780 research outputs found

    An automated ETL for online datasets

    Get PDF
    While using online datasets for machine learning is commonplace today, the quality of these datasets impacts on the performance of prediction algorithms. One method for improving the semantics of new data sources is to map these sources to a common data model or ontology. While semantic and structural heterogeneities must still be resolved, this provides a well established approach to providing clean datasets, suitable for machine learning and analysis. However, when there is a requirement for a close to real time usage of online data, a method for dynamic Extract-Transform-Load of new sources data must be developed. In this work, we present a framework for integrating online and enterprise data sources, in close to real time, to provide datasets for machine learning and predictive algorithms. An exhaustive evaluation compares a human built data transformation process with our system’s machine generated ETL process, with very favourable results, illustrating the value and impact of an automated approach

    An automated ETL for online datasets

    Get PDF
    While using online datasets for machine learning is commonplace today, the quality of these datasets impacts on the performance of prediction algorithms. One method for improving the semantics of new data sources is to map these sources to a common data model or ontology. While semantic and structural heterogeneities must still be resolved, this provides a well established approach to providing clean datasets, suitable for machine learning and analysis. However, when there is a requirement for a close to real time usage of online data, a method for dynamic Extract-Transform-Load of new sources data must be developed. In this work, we present a framework for integrating online and enterprise data sources, in close to real time, to provide datasets for machine learning and predictive algorithms. An exhaustive evaluation compares a human built data transformation process with our system’s machine generated ETL process, with very favourable results, illustrating the value and impact of an automated approach

    An automated ETL for online datasets

    Get PDF
    While using online datasets for machine learning is commonplace today, the quality of these datasets impacts on the performance of prediction algorithms. One method for improving the semantics of new data sources is to map these sources to a common data model or ontology. While semantic and structural heterogeneities must still be resolved, this provides a well established approach to providing clean datasets, suitable for machine learning and analysis. However, when there is a requirement for a close to real time usage of online data, a method for dynamic Extract-Transform-Load of new sources data must be developed. In this work, we present a framework for integrating online and enterprise data sources, in close to real time, to provide datasets for machine learning and predictive algorithms. An exhaustive evaluation compares a human built data transformation process with our system’s machine generated ETL process, with very favourable results, illustrating the value and impact of an automated approach

    Floral evolution of long-tubed Erica species

    Get PDF
    The genus Erica has undergone an extreme radiation in the Cape and exhibits a diversity of pollination syndromes and floral traits. This makes Erica well-suited to study the evolution of floral traits and how they impact speciation. The first chapter explored the role of ultraviolet colouration by recording its prevalence across Erica pollination syndromes. Ultraviolet was rare in wind-, rodent and small insect-pollinated species, but it was common in bird-pollinated species and ubiquitous in long-proboscid fly (LPF)-pollinated species. Testing their preference revealed that sunbirds can see ultraviolet, but they have no innate preference. LPFs on the other hand were not attracted to flowers where ultraviolet reflectance was removed, thus displaying a strong preference. Chapter 2 focused on the role of stickiness for nectar robbers. I experimentally added stickiness to Erica flowers of one species and further compared stickiness to nectar robbing across several communities. Stickiness appears to reduce damage due to nectar robbing within and between species. Further, I found that stickiness is strongly correlated with pollination by birds and LPFs which might be due to their large nectar rewards. Chapter 3 investigated how the sister species Erica shannonea and Erica ampullacea co-occur despite sharing a pollination syndrome. Pollination experiments and observations showed that they are pollinated by LPFs from two families. The horizontal flowers of E. shannonea are pollinated by a tabanid which has a fixed forward-pointing proboscis, while the vertical flowers of E. ampullacea are pollinated by a nemestrinid which can swivel its proboscis downwards. The nemestrinid in turn has a shorter proboscis which prevents it from accessing nectar in the long-tubed E. shannonea. Due to their different biomechanics, each fly can only access the flower it pollinates resulting in effective reproductive isolation between these species. Chapter 4 compared flower orientation in relation to the two LPF families across all LPF-pollinated species. Using a phylogenetically corrected analysis, I found that flowers pollinated by Tabanidae tend to be horizontal, while nemestrinid flowers are more variable in orientation and more often vertical. This confirms the importance of pollinator biomechanics for the evolution of floral traits. The last chapter investigated how pollen transfer efficiency differs between Erica pollination syndromes. I found that LPF- and bird-pollinated species have higher pollen transfer efficiency in comparison to bee-pollinated species which might have facilitated the shifts from ancestral bee pollination

    Exploring the Use of Marimba in Nordoff Robbins Music Therapy with Children with Autism Spectrum Disorder

    Get PDF
    This study explored the use of the marimba in Nordoff Robbins music therapy with children with Autism Spectrum Disorder. Three clients each received two individual 30 to 45-minute music therapy sessions over the course of three weeks. After reviewing the sessions an inductive thematic analysis was competed in order to determine themes that occurred throughout the study. Five themes were discussed using clinical examples and musical transcriptions. These themes included: (a) sharing the marimba to strengthen the therapeutic relationship, (b) the use of marimba as an accompanying instrument, (c) rhythmic structuring, (d) the use of Nordoff Robbins repertoire on marimba, and (e) the use of mallet techniques to reflect emotional qualities of the music. The use of mallet techniques in addition to the timbre and range of the instrument allowed for the marimba to be used clinically to engage clients in musical interactions. The examples included in the study indicated that the musical qualities of the marimba can be used to engage clients with ASD. There is still limited research on the use of the marimba in music therapy

    Validating a novel web-based method to capture disease progression outcomes in multiple sclerosis

    Get PDF
    The Expanded Disability Status Scale (EDSS) is the current ‘gold standard’ for monitoring disease severity in multiple sclerosis (MS). The EDSS is a physician-based assessment. A patient-related surrogate for the EDSS may be useful in remotely capturing information. Eighty-one patients (EDSS range 0–8) having EDSS as part of clinical trials were recruited. All patients carried out the web-based survey with minimal assistance. Full EDSS scores were available for 78 patients. The EDSS scores were compared to those generated by the online survey using analysis of variance, matched pair test, Pearson’s coefficient, weighted kappa coefficient, and the intra-class correlation coefficient. The internet-based EDSS scores showed good correlation with the physician-measured assessment (Pearson’s coefficient = 0.85). Weighted kappa for full agreement was 0.647. Full agreement was observed in 20 patient

    Transactions, Costs, and the Supply of Low-Income Rental Housing: The Case of Section 8 Rental Certificates and Vouchers in Dayton, Ohio

    Get PDF
    Either private markets or publicly-subsidized programs can provide low-income housing. Federal programs such as the Section 8 low-income housing assistance program combines public funds with the existing private housing stock to provide housing for low-income families. Tenants, (or consumers) receive rent subsidies through either vouchers or rental certificates. The housing authority pays the voucher or rental certificate amount directly to the landlord each month. Transaction costs, defined as the costs of negotiating, creating and enforcing contracts, determine in part the incentives for landlords to participate in government programs. Costs such as time, increased investment to maintain government housing quality standards, and the costs associated with the government bureaucracy of a rent subsidy program affect the willingness of landlords to participate in these programs. Understanding the impacts of transactions costs incurred through participation in housing programs is crucial for understanding and developing policies that can effectively utilize existing units in the private rental market to address social problems such as affordable housing and homelessness. A case study of landlords in the Dayton Metropolitan Housing Authority\u27s Section 8 program provides statistical analysis illustrating the role transactions costs play in determining private sector participation. The results suggest that the transactions costs involved with Section 8 program participation are not a significant deterrent. In particular, the guaranteed subsidized rent compensates landlords for the transactions costs associated with participating in the program

    Continuous Wave Ring-Down Spectroscopy for Velocity Distribution Measurements in Plasma

    Get PDF
    Cavity Ring-Down Spectroscopy CRDS is a proven, ultra-sensitive, cavity enhanced absorption spectroscopy technique. When combined with a continuous wavelength (CW) diode laser that has a sufficiently narrow line width, the Doppler broadened absorption line, i.e., the velocity distribution functions (VDFs) of the absorbing species, can be measured. Measurements of VDFs can be made using established techniques such as laser induced fluorescence (LIF). However, LIF suffers from the requirement that the initial state of the LIF sequence have a substantial density and that the excitation scheme fluoresces at an easily detectable wavelength. This usually limits LIF to ions and atoms with large metastable state densities for the given plasma conditions. CW-CRDS is considerably more sensitive than LIF and can potentially be applied to much lower density populations of ion and atom states. Also, as a direct absorption technique, CW-CRDS measurements only need to be concerned with the species\u27 absorption wavelength and provide an absolute measure of the line integrated initial state density. Presented in this work are measurements of argon ion and neutral VDFs in a helicon plasma using CW-CRDS and LIF

    Analyzing adverse events of drugs

    Get PDF
    corecore