220 research outputs found

    Beyond Volume: The Impact of Complex Healthcare Data on the Machine Learning Pipeline

    Full text link
    From medical charts to national census, healthcare has traditionally operated under a paper-based paradigm. However, the past decade has marked a long and arduous transformation bringing healthcare into the digital age. Ranging from electronic health records, to digitized imaging and laboratory reports, to public health datasets, today, healthcare now generates an incredible amount of digital information. Such a wealth of data presents an exciting opportunity for integrated machine learning solutions to address problems across multiple facets of healthcare practice and administration. Unfortunately, the ability to derive accurate and informative insights requires more than the ability to execute machine learning models. Rather, a deeper understanding of the data on which the models are run is imperative for their success. While a significant effort has been undertaken to develop models able to process the volume of data obtained during the analysis of millions of digitalized patient records, it is important to remember that volume represents only one aspect of the data. In fact, drawing on data from an increasingly diverse set of sources, healthcare data presents an incredibly complex set of attributes that must be accounted for throughout the machine learning pipeline. This chapter focuses on highlighting such challenges, and is broken down into three distinct components, each representing a phase of the pipeline. We begin with attributes of the data accounted for during preprocessing, then move to considerations during model building, and end with challenges to the interpretation of model output. For each component, we present a discussion around data as it relates to the healthcare domain and offer insight into the challenges each may impose on the efficiency of machine learning techniques.Comment: Healthcare Informatics, Machine Learning, Knowledge Discovery: 20 Pages, 1 Figur

    Hydrodynamic modelling of protein conformation in solution: ELLIPS and HYDRO

    Get PDF
    The last three decades has seen some important advances in our ability to represent the conformation of proteins in solution on the basis of hydrodynamic measurements. Advances in theoretical modeling capabilities have been matched by commensurate advances in the precision of hydrodynamic measurements. We consider the advances in whole-body (simple ellipsoid-based) modeling—still useful for providing an overall idea of molecular shape, particularly for those systems where only a limited amount of data is available—and outline the ELLIPS suite of algorithms which facilitates the use of this approach. We then focus on bead modeling strategies, particularly the surface or shell–bead approaches and the HYDRO suite of algorithms. We demonstrate how these are providing great insights into complex issues such as the conformation of immunoglobulins and other multi-domain complexes

    A Virulent Wolbachia Infection Decreases the Viability of the Dengue Vector Aedes aegypti during Periods of Embryonic Quiescence

    Get PDF
    A new approach for dengue control has been proposed that relies on life-shortening strains of the obligate intracellular bacterium Wolbachia pipientis to modify mosquito population age structure and reduce pathogen transmission. Previously we reported the stable transinfection of the major dengue vector Aedes aegypti with a life-shortening Wolbachia strain (wMelPop-CLA) from the vinegar fly Drosophila melanogaster. Here, we report a further characterization of the phenotypic effects of this virulent Wolbachia infection on several life-history traits of Ae. aegypti. Minor costs of wMelPop-CLA infection for pre-imaginal survivorship, development and adult size were found. However, we discovered that the wMelPop-CLA infection dramatically decreased the viability of desiccated Ae. aegypti eggs over time. Similarly, the reproductive fitness of wMelPop-CLA infected Ae. aegypti females declined with age. These results reveal a general pattern associated with wMelPop-CLA induced pathogenesis in this mosquito species, where host fitness costs increase during aging of both immature and adult life-history stages. In addition to influencing the invasion dynamics of this particular Wolbachia strain, we suggest that the negative impact of wMelPop-CLA on embryonic quiescence may have applied utility as a tool to reduce mosquito population size in regions with pronounced dry seasons or in regions that experience cool winters

    Genome-wide significant association with seven novel multiple sclerosis risk loci

    Get PDF
    Objective: A recent large-scale study in multiple sclerosis (MS) using the ImmunoChip platform reported on 11 loci that showed suggestive genetic association with MS. Additional data in sufficiently sized and independent data sets are needed to assess whether these loci represent genuine MS risk factors. Methods: The lead SNPs of all 11 loci were genotyped in 10 796 MS cases and 10 793 controls from Germany, Spain, France, the Netherlands, Austria and Russia, that were independent from the previously reported cohorts. Association analyses were performed using logistic regression based on an additive model. Summary effect size estimates were calculated using fixed-effect meta-analysis. Results: Seven of the 11 tested SNPs showed significant association with MS susceptibility in the 21 589 individuals analysed here. Meta-analysis across our and previously published MS case-control data (total sample size n=101 683) revealed novel genome-wide significant association with MS susceptibility (p<5×10−8) for all seven variants. This included SNPs in or near LOC100506457 (rs1534422, p=4.03×10−12), CD28 (rs6435203, p=1.35×10−9), LPP (rs4686953, p=3.35×10−8), ETS1 (rs3809006, p=7.74×10−9), DLEU1 (rs806349, p=8.14×10−12), LPIN3 (rs6072343, p=7.16×10−12) and IFNGR2 (rs9808753, p=4.40×10−10). Cis expression quantitative locus effects were observed in silico for rs6435203 on CD28 and for rs9808753 on several immunologically relevant genes in the IFNGR2 locus. Conclusions: This study adds seven loci to the list of genuine MS genetic risk factors and further extends the list of established loci shared across autoimmune diseases

    Genome-wide significant association with seven novel multiple sclerosis risk loci

    Get PDF
    Objective: A recent large-scale study in multiple sclerosis (MS) using the ImmunoChip platform reported on 11 loci that showed suggestive genetic association with MS. Additional data in sufficiently sized and independent data sets are needed to assess whether these loci represent genuine MS risk factors. Methods: The lead SNPs of all 11 loci were genotyped in 10 796 MS cases and 10 793 controls from Germany, Spain, France, the Netherlands, Austria and Russia, that were independent from the previously reported cohorts. Association analyses were performed using logistic regression based on an additive model. Summary effect size estimates were calculated using fixed-effect meta-analysis. Results: Seven of the 11 tested SNPs showed significant association with MS susceptibility in the 21 589 individuals analysed here. Meta-analysis across our and previously published MS case-control data (total sample size n=101 683) revealed novel genome-wide significant association with MS susceptibility (p<5×10−8) for all seven variants. This included SNPs in or near LOC100506457 (rs1534422, p=4.03×10−12), CD28 (rs6435203, p=1.35×10−9), LPP (rs4686953, p=3.35×10−8), ETS1 (rs3809006, p=7.74×10−9), DLEU1 (rs806349, p=8.14×10−12), LPIN3 (rs6072343, p=7.16×10−12) and IFNGR2 (rs9808753, p=4.40×10−10). Cis expression quantitative locus effects were observed in silico for rs6435203 on CD28 and for rs9808753 on several immunologically relevant genes in the IFNGR2 locus. Conclusions: This study adds seven loci to the list of genuine MS genetic risk factors and further extends the list of established loci shared across autoimmune diseases

    The influence of salinity on the effects of Multi-walled carbon nanotubes on polychaetes

    Get PDF
    Salinity shifts in estuarine and coastal areas are becoming a topic of concern and are one of the main factors influencing nanoparticles behaviour in the environment. For this reason, the impacts of multiwalled carbon nanotubes (MWCNTs) under different seawater salinity conditions were evaluated on the common ragworm Hediste diversicolor, a polychaete species widely used as bioindicator of estuarine environmental quality. An innovative method to assess the presence of MWCNT aggregates in the sediments was used for the first time. Biomarkers approach was used to evaluate the metabolic capacity, oxidative status and neurotoxicity of polychaetes after long-term exposure. The results revealed an alteration of energy-related responses in contaminated polychaetes under both salinity conditions, resulting in an increase of metabolism and expenditure of their energy reserves (lower glycogen and protein contents). Moreover, a concentration-dependent toxicity (higher lipid peroxidation, lower ratio between reduced and oxidized glutathione and activation of antioxidant defences and biotransformation mechanisms) was observed in H. diversicolor, especially when exposed to low salinity. Additionally, neurotoxicity was observed by inhibition of Cholinesterases activity in organisms exposed to MWCNTs at both salinities.publishe

    Gene expression patterns associated with blood-feeding in the malaria mosquito Anopheles gambiae

    Get PDF
    BACKGROUND: Blood feeding, or hematophagy, is a behavior exhibited by female mosquitoes required both for reproduction and for transmission of pathogens. We determined the expression patterns of 3,068 ESTs, representing ~2,000 unique gene transcripts using cDNA microarrays in adult female Anopheles gambiae at selected times during the first two days following blood ingestion, at 5 and 30 min during a 40 minute blood meal and at 0, 1, 3, 5, 12, 16, 24 and 48 hours after completion of the blood meal and compared their expression to transcript levels in mosquitoes with access only to a sugar solution. RESULTS: In blood-fed mosquitoes, 413 unique transcripts, approximately 25% of the total, were expressed at least two-fold above or below their levels in the sugar-fed mosquitoes, at one or more time points. These differentially expressed gene products were clustered using k-means clustering into Early Genes, Middle Genes, and Late Genes, containing 144, 130, and 139 unique transcripts, respectively. Several genes from each group were analyzed by quantitative real-time PCR in order to validate the microarray results. CONCLUSION: The expression patterns and annotation of the genes in these three groups (Early, Middle, and Late genes) are discussed in the context of female mosquitoes' physiological responses to blood feeding, including blood digestion, peritrophic matrix formation, egg development, and immunity

    Nanoparticles for Applications in Cellular Imaging

    Get PDF
    In the following review we discuss several types of nanoparticles (such as TiO2, quantum dots, and gold nanoparticles) and their impact on the ability to image biological components in fixed cells. The review also discusses factors influencing nanoparticle imaging and uptake in live cells in vitro. Due to their unique size-dependent properties nanoparticles offer numerous advantages over traditional dyes and proteins. For example, the photostability, narrow emission peak, and ability to rationally modify both the size and surface chemistry of Quantum Dots allow for simultaneous analyses of multiple targets within the same cell. On the other hand, the surface characteristics of nanometer sized TiO2allow efficient conjugation to nucleic acids which enables their retention in specific subcellular compartments. We discuss cellular uptake mechanisms for the internalization of nanoparticles and studies showing the influence of nanoparticle size and charge and the cell type targeted on nanoparticle uptake. The predominant nanoparticle uptake mechanisms include clathrin-dependent mechanisms, macropinocytosis, and phagocytosis

    Massively Parallel RNA Sequencing Identifies a Complex Immune Gene Repertoire in the lophotrochozoan Mytilus edulis

    Get PDF
    The marine mussel Mytilus edulis and its closely related sister species are distributed world-wide and play an important role in coastal ecology and economy. The diversification in different species and their hybrids, broad ecological distribution, as well as the filter feeding mode of life has made this genus an attractive model to investigate physiological and molecular adaptations and responses to various biotic and abiotic environmental factors. In the present study we investigated the immune system of Mytilus, which may contribute to the ecological plasticity of this species. We generated a large Mytilus transcriptome database from different tissues of immune challenged and stress treated individuals from the Baltic Sea using 454 pyrosequencing. Phylogenetic comparison of orthologous groups of 23 species demonstrated the basal position of lophotrochozoans within protostomes. The investigation of immune related transcripts revealed a complex repertoire of innate recognition receptors and downstream pathway members including transcripts for 27 toll-like receptors and 524 C1q domain containing transcripts. NOD-like receptors on the other hand were absent. We also found evidence for sophisticated TNF, autophagy and apoptosis systems as well as for cytokines. Gill tissue and hemocytes showed highest expression of putative immune related contigs and are promising tissues for further functional studies. Our results partly contrast with findings of a less complex immune repertoire in ecdysozoan and other lophotrochozoan protostomes. We show that bivalves are interesting candidates to investigate the evolution of the immune system from basal metazoans to deuterostomes and protostomes and provide a basis for future molecular work directed to immune system functioning in Mytilus
    corecore