3,481 research outputs found

    Machine Learning and Integrative Analysis of Biomedical Big Data.

    Get PDF
    Recent developments in high-throughput technologies have accelerated the accumulation of massive amounts of omics data from multiple sources: genome, epigenome, transcriptome, proteome, metabolome, etc. Traditionally, data from each source (e.g., genome) is analyzed in isolation using statistical and machine learning (ML) methods. Integrative analysis of multi-omics and clinical data is key to new biomedical discoveries and advancements in precision medicine. However, data integration poses new computational challenges as well as exacerbates the ones associated with single-omics studies. Specialized computational approaches are required to effectively and efficiently perform integrative analysis of biomedical data acquired from diverse modalities. In this review, we discuss state-of-the-art ML-based approaches for tackling five specific computational challenges associated with integrative analysis: curse of dimensionality, data heterogeneity, missing data, class imbalance and scalability issues

    A generalization of partial least squares regression and correspondence analysis for categorical and mixed data: An application with the ADNI data

    Get PDF
    The present and future of large scale studies of human brain and behaviorin typical and disease populationsis mutli-omics, deep-phenotyping, or other types of multi-source and multi-domain data collection initiatives. These massive studies rely on highly interdisciplinary teams that collect extremely diverse types of data across numerous systems and scales of measurement (e.g., genetics, brain structure, behavior, and demographics). Such large, complex, and heterogeneous data requires relatively simple methods that allow for exibility in analyses without the loss of the inherent properties of various data types. Here we introduce a method designed * Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimag-ing Initiative (ADNI) database (http://adni.loni.usc.edu/). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found a

    GIS and statistical analysis for landslide susceptibility mapping in the Daunia area, Italy

    Get PDF
    This study focuses on landslide susceptibility mapping in the Daunia area (Apulian Apennines, Italy) and achieves this by using a multivariate statistical method and data processing in a Geographical Information System (GIS). The Logistic Regression (hereafter LR) method was chosen to produce a susceptibility map over an area of 130 000 ha where small settlements are historically threatened by landslide phenomena. By means of LR analysis, the tendency to landslide occurrences was, therefore, assessed by relating a landslide inventory (dependent variable) to a series of causal factors (independent variables) which were managed in the GIS, while the statistical analyses were performed by means of the SPSS (Statistical Package for the Social Sciences) software. The LR analysis produced a reliable susceptibility map of the investigated area and the probability level of landslide occurrence was ranked in four classes. The overall performance achieved by the LR analysis was assessed by local comparison between the expected susceptibility and an independent dataset extrapolated from the landslide inventory. Of the samples classified as susceptible to landslide occurrences, 85% correspond to areas where landslide phenomena have actually occurred. In addition, the consideration of the regression coefficients provided by the analysis demonstrated that a major role is played by the "land cover" and "lithology" causal factors in determining the occurrence and distribution of landslide phenomena in the Apulian Apennines

    An Integrated Methodology for Enhancing Reverse Logistics Flows and Networks in Industry 5.0

    Get PDF
    Background: This paper explores the potential of Industry 5.0 in driving societal transition to a circular economy. We focus on the strategic role of reverse logistics in this context, underlining its significance in optimizing resource use, reducing waste, and enhancing sustainable production and consumption patterns. Adopting sustainable industrial practices is critical to addressing global environmental challenges. Industry 5.0 offers opportunities for achieving these goals, particularly through the enhancement of reverse logistics processes. Methods: We propose an integrated methodology that combines binary logistic regression and decision trees to predict and optimize reverse logistics flows and networks within the Industry 5.0 framework. Results: The methodology demonstrates effective quantitative modeling of influential predictors in reverse logistics and provides a structured framework for understanding their interrelations. It yields actionable insights that enhance decision-making processes in supply chain management. Conclusions: The methodology supports the integration of advanced technologies and human-centered approaches into industrial reverse logistics, thereby improving resource sustainability, systemic innovation, and contributing to the broader goals of a circular economy. Future research should explore the scalability of this methodology across different industrial sectors and its integration with other Industry 5.0 technologies. Continuous refinement and adaptation of the methodology will be necessary to keep pace with the evolving landscape of industrial sustainability.<br/
    • …
    corecore