Search CORE

3,481 research outputs found

Machine Learning and Integrative Analysis of Biomedical Big Data.

Author: Choi Howard
Chung Neo Christopher
Mirza Bilal
Ping Peipei
Wang Jie
Wang Wei
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Recent developments in high-throughput technologies have accelerated the accumulation of massive amounts of omics data from multiple sources: genome, epigenome, transcriptome, proteome, metabolome, etc. Traditionally, data from each source (e.g., genome) is analyzed in isolation using statistical and machine learning (ML) methods. Integrative analysis of multi-omics and clinical data is key to new biomedical discoveries and advancements in precision medicine. However, data integration poses new computational challenges as well as exacerbates the ones associated with single-omics studies. Specialized computational approaches are required to effectively and efficiently perform integrative analysis of biomedical data acquired from diverse modalities. In this review, we discuss state-of-the-art ML-based approaches for tackling five specific computational challenges associated with integrative analysis: curse of dimensionality, data heterogeneity, missing data, class imbalance and scalability issues

Multidisciplinary Digital Publishing Institute

Ezid

Directory of Open Access Journals

eScholarship - University of California

A generalization of partial least squares regression and correspondence analysis for categorical and mixed data: An application with the ADNI data

Author: Beaton Derek
Behavioral Herve,
Saporta Gilbert
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 07/02/2020
Field of study

The present and future of large scale studies of human brain and behaviorin typical and disease populationsis mutli-omics, deep-phenotyping, or other types of multi-source and multi-domain data collection initiatives. These massive studies rely on highly interdisciplinary teams that collect extremely diverse types of data across numerous systems and scales of measurement (e.g., genetics, brain structure, behavior, and demographics). Such large, complex, and heterogeneous data requires relatively simple methods that allow for exibility in analyses without the loss of the inherent properties of various data types. Here we introduce a method designed * Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimag-ing Initiative (ADNI) database (http://adni.loni.usc.edu/). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found a

Exploratory Analysis of Multivariate Data (Unsupervised Image Segmentation and Data Driven Linear and Nonlinear Decomposition)

Author: Hilger Klaus Baggesen
Publication venue
Publication date: 01/03/2002
Field of study

Online Research Database In Technology

GIS and statistical analysis for landslide susceptibility mapping in the Daunia area, Italy

Author: C. Ceppi
F. Mancini
G. Ritrovato
Publication venue: Copernicus Publications
Publication date: 01/01/2010
Field of study

This study focuses on landslide susceptibility mapping in the Daunia area (Apulian Apennines, Italy) and achieves this by using a multivariate statistical method and data processing in a Geographical Information System (GIS). The Logistic Regression (hereafter LR) method was chosen to produce a susceptibility map over an area of 130 000 ha where small settlements are historically threatened by landslide phenomena. By means of LR analysis, the tendency to landslide occurrences was, therefore, assessed by relating a landslide inventory (dependent variable) to a series of causal factors (independent variables) which were managed in the GIS, while the statistical analyses were performed by means of the SPSS (Statistical Package for the Social Sciences) software. The LR analysis produced a reliable susceptibility map of the investigated area and the probability level of landslide occurrence was ranked in four classes. The overall performance achieved by the LR analysis was assessed by local comparison between the expected susceptibility and an independent dataset extrapolated from the landslide inventory. Of the samples classified as susceptible to landslide occurrences, 85% correspond to areas where landslide phenomena have actually occurred. In addition, the consideration of the regression coefficients provided by the analysis demonstrated that a major role is played by the "land cover" and "lithology" causal factors in determining the occurrence and distribution of landslide phenomena in the Apulian Apennines

Crossref

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

An Integrated Methodology for Enhancing Reverse Logistics Flows and Networks in Industry 5.0

Author: Dabo Al-Amin
Hosseinian Far Amin
Publication venue
Publication date: 11/12/2023
Field of study

Background: This paper explores the potential of Industry 5.0 in driving societal transition to a circular economy. We focus on the strategic role of reverse logistics in this context, underlining its significance in optimizing resource use, reducing waste, and enhancing sustainable production and consumption patterns. Adopting sustainable industrial practices is critical to addressing global environmental challenges. Industry 5.0 offers opportunities for achieving these goals, particularly through the enhancement of reverse logistics processes. Methods: We propose an integrated methodology that combines binary logistic regression and decision trees to predict and optimize reverse logistics flows and networks within the Industry 5.0 framework. Results: The methodology demonstrates effective quantitative modeling of influential predictors in reverse logistics and provides a structured framework for understanding their interrelations. It yields actionable insights that enhance decision-making processes in supply chain management. Conclusions: The methodology supports the integration of advanced technologies and human-centered approaches into industrial reverse logistics, thereby improving resource sustainability, systemic innovation, and contributing to the broader goals of a circular economy. Future research should explore the scalability of this methodology across different industrial sectors and its integration with other Industry 5.0 technologies. Continuous refinement and adaptation of the methodology will be necessary to keep pace with the evolving landscape of industrial sustainability.<br/

University of Northampton's Research Explorer