Search CORE

3 research outputs found

Discriminant functional gene groups identiﬁcation with machine learning and prior knowledge

Author: Barla Annalisa
Di Camillo Barbara
Sanavia Tiziana
Squillario Margherita
Verri Alessandro
Zycinski Grzegorz
Publication venue: i6doc
Publication date: 01/01/2012
Field of study

Institutional Research Information System University of Turin

Analysis of a Parkinson's dataset: comparison between KDVS and the Standard pipeline

Author: Alessandro Verri
Annalisa Barla
Grzegorz Zycinski
Margherita Squillario
Salvatore Masecchia
Publication venue
Publication date: 01/01/2012
Field of study

Archivio istituzionale della ricerca - Università di Genova

Knowledge Driven Variable Selection (KDVS) – a new approach to enrichment analysis of gene signatures obtained from high–throughput data

Author: Alessandro Verri
Annalisa Barla
Barbara Di Camillo
Grzegorz Zycinski
Margherita Squillario
Tiziana Sanavia
Publication venue: Springer Nature
Publication date: 01/01/2013
Field of study

BACKGROUND: High–throughput (HT) technologies provide huge amount of gene expression data that can be used to identify biomarkers useful in the clinical practice. The most frequently used approaches first select a set of genes (i.e. gene signature) able to characterize differences between two or more phenotypical conditions, and then provide a functional assessment of the selected genes with an a posteriori enrichment analysis, based on biological knowledge. However, this approach comes with some drawbacks. First, gene selection procedure often requires tunable parameters that affect the outcome, typically producing many false hits. Second, a posteriori enrichment analysis is based on mapping between biological concepts and gene expression measurements, which is hard to compute because of constant changes in biological knowledge and genome analysis. Third, such mapping is typically used in the assessment of the coverage of gene signature by biological concepts, that is either score–based or requires tunable parameters as well, limiting its power. RESULTS: We present Knowledge Driven Variable Selection (KDVS), a framework that uses a priori biological knowledge in HT data analysis. The expression data matrix is transformed, according to prior knowledge, into smaller matrices, easier to analyze and to interpret from both computational and biological viewpoints. Therefore KDVS, unlike most approaches, does not exclude a priori any function or process potentially relevant for the biological question under investigation. Differently from the standard approach where gene selection and functional assessment are applied independently, KDVS embeds these two steps into a unified statistical framework, decreasing the variability derived from the threshold–dependent selection, the mapping to the biological concepts, and the signature coverage. We present three case studies to assess the usefulness of the method. CONCLUSIONS: We showed that KDVS not only enables the selection of known biological functionalities with accuracy, but also identification of new ones. An efficient implementation of KDVS was devised to obtain results in a fast and robust way. Computing time is drastically reduced by the effective use of distributed resources. Finally, integrated visualization techniques immediately increase the interpretability of results. Overall, KDVS approach can be considered as a viable alternative to enrichment–based approaches

Crossref

Springer - Publisher Connector

PubMed Central

Archivio istituzionale della ricerca - Università di Genova

Archivio istituzionale della ricerca - Università di Padova

Institutional Research Information System University of Turin