Search CORE

4 research outputs found

Reconstructing Yeasts Phylogenies and Ancestors from Whole Genome Data

Author: Bing Feng
Chao Liu
Fei Hu
Jijun Tang
Lingxi Zhou
Robert Friedman
Ruofan Xia
Yan Guo
Yu Lin
Publication venue: Scholar Commons
Publication date: 09/11/2017
Field of study

Phylogenetic studies aim to discover evolutionary relationships and histories. These studies are based on similarities of morphological characters and molecular sequences. Currently, widely accepted phylogenetic approaches are based on multiple sequence alignments, which analyze shared gene datasets and concatenate/coalesce these results to a final phylogeny with maximum support. However, these approaches still have limitations, and often have conflicting results with each other. Reconstructing ancestral genomes helps us understand mechanisms and corresponding consequences of evolution. Most existing genome level phylogeny and ancestor reconstruction methods can only process simplified real genome datasets or simulated datasets with identical genome content, unique genome markers, and limited types of evolutionary events. Here, we provide an alternative way to resolve phylogenetic problems based on analyses of real genome data. We use phylogenetic signals from all types of genome level evolutionary events, and overcome the conflicting issues existing in traditional phylogenetic approaches. Further, we build an automated computational pipeline to reconstruct phylogenies and ancestral genomes for two high-resolution real yeast genome datasets. Comparison results with recent studies and publications show that we reconstruct very accurate and robust phylogenies and ancestors. Finally, we identify and analyze the conserved syntenic blocks among reconstructed ancestral genomes and present yeast species

Directory of Open Access Journals

Scholar Commons - Institutional Repository of the University of South Carolina

Reconstructing Yeasts Phylogenies and Ancestors from Whole Genome Data

Author: Feng Bing
Hu Fei
Ling Yu
Liu Chao
Xia Roufan
Zhou Lingxi
Publication venue: Scholar Commons
Publication date: 09/11/2017
Field of study

Scholar Commons - Institutional Repository of the University of South Carolina

Mining the semantics of genome super-blocks to infer ancestral architectures

Author: Jean Géraldine
Nikolski Macha
Sherman David James
Publication venue: 'Mary Ann Liebert Inc'
Publication date: 01/01/2009
Field of study

International audienc

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Phylogeny, Ancestral Genome, And Disease Diagnoses Models Constructions Using Biological Data

Author: Feng Bing
Publication venue: Scholar Commons
Publication date: 01/01/2018
Field of study

Studies of bioinformatics develop methods and software tools to analyze the biological data and provide insight of the mechanisms of biological process. Machine learning techniques have been widely used by researchers for disease prediction, disease diagnosis, and bio-marker identification. Using machine-learning algorithms to diagnose diseases has a couple of advantages. Besides solely relying on the doctors’ experiences and stereotyped formulas, researchers could use learning algorithms to analyze sophisticated, high-dimensional and multimodal biomedical data, and construct prediction/classification models to make decisions even when some information was incomplete, unknown, or contradictory. In this study, first of all, we built an automated computational pipeline to reconstruct phylogenies and ancestral genomes for two high-resolution real yeast whole genome datasets. Furthermore, we compared the results with recent studies and publications to show that we reconstruct very accurate and robust phylogenies, as well as ancestors. We also identified and analyzed conserved syntenic blocks among reconstructed ancestral genomes and present yeast species. Next, we analyzed the metabolic level dataset obtained from positive mass spectrometry of human blood samples. We applied machine learning algorithms and feature selection algorithms to construct diagnosis models of Chronic kidney diseases (CKD). We also identified the most critical metabolite features and studied the correlations v among the metabolite features and the developments of CKD stages. The selected metabolite features provided insights into CKD early stage diagnosis, pathophysiological mechanisms, CKD treatments, and medicine development. Finally, we used deep learning techniques to build accurate Down Syndrome (DS) prediction/screening models based on the analysis of newly introduced Illumina human genome genotyping array. We proposed a bi-stream convolutional neural network (CNN) architecture with ten layers and two merged CNN models, which took two input chromosome SNP maps in combination. We evaluated and compared the performances of our CNN DS predictions models with conventional machine learning algorithms. We visualized the feature maps and trained filter weights from intermediate layers of our trained CNN model. We further discussed the advantages of our method and the underlying reasons for the differences of their performances

Scholar Commons - Institutional Repository of the University of South Carolina