Search CORE

270 research outputs found

Microarray platform consistency is revealed by biologically functional analysis of gene expression profiles

Author: Chen Tao
Li Zhiguang
Shi Leming
Su Zhenqiang
Wen Zhining
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

In silico method for systematic analysis of feature importance in microRNA-mRNA interactions

Author: Guang Xuanmin
Li Menglong
Li Yizhou
Wang Kelong
Wen Zhining
Xiao Jiamin
Zhang Lifang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background MicroRNA (miRNA), which is short non-coding RNA, plays a pivotal role in the regulation of many biological processes and affects the stability and/or translation of mRNA. Recently, machine learning algorithms were developed to predict potential miRNA targets. Most of these methods are robust but are not sensitive to redundant or irrelevant features. Despite their good performance, the relative importance of each feature is still unclear. With increasing experimental data becoming available, research interest has shifted from higher prediction performance to uncovering the mechanism of microRNA-mRNA interactions. Results Systematic analysis of sequence, structural and positional features was carried out for two different data sets. The dominant functional features were distinguished from uninformative features in single and hybrid feature sets. Models were developed using only statistically significant sequence, structural and positional features, resulting in area under the receiver operating curves (AUC) values of 0.919, 0.927 and 0.969 for one data set and of 0.926, 0.874 and 0.954 for another data set, respectively. Hybrid models were developed by combining various features and achieved AUC of 0.978 and 0.970 for two different data sets. Functional miRNA information is well reflected in these features, which are expected to be valuable in understanding the mechanism of microRNA-mRNA interactions and in designing experiments. Conclusions Differing from previous approaches, this study focused on systematic analysis of all types of features. Statistically significant features were identified and used to construct models that yield similar accuracy to previous studies in a shorter computation time.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Genome-wide association study combined with biological context can reveal more disease-related SNPs altering microRNA target seed sites

Author: Di Wu
Gang Yang
Jiwei Xue
Lifang Zhang
Menglong Li
Zhining Wen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Springer - Publisher Connector

Comparative Analysis for the Performance of Variant Calling Pipelines on Detecting the de novo Mutations in Humans

Author: Chuan Li
Li He
Menglong Li
Xuemei Pu
Yifan Zhou
Yinyi Hao
Yiru Zhao
Yu Liang
Zhining Wen
Publication venue: 'Frontiers Media SA'
Publication date: 01/04/2019
Field of study

Despite of the low occurrence rate in the entire genomes, de novo mutation is proved to be deleterious and will lead to severe genetic diseases via impacting on the gene function. Considering the fact that the traditional family based linkage approaches and the genome-wide association studies are unsuitable for identifying the de novo mutations, in recent years, several pipelines have been proposed to detect them based on the whole-genome or whole-exome sequencing data and were used for calling them in the rare diseases. However, how the performance of these variant calling pipelines on detecting the de novo mutations is still unexplored. For the purpose of facilitating the appropriate choice of the pipelines and reducing the false positive rate, in this study, we thoroughly evaluated the performance of the commonly used trio calling methods on the detection of the de novo single-nucleotide variants (DNSNVs) by conducting a comparative analysis for the calling results. Our results exhibited that different pipelines have a specific tendency to detect the DNSNVs in the genomic regions with different GC contents. Additionally, to refine the calling results for a single pipeline, our proposed filter achieved satisfied results, indicating that the read coverage at the mutation positions can be used as an effective index to identify the high-confidence DNSNVs. Our findings should be good support for the committees to choose an appropriate way to explore the de novo mutations for the rare diseases

Directory of Open Access Journals

Know, Know Where, Know Where Graph:A densely connected, cross-domain knowledge graph and geo-enrichment service stack for applications in environmental intelligence

Author: Cai Ling
Currier Kitty
D’onofrio Anthony
Fisher Colby K.
Gonzalez Seila
Gu Zhining
Hitzler Pascal
Janowicz Krzysztof
Li Wenwen
Liu Zilong
Lopez-Carr Anna
Mai Gengchen
Mecum Bryce
Rehberger Dean
Schildhauer Mark
Schroeder Andrew
Shi Meilin
Shimizu Cogan
Smith David
Stephen Shirly
Tian Yuanyuan
Wang Sizhe
Wright Dawn
Zalewski Joseph
Zhou Lu
Zhu Rui
Publication venue: 'Wiley'
Publication date: 01/03/2022
Field of study

Explore Bristol Research

Predicting disease-associated substitution of a single amino acid by analyzing residue interactions

Author: A del Sol
A Liaw
AM Fernandez-Escamilla
B Li
C Ferrer-Costa
C Ferrer-Costa
C Kosiol
CT Saunders
DJ Watts
G Amitai
G Bagler
H Carter
Hui Yin
J Reumers
Jiamin Xiao
JS Kaminker
JS Kaminker
KV Brinda
L Bao
L Bao
L Breman
Lezheng Yu
LH Greene
Li Yang
M Mort
M Vendruscolo
MEJ Newman
Menglong Li
NV Dokholyan
P Yue
P Yue
P Yue
PA Alexander
PC Ng
PC Ng
PD Thomas
R Karchin
RA Gibbs
RJ Dobson
S Miyazawa
S Sunyaev
SF Altschul
ST Sherry
V Ramensky
W Kabsch
W Lee
Y Bromberg
Y Bromberg
Yizhou Li
YL Yip
YL Yip
Z Wang
Zhining Wen
ZQ Ye
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The rapid accumulation of data on non-synonymous single nucleotide polymorphisms (nsSNPs, also called SAPs) should allow us to further our understanding of the underlying disease-associated mechanisms. Here, we use complex networks to study the role of an amino acid in both local and global structures and determine the extent to which disease-associated and polymorphic SAPs differ in terms of their interactions to other residues. Results We found that SAPs can be well characterized by network topological features. Mutations are probably disease-associated when they occur at a site with a high centrality value and/or high degree value in a protein structure network. We also discovered that study of the neighboring residues around a mutation site can help to determine whether the mutation is disease-related or not. We compiled a dataset from the Swiss-Prot variant pages and constructed a model to predict disease-associated SAPs based on the random forest algorithm. The values of total accuracy and MCC were 83.0% and 0.64, respectively, as determined by 5-fold cross-validation. With an independent dataset, our model achieved a total accuracy of 80.8% and MCC of 0.59, respectively. Conclusions The satisfactory performance suggests that network topological features can be used as quantification measures to determine the importance of a site on a protein, and this approach can complement existing methods for prediction of disease-associated SAPs. Moreover, the use of this method in SAP studies would help to determine the underlying linkage between SAPs and diseases through extensive investigation of mutual interactions between residues.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central