Search CORE

15 research outputs found

Incorporating Nonlinear Relationships in Microarray Missing Value Imputation

Author: Peng Hesen
Sun Wei
Yu Tianwei
Publication venue
Publication date: 01/01/2011
Field of study

Microarray gene expression data often contain missing values. Accurate estimation of the missing values is important for down-stream data analyses that require complete data. Nonlinear relationships between gene expression levels have not been well-utilized in missing value imputation. We propose an imputation scheme based on nonlinear dependencies between genes. By simulations based on real microarray data, we show that incorporating non-linear relationships could improve the accuracy of missing value imputation, both in terms of normalized root mean squared error and in terms of the preservation of the list of significant genes in statistical testing. In addition, we studied the impact of artificial dependencies introduced by data normalization on the simulation results. Our results suggest that methods relying on global correlation structures may yield overly optimistic simulation results when the data has been subjected to row (gene) – wise mean removal

PubMed Central

Carolina Digital Repository

Quantification and deconvolution of asymmetric LC-MS peaks using the bi-Gaussian mixture model and statistical model selection

Author: A Felinger
A Felinger
AP Dempster
CA Smith
DY Youn
FE Ahmed
FE Ahmed
G Chen
G Schwarz
Hesen Peng
HJ Issaq
JL Griffin
JR TorresLapasio
JR TorresLapasio
K Dettmer
M Johansson
M Katajamaa
M Sturm
MJD Powell
RD Caballero
T Yu
Tianwei Yu
TS Buys
VB Di Marco
WB Dunn
Z Papai
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Hierarchical Clustering of High- Throughput Expression Data Based on General Dependences

Author: Hesen Peng
Tianwei Yu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Incorporating Nonlinear Relationships in Microarray Missing Value Imputation

Author: Hesen Peng
Tianwei Yu
Wei Sun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

Crossref

PubMed Central

Carolina Digital Repository

BUS Vignette

Author: Christine Nardini
Hesen Peng
Lei Wang
Raffaele Fronza
Yin Jin
Yuanhua Liu
Publication venue
Publication date
Field of study

GOAL: The BUS package allows the computation of two types of similarities (correlation [Sokal, 2003] and mutual information [Cover, 2001]) for two different goals: (i) identification of the similarity among the activity of molecules sampled across different experiments (we name this option Unsupervised, U), (ii) identification of the similarity between such molecules and other types of information (clinical, anagraphical, etc, we name this optio

CiteSeerX

MeDiA: Mean Distance Association and Its Applications in Nonlinear Gene Set Analysis

Author: Hesen Peng (730462)
Jianwei Lu (432174)
Junjie Ma (730463)
Tianwei Yu (17518)
Yun Bai (120475)
Publication venue
Publication date: 01/01/2015
Field of study

<div>Probabilistic association discovery aims at identifying the association between random vectors, regardless of number of variables involved or linear/nonlinear functional forms. Recently, applications in high-dimensional data have generated rising interest in probabilistic association discovery. We developed a framework based on functions on the observation graph, named MeDiA (Mean Distance Association). We generalize its property to a group of functions on the observation graph. The group of functions encapsulates major existing methods in association discovery, e.g. mutual information and Brownian Covariance, and can be expanded to more complicated forms. We conducted numerical comparison of the statistical power of related methods under multiple scenarios. We further demonstrated the application of MeDiA as a method of gene set analysis that captures a broader range of responses than traditional gene set analysis methods.</div

Crossref

Directory of Open Access Journals

PubMed Central

Philadelphia College of Osteopathic Medicine: DigitalCommons@PCOM

FigShare

Network interaction for celiac disease pathways.

Author: Hesen Peng (730462)
Jianwei Lu (432174)
Junjie Ma (730463)
Tianwei Yu (17518)
Yun Bai (120475)
Publication venue
Publication date
Field of study

Red edge indicates that the interaction between connected pathways are amplified in disease individuals. Blue edge indicates the interaction suppressed in disease individuals.</p

FigShare

Gene sets associated with the two-dimensional clinical outcome based on MeDiA.

Author: Hesen Peng (730462)
Jianwei Lu (432174)
Junjie Ma (730463)
Tianwei Yu (17518)
Yun Bai (120475)
Publication venue
Publication date
Field of study

* Superscripts by the GO terms are for easy reference from the main text.Gene sets associated with the two-dimensional clinical outcome based on MeDiA.</p

FigShare

Random samples generated from independent bivariate normal distribution (left), and mixture bivariate normal distribution with ±0.8 covariates (right).

Author: Hesen Peng (730462)
Jianwei Lu (432174)
Junjie Ma (730463)
Tianwei Yu (17518)
Yun Bai (120475)
Publication venue
Publication date
Field of study

The dashed lines connects two observations if they are nearest neighbors.</p

FigShare

Comparison between the independent bivariate normal distribution and mixture normal distribution in Fig 1.

Author: Hesen Peng (730462)
Jianwei Lu (432174)
Junjie Ma (730463)
Tianwei Yu (17518)
Yun Bai (120475)
Publication venue
Publication date
Field of study

Comparison between the independent bivariate normal distribution and mixture normal distribution in <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0124620#pone.0124620.g001" target="_blank">Fig 1</a>.</p

FigShare