Search CORE

9,245 research outputs found

Model-based clustering with data correction for removing artifacts in gene expression data

Author: Raftery Adrian E.
Yeung Ka Yee
Young William Chad
Publication venue
Publication date: 19/02/2016
Field of study

The NIH Library of Integrated Network-based Cellular Signatures (LINCS) contains gene expression data from over a million experiments, using Luminex Bead technology. Only 500 colors are used to measure the expression levels of the 1,000 landmark genes measured, and the data for the resulting pairs of genes are deconvolved. The raw data are sometimes inadequate for reliable deconvolution leading to artifacts in the final processed data. These include the expression levels of paired genes being flipped or given the same value, and clusters of values that are not at the true expression level. We propose a new method called model-based clustering with data correction (MCDC) that is able to identify and correct these three kinds of artifacts simultaneously. We show that MCDC improves the resulting gene expression data in terms of agreement with external baselines, as well as improving results from subsequent analysis.Comment: 28 page

arXiv.org e-Print Archive

University of Washington: UW Tacoma Digital Commons

Crossref

A Posterior Probability Approach for Gene Regulatory Network Inference in Genetic Perturbation Data

Author: Raftery Adrian E.
Yeung Ka Yee
Young William Chad
Publication venue
Publication date: 15/03/2016
Field of study

Inferring gene regulatory networks is an important problem in systems biology. However, these networks can be hard to infer from experimental data because of the inherent variability in biological data as well as the large number of genes involved. We propose a fast, simple method for inferring regulatory relationships between genes from knockdown experiments in the NIH LINCS dataset by calculating posterior probabilities, incorporating prior information. We show that the method is able to find previously identified edges from TRANSFAC and JASPAR and discuss the merits and limitations of this approach

arXiv.org e-Print Archive

University of Washington: UW Tacoma Digital Commons

Crossref

Autonomous frequency domain identification: Theory and experiment

Author: Bayard D. S.
Hadaegh F. Y.
Mettler E.
Milman M. H.
Scheid R. E.
Yam Yeung
Publication venue
Publication date
Field of study

The analysis, design, and on-orbit tuning of robust controllers require more information about the plant than simply a nominal estimate of the plant transfer function. Information is also required concerning the uncertainty in the nominal estimate, or more generally, the identification of a model set within which the true plant is known to lie. The identification methodology that was developed and experimentally demonstrated makes use of a simple but useful characterization of the model uncertainty based on the output error. This is a characterization of the additive uncertainty in the plant model, which has found considerable use in many robust control analysis and synthesis techniques. The identification process is initiated by a stochastic input u which is applied to the plant p giving rise to the output. Spectral estimation (h = P sub uy/P sub uu) is used as an estimate of p and the model order is estimated using the produce moment matrix (PMM) method. A parametric model unit direction vector p is then determined by curve fitting the spectral estimate to a rational transfer function. The additive uncertainty delta sub m = p - unit direction vector p is then estimated by the cross spectral estimate delta = P sub ue/P sub uu where e = y - unit direction vectory y is the output error, and unit direction vector y = unit direction vector pu is the computed output of the parametric model subjected to the actual input u. The experimental results demonstrate the curve fitting algorithm produces the reduced-order plant model which minimizes the additive uncertainty. The nominal transfer function estimate unit direction vector p and the estimate delta of the additive uncertainty delta sub m are subsequently available to be used for optimization of robust controller performance and stability

NASA Technical Reports Server

Multiclass classification of microarray data with repeated measurements: application to cancer

Author: Bumgarner Roger E
Yeung Ka Yee
Publication venue: BioMed Central
Publication date: 24/11/2003
Field of study

Prediction of the diagnostic category of a tissue sample from its gene-expression profile and selection of relevant genes for class prediction have important applications in cancer research. We have developed the uncorrelated shrunken centroid (USC) and error-weighted, uncorrelated shrunken centroid (EWUSC) algorithms that are applicable to microarray data with any number of classes. We show that removing highly correlated genes typically improves classification results using a small set of genes

University of Washington: UW Tacoma Digital Commons

Springer - Publisher Connector

PubMed Central

Effects of cold water immersion on muscle oxygenation during repeated bouts of fatiguing exercise : a randomized controlled study

Author: Cheng JC
Choi MM
Fung NY
Hon M
Ting KH
Yeung E
Yeung S
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date: 08/04/2016
Field of study

2015-2016 > Academic research: refereed > Publication in refereed journalVersion of RecordPublishe

The Hong Kong Polytechnic University Pao Yue-kong Library

PolyU Institutional Repository