Search CORE

3,030 research outputs found

TROM: A Testing-based Method for Finding Transcriptomic Similarity of Biological Samples

Author: Chen Yiling
Li Jingyi Jessica
Li Wei Vivian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/08/2016
Field of study

Comparative transcriptomics has gained increasing popularity in genomic research thanks to the development of high-throughput technologies including microarray and next-generation RNA sequencing that have generated numerous transcriptomic data. An important question is to understand the conservation and differentiation of biological processes in different species. We propose a testing-based method TROM (Transcriptome Overlap Measure) for comparing transcriptomes within or between different species, and provide a different perspective to interpret transcriptomic similarity in contrast to traditional correlation analyses. Specifically, the TROM method focuses on identifying associated genes that capture molecular characteristics of biological samples, and subsequently comparing the biological samples by testing the overlap of their associated genes. We use simulation and real data studies to demonstrate that TROM is more powerful in identifying similar transcriptomes and more robust to stochastic gene expression noise than Pearson and Spearman correlations. We apply TROM to compare the developmental stages of six Drosophila species, C. elegans, S. purpuratus, D. rerio and mouse liver, and find interesting correspondence patterns that imply conserved gene expression programs in the development of these species. The TROM method is available as an R package on CRAN (http://cran.r-project.org/) with manuals and source codes available at http://www.stat.ucla.edu/ jingyi.li/software-and-data/trom.html

arXiv.org e-Print Archive

Crossref

eScholarship - University of California

MSIQ: Joint Modeling of Multiple RNA-seq Samples for Accurate Isoform Quantification

Author: Li Jingyi Jessica
Li Wei Vivian
Zhang Shihua
Zhao Anqi
Publication venue
Publication date: 02/12/2017
Field of study

Next-generation RNA sequencing (RNA-seq) technology has been widely used to assess full-length RNA isoform abundance in a high-throughput manner. RNA-seq data offer insight into gene expression levels and transcriptome structures, enabling us to better understand the regulation of gene expression and fundamental biological processes. Accurate isoform quantification from RNA-seq data is challenging due to the information loss in sequencing experiments. A recent accumulation of multiple RNA-seq data sets from the same tissue or cell type provides new opportunities to improve the accuracy of isoform quantification. However, existing statistical or computational methods for multiple RNA-seq samples either pool the samples into one sample or assign equal weights to the samples when estimating isoform abundance. These methods ignore the possible heterogeneity in the quality of different samples and could result in biased and unrobust estimates. In this article, we develop a method, which we call "joint modeling of multiple RNA-seq samples for accurate isoform quantification" (MSIQ), for more accurate and robust isoform quantification by integrating multiple RNA-seq samples under a Bayesian framework. Our method aims to (1) identify a consistent group of samples with homogeneous quality and (2) improve isoform quantification accuracy by jointly modeling multiple RNA-seq samples by allowing for higher weights on the consistent group. We show that MSIQ provides a consistent estimator of isoform abundance, and we demonstrate the accuracy and effectiveness of MSIQ compared with alternative methods through simulation studies on D. melanogaster genes. We justify MSIQ's advantages over existing approaches via application studies on real RNA-seq data from human embryonic stem cells, brain tissues, and the HepG2 immortalized cell line

arXiv.org e-Print Archive

Crossref

eScholarship - University of California

Global yield curve dynamics and interactions: a dynamic Nelson-Siegel approach

Author: Diebold Francis X.
Li Canlin
Yue Vivian Z.
Publication venue
Publication date: 01/01/2007
Field of study

The popular Nelson-Siegel (1987) yield curve is routinely fit to cross sections of intra-country bond yields, and Diebold and Li (2006) have recently proposed a dynamized version. In this paper we extend Diebold-Li to a global context, modeling a potentially large set of country yield curves in a framework that allows for both global and country-specific factors. In an empirical analysis of term structures of government bond yields for the Germany, Japan, the U.K. and the U.S., we find that global yield factors do indeed exist and are economically important, generally explaining significant fractions of country yield curve dynamics, with interesting differences across countries

CiteSeerX

Crossref

Hochschulschriftenserver - Universität Frankfurt am Main

Information Asymmetry in Corporate Bond Markets

Author: Li Vivian
Publication venue: ScholarlyCommons
Publication date: 06/05/2019
Field of study

Using data from all U.S. corporate bond transactions in 2008, intermediation chains are identified. Dealer centrality and past experience are used as proxies for the amount of information that a dealer has about the valuation of a given bond. It is shown that dealers that are closer together on a given intermediation chain are also expected to have closer levels of information. These relationships hold for both investment grade bonds and junk bonds, as well as both before and after the onset of the 2008 financial crisis. This implies that intermediation chains in an over-the-counter market can be an effective way of responding to the presence of high information asymmetries between dealers, end buyers, and end sellers

ScholarlyCommons@Penn

Global Yield Curve Dynamics and Interactions: A Dynamic Nelson-Siegel Approach

Author: Canlin Li
Francis X. Diebold
Vivian Z. Yue
Publication venue
Publication date
Field of study

Research Papers in Economics

Recommended from our members

EpiAlign: an alignment-based bioinformatic tool for comparing chromatin state sequences.

Author: Ge Xinzhou
Kwon Soo Bin
Li Jingyi Jessica
Li Wei Vivian
Xie Lingjue
Zhang Haowen
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

The availability of genome-wide epigenomic datasets enables in-depth studies of epigenetic modifications and their relationships with chromatin structures and gene expression. Various alignment tools have been developed to align nucleotide or protein sequences in order to identify structurally similar regions. However, there are currently no alignment methods specifically designed for comparing multi-track epigenomic signals and detecting common patterns that may explain functional or evolutionary similarities. We propose a new local alignment algorithm, EpiAlign, designed to compare chromatin state sequences learned from multi-track epigenomic signals and to identify locally aligned chromatin regions. EpiAlign is a dynamic programming algorithm that novelly incorporates varying lengths and frequencies of chromatin states. We demonstrate the efficacy of EpiAlign through extensive simulations and studies on the real data from the NIH Roadmap Epigenomics project. EpiAlign is able to extract recurrent chromatin state patterns along a single epigenome, and many of these patterns carry cell-type-specific characteristics. EpiAlign can also detect common chromatin state patterns across multiple epigenomes, and it will serve as a useful tool to group and distinguish epigenomic samples based on genome-wide or local chromatin state patterns

eScholarship - University of California

Women and Economic Empowerment

Author: Dukakis Kitty
Li Vivian
Publication venue: ScholarWorks at UMass Boston
Publication date: 20/03/1990
Field of study

This article proposes a women\u27s economic agenda to help fulfill the needs of working women. The first component outlined is the appointment of women who are sensitive to the needs of all women, including the poor, to key decision-making positions. The agenda then calls for employers to recognize changing workforce demographics by initiating programs that can accommodate the needs of single-person as well as dual-income households. The final component is an argument for the implementation of pay equity

University of Massachusetts Boston: ScholarWorks at UMass

Issues arising from benchmarking single-cell RNA sequencing imputation methods

Author: Li Jingyi Jessica
Li Wei Vivian
Publication venue
Publication date: 19/08/2019
Field of study

On June 25th, 2018, Huang et al. published a computational method SAVER on Nature Methods for imputing dropout gene expression levels in single cell RNA sequencing (scRNA-seq) data. Huang et al. performed a set of comprehensive benchmarking analyses, including comparison with the data from RNA fluorescence in situ hybridization, to demonstrate that SAVER outperformed two existing scRNA-seq imputation methods, scImpute and MAGIC. However, their computational analyses were based on semi-synthetic data that the authors had generated following the Poisson-Gamma model used in the SAVER method. We have therefore re-examined Huang et al.'s study. We find that the semi-synthetic data have very different properties from those of real scRNA-seq data and that the cell clusters used for benchmarking are inconsistent with the cell types labeled by biologists. We show that a reanalysis based on real scRNA-seq data and grounded on biological knowledge of cell types leads to different results and conclusions from those of Huang et al.Comment: 5 page

arXiv.org e-Print Archive

eScholarship - University of California

Modeling and analysis of RNA-seq data: a review from a statistical perspective

Author: Li Jingyi Jessica
Li Wei Vivian
Publication venue
Publication date: 01/05/2018
Field of study

Background: Since the invention of next-generation RNA sequencing (RNA-seq) technologies, they have become a powerful tool to study the presence and quantity of RNA molecules in biological samples and have revolutionized transcriptomic studies. The analysis of RNA-seq data at four different levels (samples, genes, transcripts, and exons) involve multiple statistical and computational questions, some of which remain challenging up to date. Results: We review RNA-seq analysis tools at the sample, gene, transcript, and exon levels from a statistical perspective. We also highlight the biological and statistical questions of most practical considerations. Conclusion: The development of statistical and computational methods for analyzing RNA- seq data has made significant advances in the past decade. However, methods developed to answer the same biological question often rely on diverse statical models and exhibit different performance under different scenarios. This review discusses and compares multiple commonly used statistical models regarding their assumptions, in the hope of helping users select appropriate methods as needed, as well as assisting developers for future method development

arXiv.org e-Print Archive

eScholarship - University of California