Search CORE

18 research outputs found

PENALIZED LIKELIHOOD AND BAYESIAN METHODS FOR SPARSE CONTINGENCY TABLES: AN ANALYSIS OF ALTERNATIVE SPLICING IN FULL-LENGTH cDNA LIBRARIES

Author: Buhlmann Peter
Dahinden Corinne
Emerick Mark C.
Parmigiani Giovanni
Publication venue: Collection of Biostatistics Research Archive
Publication date: 07/11/2006
Field of study

We develop methods to perform model selection and parameter estimation in loglinear models for the analysis of sparse contingency tables to study the interaction of two or more factors. Typically, datasets arising from so-called full-length cDNA libraries, in the context of alternatively spliced genes, lead to such sparse contingency tables. Maximum Likelihood estimation of log-linear model coefficients fails to work because of zero cell entries. Therefore new methods are required to estimate the coefficients and to perform model selection. Our suggestions include computationally efficient penalization (Lasso-type) approaches as well as Bayesian methods using MCMC. We compare these procedures in a simulation study and we apply the proposed methods to full-length cDNA libraries, yielding valuable insight into the biological process of alternative splicing

Collection Of Biostatistics Research Archive

Single-cell RNA sequencing identifies a paracrine interaction that may drive oncogenic notch signaling in human adenoid cystic carcinoma

Author: Aster Jon C
Bernstein Bradley E
Davis Daniel
Deschler Daniel G
Drier Yotam
Emerick Kevin S
Faquin William C
Lefranc-Torres Armida
Lin Derrick T
Miller Lauren E
Parikh Anuraag S
Puram Sidharth V
Rodarte-Rascon Alejandro I
Varvares Mark A
Wizel Avishai
Publication venue: 'Elsevier BV'
Publication date: 29/11/2022
Field of study

Salivary adenoid cystic carcinoma (ACC) is a rare, biologically unique biphasic tumor that consists of malignant myoepithelial and luminal cells. MYB and Notch signaling have been implicated in ACC pathophysiology, but in vivo descriptions of these two programs in human tumors and investigation into their active coordination remain incomplete. We utilize single-cell RNA sequencing to profile human head and neck ACC, including a comparison of primary ACC with a matched local recurrence. We define expression heterogeneity in these rare tumors, uncovering diversity in myoepithelial and luminal cell expression. We find differential expression of Notch ligands DLL1, JAG1, and JAG2 in myoepithelial cells, suggesting a paracrine interaction that may support oncogenic Notch signaling. We validate this selective expression in three published cohorts of patients with ACC. Our data provide a potential explanation for the biphasic nature of low- and intermediate-grade ACC and may help direct new therapeutic strategies against these tumors

Digital Commons@Becker

PubMed Central

Penalized likelihood for sparse contingency tables with an application to full-length cDNA libraries

Author: A Mironov
BS Everitt
C Southan
Corinne Dahinden
D Brett
D Brett
F Liang
Giovanni Parmigiani
International Human Genome Sequencing Consortium
International Human Genome Sequencing Consortium
M Yuan
M Zavolan
Mark C Emerick
MR Regan
Peter Bühlmann
R Christensen
R Tibshirani
S Rosset
SL Lauritzen
T Imanishi
The FANTOM Consortium
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The joint analysis of several categorical variables is a common task in many areas of biology, and is becoming central to systems biology investigations whose goal is to identify potentially complex interaction among variables belonging to a network. Interactions of arbitrary complexity are traditionally modeled in statistics by log-linear models. It is challenging to extend these to the high dimensional and potentially sparse data arising in computational biology. An important example, which provides the motivation for this article, is the analysis of so-called full-length cDNA libraries of alternatively spliced genes, where we investigate relationships among the presence of various exons in transcript species. Results We develop methods to perform model selection and parameter estimation in log-linear models for the analysis of sparse contingency tables, to study the interaction of two or more factors. Maximum Likelihood estimation of log-linear model coefficients might not be appropriate because of the presence of zeros in the table's cells, and new methods are required. We propose a computationally efficient ℓ1-penalization approach extending the Lasso algorithm to this context, and compare it to other procedures in a simulation study. We then illustrate these algorithms on contingency tables arising from full-length cDNA libraries. Conclusion We propose regularization methods that can be used successfully to detect complex interaction patterns among categorical variables in a broad range of biological problems involving categorical variables.</p

Repository for Publications and Research Data

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes

BACKGROUND: RNA metabolism, through 'combinatorial splicing', can generate enormous structural diversity in the proteome. Alternative domains may interact, however, with unpredictable phenotypic consequences, necessitating integrated RNA-level regulation of molecular composition. Splicing correlations within transcripts of single genes provide valuable clues to functional relationships among molecular domains as well as genomic targets for higher-order splicing regulation. RESULTS: We present tools to visualize complex splicing patterns in full-length cDNA libraries. Developmental changes in pair-wise correlations are presented vectorially in 'clock plots' and linkage grids. Higher-order correlations are assessed statistically through Monte Carlo analysis of a log-linear model with an empirical-Bayes estimate of the true probabilities of observed and unobserved splice forms. Log-linear coefficients are visualized in a 'spliceprint,' a signature of splice correlations in the transcriptome. We present two novel metrics: the linkage change index, which measures the directional change in pair-wise correlation with tissue differentiation, and the accuracy index, a very simple goodness-of-fit metric that is more sensitive than the integrated squared error when applied to sparsely populated tables, and unlike chi-square, does not diverge at low variance. Considerable attention is given to sparse contingency tables, which are inherent to single-gene libraries. CONCLUSION: Patterns of splicing correlations are revealed, which span a broad range of interaction order and change in development. The methods have a broad scope of applicability, beyond the single gene – including, for example, multiple gene interactions in the complete transcriptome

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Collection Of Biostatistics Research Archive

Beyond the therapeutic: a Habermasian view of self-help groups’ place in the public sphere

Author: A Baldacchino
A Ben-Ari
A Giddens
A Giddens
A Katz
A McKee
A McLean
A Melucci
B Chamak
B Steinke
C Munn-Giddings
C Munn-Giddings
Carol Munn-Giddings
D Hayes
D Kelleher
DKP Wong
E Dunne
E Hatzidimitriadou
F Dickerson
F Williams
G Edwards
G Scambler
G Scambler
G Scambler
G Williams
G Williams
H Anheier
H Zoller
I Buchanan
I Goldstrom
J Ablon
J Alexander
J Bond
J Habermas
J Habermas
J Habermas
J Landes
J Rappaport
J Rosencrance
K Barker
K Barker
K Elsdon
K Karppinen
K Landzelius
L Adamsen
L Goode
L Medvene
L Rootes
M Gardiner
M Jacobs
M Karlsson
M Lieberman
M Stewart
Mark Avis
N Fyfe
P Brown
P Conrad
P Conrad
P Dahlgren
P Garrett
P Godin
P Radin
R Emerick
R Hedley
R Whelan
S Benhabib
S Chaudhary
S Daly
S Damen
S Hodge
S Houston
Sarah Chaudhary
T Borkman
T Borkman
T Graham
T Stolze
V Nash
Z Bauman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Self-help groups in the United Kingdom continue to grow in number and address virtually every conceivable health condition, but they remain the subject of very little theoretical analysis. The literature to date has predominantly focused on their therapeutic effects on individual members. And yet they are widely presumed to fulfil a broader civic role and to encourage democratic citizenship. The article uses Habermas’ model of the public sphere as an analytical tool with which to reconsider the literature on self-help groups in order to increase our knowledge of their civic functions. In doing this it also aims to illustrate the continuing relevance of Habermas’ work to our understanding of issues in health and social care. We consider, within the context of current health policies and practices, the extent to which self-help groups with a range of different forms and functions operate according to the principles of communicative rationality that Habermas deemed key to democratic legitimacy. We conclude that self-help groups’ civic role is more complex than is usually presumed and that various factors including groups’ leadership, organisational structure and links with public agencies can affect their efficacy within the public sphere

Nottingham ePrints

Nottingham eTheses

Crossref

Repository@Nottingham

Springer - Publisher Connector

Anglia Ruskin Research

PubMed Central

Penalized Likelihood for Sparse Contingency Tables with an Application to Full-Length cDNA Libraries

Author: Corinne Dahinden
Giovanni Parmigiani
Mark C Emerick
Peter Bühlmann
Peter Bühlmann
Publication venue
Publication date
Field of study

Background: The joint analysis of several categorical variables is a common task in many areas of biology, and is becoming central to systems biology investigations whose goal is to identify potentially complex interaction among variables belonging to a network. Interactions of arbitrary complexity are traditionally modeled in statistics by log-linear models. It is challenging to extend these to the high dimensional and potentially sparse data arising in computational biology. An important example, which provides the motivation for this article, is the analysis of so-called full-length cDNA libraries of alternatively spliced genes, where we investigate relationships among the presence of various exons in transcript species. Results: We develop methods to perform model selection and parameter estimation in log-linear models for the analysis of sparse contingency tables, to study the interaction of two or more factors. Maximum Likelihood estimation of log-linear model coefficients is not appropriate because of the presence of zeros in the table’s cells, and new methods are required. We propose a computationally efficient ℓ1- penalization approach extending the Lasso algorithm to this context, and compare it to other procedures in a simulation study. We then illustrate these algorithms on contingency tables arising from full-length cDNA libraries. Conclusions: We propose regularization methods that can be used successfully to detect complex interactio

CiteSeerX

Mucoepidermoid Carcinoma of the Parotid: Very Close Margins and Adjuvant Radiotherapy

Author: Anuraag Parikh
Daniel G. Deschler
Derrick T. Lin
Jenny X. Chen
Joseph Zenga
Kevin S. Emerick
Mark A. Varvares
William C. Faquin
Zizi Yu
Publication venue: 'S. Karger AG'
Publication date
Field of study

Crossref