Search CORE

221 research outputs found

pGQL: A probabilistic graphical query language for gene expression time courses

Author: A Schliep
A Schliep
Alexander Schliep
H Hochheiser
IG Costa
Ivan G Costa
J Ernst
KY Yeung
LR Rabiner
M Ashburner
MF Ramoni
R Durbin
Ruben Schilling
S Chu
Z Bar-Joseph
Z Bar-Joseph
Publication venue: BMC
Publication date: 01/01/2011
Field of study

Abstract Background Timeboxes are graphical user interface widgets that were proposed to specify queries on time course data. As queries can be very easily defined, an exploratory analysis of time course data is greatly facilitated. While timeboxes are effective, they have no provisions for dealing with noisy data or data with fluctuations along the time axis, which is very common in many applications. In particular, this is true for the analysis of gene expression time courses, which are mostly derived from noisy microarray measurements at few unevenly sampled time points. From a data mining point of view the robust handling of data through a sound statistical model is of great importance. Results We propose probabilistic timeboxes, which correspond to a specific class of Hidden Markov Models, that constitutes an established method in data mining. Since HMMs are a particular class of probabilistic graphical models we call our method Probabilistic Graphical Query Language. Its implementation was realized in the free software package pGQL. We evaluate its effectiveness in exploratory analysis on a yeast sporulation data set. Conclusions We introduce a new approach to define dynamic, statistical queries on time course data. It supports an interactive exploration of reasonably large amounts of data and enables users without expert knowledge to specify fairly complex statistical models with ease. The expressivity of our approach is by its statistical nature greater and more robust with respect to amplitude and frequency fluctuation than the prior, deterministic timeboxes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

apex: phylogenetics with multiple genes.

Author: Archer F.
Goudet J.
Harris R.
Jombart T.
Kamvar Z.
Lapp H.
Paradis E.
Schliep K.
Publication venue: 'Wiley'
Publication date: 06/07/2016
Field of study

Genetic sequences of multiple genes are becoming increasingly common for a wide range of organisms including viruses, bacteria and eukaryotes. While such data may sometimes be treated as a single locus, in practice, a number of biological and statistical phenomena can lead to phylogenetic incongruence. In such cases, different loci should, at least as a preliminary step, be examined and analysed separately. The r software has become a popular platform for phylogenetics, with several packages implementing distance-based, parsimony and likelihood-based phylogenetic reconstruction, and an even greater number of packages implementing phylogenetic comparative methods. Unfortunately, basic data structures and tools for analysing multiple genes have so far been lacking, thereby limiting potential for investigating phylogenetic incongruence. In this study, we introduce the new r package apex to fill this gap. apex implements new object classes, which extend existing standards for storing DNA and amino acid sequences, and provides a number of convenient tools for handling, visualizing and analysing these data. In this study, we introduce the main features of the package and illustrate its functionalities through the analysis of a simple data set

Crossref

LSHTM Research Online

Serveur académique lausannois

PubMed Central

HAL Descartes

HAL-IRD

Spiral - Imperial College Digital Repository

Horizon / Pleins textes

HAL-CIRAD

Hal-Diderot

Conformational rearrangements upon start codon recognition in human 48S translation initiation complex

Author: Adio S.
Chari A.
Fischer N.
Goyal A.
Linden A.
Petrychenko V.
Rodnina M.
Schliep J.
Stark H.
Urlaub H.
Yi S.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 20/05/2022
Field of study

Selection of the translation start codon is a key step during protein synthesis in human cells. We obtained cryo-EM structures of human 48S initiation complexes and characterized the intermediates of codon recognition by kinetic methods using eIF1A as a reporter. Both approaches capture two distinct ribosome populations formed on an mRNA with a cognate AUG codon in the presence of eIF1, eIF1A, eIF2–GTP–Met-tRNAiMet and eIF3. The ‘open’ 40S subunit conformation differs from the human 48S scanning complex and represents an intermediate preceding the codon recognition step. The ‘closed’ form is similar to reported structures of complexes from yeast and mammals formed upon codon recognition, except for the orientation of eIF1A, which is unique in our structure. Kinetic experiments show how various initiation factors mediate the population distribution of open and closed conformations until 60S subunit docking. Our results provide insights into the timing and structure of human translation initiation intermediates and suggest the differences in the mechanisms of start codon selection between mammals and yeast

MPG.PuRe

Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data

Author: A Schliep
A Schliep
A Schliep
Alexander Schliep
B Edgar
C Niehrs
CLL Hendriks
D Tautz
EP Xing
G McLachlan
GJ McLachlan
H Ge
H Peng
H Peng
I Costa
I Lee
Ivan G Costa
J Bilmes
J Ernst
JY Pan
KY Yeung
KY Yeung
L Opitz
Lennart Opitz
M Ashburner
M Leptin
M Medvedovic
MB Eisen
MN Arbeitman
P Tomancak
P Tomancak
R Gonzalez
R Sokal
Roland Krause
SD Hooper
SK Ng
SVE Keränen
T Beissbarth
T Lange
V Stolc
W Pan
Y Luan
Z Bar-Joseph
Z Bar-Joseph
Z Lu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background: Gene expression measurements during the development of the fly Drosophila melanogaster are routinely used to find functional modules of temporally co-expressed genes. Complimentary large data sets of in situ RNA hybridization images for different stages of the fly embryo elucidate the spatial expression patterns. Results: Using a semi-supervised approach, constrained clustering with mixture models, we can find clusters of genes exhibiting spatio-temporal similarities in expression, or syn-expression. The temporal gene expression measurements are taken as primary data for which pairwise constraints are computed in an automated fashion from raw in situ images without the need for manual annotation. We investigate the influence of these pairwise constraints in the clustering and discuss the biological relevance of our results. Conclusion: Spatial information contributes to a detailed, biological meaningful analysis of temporal gene expression data. Semi-supervised learning provides a flexible, robust and efficient framework for integrating data sources of differing quality and abundance

Crossref

Springer - Publisher Connector

PubMed Central

MPG.PuRe

Fast MCMC sampling for hidden markov models to determine copy number variations

Author: A Krogh
A Schliep
A Viterbi
AB Olshen
Alexander Schliep
AM Snijders
CM Bishop
D Pelleg
D Pinto
F Picard
H Willenbrock
J Fridlyand
J Fritsch
K Wang
L Rabiner
LE Baum
M Bredel
Md Pavel Mahmud
P Wang
PHC Eilers
Q McNemar
R Andersson
R Durbin
R Tibshirani
RJD Leeuw
S Chib
S Geman
S Guha
S Morganella
S Mozes
S Salvador
S Scott
S Srivastava
SP Shah
SP Shah
SR Eddy
T Harada
W Gilks
WR Lai
Y Nannya
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Hidden Markov Models (HMM) are often used for analyzing Comparative Genomic Hybridization (CGH) data to identify chromosomal aberrations or copy number variations by segmenting observation sequences. For efficiency reasons the parameters of a HMM are often estimated with maximum likelihood and a segmentation is obtained with the Viterbi algorithm. This introduces considerable uncertainty in the segmentation, which can be avoided with Bayesian approaches integrating out parameters using Markov Chain Monte Carlo (MCMC) sampling. While the advantages of Bayesian approaches have been clearly demonstrated, the likelihood based approaches are still preferred in practice for their lower running times; datasets coming from high-density arrays and next generation sequencing amplify these problems. Results We propose an approximate sampling technique, inspired by compression of discrete sequences in HMM computations and by <it>kd</it>-trees to leverage spatial relations between data points in typical data sets, to speed up the MCMC sampling. Conclusions We test our approximate sampling method on simulated and biological ArrayCGH datasets and high-density SNP arrays, and demonstrate a speed-up of 10 to 60 respectively 90 while achieving competitive results with the state-of-the art Bayesian approaches. <it>Availability: </it>An implementation of our method will be made available as part of the open source GHMM library from <url>http://ghmm.org</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Novel selective β1-adrenoceptor antagonists for concomitant cardiovascular and respiratory disease

Author: Australian and Swedish Pindolol Study Group
Barrie Kellam
Bell D. S. H.
Black J. W.
Christophe Fromont
CIBIS‐II
Finlayson K.
Gopal P. Jadhav
Hjalmarson A.
Jeanette Woolard
Jillian G. Baker
Kevin S. J. Thompson
Leopold G.
Lombardo F.
Morales D. R.
Odeh M.
Peter M. Fischer
Schliep H. J.
Shailesh N. Mistry
Sheila M. Gardiner
Stephen J. Hill
Valko K.
Williams I. P.
Xamoterol in Severe Heart Failure Study Group
Publication venue: 'FASEB'
Publication date: 12/04/2017
Field of study

β-Blockers reduce mortality and improve symptoms in people with heart disease. However, current clinically available β-blockers have poor selectivity for the cardiac β1-adrenoceptor (AR) over the lung β2-AR. Unwanted β2-blockade risks causing life-threatening bronchospasm and a reduction in the efficacy of β2-agonist emergency rescue therapy. Thus current life-prolonging β-blockers are contraindicated in people with both heart disease and asthma. Here we describe NDD-713 and NDD-825, novel highly β1-selective neutral antagonists with good pharmaceutical properties that can potentially overcome this limitation. Radioligand binding studies and functional assays using human receptors expressed in CHO cells demonstrate that NDD-713 and NDD-825 have nanomolar β1-AR affinity, greater than 500-fold β1-AR vs β2-AR selectivity and no agonism. Studies in conscious rats demonstrated that they are orally bioavailable and cause pronounced β1-mediated reduction of heart rate while showing no effect on β2-mediated hindquarters vasodilatation. The compounds also have good disposition properties and show no adverse toxicological effects. They potentially offer a truly cardioselective β-blocker therapy for the large number of people with heart and respiratory, or peripheral vascular comorbidities

Nottingham ePrints

Nottingham eTheses

Crossref

Repository@Nottingham

Dynamic metabolomic data analysis: a tutorial review

Author: A. Conesa
A. K. Smilde
A. K. Smilde
A. K. Smilde
A. K. Smilde
A. Schliep
B. R. Bakshi
C. Ambroise
C. M. Rubingh
D. Holtz-Eakin
D. J. Vis
D. J. Vis
E. Velzen van
E. Velzen van
F. Roelfsema
F. Wu
G. E. P. Box
G. Stephanopoulos
H. A. L. Kiers
H. Antti
H. C. J. Hoefsloot
H. C. Keun
H. Pijl
I. T. Jolliffe
J. A. Westerhuis
J. A. Westerhuis
J. Cao
J. D. Storey
J. Greef van der
J. Greef van der
J. J. Jansen
J. J. Jansen
J. J. Jansen
J. O. Ramsay
J. van der Greef
J. Xu
K. V. Mardia
L. Glass
L. Hood
L. Ljung
L. Stahle
M. E. Timmerman
M. H. Kaspar
M. J. L. Hoon de
M. J. Nueda
M. Rantalainen
M. Samoilov
O. E. Noord de
O. Wolkenhauer
P. D. Harrington
P. H. C. Eilers
P. Jonsson
P. Kok
P. Molenaar
R. Apostu
R. H. Jellema
R. J. P. Berlo van
R. Kleemann
R. Larsen
S. Bijlsma
S. Bijlsma
S. J. Qin
S. L. Rodriguez-Zas
S. R. Searle
S. W. Kok
S. Wold
T. Anderson
T. E. Fortmann
W. F. Ku
W. H. M. Heijne
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

In metabolomics, time-resolved, dynamic or temporal data is more and more collected. The number of methods to analyze such data, however, is very limited and in most cases the dynamic nature of the data is not even taken into account. This paper reviews current methods in use for analyzing dynamic metabolomic data. Moreover, some methods from other fields of science that may be of use to analyze such dynamic metabolomics data are described in some detail. The methods are put in a general framework after providing a formal definition on what constitutes a ‘dynamic’ method. Some of the methods are illustrated with real-life metabolomics examples

Crossref

PubMed Central

Leiden University Scholary Publications

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Multiconstrained gene clustering based on generalized projections

Author: A Schlicker
A Schliep
Alan Wee-Chung Liew
B Adryan
C Wolting
D Dembélé
D Hanisch
D Huang
D Tritchler
DM Blei
E Kreyszig
H Stark
Hong Yan
J Zeng
J Zeng
J Zeng
J Zeng
J Zeng
J Zeng
J Zeng
Jia Zeng
JL Sevilla
JZ Wang
L Tari
M Aubry
M Kanehisa
M Shiga
MB Eisen
MF Ramoni
MK Kerr
N Bolshakova
P Tamayo
PT Spellman
PW Lord
R Steuer
S Tavazoie
S Zhu
S Zhu
Shanfeng Zhu
TR Hughes
W Feng
W Pan
X Gan
X Guo
XQ Cao
XQ Cao
XQ Cao
Z Bar-Joseph
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Gene clustering for annotating gene functions is one of the fundamental issues in bioinformatics. The best clustering solution is often regularized by multiple constraints such as gene expressions, Gene Ontology (GO) annotations and gene network structures. How to integrate multiple pieces of constraints for an optimal clustering solution still remains an unsolved problem. Results We propose a novel multiconstrained gene clustering (MGC) method within the generalized projection onto convex sets (POCS) framework used widely in image reconstruction. Each constraint is formulated as a corresponding set. The generalized projector iteratively projects the clustering solution onto these sets in order to find a consistent solution included in the intersection set that satisfies all constraints. Compared with previous MGC methods, POCS can integrate multiple constraints from different nature without distorting the original constraints. To evaluate the clustering solution, we also propose a new performance measure referred to as Gene Log Likelihood (GLL) that considers genes having more than one function and hence in more than one cluster. Comparative experimental results show that our POCS-based gene clustering method outperforms current state-of-the-art MGC methods. Conclusions The POCS-based MGC method can successfully combine multiple constraints from different nature for gene clustering. Also, the proposed GLL is an effective performance measure for the soft clustering solutions.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Computational archaeology of the Pristionchus pacificus genome reveals evidence of horizontal gene transfers from insects

Author: AD Cutter
AJ Arvey
AM Weller
C Dieterich
Christian Rödelsperger
Consortium TGS
EGJ Danchin
H Tanaka
J Parkinson
JB Plotkin
JC Vaughn
JCD Hotopp
JG Lawrence
KP Schliep
L Breiman
M Blaxter
M Herrmann
M Mitreva
MG Mayer
N Borchert
P Abad
R Merkl
Ralf J Sommer
RC Edgar
RJ Sommer
S Karlin
S Karlin
S Rashkova
S Schaack
S Youngman
SB Daniels
SF Altschul
SQ Le
T Wittkop
V Zupunski
W Li
WE Mayer
WT Tay
Y Boucher
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The recent sequencing of nematode genomes has laid the basis for comparative genomics approaches to study the impact of horizontal gene transfer (HGT) on the adaptation to new environments and the evolution of parasitism. In the beetle associated nematode <it>Pristionchus pacificus </it>HGT events were found to involve cellulase genes of microbial origin and Diapausin genes that are known from beetles, but not from other nematodes. The insect-to-nematode horizontal transfer is of special interest given that <it>P. pacificus </it>shows a tight association with insects. Results In this study we utilized the observation that horizontally transferred genes often exhibit codon usage patterns more similar to that of the donor than that of the acceptor genome. We introduced GC-normalized relative codon frequencies as a measure to detect characteristic features of <it>P. pacificus </it>orphan genes that show no homology to other nematode genes. We found that atypical codon usage is particularly prevalent in <it>P. pacificus </it>orphans. By comparing codon usage profiles of 71 species, we detected the most significant enrichment in insect-like codon usage profiles. In cross-species comparisons, we identified 509 HGT candidates that show a significantly higher similarity to insect-like profiles than genes with nematode homologs. The most abundant gene family among these genes are non-LTR retrotransposons. Speculating that retrotransposons might have served as carriers of foreign genetic material, we found a significant local clustering tendency of orphan genes in the vicinity of retrotransposons. Conclusions Our study combined codon usage bias, phylogenetic analysis, and genomic colocalization into a general picture of the computational archaeology of the <it>P. pacificus </it>genome and suggests that a substantial fraction of the gene repertoire is of insect origin. We propose that the <it>Pristionchus</it>-beetle association has facilitated HGT and discuss potential vectors of these events.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

A biclustering algorithm based on a Bicluster Enumeration Tree: application to DNA microarray data

Author: A Ben-Dor
A Dharan
A Prelic
A Schliep
A Tanay
A Yip
B Pontes
C Cano
C Gallo
DD Lewis
EL Lehmann
F Angiulli
F Divina
GF Berriz
H Turner
H Wang
IS Dhillon
J Liu
J Yang
JA Hartigan
Jin-Kao Hao
JS Aguilar-Ruiz
K Bryan
K Cheng
L Lazzeroni
L Teng
Mourad Elloumi
R Agrawal
R Balasubramaniyan
S Barkow
S Bergmann
S Bleuler
S Mitra
S Tavazoie
SC Madeira
SC Madeira
SD Peddada
T Hofmann
U Maulik
W Gaul
Wassim Ayadi
X Liu
Y Cheng
Y Cheng
Y Christinat
Y Luan
Y Okada
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background In a number of domains, like in DNA microarray data analysis, we need to cluster simultaneously rows (genes) and columns (conditions) of a data matrix to identify groups of rows coherent with groups of columns. This kind of clustering is called <it>biclustering</it>. Biclustering algorithms are extensively used in DNA microarray data analysis. More effective biclustering algorithms are highly desirable and needed. Methods We introduce <it>BiMine</it>, a new enumeration algorithm for biclustering of DNA microarray data. The proposed algorithm is based on three original features. First, <it>BiMine </it>relies on a new evaluation function called <it>Average Spearman's rho </it>(ASR). Second, <it>BiMine </it>uses a new tree structure, called <it>Bicluster Enumeration Tree </it>(BET), to represent the different biclusters discovered during the enumeration process. Third, to avoid the combinatorial explosion of the search tree, <it>BiMine </it>introduces a parametric rule that allows the enumeration process to cut tree branches that cannot lead to good biclusters. Results The performance of the proposed algorithm is assessed using both synthetic and real DNA microarray data. The experimental results show that <it>BiMine </it>competes well with several other biclustering methods. Moreover, we test the biological significance using a gene annotation web-tool to show that our proposed method is able to produce biologically relevant biclusters. The software is available upon request from the authors to academic users.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Okina

Hal-Diderot