Search CORE

469 research outputs found

Skittle: A 2-Dimensional Genome Visualization Tool

Author: Birney
D Sussillo
E Lieberman-Aiden
EN Trifonov
EN Trifonov
G Benson
GM Weinstock
GS Baldwin
I López-Villaseñor
J Sánchez
JF Canny
John C Sanford
Josiah D Seaman
M Costantini
MB Gerstein
MK Rudd
P Schieg
S Kurtz
X She
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background It is increasingly evident that there are multiple and overlapping patterns within the genome, and that these patterns contain different types of information - regarding both genome function and genome history. In order to discover additional genomic patterns which may have biological significance, novel strategies are required. To partially address this need, we introduce a new data visualization tool entitled Skittle. Results This program first creates a 2-dimensional nucleotide display by assigning four colors to the four nucleotides, and then text-wraps to a user adjustable width. This nucleotide display is accompanied by a "repeat map" which comprehensively displays all local repeating units, based upon analysis of all possible local alignments. Skittle includes a smooth-zooming interface which allows the user to analyze genomic patterns at any scale. Skittle is especially useful in identifying and analyzing tandem repeats, including repeats not normally detectable by other methods. However, Skittle is also more generally useful for analysis of any genomic data, allowing users to correlate published annotations and observable visual patterns, and allowing for sequence and construct quality control. Conclusions Preliminary observations using Skittle reveal intriguing genomic patterns not otherwise obvious, including structured variations inside tandem repeats. The striking visual patterns revealed by Skittle appear to be useful for hypothesis development, and have already led the authors to theorize that imperfect tandem repeats could act as information carriers, and may form tertiary structures within the interphase nucleus.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Universality, limits and predictability of gold-medal performances at the Olympic Games

Author: A Guttmann
A Guttmann
A Petersen
AM Nevill
AM Nevill
AM Petersen
AM Petersen
B Efron
B Sjödin
C Holden
C Sire
E Ben-Naim
EA Codling
EE Peters
F Péronnet
F Radicchi
FD Desgorces
Filippo Radicchi
G Berthelot
G Waitt
G Wergen
G Wergen
G Yaari
GL Gerstein
H Preuss
HJ Grubb
J Beirlant
J Duch
J Swaddling
JS Katz
K Rice
M Atkinson
M Guillaume
MA Stephens
MB Wilk
MW Denny
NA Hill
NCC Sharp
PB Sparling
PE Di Prampero
R Tibshirani
RB D’Agostino
RB Knight
RD Mandell
Renaud Lambiotte
S Redner
S Reeve
S Saavedra
S Sabhapandit
S Savaglio
SS Shapiro
TW Anderson
W Alt
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

Inspired by the Games held in ancient Greece, modern Olympics represent the world's largest pageant of athletic skill and competitive spirit. Performances of athletes at the Olympic Games mirror, since 1896, human potentialities in sports, and thus provide an optimal source of information for studying the evolution of sport achievements and predicting the limits that athletes can reach. Unfortunately, the models introduced so far for the description of athlete performances at the Olympics are either sophisticated or unrealistic, and more importantly, do not provide a unified theory for sport performances. Here, we address this issue by showing that relative performance improvements of medal winners at the Olympics are normally distributed, implying that the evolution of performance values can be described in good approximation as an exponential approach to an a priori unknown limiting performance value. This law holds for all specialties in athletics-including running, jumping, and throwing-and swimming. We present a self-consistent method, based on normality hypothesis testing, able to predict limiting performance values in all specialties. We further quantify the most likely years in which athletes will breach challenging performance walls in running, jumping, throwing, and swimming events, as well as the probability that new world records will be established at the next edition of the Olympic Games.Comment: 8 pages, 3 figures, 1 table. Supporting information files and data are available at filrad.homelinux.or

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Improved annotation of 3' untranslated regions and complex loci by combination of strand-specific direct RNA sequencing, RNA-seq and ESTs

Author: A Derti
A Dobin
A Sherstnev
AI Reid
Alexander Sherstnev
B Langmead
BJ Haas
BJ Haas
BS Yoon
C Burge
C Cole
C Luo
C Trapnell
CE Joyce
CH Jan
Christian Cole
Céline Duc
D Brawand
DM Church
F Ozsolak
F Ozsolak
F Ozsolak
Geoffrey J. Barton
Gordon G. Simpson
H Stroud
H Zou
HK Saini
I Ulitsky
J Bracht
J Harrow
JE Collins
JH Yang
Junfang Song
Kate G. Storey
L Jiang
M Fujii
M Garber
M Yandell
MB Gerstein
Nicholas J. Schurch
P Lamesch
PE Boardman
Sara J. Brown
T Pelissier
Thomas Preiss
TS Becker
V Curwen
V Hamburger
W. H. Irwin McLean
X Cai
Y Kurihara
Y Lee
Z Moqtaderi
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 11/11/2013
Field of study

The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct annotation is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3-prime untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3-prime polyadenylation sites to within +/- 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1) gene and 3-prime UTR re-annotation (including extension of one 3-prime UTR by 5.9 kb); (2) disentangling of gene expression in complex regions; (3) clearer interpretation of small RNA expression and (4) identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental dataComment: 44 pages, 9 figure

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

University of Dundee Online Publications

GIFtS: annotation landscape analysis with GeneCards

Author: A Platzer
Aron Inger
Arye Harel
B Turgeon
Birney Eea
D Kemmer
DM Greenawalt
Doron Lancet
Gil Stelzer
Irina Dalah
Liora Strichman-Almashanu
M Ashburner
M Rebhan
M Safran
M Safran
Marilyn Safran
MB Gerstein
N Rosen
O Shmueli
P Saetre
S Washietl
TJ Buza
V Chalifa-Caspi
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Gene annotation is a pivotal component in computational genomics, encompassing prediction of gene function, expression analysis, and sequence scrutiny. Hence, quantitative measures of the annotation landscape constitute a pertinent bioinformatics tool. GeneCards® is a gene-centric compendium of rich annotative information for over 50,000 human gene entries, building upon 68 data sources, including Gene Ontology (GO), pathways, interactions, phenotypes, publications and many more. Results We present the GeneCards Inferred Functionality Score (GIFtS) which allows a quantitative assessment of a gene's annotation status, by exploiting the unique wealth and diversity of GeneCards information. The GIFtS tool, linked from the GeneCards home page, facilitates browsing the human genome by searching for the annotation level of a specified gene, retrieving a list of genes within a specified range of GIFtS value, obtaining random genes with a specific GIFtS value, and experimenting with the GIFtS weighting algorithm for a variety of annotation categories. The bimodal shape of the GIFtS distribution suggests a division of the human gene repertoire into two main groups: the high-GIFtS peak consists almost entirely of protein-coding genes; the low-GIFtS peak consists of genes from all of the categories. Cluster analysis of GIFtS annotation vectors provides the classification of gene groups by detailed positioning in the annotation arena. GIFtS also provide measures which enable the evaluation of the databases that serve as GeneCards sources. An inverse correlation is found (for GIFtS>25) between the number of genes annotated by each source, and the average GIFtS value of genes associated with that source. Three typical source prototypes are revealed by their GIFtS distribution: genome-wide sources, sources comprising mainly highly annotated genes, and sources comprising mainly poorly annotated genes. The degree of accumulated knowledge for a given gene measured by GIFtS was correlated (for GIFtS>30) with the number of publications for a gene, and with the seniority of this entry in the HGNC database. Conclusion GIFtS can be a valuable tool for computational procedures which analyze lists of large set of genes resulting from wet-lab or computational research. GIFtS may also assist the scientific community with identification of groups of uncharacterized genes for diverse applications, such as delineation of novel functions and charting unexplored areas of the human genome.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

FlexOracle: predicting flexible hinges by identification of stable domains

Author: AJ Bjorkman
AJ Rader
AJ Rader
AR Means
AS Siddiqui
BM Hespenheide
DJ Jacobs
EA Abbondanzieri
I Bahar
J Janin
J Lanman
J Painter
J Schymkowitz
J Zheng
JW Schymkowitz
L Holm
LW Yang
M Gerstein
M Shatsky
Mark B Gerstein
Mark Berjanskii and David S Wishart
MB Swindells
MC Anguera
MF Thorpe
MF Thorpe D.J. Jacob
N Armstrong
N Nagarajan
QC Dmitry A Kondrashov and George
R Linding
RJ M Gerstein T Johnson
S Flores
S Hayward
S Jones
S Kundu
S Wells
SA Ahmed
Samuel C Flores
SW Choi
T Ooi
V Alexandrov
WG Krebs
WG Krebs
WL Jorgensen
X Sun
YS Babu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Protein motions play an essential role in catalysis and protein-ligand interactions, but are difficult to observe directly. A substantial fraction of protein motions involve hinge bending. For these proteins, the accurate identification of flexible hinges connecting rigid domains would provide significant insight into motion. Programs such as GNM and FIRST have made global flexibility predictions available at low computational cost, but are not designed specifically for finding hinge points. Results Here we present the novel FlexOracle hinge prediction approach based on the ideas that energetic interactions are stronger <it>within </it>structural domains than <it>between </it>them, and that fragments generated by cleaving the protein at the hinge site are independently stable. We implement this as a tool within the Database of Macromolecular Motions, MolMovDB.org. For a given structure, we generate pairs of fragments based on scanning all possible cleavage points on the protein chain, compute the energy of the fragments compared with the undivided protein, and predict hinges where this quantity is minimal. We present three specific implementations of this approach. In the first, we consider only pairs of fragments generated by cutting at a <it>single </it>location on the protein chain and then use a standard molecular mechanics force field to calculate the enthalpies of the two fragments. In the second, we generate fragments in the same way but instead compute their free energies using a knowledge based force field. In the third, we generate fragment pairs by cutting at <it>two </it>points on the protein chain and then calculate their free energies. Conclusion Quantitative results demonstrate our method's ability to predict known hinges from the Database of Macromolecular Motions.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Cancer somatic mutations cluster in a subset of regulatory sites predicted from the ENCODE data

Author: A Mortazavi
A Pohl
A Visel
AP Boyle
C Melton
David R. Westhead
DK Goode
FW Huang
J Ernst
JA Wamstad
JH Friedman
JR Landry
KD MacIsaac
M. S. Vijayabaskar
MB Gerstein
MS Lawrence
N Weinhold
Nisar A. Shar
NJ Fredriksson
PA Futreal
RE Thurman
RS Hansen
S Djebali
SA Forbes
TH Rabbitts
WJ Kent
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: Transcriptional regulation of gene expression is essential for cellular differentiation and function, and defects in the process are associated with cancer. The ENCODE project has mapped potential regulatory sites across the complete genome in many cell types, and these regions have been shown to harbour many of the somatic mutations that occur in cancer cells, suggesting that their effects may drive cancer initiation and development. The ENCODE data suggests a very large number of regulatory sites, and methods are needed to identify those that are most relevant and to connect them to the genes that they control. Methods: Predictive models of gene expression were developed by integrating the ENCODE data for regulation, including transcription factor binding and DNase1 hypersensitivity, with RNA-seq data for gene expression. A penalized regression method was used to identify the most predictive potential regulatory sites for each transcript. Known cancer somatic mutations from the COSMIC database were mapped to potential regulatory sites, and we examined differences in the mapping frequencies associated with sites chosen in regulatory models and other (rejected) sites. The effects of potential confounders, for example replication timing, were considered. Results: Cancer somatic mutations preferentially occupy those regulatory regions chosen in our models as most predictive of gene expression. Conclusion: Our methods have identified a significantly reduced set of regulatory sites that are enriched in cancer somatic mutations and are more predictive of gene expression. This has significance for the mechanistic interpretation of cancer mutations, and the understanding of genetic regulation

Crossref

Springer - Publisher Connector

PubMed Central

White Rose Research Online

FigShare

The Escherichia coli transcriptome mostly consists of independently regulated modules

Author: A Anand
A Biton
A Delorme
A Frigyesi
A Hyvärinen
A Santos-Zavaleta
A-M Martoglio
AE Teschendorff
B Dalrymple
B Langmead
B-K Cho
B-K Cho
BM Bolstad
C Vijayendran
CL Turnbough Jr
D Kim
D Marbach
D Risso
D-S Huang
DS Latchman
E Nudler
EJ O’Brien
ENCODE Project Consortium.
ER Gansner
F Pedregosa
GI Guzmán
GI Guzmán
H Zou
HS Rhee
I Kristoficova
IM Keseler
J Pouyssegur
J Utrilla
JE Galagan
JJ Faith
JM Buescher
JM Engreitz
JM Monk
JT Leek
K Valgepea
K-K Yan
KF Jensen
KJ Karczewski
L Wang
M Ester
M Kim
M Lawrence
M Moretto
M Scott
M Scott
MB Gerstein
MI Love
NE Lewis
O Alter
P Chiappetta
P Comon
PR Subbarayan
PV Phaneuf
R De Smet
R Kolter
RA LaCroix
RB D’agostino
S Gama-Castro
S Lin
SJ Larsen
SW Seo
T Baba
T Barrett
TM Henkin
W Kong
W Liebermeister
W Saelens
X Zhang
Xin Fang
XW Zhang
Y Gao
Y Yamanaka
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Underlying cellular responses is a transcriptional regulatory network (TRN) that modulates gene expression. A useful description of the TRN would decompose the transcriptome into targeted effects of individual transcriptional regulators. Here, we apply unsupervised machine learning to a diverse compendium of over 250 high-quality Escherichia coli RNA-seq datasets to identify 92 statistically independent signals that modulate the expression of specific gene sets. We show that 61 of these transcriptomic signals represent the effects of currently characterized transcriptional regulators. Condition-specific activation of signals is validated by exposure of E. coli to new environmental conditions. The resulting decomposition of the transcriptome provides: a mechanistic, systems-level, network-based explanation of responses to environmental and genetic perturbations; a guide to gene and regulator function discovery; and a basis for characterizing transcriptomic differences in multiple strains. Taken together, our results show that signal summation describes the composition of a model prokaryotic transcriptome

Crossref

ScholarWorks@UNIST

eScholarship - University of California

Online Research Database In Technology

Defining genes: a computational framework

Author: BO Palsson
Christian V. Forst
CV Forst
D Karolchik
David C. Krakauer
E Dicou
E Pennisi
G Berry
H Pearson
I Brigandt
JD Walton
K Scherrer
L Duret
MB Gerstein
MD Laubichler
MM Krem
Peter F. Stadler
RG Taylor
S Griffiths-Jones
SJ Prohaska
Sonja J. Prohaska
TR Gingeras
TS Furey
Y Tohsato
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

The precise elucidation of the gene concept has become the subject of intense discussion in light of results from several, large high-throughput surveys of transcriptomes and proteomes. In previous work, we proposed an approach for constructing gene concepts that combines genomic heritability with elements of function. Here, we introduce a definition of the gene within a computational framework of cellular interactions. The definition seeks to satisfy the practical requirements imposed by annotation, capture logical aspects of regulation, and encompass the evolutionary property of homology

Crossref

Springer - Publisher Connector

Fraunhofer-ePrints

PubMed Central

Attention-dependent modulation of cortical taste circuits revealed by granger causality with signal-dependent noise

Author: A Roebroeck
A Roebroeck
A Roebroeck
AF Rossi
C McCabe
CJ McAdams
CM Hafner
CM Harris
CWJ Granger
DA Handwerker
DL Collins
DL Knepp
DM Beck
DM Small
DR Gitelman
E Todorov
Edmund T. Rolls
ET Rolls
ET Rolls
ET Rolls
ET Rolls
ET Rolls
F Grabenhorst
F Grabenhorst
F Grabenhorst
F Grabenhorst
F Grabenhorst
F Kouneiher
F Kruggel
Fabian Grabenhorst
FX Diebold
G Deco
G Deco
G Deshpande
GK Aguirre
GL Gerstein
IET de Araujo
IET de Araujo
IET de Araujo
J O'doherty
J Yacubian
JA Gottfried
JB Mandeville
JB Nitschke
JC Rajapakse
JF Geweke
JF Geweke
Jianfeng Feng
JL Wilson
JP Hamilton
K Friston
K Hwang
K Sakai
K Sakai
KJ Friston
KJ Friston
KJ Friston
KJ Friston
KJ Friston
KJ Friston
L Baccalá
L Barnett
L Barnett
L Haase
LPJ Selen
M Kamiński
MA Schoenfeld
MB Schippers
MG Veldhuizen
ML Kringelbach
O David
O David
O Yamashita
Olaf Sporns
PA Valdes-Sosa
Q Luo
Qiang Luo
R Desimone
R Engle
R Engle
R Goebel
RB Buxton
RF Engle
S Taylor
SL Bengtsson
SL Bressler
T Bollerslev
T Bollerslev
T Fawcett
T Ge
T Ge
T Pantelidis
Tian Ge
X Wen
Y Baba
YM Hong
YW Cheung
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

We show, for the first time, that in cortical areas, for example the insular, orbitofrontal, and lateral prefrontal cortex, there is signal-dependent noise in the fMRI blood-oxygen level dependent (BOLD) time series, with the variance of the noise increasing approximately linearly with the square of the signal. Classical Granger causal models are based on autoregressive models with time invariant covariance structure, and thus do not take this signal-dependent noise into account. To address this limitation, here we describe a Granger causal model with signal-dependent noise, and a novel, likelihood ratio test for causal inferences. We apply this approach to the data from an fMRI study to investigate the source of the top-down attentional control of taste intensity and taste pleasantness processing. The Granger causality with signal-dependent noise analysis reveals effects not identified by classical Granger causal analysis. In particular, there is a top-down effect from the posterior lateral prefrontal cortex to the insular taste cortex during attention to intensity but not to pleasantness, and there is a top-down effect from the anterior and posterior lateral prefrontal cortex to the orbitofrontal cortex during attention to pleasantness but not to intensity. In addition, there is stronger forward effective connectivity from the insular taste cortex to the orbitofrontal cortex during attention to pleasantness than during attention to intensity. These findings indicate the importance of explicitly modeling signal-dependent noise in functional neuroimaging, and reveal some of the processes involved in a biased activation theory of selective attention

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Warwick Research Archives Portal Repository

Control of intestinal stem cell function and proliferation by mitochondrial pyruvate metabolism.

Author: A Ootani
A Ralston
A Roostaee
AG Muntean
Aimee Flores
B Ohlstein
B-Z Chen
C Moolenbeek
C Stringari
C Yang
CA Micchelli
CA Micchelli
Carl S. Thummel
CCP Aires
Christian S. Earl
Claire Bensard
D Dutta
D Karolchik
Dean Y. Li
DK Bricker
Don Delker
Dona R. Wisidagama
E Berger
F Cao
H Jiang
H Li
H Li
H Tateno
Heather R. Christofk
Helong Zhao
I Martínez-Reyes
J Benavides
J Wang
J Wang
James E. Cox
Jared Rutter
Jason Tanner
JC Schell
Jeffrey Mohlman
JL Golob
John C. Schell
JS Wu
K Birsoy
K Ito
K Nishino
K Weber
Kristofor A. Olson
KT Pate
KW McCool
LB Sullivan
Lei Jiang
Lise K. Sorensen
LK Boroughs
LR Edmunds
LR Gray
M Uhlen
M Uhlen
MA Keller
Mary P. Bronner
MB Gerstein
MJ Dailey
MJ Rodríguez-Colman
N Barker
N Buchon
NM Vacanti
O Warburg
PA Vigueira
Peng Wei
Priyanka Kanth
R Camarda
Ralph J. DeBerardinis
RB Flavell
Ren Miao
RW Daniels
S Beyaz
S Herzig
S Simmini
T Bender
T Sato
T Sato
T Sato
T Simsek
T. Cameron Waller
William E. Lowry
X Yin
X-M Wang
Y-Y Fan
Publication venue: eScholarship, University of California
Publication date: 01/09/2017
Field of study

Most differentiated cells convert glucose to pyruvate in the cytosol through glycolysis, followed by pyruvate oxidation in the mitochondria. These processes are linked by the mitochondrial pyruvate carrier (MPC), which is required for efficient mitochondrial pyruvate uptake. In contrast, proliferative cells, including many cancer and stem cells, perform glycolysis robustly but limit fractional mitochondrial pyruvate oxidation. We sought to understand the role this transition from glycolysis to pyruvate oxidation plays in stem cell maintenance and differentiation. Loss of the MPC in Lgr5-EGFP-positive stem cells, or treatment of intestinal organoids with an MPC inhibitor, increases proliferation and expands the stem cell compartment. Similarly, genetic deletion of the MPC in Drosophila intestinal stem cells also increases proliferation, whereas MPC overexpression suppresses stem cell proliferation. These data demonstrate that limiting mitochondrial pyruvate metabolism is necessary and sufficient to maintain the proliferation of intestinal stem cells

Crossref

eScholarship - University of California