Search CORE

405 research outputs found

Assembly of body wall muscle and muscle cell attachment structures in Caenorhabditis elegans

Author: BD Williams
MC Hresko
RH Waterston
Publication venue: 'Rockefeller University Press'
Publication date
Field of study

Expedited batch processing and analysis of transposon insertions

Author: A Bohne
AL Price
BC Meyers
CM Bergman
CM Bergman
David A Ray
ES Lander
GD Schuler
JE Stajich
Jeremy D Smith
PL Deininger
RC Edgar
RC Edgar
RH Waterston
Sela
SF Altschul
Publication venue: BioMed Central
Publication date: 01/11/2011
Field of study

Abstract Background With advances in sequencing technology, greater and greater amounts of eukaryotic genome data are becoming available. Often, large portions of these genomes consist of transposable elements, frequently accounting for 50% or more in vertebrates. Each transposable element family may have thousands or tens of thousands of individual copies within a given genome, and therefore it can take an exorbitant amount of time and effort to process data in a meaningful fashion. Findings In order to combat this problem, we developed a set of bioinformatics techniques and programs to streamline the analysis. This includes a unique Perl script which automates the process of taking BLAST, Repeatmasker and similar data to extract and manipulate the hit sequences from the genome. This script, called Process_hits uses an object-oriented methodology to compile all hit locations from a given file for processing, organize this data into useable categories, and output it in multiple formats. Conclusions The program proved capable of handling large amounts of transposon data in an efficient fashion. It is equipped with a number of useful sub-functions, each of which is contained within its own sub-module to allow for greater expandability and as a foundation for future program design.</p

Crossref

Directory of Open Access Journals

PubMed Central

Melatonin Alters Age-Related Changes in Transcription Factors and Kinase Activation

Author: A Jagota
B Caballero
BP Rutten
CE Finch
D Lahiri
Debomoy K. Lahiri
DK Lahiri
E Head
E Sharman
EH Sharman
G Mayer
H Li
Huihui Li
Jason A. Bailey
JL Teeling
Jun Zhou
KG Sharman
KG Sharman
L Yang
M Griffiths
M Li
M Sánchez-Hidalgo
Meixia Wu
NL Sparkman
P Lemoine
R Sandhir
RH Waterston
S Denis-Donini
S Kaewsuk
SC Bondy
Stephen C. Bondy
V Vukic
VM Perreau
Y Marchalant
Y Uyanikgil
YH Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/06/2010
Field of study

Male mice were fed 40 ppm melatonin for 2 months prior to sacrifice at age 26 months, and compared with both 26 and 4 month-old untreated controls. The nuclear translocation of NF-κB increased with age in both brain and spleen and this was reversed by melatonin only in brain. Another transcription factor, AP-1 was increased with age in the spleen and not in brain and this could be blocked by melatonin treatment. The fraction of the active relative to the inactive form of several enabling kinases was compared. The proportion of activated ERK was elevated with age in brain and spleen but this change was unresponsive to melatonin. A similar age-related increase in glial fibrillary acidic protein (GFAP) was also refractory to melatonin treatment. The cerebral melatonin M1 receptor decreased with age in brain but increased in spleen. The potentially beneficial nature of melatonin for the preservation of brain function with aging was suggested by the finding that an age-related decline in cortical synaptophysin levels was prevented by dietary melatonin

Crossref

IUPUIScholarWorks

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

High-throughput sequencing of the DBA/2J mouse genome

Author: Daniel C Ciobanu
Deanna M Church
Donald B Thomason
JL Peirce
John A Capra
Katherine S Pollard
Khyobeni Mozhui
Lu Lu
Megan K Mulligan
RH Waterston
Richa Agarwala
Robert W Williams
Stanley F Nelson
Williams L Taylor
Xusheng Wang
Zhengsheng Li
Zugen Chen
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

The DBA/2J mouse is not only the oldest inbred strain, but also one of the most widely used strains. DBA/2J exhibits many unique anatomical, physiological, and behavior traits. In addition, DBA/2J is one parent of the large BXD family of recombinant inbred strains [1]. The genome of the other parent of this BXD family— C57BL/6J—has been sequenced and serves as the mouse reference genome [2]. We sequenced the genome of DBA/2J using SOLiD and Illumina high throughput short read protocols to generate a comprehensive set of ~5 million sequence variants segregating in the BXD family that ultimately cause developmental, anatomical, functional and behavioral differences among these 80+ strains

Crossref

DigitalCommons@University of Nebraska

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

GeneWaltz--A new method for reducing the false positives of gene finding

Author: AG Clark
AL Hughes
C Burge
ES Lander
G Parra
I Korf
IM Meyer
IM Meyer
J Shendure
J Wang
JC Venter
K Misawa
Kazuharu Misawa
L Stein
L Zhang
M Stanke
MR Brent
Reiko F Kikuno
RH Waterston
S Karlin
SF Altschul
SJ Jones
W Makalowski
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Identifying protein-coding regions in genomic sequences is an essential step in genome analysis. It is well known that the proportion of false positives among genes predicted by current methods is high, especially when the exons are short. These false positives are problematic because they waste time and resources of experimental studies. Methods We developed GeneWaltz, a new filtering method that reduces the risk of false positives in gene finding. GeneWaltz utilizes a codon-to-codon substitution matrix that was constructed by comparing protein-coding regions from orthologous gene pairs between mouse and human genomes. Using this matrix, a scoring scheme was developed; it assigned higher scores to coding regions and lower scores to non-coding regions. The regions with high scores were considered candidate coding regions. One-dimensional Karlin-Altschul statistics was used to test the significance of the coding regions identified by GeneWaltz. Results The proportion of false positives among genes predicted by GENSCAN and Twinscan were high, especially when the exons were short. GeneWaltz significantly reduced the ratio of false positives to all positives predicted by GENSCAN and Twinscan, especially when the exons were short. Conclusions GeneWaltz will be helpful in experimental genomic studies. GeneWaltz binaries and the matrix are available online at <url>http://en.sourceforge.jp/projects/genewaltz/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Using ESTs to improve the accuracy of de novo gene prediction

Author: A Krogh
AA Salamov
AC Siepel
C Wei
Chaochun Wei
DR Maglott
E Birney
I Korf
JE Allen
JE Allen
KD Pruitt
KD Pruitt
KL Howe
L Stein
LW Hillier
M Stanke
MG Reese
Michael R Brent
MJ van Baren
MR Brent
MS Boguski
P Flicek
R Guigo
R Guigó
R Mott
RA Gibbs
RH Brown
RH Waterston
S Foissac
SS Gross
The MGC Project Team
TW Harris
TW Harris
VV Solovyev
WJ Kent
Publication venue: BioMed Central
Publication date: 01/07/2006
Field of study

BACKGROUND: ESTs are a tremendous resource for determining the exon-intron structures of genes, but even extensive EST sequencing tends to leave many exons and genes untouched. Gene prediction systems based exclusively on EST alignments miss these exons and genes, leading to poor sensitivity. De novo gene prediction systems, which ignore ESTs in favor of genomic sequence, can predict such "untouched" exons, but they are less accurate when predicting exons to which ESTs align. TWINSCAN is the most accurate de novo gene finder available for nematodes and N-SCAN is the most accurate for mammals, as measured by exact CDS gene prediction and exact exon prediction. RESULTS: TWINSCAN_EST is a new system that successfully combines EST alignments with TWINSCAN. On the whole C. elegans genome TWINSCAN_EST shows 14% improvement in sensitivity and 13% in specificity in predicting exact gene structures compared to TWINSCAN without EST alignments. Not only are the structures revealed by EST alignments predicted correctly, but these also constrain the predictions without alignments, improving their accuracy. For the human genome, we used the same approach with N-SCAN, creating N-SCAN_EST. On the whole genome, N-SCAN_EST produced a 6% improvement in sensitivity and 1% in specificity of exact gene structure predictions compared to N-SCAN. CONCLUSION: TWINSCAN_EST and N-SCAN_EST are more accurate than TWINSCAN and N-SCAN, while retaining their ability to discover novel genes to which no ESTs align. Thus, we recommend using the EST versions of these programs to annotate any genome for which EST information is available. TWINSCAN_EST and N-SCAN_EST are part of the TWINSCAN open source software package

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

SNPs Occur in Regions with Less Genomic Sequence Conservation

Author: A Grimson
A Siepel
BP Lewis
CF Baer
CM Wade
D Chasman
E Pennisi
ES Lander
FW Allendorf
GE Crooks
H Zhang
Ilya Ruvinsky
J Stapley
JC Venter
John C. Castle
JV Chamary
K Chen
L Cartegni
M Lynch
M Stratton
MA Saunders
MP Miller
PA Morin
RH Waterston
RM Durbin
RM Kuhn
ST Sherry
V Matys
WG Fairbrother
Publication venue: Public Library of Science
Publication date: 06/06/2011
Field of study

Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less conserved regions and that functional regions have fewer SNPs. Results are confirmed using both mouse and human data. Regions include protein start codons, 3′ splice sites, 5′ splice sites, protein stop codons, predicted miRNA binding sites, and polyadenylation sites. Throughout, SNP rates are lower and conservation is higher at regulatory sites. Within coding regions, SNP rates are highest and conservation is lowest at codon position three and the fewest SNPs are found at codon position two, reflecting codon degeneracy for amino acid encoding. Exon splice sites show high conservation and very low SNP rates, reflecting both splicing signals and protein coding. Relaxed constraint on the codon third position is dramatically seen when separating exonic SNP rates based on intron phase. At polyadenylation sites, a peak of conservation and low SNP rate occurs from 30 to 17 nt preceding the site. This region is highly enriched for the sequence AAUAAA, reflecting the location of the conserved polyA signal. miRNA 3′ UTR target sites are predicted incorporating interspecies genomic sequence conservation; SNP rates are low in these sites, again showing fewer SNPs in conserved regions. Together, these results confirm that SNPs, reflecting recent genetic variation, occur more frequently in regions with less evolutionarily conservation

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Characteristics of transposable element exonization within human and mouse

Author: A Athanasiadis
A Corvelo
A Gerber
A Goren
A Levy
A Magen
A Nekrutenko
A Resch
AFA Smit
Agnes Hotz-Wagenblatt
B Giardine
B Mersch
BR Graveley
Britta Mersch
C Liu
D Karolchik
D Labuda
DD Kim
E Kim
ES Lander
EY Levanon
G Ast
G Lev-Maor
G Lev-Maor
Gil Ast
H Xie
Ilya Ruvinsky
J Hull
J Jurka
J Jurka
JO Kriegs
JO Yang
JP Nemes
K Nakabayashi
KP Kister
L Lin
L Lin
M Amit
M Blow
M Krull
M Moller-Krull
M Roy
M Sironi
M Sironi
MA Batzer
MD Koob
N Gal-Mark
N Gal-Mark
N Sela
NH Gehring
Noa Sela
O Ram
P Deininger
PL Deininger
R Cordaux
R Sorek
R Sorek
R Sorek
RA Gibbs
RE Mills
RH Waterston
RM Kuhn
RT Hillman
S He
S Schwartz
SK Ng
SS Singer
ST Sherry
T Kwan
T Kwan
VV Kapitonov
W Makalowski
WJ Kent
WL Chen
WS Lo
XH Zhang
Y Xing
YF Chang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/06/2010
Field of study

Insertion of transposed elements within mammalian genes is thought to be an important contributor to mammalian evolution and speciation. Insertion of transposed elements into introns can lead to their activation as alternatively spliced cassette exons, an event called exonization. Elucidation of the evolutionary constraints that have shaped fixation of transposed elements within human and mouse protein coding genes and subsequent exonization is important for understanding of how the exonization process has affected transcriptome and proteome complexities. Here we show that exonization of transposed elements is biased towards the beginning of the coding sequence in both human and mouse genes. Analysis of single nucleotide polymorphisms (SNPs) revealed that exonization of transposed elements can be population-specific, implying that exonizations may enhance divergence and lead to speciation. SNP density analysis revealed differences between Alu and other transposed elements. Finally, we identified cases of primate-specific Alu elements that depend on RNA editing for their exonization. These results shed light on TE fixation and the exonization process within human and mouse genes.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Genome-Wide Survey for Biologically Functional Pseudogenes

Author: Graur D Shuali Y, Li
Jens Lagergren
Lars Arvestad
Orr HT Chung MY, Banfi S, Kwiatkowski TJ Jr, Servadio A, et al.
Waterston RH Lindblad-Toh K, Birney E, Rogers J, Abril JF, et al.
Yano Y Saito R, Yoshida N, Yoshiki A, Wynshaw-Boris A, et al.
Yitzhak Pilpel
Zheng D Zhang Z, Harrison P, Karro J, Carriero N, et al.
Örjan Svensson
Publication venue: Public Library of Science
Publication date: 01/01/2005
Field of study

According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, indeed, in vivo vital and also evolutionarily preserved, encouraged us to conduct a genome-wide survey for other functional pseudogenes in human, mouse, and chimpanzee. We identify to our knowledge the first examples of conserved pseudogenes common to human and mouse, originating from one duplication predating the human–mouse species split and having evolved as pseudogenes since the species split. Functionality is one possible way to explain the apparently contradictory properties of such pseudogene pairs, i.e., high conservation and ancient origin. The hypothesis of functionality is tested by comparing expression evidence and synteny of the candidates with proper test sets. The tests suggest potential biological function. Our candidate set includes a small set of long-lived pseudogenes whose unknown potential function is retained since before the human–mouse species split, and also a larger group of primate-specific ones found from human–chimpanzee searches. Two processed sequences are notable, their conservation since the human–mouse split being as high as most protein-coding genes; one is derived from the protein Ataxin 7-like 3 (ATX7NL3), and one from the Spinocerebellar ataxia type 1 protein (ATX1). Our approach is comparative and can be applied to any pair of species. It is implemented by a semi-automated pipeline based on cross-species BLAST comparisons and maximum-likelihood phylogeny estimations. To separate pseudogenes from protein-coding genes, we use standard methods, utilizing in-frame disablements, as well as a probabilistic filter based on Ka/Ks ratios

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Transcriptional Profiling Uncovers a Network of Cholesterol-Responsive Atherosclerosis Target Genes

Author: AC Nicholson
Alexander Kovacs
Anders Hamsten
B Efron
C Christoffersen
CL Grines
DA Hosack
E Stotz
ES Lander
HD Lieu
HT Ong
J Edqvist
J Tegner
J Tegner
JC Venter
Jesper Lundström
Jesper Tegnér
Johan Björkegren
Josefin Skogsberg
Marina Köhler
MM Veniant
N Hennuyer
P Libby
Peri Noori
RH Waterston
Roland Nilsson
S Schmidt
Shohreh Maleki
SK Basu
TG Redgrave
TT Tuomisto
W Cleveland
Wayne N. Frankel
Publication venue: Public Library of Science
Publication date: 01/03/2008
Field of study

Despite the well-documented effects of plasma lipid lowering regimes halting atherosclerosis lesion development and reducing morbidity and mortality of coronary artery disease and stroke, the transcriptional response in the atherosclerotic lesion mediating these beneficial effects has not yet been carefully investigated. We performed transcriptional profiling at 10-week intervals in atherosclerosis-prone mice with human-like hypercholesterolemia and a genetic switch to lower plasma lipoproteins (Ldlr−/−Apo100/100 Mttpflox/flox Mx1-Cre). Atherosclerotic lesions progressed slowly at first, then expanded rapidly, and plateaued after advanced lesions formed. Analysis of lesion expression profiles indicated that accumulation of lipid-poor macrophages reached a point that led to the rapid expansion phase with accelerated foam-cell formation and inflammation, an interpretation supported by lesion histology. Genetic lowering of plasma cholesterol (e.g., lipoproteins) at this point all together prevented the formation of advanced plaques and parallel transcriptional profiling of the atherosclerotic arterial wall identified 37 cholesterol-responsive genes mediating this effect. Validation by siRNA-inhibition in macrophages incubated with acetylated-LDL revealed a network of eight cholesterol-responsive atherosclerosis genes regulating cholesterol-ester accumulation. Taken together, we have identified a network of atherosclerosis genes that in response to plasma cholesterol-lowering prevents the formation of advanced plaques. This network should be of interest for the development of novel atherosclerosis therapies

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central