Search CORE

251 research outputs found

KAAS: an automatic genome annotation and pathway reconstruction server

Author: Booth Benjamin W
Celniker Susan E
Hammonds Ann S
Park Soo
Wan Kenneth H
Yu Charles
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

The number of complete and draft genomes is rapidly growing in recent years, and it has become increasingly important to automate the identification of functional properties and biological roles of genes in these genomes. In the KEGG database, genes in complete genomes are annotated with the KEGG orthology (KO) identifiers, or the K numbers, based on the best hit information using Smith–Waterman scores as well as by the manual curation. Each K number represents an ortholog group of genes, and it is directly linked to an object in the KEGG pathway map or the BRITE functional hierarchy. Here, we have developed a web-based server called KAAS (KEGG Automatic Annotation Server: http://www.genome.jp/kegg/kaas/) i.e. an implementation of a rapid method to automatically assign K numbers to genes in the genome, enabling reconstruction of KEGG pathways and BRITE hierarchies. The method is based on sequence similarities, bi-directional best hit information and some heuristics, and has achieved a high degree of accuracy when compared with the manually curated KEGG GENES database

CiteSeerX

Crossref

PubMed Central

eScholarship - University of California

Systematic image-driven analysis of the spatial Drosophila embryonic expression landscape

Author: Ann S Hammonds
Broihier HT
Erwin Frise
Hartenstein V
Hartenstein V
Ju T
Kumar S
Peng H
Reuter R
Su MT
Susan E Celniker
Publication venue: Nature Publishing Group
Publication date: 01/01/2010
Field of study

We created innovative virtual representation for our large scale Drosophila insitu expression dataset. We aligned an elliptically shaped mesh comprised of small triangular regions to the outline of each embryo. Each triangle defines a unique location in the embryo and comparing corresponding triangles allows easy identification of similar expression patterns.The virtual representation was used to organize the expression landscape at stage 4-6. We identified regions with similar expression in the embryo and clustered genes with similar expression patterns.We created algorithms to mine the dataset for adjacent non-overlapping patterns and anti-correlated patterns. We were able to mine the dataset to identify co-expressed and putative interacting genes.Using co-expression we were able to assign putative functions to unknown genes

Crossref

Directory of Open Access Journals

PubMed Central

The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective.

Author: Ashburner Michael
Bergman Casey M
Carlson Joseph
Celniker Susan E
Frise Erwin
Kaminker Joshua S
Kronmiller Brent
Lewis Suzanna E
Patel Sandeep
Rubin Gerald M
Svirskas Robert
Wheeler David A
Publication venue: Genome Biol
Publication date: 01/01/2002
Field of study

BACKGROUND: Transposable elements are found in the genomes of nearly all eukaryotes. The recent completion of the Release 3 euchromatic genomic sequence of Drosophila melanogaster by the Berkeley Drosophila Genome Project has provided precise sequence for the repetitive elements in the Drosophila euchromatin. We have used this genomic sequence to describe the euchromatic transposable elements in the sequenced strain of this species. RESULTS: We identified 85 known and eight novel families of transposable element varying in copy number from one to 146. A total of 1,572 full and partial transposable elements were identified, comprising 3.86% of the sequence. More than two-thirds of the transposable elements are partial. The density of transposable elements increases an average of 4.7 times in the centromere-proximal regions of each of the major chromosome arms. We found that transposable elements are preferentially found outside genes; only 436 of 1,572 transposable elements are contained within the 61.4 Mb of sequence that is annotated as being transcribed. A large proportion of transposable elements is found nested within other elements of the same or different classes. Lastly, an analysis of structural variation from different families reveals distinct patterns of deletion for elements belonging to different classes. CONCLUSIONS: This analysis represents an initial characterization of the transposable elements in the Release 3 euchromatic genomic sequence of D. melanogaster for which comparison to the transposable elements of other organisms can begin to be made. These data have been made available on the Berkeley Drosophila Genome Project website for future analyses.RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are

CiteSeerX

Springer - Publisher Connector

PubMed Central

The University of Manchester - Institutional Repository

Apollo (Cambridge)

Recommended from our members

The transposable elements of the Drosophila melanogaster

Author: Ashburner Michael
Bergman Casey M
Carlson Joseph
Celniker Susan E
Frise Erwin
Kaminker Joshua S
Kronmiller Brent
Lewis Suzanna E
Patel Sandeep
Rubin Gerald M
Svirskas Robert
Wheeler David A
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 23/12/2002
Field of study

Background: Transposable elements are found in the genomes of nearly all eukaryotes. The recent completion of the Release 3 euchromatic genomic sequence of Drosophila melanogaster by the Berkeley Drosophila Genome Project has provided precise sequence for the repetitive elements in the Drosophila euchromatin. We have used this genomic sequence to describe the euchromatic transposable elements in the sequenced strain of this species. Results: We identified 85 known and eight novel families of transposable element varying in copy number from one to 146. A total of 1,572 full and partial transposable elements were identified, comprising 3.86% of the sequence. More than two-thirds of the transposable elements are partial. The density of transposable elements increases an average of 4.7 times in the centromereproximal regions of each of the major chromosome arms. We found that transposable elements are preferentially found outside genes; only 436 of 1,572 transposable elements are contained within the 61.4 Mb of sequence that is annotated as being transcribed. A large proportion of transposable elements is found nested within other elements of the same or different classes. Lastly, an analysis of structural variation from different families reveals distinct patterns of deletion for elements belonging to different classes. Conclusions: This analysis represents an initial characterization of the transposable elements in the Release 3 euchromatic genomic sequence of D. melanogaster for which comparison to the transposable elements of other organisms can begin to be made. These data have been made available on the Berkeley Drosophila Genome Project website for future analyses

Apollo (Cambridge)

Recommended from our members

Functional Evolution of cis-Regulatory Modules at a Homeotic Gene in Drosophila

Author: Allen John M
Bae Esther
Bender Welcome W.
Celniker Susan E.
Drewell Robert A.
Fisher William W.
Goetz Sara E.
Ho Margaret C. W.
Johnsen Holly
Rau Christoph
Schiller Benjamin J.
Shur Andrey S.
Tran Diana A.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 27/03/2011
Field of study

It is a long-held belief in evolutionary biology that the rate of molecular evolution for a given DNA sequence is inversely related to the level of functional constraint. This belief holds true for the protein-coding homeotic (Hox) genes originally discovered in Drosophila melanogaster. Expression of the Hox genes in Drosophila embryos is essential for body patterning and is controlled by an extensive array of cis-regulatory modules (CRMs). How the regulatory modules functionally evolve in different species is not clear. A comparison of the CRMs for the Abdominal-B gene from different Drosophila species reveals relatively low levels of overall sequence conservation. However, embryonic enhancer CRMs from other Drosophila species direct transgenic reporter gene expression in the same spatial and temporal patterns during development as their D. melanogaster orthologs. Bioinformatic analysis reveals the presence of short conserved sequences within defined CRMs, representing gap and pair-rule transcription factor binding sites. One predicted binding site for the gap transcription factor KRUPPEL in the IAB5 CRM was found to be altered in Superabdominal (Sab) mutations. In Sab mutant flies, the third abdominal segment is transformed into a copy of the fifth abdominal segment. A model for KRUPPEL-mediated repression at this binding site is presented. These findings challenge our current understanding of the relationship between sequence evolution at the molecular level and functional activity of a CRM. While the overall sequence conservation at Drosophila CRMs is not distinctive from neighboring genomic regions, functionally critical transcription factor binding sites within embryonic enhancer CRMs are highly conserved. These results have implications for understanding mechanisms of gene expression during embryonic development, enhancer function, and the molecular evolution of eukaryotic regulatory modules

Harvard University - DASH

Saccharomyces Genome Database: the genomics resource of budding yeast

Author: B. C. Hitz
Barros
Bharadwaj
Borneman
C. Amundsen
C. J. Krieger
Carlson
Celniker
D. G. Fisk
E. D. Wong
E. L. Hong
E. T. Chan
G. Binkley
J. E. Hirschman
J. M. Cherry
J. Park
K. Karra
K. R. Christie
Kane
Lyne
M. C. Costanzo
M. S. Skrzypek
M. Simison
M ller
Naumov
Petranovic
R. Balakrishnan
R. S. Nash
Rinaldi
S. R. Engel
S. R. Miyasato
S. S. Dwight
S. Weng
Stein
Publication venue: Oxford University Press
Publication date
Field of study

The Saccharomyces Genome Database (SGD, http://www.yeastgenome.org) is the community resource for the budding yeast Saccharomyces cerevisiae. The SGD project provides the highest-quality manually curated information from peer-reviewed literature. The experimental results reported in the literature are extracted and integrated within a well-developed database. These data are combined with quality high-throughput results and provided through Locus Summary pages, a powerful query engine and rich genome browser. The acquisition, integration and retrieval of these data allow SGD to facilitate experimental design and analysis by providing an encyclopedia of the yeast genome, its chromosomal features, their functions and interactions. Public access to these data is provided to researchers and educators via web pages designed for optimal ease of use

Crossref

PubMed Central

ENCODE whole-genome data in the UCSC Genome Browser

Author: A. Pohl
A. S. Hinrichs
A. S. Zweig
B. J. Raney
B. Rhead
Celniker
D. Haussler
D. Karolchik
G. P. Barber
K. E. Smith
K. Learned
K. R. Rosenbloom
L. R. Meyer
M. Pheasant
P. A. Fujita
R. M. Kuhn
T. R. Dreszer
T. Wang
The ENCODE Project Consortium
W. J. Kent
Weinstock
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

The Encyclopedia of DNA Elements (ENCODE) project is an international consortium of investigators funded to analyze the human genome with the goal of producing a comprehensive catalog of functional elements. The ENCODE Data Coordination Center at The University of California, Santa Cruz (UCSC) is the primary repository for experimental results generated by ENCODE investigators. These results are captured in the UCSC Genome Bioinformatics database and download server for visualization and data mining via the UCSC Genome Browser and companion tools (Rhead et al. The UCSC Genome Browser Database: update 2010, in this issue). The ENCODE web portal at UCSC (http://encodeproject.org or http://genome.ucsc.edu/ENCODE) provides information about the ENCODE data and convenient links for access

Crossref

PubMed Central

University of Queensland eSpace

Heterochromatic sequences in a Drosophila whole-genome shotgun assembly

Author: Carlson Joseph W
Carvalho A Bernardo
Celniker Susan E
Halpern Aaron
Hoskins Roger A
Kaminker Joshua S
Karpen Gary H
Kennedy Cameron
Mungall Chris J
Myers Eugene W
Rubin Gerald M
Smith Christopher D
Sullivan Beth A
Sutton Granger G
Wakimoto Barbara T
Yasuhara Jiro C
Publication venue: BioMed Central
Publication date: 01/01/2002
Field of study

BACKGROUND: Most eukaryotic genomes include a substantial repeat-rich fraction termed heterochromatin, which is concentrated in centric and telomeric regions. The repetitive nature of heterochromatic sequence makes it difficult to assemble and analyze. To better understand the heterochromatic component of the Drosophila melanogaster genome, we characterized and annotated portions of a whole-genome shotgun sequence assembly. RESULTS: WGS3, an improved whole-genome shotgun assembly, includes 20.7 Mb of draft-quality sequence not represented in the Release 3 sequence spanning the euchromatin. We annotated this sequence using the methods employed in the re-annotation of the Release 3 euchromatic sequence. This analysis predicted 297 protein-coding genes and six non-protein-coding genes, including known heterochromatic genes, and regions of similarity to known transposable elements. Bacterial artificial chromosome (BAC)-based fluorescence in situ hybridization analysis was used to correlate the genomic sequence with the cytogenetic map in order to refine the genomic definition of the centric heterochromatin; on the basis of our cytological definition, the annotated Release 3 euchromatic sequence extends into the centric heterochromatin on each chromosome arm. CONCLUSIONS: Whole-genome shotgun assembly produced a reliable draft-quality sequence of a significant part of the Drosophila heterochromatin. Annotation of this sequence defined the intron-exon structures of 30 known protein-coding genes and 267 protein-coding gene models. The cytogenetic mapping suggests that an additional 150 predicted genes are located in heterochromatin at the base of the Release 3 euchromatic sequence. Our analysis suggests strategies for improving the sequence and annotation of the heterochromatic portions of the Drosophila and other complex genomes

CiteSeerX

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

ENCODE whole-genome data in the UCSC genome browser (2011 update)

Author: Andy Pohl
Angie S. Hinrichs
Ann S. Zweig
Baroni
Bernard B. Suh
Birney
Brian J. Raney
Brooke Rhead
Celniker
Cricket A. Sloan
David Haussler
Donna Karolchik
Galt P. Barber
Greenbaum
Harrow
Hershey
Hesselberth
Hiram Clawson
Kan
Kate R. Rosenbloom
Katrina Learned
Kayla E. Smith
Kent
Khatun
King
Krishna M. Roskin
Kuhn
Laurence R. Meyer
Li
Melissa S. Cline
Pauline A. Fujita
Robert M. Kuhn
Rosenbloom
Timothy R. Dreszer
Vanessa Kirkup
Venkat S. Malladi
Via
W. James Kent
Weirauch
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

The ENCODE project is an international consortium with a goal of cataloguing all the functional elements in the human genome. The ENCODE Data Coordination Center (DCC) at the University of California, Santa Cruz serves as the central repository for ENCODE data. In this role, the DCC offers a collection of high-throughput, genome-wide data generated with technologies such as ChIP-Seq, RNA-Seq, DNA digestion and others. This data helps illuminate transcription factor-binding sites, histone marks, chromatin accessibility, DNA methylation, RNA expression, RNA binding and other cell-state indicators. It includes sequences with quality scores, alignments, signals calculated from the alignments, and in most cases, element or peak calls calculated from the signal data. Each data set is available for visualization and download via the UCSC Genome Browser (http://genome.ucsc.edu/). ENCODE data can also be retrieved using a metadata system that captures the experimental parameters of each assay. The ENCODE web portal at UCSC (http://encodeproject.org/) provides information about the ENCODE data and links for access

CiteSeerX

Crossref

PubMed Central