Search CORE

9,421 research outputs found

Non-coding sequence retrieval system for comparative genomic analysis of gene regulatory elements

Author: Cai Li
Doh Sung Tae
Temple Matthew H
Zhang Yunyu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Completion of the human genome sequence along with other species allows for greater understanding of the biochemical mechanisms and processes that govern healthy as well as diseased states. The large size of the genome sequences has made them difficult to study using traditional methods. There are many studies focusing on the protein coding sequences, however, not much is known about the function of non-coding regions of the genome. It has been demonstrated that parts of the non-coding region play a critical role as gene regulatory elements. Enhancers that regulate transcription processes have been found in intergenic regions. Furthermore, it is observed that regulatory elements found in non-coding regions are highly conserved across different species. However, the analysis of these regulatory elements is not as straightforward as it may first seem. The development of a centralized resource that allows for the quick and easy retrieval of non-coding sequences from multiple species and is capable of handing multi-gene queries is critical for the analysis of non-coding sequences. Here we describe the development of a web-based non-coding sequence retrieval system. RESULTS: This paper presents a Non-Coding Sequences Retrieval System (NCSRS). The NCSRS is a web-based bioinformatics tool that performs fast and convenient retrieval of non-coding and coding sequences from multiple species related to a specific gene or set of genes. This tool has compiled resources from multiple sources into one easy to use and convenient web based interface. With no software installation necessary, the user needs only internet access to use this tool. CONCLUSION: The unique features of this tool will be very helpful for those studying gene regulatory elements that exist in non-coding regions. The web based application can be accessed on the internet at:

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

TranspoGene and microTranspoGene: transposed elements influence on the transcriptome of seven vertebrates and invertebrates

Author: Asaf Levy
Biemont
Borchert
Callinan
Clark
Consortium
Dagan
Deininger
Deininger
Gasteiger
Giardine
Gil Ast
Griffiths-Jones
Han
Hedges
Houwing
Johnson
Jordan
Jurka
Karolchik
Kent
Kim
Kim
Kuhn
Lander
Lev-Maor
Lippman
Lorenc
Makalowski
Martignetti
McKusick
Morgan
Noa Sela
Pasyukova
Piriyapongsa
Pruitt
Sayah
Sela
Smalheiser
Smalheiser
Sorek
Sorek
Thornburg
Waterston
Publication venue: 'Oxford University Press (OUP)'
Publication date: 21/11/2008
Field of study

Transposed elements (TEs) are mobile genetic sequences. During the evolution of eukaryotes TEs were inserted into active protein-coding genes, affecting gene structure, expression and splicing patterns, and protein sequences. Genomic insertions of TEs also led to creation and expression of new functional non-coding RNAs such as micro- RNAs. We have constructed the TranspoGene database, which covers TEs located inside proteincoding genes of seven species: human, mouse, chicken, zebrafish, fruit fly, nematode and sea squirt. TEs were classified according to location within the gene: proximal promoter TEs, exonized TEs (insertion within an intron that led to exon creation), exonic TEs (insertion into an existing exon) or intronic TEs. TranspoGene contains information regarding specific type and family of the TEs, genomic and mRNA location, sequence, supporting transcript accession and alignment to the TE consensus sequence. The database also contains host gene specific data: gene name, genomic location, Swiss-Prot and RefSeq accessions, diseases associated with the gene and splicing pattern. In addition, we created microTranspoGene: a database of human, mouse, zebrafish and nematode TEderived microRNAs. The TranspoGene and micro- TranspoGene databases can be used by researchers interested in the effect of TE insertion on the eukaryotic transcriptome

arXiv.org e-Print Archive

Crossref

PubMed Central

WormBase: a multi-species resource for nematode biology and genomics

Author: Antoshechkin Igor
Bastiani Carol
Bieri Tamberlyn
Blasiar Darin
Bradnam Keith
Chan Juancarlos
Chen Chao-Kung
Chen Nansheng
Chen Wen J.
Cunningham Fiona
Davis Paul
Durbin Richard
Harris Todd W.
Kenny Eimear
Kishore Ranjana
Lawson Daniel
Lee Raymond Y. N.
Müller Hans-Michael
Nakamura Cecilia
Ozersky Philip
Petcherski Andrei
Rogers Anthony
Sabo Aniko
Schwarz Erich M.
Spieth John
Stein Lincoln D.
Sternberg Paul W.
Tello-Ruiz Marcela
Van Auken Kimberly
Wang Qinghua
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2004
Field of study

WormBase (http://www.wormbase.org/) is the central data repository for information about Caenorhabditis elegans and related nematodes. As a model organism database, WormBase extends beyond the genomic sequence, integrating experimental results with extensively annotated views of the genome. The WormBase Consortium continues to expand the biological scope and utility of WormBase with the inclusion of large-scale genomic analyses, through active data and literature curation, through new analysis and visualization tools, and through refinement of the user interface. Over the past year, the nearly complete genomic sequence and comparative analyses of the closely related species Caenorhabditis briggsae have been integrated into WormBase, including gene predictions, ortholog assignments and a new synteny viewer to display the relationships between the two species. Extensive site-wide refinement of the user interface now provides quick access to the most frequently accessed resources and a consistent browsing experience across the site. Unified single-page views now provide complete summaries of commonly accessed entries like genes. These advances continue to increase the utility of WormBase for C.elegans researchers, as well as for those researchers exploring problems in functional and comparative genomics in the context of a powerful genetic system

CiteSeerX

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central

Caltech Authors

MIPSPlantsDB—plant database resource for integrative and comparative plant genome research

Author: Gundlach Heidrun
Haase Dirk
Haberer Georg
Hindemitt Tobias
Klee Kathrin
Mayer Klaus F. X.
Noubibou Octave
Schoof Heiko
Spannagl Manuel
Yang Li
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

Genome-oriented plant research delivers rapidly increasing amount of plant genome data. Comprehensive and structured information resources are required to structure and communicate genome and associated analytical data for model organisms as well as for crops. The increase in available plant genomic data enables powerful comparative analysis and integrative approaches. PlantsDB aims to provide data and information resources for individual plant species and in addition to build a platform for integrative and comparative plant genome research. PlantsDB is constituted from genome databases for Arabidopsis, Medicago, Lotus, rice, maize and tomato. Complementary data resources for cis elements, repetive elements and extensive cross-species comparisons are implemented. The PlantsDB portal can be reached at

SpBase: the sea urchin genome database and web site

Author: A. Yuan
Altschul
Cai
Curwen
D. He
Davidson
Davidson
E. Davidson
Gonzalez
Havlak
Howard-Ashby
Howe
Littlewood
M. Samanta
M ller
Oliveri
Poustka
R. A. Cameron
Salamov
Samanta
Sea Urchin Genome Sequencing Consortium
Sodergren
Springer
Stein
Wei
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2008
Field of study

SpBase is a system of databases focused on the genomic information from sea urchins and related echinoderms. It is exposed to the public through a web site served with open source software (http://spbase.org/). The enterprise was undertaken to provide an easily used collection of information to directly support experimental work on these useful research models in cell and developmental biology. The information served from the databases emerges from the draft genomic sequence of the purple sea urchin, Strongylocentrotus purpuratus and includes sequence data and genomic resource descriptions for other members of the echinoderm clade which in total span 540 million years of evolutionary time. This version of the system contains two assemblies of the purple sea urchin genome, associated expressed sequences, gene annotations and accessory resources. Search mechanisms for the sequences and the gene annotations are provided. Because the system is maintained along with the Sea Urchin Genome resource, a database of sequenced clones is also provided

The Chlamydomonas genome project: A decade on

Author: Aksoy M
Blaby IK
Blaby-Haas CE
Dutcher S
Goodstein D
Grimwood J
Grossman A
Harris EH
Hom EFY
King S
Lopez D
Merchant SS
Porter M
Prochnik S
Schmutz J
Stanke M
Tourasse N
Umen J
Vallon O
Witman GB
Publication venue: eScholarship, University of California
Publication date: 01/10/2014
Field of study

The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis, and micronutrient homeostasis. Ten years since its genome project was initiated an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the omics era. Housed at Phytozome, the plant genomics portal of the Joint Genome Institute (JGI), the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of whole transcriptome sequencing (RNA-Seq) data. We present here the past, present, and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions, and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes

PubMed Central

eScholarship - University of California

TOUCAN 2: the all-inclusive open source workbench for regulatory sequence analysis

Author: Aerts Stein
de Martin Rainer
De Moor Bart
Mayer Herbert
Moreau Yves
Thijs Gert
Van Loo Peter
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

We present the second and improved release of the TOUCAN workbench for cis-regulatory sequence analysis. TOUCAN implements and integrates fast state-of-the-art methods and strategies in gene regulation bioinformatics, including algorithms for comparative genomics and for the detection of cis-regulatory modules. This second release of TOUCAN has become open source and thereby carries the potential to evolve rapidly. The main goal of TOUCAN is to allow a user to come to testable hypotheses regarding the regulation of a gene or of a set of co-regulated genes. TOUCAN can be launched from this location:

CiteSeerX

Crossref

PubMed Central

NemaFootPrinter: a web based software for the identification of conserved non-coding genome sequence regions between C. elegans and C. briggsae

Author: Cassata Giuseppe
Guffanti Alessandro
Morandi Paolo
Rambaldi Davide
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: NemaFootPrinter (Nematode Transcription Factor Scan Through Philogenetic Footprinting) is a web-based software for interactive identification of conserved, non-exonic DNA segments in the genomes of C. elegans and C. briggsae. It has been implemented according to the following project specifications: a) Automated identification of orthologous gene pairs. b) Interactive selection of the boundaries of the genes to be compared. c) Pairwise sequence comparison with a range of different methods. d) Identification of putative transcription factor binding sites on conserved, non-exonic DNA segments. RESULTS: Starting from a C. elegans or C. briggsae gene name or identifier, the software identifies the putative ortholog (if any), based on information derived from public nematode genome annotation databases. The investigator can then retrieve the genome DNA sequences of the two orthologous genes; visualize graphically the genes' intron/exon structure and the surrounding DNA regions; select, through an interactive graphical user interface, subsequences of the two gene regions. Using a bioinformatics toolbox (Blast2seq, Dotmatcher, Ssearch and connection to the rVista database) the investigator is able at the end of the procedure to identify and analyze significant sequences similarities, detecting the presence of transcription factor binding sites corresponding to the conserved segments. The software automatically masks exons. DISCUSSION: This software is intended as a practical and intuitive tool for the researchers interested in the identification of non-exonic conserved sequence segments between C. elegans and C. briggsae. These sequences may contain regulatory transcriptional elements since they are conserved between two related, but rapidly evolving genomes. This software also highlights the power of genome annotation databases when they are conceived as an open resource and the possibilities offered by seamless integration of different web services via the http protocol. Availability: the program is freely available a

Crossref

AIR Universita degli studi di Milano

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Genome-wide analysis of 30 -untranslated regions supports the existence of post-transcriptional regulons controlling gene expression in trypanosomes

Author: Agüero Fernan Gonzalo
Carmona Santiago Javier
de Gaudenzi Javier Gerardo
Frasch Alberto Carlos C.
Publication venue: 'PeerJ'
Publication date: 01/07/2013
Field of study

In eukaryotic cells, a group of messenger ribonucleic acids (mRNAs) encoding functionally interrelated proteins together with the trans-acting factors that coordinately modulate their expression is termed a post-transcriptional regulon, due to their partial analogy to a prokaryotic polycistron. This mRNA clustering is organized by sequence-specific RNA-binding proteins (RBPs) that bind cis-regulatory elements in the noncoding regions of genes, and mediates the synchronized control of their fate. These recognition motifs are often characterized by conserved sequences and/or RNA structures, and it is likely that various classes of cis-elements remain undiscovered. Current evidence suggests that RNA regulons govern gene expression in trypanosomes, unicellular parasites which mainly use post-transcriptional mechanisms to control protein synthesis. In this study, we used motif discovery tools to test whether groups of functionally related trypanosomatid genes contain a common cis-regulatory element. We obtained conserved structured RNA motifs statistically enriched in the noncoding region of 38 out of 53 groups of metabolically related transcripts in comparison with a random control. These motifs have a hairpin loop structure, a preferred sense orientation and are located in close proximity to the open reading frames. We found that 15 out of these 38 groups represent unique motifs in which most 30 -UTR signature elements were group-specific. Two extensively studied Trypanosoma cruzi RBPs, TcUBP1 and TcRBP3 were found associated with a few candidate RNA regulons. Interestingly, 13 motifs showed a strong correlation with clusters of developmentally co-expressed genes and six RNA elements were enriched in gene clusters affected after hyperosmotic stress. Here we report a systematic genome-wide in silico screen to search for novel RNA-binding sites in transcripts, and describe an organized network of several coordinately regulated cohorts of mRNAs in T. cruzi. Moreover, we found that structured RNA elements are also conserved in other human pathogens. These results support a model of regulation of gene expression by multiple post-transcriptional regulons in trypanosomes.Fil: de Gaudenzi, Javier Gerardo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Instituto de Investigaciones Biotecnológicas "Dr. Raúl Alfonsín" (sede Chascomús). Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas. Instituto de Investigaciones Biotecnológicas "Dr. Raúl Alfonsín" (sede Chascomús); ArgentinaFil: Carmona, Santiago Javier. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Instituto de Investigaciones Biotecnológicas "Dr. Raúl Alfonsín" (sede Chascomús). Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas. Instituto de Investigaciones Biotecnológicas "Dr. Raúl Alfonsín" (sede Chascomús); ArgentinaFil: Agüero, Fernan Gonzalo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Instituto de Investigaciones Biotecnológicas "Dr. Raúl Alfonsín" (sede Chascomús). Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas. Instituto de Investigaciones Biotecnológicas "Dr. Raúl Alfonsín" (sede Chascomús); ArgentinaFil: Frasch, Alberto Carlos C.. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Instituto de Investigaciones Biotecnológicas "Dr. Raúl Alfonsín" (sede Chascomús). Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas. Instituto de Investigaciones Biotecnológicas "Dr. Raúl Alfonsín" (sede Chascomús); Argentin

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

CONICET Digital

Directory of Open Access Journals

PubMed Central