Search CORE

25 research outputs found

Improvements to GALA and dbERGE II: databases featuring genomic sequence alignment, annotation and experimental results

Author: Burhans Richard
Elnitski Laura
Giardine Belinda
Hardison Ross C.
Miller Webb
Riemer Cathy
Shah Prachi
Weirauch Matthew
Zhang Yi
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

We describe improvements to two databases that give access to information on genomic sequence similarities, functional elements in DNA and experimental results that demonstrate those functions. GALA, the database of Genome ALignments and Annotations, is now a set of interlinked relational databases for five vertebrate species, human, chimpanzee, mouse, rat and chicken. For each species, GALA records pairwise and multiple sequence alignments, scores derived from those alignments that reflect the likelihood of being under purifying selection or being a regulatory element, and extensive annotations such as genes, gene expression patterns and transcription factor binding sites. The user interface supports simple and complex queries, including operations such as subtraction and intersections as well as clustering and finding elements in proximity to features. dbERGE II, the database of Experimental Results on Gene Expression, contains experimental data from a variety of functional assays. Both databases are now run on the DB2 database management system. Improved hardware and tuning has reduced response times and increased querying capacity, while simplified query interfaces will help direct new users through the querying process. Links are available at http://www.bx.psu.edu/

Crossref

PubMed Central

Revealing mammalian evolutionary relationships by comparative analysis of gene clusters

Author: Abi-Rached
Akahoshi
Bailey
Benjamin Dickins
Birney
Cadavid
Cathy Riemer
Chen
Chih-Hao Hsu
Chiu
Colobran
Datta
Degenhardt
Dewey
Dufayard
Edwards
Eric D. Green
Fitch
Fitch
Fitch
Giltae Song
Gish
Gonzalez
Goodstadt
Graef
Guethlein
Guethlein
Han
Hardies
Hardison
Hardison
Hardison
Harris
Hie Lim Kim
Hoffmann
Hou
Hou
Hsu
Hsu
Hu
Huerta-Cepas
Jensen
Johnson
Kim
Kristensen
Lee
Levy
Li
Li
Lopez-Vazquez
Louxin Zhang
Margulies
Martin
Matsuya
Mi
Miyata
Muller
Murphy
NISC Comparative Sequencing Program
Opazo
Opazo
Ostlund
Ouzounis
Parham
Pianezza
Rajalingam
Ross C. Hardison
Sambrook
Shilling
Siepel
Smit
Song
Song
Song
Sonnhammer
Su
Tatusov
The ENCODE Project Consortium
Uchiyama
van der Heijden
Vilella
Wang
Wapinski
Waterhouse
Webb Miller
Wilson
Wilson
Woelk
Yu Zhang
Zhang
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2012
Field of study

Many software tools for comparative analysis of genomic sequence data have been released in recent decades. Despite this, it remains challenging to determine evolutionary relationships in gene clusters due to their complex histories involving duplications, deletions, inversions, and conversions. One concept describing these relationships is orthology. Orthologs derive from a common ancestor by speciation, in contrast to paralogs, which derive from duplication. Discriminating orthologs from paralogs is a necessary step in most multispecies sequence analyses, but doing so accurately is impeded by the occurrence of gene conversion events. We propose a refined method of orthology assignment based on two paradigms for interpreting its definition: by genomic context or by sequence content. X-orthology (based on context) traces orthology resulting from speciation and duplication only, while N-orthology (based on content) includes the influence of conversion events

Crossref

Nottingham Trent Institutional Repository (IRep)

PubMed Central

ScholarBank@NUS

Evaluation of methods for detecting conversion events in gene clusters

Author: A Siepel
A Siepel
C Hsu
C Spencer
C Strope
Cathy Riemer
Chih-Hao Hsu
D Husmeier
D Martin
D Martin
D Posada
E Holmes
G Hellenthal
Giltae Song
J Archer
J Archibald
J Chen
J Hein
J Huelsenbeck
J Kim
J Smith
J Stoye
K Lole
L Excoffier
L Liang
M Arenas
M Arenas
M Boni
M Gibbs
M Hasegawa
M Rosenberg
M Suchard
N Grassly
O Westesson
P Marjoram
R Cartwright
R Harris
R Hudson
S Pond
S Sawyer
S Schaffner
T Mailund
V Minin
W Miller
Webb Miller
Y Zhang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background: Gene clusters are genetically important, but their analysis poses significant computational challenges. One of the major reasons for these difficulties is gene conversion among the duplicated regions of the cluster, which can obscure their true relationships. Many computational methods for detecting gene conversion events have been released, but their performance has not been assessed for wide deployment in evolutionary history studies due to a lack of accurate evaluation methods. Results: We designed a new method that simulates gene cluster evolution, including large-scale events of duplication, deletion, and conversion as well as small mutations. We used this simulation data to evaluate several different programs for detecting gene conversion events. Conclusions: Our evaluation identifies strengths and weaknesses of several methods for detecting gene conversion, which can contribute to more accurate analysis of gene cluster evolution

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Conversion events in gene clusters

Abstract Background Gene clusters containing multiple similar genomic regions in close proximity are of great interest for biomedical studies because of their associations with inherited diseases. However, such regions are difficult to analyze due to their structural complexity and their complicated evolutionary histories, reflecting a variety of large-scale mutational events. In particular, conversion events can mislead inferences about the relationships among these regions, as traced by traditional methods such as construction of phylogenetic trees or multi-species alignments. Results To correct the distorted information generated by such methods, we have developed an automated pipeline called CHAP (Cluster History Analysis Package) for detecting conversion events. We used this pipeline to analyze the conversion events that affected two well-studied gene clusters (α-globin and β-globin) and three gene clusters for which comparative sequence data were generated from seven primate species: CCL (chemokine ligand), IFN (interferon), and CYP2abf (part of cytochrome P450 family 2). CHAP is freely available at <url>http://www.bx.psu.edu/miller_lab</url>. Conclusions These studies reveal the value of characterizing conversion events in the context of studying gene clusters in complex genomes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

Systematic documentation and analysis of human genetic variation in hemoglobinopathies using the microattribution approach

Author: A Nazli Basak
Adamantia Papachatzopoulou
Alain Francina
Alex E Felice
Barnaby Clark
Belinda Giardine
Belinda K Singleton
BK Singleton
BK Singleton
Branka Zukic
C Yu
Cathy Riemer
Claudia Wiemann
Cornelis L Harteveld
David H K Chui
David J Anstee
Donna Maglott
Douglas R Higgs
DP Heruth
DP Steensma
Emmanuel Kanavakis
Flavia C Costa
George P Patrinos
GP Patrinos
GP Patrinos
GP Patrinos
Halyna Fedosyuk
Henri Wajcman
I Amoyal
IF Fokkema
Iris Schrijver
J Borg
J Borg
J Xu
James D Hoyer
Jan Traeger-Synodinos
John Old
John S Waye
Joseph Borg
K Moradkhani
Kamran Moradkhani
Kenneth R Peterson
KR Peterson
L Arnaud
Lucia Perseu
M Siatecka
Maja Stojiljkovic
Manoussos N Papadakis
Marianthi Georgitsi
Martin Jarvis
MH Steinberg
Milena Radmilovic
MN Papadakis
Monica V E Gallivan
Panagoula Kollia
Paula Faustino
Petros Papadopoulos
Philippe Joly
Piero C Giordano
Q Ma
R Drissen
Ray Tully
RC Hardison
Renzo Galanello
Richard J Gibbons
RJ Gibbons
RJ Gibbons
RM Böhmer
Ross C Hardison
S Harju
S Harju-Baker
S Menzel
Sjaak Philipsen
SL Thein
Sonja Pavlovic
Stefania Satta
Stephan Menzel
Swee Lay Thein
Takahito Wada
V Viprakasit
VG Sankaran
Webb Miller
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

We developed a series of interrelated locus-specific databases to store all published and unpublished genetic variation related to hemoglobinopathies and thalassemia and implemented microattribution to encourage submission of unpublished observations of genetic variation to these public repositories. A total of 1,941 unique genetic variants in 37 genes, encoding globins and other erythroid proteins, are currently documented in these databases, with reciprocal attribution of microcitations to data contributors. Our project provides the first example of implementing microattribution to incentivise submission of all known genetic variation in a defined system. It has demonstrably increased the reporting of human variants, leading to a comprehensive online resource for systematically describing human genetic variation in the globin genes and other genes contributing to hemoglobinopathies and thalassemias. The principles established here will serve as a model for other systems and for the analysis of other common and/or complex human genetic diseases

Archivio istituzionale della ricerca - Università di Cagliari

Leiden University Scholary Publications

Erasmus University Digital Repository

Crossref

PubMed Central

Oxford University Research Archive

King's Research Portal

imagine

Repositório Científico do Instituto Nacional de Saúde

GALA, a Database for Genomic Sequence Alignments and Annotations

Author: Elnitski Laura
Giardine Belinda
Hardison Ross C.
Makalowska Izabela
Miller Webb
Riemer Cathy
Schwartz Scott
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/04/2003
Field of study

We have developed a relational database to contain whole genome sequence alignments between human and mouse with extensive annotations of the human sequence. Complex queries are supported on recorded features, both directly and on proximity among them. Searches can reveal a wide variety of relationships, such as finding all genes expressed in a designated tissue that have a highly conserved noncoding sequence 5′ to the start site. Other examples are finding single nucleotide polymorphisms that occur in conserved noncoding regions upstream of genes and identifying CpG islands that overlap the 5′ ends of divergently transcribed genes. The database is available online at http://globin.cse.psu.edu/ and http://bio.cse.psu.edu/

Crossref

PubMed Central

Improvements in the HbVar database of human hemoglobin variants and thalassemia mutations for population and sequence variation studies

Author: Anagnou Nicholas P.
Chui David H. K.
Giardine Belinda
Hardison Ross C.
Miller Webb
Patrinos George P.
Riemer Cathy
Wajcman Henri
Publication venue: Oxford University Press
Publication date: 01/01/2004
Field of study

HbVar (http://globin.cse.psu.edu/globin/hbvar/) is a relational database developed by a multi-center academic effort to provide up-to-date and high quality information on the genomic sequence changes leading to hemoglobin variants and all types of thalassemia and hemoglobinopathies. Extensive information is recorded for each variant and mutation, including sequence alterations, biochemical and hematological effects, associated pathology, ethnic occurrence and references. In addition to the regular updates to entries, we report two significant advances: (i) The frequencies for a large number of mutations causing β-thalassemia in at-risk populations have been extracted from the published literature and made available for the user to query upon. (ii) HbVar has been linked with the GALA (Genome Alignment and Annotation database, available at http://globin.cse.psu.edu/gala/) so that users can combine information on hemoglobin variants and thalassemia mutations with a wide spectrum of genomic data. It also expands the capacity to view and analyze the data, using tools within GALA and the University of California at Santa Cruz (UCSC) Genome Browser

Crossref

PubMed Central

EUR Research Repository

Erasmus University Digital Repository