Search CORE

Public Library of Science (PLOS)

The G Protein–Coupled Receptor Subset of the Chicken Genome

Author: Anders R Hellström
David E Gloriam
Helgi B Schiöth
International Human Genome Sequencing Consortium
Malin C Lagerström
Philip Bourne
Robert Fredriksson
Thomas P Larsson
Publication venue: Public Library of Science
Publication date: 01/01/2006
Field of study

G protein–coupled receptors (GPCRs) are one of the largest families of proteins, and here we scan the recently sequenced chicken genome for GPCRs. We use a homology-based approach, utilizing comparisons with all human GPCRs, to detect and verify chicken GPCRs from translated genomic alignments and Genscan predictions. We present 557 manually curated sequences for GPCRs from the chicken genome, of which 455 were previously not annotated. More than 60% of the chicken Genscan gene predictions with a human ortholog needed curation, which drastically changed the average percentage identity between the human–chicken orthologous pairs (from 56.3% to 72.9%). Of the non-olfactory chicken GPCRs, 79% had a one-to-one orthologous relationship to a human GPCR. The Frizzled, Secretin, and subgroups of the Rhodopsin families have high proportions of orthologous pairs, although the percentage of amino acid identity varies. Other groups show large differences, such as the Adhesion family and GPCRs that bind exogenous ligands. The chicken has only three bitter Taste 2 receptors, and it also lacks an ortholog to human TAS1R2 (one of three GPCRs in the human genome in the Taste 1 receptor family [TAS1R]), implying that the chicken's ability and mode of detecting both bitter and sweet taste may differ from the human's. The chicken genome contains at least 229 olfactory receptors, and the majority of these (218) originate from a chicken-specific expansion. To our knowledge, this dataset of chicken GPCRs is the largest curated dataset from a single gene family from a non-mammalian vertebrate. Both the updated human GPCR dataset, as well the chicken GPCR dataset, are available for download

CiteSeerX

Public Library of Science (PLOS)

The Influence of Recombination on Human Genetic Diversity

Author: Bernard Silverman
Chris C. A Spencer
David Bentley
Gil McVean
International Human Genome Sequencing Consortium
Jeffrey D Wall
Jim Mullikin
Panos Deloukas
Peter Donnelly
Sarah Hunt
Simon Myers
The International HapMap Consortium
Publication venue: Public Library of Science
Publication date: 01/01/2005
Field of study

In humans, the rate of recombination, as measured on the megabase scale, is positively associated with the level of genetic variation, as measured at the genic scale. Despite considerable debate, it is not clear whether these factors are causally linked or, if they are, whether this is driven by the repeated action of adaptive evolution or molecular processes such as double-strand break formation and mismatch repair. We introduce three innovations to the analysis of recombination and diversity: fine-scale genetic maps estimated from genotype experiments that identify recombination hotspots at the kilobase scale, analysis of an entire human chromosome, and the use of wavelet techniques to identify correlations acting at different scales. We show that recombination influences genetic diversity only at the level of recombination hotspots. Hotspots are also associated with local increases in GC content and the relative frequency of GC-increasing mutations but have no effect on substitution rates. Broad-scale association between recombination and diversity is explained through covariance of both factors with base composition. To our knowledge, these results are the first evidence of a direct and local influence of recombination hotspots on genetic variation and the fate of individual mutations. However, that hotspots have no influence on substitution rates suggests that they are too ephemeral on an evolutionary time scale to have a strong influence on broader scale patterns of base composition and long-term molecular evolution

Oxford University Research Archive

Genome-wide comparison of Asian and African rice reveals high recent activity of DNA transposons

Author: AH Paterson
AJ Hartlerode
B Edlinger
B Piegu
C Feschotte
F Sabot
G Yang
G Yang
G Yang
GTH Vu
International Human Genome Sequencing Consortium
International Rice Genome Sequencing Project
JM Richardson
JP Buchmann
K Fujino
K Kikuchi
L Duret
LS Symington
M Kimura
M Wang
N Jiang
P Cao
P SanMiguel
R Kalendar
RH Plasterk
S Moon
S Ouyang
T Nakazaki
T Wicker
T Wicker
T Wicker
T Wicker
TD Wu
TE Bureau
TE Bureau
The International Brachypodium Initiative
V Robert
WR Engels
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes

BACKGROUND: RNA metabolism, through 'combinatorial splicing', can generate enormous structural diversity in the proteome. Alternative domains may interact, however, with unpredictable phenotypic consequences, necessitating integrated RNA-level regulation of molecular composition. Splicing correlations within transcripts of single genes provide valuable clues to functional relationships among molecular domains as well as genomic targets for higher-order splicing regulation. RESULTS: We present tools to visualize complex splicing patterns in full-length cDNA libraries. Developmental changes in pair-wise correlations are presented vectorially in 'clock plots' and linkage grids. Higher-order correlations are assessed statistically through Monte Carlo analysis of a log-linear model with an empirical-Bayes estimate of the true probabilities of observed and unobserved splice forms. Log-linear coefficients are visualized in a 'spliceprint,' a signature of splice correlations in the transcriptome. We present two novel metrics: the linkage change index, which measures the directional change in pair-wise correlation with tissue differentiation, and the accuracy index, a very simple goodness-of-fit metric that is more sensitive than the integrated squared error when applied to sparsely populated tables, and unlike chi-square, does not diverge at low variance. Considerable attention is given to sparse contingency tables, which are inherent to single-gene libraries. CONCLUSION: Patterns of splicing correlations are revealed, which span a broad range of interaction order and change in development. The methods have a broad scope of applicability, beyond the single gene – including, for example, multiple gene interactions in the complete transcriptome

Collection Of Biostatistics Research Archive

A comparison of genomic copy number calls by Partek Genomics Suite, Genotyping Console and Birdsuite algorithms to quantitative PCR

Author: A Alonso
A Piotrowski
AB Olshen
AE Dellinger
AJ Iafrate
B Schuster-Böckler
B Xu
BE Stranger
Britney L Grayson
DL Bruno
EG Bochukova
H Fiegler
International Human Genome Sequencing C
J Sebat
JM Korn
JR Lupski
K Han
K Wang
L Forer
MK Rudd
NP Carter
P Cahan
P Hupe
S Colella
SA McCarroll
SC Greenway
SE Baranzini
The International SNP Map WG
Thomas M Aune
TL Yang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Copy number variants are >1 kb genomic amplifications or deletions that can be identified using array platforms. However, arrays produce substantial background noise that contributes to high false discovery rates of variants. We hypothesized that quantitative PCR could finitely determine copy number and assess the validity of calling algorithms. Results Using data from 29 Affymetrix SNP 6.0 arrays, we determined copy numbers using three programs: Partek Genomics Suite, Affymetrix Genotyping Console 2.0 and Birdsuite. We compared array calls at 25 chromosomal regions to those determined by qPCR and found nearly identical calls in regions of copy number 2. Conversely, agreement differed in regions called variant by at least one method. The highest overall agreement in calls, 91%, was between Birdsuite and quantitative PCR. Partek Genomics Suite calls agreed with quantitative PCR 76% of the time while the agreement of Affymetrix Genotyping Console 2.0 with quantitative PCR was 79%. Conclusions In 38 independent samples, 96% of Birdsuite calls agreed with quantitative PCR. Analysis of three copy number calling programs and quantitative PCR showed Birdsuite to have the greatest agreement with quantitative PCR.</p

The radial arrangement of the human chromosome 7 in the lymphocyte cell nucleus is associated with chromosomal band gene density

Author: A Bolzer
A Ono
AM Boutanaev
B Dutrillaux
BA Boggs
C Federico
C Federico
C Federico
C Federico
C Federico
Catia Daniela Cantarella
CM Clemson
Concetta Federico
D Zink
E Lukasova
ED Andrulis
EV Volpi
G Bernardi
G D’Onofrio
H Tanabe
H Tanabe
HA Foster
I Solovei
IHGSC (International Human Genome Sequencing Consortium)
J Ferreira
J Strouboulis
J Zhou
JA Croft
JJ Roix
JM Bridger
K Kupper
KE Brown
L Andreozzi
M Cockell
M Costantini
M Neusser
N Gilbert
N Sadoni
NV Petrova
Patrizia Di Mare
PS Masny
S Boyle
S D’Antoni
S Saccone
S Saccone
S Saccone
S Saccone
Sabrina Tosi
Salvatore Saccone
SW Scherer
T Cremer
TS Furey
U Francke
WA Bickmore
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/04/2008
Field of study

This is the author's accepted manuscript. The final published article is available from the link below. Copyright @ Springer-Verlag 2008.In the nuclei of human lymphocytes, chromosome territories are distributed according to the average gene density of each chromosome. However, chromosomes are very heterogeneous in size and base composition, and can contain both very gene-dense and very gene-poor regions. Thus, a precise analysis of chromosome organisation in the nuclei should consider also the distribution of DNA belonging to the chromosomal bands in each chromosome. To improve our understanding of the chromatin organisation, we localised chromosome 7 DNA regions, endowed with different gene densities, in the nuclei of human lymphocytes. Our results showed that this chromosome in cell nuclei is arranged radially with the gene-dense/GC-richest regions exposed towards the nuclear interior and the gene-poorest/GC-poorest ones located at the nuclear periphery. Moreover, we found that chromatin fibres from the 7p22.3 and the 7q22.1 bands are not confined to the territory of the bulk of this chromosome, protruding towards the inner part of the nucleus. Overall, our work demonstrates the radial arrangement of the territory of chromosome 7 in the lymphocyte nucleus and confirms that human genes occupy specific radial positions, presumably to enhance intra- and inter-chromosomal interaction among loci displaying a similar expression pattern, and/or similar replication timing

Brunel University Research Archive

Social and ethical checkpoints for bottom-up synthetic biology, or protocells

Author: A Levskaya
A Pottage
Brigitte Hantsche-Tangen
C Cranor
C Lartigue
D Endy
D-K Ro
DG Gibson
DY Zhang
E Andrianantoandro
Emily C. Parke
F Fukuyama
G Orive
HO Smith
International Human Genome Sequencing Consortium
J Boldt
J Kim
JC Venter
JW Szostak
L Serrano
MA Bedau
MA O’Malley
Mark A. Bedau
MK Cho
ML Simpson
PEM Purnick
PL Luisi
S Rasmussen
T Shinji
TA Lincoln
Uwe Tangen
VJJ Martin
Publication venue: Springer Netherlands
Publication date: 01/01/2009
Field of study

An alternative to creating novel organisms through the traditional “top-down” approach to synthetic biology involves creating them from the “bottom up” by assembling them from non-living components; the products of this approach are called “protocells.” In this paper we describe how bottom-up and top-down synthetic biology differ, review the current state of protocell research and development, and examine the unique ethical, social, and regulatory issues raised by bottom-up synthetic biology. Protocells have not yet been developed, but many expect this to happen within the next five to ten years. Accordingly, we identify six key checkpoints in protocell development at which particular attention should be given to specific ethical, social and regulatory issues concerning bottom-up synthetic biology, and make ten recommendations for responsible protocell science that are tied to the achievement of these checkpoints

University of Southern Denmark Research Output

Aspects of coverage in medical DNA sequencing

Author: C elegans Sequencing Consortium
DA Wheeler
DR Bentley
E Check
ER Mardis
ES Lander
F Sanger
GD Smith
GI Barenblatt
HE Robbins
International Human Genome Sequencing Consortium
J Glaz
J Kling
K Chen
K Virtaneva
L Clarke
L Holst
LW Hillier
M Cammarata
MC Wendl
MC Wendl
MC Wendl
MC Wendl
Michael C Wendl
N Whiteford
NE Breslow
P Nicolaidis
PC Ma
R Wilson
RF Service
Richard K Wilson
RK Wilson
RL Strausberg
RL Warren
S Levy
T Sjöblom
T Wicker
TJ Ley
V Rand
W Feller
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background DNA sequencing is now emerging as an important component in biomedical studies of diseases like cancer. Short-read, highly parallel sequencing instruments are expected to be used heavily for such projects, but many design specifications have yet to be conclusively established. Perhaps the most fundamental of these is the redundancy required to detect sequence variations, which bears directly upon genomic coverage and the consequent resolving power for discerning somatic mutations. Results We address the medical sequencing coverage problem via an extension of the standard mathematical theory of haploid coverage. The expected diploid multi-fold coverage, as well as its generalization for aneuploidy are derived and these expressions can be readily evaluated for any project. The resulting theory is used as a scaling law to calibrate performance to that of standard BAC sequencing at 8× to 10× redundancy, i.e. for expected coverages that exceed 99% of the unique sequence. A differential strategy is formalized for tumor/normal studies wherein tumor samples are sequenced more deeply than normal ones. In particular, both tumor alleles should be detected at least twice, while both normal alleles are detected at least once. Our theory predicts these requirements can be met for tumor and normal redundancies of approximately 26× and 21×, respectively. We explain why these values do not differ by a factor of 2, as might intuitively be expected. Future technology developments should prompt even deeper sequencing of tumors, but the 21× value for normal samples is essentially a constant. Conclusion Given the assumptions of standard coverage theory, our model gives pragmatic estimates for required redundancy. The differential strategy should be an efficient means of identifying potential somatic mutations for further study.</p

Public Library of Science (PLOS)

Digital Commons@Becker

The Diploid Genome Sequence of an Individual Human

Presented here is a genome sequence of an individual human. It was produced from ∼32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2–206 bp), 292,102 heterozygous insertion/deletion events (indels)(1–571 bp), 559,473 homozygous indels (1–82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information

CiteSeerX