Search CORE

172 research outputs found

1: To Know Ourselves

Author: The Human Genome Project
The U.S. Department of Energy
Publication venue: DigitalCommons@IMSA
Publication date: 01/07/1996
Field of study

AT THE END OF THE ROAD in Little Cottonwood Canyon, near Salt Lake City, Alta is a place of near-mythic renown among skiers. In time it may well assume similar status among molecular geneticists. In December 1984, a conference there, co-sponsored by the U.S. Department of Energy, pondered a single question: Does modern DNA research offer a way of detecting tiny genetic mutations—and, in particular, of observing any increase in the mutation rate among the survivors of the Hiroshima and Nagasaki bombings and their descendants? In short the answer was, Not yet. But in an atmosphere of rare intellectual fertility, the seeds were sown for a project that would make such detection possible in the future—the Human Genome Project

Illinois Mathematics and Science Academy: DigitalCommons@IMSA

Robustness of Massively Parallel Sequencing Platforms

Author: Aksu Soner
Alkan Can
Güngör Tunga
Hach Faraz
Kavak Pınar
Kulekci M. Oguzhan
Sağıroğlu Mahmut Şamil
Turkish Human Genome Project
Yüksel Bayram
Şahinalp S. Cenk
Publication venue
Publication date: 01/01/2015
Field of study

The improvements in high throughput sequencing technologies (HTS) made clinical sequencing projects such as ClinSeq and Genomics England feasible. Although there are significant improvements in accuracy and reproducibility of HTS based analyses, the usability of these types of data for diagnostic and prognostic applications necessitates a near perfect data generation. To assess the usability of a widely used HTS platform for accurate and reproducible clinical applications in terms of robustness, we generated whole genome shotgun (WGS) sequence data from the genomes of two human individuals in two different genome sequencing centers. After analyzing the data to characterize SNPs and indels using the same tools (BWA, SAMtools, and GATK), we observed significant number of discrepancies in the call sets. As expected, the most of the disagreements between the call sets were found within genomic regions containing common repeats and segmental duplications, albeit only a small fraction of the discordant variants were within the exons and other functionally relevant regions such as promoters. We conclude that although HTS platforms are sufficiently powerful for providing data for first-pass clinical tests, the variant predictions still need to be confirmed using orthogonal methods before using in clinical applications

Directory of Open Access Journals

Simon Fraser University Institutional Repository

PubMed Central

FigShare

Genomics, bio specimens, and other biological data: Current status and future directions

Author: AACR Project GENIE Consortium
Bombard
Burnet
Cancer Genome Atlas Research Network
Comprehensive molecular portraits of human breast tumours
Coyne
ENCODE Project Consortium
Ho
Iwakawa
Jochems
Kerns
Lander
Leek
Mayo
Rosenstein
Roychowdhury
Tym
West
West
Publication venue: 'Wiley'
Publication date: 01/10/2018
Field of study

Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/146389/1/mp12912_am.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/146389/2/mp12912.pd

Crossref

IUPUIScholarWorks

Institute of Cancer Research Repository

Deep Blue Documents at the University of Michigan

Comparative studies of glycosylphosphatidylinositol-anchored high-density lipoprotein-binding protein 1: evidence for a eutherian mammalian origin for the GPIHBP1 gene from an LY6-like gene

Glycosylphosphatidylinositol-anchored high-density lipoprotein-binding protein 1 (GPIHBP1) functions as a platform and transport agent for lipoprotein lipase (LPL) which functions in the hydrolysis of chylomicrons, principally in heart, skeletal muscle and adipose tissue capillary endothelial cells. Previous reports of genetic deficiency for this protein have described severe chylomicronemia. Comparative GPIHBP1 amino acid sequences and structures and GPIHBP1 gene locations were examined using data from several mammalian genome projects. Mammalian GPIHBP1 genes usually contain four coding exons on the positive strand. Mammalian GPIHBP1 sequences shared 41–96% identities as compared with 9–32% sequence identities with other LY6-domain-containing human proteins (LY6-like). The human N-glycosylation site was predominantly conserved among other mammalian GPIHBP1 proteins except cow, dog and pig. Sequence alignments, key amino acid residues and conserved predicted secondary structures were also examined, including the N-terminal signal peptide, the acidic amino acid sequence region which binds LPL, the glycosylphosphatidylinositol linkage group, the Ly6 domain and the C-terminal α-helix. Comparative and phylogenetic studies of mammalian GPIHBP1 suggested that it originated in eutherian mammals from a gene duplication event of an ancestral LY6-like gene and subsequent integration of exon 2, which may have been derived from BCL11A (B-cell CLL/lymphoma 11A gene) encoding an extended acidic amino acid sequence

Crossref

Springer - Publisher Connector

PubMed Central

Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes

BACKGROUND: RNA metabolism, through 'combinatorial splicing', can generate enormous structural diversity in the proteome. Alternative domains may interact, however, with unpredictable phenotypic consequences, necessitating integrated RNA-level regulation of molecular composition. Splicing correlations within transcripts of single genes provide valuable clues to functional relationships among molecular domains as well as genomic targets for higher-order splicing regulation. RESULTS: We present tools to visualize complex splicing patterns in full-length cDNA libraries. Developmental changes in pair-wise correlations are presented vectorially in 'clock plots' and linkage grids. Higher-order correlations are assessed statistically through Monte Carlo analysis of a log-linear model with an empirical-Bayes estimate of the true probabilities of observed and unobserved splice forms. Log-linear coefficients are visualized in a 'spliceprint,' a signature of splice correlations in the transcriptome. We present two novel metrics: the linkage change index, which measures the directional change in pair-wise correlation with tissue differentiation, and the accuracy index, a very simple goodness-of-fit metric that is more sensitive than the integrated squared error when applied to sparsely populated tables, and unlike chi-square, does not diverge at low variance. Considerable attention is given to sparse contingency tables, which are inherent to single-gene libraries. CONCLUSION: Patterns of splicing correlations are revealed, which span a broad range of interaction order and change in development. The methods have a broad scope of applicability, beyond the single gene – including, for example, multiple gene interactions in the complete transcriptome

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Collection Of Biostatistics Research Archive

On the Origin and Evolution of Vertebrate Olfactory Receptor Genes: Comparative Genome Analysis Among 23 Chordate Species

Olfaction is a primitive sense in organisms. Both vertebrates and insects have receptors for detecting odor molecules in the environment, but the evolutionary origins of these genes are different. Among studied vertebrates, mammals have ∼1,000 olfactory receptor (OR) genes, whereas teleost fishes have much smaller (∼100) numbers of OR genes. To investigate the origin and evolution of vertebrate OR genes, I attempted to determine near-complete OR gene repertoires by searching whole-genome sequences of 14 nonmammalian chordates, including cephalochordates (amphioxus), urochordates (ascidian and larvacean), and vertebrates (sea lamprey, elephant shark, five teleost fishes, frog, lizard, and chicken), followed by a large-scale phylogenetic analysis in conjunction with mammalian OR genes identified from nine species. This analysis showed that the amphioxus has >30 vertebrate-type OR genes though it lacks distinctive olfactory organs, whereas all OR genes appear to have been lost in the urochordate lineage. Some groups of genes (θ, κ, and λ) that are phylogenetically nested within vertebrate OR genes showed few gene gains and losses, which is in sharp contrast to the evolutionary pattern of OR genes, suggesting that they are actually non-OR genes. Moreover, the analysis demonstrated a great difference in OR gene repertoires between aquatic and terrestrial vertebrates, reflecting the necessity for the detection of water-soluble and airborne odorants, respectively. However, a minor group (β) of genes that are atypically present in both aquatic and terrestrial vertebrates was also found. These findings should provide a critical foundation for further physiological, behavioral, and evolutionary studies of olfaction in various organisms

Crossref

PubMed Central

Law of Genome Evolution Direction : Coding Information Quantity Grows

Author: A. F. A. Smit
A. G. Matera
A. Mira
B. Charlesworth
C. L. Organ
C. Nusbaum
D. A. Petrov
D. L. Marais Des
D. R. Scannell
E. Schrodinger
E. T. Dermitzakis
F. Clark
G. Bejerano
G. Liu
G. Storz
H. H. Chou
H. H. Kazazian
H. Ozkan
H. Winter
I. J. Leitch
I. Wapinski
I. Wickelgren
International Human Genome Sequencing Consortium
J. Filkowski
J.M. Aury
K. M. Devos
L. F. Luo
L. F. Luo
L. F. Luo
L. He
L. Patthy
L. R. Zhang
Liao-fu Luo
R. J. Taft
R. P. Bininda-Edmonds
S. E. Peters
T. C. Stadtman
T. Kouzarides
T. R. Gregory
The ENCODE Project Consortium
W. Deng
W. Enard
W. H. Li
W. Makalowski
X. Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/08/2008
Field of study

The problem of the directionality of genome evolution is studied. Based on the analysis of C-value paradox and the evolution of genome size we propose that the function-coding information quantity of a genome always grows in the course of evolution through sequence duplication, expansion of code, and gene transfer from outside. The function-coding information quantity of a genome consists of two parts, p-coding information quantity which encodes functional protein and n-coding information quantity which encodes other functional elements except amino acid sequence. The evidences on the evolutionary law about the function-coding information quantity are listed. The needs of function is the motive force for the expansion of coding information quantity and the information quantity expansion is the way to make functional innovation and extension for a species. So, the increase of coding information quantity of a genome is a measure of the acquired new function and it determines the directionality of genome evolution.Comment: 16 page

arXiv.org e-Print Archive

Crossref

Genome-wide comparison of Asian and African rice reveals high recent activity of DNA transposons

Author: AH Paterson
AJ Hartlerode
B Edlinger
B Piegu
C Feschotte
F Sabot
G Yang
G Yang
G Yang
GTH Vu
International Human Genome Sequencing Consortium
International Rice Genome Sequencing Project
JM Richardson
JP Buchmann
K Fujino
K Kikuchi
L Duret
LS Symington
M Kimura
M Wang
N Jiang
P Cao
P SanMiguel
R Kalendar
RH Plasterk
S Moon
S Ouyang
T Nakazaki
T Wicker
T Wicker
T Wicker
T Wicker
TD Wu
TE Bureau
TE Bureau
The International Brachypodium Initiative
V Robert
WR Engels
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The bacterial and mitochondrial ribosomal A-site molecular switches possess different conformational substates

Author: Anderson
Barrell
Brünger
Brünger
Cannone
Carter
Collaborative Computational Project Number 4.
Dunham
Echols
Entelis
Eric Westhof
Fischel-Ghodsian
Florentz
Fokine
Fokine
François
François
Heckman
Hutchin
Hutchin
International Human Genome Sequencing Consortium.
Jiro Kondo
Jones
Koc
Koc
Kohanski
Komine
Kondo
Kondo
Kondo
Kondo
Krebs
Leontis
Leslie
Murphy
Navaza
Ogle
Ogle
Ogle
Ogle
O’Brien
Pfister
Prezant
Rossmann
Sanbonmatsu
Schneider
Selmer
Sengupta
Shandrick
Sprinzl
Suzuki
Suzuki
Terwilliger
Vicens
Vicens
Vicens
Weixlbaumer
Wimberly
Yokoyama
Zhao
Publication venue: Oxford University Press
Publication date
Field of study

The A site of the small ribosomal subunit participates in the fidelity of decoding by switching between two states, a resting ‘off’ state and an active decoding ‘on’ state. Eight crystal structures of RNA duplexes containing two minimal decoding A sites of the Homo sapiens mitochondrial wild-type, the A1555G mutant or bacteria have been solved. The resting ‘off’ state of the mitochondrial wild-type A site is surprisingly different from that of the bacterial A site. The mitochondrial A1555G mutant has two types of the ‘off’ states; one is similar to the mitochondrial wild-type ‘off’ state and the other is similar to the bacterial ‘off’ state. Our present results indicate that the dynamics of the A site in bacteria and mitochondria are different, a property probably related to the small number of tRNAs used for decoding in mitochondria. Based on these structures, we propose a hypothesis for the molecular mechanism of non-syndromic hearing loss due to the mitochondrial A1555G mutation

Crossref

PubMed Central

A genome-wide study of preferential amplification/hybridization in microarray-based pooled DNA experiments

Author: Akey
Arnheim
Bansal
Barcellos
Barratt
Buetow
Butcher
C.-H. Lin
C.S.J. Fann
Dubreuil
Guo
H.-C. Yang
Hillel
Hinds
Hinds
Holm
Hoogendoorn
Huang
J.-Y. Wu
Jawadi
Johnson
Kennedy
L.-H. Li
Le Hellard
Lindroos
Liu
M.-C. Huang
Macgregor
Matsuzaki
Meaburn
Meaburn
Mohlke
Moskvina
Nelson
Norton
Pan
Pusch
Sham
Shapiro
Shaw
Simpson
The ENCODE Project Consortium
The International HapMap Consortium
The International Human Genome Mapping Consortium
Uhl
Visscher
Werner
Wolford
Xu
Y.-J. Liang
Y.-T. Chen
Yang
Yang
Yang
Yang
Zou
Publication venue: Oxford University Press
Publication date: 23/08/2006
Field of study

Microarray-based pooled DNA methods overcome the cost bottleneck of simultaneously genotyping more than 100 000 markers for numerous study individuals. The success of such methods relies on the proper adjustment of preferential amplification/hybridization to ensure accurate and reliable allele frequency estimation. We performed a hybridization-based genome-wide single nucleotide polymorphisms (SNPs) genotyping analysis to dissect preferential amplification/hybridization. The majority of SNPs had less than 2-fold signal amplification or suppression, and the lognormal distributions adequately modeled preferential amplification/hybridization across the human genome. Comparative analyses suggested that the distributions of preferential amplification/hybridization differed among genotypes and the GC content. Patterns among different ethnic populations were similar; nevertheless, there were striking differences for a small proportion of SNPs, and a slight ethnic heterogeneity was observed. To fulfill appropriate and gratuitous adjustments, databases of preferential amplification/hybridization for African Americans, Caucasians and Asians were constructed based on the Affymetrix GeneChip Human Mapping 100 K Set. The robustness of allele frequency estimation using this database was validated by a pooled DNA experiment. This study provides a genome-wide investigation of preferential amplification/hybridization and suggests guidance for the reliable use of the database. Our results constitute an objective foundation for theoretical development of preferential amplification/hybridization and provide important information for future pooled DNA analyses

Crossref

PubMed Central