40 research outputs found
Factors influencing success of clinical genome sequencing across a broad spectrum of disorders
To assess factors influencing the success of whole-genome sequencing for mainstream clinical diagnosis, we sequenced 217 individuals from 156 independent cases or families across a broad spectrum of disorders in whom previous screening had identified no pathogenic variants. We quantified the number of candidate variants identified using different strategies for variant calling, filtering, annotation and prioritization. We found that jointly calling variants across samples, filtering against both local and external databases, deploying multiple annotation tools and using familial transmission above biological plausibility contributed to accuracy. Overall, we identified disease-causing variants in 21% of cases, with the proportion increasing to 34% (23/68) for mendelian disorders and 57% (8/14) in family trios. We also discovered 32 potentially clinically actionable variants in 18 genes unrelated to the referral disorder, although only 4 were ultimately considered reportable. Our results demonstrate the value of genome sequencing for routine clinical diagnosis but also highlight many outstanding challenges
Factors influencing success of clinical genome sequencing across a broad spectrum of disorders
To assess factors influencing the success of whole-genome sequencing for mainstream clinical diagnosis, we sequenced 217 individuals from 156 independent cases or families across a broad spectrum of disorders in whom previous screening had identified no pathogenic variants. We quantified the number of candidate variants identified using different strategies for variant calling, filtering, annotation and prioritization. We found that jointly calling variants across samples, filtering against both local and external databases, deploying multiple annotation tools and using familial transmission above biological plausibility contributed to accuracy. Overall, we identified disease-causing variants in 21% of cases, with the proportion increasing to 34% (23/68) for mendelian disorders and 57% (8/14) in family trios. We also discovered 32 potentially clinically actionable variants in 18 genes unrelated to the referral disorder, although only 4 were ultimately considered reportable. Our results demonstrate the value of genome sequencing for routine clinical diagnosis but also highlight many outstanding challenges
A Gene Catalogue of the Euchromatic Male-Specific Region of the Horse Y Chromosome: Comparison with Human and Other Mammals
Studies of the Y chromosome in primates, rodents and carnivores provide compelling evidence that the male specific region of Y (MSY) contains functional genes, many of which have specialized roles in spermatogenesis and male-fertility. Little similarity, however, has been found between the gene content and sequence of MSY in different species. This hinders the discovery of species-specific male fertility genes and limits our understanding about MSY evolution in mammals. Here, a detailed MSY gene catalogue was developed for the horse – an odd-toed ungulate. Using direct cDNA selection from horse testis, and sequence analysis of Y-specific BAC clones, 37 horse MSY genes/transcripts were identified. The genes were mapped to the MSY BAC contig map, characterized for copy number, analyzed for transcriptional profiles by RT-PCR, examined for the presence of ORFs, and compared to other mammalian orthologs. We demonstrate that the horse MSY harbors 20 X-degenerate genes with known orthologs in other eutherian species. The remaining 17 genes are acquired or novel and have so far been identified only in the horse or donkey Y chromosomes. Notably, 3 transcripts were found in the heterochromatic part of the Y. We show that despite substantial differences between the sequence, gene content and organization of horse and other mammalian Y chromosomes, the functions of MSY genes are predominantly related to testis and spermatogenesis. Altogether, 10 multicopy genes with testis-specific expression were identified in the horse MSY, and considered likely candidate genes for stallion fertility. The findings establish an important foundation for the study of Y-linked genetic factors governing fertility in stallions, and improve our knowledge about the evolutionary processes that have shaped Y chromosomes in different mammalian lineages
100,000 Genomes Pilot on Rare-Disease Diagnosis in Health Care — Preliminary Report
BACKGROUND: The U.K. 100,000 Genomes Project is in the process of investigating the role of genome sequencing in patients with undiagnosed rare diseases after usual care and the alignment of this research with health care implementation in the U.K. National Health Service. Other parts of this project focus on patients with cancer and infection. METHODS: We conducted a pilot study involving 4660 participants from 2183 families, among whom 161 disorders covering a broad spectrum of rare diseases were present. We collected data on clinical features with the use of Human Phenotype Ontology terms, undertook genome sequencing, applied automated variant prioritization on the basis of applied virtual gene panels and phenotypes, and identified novel pathogenic variants through research analysis. RESULTS: Diagnostic yields varied among family structures and were highest in family trios (both parents and a proband) and families with larger pedigrees. Diagnostic yields were much higher for disorders likely to have a monogenic cause (35%) than for disorders likely to have a complex cause (11%). Diagnostic yields for intellectual disability, hearing disorders, and vision disorders ranged from 40 to 55%. We made genetic diagnoses in 25% of the probands. A total of 14% of the diagnoses were made by means of the combination of research and automated approaches, which was critical for cases in which we found etiologic noncoding, structural, and mitochondrial genome variants and coding variants poorly covered by exome sequencing. Cohortwide burden testing across 57,000 genomes enabled the discovery of three new disease genes and 19 new associations. Of the genetic diagnoses that we made, 25% had immediate ramifications for clinical decision making for the patients or their relatives. CONCLUSIONS: Our pilot study of genome sequencing in a national health care system showed an increase in diagnostic yield across a range of rare diseases. (Funded by the National Institute for Health Research and others.)
An animal model to evaluate the function and regulation of the adaptively evolving stress protein SEP53 in oesophageal bile damage responses
Squamous epithelium in mammals has evolved an atypical stress response involving down-regulation of the classic HSP70 protein and induction of sets of proteins including one named SEP53. This atypical stress response might be due to the unusual environmental pressures placed on squamous tissue. In fact, SEP53 plays a role as an anti-apoptotic factor in response to DNA damage induced by deoxycholic acid stresses implicated in oesophageal reflux disease. SEP53 also has a genetic signature characteristic of an adaptively and rapidly evolving gene, and this observation has been used to imply a role for SEP53 in immunity. Physiological models of squamous tissue are required to further define the regulation and function of SEP53. We examined whether porcine squamous epithelium would be a good model to study SEP53, since this animal suffers from a bile-reflux disease in squamous oesophageal tissue. We have (1) cloned and sequenced the porcine SEP53 locus from porcine bacterial artificial chromosome genomic DNA, (2) confirmed the strikingly divergent nature of the C-terminal portion of the SEP53 gene amongst mammals, (3) discovered that a function of the conserved N-terminal domain of the gene is to maintain cytoplasmic localisation, and (4) examined SEP53 expression in normal and diseased porcine pars oesophagea. SEP53 expression in porcine tissue was relatively confined to gastric squamous epithelium, consistent with its expression in normal human squamous epithelium. Immunohistochemical staining for SEP53 protein in normal and damaged pars oesophagea demonstrated significant stabilisation of SEP53 protein in the injured tissue. These results suggest that porcine squamous epithelium would be a robust physiological model to examine the evolution and function of the SEP53 stress pathway in modulating stress-induced responses in squamous tissue
A global reference for human genetic variation
The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.We thank the many people who were generous with contributing their samples to the project: the African Caribbean in Barbados; Bengali in Bangladesh; British in England and Scotland; Chinese Dai in Xishuangbanna, China; Colombians in Medellin, Colombia; Esan in Nigeria; Finnish in Finland; Gambian in Western Division – Mandinka; Gujarati Indians in Houston, Texas, USA; Han Chinese in Beijing, China; Iberian populations in Spain; Indian Telugu in the UK; Japanese in Tokyo, Japan; Kinh in Ho Chi Minh City, Vietnam; Luhya in Webuye, Kenya; Mende in Sierra Leone; people with African ancestry in the southwest USA; people with Mexican ancestry in Los Angeles, California, USA; Peruvians in Lima, Peru; Puerto Ricans in Puerto Rico; Punjabi in Lahore, Pakistan; southern Han Chinese; Sri Lankan Tamil in the UK; Toscani in Italia; Utah residents (CEPH) with northern and western European ancestry; and Yoruba in Ibadan, Nigeria. Many thanks to the people who contributed to this project: P. Maul, T. Maul, and C. Foster; Z. Chong, X. Fan, W. Zhou, and T. Chen; N. Sengamalay, S. Ott, L. Sadzewicz, J. Liu, and L. Tallon; L. Merson; O. Folarin, D. Asogun, O. Ikpwonmosa, E. Philomena, G. Akpede, S. Okhobgenin, and O. Omoniwa; the staff of the Institute of Lassa Fever Research and Control (ILFRC), Irrua Specialist Teaching Hospital, Irrua, Edo State, Nigeria; A. Schlattl and T. Zichner; S. Lewis, E. Appelbaum, and L. Fulton; A. Yurovsky and I. Padioleau; N. Kaelin and F. Laplace; E. Drury and H. Arbery; A. Naranjo, M. Victoria Parra, and C. Duque; S. Däkel, B. Lenz, and S. Schrinner; S. Bumpstead; and C. Fletcher-Hoppe. Funding for this work was from the Wellcome Trust Core Award 090532/Z/09/Z and Senior Investigator Award 095552/Z/11/Z (P.D.), and grants WT098051 (R.D.), WT095908 and WT109497 (P.F.), WT086084/Z/08/Z and WT100956/Z/13/Z (G.M.), WT097307 (W.K.), WT0855322/Z/08/Z (R.L.), WT090770/Z/09/Z (D.K.), the Wellcome Trust Major Overseas program in Vietnam grant 089276/Z.09/Z (S.D.), the Medical Research Council UK grant G0801823 (J.L.M.), the UK Biotechnology and Biological Sciences Research Council grants BB/I02593X/1 (G.M.) and BB/I021213/1 (A.R.L.), the British Heart Foundation (C.A.A.), the Monument Trust (J.H.), the European Molecular Biology Laboratory (P.F.), the European Research Council grant 617306 (J.L.M.), the Chinese 863 Program 2012AA02A201, the National Basic Research program of China 973 program no. 2011CB809201, 2011CB809202 and 2011CB809203, Natural Science Foundation of China 31161130357, the Shenzhen Municipal Government of China grant ZYC201105170397A (J.W.), the Canadian Institutes of Health Research Operating grant 136855 and Canada Research Chair (S.G.), Banting Postdoctoral Fellowship from the Canadian Institutes of Health Research (M.K.D.), a Le Fonds de Recherche duQuébec-Santé (FRQS) research fellowship (A.H.), Genome Quebec (P.A.), the Ontario Ministry of Research and Innovation – Ontario Institute for Cancer Research Investigator Award (P.A., J.S.), the Quebec Ministry of Economic Development, Innovation, and Exports grant PSR-SIIRI-195 (P.A.), the German Federal Ministry of Education and Research (BMBF) grants 0315428A and 01GS08201 (R.H.), the Max Planck Society (H.L., G.M., R.S.), BMBF-EPITREAT grant 0316190A (R.H., M.L.), the German Research Foundation (Deutsche Forschungsgemeinschaft) Emmy Noether Grant KO4037/1-1 (J.O.K.), the Beatriu de Pinos Program grants 2006 BP-A 10144 and 2009 BP-B 00274 (M.V.), the Spanish National Institute for Health Research grant PRB2 IPT13/0001-ISCIII-SGEFI/FEDER (A.O.), Ewha Womans University (C.L.), the Japan Society for the Promotion of Science Fellowship number PE13075 (N.P.), the Louis Jeantet Foundation (E.T.D.), the Marie Curie Actions Career Integration grant 303772 (C.A.), the Swiss National Science Foundation 31003A_130342 and NCCR “Frontiers in Genetics” (E.T.D.), the University of Geneva (E.T.D., T.L., G.M.), the US National Institutes of Health National Center for Biotechnology Information (S.S.) and grants U54HG3067 (E.S.L.), U54HG3273 and U01HG5211 (R.A.G.), U54HG3079 (R.K.W., E.R.M.), R01HG2898 (S.E.D.), R01HG2385 (E.E.E.), RC2HG5552 and U01HG6513 (G.T.M., G.R.A.), U01HG5214 (A.C.), U01HG5715 (C.D.B.), U01HG5718 (M.G.), U01HG5728 (Y.X.F.), U41HG7635 (R.K.W., E.E.E., P.H.S.), U41HG7497 (C.L., M.A.B., K.C., L.D., E.E.E., M.G., J.O.K., G.T.M., S.A.M., R.E.M., J.L.S., K.Y.), R01HG4960 and R01HG5701 (B.L.B.), R01HG5214 (G.A.), R01HG6855 (S.M.), R01HG7068 (R.E.M.), R01HG7644 (R.D.H.), DP2OD6514 (P.S.), DP5OD9154 (J.K.), R01CA166661 (S.E.D.), R01CA172652 (K.C.), P01GM99568 (S.R.B.), R01GM59290 (L.B.J., M.A.B.), R01GM104390 (L.B.J., M.Y.Y.), T32GM7790 (C.D.B., A.R.M.), P01GM99568 (S.R.B.), R01HL87699 and R01HL104608 (K.C.B.), T32HL94284 (J.L.R.F.), and contracts HHSN268201100040C (A.M.R.) and HHSN272201000025C (P.S.), Harvard Medical School Eleanor and Miles Shore Fellowship (K.L.), Lundbeck Foundation Grant R170-2014-1039 (K.L.), NIJ Grant 2014-DN-BX-K089 (Y.E.), the Mary Beryl Patch Turnbull Scholar Program (K.C.B.), NSF Graduate Research Fellowship DGE-1147470 (G.D.P.), the Simons Foundation SFARI award SF51 (M.W.), and a Sloan Foundation Fellowship (R.D.H.). E.E.E. is an investigator of the Howard Hughes Medical Institute
Genetic determinants of risk in pulmonary arterial hypertension: international genome-wide association studies and meta-analysis
Background Rare genetic variants cause pulmonary arterial hypertension, but the contribution of common genetic
variation to disease risk and natural history is poorly characterised. We tested for genome-wide association for pulmonary
arterial hypertension in large international cohorts and assessed the contribution of associated regions to outcomes.
Methods We did two separate genome-wide association studies (GWAS) and a meta-analysis of pulmonary arterial
hypertension. These GWAS used data from four international case-control studies across 11744 individuals with
European ancestry (including 2085 patients). One GWAS used genotypes from 5895 whole-genome sequences and
the other GWAS used genotyping array data from an additional 5849 individuals. Cross-validation of loci reaching
genome-wide significance was sought by meta-analysis. Conditional analysis corrected for the most significant variants
at each locus was used to resolve signals for multiple associations. We functionally annotated associated variants and
tested associations with duration of survival. All-cause mortality was the primary endpoint in survival analyses.
Findings A locus near SOX17 (rs10103692, odds ratio 1·80 [95% CI 1·55–2·08], p=5·13×10–
¹⁵) and a second locus in
HLA-DPA1 and HLA-DPB1 (collectively referred to as HLA-DPA1/DPB1 here; rs2856830, 1·56 [1·42–1·71],
p=7·65×10–
²⁰) within the class II MHC region were associated with pulmonary arterial hypertension. The SOX17 locus
had two independent signals associated with pulmonary arterial hypertension (rs13266183, 1·36 [1·25–1·48],
p=1·69×10–
¹²; and rs10103692). Functional and epigenomic data indicate that the risk variants near SOX17 alter gene
regulation via an enhancer active in endothelial cells. Pulmonary arterial hypertension risk variants determined
haplotype-specific enhancer activity, and CRISPR-mediated inhibition of the enhancer reduced SOX17 expression. The
HLA-DPA1/DPB1 rs2856830 genotype was strongly associated with survival. Median survival from diagnosis in
patients with pulmonary arterial hypertension with the C/C homozygous genotype was double (13·50 years [95% CI
12·07 to >13·50]) that of those with the T/T genotype (6·97 years [6·02–8·05]), despite similar baseline disease severity.
Interpretation This is the first study to report that common genetic variation at loci in an enhancer near SOX17 and in
HLA-DPA1/DPB1 is associated with pulmonary arterial hypertension. Impairment of SOX17 function might be more
common in pulmonary arterial hypertension than suggested by rare mutations in SOX17. Further studies are needed
to confirm the association between HLA typing or rs2856830 genotyping and survival, and to determine whether HLA
typing or rs2856830 genotyping improves risk stratification in clinical practice or trials.
Funding UK NIHR, BHF, UK MRC, Dinosaur Trust, NIH/NHLBI, ERS, EMBO, Wellcome Trust, EU, AHA,
ACClinPharm, Netherlands CVRI, Dutch Heart Foundation, Dutch Federation of UMC, Netherlands OHRD and
RNAS, German DFG, German BMBF, APH Paris, INSERM, Université Paris-Sud, and French ANR