50 research outputs found
Recommended from our members
<i>De Novo</i> and Rare Variants at Multiple Loci Support the Oligogenic Origins of Atrioventricular Septal Heart Defects
Congenital heart disease (CHD) has a complex genetic etiology, and recent studies suggest that high penetrance de novo mutations may account for only a small fraction of disease. In a multi-institutional cohort surveyed by exome sequencing, combining analysis of 987 individuals (discovery cohort of 59 affected trios and 59 control trios, and a replication cohort of 100 affected singletons and 533 unaffected singletons) we observe variation at novel and known loci related to a specific cardiac malformation the atrioventricular septal defect (AVSD). In a primary analysis, by combining developmental coexpression networks with inheritance modeling, we identify a de novo mutation in the DNA binding domain of NR1D2 (p.R175W). We show that p.R175W changes the transcriptional activity of Nr1d2 using an in vitro transactivation model in HUVEC cells. Finally, we demonstrate previously unrecognized cardiovascular malformations in the Nr1d2tm1-Dgen knockout mouse. In secondary analyses we map genetic variation to protein-interaction networks suggesting a role for two collagen genes in AVSD, which we corroborate by burden testing in a second replication cohort of 100 AVSDs and 533 controls (p = 8.37e-08). Finally, we apply a rare-disease inheritance model to identify variation in genes previously associated with CHD (ZFPM2, NSD1, NOTCH1, VCAN, and MYH6), cardiac malformations in mouse models (ADAM17, CHRD, IFT140, PTPRJ, RYR1 and ATE1), and hypomorphic alleles of genes causing syndromic CHD (EHMT1, SRCAP, BBS2, NOTCH2, and KMT2D) in 14 of 59 trios, greatly exceeding variation in control trios without CHD (p = 9.60e-06). In total, 32% of trios carried at least one putatively disease-associated variant across 19 loci,suggesting that inherited and de novo variation across a heterogeneous group of loci may contribute to disease risk
Correction: Exome Sequencing of Phenotypic Extremes Identifies CAV2 and TMC6 as Interacting Modifiers of Chronic Pseudomonas aeruginosa Infection in Cystic Fibrosis
Discovery of rare or low frequency variants in exome or genome data that are associated with complex traits often will require use of very large sample sizes to achieve adequate statistical power. For a fixed sample size, sequencing of individuals sampled from the tails of a phenotype distribution (i.e., extreme phenotypes design) maximizes power and this approach was recently validated empirically with the discovery of variants in DCTN4 that influence the natural history of P. aeruginosa airway infection in persons with cystic fibrosis (CF; MIM219700). The increasing availability of large exome/genome sequence datasets that serve as proxies for population-based controls affords the opportunity to test an alternative, potentially more powerful and generalizable strategy, in which the frequency of rare variants in a single extreme phenotypic group is compared to a control group (i.e., extreme phenotype vs. control population design). As proof-of-principle, we applied this approach to search for variants associated with risk for age-of-onset of chronic P. aeruginosa airway infection among individuals with CF and identified variants in CAV2 and TMC6 that were significantly associated with group status. These results were validated using a large, prospective, longitudinal CF cohort and confirmed a significant association of a variant in CAV2 with increased age-of-onset of P. aeruginosa airway infection (hazard ratio = 0.48, 95% CI=[0.32, 0.88]) and variants in TMC6 with diminished age-of-onset of P. aeruginosa airway infection (HR = 5.4, 95% CI=[2.2, 13.5]) A strong interaction between CAV2 and TMC6 variants was observed (HR=12.1, 95% CI=[3.8, 39]) for children with the deleterious TMC6 variant and without the CAV2 protective variant. Neither gene showed a significant association using an extreme phenotypes design, and conditions for which the power of an extreme phenotype vs. control population design was greater than that for the extreme phenotypes design were explored
Genome-Wide Association Study Identifies Genetic Loci Associated with Iron Deficiency
The existence of multiple inherited disorders of iron metabolism in man, rodents and other vertebrates suggests genetic contributions to iron deficiency. To identify new genomic locations associated with iron deficiency, a genome-wide association study (GWAS) was performed using DNA collected from white men aged ≥25 y and women ≥50 y in the Hemochromatosis and Iron Overload Screening (HEIRS) Study with serum ferritin (SF) ≤ 12 µg/L (cases) and iron replete controls (SF>100 µg/L in men, SF>50 µg/L in women). Regression analysis was used to examine the association between case-control status (336 cases, 343 controls) and quantitative serum iron measures and 331,060 single nucleotide polymorphism (SNP) genotypes, with replication analyses performed in a sample of 71 cases and 161 controls from a population of white male and female veterans screened at a US Veterans Affairs (VA) medical center. Five SNPs identified in the GWAS met genome-wide statistical significance for association with at least one iron measure, rs2698530 on chr. 2p14; rs3811647 on chr. 3q22, a known SNP in the transferrin (TF) gene region; rs1800562 on chr. 6p22, the C282Y mutation in the HFE gene; rs7787204 on chr. 7p21; and rs987710 on chr. 22q11 (GWAS observed P<1.51×10−7 for all). An association between total iron binding capacity and SNP rs3811647 in the TF gene (GWAS observed P = 7.0×10−9, corrected P = 0.012) was replicated within the VA samples (observed P = 0.012). Associations with the C282Y mutation in the HFE gene also were replicated. The joint analysis of the HEIRS and VA samples revealed strong associations between rs2698530 on chr. 2p14 and iron status outcomes. These results confirm a previously-described TF polymorphism and implicate one potential new locus as a target for gene identification
Recommended from our members
De novo variants in congenital diaphragmatic hernia identify MYRF as a new syndrome and reveal genetic overlaps with other developmental disorders
Congenital diaphragmatic hernia (CDH) is a severe birth defect that is often accompanied by other congenital anomalies. Previous exome sequencing studies for CDH have supported a role of de novo damaging variants but did not identify any recurrently mutated genes. To investigate further the genetics of CDH, we analyzed de novo coding variants in 362 proband-parent trios including 271 new trios reported in this study. We identified four unrelated individuals with damaging de novo variants in MYRF (P = 5.3x10-8), including one likely gene-disrupting (LGD) and three deleterious missense (D-mis) variants. Eight additional individuals with de novo LGD or missense variants were identified from our other genetic studies or from the literature. Common phenotypes of MYRF de novo variant carriers include CDH, congenital heart disease and genitourinary abnormalities, suggesting that it represents a novel syndrome. MYRF is a membrane associated transcriptional factor highly expressed in developing diaphragm and is depleted of LGD variants in the general population. All de novo missense variants aggregated in two functional protein domains. Analyzing the transcriptome of patient-derived diaphragm fibroblast cells suggest that disease associated variants abolish the transcription factor activity. Furthermore, we showed that the remaining genes with damaging variants in CDH significantly overlap with genes implicated in other developmental disorders. Gene expression patterns and patient phenotypes support pleiotropic effects of damaging variants in these genes on CDH and other developmental disorders. Finally, functional enrichment analysis implicates the disruption of regulation of gene expression, kinase activities, intra-cellular signaling, and cytoskeleton organization as pathogenic mechanisms in CDH
Finishing the euchromatic sequence of the human genome
The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead