87 research outputs found

    The functional spectrum of low-frequency coding variation

    Get PDF
    Background Rare coding variants constitute an important class of human genetic variation, but are underrepresented in current databases that are based on small population samples. Recent studies show that variants altering amino acid sequence and protein function are enriched at low variant allele frequency, 2 to 5%, but because of insufficient sample size it is not clear if the same trend holds for rare variants below 1% allele frequency. Results The 1000 Genomes Exon Pilot Project has collected deep-coverage exon-capture data in roughly 1,000 human genes, for nearly 700 samples. Although medical whole-exome projects are currently afoot, this is still the deepest reported sampling of a large number of human genes with next-generation technologies. According to the goals of the 1000 Genomes Project, we created effective informatics pipelines to process and analyze the data, and discovered 12,758 exonic SNPs, 70% of them novel, and 74% below 1% allele frequency in the seven population samples we examined. Our analysis confirms that coding variants below 1% allele frequency show increased population-specificity and are enriched for functional variants. Conclusions This study represents a large step toward detecting and interpreting low frequency coding variation, clearly lays out technical steps for effective analysis of DNA capture data, and articulates functional and population properties of this important class of genetic variatio

    Platypus globin genes and flanking loci suggest a new insertional model for beta-globin evolution in birds and mammals

    Get PDF
    Background: Vertebrate alpha (α)- and beta (β)-globin gene families exemplify the way in which genomes evolve to produce functional complexity. From tandem duplication of a single globin locus, the α- and β-globin clusters expanded, and then were separated onto different chromosomes. The previous finding of a fossil β-globin gene (ω) in the marsupial α-cluster, however, suggested that duplication of the α-β cluster onto two chromosomes, followed by lineage-specific gene loss and duplication, produced paralogous α- and β-globin clusters in birds and mammals. Here we analyse genomic data from an egg-laying monotreme mammal, the platypus (Ornithorhynchus anatinus), to explore haemoglobin evolution at the stem of the mammalian radiation. Results: The platypus α-globin cluster (chromosome 21) contains embryonic and adult α- globin genes, a β-like ω-globin gene, and the GBY globin gene with homology to cytoglobin, arranged as 5'-ζ-ζ'-αD-α3-α2-α1-ω-GBY-3'. The platypus β-globin cluster (chromosome 2) contains single embryonic and adult globin genes arranged as 5'-ε-β-3'. Surprisingly, all of these globin genes were expressed in some adult tissues. Comparison of flanking sequences revealed that all jawed vertebrate α-globin clusters are flanked by MPG-C16orf35 and LUC7L, whereas all bird and mammal β-globin clusters are embedded in olfactory genes. Thus, the mammalian α- and β-globin clusters are orthologous to the bird α- and β-globin clusters respectively. Conclusion: We propose that α- and β-globin clusters evolved from an ancient MPG-C16orf35-α-β-GBY-LUC7L arrangement 410 million years ago. A copy of the original β (represented by ω in marsupials and monotremes) was inserted into an array of olfactory genes before the amniote radiation (>315 million years ago), then duplicated and diverged to form orthologous clusters of β-globin genes with different expression profiles in different lineages.Vidushi S. Patel, Steven J.B. Cooper, Janine E. Deakin, Bob Fulton, Tina Graves, Wesley C. Warren, Richard K. Wilson and Jennifer A.M. Grave

    A second planet transiting LTT 1445A and a determination of the masses of both worlds

    Get PDF
    K.H. acknowledges support from STFC grant ST/R000824/1.LTT 1445 is a hierarchical triple M-dwarf star system located at a distance of 6.86 pc. The primary star LTT 1445A (0.257 M⊙) is known to host the transiting planet LTT 1445Ab with an orbital period of 5.36 days, making it the second-closest known transiting exoplanet system, and the closest one for which the host is an M dwarf. Using Transiting Exoplanet Survey Satellite data, we present the discovery of a second planet in the LTT 1445 system, with an orbital period of 3.12 days. We combine radial-velocity measurements obtained from the five spectrographs, Echelle Spectrograph for Rocky Exoplanets and Stable Spectroscopic Observations, High Accuracy Radial Velocity Planet Searcher, High-Resolution Echelle Spectrometer, MAROON-X, and Planet Finder Spectrograph to establish that the new world also orbits LTT 1445A. We determine the mass and radius of LTT 1445Ab to be 2.87 ± 0.25 M⊕ and 1.304-0.060+0.067 R⊕, consistent with an Earth-like composition. For the newly discovered LTT 1445Ac, we measure a mass of 1.54-0.19+0.20 M⊕ and a minimum radius of 1.15 R⊕, but we cannot determine the radius directly as the signal-to-noise ratio of our light curve permits both grazing and nongrazing configurations. Using MEarth photometry and ground-based spectroscopy, we establish that star C (0.161 M⊙) is likely the source of the 1.4 day rotation period, and star B (0.215 M⊙) has a likely rotation period of 6.7 days. We estimate a probable rotation period of 85 days for LTT 1445A. Thus, this triple M-dwarf system appears to be in a special evolutionary stage where the most massive M dwarf has spun down, the intermediate mass M dwarf is in the process of spinning down, while the least massive stellar component has not yet begun to spin down.Publisher PDFPeer reviewe

    Identification of the top TESS objects of interest for atmospheric characterization of transiting exoplanets with JWST

    Get PDF
    Funding: Funding for the TESS mission is provided by NASA's Science Mission Directorate. This work makes use of observations from the LCOGT network. Part of the LCOGT telescope time was granted by NOIRLab through the Mid-Scale Innovations Program (MSIP). MSIP is funded by NSF. This paper is based on observations made with the MuSCAT3 instrument, developed by the Astrobiology Center and under financial support by JSPS KAKENHI (grant No. JP18H05439) and JST PRESTO (grant No. JPMJPR1775), at Faulkes Telescope North on Maui, HI, operated by the Las Cumbres Observatory. This paper makes use of data from the MEarth Project, which is a collaboration between Harvard University and the Smithsonian Astrophysical Observatory. The MEarth Project acknowledges funding from the David and Lucile Packard Fellowship for Science and Engineering, the National Science Foundation under grant Nos. AST-0807690, AST-1109468, AST-1616624 and AST-1004488 (Alan T. Waterman Award), the National Aeronautics and Space Administration under grant No. 80NSSC18K0476 issued through the XRP Program, and the John Templeton Foundation. C.M. would like to gratefully acknowledge the entire Dragonfly Telephoto Array team, and Bob Abraham in particular, for allowing their telescope bright time to be put to use observing exoplanets. B.J.H. acknowledges support from the Future Investigators in NASA Earth and Space Science and Technology (FINESST) program (grant No. 80NSSC20K1551) and support by NASA under grant No. 80GSFC21M0002. K.A.C. and C.N.W. acknowledge support from the TESS mission via subaward s3449 from MIT. D.R.C. and C.A.C. acknowledge support from NASA through the XRP grant No. 18-2XRP18_2-0007. C.A.C. acknowledges that this research was carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and Space Administration (80NM0018D0004). S.Z. and A.B. acknowledge support from the Israel Ministry of Science and Technology (grant No. 3-18143). The research leading to these results has received funding from the ARC grant for Concerted Research Actions, financed by the Wallonia-Brussels Federation. TRAPPIST is funded by the Belgian Fund for Scientific Research (Fond National de la Recherche Scientifique, FNRS) under the grant No. PDR T.0120.21. The postdoctoral fellowship of K.B. is funded by F.R.S.-FNRS grant No. T.0109.20 and by the Francqui Foundation. H.P.O.'s contribution has been carried out within the framework of the NCCR PlanetS supported by the Swiss National Science Foundation under grant Nos. 51NF40_182901 and 51NF40_205606. F.J.P. acknowledges financial support from the grant No. CEX2021-001131-S funded by MCIN/AEI/ 10.13039/501100011033. A.J. acknowledges support from ANID—Millennium Science Initiative—ICN12_009 and from FONDECYT project 1210718. Z.L.D. acknowledges the MIT Presidential Fellowship and that this material is based upon work supported by the National Science Foundation Graduate Research Fellowship under grant No. 1745302. P.R. acknowledges support from the National Science Foundation grant No. 1952545. This work is partly supported by JSPS KAKENHI grant Nos. JP17H04574, JP18H05439, JP21K20376; JST CREST grant No. JPMJCR1761; and Astrobiology Center SATELLITE Research project AB022006. This publication benefits from the support of the French Community of Belgium in the context of the FRIA Doctoral Grant awarded to M.T. D.D. acknowledges support from TESS Guest Investigator Program grant Nos. 80NSSC22K1353, 80NSSC22K0185, and 80NSSC23K0769. A.B. acknowledges the support of M.V. Lomonosov Moscow State University Program of Development. T.D. was supported in part by the McDonnell Center for the Space Sciences. V.K. acknowledges support from the youth scientific laboratory project, topic FEUZ-2020-0038.JWST has ushered in an era of unprecedented ability to characterize exoplanetary atmospheres. While there are over 5000 confirmed planets, more than 4000 Transiting Exoplanet Survey Satellite (TESS) planet candidates are still unconfirmed and many of the best planets for atmospheric characterization may remain to be identified. We present a sample of TESS planets and planet candidates that we identify as “best-in-class” for transmission and emission spectroscopy with JWST. These targets are sorted into bins across equilibrium temperature Teq and planetary radius Rp and are ranked by a transmission and an emission spectroscopy metric (TSM and ESM, respectively) within each bin. We perform cuts for expected signal size and stellar brightness to remove suboptimal targets for JWST. Of the 194 targets in the resulting sample, 103 are unconfirmed TESS planet candidates, also known as TESS Objects of Interest (TOIs). We perform vetting and statistical validation analyses on these 103 targets to determine which are likely planets and which are likely false positives, incorporating ground-based follow-up from the TESS Follow-up Observation Program to aid the vetting and validation process. We statistically validate 18 TOIs, marginally validate 31 TOIs to varying levels of confidence, deem 29 TOIs likely false positives, and leave the dispositions for four TOIs as inconclusive. Twenty-one of the 103 TOIs were confirmed independently over the course of our analysis. We intend for this work to serve as a community resource and motivate formal confirmation and mass measurements of each validated planet. We encourage more detailed analysis of individual targets by the community.Peer reviewe

    New insights into the genetic etiology of Alzheimer's disease and related dementias

    Get PDF
    Characterization of the genetic landscape of Alzheimer's disease (AD) and related dementias (ADD) provides a unique opportunity for a better understanding of the associated pathophysiological processes. We performed a two-stage genome-wide association study totaling 111,326 clinically diagnosed/'proxy' AD cases and 677,663 controls. We found 75 risk loci, of which 42 were new at the time of analysis. Pathway enrichment analyses confirmed the involvement of amyloid/tau pathways and highlighted microglia implication. Gene prioritization in the new loci identified 31 genes that were suggestive of new genetically associated processes, including the tumor necrosis factor alpha pathway through the linear ubiquitin chain assembly complex. We also built a new genetic risk score associated with the risk of future AD/dementia or progression from mild cognitive impairment to AD/dementia. The improvement in prediction led to a 1.6- to 1.9-fold increase in AD risk from the lowest to the highest decile, in addition to effects of age and the APOE ε4 allele

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead
    corecore