103 research outputs found

    CG dinucleotide clustering is a species-specific property of the genome

    Get PDF
    Cytosines at cytosine-guanine (CG) dinucleotides are the near-exclusive target of DNA methyltransferases in mammalian genomes. Spontaneous deamination of methylcytosine to thymine makes methylated cytosines unusually susceptible to mutation and consequent depletion. The loci where CG dinucleotides remain relatively enriched, presumably due to their unmethylated status during the germ cell cycle, have been referred to as CpG islands. Currently, CpG islands are solely defined by base compositional criteria, allowing annotation of any sequenced genome. Using a novel bioinformatic approach, we show that CG clusters can be identified as an inherent property of genomic sequence without imposing a base compositional a priori assumption. We also show that the CG clusters co-localize in the human genome with hypomethylated loci and annotated transcription start sites to a greater extent than annotations produced by prior CpG island definitions. Moreover, this new approach allows CG clusters to be identified in a species-specific manner, revealing a degree of orthologous conservation that is not revealed by current base compositional approaches. Finally, our approach is able to identify methylating genomes (such as Takifugu rubripes) that lack CG clustering entirely, in which it is inappropriate to annotate CpG islands or CG clusters

    Comparing Aerodynamic Efficiency in Birds and Bats Suggests Better Flight Performance in Birds

    Get PDF
    Flight is one of the energetically most costly activities in the animal kingdom, suggesting that natural selection should work to optimize flight performance. The similar size and flight speed of birds and bats may therefore suggest convergent aerodynamic performance; alternatively, flight performance could be restricted by phylogenetic constraints. We test which of these scenarios fit to two measures of aerodynamic flight efficiency in two passerine bird species and two New World leaf-nosed bat species. Using time-resolved particle image velocimetry measurements of the wake of the animals flying in a wind tunnel, we derived the span efficiency, a metric for the efficiency of generating lift, and the lift-to-drag ratio, a metric for mechanical energetic flight efficiency. We show that the birds significantly outperform the bats in both metrics, which we ascribe to variation in aerodynamic function of body and wing upstroke: Bird bodies generated relatively more lift than bat bodies, resulting in a more uniform spanwise lift distribution and higher span efficiency. A likely explanation would be that the bat ears and nose leaf, associated with echolocation, disturb the flow over the body. During the upstroke, the birds retract their wings to make them aerodynamically inactive, while the membranous bat wings generate thrust and negative lift. Despite the differences in performance, the wake morphology of both birds and bats resemble the optimal wake for their respective lift-to-drag ratio regimes. This suggests that evolution has optimized performance relative to the respective conditions of birds and bats, but that maximum performance is possibly limited by phylogenetic constraints. Although ecological differences between birds and bats are subjected to many conspiring variables, the different aerodynamic flight efficiency for the bird and bat species studied here may help explain why birds typically fly faster, migrate more frequently and migrate longer distances than bats

    Refining transcriptional programs in kidney development by integration of deep RNA-sequencing and array-based spatial profiling

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The developing mouse kidney is currently the best-characterized model of organogenesis at a transcriptional level. Detailed spatial maps have been generated for gene expression profiling combined with systematic <it>in situ </it>screening. These studies, however, fall short of capturing the transcriptional complexity arising from each locus due to the limited scope of microarray-based technology, which is largely based on "gene-centric" models.</p> <p>Results</p> <p>To address this, the polyadenylated RNA and microRNA transcriptomes of the 15.5 dpc mouse kidney were profiled using strand-specific RNA-sequencing (RNA-Seq) to a depth sufficient to complement spatial maps from pre-existing microarray datasets. The transcriptional complexity of RNAs arising from mouse RefSeq loci was catalogued; including 3568 alternatively spliced transcripts and 532 uncharacterized alternate 3' UTRs. Antisense expressions for 60% of RefSeq genes was also detected including uncharacterized non-coding transcripts overlapping kidney progenitor markers, Six2 and Sall1, and were validated by section <it>in situ </it>hybridization. Analysis of genes known to be involved in kidney development, particularly during mesenchymal-to-epithelial transition, showed an enrichment of non-coding antisense transcripts extended along protein-coding RNAs.</p> <p>Conclusion</p> <p>The resulting resource further refines the transcriptomic cartography of kidney organogenesis by integrating deep RNA sequencing data with locus-based information from previously published expression atlases. The added resolution of RNA-Seq has provided the basis for a transition from classical gene-centric models of kidney development towards more accurate and detailed "transcript-centric" representations, which highlights the extent of transcriptional complexity of genes that direct complex development events.</p

    The Psychological Science Accelerator: Advancing Psychology Through a Distributed Collaborative Network

    Get PDF
    Source at https://doi.org/10.1177/2515245918797607.Concerns about the veracity of psychological research have been growing. Many findings in psychological science are based on studies with insufficient statistical power and nonrepresentative samples, or may otherwise be limited to specific, ungeneralizable settings or populations. Crowdsourced research, a type of large-scale collaboration in which one or more research projects are conducted across multiple lab sites, offers a pragmatic solution to these and other current methodological challenges. The Psychological Science Accelerator (PSA) is a distributed network of laboratories designed to enable and support crowdsourced research projects. These projects can focus on novel research questions or replicate prior research in large, diverse samples. The PSA’s mission is to accelerate the accumulation of reliable and generalizable evidence in psychological science. Here, we describe the background, structure, principles, procedures, benefits, and challenges of the PSA. In contrast to other crowdsourced research networks, the PSA is ongoing (as opposed to time limited), efficient (in that structures and principles are reused for different projects), decentralized, diverse (in both subjects and researchers), and inclusive (of proposals, contributions, and other relevant input from anyone inside or outside the network). The PSA and other approaches to crowdsourced psychological science will advance understanding of mental processes and behaviors by enabling rigorous research and systematic examination of its generalizability

    Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data.

    Get PDF
    Trio-based whole-exome sequence (WES) data have established confident genetic diagnoses in ∼40% of previously undiagnosed individuals recruited to the Deciphering Developmental Disorders (DDD) study. Here we aim to use the breadth of phenotypic information recorded in DDD to augment diagnosis and disease variant discovery in probands. Median Euclidean distances (mEuD) were employed as a simple measure of similarity of quantitative phenotypic data within sets of ≥10 individuals with plausibly causative de novo mutations (DNM) in 28 different developmental disorder genes. 13/28 (46.4%) showed significant similarity for growth or developmental milestone metrics, 10/28 (35.7%) showed similarity in HPO term usage, and 12/28 (43%) showed no phenotypic similarity. Pairwise comparisons of individuals with high-impact inherited variants to the 32 individuals with causative DNM in ANKRD11 using only growth z-scores highlighted 5 likely causative inherited variants and two unrecognized DNM resulting in an 18% diagnostic uplift for this gene. Using an independent approach, naive Bayes classification of growth and developmental data produced reasonably discriminative models for the 24 DNM genes with sufficiently complete data. An unsupervised naive Bayes classification of 6,993 probands with WES data and sufficient phenotypic information defined 23 in silico syndromes (ISSs) and was used to test a "phenotype first" approach to the discovery of causative genotypes using WES variants strictly filtered on allele frequency, mutation consequence, and evidence of constraint in humans. This highlighted heterozygous de novo nonsynonymous variants in SPTBN2 as causative in three DDD probands

    Germline variation at 8q24 and prostate cancer risk in men of European ancestry

    Get PDF
    Chromosome 8q24 is a susceptibility locus for multiple cancers, including prostate cancer. Here we combine genetic data across the 8q24 susceptibility region from 71,535 prostate cancer cases and 52,935 controls of European ancestry to define the overall contribution of germline variation at 8q24 to prostate cancer risk. We identify 12 independent risk signals for prostate cancer (p < 4.28 × 10−15), including three risk variants that have yet to be reported. From a polygenic risk score (PRS) model, derived to assess the cumulative effect of risk variants at 8q24, men in the top 1% of the PRS have a 4-fold (95%CI = 3.62–4.40) greater risk compared to the population average. These 12 variants account for ~25% of what can be currently explained of the familial risk of prostate cancer by known genetic risk factors. These findings highlight the overwhelming contribution of germline variation at 8q24 on prostate cancer risk which has implications for population risk stratification

    Fine-mapping of prostate cancer susceptibility loci in a large meta-analysis identifies candidate causal variants

    Get PDF
    Prostate cancer is a polygenic disease with a large heritable component. A number of common, low-penetrance prostate cancer risk loci have been identified through GWAS. Here we apply the Bayesian multivariate variable selection algorithm JAM to fine-map 84 prostate cancer susceptibility loci, using summary data from a large European ancestry meta-analysis. We observe evidence for multiple independent signals at 12 regions and 99 risk signals overall. Only 15 original GWAS tag SNPs remain among the catalogue of candidate variants identified; the remainder are replaced by more likely candidates. Biological annotation of our credible set of variants indicates significant enrichment within promoter and enhancer elements, and transcription factor-binding sites, including AR, ERG and FOXA1. In 40 regions at least one variant is colocalised with an eQTL in prostate cancer tissue. The refined set of candidate variants substantially increase the proportion of familial relative risk explained by these known susceptibility regions, which highlights the importance of fine-mapping studies and has implications for clinical risk profiling. © 2018 The Author(s).Prostate cancer is a polygenic disease with a large heritable component. A number of common, low-penetrance prostate cancer risk loci have been identified through GWAS. Here we apply the Bayesian multivariate variable selection algorithm JAM to fine-map 84 prostate cancer susceptibility loci, using summary data from a large European ancestry meta-analysis. We observe evidence for multiple independent signals at 12 regions and 99 risk signals overall. Only 15 original GWAS tag SNPs remain among the catalogue of candidate variants identified; the remainder are replaced by more likely candidates. Biological annotation of our credible set of variants indicates significant enrichment within promoter and enhancer elements, and transcription factor-binding sites, including AR, ERG and FOXA1. In 40 regions at least one variant is colocalised with an eQTL in prostate cancer tissue. The refined set of candidate variants substantially increase the proportion of familial relative risk explained by these known susceptibility regions, which highlights the importance of fine-mapping studies and has implications for clinical risk profiling. © 2018 The Author(s).Peer reviewe

    Prevalence and architecture of de novo mutations in developmental disorders.

    Get PDF
    The genomes of individuals with severe, undiagnosed developmental disorders are enriched in damaging de novo mutations (DNMs) in developmentally important genes. Here we have sequenced the exomes of 4,293 families containing individuals with developmental disorders, and meta-analysed these data with data from another 3,287 individuals with similar disorders. We show that the most important factors influencing the diagnostic yield of DNMs are the sex of the affected individual, the relatedness of their parents, whether close relatives are affected and the parental ages. We identified 94 genes enriched in damaging DNMs, including 14 that previously lacked compelling evidence of involvement in developmental disorders. We have also characterized the phenotypic diversity among these disorders. We estimate that 42% of our cohort carry pathogenic DNMs in coding sequences; approximately half of these DNMs disrupt gene function and the remainder result in altered protein function. We estimate that developmental disorders caused by DNMs have an average prevalence of 1 in 213 to 1 in 448 births, depending on parental age. Given current global demographics, this equates to almost 400,000 children born per year

    To which world regions does the valence–dominance model of social perception apply?

    Get PDF
    Over the past 10 years, Oosterhof and Todorov’s valence–dominance model has emerged as the most prominent account of how people evaluate faces on social dimensions. In this model, two dimensions (valence and dominance) underpin social judgements of faces. Because this model has primarily been developed and tested in Western regions, it is unclear whether these findings apply to other regions. We addressed this question by replicating Oosterhof and Todorov’s methodology across 11 world regions, 41 countries and 11,570 participants. When we used Oosterhof and Todorov’s original analysis strategy, the valence–dominance model generalized across regions. When we used an alternative methodology to allow for correlated dimensions, we observed much less generalization. Collectively, these results suggest that, while the valence–dominance model generalizes very well across regions when dimensions are forced to be orthogonal, regional differences are revealed when we use different extraction methods and correlate and rotate the dimension reduction solution.C.L. was supported by the Vienna Science and Technology Fund (WWTF VRG13-007); L.M.D. was supported by ERC 647910 (KINSHIP); D.I.B. and N.I. received funding from CONICET, Argentina; L.K., F.K. and Á. Putz were supported by the European Social Fund (EFOP-3.6.1.-16-2016-00004; ‘Comprehensive Development for Implementing Smart Specialization Strategies at the University of Pécs’). K.U. and E. Vergauwe were supported by a grant from the Swiss National Science Foundation (PZ00P1_154911 to E. Vergauwe). T.G. is supported by the Social Sciences and Humanities Research Council of Canada (SSHRC). M.A.V. was supported by grants 2016-T1/SOC-1395 (Comunidad de Madrid) and PSI2017-85159-P (AEI/FEDER UE). K.B. was supported by a grant from the National Science Centre, Poland (number 2015/19/D/HS6/00641). J. Bonick and J.W.L. were supported by the Joep Lange Institute. G.B. was supported by the Slovak Research and Development Agency (APVV-17-0418). H.I.J. and E.S. were supported by a French National Research Agency ‘Investissements d’Avenir’ programme grant (ANR-15-IDEX-02). T.D.G. was supported by an Australian Government Research Training Program Scholarship. The Raipur Group is thankful to: (1) the University Grants Commission, New Delhi, India for the research grants received through its SAP-DRS (Phase-III) scheme sanctioned to the School of Studies in Life Science; and (2) the Center for Translational Chronobiology at the School of Studies in Life Science, PRSU, Raipur, India for providing logistical support. K. Ask was supported by a small grant from the Department of Psychology, University of Gothenburg. Y.Q. was supported by grants from the Beijing Natural Science Foundation (5184035) and CAS Key Laboratory of Behavioral Science, Institute of Psychology. N.A.C. was supported by the National Science Foundation Graduate Research Fellowship (R010138018). We acknowledge the following research assistants: J. Muriithi and J. Ngugi (United States International University Africa); E. Adamo, D. Cafaro, V. Ciambrone, F. Dolce and E. Tolomeo (Magna Græcia University of Catanzaro); E. De Stefano (University of Padova); S. A. Escobar Abadia (University of Lincoln); L. E. Grimstad (Norwegian School of Economics (NHH)); L. C. Zamora (Franklin and Marshall College); R. E. Liang and R. C. Lo (Universiti Tunku Abdul Rahman); A. Short and L. Allen (Massey University, New Zealand), A. Ateş, E. Güneş and S. Can Özdemir (Boğaziçi University); I. Pedersen and T. Roos (Åbo Akademi University); N. Paetz (Escuela de Comunicación Mónica Herrera); J. Green (University of Gothenburg); M. Krainz (University of Vienna, Austria); and B. Todorova (University of Vienna, Austria). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.https://www.nature.com/nathumbehav/am2023BiochemistryGeneticsMicrobiology and Plant Patholog
    corecore