137 research outputs found

    Using self-organizing maps to investigate environmental factors regulating colony size and breeding success of the White Stork (Ciconia ciconia)

    Get PDF
    We studied variations in the size of breeding colonies and in breeding performance of White Storks Ciconia ciconia in 2006–2008 in north-east Algeria. Each colony site was characterized using 12 environmental variables describing the physical environment, land-cover categories, and human activities, and by three demographic parameters: the number of breeding pairs, the number of pairs with chicks, and the number of fledged chicks per pair. Generalized linear mixed models and the self-organizing map algorithm (SOM, neural network) were used to investigate effects of biotic, abiotic, and anthropogenic factors on demographic parameters and on their relationships. Numbers of breeding pairs and of pairs with chicks were affected by the same environmental factors, mainly anthropogenic, which differed from those affecting the number of fledged chicks per pair. Numbers of fledged chicks per pair was not affected by colony size or by the number of nests with chicks. The categorization of the environmental variables into natural and anthropogenic, in connection with demographic parameters, was relevant to detect factors explaining variation in colony size and breeding parameters. The SOM proved a relevant tool to help determine actual dynamics in White Stork colonies, and thus to support effective conservation decisions at a regional scale

    Predicting the impact of Lynch syndrome-causing missense mutations from structural calculations

    Get PDF
    Accurate methods to assess the pathogenicity of mutations are needed to fully leverage the possibilities of genome sequencing in diagnosis. Current data-driven and bioinformatics approaches are, however, limited by the large number of new variations found in each newly sequenced genome, and often do not provide direct mechanistic insight. Here we demonstrate, for the first time, that saturation mutagenesis, biophysical modeling and co-variation analysis, performed in silico, can predict the abundance, metabolic stability, and function of proteins inside living cells. As a model system, we selected the human mismatch repair protein, MSH2, where missense variants are known to cause the hereditary cancer predisposition disease, known as Lynch syndrome. We show that the majority of disease-causing MSH2 mutations give rise to folding defects and proteasome-dependent degradation rather than inherent loss of function, and accordingly our in silico modeling data accurately identifies disease-causing mutations and outperforms the traditionally used genetic disease predictors. Thus, in conclusion, in silico biophysical modeling should be considered for making genotype-phenotype predictions and for diagnosis of Lynch syndrome, and perhaps other hereditary diseases

    Analysis of exome data for 4293 trios suggests GPI-anchor biogenesis defects are a rare cause of developmental disorders.

    Get PDF
    Over 150 different proteins attach to the plasma membrane using glycosylphosphatidylinositol (GPI) anchors. Mutations in 18 genes that encode components of GPI-anchor biogenesis result in a phenotypic spectrum that includes learning disability, epilepsy, microcephaly, congenital malformations and mild dysmorphic features. To determine the incidence of GPI-anchor defects, we analysed the exome data from 4293 parent-child trios recruited to the Deciphering Developmental Disorders (DDD) study. All probands recruited had a neurodevelopmental disorder. We searched for variants in 31 genes linked to GPI-anchor biogenesis and detected rare biallelic variants in PGAP3, PIGN, PIGT (n=2), PIGO and PIGL, providing a likely diagnosis for six families. In five families, the variants were in a compound heterozygous configuration while in a consanguineous Afghani kindred, a homozygous c.709G>C; p.(E237Q) variant in PIGT was identified within 10-12 Mb of autozygosity. Validation and segregation analysis was performed using Sanger sequencing. Across the six families, five siblings were available for testing and in all cases variants co-segregated consistent with them being causative. In four families, abnormal alkaline phosphatase results were observed in the direction expected. FACS analysis of knockout HEK293 cells that had been transfected with wild-type or mutant cDNA constructs demonstrated that the variants in PIGN, PIGT and PIGO all led to reduced activity. Splicing assays, performed using leucocyte RNA, showed that a c.336-2A>G variant in PIGL resulted in exon skipping and p.D113fs*2. Our results strengthen recently reported disease associations, suggest that defective GPI-anchor biogenesis may explain ~0.15% of individuals with developmental disorders and highlight the benefits of data sharing

    The Medical Genome Reference Bank contains whole genome and phenotype data of 2570 healthy elderly

    Get PDF
    Population health research is increasingly focused on the genetic determinants of healthy ageing, but there is no public resource of whole genome sequences and phenotype data from healthy elderly individuals. Here we describe the first release of the Medical Genome Reference Bank (MGRB), comprising whole genome sequence and phenotype of 2570 elderly Australians depleted for cancer, cardiovascular disease, and dementia. We analyse the MGRB for single-nucleotide, indel and structural variation in the nuclear and mitochondrial genomes. MGRB individuals have fewer disease-associated common and rare germline variants, relative to both cancer cases and the gnomAD and UK Biobank cohorts, consistent with risk depletion. Age-related somatic changes are correlated with grip strength in men, suggesting blood-derived whole genomes may also provide a biologic measure of age-related functional deterioration. The MGRB provides a broadly applicable reference cohort for clinical genetics and genomic association studies, and for understanding the genetics of healthy ageing

    Primary brain calcification: an international study reporting novel variants and associated phenotypes.

    Get PDF
    Primary familial brain calcification (PFBC) is a rare cerebral microvascular calcifying disorder with a wide spectrum of motor, cognitive, and neuropsychiatric symptoms. It is typically inherited as an autosomal-dominant trait with four causative genes identified so far: SLC20A2, PDGFRB, PDGFB, and XPR1. Our study aimed at screening the coding regions of these genes in a series of 177 unrelated probands that fulfilled the diagnostic criteria for primary brain calcification regardless of their family history. Sequence variants were classified as pathogenic, likely pathogenic, or of uncertain significance (VUS), based on the ACMG-AMP recommendations. We identified 45 probands (25.4%) carrying either pathogenic or likely pathogenic variants (n = 34, 19.2%) or VUS (n = 11, 6.2%). SLC20A2 provided the highest contribution (16.9%), followed by XPR1 and PDGFB (3.4% each), and PDGFRB (1.7%). A total of 81.5% of carriers were symptomatic and the most recurrent symptoms were parkinsonism, cognitive impairment, and psychiatric disturbances (52.3%, 40.9%, and 38.6% of symptomatic individuals, respectively), with a wide range of age at onset (from childhood to 81 years). While the pathogenic and likely pathogenic variants identified in this study can be used for genetic counseling, the VUS will require additional evidence, such as recurrence in unrelated patients, in order to be classified as pathogenic

    A cre-inducible DUX4 transgenic mouse model for investigating facioscapulohumeral muscular dystrophy

    Get PDF
    The Double homeobox 4 (DUX4) gene is an important regulator of early human development and its aberrant expression is causal for facioscapulohumeral muscular dystrophy (FSHD). The DUX4-full length (DUX4-fl) mRNA splice isoform encodes a transcriptional activator; however, DUX4 and its unique DNA binding preferences are specific to old-world primates. Regardless, the somatic cytotoxicity caused by DUX4 expression is conserved when expressed in cells and animals ranging from fly to mouse. Thus, viable animal models based on DUX4-fl expression have been difficult to generate due in large part to overt developmental toxicity of low DUX4-fl expression from leaky transgenes. We have overcome this obstacle and here we report the generation and initial characterization of a line of conditional floxed DUX4-fl transgenic mice, FLExDUX4, that is viable and fertile. In the absence of cre, these mice express a very low level of DUX4-fl mRNA from the transgene, resulting in mild phenotypes. However, when crossed with appropriate cre-driver lines of mice, the double transgenic offspring readily express DUX4-fl mRNA, protein, and target genes with the spatiotemporal pattern of nuclear cre expression dictated by the chosen system. When cre is expressed from the ACTA1 skeletal muscle-specific promoter, the double transgenic animals exhibit a developmental myopathy. When crossed with tamoxifen-inducible cre lines, DUX4-mediated pathology can be induced in adult animals. Thus, the appearance and progression of pathology can be controlled to provide readily screenable phenotypes useful for assessing therapeutic approaches targeting DUX4-fl mRNA and protein. Overall, the FLExDUX4 line of mice is quite versatile and will allow new investigations into mechanisms of DUX4-mediated pathophysiology as well as much-needed pre-clinical testing of DUX4-targeted FSHD interventions in vivo

    Next-gen sequencing identifies non-coding variation disrupting miRNA-binding sites in neurological disorders

    Get PDF
    Understanding the genetic factors underlying neurodevelopmental and neuropsychiatric disorders is a major challenge given their prevalence and potential severity for quality of life. While large-scale genomic screens have made major advances in this area, for many disorders the genetic underpinnings are complex and poorly understood. To date the field has focused predominantly on protein coding variation, but given the importance of tightly controlled gene expression for normal brain development and disorder, variation that affects non-coding regulatory regions of the genome is likely to play an important role in these phenotypes. Herein we show the importance of 3 prime untranslated region (3'UTR) non-coding regulatory variants across neurodevelopmental and neuropsychiatric disorders. We devised a pipeline for identifying and functionally validating putatively pathogenic variants from next generation sequencing (NGS) data. We applied this pipeline to a cohort of children with severe specific language impairment (SLI) and identified a functional, SLI-associated variant affecting gene regulation in cells and post-mortem human brain. This variant and the affected gene (ARHGEF39) represent new putative risk factors for SLI. Furthermore, we identified 3'UTR regulatory variants across autism, schizophrenia and bipolar disorder NGS cohorts demonstrating their impact on neurodevelopmental and neuropsychiatric disorders. Our findings show the importance of investigating non-coding regulatory variants when determining risk factors contributing to neurodevelopmental and neuropsychiatric disorders. In the future, integration of such regulatory variation with protein coding changes will be essential for uncovering the genetic causes of complex neurological disorders and the fundamental mechanisms underlying health and disease

    Mutations in the histone methyltransferase gene KMT2B cause complex early-onset dystonia.

    Get PDF
    Histone lysine methylation, mediated by mixed-lineage leukemia (MLL) proteins, is now known to be critical in the regulation of gene expression, genomic stability, cell cycle and nuclear architecture. Despite MLL proteins being postulated as essential for normal development, little is known about the specific functions of the different MLL lysine methyltransferases. Here we report heterozygous variants in the gene KMT2B (also known as MLL4) in 27 unrelated individuals with a complex progressive childhood-onset dystonia, often associated with a typical facial appearance and characteristic brain magnetic resonance imaging findings. Over time, the majority of affected individuals developed prominent cervical, cranial and laryngeal dystonia. Marked clinical benefit, including the restoration of independent ambulation in some cases, was observed following deep brain stimulation (DBS). These findings highlight a clinically recognizable and potentially treatable form of genetic dystonia, demonstrating the crucial role of KMT2B in the physiological control of voluntary movement.Funding for the project was provided by the Wellcome Trust for UK10K (WT091310) and DDD Study. The DDD study presents independent research commissioned by the Health Innovation Challenge Fund [grant number HICF-1009-003] - see www.ddduk.org/access.html for full acknowledgement. This work was supported in part by the Intramural Research Program of the National Human Genome Research Institute and the Common Fund, NIH Office of the Director. This work was supported in part by the German Ministry of Research and Education (grant nos. 01GS08160 and 01GS08167; German Mental Retardation Network) as part of the National Genome Research Network to A.R. and D.W. and by the Deutsche Forschungsgemeinschaft (AB393/2-2) to A.R. Brain expression data was provided by the UK Human Brain Expression Consortium (UKBEC), which comprises John A. Hardy, Mina Ryten, Michael Weale, Daniah Trabzuni, Adaikalavan Ramasamy, Colin Smith and Robert Walker, affiliated with UCL Institute of Neurology (J.H., M.R., D.T.), King’s College London (M.R., M.W., A.R.) and the University of Edinburgh (C.S., R.W.)

    Improving genetic diagnosis in Mendelian disease with transcriptome sequencing

    Get PDF
    Exome and whole-genome sequencing are becoming increasingly routine approaches in Mendelian disease diagnosis. Despite their success, the current diagnostic rate for genomic analyses across a variety of rare diseases is approximately 25 to 50%. We explore the utility of transcriptome sequencing [RNA sequencing (RNA-seq)] as a complementary diagnostic tool in a cohort of 50 patients with genetically undiagnosed rare muscle disorders. We describe an integrated approach to analyze patient muscle RNA-seq, leveraging an analysis framework focused on the detection of transcript-level changes that are unique to the patient compared to more than 180 control skeletal muscle samples. We demonstrate the power of RNA-seq to validate candidate splice-disrupting mutations and to identify splice-altering variants in both exonic and deep intronic regions, yielding an overall diagnosis rate of 35%. We also report the discovery of a highly recurrent de novo intronic mutation in COL6A1 that results in a dominantly acting splice-gain event, disrupting the critical glycine repeat motif of the triple helical domain. We identify this pathogenic variant in a total of 27 genetically unsolved patients in an external collagen VI–like dystrophy cohort, thus explaining approximately 25% of patients clinically suggestive of having collagen VI dystrophy in whom prior genetic analysis is negative. Overall, this study represents a large systematic application of transcriptome sequencing to rare disease diagnosis and highlights its utility for the detection and interpretation of variants missed by current standard diagnostic approaches

    Interpreting short tandem repeat variations in humans using mutational constraint

    Get PDF
    Identifying regions of the genome that are depleted of mutations can reveal potentially deleterious variants. Short tandem repeats (STRs), also known as microsatellites, are among the largest contributors of de novo mutations in humans. However, per-locus studies of STR mutations have been limited to highly ascertained panels of several dozen loci. Here, we harnessed bioinformatics tools and a novel analytical framework to estimate mutation parameters for each STR in the human genome by correlating STR genotypes with local sequence heterozygosity. We applied our method to obtain robust estimates of the impact of local sequence features on mutation parameters and used this to create a framework for measuring constraint at STRs by comparing observed vs. expected mutation rates. Constraint scores identified known pathogenic variants with early onset effects. Our metric will provide a valuable tool for prioritizing pathogenic STRs in medical genetics studies
    corecore