74 research outputs found

    Community Genetics screening in a pandemic: solutions for pre-test education, informed consent, and specimen collection

    Full text link
    A Community Genetics carrier screening program for the Jewish community has operated on-site in high schools in Sydney (Australia) for 25 years. During 2020, in response to the COVID-19 pandemic, government-mandated social-distancing, ‘lock-down’ public health orders, and laboratory supply-chain shortages prevented the usual operation and delivery of the annual testing program. We describe development of three responses to overcome these challenges: (1) pivoting to online education sufficient to ensure informed consent for both genetic and genomic testing; (2) development of contactless telehealth with remote training and supervision for collecting genetic samples using buccal swabs; and (3) a novel patient and specimen identification ‘GeneTrustee’ protocol enabling fully identified clinical-grade specimens to be collected and DNA extracted by a research laboratory while maintaining full participant confidentiality and privacy. These telehealth strategies for education, consent, specimen collection and sample processing enabled uninterrupted delivery and operation of complex genetic testing and screening programs even amid pandemic restrictions. These tools remain available for future operation and can be adapted to other programs

    The Dynamics and Evolutionary Potential of Domain Loss and Emergence

    Get PDF
    The wealth of available genomic data presents an unrivaled opportunity to study the molecular basis of evolution. Studies on gene family expansions and site-dependent analyses have already helped establish important insights into how proteins facilitate adaptation. However, efforts to conduct full-scale cross-genomic comparisons between species are challenged by both growing amounts of data and the inherent difficulty in accurately inferring homology between deeply rooted species. Proteins, in comparison, evolve by means of domain rearrangements, a process more amenable to study given the strength of profile-based homology inference and the lower rates with which rearrangements occur. However, adapting to a constantly changing environment can require molecular modulations beyond reach of rearrangement alone. Here, we explore rates and functional implications of novel domain emergence in contrast to domain gain and loss in 20 arthropod species of the pancrustacean clade. Emerging domains are more likely disordered in structure and spread more rapidly within their genomes than established domains. Furthermore, although domain turnover occurs at lower rates than gene family turnover, we find strong evidence that the emergence of novel domains is foremost associated with environmental adaptation such as abiotic stress response. The results presented here illustrate the simplicity with which domain-based analyses can unravel key players of nature's adaptational machinery, complementing the classical site-based analyses of adaptation

    Whole genome sequencing for the genetic diagnosis of heterogenous dystonia phenotypes

    Get PDF
    Introduction: Dystonia is a clinically and genetically heterogeneous disorder and a genetic cause is often difficult to elucidate. This is the first study to use whole genome sequencing (WGS) to investigate dystonia in a large sample of affected individuals. Methods: WGS was performed on 111 probands with heterogenous dystonia phenotypes. We performed analysis for coding and non-coding variants, copy number variants (CNVs), and structural variants (SVs). We assessed for an association between dystonia and 10 known dystonia risk variants. Results: A genetic diagnosis was obtained for 11.7% (13/111) of individuals. We found that a genetic diagnosis was more likely in those with an earlier age at onset, younger age at testing, and a combined dystonia phenotype. We identified pathogenic/likely-pathogenic variants in ADCY5 (n = 1), ATM (n = 1), GNAL (n = 2), GLB1 (n = 1), KMT2B (n = 2), PRKN (n = 2), PRRT2 (n = 1), SGCE (n = 2), and THAP1 (n = 1). CNVs were detected in 3 individuals. We found an association between the known risk variant ARSG rs11655081 and dystonia (p = 0.003). Conclusion: A genetic diagnosis was found in 11.7% of individuals with dystonia. The diagnostic yield was higher in those with an earlier age of onset, younger age at testing, and a combined dystonia phenotype. WGS may be particularly relevant for dystonia given that it allows for the detection of CNVs, which accounted for 23% of the genetically diagnosed cases. © 2019 The Author

    Just how versatile are domains?

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Creating new protein domain arrangements is a frequent mechanism of evolutionary innovation. While some domains always form the same combinations, others form many different arrangements. This ability, which is often referred to as versatility or promiscuity of domains, its a random evolutionary model in which a domain's promiscuity is based on its relative frequency of domains.</p> <p>Results</p> <p>We show that there is a clear relationship across genomes between the promiscuity of a given domain and its frequency. However, the strength of this relationship differs for different domains. We thus redefine domain promiscuity by defining a new index, <it>DV I </it>("domain versatility index"), which eliminates the effect of domain frequency. We explore links between a domain's versatility, when unlinked from abundance, and its biological properties.</p> <p>Conclusion</p> <p>Our results indicate that domains occurring as single domain proteins and domains appearing frequently at protein termini have a higher <it>DV I</it>. This is consistent with previous observations that the evolution of domain re-arrangements is primarily driven by fusion of pre-existing arrangements and single domains as well as loss of domains at protein termini. Furthermore, we studied the link between domain age, defined as the first appearance of a domain in the species tree, and the <it>DV I</it>. Contrary to previous studies based on domain promiscuity, it seems as if the <it>DV I </it>is age independent. Finally, we find that contrary to previously reported findings, versatility is lower in Eukaryotes. In summary, our measure of domain versatility indicates that a random attachment process is sufficient to explain the observed distribution of domain arrangements and that several views on domain promiscuity need to be revised.</p

    Evolution of protein domain architectures

    Get PDF
    This chapter reviews current research on how protein domain architectures evolve. We begin by summarizing work on the phylogenetic distribution of proteins, as this will directly impact which domain architectures can be formed in different species. Studies relating domain family size to occurrence have shown that they generally follow power law distributions, both within genomes and larger evolutionary groups. These findings were subsequently extended to multi-domain architectures. Genome evolution models that have been suggested to explain the shape of these distributions are reviewed, as well as evidence for selective pressure to expand certain domain families more than others. Each domain has an intrinsic combinatorial propensity, and the effects of this have been studied using measures of domain versatility or promiscuity. Next, we study the principles of protein domain architecture evolution and how these have been inferred from distributions of extant domain arrangements. Following this, we review inferences of ancestral domain architecture and the conclusions concerning domain architecture evolution mechanisms that can be drawn from these. Finally, we examine whether all known cases of a given domain architecture can be assumed to have a single common origin (monophyly) or have evolved convergently (polyphyly). We end by a discussion of some available tools for computational analysis or exploitation of protein domain architectures and their evolution

    The automatic annotation of bacterial genomes

    Get PDF
    Salmonella is one of the most important pathogens of mankind and animals alike, causing several billion pounds worth of damage worldwide each year. We have sequenced, annotated and published 4 genomes of Salmonella of well-defined virulence in farm animals. This provides valuable measures of intraserovar diversity and opportunities to formally link genotypes to phenotypes in target animals. Specifically, we have examined pathway detrition and mutagenesis and linked this to host specificity of the serovars. With the advent of next generation sequencing there has been a boom in genomic sequence submission, and an onslaught of -omics data has ensued. Integrating these different data types is complex and there is little available to visualise this data in the context of its genome. We present GeneBook, a web-based tool that synchronously integrates disparate datasets, displaying a fully annotated genome, enriched with publicly available data and the user's private experiments. It is accessed through a user-friendly interface that allows scientists to interrogate genomic features across multiple, heterogeneous, experiments

    Sequence Similarity Network Reveals Common Ancestry of Multidomain Proteins

    Get PDF
    We address the problem of homology identification in complex multidomain families with varied domain architectures. The challenge is to distinguish sequence pairs that share common ancestry from pairs that share an inserted domain but are otherwise unrelated. This distinction is essential for accuracy in gene annotation, function prediction, and comparative genomics. There are two major obstacles to multidomain homology identification: lack of a formal definition and lack of curated benchmarks for evaluating the performance of new methods. We offer preliminary solutions to both problems: 1) an extension of the traditional model of homology to include domain insertions; and 2) a manually curated benchmark of well-studied families in mouse and human. We further present Neighborhood Correlation, a novel method that exploits the local structure of the sequence similarity network to identify homologs with great accuracy based on the observation that gene duplication and domain shuffling leave distinct patterns in the sequence similarity network. In a rigorous, empirical comparison using our curated data, Neighborhood Correlation outperforms sequence similarity, alignment length, and domain architecture comparison. Neighborhood Correlation is well suited for automated, genome-scale analyses. It is easy to compute, does not require explicit knowledge of domain architecture, and classifies both single and multidomain homologs with high accuracy. Homolog predictions obtained with our method, as well as our manually curated benchmark and a web-based visualization tool for exploratory analysis of the network neighborhood structure, are available at http://www.neighborhoodcorrelation.org. Our work represents a departure from the prevailing view that the concept of homology cannot be applied to genes that have undergone domain shuffling. In contrast to current approaches that either focus on the homology of individual domains or consider only families with identical domain architectures, we show that homology can be rationally defined for multidomain families with diverse architectures by considering the genomic context of the genes that encode them. Our study demonstrates the utility of mining network structure for evolutionary information, suggesting this is a fertile approach for investigating evolutionary processes in the post-genomic era
    corecore