1,402 research outputs found

    Challenges and directions: an analysis of Genetic Analysis Workshop 17 data by collapsing rare variants within family data

    Get PDF
    Recent studies suggest that the traditional case-control study design does not have sufficient power to discover rare risk variants. Two different methods—collapsing and family data—are suggested as alternatives for discovering these rare variants. Compared with common variants, rare variants have unique characteristics. In this paper, we assess the distribution of rare variants in family data. We notice that a large number of rare variants exist only in one or two families and that the association result is largely shaped by those families. Therefore we explore the possibility of integrating both the collapsing method and the family data method. This combinational approach offers a potential power boost for certain causal genes, including VEGFA, VEGFC, SIRT1, SREBF1, PIK3R3, VLDLR, PLAT, and FLT4, and thus deserves further investigation

    Incorporating predicted functions of nonsynonymous variants into gene-based analysis of exome sequencing data: a comparative study

    Get PDF
    Next-generation sequencing has opened up new avenues for the genetic study of complex traits. However, because of the small number of observations for any given rare allele and high sequencing error, it is a challenge to identify functional rare variants associated with the phenotype of interest. Recent research shows that grouping variants by gene and incorporating computationally predicted functions of variants may provide higher statistical power. On the other hand, many algorithms are available for predicting the damaging effects of nonsynonymous variants. Here, we use the simulated mini-exome data of Genetic Analysis Workshop 17 to study and compare the effects of incorporating the functional predictions of single-nucleotide polymorphisms using two popular algorithms, SIFT and PolyPhen-2, into a gene-based association test. We also propose a simple mixture model that can effectively combine test results based on different functional prediction algorithms

    Establishing the precise evolutionary history of a gene improves prediction of disease-causing missense mutations

    Get PDF
    PURPOSE: Predicting the phenotypic effects of mutations has become an important application in clinical genetic diagnostics. Computational tools evaluate the behavior of the variant over evolutionary time and assume that variations seen during the course of evolution are probably benign in humans. However, current tools do not take into account orthologous/paralogous relationships. Paralogs have dramatically different roles in Mendelian diseases. For example, whereas inactivating mutations in the NPC1 gene cause the neurodegenerative disorder Niemann-Pick C, inactivating mutations in its paralog NPC1L1 are not disease-causing and, moreover, are implicated in protection from coronary heart disease. METHODS: We identified major events in NPC1 evolution and revealed and compared orthologs and paralogs of the human NPC1 gene through phylogenetic and protein sequence analyses. We predicted whether an amino acid substitution affects protein function by reducing the organism’s fitness. RESULTS: Removing the paralogs and distant homologs improved the overall performance of categorizing disease-causing and benign amino acid substitutions. CONCLUSION: The results show that a thorough evolutionary analysis followed by identification of orthologs improves the accuracy in predicting disease-causing missense mutations. We anticipate that this approach will be used as a reference in the interpretation of variants in other genetic diseases as well. Genet Med 18 10, 1029–1036

    The Phyre2 web portal for protein modeling, prediction and analysis

    Get PDF
    Phyre2 is a suite of tools available on the web to predict and analyze protein structure, function and mutations. The focus of Phyre2 is to provide biologists with a simple and intuitive interface to state-of-the-art protein bioinformatics tools. Phyre2 replaces Phyre, the original version of the server for which we previously published a paper in Nature Protocols. In this updated protocol, we describe Phyre2, which uses advanced remote homology detection methods to build 3D models, predict ligand binding sites and analyze the effect of amino acid variants (e.g., nonsynonymous SNPs (nsSNPs)) for a user's protein sequence. Users are guided through results by a simple interface at a level of detail they determine. This protocol will guide users from submitting a protein sequence to interpreting the secondary and tertiary structure of their models, their domain composition and model quality. A range of additional available tools is described to find a protein structure in a genome, to submit large number of sequences at once and to automatically run weekly searches for proteins that are difficult to model. The server is available at http://www.sbg.bio.ic.ac.uk/phyre2. A typical structure prediction will be returned between 30 min and 2 h after submission

    Identification of functional genetic variation in exome sequence analysis

    Get PDF
    Recent technological advances have allowed us to study individual genomes at a base-pair resolution and have demonstrated that the average exome harbors more than 15,000 genetic variants. However, our ability to understand the biological significance of the identified variants and to connect these observed variants with phenotypes is limited. The first step in this process is to identify genetic variation that is likely to result in changes to protein structure and function, because detailed studies, either population based or functional, for each of the identified variants are not practicable. Therefore algorithms that yield valid predictions of a variant’s functional significance are needed. Over the past decade, several programs have been developed to predict the probability that an observed sequence variant will have a deleterious effect on protein function. These algorithms range from empirical programs that classify using known biochemical properties to statistical algorithms trained using a variety of data sources, including sequence conservation data, biochemical properties, and functional data. Using data from the pilot3 study of the 1000 Genomes Project available through Genetic Analysis Workshop 17, we compared the results of four programs (SIFT, PolyPhen, MAPP, and VarioWatch) used to predict the functional relevance of variants in 101 genes. Analysis was conducted without knowledge of the simulation model. Agreement between programs was modest ranging from 59.4% to 71.4% and only 3.5% of variants were classified as deleterious and 10.9% as tolerated across all four programs

    Deriving a mutation index of carcinogenicity using protein structure and protein interfaces

    Get PDF
    With the advent of Next Generation Sequencing the identification of mutations in the genomes of healthy and diseased tissues has become commonplace. While much progress has been made to elucidate the aetiology of disease processes in cancer, the contributions to disease that many individual mutations make remain to be characterised and their downstream consequences on cancer phenotypes remain to be understood. Missense mutations commonly occur in cancers and their consequences remain challenging to predict. However, this knowledge is becoming more vital, for both assessing disease progression and for stratifying drug treatment regimes. Coupled with structural data, comprehensive genomic databases of mutations such as the 1000 Genomes project and COSMIC give an opportunity to investigate general principles of how cancer mutations disrupt proteins and their interactions at the molecular and network level. We describe a comprehensive comparison of cancer and neutral missense mutations; by combining features derived from structural and interface properties we have developed a carcinogenicity predictor, InCa (Index of Carcinogenicity). Upon comparison with other methods, we observe that InCa can predict mutations that might not be detected by other methods. We also discuss general limitations shared by all predictors that attempt to predict driver mutations and discuss how this could impact high-throughput predictions. A web interface to a server implementation is publicly available at http://inca.icr.ac.uk/

    Gene Expression Pattern in Transmitochondrial Cytoplasmic Hybrid Cells Harboring Type 2 Diabetes-Associated Mitochondrial DNA Haplogroups

    Get PDF
    Decreased mitochondrial function plays a pivotal role in the pathogenesis of type 2 diabetes mellitus (T2DM). Recently, it was reported that mitochondrial DNA (mtDNA) haplogroups confer genetic susceptibility to T2DM in Koreans and Japanese. Particularly, mtDNA haplogroup N9a is associated with a decreased risk of T2DM, whereas haplogroups D5 and F are associated with an increased risk. To examine functional consequences of these haplogroups without being confounded by the heterogeneous nuclear genomic backgrounds of different subjects, we constructed transmitochondrial cytoplasmic hybrid (cybrid) cells harboring each of the three haplogroups (N9a, D5, and F) in a background of a shared nuclear genome. We compared the functional consequences of the three haplogroups using cell-based assays and gene expression microarrays. Cell-based assays did not detect differences in mitochondrial functions among the haplogroups in terms of ATP generation, reactive oxygen species production, mitochondrial membrane potential, and cellular dehydrogenase activity. However, differential expression and clustering analyses of microarray data revealed that the three haplogroups exhibit a distinctive nuclear gene expression pattern that correlates with their susceptibility to T2DM. Pathway analysis of microarray data identified several differentially regulated metabolic pathways. Notably, compared to the T2DM-resistant haplogroup N9a, the T2DM-susceptible haplogroup F showed down-regulation of oxidative phosphorylation and up-regulation of glycolysis. These results suggest that variations in mtDNA can affect the expression of nuclear genes regulating mitochondrial functions or cellular energetics. Given that impaired mitochondrial function caused by T2DM-associated mtDNA haplogroups is compensated by the nuclear genome, we speculate that defective nuclear compensation, under certain circumstances, might lead to the development of T2DM

    Novel homozygous missense mutation in GAN associated with Charcot-Marie-Tooth disease type 2 in a large consanguineous family from Israel.

    Get PDF
    BACKGROUND: CMT-2 is a clinically and genetically heterogeneous group of peripheral axonal neuropathies characterized by slowly progressive weakness and atrophy of distal limb muscles resulting from length-dependent motor and sensory neurodegeneration. Classical giant axonal neuropathy (GAN) is an autosomal recessively inherited progressive neurodegenerative disorder of the peripheral and central nervous systems, typically diagnosed in early childhood and resulting in death by the end of the third decade. Distinctive phenotypic features are the presence of "kinky" hair and long eyelashes. The genetic basis of the disease has been well established, with over 40 associated mutations identified in the gene GAN, encoding the BTB-KELCH protein gigaxonin, involved in intermediate filament regulation. METHODS: An Illumina Human CytoSNP-12 array followed by whole exome sequence analysis was used to identify the disease associated gene mutation in a large consanguineous family diagnosed with Charcot-Marie-Tooth disease type 2 (CMT-2) from which all but one affected member had straight hair. RESULTS: Here we report the identification of a novel GAN missense mutation underlying the CMT-2 phenotype observed in this family. Although milder forms of GAN, with and without the presence of kinky hair have been reported previously, a phenotype distinct from that was investigated in this study. All family members lacked common features of GAN, including ataxia, nystagmus, intellectual disability, seizures, and central nervous system involvement. CONCLUSIONS: Our findings broaden the spectrum of phenotypes associated with GAN mutations and emphasize a need to proceed with caution when providing families with diagnostic or prognostic information based on either clinical or genetic findings alone
    corecore