69 research outputs found

    An integrated approach to the interpretation of Single Amino Acid Polymorphisms within the framework of CATH and Gene3D

    Get PDF
    Background The phenotypic effects of sequence variations in protein-coding regions come about primarily via their effects on the resulting structures, for example by disrupting active sites or affecting structural stability. In order better to understand the mechanisms behind known mutant phenotypes, and predict the effects of novel variations, biologists need tools to gauge the impacts of DNA mutations in terms of their structural manifestation. Although many mutations occur within domains whose structure has been solved, many more occur within genes whose protein products have not been structurally characterized.<p></p> Results Here we present 3DSim (3D Structural Implication of Mutations), a database and web application facilitating the localization and visualization of single amino acid polymorphisms (SAAPs) mapped to protein structures even where the structure of the protein of interest is unknown. The server displays information on 6514 point mutations, 4865 of them known to be associated with disease. These polymorphisms are drawn from SAAPdb, which aggregates data from various sources including dbSNP and several pathogenic mutation databases. While the SAAPdb interface displays mutations on known structures, 3DSim projects mutations onto known sequence domains in Gene3D. This resource contains sequences annotated with domains predicted to belong to structural families in the CATH database. Mappings between domain sequences in Gene3D and known structures in CATH are obtained using a MUSCLE alignment. 1210 three-dimensional structures corresponding to CATH structural domains are currently included in 3DSim; these domains are distributed across 396 CATH superfamilies, and provide a comprehensive overview of the distribution of mutations in structural space.<p></p> Conclusion The server is publicly available at http://3DSim.bioinfo.cnio.es/ webcite. In addition, the database containing the mapping between SAAPdb, Gene3D and CATH is available on request and most of the functionality is available through programmatic web service access.<p></p&gt

    Characterization of pathogenic germline mutations in human Protein Kinases

    Get PDF
    Background Protein Kinases are a superfamily of proteins involved in crucial cellular processes such as cell cycle regulation and signal transduction. Accordingly, they play an important role in cancer biology. To contribute to the study of the relation between kinases and disease we compared pathogenic mutations to neutral mutations as an extension to our previous analysis of cancer somatic mutations. First, we analyzed native and mutant proteins in terms of amino acid composition. Secondly, mutations were characterized according to their potential structural effects and finally, we assessed the location of the different classes of polymorphisms with respect to kinase-relevant positions in terms of subfamily specificity, conservation, accessibility and functional sites.<p></p> Results Pathogenic Protein Kinase mutations perturb essential aspects of protein function, including disruption of substrate binding and/or effector recognition at family-specific positions. Interestingly these mutations in Protein Kinases display a tendency to avoid structurally relevant positions, what represents a significant difference with respect to the average distribution of pathogenic mutations in other protein families.<p></p> Conclusions Disease-associated mutations display sound differences with respect to neutral mutations: several amino acids are specific of each mutation type, different structural properties characterize each class and the distribution of pathogenic mutations within the consensus structure of the Protein Kinase domain is substantially different to that for non-pathogenic mutations. This preferential distribution confirms previous observations about the functional and structural distribution of the controversial cancer driver and passenger somatic mutations and their use as a proxy for the study of the involvement of somatic mutations in cancer development.<p></p&gt

    A second generation human haplotype map of over 3.1 million SNPs

    Full text link
    We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r(2) of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r(2) of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/62863/1/nature06258.pd

    2.6 Mb YAC contig of the human X inactivation center region in Xq13: physical linkage of the RPS4X, PHKA1, XIST and DXS128E genes.

    No full text
    X chromosome inactivation is a mechanism of dosage compensation that regulates the expression of mammalian X-linked genes between XY males and XX females. This phenomenon is cis-acting, clonally heritable, and requires the presence of an X inactivation center (XIC). In our attempts to characterize this phenomenon, we have focused on the physical organization of the human XIC localized to Xq13. From previous studies, we had determined that the candidate XIC interval contained two loci (DXS128 and XIST) and was bound by the breakpoints of two structurally abnormal inactivated X chromosomes, a t(X;14) and an idic(Xp). Here we present a refined mapping of the XIC-containing region using the breakpoint of a late replicating rearranged X (rea(X)), and the initial characterization of a set of 40 yeast artificial chromosomes (YACs) derived from the XIC-containing region. These YACs form a 2.6 Mb contig which completely covers the XIC, and physically links the RPS4X, PHKA1, XIST, and DXS128E genes, as well as a laminin receptor pseudogene (LAMRP4). Furthermore, we have determined the relative orientations of these four genes, and have derived a restriction map of the region using the rare cutter enzymes BssHII, EagI, MluI, NruI, SalI, SfiI, SstII (or SacII), and NotI. We have identified at least 9 CpG-rich islands within this region, and have discovered a large (approximately 125 kb) inverted duplication proximal to the XIC based on symmetrical restriction patterns and homologous probes. We estimate the maximum size of the XIC-containing interval to be between 680 kb and 1200 kb, based on the localization of the breakpoints of the rearranged X chromosomes mentioned above. This lays the groundwork for the further characterization of the XIC region and the isolation of other expressed sequences therefrom

    2.6 Mb YAC contig of the human X inactivation center region in Xq13: physical linkage of the RPS4X, PHKA1, XIST and DXS128E genes.

    No full text
    X chromosome inactivation is a mechanism of dosage compensation that regulates the expression of mammalian X-linked genes between XY males and XX females. This phenomenon is cis-acting, clonally heritable, and requires the presence of an X inactivation center (XIC). In our attempts to characterize this phenomenon, we have focused on the physical organization of the human XIC localized to Xq13. From previous studies, we had determined that the candidate XIC interval contained two loci (DXS128 and XIST) and was bound by the breakpoints of two structurally abnormal inactivated X chromosomes, a t(X;14) and an idic(Xp). Here we present a refined mapping of the XIC-containing region using the breakpoint of a late replicating rearranged X (rea(X)), and the initial characterization of a set of 40 yeast artificial chromosomes (YACs) derived from the XIC-containing region. These YACs form a 2.6 Mb contig which completely covers the XIC, and physically links the RPS4X, PHKA1, XIST, and DXS128E genes, as well as a laminin receptor pseudogene (LAMRP4). Furthermore, we have determined the relative orientations of these four genes, and have derived a restriction map of the region using the rare cutter enzymes BssHII, EagI, MluI, NruI, SalI, SfiI, SstII (or SacII), and NotI. We have identified at least 9 CpG-rich islands within this region, and have discovered a large (approximately 125 kb) inverted duplication proximal to the XIC based on symmetrical restriction patterns and homologous probes. We estimate the maximum size of the XIC-containing interval to be between 680 kb and 1200 kb, based on the localization of the breakpoints of the rearranged X chromosomes mentioned above. This lays the groundwork for the further characterization of the XIC region and the isolation of other expressed sequences therefrom
    corecore