72 research outputs found

    Enhanced genetic maps from family-based disease studies: population-specific comparisons

    Get PDF
    Abstract Background Accurate genetic maps are required for successful and efficient linkage mapping of disease genes. However, most available genome-wide genetic maps were built using only small collections of pedigrees, and therefore have large sampling errors. A large set of genetic studies genotyped by the NHLBI Mammalian Genotyping Service (MGS) provide appropriate data for generating more accurate maps. Results We collected a large sample of uncleaned genotype data for 461 markers generated by the MGS using the Weber screening sets 9 and 10. This collection includes genotypes for over 4,400 pedigrees containing over 17,000 genotyped individuals from different populations. We identified and cleaned numerous relationship and genotyping errors, as well as verified the marker orders. We used this dataset to test for population-specific genetic maps, and to re-estimate the genetic map distances with greater precision; standard errors for all intervals are provided. The map-interval sizes from the European (or European descent), Chinese, and Hispanic samples are in quite good agreement with each other. We found one map interval on chromosome 8p with a statistically significant size difference between the European and Chinese samples, and several map intervals with significant size differences between the African American and Chinese samples. When comparing Palauan with European samples, a statistically significant difference was detected at the telomeric region of chromosome 11p. Several significant differences were also identified between populations in chromosomal and genome lengths. Conclusions Our new population-specific screening set maps can be used to improve the accuracy of disease-mapping studies. As a result of the large sample size, the average length of the 95% confidence interval (CI) for a 10 cM map interval is only 2.4 cM, which is considerably smaller than on previously published maps.http://deepblue.lib.umich.edu/bitstream/2027.42/112826/1/12881_2010_Article_748.pd

    Whole-Genome Sequencing of a Single Proband Together with Linkage Analysis Identifies a Mendelian Disease Gene

    Get PDF
    Although more than 2,400 genes have been shown to contain variants that cause Mendelian disease, there are still several thousand such diseases yet to be molecularly defined. The ability of new whole-genome sequencing technologies to rapidly indentify most of the genetic variants in any given genome opens an exciting opportunity to identify these disease genes. Here we sequenced the whole genome of a single patient with the dominant Mendelian disease, metachondromatosis (OMIM 156250), and used partial linkage data from her small family to focus our search for the responsible variant. In the proband, we identified an 11 bp deletion in exon four of PTPN11, which alters frame, results in premature translation termination, and co-segregates with the phenotype. In a second metachondromatosis family, we confirmed our result by identifying a nonsense mutation in exon 4 of PTPN11 that also co-segregates with the phenotype. Sequencing PTPN11 exon 4 in 469 controls showed no such protein truncating variants, supporting the pathogenicity of these two mutations. This combination of a new technology and a classical genetic approach provides a powerful strategy to discover the genes responsible for unexplained Mendelian disorders

    Personalized medicine: new genomics, old lessons

    Get PDF
    Personalized medicine uses traditional, as well as emerging concepts of the genetic and environmental basis of disease to individualize prevention, diagnosis and treatment. Personalized genomics plays a vital, but not exclusive role in this evolving model of personalized medicine. The distinctions between genetic and genomic medicine are more quantitative than qualitative. Personalized genomics builds on principles established by the integration of genetics into medical practice. Principles shared by genetic and genomic aspects of medicine, include the use of variants as markers for diagnosis, prognosis, prevention, as well as targets for treatment, the use of clinically validated variants that may not be functionally characterized, the segregation of these variants in non-Mendelian as well as Mendelian patterns, the role of gene–environment interactions, the dependence on evidence for clinical utility, the critical translational role of behavioral science, and common ethical considerations. During the current period of transition from investigation to practice, consumers should be protected from harms of premature translation of research findings, while encouraging the innovative and cost-effective application of those genomic discoveries that improve personalized medical care

    Identifying micro-inversions using high-throughput sequencing reads

    Full text link
    Background: The identification of inversions of DNA segments shorter than read length (e.g., 100 bp), defined as micro-inversions (MIs), remains challenging for next-generation sequencing reads. It is acknowledged that MIs are important genomic variation and may play roles in causing genetic disease. However, current alignment methods are generally insensitive to detect MIs. Here we develop a novel tool, MID (Micro-Inversion Detector), to identify MIs in human genomes using next-generation sequencing reads. Results: The algorithm of MID is designed based on a dynamic programming path-finding approach. What makes MID different from other variant detection tools is that MID can handle small MIs and multiple breakpoints within an unmapped read. Moreover, MID improves reliability in low coverage data by integrating multiple samples. Our evaluation demonstrated that MID outperforms Gustaf, which can currently detect inversions from 30 bp to 500 bp. Conclusions: To our knowledge, MID is the first method that can efficiently and reliably identify MIs from unmapped short next-generation sequencing reads. MID is reliable on low coverage data, which is suitable for large-scale projects such as the 1000 Genomes Project (1KGP). MID identified previously unknown MIs from the 1KGP that overlap with genes and regulatory elements in the human genome. We also identified MIs in cancer cell lines from Cancer Cell Line Encyclopedia (CCLE). Therefore our tool is expected to be useful to improve the study of MIs as a type of genetic variant in the human genome. The source code can be downloaded from: http://cqb.pku.edu.cn/ZhuLab/MID.NCI NIH HHS [CA182360, R33 CA182360]; NHGRI NIH HHS [HG007352, R01 HG007352]SCI(E)PubMedARTICLESuppl 141

    Genomewide linkage scan of schizophrenia in a large multicenter pedigree sample using single nucleotide polymorphisms

    Get PDF
    A genomewide linkage scan was carried out in eight clinical samples of informative schizophrenia families. After all quality control checks, the analysis of 707 European-ancestry families included 1615 affected and 1602 unaffected genotyped individuals, and the analysis of all 807 families included 1900 affected and 1839 unaffected individuals. Multipoint linkage analysis with correction for marker–marker linkage disequilibrium was carried out with 5861 single nucleotide polymorphisms (SNPs; Illumina version 4.0 linkage map). Suggestive evidence for linkage (European families) was observed on chromosomes 8p21, 8q24.1, 9q34 and 12q24.1 in nonparametric and/or parametric analyses. In a logistic regression allele-sharing analysis of linkage allowing for intersite heterogeneity, genomewide significant evidence for linkage was observed on chromosome 10p12. Significant heterogeneity was also observed on chromosome 22q11.1. Evidence for linkage across family sets and analyses was most consistent on chromosome 8p21, with a one-LOD support interval that does not include the candidate gene NRG1, suggesting that one or more other susceptibility loci might exist in the region. In this era of genomewide association and deep resequencing studies, consensus linkage regions deserve continued attention, given that linkage signals can be produced by many types of genomic variation, including any combination of multiple common or rare SNPs or copy number variants in a region

    DNA variants at the LPL gene locus associate with angiographically defined severity of atherosclerosis and serum lipoprotein levels in a Welsh population.

    No full text
    Coronary artery disease (CAD) patients (n = 235), comprising minimal (CAD-, n = 124) and severe (CAD+, n = 111) CAD, were recruited on the basis of their angiographic scores. Male control subjects (n = 123) were selected randomly from the Caerphilly Heart Study cohort. Subjects were genotyped for the Ser447-Ter mutation and HindIII/Pvu II restriction fragment length polymorphisms of the lipoprotein lipase gene and investigated for associations with severity and development of CAD and lipid and lipoprotein levels. The Ser447-Ter mutation showed no significant associations with CAD or dyslipidemia but was related to favorable lipid and lipoprotein profiles. The H2H2 genotype (P < .05) and H2 allele (P = .05) were significantly more frequent in CAD+ versus CAD- and control subjects versus CAD-. H2H2 subjects, among the entire male cohort, had significantly higher levels of apolipoprotein B (P = .0002), total cholesterol (P < .004), and triglycerides (P < .04) than alternative genotypes. P2P2 associated with significantly lower high-density lipoprotein cholesterol levels (P < .01). The H2 allele had most significant associations with raised apolipoprotein B levels compared with other biochemical parameters. Our data suggest that the H2 allele may be a linkage marker for an etiologic mutation for dyslipidemia and the severity and development of atherosclerosis; this is not the Ser447-Ter mutatio

    Transcriptome sequencing of an ecologically important graminivorous sawfly:a resource for marker development

    No full text
    Sawfly larvae (Hymenoptera: Symphyta) are an important, highly nutritious, invertebrate food source for farmland birds. Reduced numbers of farmland invertebrates are a possible factor contributing to the observed declines in many farmland bird populations associated with post-1950s intensification of agriculture. To date, studies on sawfly populations have been census-based and, therefore, the genetic factors underlying their declines are unknown. We have produced the first genetic resource for any sawfly species in the form of a de novo transcriptome assembly comprising 18,539 contiguous sequences (contigs) and 260 singletons. The assembly was sequenced using 454 pyrosequencing technology and produced 1,284 microsatellite markers for the common farmland sawfly Dolerus aeneus, which may also be useful in other closely-related species. These markers will facilitate monitoring of changes in genetic diversity and gene flow in sawfly populations therefore helping to predict extinction risk.</p
    corecore