62 research outputs found

    Rapid haplotype inference for nuclear families

    Get PDF
    Hapi is a new dynamic programming algorithm that ignores uninformative states and state transitions in order to efficiently compute minimum-recombinant and maximum likelihood haplotypes. When applied to a dataset containing 103 families, Hapi performs 3.8 and 320 times faster than state-of-the-art algorithms. Because Hapi infers both minimum-recombinant and maximum likelihood haplotypes and applies to related individuals, the haplotypes it infers are highly accurate over extended genomic distances.National Institutes of Health (U.S.) (NIH grant 5-T90-DK070069)National Institutes of Health (U.S.) (Grant 5-P01-NS055923)National Science Foundation (U.S.) (Graduate Research Fellowship

    Spin Caloritronics

    Get PDF
    This is a brief overview of the state of the art of spin caloritronics, the science and technology of controlling heat currents by the electron spin degree of freedom (and vice versa).Comment: To be published in "Spin Current", edited by S. Maekawa, E. Saitoh, S. Valenzuela and Y. Kimura, Oxford University Pres

    Efficient and Accurate Construction of Genetic Linkage Maps from the Minimum Spanning Tree of a Graph

    Get PDF
    Genetic linkage maps are cornerstones of a wide spectrum of biotechnology applications, including map-assisted breeding, association genetics, and map-assisted gene cloning. During the past several years, the adoption of high-throughput genotyping technologies has been paralleled by a substantial increase in the density and diversity of genetic markers. New genetic mapping algorithms are needed in order to efficiently process these large datasets and accurately construct high-density genetic maps. In this paper, we introduce a novel algorithm to order markers on a genetic linkage map. Our method is based on a simple yet fundamental mathematical property that we prove under rather general assumptions. The validity of this property allows one to determine efficiently the correct order of markers by computing the minimum spanning tree of an associated graph. Our empirical studies obtained on genotyping data for three mapping populations of barley (Hordeum vulgare), as well as extensive simulations on synthetic data, show that our algorithm consistently outperforms the best available methods in the literature, particularly when the input data are noisy or incomplete. The software implementing our algorithm is available in the public domain as a web tool under the name MSTmap

    Haplotype association analyses in resources of mixed structure using Monte Carlo testing

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genomewide association studies have resulted in a great many genomic regions that are likely to harbor disease genes. Thorough interrogation of these specific regions is the logical next step, including regional haplotype studies to identify risk haplotypes upon which the underlying critical variants lie. Pedigrees ascertained for disease can be powerful for genetic analysis due to the cases being enriched for genetic disease. Here we present a Monte Carlo based method to perform haplotype association analysis. Our method, hapMC, allows for the analysis of full-length and sub-haplotypes, including imputation of missing data, in resources of nuclear families, general pedigrees, case-control data or mixtures thereof. Both traditional association statistics and transmission/disequilibrium statistics can be performed. The method includes a phasing algorithm that can be used in large pedigrees and optional use of pseudocontrols.</p> <p>Results</p> <p>Our new phasing algorithm substantially outperformed the standard expectation-maximization algorithm that is ignorant of pedigree structure, and hence is preferable for resources that include pedigree structure. Through simulation we show that our Monte Carlo procedure maintains the correct type 1 error rates for all resource types. Power comparisons suggest that transmission-disequilibrium statistics are superior for performing association in resources of only nuclear families. For mixed structure resources, however, the newly implemented pseudocontrol approach appears to be the best choice. Results also indicated the value of large high-risk pedigrees for association analysis, which, in the simulations considered, were comparable in power to case-control resources of the same sample size.</p> <p>Conclusions</p> <p>We propose hapMC as a valuable new tool to perform haplotype association analyses, particularly for resources of mixed structure. The availability of meta-association and haplotype-mining modules in our suite of Monte Carlo haplotype procedures adds further value to the approach.</p

    Folliculin mutations are not associated with severe COPD

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Rare loss-of-function folliculin (<it>FLCN</it>) mutations are the genetic cause of Birt-Hogg-Dubรฉ syndrome, a monogenic disorder characterized by spontaneous pneumothorax, fibrofolliculomas, and kidney tumors. Loss-of-function folliculin mutations have also been described in pedigrees with familial spontaneous pneumothorax. Because the majority of patients with folliculin mutations have radiographic evidence of pulmonary cysts, folliculin has been hypothesized to contribute to the development of emphysema.</p> <p>To determine whether folliculin sequence variants are risk factors for severe COPD, we genotyped seven previously reported Birt-Hogg-Dubรฉ or familial spontaneous pneumothorax associated folliculin mutations in 152 severe COPD probands participating in the Boston Early-Onset COPD Study. We performed bidirectional resequencing of all 14 folliculin exons in a subset of 41 probands and subsequently genotyped four identified variants in an independent sample of345 COPD subjects from the National Emphysema Treatment Trial (cases) and 420 male smokers with normal lung function from the Normative Aging Study (controls).</p> <p>Results</p> <p>None of the seven previously reported Birt-Hogg-Dubรฉ or familial spontaneous pneumothorax mutations were observed in the 152 severe, early-onset COPD probands. Exon resequencing identified 31 variants, including two non-synonymous polymorphisms and two common non-coding polymorphisms. No significant association was observed for any of these four variants with presence of COPD or emphysema-related phenotypes.</p> <p>Conclusion</p> <p>Genetic variation in folliculin does not appear to be a major risk factor for severe COPD. These data suggest that familial spontaneous pneumothorax and COPD have distinct genetic causes, despite some overlap in radiographic characteristics.</p

    Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Cucumber, <it>Cucumis sativus </it>L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequences, which are frequently favored as genetic markers due to their high level of polymorphism and codominant inheritance. Data from previously characterized genomes has shown that these repeats vary in frequency, motif sequence, and genomic location across taxa. During the last year, the genomes of two cucumber genotypes were sequenced including the Chinese fresh market type inbred line '9930' and the North American pickling type inbred line 'Gy14'. These sequences provide a powerful tool for developing markers in a large scale. In this study, we surveyed and characterized the distribution and frequency of perfect microsatellites in 203 Mbp assembled Gy14 DNA sequences, representing 55% of its nuclear genome, and in cucumber EST sequences. Similar analyses were performed in genomic and EST data from seven other plant species, and the results were compared with those of cucumber.</p> <p>Results</p> <p>A total of 112,073 perfect repeats were detected in the Gy14 cucumber genome sequence, accounting for 0.9% of the assembled Gy14 genome, with an overall density of 551.9 SSRs/Mbp. While tetranucleotides were the most frequent microsatellites in genomic DNA sequence, dinucleotide repeats, which had more repeat units than any other SSR type, had the highest cumulative sequence length. Coding regions (ESTs) of the cucumber genome had fewer microsatellites compared to its genomic sequence, with trinucleotides predominating in EST sequences. AAG was the most frequent repeat in cucumber ESTs. Overall, AT-rich motifs prevailed in both genomic and EST data. Compared to the other species examined, cucumber genomic sequence had the highest density of SSRs (although comparable to the density of poplar, grapevine and rice), and was richest in AT dinucleotides. Using an electronic PCR strategy, we investigated the polymorphism between 9930 and Gy14 at 1,006 SSR loci, and found unexpectedly high degree of polymorphism (48.3%) between the two genotypes. The level of polymorphism seems to be positively associated with the number of repeat units in the microsatellite. The <it>in silico </it>PCR results were validated empirically in 660 of the 1,006 SSR loci. In addition, primer sequences for more than 83,000 newly-discovered cucumber microsatellites, and their exact positions in the Gy14 genome assembly were made publicly available.</p> <p>Conclusions</p> <p>The cucumber genome is rich in microsatellites; AT and AAG are the most abundant repeat motifs in genomic and EST sequences of cucumber, respectively. Considering all the species investigated, some commonalities were noted, especially within the monocot and dicot groups, although the distribution of motifs and the frequency of certain repeats were characteristic of the species examined. The large number of SSR markers developed from this study should be a significant contribution to the cucurbit research community.</p

    Microsatellite isolation and marker development in carrot - genomic distribution, linkage mapping, genetic diversity analysis and marker transferability across Apiaceae

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Apiaceae family includes several vegetable and spice crop species among which carrot is the most economically important member, with ~21 million tons produced yearly worldwide. Despite its importance, molecular resources in this species are relatively underdeveloped. The availability of informative, polymorphic, and robust PCR-based markers, such as microsatellites (or SSRs), will facilitate genetics and breeding of carrot and other Apiaceae, including integration of linkage maps, tagging of phenotypic traits and assisting positional gene cloning. Thus, with the purpose of isolating carrot microsatellites, two different strategies were used; a hybridization-based library enrichment for SSRs, and bioinformatic mining of SSRs in BAC-end sequence and EST sequence databases. This work reports on the development of 300 carrot SSR markers and their characterization at various levels.</p> <p>Results</p> <p>Evaluation of microsatellites isolated from both DNA sources in subsets of 7 carrot F<sub>2 </sub>mapping populations revealed that SSRs from the hybridization-based method were longer, had more repeat units and were more polymorphic than SSRs isolated by sequence search. Overall, 196 SSRs (65.1%) were polymorphic in at least one mapping population, and the percentage of polymophic SSRs across F<sub>2 </sub>populations ranged from 17.8 to 24.7. Polymorphic markers in one family were evaluated in the entire F<sub>2</sub>, allowing the genetic mapping of 55 SSRs (38 codominant) onto the carrot reference map. The SSR loci were distributed throughout all 9 carrot linkage groups (LGs), with 2 to 9 SSRs/LG. In addition, SSR evaluations in carrot-related taxa indicated that a significant fraction of the carrot SSRs transfer successfully across Apiaceae, with heterologous amplification success rate decreasing with the target-species evolutionary distance from carrot. SSR diversity evaluated in a collection of 65 <it>D. carota </it>accessions revealed a high level of polymorphism for these selected loci, with an average of 19 alleles/locus and 0.84 expected heterozygosity.</p> <p>Conclusions</p> <p>The addition of 55 SSRs to the carrot map, together with marker characterizations in six other mapping populations, will facilitate future comparative mapping studies and integration of carrot maps. The markers developed herein will be a valuable resource for assisting breeding, genetic, diversity, and genomic studies of carrot and other Apiaceae.</p

    Recombinational Landscape and Population Genomics of Caenorhabditis elegans

    Get PDF
    Recombination rate and linkage disequilibrium, the latter a function of population genomic processes, are the critical parameters for mapping by linkage and association, and their patterns in Caenorhabditis elegans are poorly understood. We performed high-density SNP genotyping on a large panel of recombinant inbred advanced intercross lines (RIAILs) of C. elegans to characterize the landscape of recombination and, on a panel of wild strains, to characterize population genomic patterns. We confirmed that C. elegans autosomes exhibit discrete domains of nearly constant recombination rate, and we show, for the first time, that the pattern holds for the X chromosome as well. The terminal domains of each chromosome, spanning about 7% of the genome, exhibit effectively no recombination. The RIAILs exhibit a 5.3-fold expansion of the genetic map. With median marker spacing of 61 kb, they are a powerful resource for mapping quantitative trait loci in C. elegans. Among 125 wild isolates, we identified only 41 distinct haplotypes. The patterns of genotypic similarity suggest that some presumed wild strains are laboratory contaminants. The Hawaiian strain, CB4856, exhibits genetic isolation from the remainder of the global population, whose members exhibit ample evidence of intercrossing and recombining. The population effective recombination rate, estimated from the pattern of linkage disequilibrium, is correlated with the estimated meiotic recombination rate, but its magnitude implies that the effective rate of outcrossing is extremely low, corroborating reports of selection against recombinant genotypes. Despite the low population, effective recombination rate and extensive linkage disequilibrium among chromosomes, which are techniques that account for background levels of genomic similarity, permit association mapping in wild C. elegans strains

    Identification of RNF213 as a Susceptibility Gene for Moyamoya Disease and Its Possible Role in Vascular Development

    Get PDF
    ใ‚‚ใ‚„ใ‚‚ใ‚„็—…ๆ„Ÿๅ—ๆ€ง้บไผๅญใฎ็‰นๅฎšใจใใฎๆฉŸ่ƒฝใซใคใ„ใฆใฎ็™บ่ฆ‹. ไบฌ้ƒฝๅคงๅญฆใƒ—ใƒฌใ‚นใƒชใƒชใƒผใ‚น. 2011-7-21.Background Moyamoya disease is an idiopathic vascular disorder of intracranial arteries. Its susceptibility locus has been mapped to 17q25.3 in Japanese families, but the susceptibility gene is unknown. Methodology/Principal Findings Genome-wide linkage analysis in eight three-generation families with moyamoya disease revealed linkage to 17q25.3 (P<10-4). Fine mapping demonstrated a 1.5-Mb disease locus bounded by D17S1806 and rs2280147. We conducted exome analysis of the eight index cases in these families, with results filtered through Ng criteria. There was a variant of p.N321S in PCMTD1 and p.R4810K in RNF213 in the 1.5-Mb locus of the eight index cases. The p.N321S variant in PCMTD1 could not be confirmed by the Sanger method. Sequencing RNF213 in 42 index cases confirmed p.R4810K and revealed it to be the only unregistered variant. Genotyping 39 SNPs around RNF213 revealed a founder haplotype transmitted in 42 families. Sequencing the 260-kb region covering the founder haplotype in one index case did not show any coding variants except p.R4810K. A case-control study demonstrated strong association of p.R4810K with moyamoya disease in East Asian populations (251 cases and 707 controls) with an odds ratio of 111.8 (P = 10โˆ’119). Sequencing of RNF213 in East Asian cases revealed additional novel variants: p.D4863N, p.E4950D, p.A5021V, p.D5160E, and p.E5176G. Among Caucasian cases, variants p.N3962D, p.D4013N, p.R4062Q and p.P4608S were identified. RNF213 encodes a 591-kDa cytosolic protein that possesses two functional domains: a Walker motif and a RING finger domain. These exhibit ATPase and ubiquitin ligase activities. Although the mutant alleles (p.R4810K or p.D4013N in the RING domain) did not affect transcription levels or ubiquitination activity, knockdown of RNF213 in zebrafish caused irregular wall formation in trunk arteries and abnormal sprouting vessels. Conclusions/Significance We provide evidence suggesting, for the first time, the involvement of RNF213 in genetic susceptibility to moyamoya disease

    Comparative Developmental Expression Profiling of Two C. elegans Isolates

    Get PDF
    Gene expression is known to change during development and to vary among genetically diverse strains. Previous studies of temporal patterns of gene expression during C. elegans development were incomplete, and little is known about how these patterns change as a function of genetic background. We used microarrays that comprehensively cover known and predicted worm genes to compare the landscape of genetic variation over developmental time between two isolates of C. elegans. We show that most genes vary in expression during development from egg to young adult, many genes vary in expression between the two isolates, and a subset of these genes exhibit isolate-specific changes during some developmental stages. This subset is strongly enriched for genes with roles in innate immunity. We identify several novel motifs that appear to play a role in regulating gene expression during development, and we propose functional annotations for many previously unannotated genes. These results improve our understanding of gene expression and function during worm development and lay the foundation for linkage studies of the genetic basis of developmental variation in gene expression in this important model organism
    • โ€ฆ
    corecore