962 research outputs found

    HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data

    Get PDF
    As the more recent next-generation sequencing (NGS) technologies provide longer read sequences, the use of sequencing datasets for complete haplotype phasing is fast becoming a reality, allowing haplotype reconstruction of a single sequenced genome. Nearly all previous haplotype reconstruction studies have focused on diploid genomes and are rarely scalable to genomes with higher ploidy. Yet computational investigations into polyploid genomes carry great importance, impacting plant, yeast and fish genomics, as well as the studies of the evolution of modern-day eukaryotes and (epi)genetic interactions between copies of genes. In this paper, we describe a novel maximum-likelihood estimation framework, HapTree, for polyploid haplotype assembly of an individual genome using NGS read datasets. We evaluate the performance of HapTree on simulated polyploid sequencing read data modeled after Illumina sequencing technologies. For triploid and higher ploidy genomes, we demonstrate that HapTree substantially improves haplotype assembly accuracy and efficiency over the state-of-the-art; moreover, HapTree is the first scalable polyplotyping method for higher ploidy. As a proof of concept, we also test our method on real sequencing data from NA12878 (1000 Genomes Project) and evaluate the quality of assembled haplotypes with respect to trio-based diplotype annotation as the ground truth. The results indicate that HapTree significantly improves the switch accuracy within phased haplotype blocks as compared to existing haplotype assembly methods, while producing comparable minimum error correction (MEC) values. A summary of this paper appears in the proceedings of the RECOMB 2014 conference, April 2–5.National Science Foundation (U.S.) (NSF/NIH BIGDATA Grant R01GM108348-01)National Science Foundation (U.S.) (Graduate Research Fellowship)Simons Foundatio

    Using Extended Genealogy to Estimate Components of Heritability for 23 Quantitative and Dichotomous Traits

    Get PDF
    Important knowledge about the determinants of complex human phenotypes can be obtained from the estimation of heritability, the fraction of phenotypic variation in a population that is determined by genetic factors. Here, we make use of extensive phenotype data in Iceland, long-range phased genotypes, and a population-wide genealogical database to examine the heritability of 11 quantitative and 12 dichotomous phenotypes in a sample of 38,167 individuals. Most previous estimates of heritability are derived from family-based approaches such as twin studies, which may be biased upwards by epistatic interactions or shared environment. Our estimates of heritability, based on both closely and distantly related pairs of individuals, are significantly lower than those from previous studies. We examine phenotypic correlations across a range of relationships, from siblings to first cousins, and find that the excess phenotypic correlation in these related individuals is predominantly due to shared environment as opposed to dominance or epistasis. We also develop a new method to jointly estimate narrow-sense heritability and the heritability explained by genotyped SNPs. Unlike existing methods, this approach permits the use of information from both closely and distantly related pairs of individuals, thereby reducing the variance of estimates of heritability explained by genotyped SNPs while preventing upward bias. Our results show that common SNPs explain a larger proportion of the heritability than previously thought, with SNPs present on Illumina 300K genotyping arrays explaining more than half of the heritability for the 23 phenotypes examined in this study. Much of the remaining heritability is likely to be due to rare alleles that are not captured by standard genotyping arrays

    Breeding histories and selection criteria for oilseed rape in Europe and China identified by genome wide pedigree dissection

    Get PDF
    Selection breeding has played a key role in the improvement of seed yield and quality in oilseed rape (Brassica napus L.). We genotyped Tapidor (European), Ningyou7 (Chinese) and their progenitors with the Brassica 60 K Illumina Infinium SNP array and mapped a total of 29,347 SNP markers onto the reference genome of Darmor-bzh. Identity by descent (IBD) refers to a haplotype segment of a chromosome inherited from a shared common ancestor. IBDs identified on the C subgenome were larger than those on the A subgenome within both the Tapidor and Ningyou7 pedigrees. IBD number and length were greater in the Ningyou7 pedigree than in the Tapidor pedigree. Seventy nine QTLs for flowering time, seed quality and root morphology traits were identified in the IBDs of Tapidor and Ningyou7. Many more candidate genes had been selected within the Ningyou7 pedigree than within the Tapidor pedigree. These results highlight differences in the transfer of favorable gene clusters controlling key traits during selection breeding in Europe and China

    The Role of Individual Variables, Organizational Variables and Moral Intensity Dimensions in Libyan Management Accountants’ Ethical Decision Making

    Get PDF
    This study investigates the association of a broad set of variables with the ethical decision making of management accountants in Libya. Adopting a cross-sectional methodology, a questionnaire including four different ethical scenarios was used to gather data from 229 participants. For each scenario, ethical decision making was examined in terms of the recognition, judgment and intention stages of Rest’s model. A significant relationship was found between ethical recognition and ethical judgment and also between ethical judgment and ethical intention, but ethical recognition did not significantly predict ethical intention—thus providing support for Rest’s model. Organizational variables, age and educational level yielded few significant results. The lack of significance for codes of ethics might reflect their relative lack of development in Libya, in which case Libyan companies should pay attention to their content and how they are supported, especially in the light of the under-development of the accounting profession in Libya. Few significant results were also found for gender, but where they were found, males showed more ethical characteristics than females. This unusual result reinforces the dangers of gender stereotyping in business. Personal moral philosophy and moral intensity dimensions were generally found to be significant predictors of the three stages of ethical decision making studied. One implication of this is to give more attention to ethics in accounting education, making the connections between accounting practice and (in Libya) Islam. Overall, this study not only adds to the available empirical evidence on factors affecting ethical decision making, notably examining three stages of Rest’s model, but also offers rare insights into the ethical views of practising management accountants and provides a benchmark for future studies of ethical decision making in Muslim majority countries and other parts of the developing world

    Rapid haplotype inference for nuclear families

    Get PDF
    Hapi is a new dynamic programming algorithm that ignores uninformative states and state transitions in order to efficiently compute minimum-recombinant and maximum likelihood haplotypes. When applied to a dataset containing 103 families, Hapi performs 3.8 and 320 times faster than state-of-the-art algorithms. Because Hapi infers both minimum-recombinant and maximum likelihood haplotypes and applies to related individuals, the haplotypes it infers are highly accurate over extended genomic distances.National Institutes of Health (U.S.) (NIH grant 5-T90-DK070069)National Institutes of Health (U.S.) (Grant 5-P01-NS055923)National Science Foundation (U.S.) (Graduate Research Fellowship

    Genome-wide analyses for personality traits identify six genomic loci and show correlations with psychiatric disorders

    Get PDF
    Personality is influenced by genetic and environmental factors1 and associated with mental health. However, the underlying genetic determinants are largely unknown. We identified six genetic loci, including five novel loci2,3, significantly associated with personality traits in a meta-analysis of genome-wide association studies (N = 123,132–260,861). Of these genomewide significant loci, extraversion was associated with variants in WSCD2 and near PCDH15, and neuroticism with variants on chromosome 8p23.1 and in L3MBTL2. We performed a principal component analysis to extract major dimensions underlying genetic variations among five personality traits and six psychiatric disorders (N = 5,422–18,759). The first genetic dimension separated personality traits and psychiatric disorders, except that neuroticism and openness to experience were clustered with the disorders. High genetic correlations were found between extraversion and attention-deficit– hyperactivity disorder (ADHD) and between openness and schizophrenia and bipolar disorder. The second genetic dimension was closely aligned with extraversion–introversion and grouped neuroticism with internalizing psychopathology (e.g., depression or anxiety)

    The effect of incorrect scanning distance on boundary detection errors and macular thickness measurements by spectral domain optical coherence tomography: a cross sectional study

    Get PDF
    BACKGROUND: To investigate the influence of scan distance on retinal boundary detection errors (RBDEs) and retinal thickness measurements by spectral domain optical coherence tomography (SD-OCT). METHODS: 10 eyes of healthy subjects, 10 eyes with diabetic macular edema (DME) and 10 eyes with neovascular age-related macular degeneration (AMD) were examined with RTVue SD-OCT. The MM5 protocol was used in two consecutive sessions to scan the macula. For the first session, the device was set 3.5 cm from the eye in order to obtain detectable signal with low fundus image quality (suboptimal setting) while in the second session a distance of 2.5 cm was set with a good quality fundus image. The signal strength (SSI) value was recorded. The score for retinal boundary detection errors (RBDE) was calculated for ten scans of each examination. RBDE scores were recorded for the whole scan and also for the peripheral 1.0 mm region. RBDE scores, regional retinal thickness values and SSI values between the two sessions were compared. The correlation between SSI and the number of RBDEs was also examined. RESULTS: The SSI was significantly lower with suboptimal settings compared to optimal settings (63.9+/-12.0 vs. 68.3+/-12.2, respectively, p = 0.001) and the number of RBDEs was significantly higher with suboptimal settings in the "all-eyes" group along with the group of healthy subjects and eyes with DME (9.1+/-6.5 vs. 6.8+/-6.3, p = 0.007; 4.4+/-2.6 vs. 2.5+/-1.6, p = 0.035 and 9.7+/-3.3 vs. 5.1+/-3.7, p = 0.008, respectively). For these groups, significant negative correlation was found between the SSI and the number of RBDEs. In the AMD group, the number of RBDEs was markedly higher compared to the other groups and there was no difference in RBDEs between optimal and suboptimal settings with the errors being independent of the SSI. There were significantly less peripheral RBDEs with optimal settings in the "all-eyes" group and the DME subgroup (2.7+/-2.6 vs. 4.2+/-2.8, p = 0.001 and 1.4+/-1.7 vs. 4.1+/-2.2, p = 0.007, respectively). Retinal thickness in the two settings was significantly different only in the outer-superior region in DME. CONCLUSIONS: Optimal distance settings improve SD-OCT SSI with a decrease in RBDEs while retinal thickness measurements are independent of scanning distance

    A Systematic Mapping Approach of 16q12.2/FTO and BMI in More Than 20,000 African Americans Narrows in on the Underlying Functional Variation: Results from the Population Architecture using Genomics and Epidemiology (PAGE) Study

    Get PDF
    Genetic variants in intron 1 of the fat mass- and obesity-associated (FTO) gene have been consistently associated with body mass index (BMI) in Europeans. However, follow-up studies in African Americans (AA) have shown no support for some of the most consistently BMI-associated FTO index single nucleotide polymorphisms (SNPs). This is most likely explained by different race-specific linkage disequilibrium (LD) patterns and lower correlation overall in AA, which provides the opportunity to fine-map this region and narrow in on the functional variant. To comprehensively explore the 16q12.2/FTO locus and to search for second independent signals in the broader region, we fine-mapped a 646-kb region, encompassing the large FTO gene and the flanking gene RPGRIP1L by investigating a total of 3,756 variants (1,529 genotyped and 2,227 imputed variants) in 20,488 AAs across five studies. We observed associations between BMI and variants in the known FTO intron 1 locus: the SNP with the most significant p-value, rs56137030 (8.3×10-6) had not been highlighted in previous studies. While rs56137030was correlated at r2>0.5 with 103 SNPs in Europeans (including the GWAS index SNPs), this number was reduced to 28 SNPs in AA. Among rs56137030 and the 28 correlated SNPs, six were located within candidate intronic regulatory elements, including rs1421085, for which we predicted allele-specific binding affinity for the transcription factor CUX1, which has recently been implicated in the regulation of FTO. We did not find strong evidence for a second independent signal in the broader region. In summary, this large fine-mapping study in AA has substantially reduced the number of common alleles that are likely to be functional candidates of the known FTO locus. Importantly our study demonstrated that comprehensive fine-mapping in AA provides a powerful approach to narrow in on the functional candidate(s) underlying the initial GWAS findings in European populations

    A combined long-range phasing and long haplotype imputation method to impute phase for SNP genotypes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Knowing the phase of marker genotype data can be useful in genome-wide association studies, because it makes it possible to use analysis frameworks that account for identity by descent or parent of origin of alleles and it can lead to a large increase in data quantities via genotype or sequence imputation. Long-range phasing and haplotype library imputation constitute a fast and accurate method to impute phase for SNP data.</p> <p>Methods</p> <p>A long-range phasing and haplotype library imputation algorithm was developed. It combines information from surrogate parents and long haplotypes to resolve phase in a manner that is not dependent on the family structure of a dataset or on the presence of pedigree information.</p> <p>Results</p> <p>The algorithm performed well in both simulated and real livestock and human datasets in terms of both phasing accuracy and computation efficiency. The percentage of alleles that could be phased in both simulated and real datasets of varying size generally exceeded 98% while the percentage of alleles incorrectly phased in simulated data was generally less than 0.5%. The accuracy of phasing was affected by dataset size, with lower accuracy for dataset sizes less than 1000, but was not affected by effective population size, family data structure, presence or absence of pedigree information, and SNP density. The method was computationally fast. In comparison to a commonly used statistical method (fastPHASE), the current method made about 8% less phasing mistakes and ran about 26 times faster for a small dataset. For larger datasets, the differences in computational time are expected to be even greater. A computer program implementing these methods has been made available.</p> <p>Conclusions</p> <p>The algorithm and software developed in this study make feasible the routine phasing of high-density SNP chips in large datasets.</p

    Extensive Copy-Number Variation of Young Genes across Stickleback Populations

    Get PDF
    MM received funding from the Max Planck innovation funds for this project. PGDF was supported by a Marie Curie European Reintegration Grant (proposal nr 270891). CE was supported by German Science Foundation grants (DFG, EI 841/4-1 and EI 841/6-1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript
    corecore