74,706 research outputs found
Bayesian robust analysis for genetic architecture of quantitative traits
Motivation: In most quantitative trait locus (QTL) mapping studies, phenotypes are assumed to follow normal distributions. Deviations from this assumption may affect the accuracy of QTL detection and lead to detection of spurious QTLs. To improve the robustness of QTL mapping methods, we replaced the normal distribution for residuals in multiple interacting QTL models with the normal/independent distributions that are a class of symmetric and long-tailed distributions and are able to accommodate residual outliers. Subsequently, we developed a Bayesian robust analysis strategy for dissecting genetic architecture of quantitative traits and for mapping genome-wide interacting QTLs in line crosses
Genome-wide association studies for canine hip dysplasia in single and multiple populations – implications and potential novel risk loci
Background Association mapping studies of quantitative trait loci (QTL) for canine hip dysplasia (CHD) can contribute to the understanding of the genetic background of this common and debilitating disease and might contribute to its genetic improvement. The power of association studies for CHD is limited by relatively small sample numbers for CHD records within countries, suggesting potential benefits of joining data across countries. However, this is complicated due to the use of different scoring systems across countries. In this study, we incorporated routinely assessed CHD records and genotype data of German Shepherd dogs from two countries (UK and Sweden) to perform genome-wide association studies (GWAS) within populations using different variations of CHD phenotypes. As phenotypes, dogs were either classified into cases and controls based on the Federation Cynologique Internationale (FCI) five-level grading of the worst hip or the FCI grade was treated as an ordinal trait. In a subsequent meta-analysis, we added publicly available data from a Finnish population and performed the GWAS across all populations. Genetic associations for the CHD phenotypes were evaluated in a linear mixed model using 62,089 SNPs. Results Multiple SNPs with genome-wide significant and suggestive associations were detected in single-population GWAS and the meta-analysis. Few of these SNPs overlapped between populations or between single-population GWAS and the meta-analysis, suggesting that many CHD-related QTL are population-specific. More significant or suggestive SNPs were identified when FCI grades were used as phenotypes in comparison to the case-control approach. MED13 (Chr 9) and PLEKHA7 (Chr 21) emerged as novel positional candidate genes associated with hip dysplasia. Conclusions Our findings confirm the complex genetic nature of hip dysplasia in dogs, with multiple loci associated with the trait, most of which are population-specific. Routinely assessed CHD information collected across countries provide an opportunity to increase sample sizes and statistical power for association studies. While the lack of standardisation of CHD assessment schemes across countries poses a challenge, we showed that conversion of traits can be utilised to overcome this obstacle
A Simple Method for Combining Genetic Mapping Data from Multiple Crosses and Experimental Designs
Over the past decade many linkage studies have defined chromosomal intervals containing polymorphisms that modulate a variety of traits. Many phenotypes are now associated with enough mapping data that meta-analysis could help refine locations of known QTLs and detect many novel QTLs.We describe a simple approach to combining QTL mapping results for multiple studies and demonstrate its utility using two hippocampus weight loci. Using data taken from two populations, a recombinant inbred strain set and an advanced intercross population we demonstrate considerable improvements in significance and resolution for both loci. 1-LOD support intervals were improved 51% for Hipp1a and 37% for Hipp9a. We first generate locus-wise permuted P-values for association with the phenotype from multiple maps, which can be done using a permutation method appropriate to each population. These results are then assigned to defined physical positions by interpolation between markers with known physical and genetic positions. We then use Fisher's combination test to combine position-by-position probabilities among experiments. Finally, we calculate genome-wide combined P-values by generating locus-specific P-values for each permuted map for each experiment. These permuted maps are then sampled with replacement and combined. The distribution of best locus-specific P-values for each combined map is the null distribution of genome-wide adjusted P-values.Our approach is applicable to a wide variety of segregating and non-segregating mapping populations, facilitates rapid refinement of physical QTL position, is complementary to other QTL fine mapping methods, and provides an appropriate genome-wide criterion of significance for combined mapping results
Recommended from our members
Clustering and principal-components approach based on heritability for mapping multiple gene expressions
When the number of phenotypes in a genetic study is on the scale of thousands, such as in studies concerning thousands of gene expression levels, the single-trait analysis is computationally intensive, and heavy adjustment of multiple comparisons is required. Traditional multivariate genetic linkage analysis for quantitative traits focuses on mapping only a few phenotypes and is not feasible for a large number of traits. To cope with high-dimensional phenotype data, clustering analysis and principal-component analysis (PCA) are proposed to reduce the data dimensionality and to map shared genetic contributions for multiple traits. However, standard clustering analysis and PCA are applicable for independent observations. In most genetic studies, where family data are collected, these standard analyses can only be applied to founders and can lead to the loss of information. Here, we proposed a clustering method that can exploit family structure information and applied the method to 29 gene expression levels mapped to a reported hot spot on chromosome 14. We then used a PCA approach based on heritability applicable to small number of traits to combine phenotypes in the clusters. Lastly, we used a penalized PCA approach based on heritability applicable to arbitrary number of traits to combine 150 gene expression levels with the highest heritability. Genome-wide multipoint linkage analysis was carried out on the individual traits and on the combined traits. Two previously reported peaks on chromosomes 14 and 20 were identified. Linkage evidence was stronger for traits derived from methods that incorporate family structure information
How to understand the cell by breaking it: network analysis of gene perturbation screens
Modern high-throughput gene perturbation screens are key technologies at the
forefront of genetic research. Combined with rich phenotypic descriptors they
enable researchers to observe detailed cellular reactions to experimental
perturbations on a genome-wide scale. This review surveys the current
state-of-the-art in analyzing perturbation screens from a network point of
view. We describe approaches to make the step from the parts list to the wiring
diagram by using phenotypes for network inference and integrating them with
complementary data sources. The first part of the review describes methods to
analyze one- or low-dimensional phenotypes like viability or reporter activity;
the second part concentrates on high-dimensional phenotypes showing global
changes in cell morphology, transcriptome or proteome.Comment: Review based on ISMB 2009 tutorial; after two rounds of revisio
Automated data integration for developmental biological research
In an era exploding with genome-scale data, a major challenge for developmental biologists is how to extract significant clues from these publicly available data to benefit our studies of individual genes, and how to use them to improve our understanding of development at a systems level. Several studies have successfully demonstrated new approaches to classic developmental questions by computationally integrating various genome-wide data sets. Such computational approaches have shown great potential for facilitating research: instead of testing 20,000 genes, researchers might test 200 to the same effect. We discuss the nature and state of this art as it applies to developmental research
A major genetic locus in <i>Trypanosoma brucei</i> is a determinant of host pathology
The progression and variation of pathology during infections can be due to components from both host or pathogen, and/or the interaction between them. The influence of host genetic variation on disease pathology during infections with trypanosomes has been well studied in recent years, but the role of parasite genetic variation has not been extensively studied. We have shown that there is parasite strain-specific variation in the level of splenomegaly and hepatomegaly in infected mice and used a forward genetic approach to identify the parasite loci that determine this variation. This approach allowed us to dissect and identify the parasite loci that determine the complex phenotypes induced by infection. Using the available trypanosome genetic map, a major quantitative trait locus (QTL) was identified on T. brucei chromosome 3 (LOD = 7.2) that accounted for approximately two thirds of the variance observed in each of two correlated phenotypes, splenomegaly and hepatomegaly, in the infected mice (named <i>TbOrg1</i>). In addition, a second locus was identified that contributed to splenomegaly, hepatomegaly and reticulocytosis (<i>TbOrg2</i>). This is the first use of quantitative trait locus mapping in a diploid protozoan and shows that there are trypanosome genes that directly contribute to the progression of pathology during infections and, therefore, that parasite genetic variation can be a critical factor in disease outcome. The identification of parasite loci is a first step towards identifying the genes that are responsible for these important traits and shows the power of genetic analysis as a tool for dissecting complex quantitative phenotypic traits
Recommended from our members
Mapping genetic interactions in cancer: a road to rational combination therapies.
The discovery of synthetic lethal interactions between poly (ADP-ribose) polymerase (PARP) inhibitors and BRCA genes, which are involved in homologous recombination, led to the approval of PARP inhibition as a monotherapy for patients with BRCA1/2-mutated breast or ovarian cancer. Studies following the initial observation of synthetic lethality demonstrated that the reach of PARP inhibitors is well beyond just BRCA1/2 mutants. Insights into the mechanisms of action of anticancer drugs are fundamental for the development of targeted monotherapies or rational combination treatments that will synergize to promote cancer cell death and overcome mechanisms of resistance. The development of targeted therapeutic agents is premised on mapping the physical and functional dependencies of mutated genes in cancer. An important part of this effort is the systematic screening of genetic interactions in a variety of cancer types. Until recently, genetic-interaction screens have relied either on the pairwise perturbations of two genes or on the perturbation of genes of interest combined with inhibition by commonly used anticancer drugs. Here, we summarize recent advances in mapping genetic interactions using targeted, genome-wide, and high-throughput genetic screens, and we discuss the therapeutic insights obtained through such screens. We further focus on factors that should be considered in order to develop a robust analysis pipeline. Finally, we discuss the integration of functional interaction data with orthogonal methods and suggest that such approaches will increase the reach of genetic-interaction screens for the development of rational combination therapies
Genome-wide association study in two cohorts from a multi-generational mouse advanced intercross line highlights the difficulty of replication due to study-specific heterogeneity
There has been extensive discussion of the Replication Crisis in many fields, including genome-wide association studies
- …