673 research outputs found
CONTRIBUTION TO LINKAGE AND ASSOCIATION MAPPING OF TRAIT LOCI IN LIVESTOCK.
Until recently, breeding values were estimated based on phenotypes measured on the individual and its relatives, and the notion that the covariance between breeding values is proportionate to the kinship coefficient. Advances in genomics now allow for direct analysis of the genome and identification of the loci that determine the breeding values of individuals. As a consequence, marker assisted selection and genomic selection have become more effective and are replacing conventional selection.
The identification of loci influencing the traits of interest requires the use of advanced statistical methods that are constantly evolving. In the context of this thesis, we have (i) contributed to the development of gene mapping methods, (ii) applied these methods to map loci influencing both metric and meristic traits, and (iii) contributed to the development of methods for the integration of genomic information in livestock breeding and management.
The mapping methods that we have helped developing distinguish themselves mainly by the fact that (i) they exploit haplotype information (by means of a hidden markov model) which should increase the linkage disequilibrium with causative variants and hence detection power, (ii) they can simultaneously extract linkage information within families, and linkage disequilibrium information across the population, and (iii) they correct for population stratification by means of a random polygenic effect, and (iv) they can be applied to binary as well as quantitative traits.
We have applied these and other methods to map loci influencing (i) quantitative hematological parameters in a porcine line-cross, and (ii) binary traits including diseases in bovine and non-syntenic Copy Number Variants in cattle, horse and human.
In fine, we have contributed to the development of methods for the utilization of marker information in animal selection and production. We have extended the haplotype-based mapping method to allow imputation and have evaluated the utility of this approach in scenarios mimicking reality. We have also contributed to the development of a method to quantify somatic cell counts in the milk of individual cows by genotyping a sample of milk from the farmβs tank (hence a mixture of milk from all cows on the farm)
Our work has resulted in the development of a software package (βGLASCOWβ) that is increasingly used by the community to map genes influencing complex traits, primarily binary. By using this tool, we have contributed to the localization of several trait loci in pig, cattle, horse and human. We have contributed to the development of approaches that reduce the costs of genomic analyses in livestock by, on the one hand, complementing real SNP genotypes with genotypes obtained in silico by means imputation, and, on the other hand, by developing a method to deconvolute genotypes obtained on DNA pools
Genome-wide Association Studies in Ancestrally Diverse Populations: Opportunities, Methods, Pitfalls, and Recommendations
Genome-wide association studies (GWASs) have focused primarily on populations of European descent, but it is essential that diverse populations become better represented. Increasing diversity among study participants will advance our understanding of genetic architecture in all populations and ensure that genetic research is broadly applicable. To facilitate and promote research in multi-ancestry and admixed cohorts, we outline key methodological considerations and highlight opportunities, challenges, solutions, and areas in need of development. Despite the perception that analyzing genetic data from diverse populations is difficult, it is scientifically and ethically imperative, and there is an expanding analytical toolbox to do it well
Mapping genes underlying ethnic differences in tuberculosis risk by linkage disequilibrium in the South African coloured population of the Western Cape
Includes bibliographical references.The South Africa Coloured population of the Western Cape is the result of unions between Europeans, Africans (Bantu and Khoisan), and various other populations (Malaysian or Indonesian descent). The world-wide burden of tuberculosis remains an enormous problem, and is particularly severe in this population. In general, admixed populations that have arisen in historical times can make an important contribution to the discovery of disease susceptibility genes if the parental populations exhibit substantial variation in susceptibility. Despite numerous successful genome-wide association studies, detecting variants that have low disease risk still poses a challenge. Furthermore, admixture association studies for multi-way admixed populations pose constant challenges, including the choice of an accurate ancestral panel to infer ancestry and for imputing missing genotypes to identify possible genetic variants causing susceptibility to disease. This thesis addresses some of these challenges. We first developed PROXYANC, an approach to select the best proxy ancestral populations for admixed populations. From the simulation of a multi-way admixed population, we demonstrated the ability and accuracy of PROXYANC in selecting the best proxy ancestry and illustrated the importance of the choice of ancestries in both estimating admixture proportions and imputing missing genotypes. We applied this approach to the South African Coloured population, to refine both the choice of ancestral populations and their genetic contributions. We also demonstrated that the ancestral allele frequency differences correlated with increased linkage disequilibrium in the SAC, and that the increased LD originates from admixture events rather than population bottlenecks. Secondly, we conducted a study to determine whether ancestry-specific genetic contributions affect tuberculosis risk. We additionally conducted imputation genome-wide association studies and a meta-analysis incorporating previous genome-wide association studies of tuberculosis
Polygenicity and Epistasis Underlie Fitness-Proximal Traits in the Caenorhabditis elegans Multiparental Experimental Evolution (CeMEE) Panel
The deposited article is a pre-print version and it has not been submitted to peer reviewing. This article version was provided by bioRxiv and is the preprint first posted online Mar. 26, 2017. This publication hasn't any creative commons license associated. The deposited article version contains attached the supplementary materials within the pdf.Understanding the genetic basis of complex traits remains a major challenge in biology. Polygenicity, phenotypic plasticity and epistasis contribute to phenotypic variance in ways that are rarely clear. This uncertainty can be problematic for estimating heritability, for predicting individual phenotypes from genomic data, and for parameterizing models of phenotypic evolution. Here we report an advanced recombinant inbred line (RIL) quantitative trait locus (QTL) mapping panel for the hermaphroditic nematode Caenorhabditis elegans, the C. elegans multiparental experimental evolution (CeMEE) panel. The CeMEE panel, comprising 507 RILs at present, was created by hybridization of 16 wild isolates, experimental evolution for 140-190 generations, and inbreeding by selfing for 13-16 generations. The panel contains 22% of single nucleotide polymorphisms known to segregate in natural populations, and complements existing C. elegans mapping resources by providing fine resolution and high nucleotide diversity across >95% of the genome. We apply it to study the genetic basis of two fitness components, fertility and hermaphrodite body size at time of reproduction, with high broad sense heritability in the CeMEE. While simulations show we should detect common alleles with additive effects as small as 5%, at gene-level resolution, the genetic architectures of these traits does not feature such alleles. We instead find that a significant fraction of trait variance, approaching 40% for fertility, can be explained by sign epistasis with main effects below the detection limit. In congruence, phenotype prediction from genomic similarity, while generally poor (r2 < 10%), requires modeling epistasis for optimal accuracy, with most variance attributed to the rapidly evolving chromosome arms.National Science Foundation grant: (PHY-1125915); National Institutes of Health grants: (R25-GM-067110, R01-GM-089972, R01-GM-121828); Gordon and Betty Moore Foundation grant: (2919.01); Human Frontiers Science Program (RGP0045/2010); European Research Council grant: (FP7/2007-2013/243285); Agence Nationale de la Recherche grant: (ANR-14-ACHN-0032-01).info:eu-repo/semantics/publishedVersio
Genetic variation and health in rural Caribbean village
Bwa Mawego is a small-scale horticultural community (~500 people) on the island of Dominica that has been the site of a longitudinal health research project for more than 30 years. Cardiovascular diseases and metabolic health are growing local concerns. Here we analyze longitudinal growth data, cardiometabolic metrics, and genome-wide single nucleotide polymorphism (SNP) data from this population to investigate sources of variation in anthropometric and cardiometabolic outcomes. Mixed effect heritability models indicate that (1) variation in body mass index (BMI) is significantly shaped by genetic variation, and (2) variation between longitudinal BMI curves has not been consistently impacted by secular environmental trends from 1997 2017. In order to assess genetic variation in more detail, we first characterize the population structure and admixture in this Caribbean community using high-density SNP data and global reference samples in the Human Genome Diversity Panel. We detect four distinct family clusters and admixture from African, European, and Amerindian ancestral populations that occurred 5-6 generations ago (~130-150 years). Amerindian haplotypes represented in Bwa Mawego associate with deeply diverged lineages in Karitiana and Surui peoples, highlighting the regionally variable nature of admixture throughout the Caribbean and unique historical outcomes in Dominica. Genome-wide association tests of cardiometabolic phenotypes identify a genomic region of interest downstream of the ANK3 gene that associates with BMI in Bwa Mawego, after controlling for confounding variation from ancestral population structure and relatedness. Any functional relationship between ANK3 and BMI is currently uncharacterized, and there is unique potential to further explore complex gene-environment-phenotype landscapes in Bwa Mawego.Includes bibliographical reference
Recommended from our members
International meta-analysis of PTSD genome-wide association studies identifies sex- and ancestry-specific genetic risk loci.
The risk of posttraumatic stress disorder (PTSD) following trauma is heritable, but robust common variants have yet to be identified. In a multi-ethnic cohort including over 30,000 PTSD cases and 170,000 controls we conduct a genome-wide association study of PTSD. We demonstrate SNP-based heritability estimates of 5-20%, varying by sex. Three genome-wide significant loci are identified, 2 in European and 1 in African-ancestry analyses. Analyses stratified by sex implicate 3 additional loci in men. Along with other novel genes and non-coding RNAs, a Parkinson's disease gene involved in dopamine regulation, PARK2, is associated with PTSD. Finally, we demonstrate that polygenic risk for PTSD is significantly predictive of re-experiencing symptoms in the Million Veteran Program dataset, although specific loci did not replicate. These results demonstrate the role of genetic variation in the biology of risk for PTSD and highlight the necessity of conducting sex-stratified analyses and expanding GWAS beyond European ancestry populations
Including diverse and admixed populations in genetic epidemiology research
The inclusion of ancestrally diverse participants in genetic studies can lead to new discoveries and is important to ensure equitable health care benefit from research advances. Here, members of the Ethical, Legal, Social, Implications (ELSI) committee of the International Genetic Epidemiology Society (IGES) offer perspectives on methods and analysis tools for the conduct of inclusive genetic epidemiology research, with a focus on admixed and ancestrally diverse populations in support of reproducible research practices. We emphasize the importance of distinguishing socially defined population categorizations from genetic ancestry in the design, analysis, reporting, and interpretation of genetic epidemiology research findings. Finally, we discuss the current state of genomic resources used in genetic association studies, functional interpretation, and clinical and public health translation of genomic findings with respect to diverse populations
μ μ₯μ μ 체μ°κ΄λΆμκ³Ό ννν μμΈ‘ μ°κ΅¬λ₯Ό μν μ ννΌν©λͺ¨ν
νμλ
Όλ¬Έ (μμ¬)-- μμΈλνκ΅ λνμ : λμ
μλͺ
κ³Όνλν λμλͺ
곡νλΆ, 2019. 2. κΉν¬λ°.With the advance of sequencing and genotyping technologies, a large amount of genomic data has been accumulated and is available for biological studies. Along with the development of statistical models and computational capabilities, sizable genomic data can be analyzed thoroughly. Processing large genomic data via statistical computation enables discerning the relationship between genotypes and phenotypes.
In this thesis, the main concern was how differences in genotypes are related to phenotypes. I conducted genome-wide association study to discover genetic variants correlated with phenotypes. Also, I constructed prediction models to precisely estimate phenotypes from genotypes. In the studies, various linear mixed models were applied to calculate the effects of genetic variants.
In chapter 2, genome-wide association study on intramuscular fat content of pig was performed. Statistically significant single nucleotide polymorphisms were found and annotated to genes. Genes related to mitogen-activated protein kinase pathway were identified as candidate genes affecting the intramuscular fat content of pigs.
In chapter 3, genomic prediction models using haplotype alleles were constructed. The models attempt to predict carcass weight in Hanwoo. Different haplotype defining methods were implemented and the prediction accuracies of them were compared. As a result, genomic prediction accuracy was higher when haplotype alleles were used compared to when individual SNPs were used.
In chapter 4, models predicting human height from genotype were developed. I designed a genomic best linear unbiased prediction model adjusted with parental height. In addition, variables having highest effects on height were selected using bootstrap resampling. Models using only the selected variables were tested, and consequently I could obtain a model with high prediction ability.
Through these studies, I could understand how linear mixed models can be applied to explain relationships between genotypic variation and phenotypic variation. The findings of this dissertation will help to extend the use of linear mixed models for understanding the genetic architectures in animals and human.μΌκΈ°μμ΄λΆμκ³Ό μ μ νμ§λΆμ κΈ°μ μ λ°μ μΌλ‘ μλ¬Όνμ μ°κ΅¬μ μ΄μ©ν μ μλ λ§μ μμ λ°μ΄ν°κ° μΆμ λμλ€. λΏλ§ μλλΌ ν΅κ³μ λͺ¨νμ΄ λ°λ¬νκ³ ν° λ°μ΄ν°λ₯Ό κ³μ°νλ λ₯λ ₯μ΄ ν₯μλλ©΄μ λ°©λν μμ μ μ μ 보μ λν λ³΄λ€ μ λ°ν λΆμμ΄ κ°λ₯ν΄μ‘λ€. ν΅κ³μ κ³μ°μ νμ©ν μ μ μ 보μ λΆμμΌλ‘λΆν° μ μ νμ§κ³Ό ννν κ°μ κ΄κ³λ₯Ό λ°νλΌ μ μλ€.
λ³Έ μ‘Έμ
λ
Όλ¬Έμμλ μ μ νμ§μ μ°¨μ΄κ° μ΄λ»κ² νννκ³Ό κ΄λ ¨μ΄ μλμ§λ₯Ό μ£Όλ‘ λ€λ£¨κ³ μλ€. μ°μ , μ μ₯μ μ 체μ°κ΄λΆμμ ν΅ν΄ νννκ³Ό μ°κ΄μ±μ΄ λμ μ μ λ³μ΄λ₯Ό μ°Ύμλ΄κ³ μ νμλ€. λν, μ μ νμ§λ‘λΆν° νννμ μ ννκ² μμΈ‘ν μ μλ λͺ¨νμ κ°λ°νκ³ μ νμλ€. νννμ λν μ μ λ³μ΄λ€μ ν¨κ³Όλ₯Ό μΆμ νκΈ° μν΄μ λ€μν μ ννΌν©λͺ¨νμ μ μ©νμλ€.
2μ₯μμλ λΌμ§μ κ·Όλ΄μ§λ°©λμ λν΄μ μ μ₯μ μ 체μ°κ΄λΆμμ μ€μνμλ€. μ΄λ‘λΆν° ν΅κ³μ μΌλ‘ μ μν ν¨κ³Όλ₯Ό κ°μ§λ λ¨μΌμΌκΈ°λ€νμ±λ€μ λ°κ²¬νκ³ , μ μν λ¨μΌμΌκΈ°λ€νμ±μ΄ ν¬ν¨λμ΄ μκ±°λ 물리μ μΌλ‘ κ°κΉμ΄ μλ μ μ μλ€μ μ°Ύμλ΄μλ€. μ°Ύμμ§ μ μ μλ€ μ€, λ§μ΄ν κ² νμ±ν λ¨λ°±μ§ ν€λμ κ²½λ‘μ κ΄λ ¨λ μ μ μλ€μ λΌμ§μ κ·Όλ΄μ§λ°©λμ μν₯μ μ£Όλ ν보 μ μ μλ€λ‘ μ μνμλ€.
3μ₯μμλ νμ°μ μ μ μνμΌλ‘λΆν° λ체μ€μ μμΈ‘νκΈ° μν΄ λ°μ체μ λ립 νμ§μ μ΄μ©ν μ μ 체 μμΈ‘μ μ§ννμλ€. λ€μν λ°©λ²μΌλ‘ λ°μ체λ₯Ό μ μνμκ³ , μ΄λ‘λΆν° μ»μ΄μ§ λ립 νμ§μ μ¬μ©νμ λ μ μ 체 μμΈ‘μ μ νλλ₯Ό λΉκ΅νμλ€. μ΄ λ, λ°μ체λ₯Ό μ΄μ©νμμ λμ μ νλκ° κ°κ°μ λ¨μΌμΌκΈ°λ€νμ±μ μ΄μ©νμ λμ μ νλλ³΄λ€ λκ² λνλ¬λ€.
4μ₯μμλ μ¬λμ μ μ μνμΌλ‘λΆν° ν€λ₯Ό μμΈ‘νλ λͺ¨νμ μ€κ³νμλ€. μμΈ‘ λͺ¨νμΌλ‘λ λΆλͺ¨μ ν€λ‘ 보μ λ μ΅μ μ νλΆνΈμΆμ λͺ¨νμ μ¬μ©νμλ€. λλΆμ΄ λΆνΈμ€νΈλ© μ¬μΆμΆμ νμ©νμ¬ ν€μ λ―ΈμΉλ μν₯μ΄ ν° λ¨μΌμΌκΈ°λ€νμ±μ μ ννμλ€. μ νλ λ¨μΌμΌκΈ°λ€νμ±λ§μ λ³μλ‘ μ¬μ©νλ λͺ¨νμ κ²μ¦ν κ²°κ³Ό, μμΈ‘λ ₯μ΄ λκ² λνλ¬λ€.
μμ μ°κ΅¬λ€μ ν΅ν΄ μ μ μνμμμ λ³μ΄μ νννμμμ λ³μ΄ μ¬μ΄μ κ΄κ³λ₯Ό μ€λͺ
νκΈ° μν΄ μ ννΌν©λͺ¨νμ μ΄λ»κ² μ μ©ν μ μλμ§ μ΄ν΄ν μ μμλ€. μ°κ΅¬μμ μ»μ΄μ§ κ²°κ³Όλ λλ¬Όκ³Ό μ¬λμ μ μ μ ꡬ쑰λ₯Ό μ΄ν΄νκΈ° μν μ ννΌν©λͺ¨νμ μ μ©μ νμ₯νλ κ²μ νμ©λ μ μλ€.Abstract β
°
Table of Contents β
³
List of Figures β
΅
List of Tables β
·
Chapter 1. Literature Review 1
1.1 Linear Mixed Models 2
1.2 Genome-Wide Association Study 2
1.3 Genomic Prediction 7
Chapter 2. Identification of genes related to intramuscular fat content of pig using genome-wide association study 13
2.1 Abstract 14
2.2 Introduction 15
2.3 Materials and Method 17
2.4 Results 20
2.5 Discussion 26
Chapter 3. Genomic prediction accuracies using haplotypes defined by different methods in Hanwoo 53
3.1 Abstract 54
3.2 Introduction 56
3.3 Materials and Method 59
3.4 Results 64
3.5 Discussion 72
Chapter 4. A height prediction model using selected genetic markers and parental heights in Korean 75
2.1 Abstract 76
2.2 Introduction 77
2.3 Materials and Method 79
2.4 Results and Discussion 83
Reference 91
κ΅λ¬Έμ΄λ‘ 99Maste
- β¦