673 research outputs found

    CONTRIBUTION TO LINKAGE AND ASSOCIATION MAPPING OF TRAIT LOCI IN LIVESTOCK.

    Full text link
    Until recently, breeding values were estimated based on phenotypes measured on the individual and its relatives, and the notion that the covariance between breeding values is proportionate to the kinship coefficient. Advances in genomics now allow for direct analysis of the genome and identification of the loci that determine the breeding values of individuals. As a consequence, marker assisted selection and genomic selection have become more effective and are replacing conventional selection. The identification of loci influencing the traits of interest requires the use of advanced statistical methods that are constantly evolving. In the context of this thesis, we have (i) contributed to the development of gene mapping methods, (ii) applied these methods to map loci influencing both metric and meristic traits, and (iii) contributed to the development of methods for the integration of genomic information in livestock breeding and management. The mapping methods that we have helped developing distinguish themselves mainly by the fact that (i) they exploit haplotype information (by means of a hidden markov model) which should increase the linkage disequilibrium with causative variants and hence detection power, (ii) they can simultaneously extract linkage information within families, and linkage disequilibrium information across the population, and (iii) they correct for population stratification by means of a random polygenic effect, and (iv) they can be applied to binary as well as quantitative traits. We have applied these and other methods to map loci influencing (i) quantitative hematological parameters in a porcine line-cross, and (ii) binary traits including diseases in bovine and non-syntenic Copy Number Variants in cattle, horse and human. In fine, we have contributed to the development of methods for the utilization of marker information in animal selection and production. We have extended the haplotype-based mapping method to allow imputation and have evaluated the utility of this approach in scenarios mimicking reality. We have also contributed to the development of a method to quantify somatic cell counts in the milk of individual cows by genotyping a sample of milk from the farm’s tank (hence a mixture of milk from all cows on the farm) Our work has resulted in the development of a software package (β€œGLASCOW”) that is increasingly used by the community to map genes influencing complex traits, primarily binary. By using this tool, we have contributed to the localization of several trait loci in pig, cattle, horse and human. We have contributed to the development of approaches that reduce the costs of genomic analyses in livestock by, on the one hand, complementing real SNP genotypes with genotypes obtained in silico by means imputation, and, on the other hand, by developing a method to deconvolute genotypes obtained on DNA pools

    Genome-wide Association Studies in Ancestrally Diverse Populations: Opportunities, Methods, Pitfalls, and Recommendations

    Get PDF
    Genome-wide association studies (GWASs) have focused primarily on populations of European descent, but it is essential that diverse populations become better represented. Increasing diversity among study participants will advance our understanding of genetic architecture in all populations and ensure that genetic research is broadly applicable. To facilitate and promote research in multi-ancestry and admixed cohorts, we outline key methodological considerations and highlight opportunities, challenges, solutions, and areas in need of development. Despite the perception that analyzing genetic data from diverse populations is difficult, it is scientifically and ethically imperative, and there is an expanding analytical toolbox to do it well

    Mapping genes underlying ethnic differences in tuberculosis risk by linkage disequilibrium in the South African coloured population of the Western Cape

    Get PDF
    Includes bibliographical references.The South Africa Coloured population of the Western Cape is the result of unions between Europeans, Africans (Bantu and Khoisan), and various other populations (Malaysian or Indonesian descent). The world-wide burden of tuberculosis remains an enormous problem, and is particularly severe in this population. In general, admixed populations that have arisen in historical times can make an important contribution to the discovery of disease susceptibility genes if the parental populations exhibit substantial variation in susceptibility. Despite numerous successful genome-wide association studies, detecting variants that have low disease risk still poses a challenge. Furthermore, admixture association studies for multi-way admixed populations pose constant challenges, including the choice of an accurate ancestral panel to infer ancestry and for imputing missing genotypes to identify possible genetic variants causing susceptibility to disease. This thesis addresses some of these challenges. We first developed PROXYANC, an approach to select the best proxy ancestral populations for admixed populations. From the simulation of a multi-way admixed population, we demonstrated the ability and accuracy of PROXYANC in selecting the best proxy ancestry and illustrated the importance of the choice of ancestries in both estimating admixture proportions and imputing missing genotypes. We applied this approach to the South African Coloured population, to refine both the choice of ancestral populations and their genetic contributions. We also demonstrated that the ancestral allele frequency differences correlated with increased linkage disequilibrium in the SAC, and that the increased LD originates from admixture events rather than population bottlenecks. Secondly, we conducted a study to determine whether ancestry-specific genetic contributions affect tuberculosis risk. We additionally conducted imputation genome-wide association studies and a meta-analysis incorporating previous genome-wide association studies of tuberculosis

    Polygenicity and Epistasis Underlie Fitness-Proximal Traits in the Caenorhabditis elegans Multiparental Experimental Evolution (CeMEE) Panel

    Get PDF
    The deposited article is a pre-print version and it has not been submitted to peer reviewing. This article version was provided by bioRxiv and is the preprint first posted online Mar. 26, 2017. This publication hasn't any creative commons license associated. The deposited article version contains attached the supplementary materials within the pdf.Understanding the genetic basis of complex traits remains a major challenge in biology. Polygenicity, phenotypic plasticity and epistasis contribute to phenotypic variance in ways that are rarely clear. This uncertainty can be problematic for estimating heritability, for predicting individual phenotypes from genomic data, and for parameterizing models of phenotypic evolution. Here we report an advanced recombinant inbred line (RIL) quantitative trait locus (QTL) mapping panel for the hermaphroditic nematode Caenorhabditis elegans, the C. elegans multiparental experimental evolution (CeMEE) panel. The CeMEE panel, comprising 507 RILs at present, was created by hybridization of 16 wild isolates, experimental evolution for 140-190 generations, and inbreeding by selfing for 13-16 generations. The panel contains 22% of single nucleotide polymorphisms known to segregate in natural populations, and complements existing C. elegans mapping resources by providing fine resolution and high nucleotide diversity across >95% of the genome. We apply it to study the genetic basis of two fitness components, fertility and hermaphrodite body size at time of reproduction, with high broad sense heritability in the CeMEE. While simulations show we should detect common alleles with additive effects as small as 5%, at gene-level resolution, the genetic architectures of these traits does not feature such alleles. We instead find that a significant fraction of trait variance, approaching 40% for fertility, can be explained by sign epistasis with main effects below the detection limit. In congruence, phenotype prediction from genomic similarity, while generally poor (r2 < 10%), requires modeling epistasis for optimal accuracy, with most variance attributed to the rapidly evolving chromosome arms.National Science Foundation grant: (PHY-1125915); National Institutes of Health grants: (R25-GM-067110, R01-GM-089972, R01-GM-121828); Gordon and Betty Moore Foundation grant: (2919.01); Human Frontiers Science Program (RGP0045/2010); European Research Council grant: (FP7/2007-2013/243285); Agence Nationale de la Recherche grant: (ANR-14-ACHN-0032-01).info:eu-repo/semantics/publishedVersio

    Genetic variation and health in rural Caribbean village

    Get PDF
    Bwa Mawego is a small-scale horticultural community (~500 people) on the island of Dominica that has been the site of a longitudinal health research project for more than 30 years. Cardiovascular diseases and metabolic health are growing local concerns. Here we analyze longitudinal growth data, cardiometabolic metrics, and genome-wide single nucleotide polymorphism (SNP) data from this population to investigate sources of variation in anthropometric and cardiometabolic outcomes. Mixed effect heritability models indicate that (1) variation in body mass index (BMI) is significantly shaped by genetic variation, and (2) variation between longitudinal BMI curves has not been consistently impacted by secular environmental trends from 1997 2017. In order to assess genetic variation in more detail, we first characterize the population structure and admixture in this Caribbean community using high-density SNP data and global reference samples in the Human Genome Diversity Panel. We detect four distinct family clusters and admixture from African, European, and Amerindian ancestral populations that occurred 5-6 generations ago (~130-150 years). Amerindian haplotypes represented in Bwa Mawego associate with deeply diverged lineages in Karitiana and Surui peoples, highlighting the regionally variable nature of admixture throughout the Caribbean and unique historical outcomes in Dominica. Genome-wide association tests of cardiometabolic phenotypes identify a genomic region of interest downstream of the ANK3 gene that associates with BMI in Bwa Mawego, after controlling for confounding variation from ancestral population structure and relatedness. Any functional relationship between ANK3 and BMI is currently uncharacterized, and there is unique potential to further explore complex gene-environment-phenotype landscapes in Bwa Mawego.Includes bibliographical reference

    Including diverse and admixed populations in genetic epidemiology research

    Get PDF
    The inclusion of ancestrally diverse participants in genetic studies can lead to new discoveries and is important to ensure equitable health care benefit from research advances. Here, members of the Ethical, Legal, Social, Implications (ELSI) committee of the International Genetic Epidemiology Society (IGES) offer perspectives on methods and analysis tools for the conduct of inclusive genetic epidemiology research, with a focus on admixed and ancestrally diverse populations in support of reproducible research practices. We emphasize the importance of distinguishing socially defined population categorizations from genetic ancestry in the design, analysis, reporting, and interpretation of genetic epidemiology research findings. Finally, we discuss the current state of genomic resources used in genetic association studies, functional interpretation, and clinical and public health translation of genomic findings with respect to diverse populations

    μ „μž₯μœ μ „μ²΄μ—°κ΄€λΆ„μ„κ³Ό ν‘œν˜„ν˜• 예츑 연ꡬλ₯Ό μœ„ν•œ μ„ ν˜•ν˜Όν•©λͺ¨ν˜•

    Get PDF
    ν•™μœ„λ…Όλ¬Έ (석사)-- μ„œμšΈλŒ€ν•™κ΅ λŒ€ν•™μ› : 농업생λͺ…κ³Όν•™λŒ€ν•™ 농생λͺ…곡학뢀, 2019. 2. κΉ€ν¬λ°œ.With the advance of sequencing and genotyping technologies, a large amount of genomic data has been accumulated and is available for biological studies. Along with the development of statistical models and computational capabilities, sizable genomic data can be analyzed thoroughly. Processing large genomic data via statistical computation enables discerning the relationship between genotypes and phenotypes. In this thesis, the main concern was how differences in genotypes are related to phenotypes. I conducted genome-wide association study to discover genetic variants correlated with phenotypes. Also, I constructed prediction models to precisely estimate phenotypes from genotypes. In the studies, various linear mixed models were applied to calculate the effects of genetic variants. In chapter 2, genome-wide association study on intramuscular fat content of pig was performed. Statistically significant single nucleotide polymorphisms were found and annotated to genes. Genes related to mitogen-activated protein kinase pathway were identified as candidate genes affecting the intramuscular fat content of pigs. In chapter 3, genomic prediction models using haplotype alleles were constructed. The models attempt to predict carcass weight in Hanwoo. Different haplotype defining methods were implemented and the prediction accuracies of them were compared. As a result, genomic prediction accuracy was higher when haplotype alleles were used compared to when individual SNPs were used. In chapter 4, models predicting human height from genotype were developed. I designed a genomic best linear unbiased prediction model adjusted with parental height. In addition, variables having highest effects on height were selected using bootstrap resampling. Models using only the selected variables were tested, and consequently I could obtain a model with high prediction ability. Through these studies, I could understand how linear mixed models can be applied to explain relationships between genotypic variation and phenotypic variation. The findings of this dissertation will help to extend the use of linear mixed models for understanding the genetic architectures in animals and human.μ—ΌκΈ°μ„œμ—΄λΆ„μ„κ³Ό μœ μ „ν˜•μ§ˆλΆ„μ„ 기술의 λ°œμ „μœΌλ‘œ 생물학적 연ꡬ에 μ΄μš©ν•  수 μžˆλŠ” λ§Žμ€ μ–‘μ˜ 데이터가 μΆ•μ λ˜μ—ˆλ‹€. 뿐만 μ•„λ‹ˆλΌ 톡계적 λͺ¨ν˜•μ΄ λ°œλ‹¬ν•˜κ³  큰 데이터λ₯Ό κ³„μ‚°ν•˜λŠ” λŠ₯λ ₯이 ν–₯μƒλ˜λ©΄μ„œ λ°©λŒ€ν•œ μ–‘μ˜ μœ μ „μ •λ³΄μ— λŒ€ν•œ 보닀 μ •λ°€ν•œ 뢄석이 κ°€λŠ₯ν•΄μ‘Œλ‹€. 톡계적 계산을 ν™œμš©ν•œ μœ μ „μ •λ³΄μ˜ λΆ„μ„μœΌλ‘œλΆ€ν„° μœ μ „ν˜•μ§ˆκ³Ό ν‘œν˜„ν˜• κ°„μ˜ 관계λ₯Ό λ°ν˜€λ‚Ό 수 μžˆλ‹€. λ³Έ μ‘Έμ—…λ…Όλ¬Έμ—μ„œλŠ” μœ μ „ν˜•μ§ˆμ˜ 차이가 μ–΄λ–»κ²Œ ν‘œν˜„ν˜•κ³Ό 관련이 μžˆλŠ”μ§€λ₯Ό 주둜 닀루고 μžˆλ‹€. μš°μ„ , μ „μž₯μœ μ „μ²΄μ—°κ΄€λΆ„μ„μ„ 톡해 ν‘œν˜„ν˜•κ³Ό 연관성이 높은 μœ μ „ 변이λ₯Ό μ°Ύμ•„λ‚΄κ³ μž ν•˜μ˜€λ‹€. λ˜ν•œ, μœ μ „ν˜•μ§ˆλ‘œλΆ€ν„° ν‘œν˜„ν˜•μ„ μ •ν™•ν•˜κ²Œ μ˜ˆμΈ‘ν•  수 μžˆλŠ” λͺ¨ν˜•μ„ κ°œλ°œν•˜κ³ μž ν•˜μ˜€λ‹€. ν‘œν˜„ν˜•μ— λŒ€ν•œ μœ μ „ λ³€μ΄λ“€μ˜ 효과λ₯Ό μΆ”μ •ν•˜κΈ° μœ„ν•΄μ„œ λ‹€μ–‘ν•œ μ„ ν˜•ν˜Όν•©λͺ¨ν˜•μ„ μ μš©ν•˜μ˜€λ‹€. 2μž₯μ—μ„œλŠ” λΌμ§€μ˜ 근내지방도에 λŒ€ν•΄μ„œ μ „μž₯μœ μ „μ²΄μ—°κ΄€λΆ„μ„μ„ μ‹€μ‹œν•˜μ˜€λ‹€. μ΄λ‘œλΆ€ν„° ν†΅κ³„μ μœΌλ‘œ μœ μ˜ν•œ 효과λ₯Ό κ°€μ§€λŠ” λ‹¨μΌμ—ΌκΈ°λ‹€ν˜•μ„±λ“€μ„ λ°œκ²¬ν–ˆκ³ , μœ μ˜ν•œ λ‹¨μΌμ—ΌκΈ°λ‹€ν˜•μ„±μ΄ ν¬ν•¨λ˜μ–΄ μžˆκ±°λ‚˜ 물리적으둜 κ°€κΉŒμ΄ μžˆλŠ” μœ μ „μžλ“€μ„ μ°Ύμ•„λ‚΄μ—ˆλ‹€. 찾아진 μœ μ „μžλ“€ 쀑, λ§ˆμ΄ν† κ² ν™œμ„±ν™” λ‹¨λ°±μ§ˆ ν‚€λ‚˜μ œ κ²½λ‘œμ™€ κ΄€λ ¨λœ μœ μ „μžλ“€μ„ λΌμ§€μ˜ 근내지방도에 영ν–₯을 μ£ΌλŠ” 후보 μœ μ „μžλ“€λ‘œ μ œμ‹œν•˜μ˜€λ‹€. 3μž₯μ—μ„œλŠ” ν•œμš°μ˜ μœ μ „μžν˜•μœΌλ‘œλΆ€ν„° 도체쀑을 μ˜ˆμΈ‘ν•˜κΈ° μœ„ν•΄ 반수체의 λŒ€λ¦½ ν˜•μ§ˆμ„ μ΄μš©ν•œ μœ μ „μ²΄ μ˜ˆμΈ‘μ„ μ§„ν–‰ν•˜μ˜€λ‹€. λ‹€μ–‘ν•œ λ°©λ²•μœΌλ‘œ 반수체λ₯Ό μ •μ˜ν•˜μ˜€κ³ , μ΄λ‘œλΆ€ν„° 얻어진 λŒ€λ¦½ ν˜•μ§ˆμ„ μ‚¬μš©ν–ˆμ„ λ•Œ μœ μ „μ²΄ 예츑의 정확도λ₯Ό λΉ„κ΅ν•˜μ˜€λ‹€. 이 λ•Œ, 반수체λ₯Ό μ΄μš©ν•˜μ˜€μ„ λ•Œμ˜ 정확도가 개개의 λ‹¨μΌμ—ΌκΈ°λ‹€ν˜•μ„±μ„ μ΄μš©ν–ˆμ„ λ•Œμ˜ 정확도보닀 λ†’κ²Œ λ‚˜νƒ€λ‚¬λ‹€. 4μž₯μ—μ„œλŠ” μ‚¬λžŒμ˜ μœ μ „μžν˜•μœΌλ‘œλΆ€ν„° ν‚€λ₯Ό μ˜ˆμΈ‘ν•˜λŠ” λͺ¨ν˜•μ„ μ„€κ³„ν•˜μ˜€λ‹€. 예츑 λͺ¨ν˜•μœΌλ‘œλŠ” λΆ€λͺ¨μ˜ ν‚€λ‘œ λ³΄μ •λœ μ΅œμ μ„ ν˜•λΆˆνŽΈμΆ”μ • λͺ¨ν˜•μ„ μ‚¬μš©ν•˜μ˜€λ‹€. λ”λΆˆμ–΄ λΆ€νŠΈμŠ€νŠΈλž© μž¬μΆ”μΆœμ„ ν™œμš©ν•˜μ—¬ 킀에 λ―ΈμΉ˜λŠ” 영ν–₯이 큰 λ‹¨μΌμ—ΌκΈ°λ‹€ν˜•μ„±μ„ μ„ νƒν•˜μ˜€λ‹€. μ„ νƒλœ λ‹¨μΌμ—ΌκΈ°λ‹€ν˜•μ„±λ§Œμ„ λ³€μˆ˜λ‘œ μ‚¬μš©ν•˜λŠ” λͺ¨ν˜•μ„ κ²€μ¦ν•œ κ²°κ³Ό, 예츑λ ₯이 λ†’κ²Œ λ‚˜νƒ€λ‚¬λ‹€. μœ„μ˜ 연ꡬ듀을 톡해 μœ μ „μžν˜•μ—μ„œμ˜ 변이와 ν‘œν˜„ν˜•μ—μ„œμ˜ 변이 μ‚¬μ΄μ˜ 관계λ₯Ό μ„€λͺ…ν•˜κΈ° μœ„ν•΄ μ„ ν˜•ν˜Όν•©λͺ¨ν˜•μ„ μ–΄λ–»κ²Œ μ μš©ν•  수 μžˆλŠ”μ§€ 이해할 수 μžˆμ—ˆλ‹€. μ—°κ΅¬μ—μ„œ 얻어진 κ²°κ³ΌλŠ” 동물과 μ‚¬λžŒμ˜ μœ μ „μ  ꡬ쑰λ₯Ό μ΄ν•΄ν•˜κΈ° μœ„ν•œ μ„ ν˜•ν˜Όν•©λͺ¨ν˜•μ˜ μ μš©μ„ ν™•μž₯ν•˜λŠ” 것에 ν™œμš©λ  수 μžˆλ‹€.Abstract β…° Table of Contents β…³ List of Figures β…΅ List of Tables β…· Chapter 1. Literature Review 1 1.1 Linear Mixed Models 2 1.2 Genome-Wide Association Study 2 1.3 Genomic Prediction 7 Chapter 2. Identification of genes related to intramuscular fat content of pig using genome-wide association study 13 2.1 Abstract 14 2.2 Introduction 15 2.3 Materials and Method 17 2.4 Results 20 2.5 Discussion 26 Chapter 3. Genomic prediction accuracies using haplotypes defined by different methods in Hanwoo 53 3.1 Abstract 54 3.2 Introduction 56 3.3 Materials and Method 59 3.4 Results 64 3.5 Discussion 72 Chapter 4. A height prediction model using selected genetic markers and parental heights in Korean 75 2.1 Abstract 76 2.2 Introduction 77 2.3 Materials and Method 79 2.4 Results and Discussion 83 Reference 91 ꡭ문초둝 99Maste
    • …
    corecore