35 research outputs found
Mapping genes with longitudinal phenotypes via Bayesian posterior probabilities
Most association studies focus on disease risk, with less attention paid to disease progression or severity. These phenotypes require longitudinal data. This paper presents a new method for analyzing longitudinal data to map genes in both population-based and family-based studies. Using simulated systolic blood pressure measurements obtained from Genetic Analysis Workshop 18, we cluster the phenotype data into trajectory subgroups. We then use the Bayesian posterior probability of being in the high subgroup as a quantitative trait in an association analysis with genotype data. This method maintains high power (\u3e80%) in locating genes known to affect the simulated phenotype for most specified significance levels (a). We believe that this method can be useful to aid in the discovery of genes that affect severity or progression of disease
Identity-by-descent estimation with population- and pedigree-based imputation in admixed family data
Background: In the past few years, imputation approaches have been mainly used in population-based designs of genome-wide association studies, although both family- and population-based imputation methods have been proposed. With the recent surge of family-based designs, family-based imputation has become more important. Imputation methods for both designs are based on identity-by-descent (IBD) information. Apart from imputation, the use of IBD information is also common for several types of genetic analysis, including pedigree-based linkage analysis.
Methods: We compared the performance of several family- and population-based imputation methods in large pedigrees provided by Genetic Analysis Workshop 19 (GAW19). We also evaluated the performance of a new IBD mapping approach that we propose, which combines IBD information from known pedigrees with information from unrelated individuals.
Results: Different combinations of the imputation methods have varied imputation accuracies. Moreover, we showed gains from the use of both known pedigrees and unrelated individuals with our IBD mapping approach over the use of known pedigrees only.
Conclusions: Our results represent accuracies of different combinations of imputation methods that may be useful for data sets similar to the GAW19 pedigree data. Our IBD mapping approach, which uses both known pedigree and unrelated individuals, performed better than classical linkage analysis
Estimating relationships between phenotypes and subjects drawn from admixed families.
Background: Estimating relationships among subjects in a sample, within family structures or caused by population substructure, is complicated in admixed populations. Inaccurate allele frequencies can bias both kinship estimates and tests for association between subjects and a phenotype. We analyzed the simulated and real family data from Genetic Analysis Workshop 19, and were aware of the simulation model.
Results: We found that kinship estimation is more accurate when marker data include common variants whose frequencies are less variable across populations. Estimates of heritability and association vary with age for longitudinally measured traits. Accounting for local ancestry identified different true associations than those identified by a traditional approach. Principal components aid kinship estimation and tests for association, but their utility is influenced by the frequency of the markers used to generate them.
Conclusions: Admixed families can provide a powerful resource for detecting disease loci, as well as analytical challenges. Allele frequencies, although difficult to adequately estimate in admixed populations, have a strong impact on the estimation of kinship, ancestry, and association with phenotypes. Approaches that acknowledge population structure in admixed families outperform those which ignore it
Genetic Candidate Variants in Two Multigenerational Families with Childhood Apraxia of Speech
Childhood apraxia of speech (CAS) is a severe and socially debilitating form of speech sound disorder with suspected genetic involvement, but the genetic etiology is not yet well understood. Very few known or putative causal genes have been identified to date, e.g., FOXP2 and BCL11A. Building a knowledge base of the genetic etiology of CAS will make it possible to identify infants at genetic risk and motivate the development of effective very early intervention programs. We investigated the genetic etiology of CAS in two large multigenerational families with familial CAS. Complementary genomic methods included Markov chain Monte Carlo linkage analysis, copy-number analysis, identity-by-descent sharing, and exome sequencing with variant filtering. No overlaps in regions with positive evidence of linkage between the two families were found. In one family, linkage analysis detected two chromosomal regions of interest, 5p15.1-p14.1, and 17p13.1-q11.1, inherited separately from the two founders. Single-point linkage analysis of selected variants identified CDH18 as a primary gene of interest and additionally, MYO10, NIPBL, GLP2R, NCOR1, FLCN, SMCR8, NEK8, and ANKRD12, possibly with additive effects. Linkage analysis in the second family detected five regions with LOD scores approaching the highest values possible in the family. A gene of interest was C4orf21(ZGRF1) on 4q25-q28.2. Evidence for previously described causal copy-number variations and validated or suspected genes was not found. Results are consistent with a heterogeneous CAS etiology, as is expected in many neurogenic disorders. Future studies will investigate genome variants in these and other families with CAS
Multipoint genome-wide linkage scan for nonword repetition in a multigenerational family further supports chromosome 13q as a locus for verbal trait disorders
Verbal trait disorders encompass a wide range of conditions and are marked by deficits in five domains that impair a person’s ability to communicate: speech, language, reading, spelling, and writing. Nonword repetition is a robust endophenotype for verbal trait disorders that is sensitive to cognitive processes critical to verbal development, including auditory processing, phonological working memory, and motor planning and programming. In the present study, we present a six-generation extended pedigree with a history of verbal trait disorders. Using genome-wide multipoint variance component linkage analysis of nonword repetition, we identified a region spanning chromosome 13q14–q21 with LOD = 4.45 between 52 and 55 cM, spanning approximately 5.5 Mb on chromosome 13. This region overlaps with SLI3, a locus implicated in reading disability in families with a history of specific language impairment. Our study of a large multigenerational family with verbal trait disorders further implicates the SLI3 region in verbal trait disorders. Future studies will further refine the specific causal genetic factors in this locus on chromosome 13q that contribute to language traits
Admixture mapping implicates 13q33.3 as ancestry-of-origin locus for Alzheimer disease in Hispanic and Latino populations
Alzheimer disease (AD) is the most common form of senile dementia, with high incidence late in life in many populations including Caribbean Hispanic (CH) populations. Such admixed populations, descended from more than one ancestral population, can present challenges for genetic studies, including limited sample sizes and unique analytical constraints. Therefore, CH populations and other admixed populations have not been well represented in studies of AD, and much of the genetic variation contributing to AD risk in these populations remains unknown. Here, we conduct genome-wide analysis of AD in multiplex CH families from the Alzheimer Disease Sequencing Project (ADSP). We developed, validated, and applied an implementation of a logistic mixed model for admixture mapping with binary traits that leverages genetic ancestry to identify ancestry-of-origin loci contributing to AD. We identified three loci on chromosome 13q33.3 associated with reduced risk of AD, where associations were driven by Native American (NAM) ancestry. This AD admixture mapping signal spans the FAM155A, ABHD13, TNFSF13B, LIG4, and MYO16 genes and was supported by evidence for association in an independent sample from the Alzheimer's Genetics in Argentina—Alzheimer Argentina consortium (AGA-ALZAR) study with considerable NAM ancestry. We also provide evidence of NAM haplotypes and key variants within 13q33.3 that segregate with AD in the ADSP whole-genome sequencing data. Interestingly, the widely used genome-wide association study approach failed to identify associations in this region. Our findings underscore the potential of leveraging genetic ancestry diversity in recently admixed populations to improve genetic mapping, in this case for AD-relevant loci.Fil: Horimoto, Andrea R.V.R.. University of Washington; Estados UnidosFil: Boyken, Lisa A.. University of Washington; Estados UnidosFil: Blue, Elizabeth E.. University of Washington; Estados Unidos. Brotman Baty Institute for Precision Medicine; Estados UnidosFil: Grinde, Kelsey E.. University of Washington; Estados Unidos. Macalester College; Estados UnidosFil: Nafikov, Rafael A.. University of Washington; Estados UnidosFil: Sohi, Harkirat K.. University of Washington; Estados UnidosFil: Nato, Alejandro Q.. University of Washington; Estados Unidos. Marshall University; Estados UnidosFil: Bis, Joshua C.. University of Washington; Estados UnidosFil: Brusco, Luis Ignacio. Universidad de Buenos Aires. Facultad de Medicina; Argentina. Consejo Nacional de Investigaciones CientÃficas y Técnicas; ArgentinaFil: Morelli, Laura. Consejo Nacional de Investigaciones CientÃficas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones BioquÃmicas de Buenos Aires. Fundación Instituto Leloir. Instituto de Investigaciones BioquÃmicas de Buenos Aires; ArgentinaFil: Ramirez, Alfredo Jose. University Of Cologne; Alemania. Universitat Bonn; Alemania. German Center for Neurodegenerative Diseases; Alemania. University Of Texas Health Science Center At San Antonio (ut Health San Antonio) ; University Of Texas At San Antonio; . Universidad Nacional Arturo Jauretche. Unidad Ejecutora de Estudios en Neurociencias y Sistemas Complejos. Provincia de Buenos Aires. Ministerio de Salud. Hospital Alta Complejidad en Red El Cruce Dr. Néstor Carlos Kirchner Samic. Unidad Ejecutora de Estudios en Neurociencias y Sistemas Complejos. Consejo Nacional de Investigaciones CientÃficas y Técnicas. Centro CientÃfico Tecnológico Conicet - La Plata. Unidad Ejecutora de Estudios en Neurociencias y Sistemas Complejos; ArgentinaFil: Dalmasso, Maria Carolina. Universidad Nacional Arturo Jauretche. Unidad Ejecutora de Estudios en Neurociencias y Sistemas Complejos. Provincia de Buenos Aires. Ministerio de Salud. Hospital Alta Complejidad en Red El Cruce Dr. Néstor Carlos Kirchner Samic. Unidad Ejecutora de Estudios en Neurociencias y Sistemas Complejos. Consejo Nacional de Investigaciones CientÃficas y Técnicas. Centro CientÃfico Tecnológico Conicet - La Plata. Unidad Ejecutora de Estudios en Neurociencias y Sistemas Complejos; Argentina. University Of Cologne; AlemaniaFil: Temple, Seth. University of Washington; Estados UnidosFil: Satizabal, Claudia. University Of Texas Health Science Center At San Antonio (ut Health San Antonio) ; University Of Texas At San Antonio; . University of Texas at San Antonio; Estados UnidosFil: Browning, Sharon R.. University of Washington; Estados UnidosFil: Seshadri, Sudha. University Of Texas Health Science Center At San Antonio (ut Health San Antonio) ; University Of Texas At San Antonio; . University of Texas at San Antonio; Estados UnidosFil: Wijsman, Ellen M.. University of Washington; Estados UnidosFil: Thornton, Timothy A.. University of Washington; Estados Unido
Identity-by-descent estimation with population- and pedigree-based imputation in admixed family data
BACKGROUND: In the past few years, imputation approaches have been mainly used in population-based designs of genome-wide association studies, although both family- and population-based imputation methods have been proposed. With the recent surge of family-based designs, family-based imputation has become more important. Imputation methods for both designs are based on identity-by-descent (IBD) information. Apart from imputation, the use of IBD information is also common for several types of genetic analysis, including pedigree-based linkage analysis. METHODS: We compared the performance of several family- and population-based imputation methods in large pedigrees provided by Genetic Analysis Workshop 19 (GAW19). We also evaluated the performance of a new IBD mapping approach that we propose, which combines IBD information from known pedigrees with information from unrelated individuals. RESULTS: Different combinations of the imputation methods have varied imputation accuracies. Moreover, we showed gains from the use of both known pedigrees and unrelated individuals with our IBD mapping approach over the use of known pedigrees only. CONCLUSIONS: Our results represent accuracies of different combinations of imputation methods that may be useful for data sets similar to the GAW19 pedigree data. Our IBD mapping approach, which uses both known pedigree and unrelated individuals, performed better than classical linkage analysis
Lack of associations of ten candidate coronary heart disease risk genetic variants and subclinical atherosclerosis in four U.S. populations: The Population Architecture using Genomics and Epidemiology (PAGE) study
A number of genetic variants have been discovered by recent genome-wide association studies for their associations with clinical coronary heart disease (CHD). However, it is unclear whether these variants are also associated with the development of CHD as measured by subclinical atherosclerosis phenotypes, ankle brachial index (ABI), carotid artery intima-media thickness (cIMT) and carotid plaque
No evidence of interaction between known lipid-associated genetic variants and smoking in the multi-ethnic PAGE population
Genome-wide association studies (GWAS) have identified many variants that influence high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, and/or triglycerides. However, environmental modifiers, such as smoking, of these known genotype–phenotype associations are just recently emerging in the literature. We have tested for interactions between smoking and 49 GWAS-identified variants in over 41,000 racially/ethnically diverse samples with lipid levels from the Population Architecture Using Genomics and Epidemiology (PAGE) study. Despite their biological plausibility, we were unable to detect significant SNP × smoking interactions
Nuclear energy in the post-genomic era
\u27Genomics,\u27 the term first proposed in 1986, is the discipline of understanding genomes powered by an unprecedented accumulation of information on genetic sequences of a wide archive of life forms, foremost of which is the genome of man. After more than two decades of international effort in decoding life\u27s alphabet and culminating with the first working draft of the human DNA this year, the \u27postgenomic\u27 era aims to annotate the existing genetic data, elucidate gene functions and find revolutionary applications in almost all branches of life and physical sciences, leaving no bough untouched, not even nuclear science