57 research outputs found

    The Genomics of Oral Poliovirus Vaccine Response in Bangladeshi Infants

    Get PDF
    The success of Oral Poliovirus Vaccine (OPV) in eradicating poliovirus has set an example for the immense potential of oral vaccines in preventing enteric infections. It is widely considered the standard for oral vaccines aiming to elicit a mucosal immune response. Despite being validated in diverse populations worldwide, there still remain some individuals that fail to mount an adequate response to vaccination with OPV. It has been hypothesized that this may be due to host genetics, as the heritability is estimated to be high (60%) and there have been ethnic differences in response. To address this question we conducted a genome-wide association study (GWAS) in 357 Bangladeshi children comparing individuals that fail to mount an immune response to high responders of OPV. Four different approaches were conducted to elucidate genetic risk loci: (1) a traditional GWAS analysis, (2) a correlation of the GWAS results with signatures of positive selection, (3) an application of gene-level methods to the GWAS results, and (4) an application of pathway-level methods to the GWAS results. Because there is no consensus as to the best gene- and pathway-level methods, a simulation experiment was conducted to systematically evaluate their relative performance. The traditional GWAS assessed the association of 6.6 million single nucleotide polymorphisms (SNPs) across the human genome, adjusted for stunting (height-for-age Z-score (HAZ) < -2). While there were not any genome-wide significant results (P<5x10-8), several suggestive associations were found on chromosomes 7 and 14 (P<5x10-6). On chromosome 7, the top association was found at rs55906254 (OR=0.31, P=3.5x10-6). Found upstream of SHH (sonic hedgehog), the minor allele of this SNP conferred decreased odds of high seropositive status versus seronegative. On chromosome 14, the top association was downstream of MAPK1IP1L (mitogen-activated protein kinase 1 interacting protein 1-like) at rs113427985 (OR=0.22, P=2.9x10-6). To measure regions under positive selection, the cross-population extended haplotype homozygosity (XP-EHH) was calculated. To correlate these with the GWAS results, a filter was used in which SNPs had to have a P-value from the GWAS less than 0.001 and a P-value from the selection scan below 0.01. A total of 32 SNPs reached this threshold, half of which were between FAM86A (family with sequence similarity 86, member A) and RBFOX1 (RNA-binding protein, fox-1 homolog). The non-ancestral alleles of these SNPs were associated with high seropositive status. Therefore, it is likely that mutations arose in this region that were beneficial to either OPV immunity or another ancestral pathogen and were preserved. Before the gene- and pathway-level methods were applied to the OPV GWAS, a simulation experiment was conducted to determine which methods were the best. These methods were developed to aggregate signals from the GWAS into gene- and pathway-level units, increasing the power to detect associations and offering biological interpretation. Using genotypic data from the Wellcome Trust Case Control Consortium (WTCCC), a phenotype was simulated assuming an additive polygenic model. A total of 12 gene-level methods and 10 pathway-level methods were systematically evaluated. The gene-level method with the best balance of sensitivity and specificity was VEGAS using only the top 10% of the associated SNPs within the gene. MAGENTA and GSA-SNP had the best performance of all the pathway-level methods. These methods were then applied to the GWAS of OPV. The gene-level results highlighted the potential role of histone modifications as the top results included many histone marks within histone cluster 1 on chromosome 6. Pathway-level methods using the Gene Ontology Biological Processes showed enrichment in gene sets related to cyclic AMP as a second messenger and its relationship with G-protein signaling. Additional associations were found in neurological development. Taken together, this dissertation seeks to elucidate the host genomics of immunity to OPV. The four different approaches were complementary to each other, highlighting different genes and pathways that may relate to the underlying mechanisms of the immunological response. The population-level results may be related to the individual response. Further investigation into the associations may reveal potential adjuvants and improved vaccines, not only for oral poliovirus vaccine but also for other mucosal vaccines for enteric infections

    Strategies for Enriching Variant Coverage in Candidate Disease Loci on a Multiethnic Genotyping Array

    Get PDF
    Investigating genetic architecture of complex traits in ancestrally diverse populations is imperative to understand the etiology of disease. However, the current paucity of genetic research in people of African and Latin American ancestry, Hispanic and indigenous peoples in the United States is likely to exacerbate existing health disparities for many common diseases. The Population Architecture using Genomics and Epidemiology, Phase II (PAGE II), Study was initiated in 2013 by the National Human Genome Research Institute to expand our understanding of complex trait loci in ethnically diverse and well characterized study populations. To meet this goal, the Multi-Ethnic Genotyping Array (MEGA) was designed to substantially improve fine-mapping and functional discovery by increasing variant coverage across multiple ethnicities at known loci for metabolic, cardiovascular, renal, inflammatory, anthropometric, and a variety of lifestyle traits. Studying the frequency distribution of clinically relevant mutations, putative risk alleles, and known functional variants across multiple populations will provide important insight into the genetic architecture of complex diseases and facilitate the discovery of novel, sometimes population-specific, disease associations. DNA samples from 51,650 self-identified African ancestry (17,328), Hispanic/Latino (22,379), Asian/Pacific Islander (8,640), and American Indian (653) and an additional 2,650 participants of either South Asian or European ancestry, and other reference panels have been genotyped on MEGA by PAGE II. MEGA was designed as a new resource for studying ancestrally diverse populations. Here, we describe the methodology for selecting trait-specific content for use in multi-ethnic populations and how enriching MEGA for this content may contribute to deeper biological understanding of the genetic etiology of complex disease

    Discovering prescription patterns in pediatric acute-onset neuropsychiatric syndrome patients

    Get PDF
    OBJECTIVE: Pediatric acute-onset neuropsychiatric syndrome (PANS) is a complex neuropsychiatric syndrome characterized by an abrupt onset of obsessive-compulsive symptoms and/or severe eating restrictions, along with at least two concomitant debilitating cognitive, behavioral, or neurological symptoms. A wide range of pharmacological interventions along with behavioral and environmental modifications, and psychotherapies have been adopted to treat symptoms and underlying etiologies. Our goal was to develop a data-driven approach to identify treatment patterns in this cohort. MATERIALS AND METHODS: In this cohort study, we extracted medical prescription histories from electronic health records. We developed a modified dynamic programming approach to perform global alignment of those medication histories. Our approach is unique since it considers time gaps in prescription patterns as part of the similarity strategy. RESULTS: This study included 43 consecutive new-onset pre-pubertal patients who had at least 3 clinic visits. Our algorithm identified six clusters with distinct medication usage history which may represent clinician\u27s practice of treating PANS of different severities and etiologies i.e., two most severe groups requiring high dose intravenous steroids; two arthritic or inflammatory groups requiring prolonged nonsteroidal anti-inflammatory drug (NSAID); and two mild relapsing/remitting group treated with a short course of NSAID. The psychometric scores as outcomes in each cluster generally improved within the first two years. DISCUSSION AND CONCLUSION: Our algorithm shows potential to improve our knowledge of treatment patterns in the PANS cohort, while helping clinicians understand how patients respond to a combination of drugs

    Genome-Wide Association Study of Cryptosporidiosis in Infants Implicates PRKCA.

    Get PDF
    Diarrhea is a major cause of both morbidity and mortality worldwide, especially among young children. Cryptosporidiosis is a leading cause of diarrhea in children, particularly in South Asia and sub-Saharan Africa, where it is responsible for over 200,000 deaths per year. Beyond the initial clinical presentation of diarrhea, it is associated with long-term sequelae such as malnutrition and neurocognitive developmental deficits. Risk factors include poverty and overcrowding, and yet not all children with these risk factors and exposure are infected, nor do all infected children develop symptomatic disease. One potential risk factor to explain these differences is their human genome. To identify genetic variants associated with symptomatic cryptosporidiosis, we conducted a genome-wide association study (GWAS) examining 6.5 million single nucleotide polymorphisms (SNPs) in 873 children from three independent cohorts in Dhaka, Bangladesh, namely, the Dhaka Birth Cohort (DBC), the Performance of Rotavirus and Oral Polio Vaccines in Developing Countries (PROVIDE) study, and the Cryptosporidiosis Birth Cohort (CBC). Associations were estimated separately for each cohort under an additive model, adjusting for length-for-age Z-score at 12 months of age, the first two principal components to account for population substructure, and genotyping batch. The strongest meta-analytic association was with rs58296998 (P = 3.73 × 10-8), an intronic SNP and expression quantitative trait locus (eQTL) of protein kinase C alpha (PRKCA). Each additional risk allele conferred 2.4 times the odds of Cryptosporidium-associated diarrhea in the first year of life. This genetic association suggests a role for protein kinase C alpha in pediatric cryptosporidiosis and warrants further investigation.IMPORTANCE Globally, diarrhea remains one of the major causes of pediatric morbidity and mortality. The initial symptoms of diarrhea can often lead to long-term consequences for the health of young children, such as malnutrition and neurocognitive developmental deficits. Despite many children having similar exposures to infectious causes of diarrhea, not all develop symptomatic disease, indicating a possible role for human genetic variation. Here, we conducted a genetic study of susceptibility to symptomatic disease associated with Cryptosporidium infection (a leading cause of diarrhea) in three independent cohorts of infants from Dhaka, Bangladesh. We identified a genetic variant within protein kinase C alpha (PRKCA) associated with higher risk of cryptosporidiosis in the first year of life. These results indicate a role for human genetics in susceptibility to cryptosporidiosis and warrant further research to elucidate the mechanism

    Identification of functional genetic variation in exome sequence analysis

    Get PDF
    Recent technological advances have allowed us to study individual genomes at a base-pair resolution and have demonstrated that the average exome harbors more than 15,000 genetic variants. However, our ability to understand the biological significance of the identified variants and to connect these observed variants with phenotypes is limited. The first step in this process is to identify genetic variation that is likely to result in changes to protein structure and function, because detailed studies, either population based or functional, for each of the identified variants are not practicable. Therefore algorithms that yield valid predictions of a variant’s functional significance are needed. Over the past decade, several programs have been developed to predict the probability that an observed sequence variant will have a deleterious effect on protein function. These algorithms range from empirical programs that classify using known biochemical properties to statistical algorithms trained using a variety of data sources, including sequence conservation data, biochemical properties, and functional data. Using data from the pilot3 study of the 1000 Genomes Project available through Genetic Analysis Workshop 17, we compared the results of four programs (SIFT, PolyPhen, MAPP, and VarioWatch) used to predict the functional relevance of variants in 101 genes. Analysis was conducted without knowledge of the simulation model. Agreement between programs was modest ranging from 59.4% to 71.4% and only 3.5% of variants were classified as deleterious and 10.9% as tolerated across all four programs

    A Sex-Stratified Genome-Wide Association Study of Tuberculosis Using a Multi-Ethnic Genotyping Array

    Get PDF
    Tuberculosis (TB), caused by Mycobacterium tuberculosis, is a complex disease with a known human genetic component. Males seem to be more affected than females and in most countries the TB notification rate is twice as high in males than in females. While socio-economic status, behavior and sex hormones influence the male bias they do not fully account for it. Males have only one copy of the X chromosome, while diploid females are subject to X chromosome inactivation. In addition, the X chromosome codes for many immune-related genes, supporting the hypothesis that X-linked genes could contribute to TB susceptibility in a sex-biased manner. We report the first TB susceptibility genome-wide association study (GWAS) with a specific focus on sex-stratified autosomal analysis and the X chromosome. A total of 810 individuals (410 cases and 405 controls) from an admixed South African population were genotyped using the Illumina Multi Ethnic Genotyping Array, specifically designed as a suitable platform for diverse and admixed populations. Association testing was done on the autosome (8,27,386 variants) and X chromosome (20,939 variants) in a sex stratified and combined manner. SNP association testing was not statistically significant using a stringent cut-off for significance but revealed likely candidate genes that warrant further investigation. A genome wide interaction analysis detected 16 significant interactions. Finally, the results highlight the importance of sex-stratified analysis as strong sex-specific effects were identified on both the autosome and X chromosome

    Genetic identification of a common collagen disease in Puerto Ricans via identity-by-descent mapping in a health system

    Get PDF
    Achieving confidence in the causality of a disease locus is a complex task that often requires supporting data from both statistical genetics and clinical genomics. Here we describe a combined approach to identify and characterize a genetic disorder that leverages distantly related patients in a health system and population-scale mapping. We utilize genomic data to uncover components of distant pedigrees, in the absence of recorded pedigree information, in the multi-ethnic BioMe biobank in New York City. By linking to medical records, we discover a locus associated with both elevated genetic relatedness and extreme short stature. We link the gene, COL27A1, with a little-known genetic disease, previously thought to be rare and recessive. We demonstrate that disease manifests in both heterozygotes and homozygotes, indicating a common collagen disorder impacting up to 2% of individuals of Puerto Rican ancestry, leading to a better understanding of the continuum of complex and Mendelian disease

    Polymorphisms in Toll-like receptor genes influence antibody responses to cytomegalovirus glycoprotein B vaccine

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Congenital Cytomegalovirus (CMV) infection is an important medical problem that has yet no current solution. A clinical trial of CMV glycoprotein B (gB) vaccine in young women showed promising efficacy. Improved understanding of the basis for prevention of CMV infection is essential for developing improved vaccines.</p> <p>Results</p> <p>We genotyped 142 women previously vaccinated with three doses of CMV gB for single nucleotide polymorphisms (SNPs) in TLR 1-4, 6, 7, 9, and 10, and their associated intracellular signaling genes. SNPs in the platelet-derived growth factor receptor (PDGFRA) and integrins were also selected based on their role in binding gB. Specific SNPs in TLR7 and IKBKE (inhibitor of nuclear factor kappa-B kinase subunit epsilon) were associated with antibody responses to gB vaccine. Homozygous carriers of the minor allele at four SNPs in TLR7 showed higher vaccination-induced antibody responses to gB compared to heterozygotes or homozygotes for the common allele. SNP rs1953090 in IKBKE was associated with changes in antibody level from second to third dose of vaccine; homozygotes for the minor allele exhibited lower antibody responses while homozygotes for the major allele showed increased responses over time.</p> <p>Conclusions</p> <p>These data contribute to our understanding of the immunogenetic mechanisms underlying variations in the immune response to CMV vaccine.</p

    Admixture mapping of peripheral artery disease in a Dominican population reveals a putative risk locus on 2q35

    Get PDF
    Peripheral artery disease (PAD) is a form of atherosclerotic cardiovascular disease, affecting ∼8 million Americans, and is known to have racial and ethnic disparities. PAD has been reported to have a significantly higher prevalence in African Americans (AAs) compared to non-Hispanic European Americans (EAs). Hispanic/Latinos (HLs) have been reported to have lower or similar rates of PAD compared to EAs, despite having a paradoxically high burden of PAD risk factors; however, recent work suggests prevalence may differ between sub-groups. Here, we examined a large cohort of diverse adults in the BioMe biobank in New York City. We observed the prevalence of PAD at 1.7% in EAs vs. 8.5% and 9.4% in AAs and HLs, respectively, and among HL sub-groups, the prevalence was found at 11.4% and 11.5% in Puerto Rican and Dominican populations, respectively. Follow-up analysis that adjusted for common risk factors demonstrated that Dominicans had the highest increased risk for PAD relative to EAs [OR = 3.15 (95% CI 2.33–4.25), p &lt; 6.44 × 10−14]. To investigate whether genetic factors may explain this increased risk, we performed admixture mapping by testing the association between local ancestry and PAD in Dominican BioMe participants (N = 1,813) separately from European, African, and Native American (NAT) continental ancestry tracts. The top association with PAD was an NAT ancestry tract at chromosome 2q35 [OR = 1.96 (SE = 0.16), p &lt; 2.75 × 10−05) with 22.6% vs. 12.9% PAD prevalence in heterozygous NAT tract carriers versus non-carriers, respectively. Fine-mapping at this locus implicated tag SNP rs78529201 located within a long intergenic non-coding RNA (lincRNA) LINC00607, a gene expression regulator of key genes related to thrombosis and extracellular remodeling of endothelial cells, suggesting a putative link of the 2q35 locus to PAD etiology. Efforts to reproduce the signal in other Hispanic cohorts were unsuccessful. In summary, we showed how leveraging health system data helped understand nuances of PAD risk across HL sub-groups and admixture mapping approaches elucidated a putative risk locus in a Dominican population
    corecore