153 research outputs found

    Efficient Generalized Least Squares Method for Mixed Population and Family‐based Samples in Genome‐wide Association Studies

    Full text link
    Genome‐wide association studies (GWAS) that draw samples from multiple studies with a mixture of relationship structures are becoming more common. Analytical methods exist for using mixed‐sample data, but few methods have been proposed for the analysis of genotype‐by‐environment (G×E) interactions. Using GWAS data from a study of sarcoidosis susceptibility genes in related and unrelated African Americans, we explored the current analytic options for genotype association testing in studies using both unrelated and family‐based designs. We propose a novel method—generalized least squares (GLX)—to estimate both SNP and G×E interaction effects for categorical environmental covariates and compared this method to generalized estimating equations (GEE), logistic regression, the Cochran–Armitage trend test, and the W QLS and M QLS methods. We used simulation to demonstrate that the GLX method reduces type I error under a variety of pedigree structures. We also demonstrate its superior power to detect SNP effects while offering computational advantages and comparable power to detect G×E interactions versus GEE. Using this method, we found two novel SNPs that demonstrate a significant genome‐wide interaction with insecticide exposure—rs10499003 and rs7745248, located in the intronic and 3' UTR regions of the FUT9 gene on chromosome 6q16.1.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/107571/1/gepi21811.pd

    Single Cell Transcriptomics Implicate Novel Monocyte and T Cell Immune Dysregulation in Sarcoidosis

    Get PDF
    Sarcoidosis is a systemic inflammatory disease characterized by infiltration of immune cells into granulomas. Previous gene expression studies using heterogeneous cell mixtures lack insight into cell-type-specific immune dysregulation. We performed the first single-cell RNA-sequencing study of sarcoidosis in peripheral immune cells in 48 patients and controls. Following unbiased clustering, differentially expressed genes were identified for 18 cell types and bioinformatically assessed for function and pathway enrichment. Our results reveal persistent activation of circulating classical monocytes with subsequent upregulation of trafficking molecules. Specifically, classical monocytes upregulated distinct markers of activation including adhesion molecules, pattern recognition receptors, and chemokine receptors, as well as enrichment of immunoregulatory pathways HMGB1, mTOR, and ephrin receptor signaling. Predictive modeling implicated TGFβ and mTOR signaling as drivers of persistent monocyte activation. Additionally, sarcoidosis T cell subsets displayed patterns of dysregulation. CD4 naïve T cells were enriched for markers of apoptosis and Th17/T(reg) differentiation, while effector T cells showed enrichment of anergy-related pathways. Differentially expressed genes in regulatory T cells suggested dysfunctional p53, cell death, and TNFR2 signaling. Using more sensitive technology and more precise units of measure, we identify cell-type specific, novel inflammatory and regulatory pathways. Based on our findings, we suggest a novel model involving four convergent arms of dysregulation: persistent hyperactivation of innate and adaptive immunity via classical monocytes and CD4 naïve T cells, regulatory T cell dysfunction, and effector T cell anergy. We further our understanding of the immunopathology of sarcoidosis and point to novel therapeutic targets

    Biological and economic management strategy evaluations of the eastern king prawn fishery

    Get PDF
    Stock assessment of the eastern king prawn (EKP) fishery, and the subsequent advice to management and industry, could be improved by addressing a number of issues. The recruitment dynamics of EKP in the northern (i.e., North Reef to the Swain Reefs) parts of the fishery need to be clarified. Fishers report that the size of the prawns from these areas when they recruit to the fishing grounds is resulting in suboptimal sizes/ages at first capture, and therefore localised growth overfishing. There is a need to assess alternative harvest strategies of the EKP fishery, via computer simulations, particularly seasonal and monthly or lunar-based closures to identify scenarios that improve the value of the catch, decrease costs and reduce the risk of overfishing, prior to implementing new management measures

    Detrimental effects of duplicate reads and low complexity regions on RNA- and ChIP-seq data

    Get PDF
    Background Adapter trimming and removal of duplicate reads are common practices in next-generation sequencing pipelines. Sequencing reads ambiguously mapped to repetitive and low complexity regions can also be problematic for accurate assessment of the biological signal, yet their impact on sequencing data has not received much attention. We investigate how trimming the adapters, removing duplicates, and filtering out reads overlapping low complexity regions influence the significance of biological signal in RNA- and ChIP-seq experiments. Methods We assessed the effect of data processing steps on the alignment statistics and the functional enrichment analysis results of RNA- and ChIP-seq data. We compared differentially processed RNA-seq data with matching microarray data on the same patient samples to determine whether changes in pre-processing improved correlation between the two. We have developed a simple tool to remove low complexity regions, RepeatSoaker, available at https://github.com/mdozmorov/RepeatSoaker, and tested its effect on the alignment statistics and the results of the enrichment analyses. Results Both adapter trimming and duplicate removal moderately improved the strength of biological signals in RNA-seq and ChIP-seq data. Aggressive filtering of reads overlapping with low complexity regions, as defined by RepeatMasker, further improved the strength of biological signals, and the correlation between RNA-seq and microarray gene expression data. Conclusions Adapter trimming and duplicates removal, coupled with filtering out reads overlapping low complexity regions, is shown to increase the quality and reliability of detecting biological signals in RNA-seq and ChIP-seq data

    Performance of HLA allele prediction methods in African Americans for class II genes HLA-DRB1, -DQB1, and -DPB1

    Get PDF
    BACKGROUND: The expense of human leukocyte antigen (HLA) allele genotyping has motivated the development of imputation methods that use dense single nucleotide polymorphism (SNP) genotype data and the region’s haplotype structure, but the performance of these methods in admixed populations (such as African Americans) has not been adequately evaluated. We compared genotype-based—derived from both genome-wide genotyping and targeted sequencing—imputation results to existing allele data for HLA–DRB1, −DQB1, and –DPB1. RESULTS: In European Americans, the newly-developed HLA Genotype Imputation with Attribute Bagging (HIBAG) method outperformed HLA*IMP:02. In African Americans, HLA*IMP:02 performed marginally better than HIBAG pre-built models, but HIBAG models constructed using a portion of our African American sample with both SNP genotyping and four-digit HLA class II allele typing had consistently higher accuracy than HLA*IMP:02. However, HIBAG was significantly less accurate in individuals heterozygous for local ancestry (p ≤0.04). Accuracy improved in models with equal numbers of African and European chromosomes. Variants added by targeted sequencing and SNP imputation further improved both imputation accuracy and the proportion of high quality calls. CONCLUSION: Combining the HIBAG approach with local ancestry and dense variant data can produce highly-accurate HLA class II allele imputation in African Americans

    Trans-Ethnic Mapping of BANK1 Identifies Two Independent SLE-Risk Linkage Groups Enriched for Co-Transcriptional Splicing Marks

    Get PDF
    BANK1 is a susceptibility gene for several systemic autoimmune diseases in several populations. Using the genome-wide association study (GWAS) data from Europeans (EUR) and African Americans (AA), we performed an extensive fine mapping of ankyrin repeats 1 (BANK1). To increase the SNP density, we used imputation followed by univariate and conditional analysis, combinedwith a haplotypic and expression quantitative trait locus (eQTL) analysis. The data from Europeans showed that the associated region was restricted to a minimal and dependent set of SNPs covering introns two and three, and exon two. In AA, the signal found in the Europeans was split into two independent effects. All of the major risk associated SNPs were eQTLs, and the risks were associated with an increased BANK1 gene expression. Functional annotation analysis revealed the enrichment of repressive B cell epigenomicmarks (EZH2 and H3K27me3) and a strong enrichment of splice junctions. Furthermore, one eQTL located in intron two, rs13106926, was found within the binding site for RUNX3, a transcriptional activator. These results connect the local genome topography, chromatin structure, and the regulatory landscape of BANK1 with co-transcriptional splicing of exon two. Our data defines a minimal set of risk associated eQTLs predicted to be involved in the expression of BANK1 modulated through epigenetic regulation and splicing. These findings allow us to suggest that the increased expression of BANK1 will have an impact on B-cell mediated disease pathways.The work presented in this paper has been supported by the Ministerio de Economía y Competitividad, Spain (SAF2016-78631-P), partly co-financed by FEDER funds of the European Union, the Gustaf den V:e-80-års Fond and the Swedish Association against Rheumatism to M.E.A-R. In addition, this work was financed by the NIH P01 grant P01-AI-083194 to C.D.L., J.B.H., R.K., and M.E.A-R. JBH: NIH grants: R01 AI024717, U01 HG00866, P30 AR070549 and U01 AI130830 and the US Department of Veterans Affairs: I01 BX001834.C.D.L.: Center for Public Health Genomics. R.K.: NIH grant R01-AR33062. J.A.J.: NIH grants U54GM104938, P30AR053483

    A Replication Study of GWAS-Derived Lipid Genes in Asian Indians: The Chromosomal Region 11q23.3 Harbors Loci Contributing to Triglycerides

    Get PDF
    Recent genome-wide association scans (GWAS) and meta-analysis studies on European populations have identified many genes previously implicated in lipid regulation. Validation of these loci on different global populations is important in determining their clinical relevance, particularly for development of novel drug targets for treating and preventing diabetic dyslipidemia and coronary artery disease (CAD). In an attempt to replicate GWAS findings on a non-European sample, we examined the role of six of these loci (CELSR2-PSRC1-SORT1 rs599839; CDKN2A-2B rs1333049; BUD13-ZNF259 rs964184; ZNF259 rs12286037; CETP rs3764261; APOE-C1-C4-C2 rs4420638) in our Asian Indian cohort from the Sikh Diabetes Study (SDS) comprising 3,781 individuals (2,902 from Punjab and 879 from the US). Two of the six SNPs examined showed convincing replication in these populations of Asian Indian origin. Our study confirmed a strong association of CETP rs3764261 with high-density lipoprotein cholesterol (HDL-C) (p = 2.03×10−26). Our results also showed significant associations of two GWAS SNPs (rs964184 and rs12286037) from BUD13-ZNF259 near the APOA5-A4-C3-A1 genes with triglyceride (TG) levels in this Asian Indian cohort (rs964184: p = 1.74×10−17; rs12286037: p = 1.58×10−2). We further explored 45 SNPs in a ∼195 kb region within the chromosomal region 11q23.3 (encompassing the BUD13-ZNF259, APOA5-A4-C3-A1, and SIK3 genes) in 8,530 Asian Indians from the London Life Sciences Population (LOLIPOP) (UK) and SDS cohorts. Five more SNPs revealed significant associations with TG in both cohorts individually as well as in a joint meta-analysis. However, the strongest signal for TG remained with BUD13-ZNF259 (rs964184: p = 1.06×10−39). Future targeted deep sequencing and functional studies should enhance our understanding of the clinical relevance of these genes in dyslipidemia and hypertriglyceridemia (HTG) and, consequently, diabetes and CAD

    Variants at multiple loci implicated in both innate and adaptive immune responses are associated with Sjögren’s syndrome

    Get PDF
    Sjögren’s syndrome is a common autoimmune disease (~0.7% of European Americans) typically presenting as keratoconjunctivitis sicca and xerostomia. In addition to strong association within the HLA region at 6p21 (Pmeta=7.65×10−114), we establish associations with IRF5-TNPO3 (Pmeta=2.73×10−19), STAT4 (Pmeta=6.80×10−15), IL12A (Pmeta =1.17×10−10), FAM167A-BLK (Pmeta=4.97×10−10), DDX6-CXCR5 (Pmeta=1.10×10−8), and TNIP1 (Pmeta=3.30×10−8). Suggestive associations with Pmeta<5×10−5 were observed with 29 regions including TNFAIP3, PTTG1, PRDM1, DGKQ, FCGR2A, IRAK1BP1, ITSN2, and PHIP amongst others. These results highlight the importance of genes involved in both innate and adaptive immunity in Sjögren’s syndrome

    Sex differences in the genetics of sarcoidosis across European and African ancestry populations

    Get PDF
    BackgroundSex differences in the susceptibility of sarcoidosis are unknown. The study aims to identify sex-dependent genetic variations in two clinical sarcoidosis phenotypes: Löfgren’s syndrome (LS) and non-Löfgren’s syndrome (non-LS).MethodsA meta-analysis of genome-wide association studies was conducted on Europeans and African Americans, totaling 10,103 individuals from three population-based cohorts, Sweden (n = 3,843), Germany (n = 3,342), and the United States (n = 2,918), followed by an SNP lookup in the UK Biobank (UKB, n = 387,945). A genome-wide association study based on Immunochip data consisting of 141,000 single nucleotide polymorphisms (SNPs) was conducted in the sex groups. The association test was based on logistic regression using the additive model in LS and non-LS sex groups independently. Additionally, gene-based analysis, gene expression, expression quantitative trait loci (eQTL) mapping, and pathway analysis were performed to discover functionally relevant mechanisms related to sarcoidosis and biological sex.ResultsWe identified sex-dependent genetic variations in LS and non-LS sex groups. Genetic findings in LS sex groups were explicitly located in the extended Major Histocompatibility Complex (xMHC). In non-LS, genetic differences in the sex groups were primarily located in the MHC class II subregion and ANXA11. Gene-based analysis and eQTL enrichment revealed distinct sex-specific gene expression patterns in various tissues and immune cell types. In LS sex groups, a pathway map related to antigen presentation machinery by IFN-gamma. In non-LS, pathway maps related to immune response lectin-induced complement pathway in males and related to maturation and migration of dendritic cells in skin sensitization in females were identified.ConclusionOur findings provide new evidence for a sex bias underlying sarcoidosis genetic architecture, particularly in clinical phenotypes LS and non-LS. Biological sex likely plays a role in disease mechanisms in sarcoidosis
    corecore