31 research outputs found

    A test for comparing two groups of samples when analyzing multiple omics profiles

    Get PDF
    Background: A number of statistical models has been proposed for studying the association between gene expression and copy number data in integrated analysis. The next step is to compare association patterns between different groups of samples.Results: We propose a method, named dSIM, to find differences in association between copy number and gene expression, when comparing two groups of samples. Firstly, we use ridge regression to correct for the baseline associations between copy number and gene expression. Secondly, the global test is applied to the corrected data in order to find differences in association patterns between two groups of samples. We show that dSIM detects differences even in small genomic regions in a simulation study. We also apply dSIM to two publicly available breast cancer datasets and identify chromosome arms where copy number led gene expression regulation differs between positive and negative estrogen receptor samples. In spite of differing genomic coverage, some selected arms are identified in both datasets.Conclusion: We developed a flexible and robust method for studying association differences between two groups of samples while integrating genomic data from different platforms. dSIM can be used with most types of microarray/sequencing data, including methylation and microRNA expression. The method is implemented in R and will be made part of the BioConductor package SIM

    Genome-to-genome analysis highlights the effect of the human innate and adaptive immune systems on the hepatitis C virus

    Get PDF
    Outcomes of hepatitis C virus (HCV) infection and treatment depend on viral and host genetic factors. Here we use human genome-wide genotyping arrays and new whole-genome HCV viral sequencing technologies to perform a systematic genome-to-genome study of 542 individuals who were chronically infected with HCV, predominantly genotype 3. We show that both alleles of genes encoding human leukocyte antigen molecules and genes encoding components of the interferon lambda innate immune system drive viral polymorphism. Additionally, we show that IFNL4 genotypes determine HCV viral load through a mechanism dependent on a specific amino acid residue in the HCV NS5A protein. These findings highlight the interplay between the innate immune system and the viral genome in HCV control

    Human genetic variants and age are the strongest predictors of humoral immune responses to common pathogens and vaccines.

    Get PDF
    Humoral immune responses to infectious agents or vaccination vary substantially among individuals, and many of the factors responsible for this variability remain to be defined. Current evidence suggests that human genetic variation influences (i) serum immunoglobulin levels, (ii) seroconversion rates, and (iii) intensity of antigen-specific immune responses. Here, we evaluated the impact of intrinsic (age and sex), environmental, and genetic factors on the variability of humoral response to common pathogens and vaccines. We characterized the serological response to 15 antigens from common human pathogens or vaccines, in an age- and sex-stratified cohort of 1000 healthy individuals (Milieu Intérieur cohort). Using clinical-grade serological assays, we measured total IgA, IgE, IgG, and IgM levels, as well as qualitative (serostatus) and quantitative IgG responses to cytomegalovirus, Epstein-Barr virus, herpes simplex virus 1 and 2, varicella zoster virus, Helicobacter pylori, Toxoplasma gondii, influenza A virus, measles, mumps, rubella, and hepatitis B virus. Following genome-wide genotyping of single nucleotide polymorphisms and imputation, we examined associations between ~ 5 million genetic variants and antibody responses using single marker and gene burden tests. We identified age and sex as important determinants of humoral immunity, with older individuals and women having higher rates of seropositivity for most antigens. Genome-wide association studies revealed significant associations between variants in the human leukocyte antigen (HLA) class II region on chromosome 6 and anti-EBV and anti-rubella IgG levels. We used HLA imputation to fine map these associations to amino acid variants in the peptide-binding groove of HLA-DRβ1 and HLA-DPβ1, respectively. We also observed significant associations for total IgA levels with two loci on chromosome 2 and with specific KIR-HLA combinations. Using extensive serological testing and genome-wide association analyses in a well-characterized cohort of healthy individuals, we demonstrated that age, sex, and specific human genetic variants contribute to inter-individual variability in humoral immunity. By highlighting genes and pathways implicated in the normal antibody response to frequently encountered antigens, these findings provide a basis to better understand disease pathogenesis. ClinicalTrials.gov , NCT01699893

    Viral genome wide association study identifies novel hepatitis C virus polymorphisms associated with sofosbuvir treatment failure

    Get PDF
    Persistent hepatitis C virus (HCV) infection is a major cause of chronic liver disease, worldwide. With the development of direct-acting antivirals, treatment of chronically infected patients has become highly effective, although a subset of patients responds less well to therapy. Sofosbuvir is a common component of current de novo or salvage combination therapies, that targets the HCV NS5B polymerase. We use pre-treatment whole-genome sequences of HCV from 507 patients infected with HCV subtype 3a and treated with sofosbuvir containing regimens to detect viral polymorphisms associated with response to treatment. We find three common polymorphisms in non-targeted HCV NS2 and NS3 proteins are associated with reduced treatment response. These polymorphisms are enriched in post-treatment HCV sequences of patients unresponsive to treatment. They are also associated with lower reductions in viral load in the first week of therapy. Using in vitro short-term dose-response assays, these polymorphisms do not cause any reduction in sofosbuvir potency, suggesting an indirect mechanism of action in decreasing sofosbuvir efficacy. The identification of polymorphisms in NS2 and NS3 proteins associated with poor treatment outcomes emphasises the value of systematic genome-wide analyses of viruses in uncovering clinically relevant polymorphisms that impact treatment

    Correcting for Population Stratification Reduces False Positive and False Negative Results in Joint Analyses of Host and Pathogen Genomes

    No full text
    Studies of host genetic determinants of pathogen sequence variations can identify sites of genomic conflicts, by highlighting variants that are implicated in immune response on the host side and adaptive escape on the pathogen side. However, systematic genetic differences in host and pathogen populations can lead to inflated type I (false positive) and type II (false negative) error rates in genome-wide association analyses. Here, we demonstrate through a simulation that correcting for both host and pathogen stratification reduces spurious signals and increases power to detect real associations in a variety of tested scenarios. We confirm the validity of the simulations by showing comparable results in an analysis of paired human and HIV genomes

    A test for detecting differential indirect trans effects between two groups of samples

    Get PDF
    Integrative analysis of copy number and gene expression data can help in understanding the cis and trans effect of copy number aberrations on transcription levels of genes involved in a pathway. To analyse how these copy number mediated gene-gene interactions differ between groups of samples we propose a new method, named dNET. Our method uses ridge regression to model the network topology involving one gene's expression level, its gene dosage and the expression levels of other genes in the network. The interaction parameters are estimated by fitting the model per gene for all samples together. However, instead of testing for differential network topology per gene, dNET tests for an overall difference in estimated parameters between two groups of samples and produces a single p-value. With the help of several simulation studies, we show that dNET can detect differential network nodes with high accuracy and low rate of false positives even in the presence of differential cis effects. We also apply dNET to publicly available TCGA cancer datasets and identify pathways where copy number mediated gene-gene interactions differ between samples with cancer stage lower than stage 3 and samples with cancer stage 3 or above
    corecore