236 research outputs found
Identification and HLA-Tetramer-Validation of Human CD4(+) and CD8(+) T Cell Responses against HCMV Proteins IE1 and IE2
Human cytomegalovirus (HCMV) is an important human pathogen. It is a leading cause of congenital infection and a leading infectious threat to recipients of solid organ transplants as well as of allogeneic hematopoietic cell transplants. Moreover, it has recently been suggested that HCMV may promote tumor development. Both CD4+ and CD8+ T cell responses are important for long-term control of the virus, and adoptive transfer of HCMV-specific T cells has led to protection from reactivation and HCMV disease. Identification of HCMV-specific T cell epitopes has primarily focused on CD8+ T cell responses against the pp65 phosphoprotein. In this study, we have focused on CD4+ and CD8+ T cell responses against the immediate early 1 and 2 proteins (IE1 and IE2). Using overlapping peptides spanning the entire IE1 and IE2 sequences, peripheral blood mononuclear cells from 16 healthy, HLA-typed, donors were screened by ex vivo IFN-γ ELISpot and in vitro intracellular cytokine secretion assays. The specificities of CD4+ and CD8+ T cell responses were identified and validated by HLA class II and I tetramers, respectively. Eighty-one CD4+ and 44 CD8+ T cell responses were identified representing at least seven different CD4 epitopes and 14 CD8 epitopes restricted by seven and 11 different HLA class II and I molecules, respectively, in total covering 91 and 98% of the Caucasian population, respectively. Presented in the context of several different HLA class II molecules, two epitope areas in IE1 and IE2 were recognized in about half of the analyzed donors. These data may be used to design a versatile anti-HCMV vaccine and/or immunotherapy strategy
A study assessing the association of glycated hemoglobin a1C (HbA1C) associated variants with HbA1C, chronic kidney disease and diabetic retinopathy in populations of asian ancestry
10.1371/journal.pone.0079767PLoS ONE811-POLN
Genome-wide association study identifies multiple susceptibility loci for glioma
Previous genome-wide association studies (GWASs) have shown that common genetic variation contributes to the heritable risk of glioma. To identify new glioma susceptibility loci, we conducted a meta-analysis of four GWAS (totalling 4,147 cases and 7,435 controls), with imputation using 1000 Genomes and UK10K Project data as reference. After genotyping an additional 1,490 cases and 1,723 controls we identify new risk loci for glioblastoma (GBM) at 12q23.33 (rs3851634, near POLR3B, P=3.02 × 10−9) and non-GBM at 10q25.2 (rs11196067, near VTI1A, P=4.32 × 10−8), 11q23.2 (rs648044, near ZBTB16, P=6.26 × 10−11), 12q21.2 (rs12230172, P=7.53 × 10−11) and 15q24.2 (rs1801591, near ETFA, P=5.71 × 10−9). Our findings provide further insights into the genetic basis of the different glioma subtypes
Generating samples for association studies based on HapMap data
<p>Abstract</p> <p>Background</p> <p>With the completion of the HapMap project, a variety of computational algorithms and tools have been proposed for haplotype inference, tag SNP selection and genome-wide association studies. Simulated data are commonly used in evaluating these new developed approaches. In addition to simulations based on population models, empirical data generated by perturbing real data, has also been used because it may inherit specific properties from real data. However, there is no tool that is publicly available to generate large scale simulated variation data by taking into account knowledge from the HapMap project.</p> <p>Results</p> <p>A computer program (<it>gs</it>) was developed to quickly generate a large number of samples based on real data that are useful for a variety of purposes, including evaluating methods for haplotype inference, tag SNP selection and association studies. Two approaches have been implemented to generate dense SNP haplotype/genotype data that share similar local <it>linkage disequilibrium </it>(LD) patterns as those in human populations. The first approach takes haplotype pairs from samples as inputs, and the second approach takes patterns of haplotype block structures as inputs. Both quantitative and qualitative traits have been incorporated in the program. Phenotypes are generated based on a disease model, or based on the effect of a quantitative trait nucleotide, both of which can be specified by users. In addition to single-locus disease models, two-locus disease models have also been implemented that can incorporate any degree of epistasis. Users are allowed to specify all nine parameters in a 3 × 3 penetrance table. For several commonly used two-locus disease models, the program can automatically calculate penetrances based on the population prevalence and marginal effects of a disease that users can conveniently specify.</p> <p>Conclusion</p> <p>The program <it>gs </it>can effectively generate large scale genetic and phenotypic variation data that can be used for evaluating new developed approaches. It is freely available from the authors' web site at <url>http://www.eecs.case.edu/~jxl175/gs.html</url>.</p
Statistical Power of Model Selection Strategies for Genome-Wide Association Studies
Genome-wide association studies (GWAS) aim to identify genetic variants related to diseases by examining the associations between phenotypes and hundreds of thousands of genotyped markers. Because many genes are potentially involved in common diseases and a large number of markers are analyzed, it is crucial to devise an effective strategy to identify truly associated variants that have individual and/or interactive effects, while controlling false positives at the desired level. Although a number of model selection methods have been proposed in the literature, including marginal search, exhaustive search, and forward search, their relative performance has only been evaluated through limited simulations due to the lack of an analytical approach to calculating the power of these methods. This article develops a novel statistical approach for power calculation, derives accurate formulas for the power of different model selection strategies, and then uses the formulas to evaluate and compare these strategies in genetic model spaces. In contrast to previous studies, our theoretical framework allows for random genotypes, correlations among test statistics, and a false-positive control based on GWAS practice. After the accuracy of our analytical results is validated through simulations, they are utilized to systematically evaluate and compare the performance of these strategies in a wide class of genetic models. For a specific genetic model, our results clearly reveal how different factors, such as effect size, allele frequency, and interaction, jointly affect the statistical power of each strategy. An example is provided for the application of our approach to empirical research. The statistical approach used in our derivations is general and can be employed to address the model selection problems in other random predictor settings. We have developed an R package markerSearchPower to implement our formulas, which can be downloaded from the Comprehensive R Archive Network (CRAN) or http://bioinformatics.med.yale.edu/group/
Identity-by-descent estimation with population- and pedigree-based imputation in admixed family data
BACKGROUND: In the past few years, imputation approaches have been mainly used in population-based designs of genome-wide association studies, although both family- and population-based imputation methods have been proposed. With the recent surge of family-based designs, family-based imputation has become more important. Imputation methods for both designs are based on identity-by-descent (IBD) information. Apart from imputation, the use of IBD information is also common for several types of genetic analysis, including pedigree-based linkage analysis. METHODS: We compared the performance of several family- and population-based imputation methods in large pedigrees provided by Genetic Analysis Workshop 19 (GAW19). We also evaluated the performance of a new IBD mapping approach that we propose, which combines IBD information from known pedigrees with information from unrelated individuals. RESULTS: Different combinations of the imputation methods have varied imputation accuracies. Moreover, we showed gains from the use of both known pedigrees and unrelated individuals with our IBD mapping approach over the use of known pedigrees only. CONCLUSIONS: Our results represent accuracies of different combinations of imputation methods that may be useful for data sets similar to the GAW19 pedigree data. Our IBD mapping approach, which uses both known pedigree and unrelated individuals, performed better than classical linkage analysis
On the Use of Variance per Genotype as a Tool to Identify Quantitative Trait Interaction Effects: A Report from the Women's Genome Health Study
Testing for genetic effects on mean values of a quantitative trait has been a very successful strategy. However, most studies to date have not explored genetic effects on the variance of quantitative traits as a relevant consequence of genetic variation. In this report, we demonstrate that, under plausible scenarios of genetic interaction, the variance of a quantitative trait is expected to differ among the three possible genotypes of a biallelic SNP. Leveraging this observation with Levene's test of equality of variance, we propose a novel method to prioritize SNPs for subsequent gene–gene and gene–environment testing. This method has the advantageous characteristic that the interacting covariate need not be known or measured for a SNP to be prioritized. Using simulations, we show that this method has increased power over exhaustive search under certain conditions. We further investigate the utility of variance per genotype by examining data from the Women's Genome Health Study. Using this dataset, we identify new interactions between the LEPR SNP rs12753193 and body mass index in the prediction of C-reactive protein levels, between the ICAM1 SNP rs1799969 and smoking in the prediction of soluble ICAM-1 levels, and between the PNPLA3 SNP rs738409 and body mass index in the prediction of soluble ICAM-1 levels. These results demonstrate the utility of our approach and provide novel genetic insight into the relationship among obesity, smoking, and inflammation
Anisotropic behaviors of massless Dirac fermions in graphene under periodic potential
Charge carriers of graphene show neutrino-like linear energy dispersions as
well as chiral behavior near the Dirac point. Here we report highly unusual and
unexpected behaviors of these carriers in applied external periodic potentials,
i.e., in graphene superlattices. The group velocity renormalizes highly
anisotropically even to a degree that it is not changed at all for states with
wavevector in one direction but is reduced to zero in another, implying the
possibility that one can make nanoscale electronic circuits out of graphene not
by cutting it but by drawing on it in a non-destructive way. Also, the type of
charge carrier species (e.g. electron, hole or open orbit) and their density of
states vary drastically with the Fermi energy, enabling one to tune the Fermi
surface-dominant properties significantly with gate voltage. These results
address the fundamental question of how chiral massless Dirac fermions
propagate in periodic potentials and point to a new possible path for nanoscale
electronics.Comment: 10 pages, 9 figure
Comparison of Strategies to Detect Epistasis from eQTL Data
Genome-wide association studies have been instrumental in identifying genetic variants associated with complex traits such as human disease or gene expression phenotypes. It has been proposed that extending existing analysis methods by considering interactions between pairs of loci may uncover additional genetic effects. However, the large number of possible two-marker tests presents significant computational and statistical challenges. Although several strategies to detect epistasis effects have been proposed and tested for specific phenotypes, so far there has been no systematic attempt to compare their performance using real data. We made use of thousands of gene expression traits from linkage and eQTL studies, to compare the performance of different strategies. We found that using information from marginal associations between markers and phenotypes to detect epistatic effects yielded a lower false discovery rate (FDR) than a strategy solely using biological annotation in yeast, whereas results from human data were inconclusive. For future studies whose aim is to discover epistatic effects, we recommend incorporating information about marginal associations between SNPs and phenotypes instead of relying solely on biological annotation. Improved methods to discover epistatic effects will result in a more complete understanding of complex genetic effects
Interactions between genes involved in the antioxidant defence system and breast cancer risk
The aim of the study is to examine the association between multilocus genotypes across 10 genes encoding proteins in the antioxidant defence system and breast cancer. The 10 genes are SOD1, SOD2, GPX1, GPX4, GSR, CAT, TXN, TXN2, TXNRD1 and TXNRD2. In all, 2271 cases and 2280 controls were used to examine gene–gene interactions between 52 single nucleotide polymorphisms (SNPs) that are hypothesised to tag all common variants in the 10 genes. The statistical analysis is based on three methods: unconditional logistic regression, multifactor dimensionality reduction and hierarchical cluster analysis. We examined all two- and three-way combinations with unconditional logistic regression and multifactor dimensionality reduction, and used a global approach with all SNPs in the hierarchical cluster analysis. Single-locus studies of an association of genetic variants in the antioxidant defence genes and breast cancer have been contradictory and inconclusive. It is the first time, to our knowledge, the association between multilocus genotypes across genes coding for antioxidant defence enzymes and breast cancer is investigated. We found no evidence of an association with breast cancer with our multilocus approach. The search for two-way interactions gave experiment-wise significance levels of P=0.24 (TXN [t2715c] and TXNRD2 [g23524a]) and P=0.58 (GSR [c39396t] and TXNRD2 [a442g]), for the unconditional logistic regression and multifactor dimensionality reduction, respectively. The experiment-wise significance levels for the three-way interactions were P=0.94 (GPX4 [t2572c], TXN [t2715c] and TXNRD2 [g23524a]) and P=0.29 (GSR [c39396t], TXN [t2715c] and TXNRD2 [a442g]) for the unconditional logistic regression and multifactor dimensionality reduction, respectively. In the hierarchical cluster analysis neither the average across four rounds with replacement of missing values at random (P=0.12) nor a fifth round with more balanced proportion of missing values between cases and controls (P=0.17) was significant
- …
