28 research outputs found

    Specificity of the STAT4 Genetic Association for Severe Disease Manifestations of Systemic Lupus Erythematosus

    Get PDF
    Systemic lupus erythematosus (SLE) is a genetically complex disease with heterogeneous clinical manifestations. A polymorphism in the STAT4 gene has recently been established as a risk factor for SLE, but the relationship with specific SLE subphenotypes has not been studied. We studied 137 SNPs in the STAT4 region genotyped in 4 independent SLE case series (total n = 1398) and 2560 healthy controls, along with clinical data for the cases. Using conditional testing, we confirmed the most significant STAT4 haplotype for SLE risk. We then studied a SNP marking this haplotype for association with specific SLE subphenotypes, including autoantibody production, nephritis, arthritis, mucocutaneous manifestations, and age at diagnosis. To prevent possible type-I errors from population stratification, we reanalyzed the data using a subset of subjects determined to be most homogeneous based on principal components analysis of genome-wide data. We confirmed that four SNPs in very high LD (r2 = 0.94 to 0.99) were most strongly associated with SLE, and there was no compelling evidence for additional SLE risk loci in the STAT4 region. SNP rs7574865 marking this haplotype had a minor allele frequency (MAF) = 31.1% in SLE cases compared with 22.5% in controls (OR = 1.56, p = 10−16). This SNP was more strongly associated with SLE characterized by double-stranded DNA autoantibodies (MAF = 35.1%, OR = 1.86, p<10−19), nephritis (MAF = 34.3%, OR = 1.80, p<10−11), and age at diagnosis<30 years (MAF = 33.8%, OR = 1.77, p<10−13). An association with severe nephritis was even more striking (MAF = 39.2%, OR = 2.35, p<10−4 in the homogeneous subset of subjects). In contrast, STAT4 was less strongly associated with oral ulcers, a manifestation associated with milder disease. We conclude that this common polymorphism of STAT4 contributes to the phenotypic heterogeneity of SLE, predisposing specifically to more severe disease

    Role of STAT4 polymorphisms in systemic lupus erythematosus in a Japanese population: a case-control association study of the STAT1-STAT4 region

    Get PDF
    IntroductionRecent studies identified STAT4 (signal transducers and activators of transcription-4) as a susceptibility gene for systemic lupus erythematosus (SLE). STAT1 is encoded adjacently to STAT4 on 2q32.2-q32.3, upregulated in peripheral blood mononuclear cells from SLE patients, and functionally relevant to SLE. This study was conducted to test whether STAT4 is associated with SLE in a Japanese population also, to identify the risk haplotype, and to examine the potential genetic contribution of STAT1. To accomplish these aims, we carried out a comprehensive association analysis of 52 tag single nucleotide polymorphisms (SNPs) encompassing the STAT1-STAT4 region.MethodsIn the first screening, 52 tag SNPs were selected based on HapMap Phase II JPT (Japanese in Tokyo, Japan) data, and case-control association analysis was carried out on 105 Japanese female patients with SLE and 102 female controls. For associated SNPs, additional cases and controls were genotyped and association was analyzed using 308 SLE patients and 306 controls. Estimation of haplotype frequencies and an association study using the permutation test were performed with Haploview version 4.0 software. Population attributable risk percentage was estimated to compare the epidemiological significance of the risk genotype among populations.ResultsIn the first screening, rs7574865, rs11889341, and rs10168266 in STAT4 were most significantly associated (P < 0.01). Significant association was not observed for STAT1. Subsequent association studies of the three SNPs using 308 SLE patients and 306 controls confirmed a strong association of the rs7574865T allele (SLE patients: 46.3%, controls: 33.5%, P = 4.9 × 10-6, odds ratio 1.71) as well as TTT haplotype (rs10168266/rs11889341/rs7574865) (P = 1.5 × 10-6). The association was stronger in subgroups of SLE with nephritis and anti-double-stranded DNA antibodies. Population attributable risk percentage was estimated to be higher in the Japanese population (40.2%) than in Americans of European descent (19.5%).ConclusionsThe same STAT4 risk allele is associated with SLE in Caucasian and Japanese populations. Evidence for a role of STAT1 in genetic susceptibility to SLE was not detected. The contribution of STAT4 for the genetic background of SLE may be greater in the Japanese population than in Americans of European descent

    Advances in Computational Protein Design: Development of More Efficient Search Algorithms and their Application to the Full-Sequence Design of Larger Proteins

    Get PDF
    Protein design is the art of choosing an amino acid sequence that will fold into a desired structure. Computational protein design aims to quantify and automate this process. In computational protein design, various metrics may be used to calculate an energy score for a sequence with respect to a desired protein structure. An ongoing challenge is to find the lowest-energy sequences from amongst the vast multitude of sequence possibilities. A variety of exact and approximate algorithms may be used in this search. The work in this thesis focuses on the development and testing of four search algorithms. The first algorithm, HERO, is an exact algorithm, meaning that it will always find the lowest-energy sequence if the algorithm converges. We show that HERO is faster than other exact algorithms and converges on some previously intractable designs. The second algorithm, Vegas, is an approximate algorithm, meaning that it may not find the lowest-energy sequence. We show that, under certain conditions, Vegas finds the lowest-energy sequence in less time than HERO. The third algorithm, Monte Carlo, is an approximate algorithm that had been developed previously. We tested whether Monte Carlo was thorough enough to do a challenging computational design: the full-sequence design of a protein. Monte Carlo didn’t find the lowest-energy sequence, although a similar sequence from Vegas folded into the desired structure. Several biophysical methods suggested that the Monte Carlo sequence should also fold into the desired structure. Nevertheless, the Monte Carlo structure as determined by X-ray crystallography was markedly different from the predicted structure. We attribute this discrepancy to the presence of a high concentration of dioxane in the crystallization conditions. The fourth algorithm, FC_FASTER, is an approximate algorithm for designs of fixed amino acid composition. Such designs may accelerate improvements to the physical model. We show that FC_FASTER finds lower-energy sequences and is faster than our current fixed-composition algorithm.</p

    A search algorithm for fixed-composition protein design

    No full text
    We present a computational protein design algorithm for finding low-energy sequences of fixed amino acid composition. The search algorithms used in protein design typically do not restrict amino acid composition. However, the random energy model of Shakhnovich suggests that the use of fixed-composition sequences may circumvent defects in the modeling of the denatured state. Our algorithm, FC_FASTER, links fixed-composition versions of Monte Carlo and the FASTER algorithm. As proof of principle, FC_FASTER was tested on an experimentally validated, full-sequence design of the ÎČ1 domain of protein G. For the wild-type composition, FC_FASTER found a lower energy sequence than the experimentally validated sequence. Also, for a different composition, FC_FASTER found the hypothetical lowest-energy sequence in 14 out of 32 trials

    Acknowledgements

    No full text
    Advances in computational protein design: Development of more efficient search algorithms and their application to the full-sequence design of larger proteins Thesis b

    Preprocessing of rotamers for protein design calculations

    No full text
    We have developed a process that significantly reduces the number of rotamers in computational protein design calculations. This process, which we call Vegas, results in dramatic computational performance increases when used with algorithms based on the dead-end elimination (DEE) theorem. Vegas estimates the energy of each rotamer at each position by fixing each rotamer in turn and utilizing various search algorithms to optimize the remaining positions. Algorithms used for this context specific optimization can include Monte Carlo, self-consistent mean field, and the evaluation of an expression that generates a lower bound energy for the fixed rotamer. Rotamers with energies above a user-defined cutoff value are eliminated. We found that using Vegas to preprocess rotamers significantly reduced the calculation time of subsequent DEE-based algorithms while retaining the global minimum energy conformation. For a full boundary design of a 51 amino acid fragment of engrailed homeodomain, the total calculation time was reduced by 12-fold

    Dioxane contributes to the altered conformation and oligomerization state of a designed engrailed homeodomain variant

    No full text
    Our goal was to compute a stable, full-sequence design of the Drosophila melanogaster engrailed homeodomain. Thermal and chemical denaturation data indicated the design was significantly more stable than was the wild-type protein. The data were also nearly identical to those for a similar, later full-sequence design, which was shown by NMR to adopt the homeodomain fold: a three-helix, globular monomer. However, a 1.65 Å crystal structure of the design described here turned out to be of a completely different fold: a four-helix, rodlike tetramer. The crystallization conditions included approximately ~25% dioxane, and subsequent experiments by circular dichroism and sedimentation velocity analytical ultracentrifugation indicated that dioxane increases the helicity and oligomerization state of the designed protein. We attribute at least part of the discrepancy between the target fold and the crystal structure to the presence of a high concentration of dioxane

    Exact rotamer optimization for protein design

    No full text
    Computational methods play a central role in the rational design of novel proteins. The present work describes a new hybrid exact rotamer optimization (HERO) method that builds on previous dead-end elimination algorithms to yield dramatic performance enhancements. Measured on experimentally validated physical models, these improvements make it possible to perform previously intractable designs of entire protein core, surface, or boundary regions. Computational demonstrations include a full core design of the variable domains of the light and heavy chains of catalytic antibody 48G7 FAB with 74 residues and 10^(128) conformations, a full core/boundary design of the ÎČ1 domain of protein G with 25 residues and 10^(53) conformations, and a full surface design of the ÎČ1 domain of protein G with 27 residues and 10^(60) conformations. In addition, a full sequence design of the ÎČ1 domain of protein G is used to demonstrate the strong dependence of algorithm performance on the exact form of the potential function and the fidelity of the rotamer library. These results emphasize that search algorithm performance for protein design can only be meaningfully evaluated on physical models that have been subjected to experimental scrutiny. The new algorithm greatly facilitates ongoing efforts to engineer increasingly complex protein features

    Full-sequence computational design and solution structure of a thermostable protein variant

    No full text
    Computational protein design procedures were applied to the redesign of the entire sequence of a 51 amino acid residue protein, Drosophila melanogaster engrailed homeodomain. Various sequence optimization algorithms were compared and two resulting designed sequences were experimentally evaluated. The two sequences differ by 11 mutations and share 22% and 24% sequence identity with the wild-type protein. Both computationally designed proteins were considerably more stable than the naturally occurring protein, with midpoints of thermal denaturation greater than 99 degrees C. The solution structure was determined for one of the two sequences using multidimensional heteronuclear NMR spectroscopy, and the structure was found to closely match the original design template scaffold
    corecore