24 research outputs found

    CTdatabase: a knowledge-base of high-throughput and curated data on cancer-testis antigens

    Get PDF
    The potency of the immune response has still to be harnessed effectively to combat human cancers. However, the discovery of T-cell targets in melanomas and other tumors has raised the possibility that cancer vaccines can be used to induce a therapeutically effective immune response against cancer. The targets, cancer-testis (CT) antigens, are immunogenic proteins preferentially expressed in normal gametogenic tissues and different histological types of tumors. Therapeutic cancer vaccines directed against CT antigens are currently in late-stage clinical trials testing whether they can delay or prevent recurrence of lung cancer and melanoma following surgical removal of primary tumors. CT antigens constitute a large, but ill-defined, family of proteins that exhibit a remarkably restricted expression. Currently, there is a considerable amount of information about these proteins, but the data are scattered through the literature and in several bioinformatic databases. The database presented here, CTdatabase (http://www.cta.lncc.br), unifies this knowledge to facilitate both the mining of the existing deluge of data, and the identification of proteins alleged to be CT antigens, but that do not have their characteristic restricted expression pattern. CTdatabase is more than a repository of CT antigen data, since all the available information was carefully curated and annotated with most data being specifically processed for CT antigens and stored locally. Starting from a compilation of known CT antigens, CTdatabase provides basic information including gene names and aliases, RefSeq accession numbers, genomic location, known splicing variants, gene duplications and additional family members. Gene expression at the mRNA level in normal and tumor tissues has been collated from publicly available data obtained by several different technologies. Manually curated data related to mRNA and protein expression, and antigen-specific immune responses in cancer patients are also available, together with links to PubMed for relevant CT antigen article

    Extensive pleiotropism and allelic heterogeneity mediate metabolic effects of IRX3 and IRX5

    Get PDF
    While coding variants often have pleiotropic effects across multiple tissues, non-coding variants are thought to mediate their phenotypic effects by specific tissue and temporal regulation of gene expression. Here, we dissected the genetic and functional architecture of a genomic region within the FTO gene that is strongly associated with obesity risk. We show that multiple variants on a common haplotype modify the regulatory properties of several enhancers targeting IRX3 and IRX5 from megabase distances. We demonstrate that these enhancers impact gene expression in multiple tissues, including adipose and brain, and impart regulatory effects during a restricted temporal window. Our data indicate that the genetic architecture of disease-associated loci may involve extensive pleiotropy, allelic heterogeneity, shared allelic effects across tissues, and temporally-restricted effects

    Asthma-associated genetic variants induce IL33 differential expression through an enhancer-blocking regulatory region

    Get PDF
    Genome-wide association studies (GWAS) have implicated the IL33 locus in asthma, but the underlying mechanisms remain unclear. Here, we identify a 5鈥塳b region within the GWAS-defined segment that acts as an enhancer-blocking element in vivo and in vitro. Chromatin conformation capture showed that this 5鈥塳b region loops to the IL33 promoter, potentially regulating its expression. We show that the asthma-associated single nucleotide polymorphism (SNP) rs1888909, located within the 5鈥塳b region, is associated with IL33 gene expression in human airway epithelial cells and IL-33 protein expression in human plasma, potentially through differential binding of OCT-1 (POU2F1) to the asthma-risk allele. Our data demonstrate that asthma-associated variants at the IL33 locus mediate allele-specific regulatory activity and IL33 expression, providing a mechanism through which a regulatory SNP contributes to genetic risk of asthma.This work was supported by NIH grants R01 HL118758, R01 HL128075, R01 HL119577, R01 HL085197, U19 AI095230, UG3 OD023282 and UM1 AI114271

    Genome-wide discovery of human heart enhancers

    No full text
    The various organogenic programs deployed during embryonic development rely on the precise expression of a multitude of genes in time and space. Identifying the cis-regulatory elements responsible for this tightly orchestrated regulation of gene expression is an essential step in understanding the genetic pathways involved in development. We describe a strategy to systematically identify tissue-specific cis-regulatory elements that share combinations of sequence motifs. Using heart development as an experimental framework, we employed a combination of Gibbs sampling and linear regression to build a classifier that identifies heart enhancers based on the presence and/or absence of various sequence features, including known and putative transcription factor (TF) binding specificities. In distinguishing heart enhancers from a large pool of random noncoding sequences, the performance of our classifier is vastly superior to four commonly used methods, with an accuracy reaching 92% in cross-validation. Furthermore, most of the binding specificities learned by our method resemble the specificities of TFs widely recognized as key players in heart development and differentiation, such as SRF, MEF2, ETS1, SMAD, and GATA. Using our classifier as a predictor, a genome-wide scan identified over 40,000 novel human heart enhancers. Although the classifier used no gene expression information, these novel enhancers are strongly associated with genes expressed in the heart. Finally, in vivo tests of our predictions in mouse and zebrafish achieved a validation rate of 62%, significantly higher than what is expected by chance. These results support the existence of underlying cis-regulatory codes dictating tissue-specific transcription in mammalian genomes and validate our enhancer classifier strategy as a method to uncover these regulatory codes

    A large-scale study of SNPs in regulatory elements of

    No full text
    The synthesis of a protein in eukaryotic organisms involves the transcription of a molecule of DNA to RNA (pre-mRNA), the excision of introns and fusion of exons by a process named splicing (yielding the mature mRNA molecule) and finally the translation of mRNA to protein. Not all exons of a gene are necessarily included in a mRNA, generating different isoforms of a protein, a phenomenon named alternative splicing. As a result, the mature mRNA may contain different combinations of exons, longer or shorter exons and even incorporate intronic sequences. The recognition of splice sites (exon-intron borders) involves i-specific sequences in the mRNA known as exonic or intronic splicing enhancers/silencers and iimany protein and ribonucleoprotein factors that recognize the sequences described in (i). Mutations that alter regulatory elements in the mRNA (i), thus interfering in the recognition by the factors described in (ii) may significantly alter the inclusion of a given exon [1]. It is known that mutations in a single nucleotide in specific positions of regulatory elements can interfere in the splicing of exons regulated by such sequences [2]. A common form of variation between two genomes of the same organism is the single nucleotide polimorfisms (SNPs) [3]. There are many studies that correlate SNPs to human diseases, since a single nucleotide change may radically affect the biochemical properties of a protein and may change its expression patterns. Our aim is to perform a large scale analysis of the occurrence of human SNPs in regulatory elements of alternative splicing, using a transcriptome database available in our lab. The database allows one to verify the frequency of SNPs in exons, introns and regulatory elements and make correlation with transcripts. OMIM [4] wi..
    corecore