47 research outputs found

    Evolutionarily conserved and diverged alternative splicing events show different expression and functional profiles

    Get PDF
    To better decipher the functional impact of alternative splicing, we classified alternative splicing events in 10 818 pairs of human and mouse genes based on conservation at genome and transcript levels. Expression levels of conserved alternative splices in human and mouse expressed sequence tag databases show strong correlation, indicating that alternative splicing is similarly regulated in both species. A total of 43% (8921) of mouse alternative splices could be found in the human genome but not in human transcripts. Five of eleven tested mouse predictions were observed in human tissues, demonstrating that mouse transcripts provide a valuable resource for identifying alternative splicing events in human genes. Combining gene-specific measures of conserved and diverged alternative splicing with both gene classification based on Gene Ontology (GO) and microarray-determined gene expression in 52 diverse human tissues and cell lines, we found conserved alternative splicing most enriched in brain-expressed signaling pathways. Diverged alternative splicing is more prevalent in testis and cancerous cell line up-regulated processes, including protein biosynthesis, responses to stress and responses to endogenous stimuli. Using conservation as a surrogate for functional significance, these results suggest that alternative splicing plays an important role in enhancing the functional capacity of central nervous systems, while non-functional splicing more frequently occurs in testis and cell lines, possibly as a result of cellular stress and rapid proliferation

    An Intronic Signal for Alternative Splicing in the Human Genome

    Get PDF
    An important level at which the expression of programmed cell death (PCD) genes is regulated is alternative splicing. Our previous work identified an intronic splicing regulatory element in caspase-2 (casp-2) gene. This 100-nucleotide intronic element, In100, consists of an upstream region containing a decoy 3′ splice site and a downstream region containing binding sites for splicing repressor PTB. Based on the signal of In100 element in casp-2, we have detected the In100-like sequences as a family of sequence elements associated with alternative splicing in the human genome by using computational and experimental approaches. A survey of human genome reveals the presence of more than four thousand In100-like elements in 2757 genes. These In100-like elements tend to locate more frequent in intronic regions than exonic regions. EST analyses indicate that the presence of In100-like elements correlates with the skipping of their immediate upstream exons, with 526 genes showing exon skipping in such a manner. In addition, In100-like elements are found in several human caspase genes near exons encoding the caspase active domain. RT-PCR experiments show that these caspase genes indeed undergo alternative splicing in a pattern predicted to affect their functional activity. Together, these results suggest that the In100-like elements represent a family of intronic signals for alternative splicing in the human genome

    A comprehensive transcript index of the human genome generated using microarrays and computational approaches

    Get PDF
    BACKGROUND: Computational and microarray-based experimental approaches were used to generate a comprehensive transcript index for the human genome. Oligonucleotide probes designed from approximately 50,000 known and predicted transcript sequences from the human genome were used to survey transcription from a diverse set of 60 tissues and cell lines using ink-jet microarrays. Further, expression activity over at least six conditions was more generally assessed using genomic tiling arrays consisting of probes tiled through a repeat-masked version of the genomic sequence making up chromosomes 20 and 22. RESULTS: The combination of microarray data with extensive genome annotations resulted in a set of 28,456 experimentally supported transcripts. This set of high-confidence transcripts represents the first experimentally driven annotation of the human genome. In addition, the results from genomic tiling suggest that a large amount of transcription exists outside of annotated regions of the genome and serves as an example of how this activity could be measured on a genome-wide scale. CONCLUSIONS: These data represent one of the most comprehensive assessments of transcriptional activity in the human genome and provide an atlas of human gene expression over a unique set of gene predictions. Before the annotation of the human genome is considered complete, however, the previously unannotated transcriptional activity throughout the genome must be fully characterized

    Oncogenic ERBB3 Mutations in Human Cancers

    Get PDF
    SummaryThe human epidermal growth factor receptor (HER) family of tyrosine kinases is deregulated in multiple cancers either through amplification, overexpression, or mutation. ERBB3/HER3, the only member with an impaired kinase domain, although amplified or overexpressed in some cancers, has not been reported to carry oncogenic mutations. Here, we report the identification of ERBB3 somatic mutations in ∼11% of colon and gastric cancers. We found that the ERBB3 mutants transformed colonic and breast epithelial cells in a ligand-independent manner. However, the mutant ERBB3 oncogenic activity was dependent on kinase-active ERBB2. Furthermore, we found that anti-ERBB antibodies and small molecule inhibitors effectively blocked mutant ERBB3-mediated oncogenic signaling and disease progression in vivo

    ACC2 Is Expressed at High Levels Human White Adipose and Has an Isoform with a Novel N-Terminus

    Get PDF
    Acetyl-CoA carboxylases ACC1 and ACC2 catalyze the carboxylation of acetyl-CoA to malonyl-CoA, regulating fatty-acid synthesis and oxidation, and are potential targets for treatment of metabolic syndrome. Expression of ACC1 in rodent lipogenic tissues and ACC2 in rodent oxidative tissues, coupled with the predicted localization of ACC2 to the mitochondrial membrane, have suggested separate functional roles for ACC1 in lipogenesis and ACC2 in fatty acid oxidation. We find, however, that human adipose tissue, unlike rodent adipose, expresses more ACC2 mRNA relative to the oxidative tissues muscle and heart. Human adipose, along with human liver, expresses more ACC2 than ACC1. Using RT-PCR, real-time PCR, and immunoprecipitation we report a novel isoform of ACC2 (ACC2.v2) that is expressed at significant levels in human adipose. The protein generated by this isoform has enzymatic activity, is endogenously expressed in adipose, and lacks the N-terminal sequence. Both ACC2 isoforms are capable of de novo lipogenesis, suggesting that ACC2, in addition to ACC1, may play a role in lipogenesis. The results demonstrate a significant difference in ACC expression between human and rodents, which may introduce difficulties for the use of rodent models for development of ACC inhibitors

    Deep RNA sequencing analysis of readthrough gene fusions in human prostate adenocarcinoma and reference samples

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Readthrough fusions across adjacent genes in the genome, or transcription-induced chimeras (TICs), have been estimated using expressed sequence tag (EST) libraries to involve 4-6% of all genes. Deep transcriptional sequencing (RNA-Seq) now makes it possible to study the occurrence and expression levels of TICs in individual samples across the genome.</p> <p>Methods</p> <p>We performed single-end RNA-Seq on three human prostate adenocarcinoma samples and their corresponding normal tissues, as well as brain and universal reference samples. We developed two bioinformatics methods to specifically identify TIC events: a targeted alignment method using artificial exon-exon junctions within 200,000 bp from adjacent genes, and genomic alignment allowing splicing within individual reads. We performed further experimental verification and characterization of selected TIC and fusion events using quantitative RT-PCR and comparative genomic hybridization microarrays.</p> <p>Results</p> <p>Targeted alignment against artificial exon-exon junctions yielded 339 distinct TIC events, including 32 gene pairs with multiple isoforms. The false discovery rate was estimated to be 1.5%. Spliced alignment to the genome was less sensitive, finding only 18% of those found by targeted alignment in 33-nt reads and 59% of those in 50-nt reads. However, spliced alignment revealed 30 cases of TICs with intervening exons, in addition to distant inversions, scrambled genes, and translocations. Our findings increase the catalog of observed TIC gene pairs by 66%.</p> <p>We verified 6 of 6 predicted TICs in all prostate samples, and 2 of 5 predicted novel distant gene fusions, both private events among 54 prostate tumor samples tested. Expression of TICs correlates with that of the upstream gene, which can explain the prostate-specific pattern of some TIC events and the restriction of the <it>SLC45A3-ELK4 </it>e4-e2 TIC to <it>ERG</it>-negative prostate samples, as confirmed in 20 matched prostate tumor and normal samples and 9 lung cancer cell lines.</p> <p>Conclusions</p> <p>Deep transcriptional sequencing and analysis with targeted and spliced alignment methods can effectively identify TIC events across the genome in individual tissues. Prostate and reference samples exhibit a wide range of TIC events, involving more genes than estimated previously using ESTs. Tissue specificity of TIC events is correlated with expression patterns of the upstream gene. Some TIC events, such as <it>MSMB-NCOA4</it>, may play functional roles in cancer.</p

    The BioMart community portal: an innovative alternative to large, centralized data repositories.

    Get PDF
    The BioMart Community Portal (www.biomart.org) is a community-driven effort to provide a unified interface to biomedical databases that are distributed worldwide. The portal provides access to numerous database projects supported by 30 scientific organizations. It includes over 800 different biological datasets spanning genomics, proteomics, model organisms, cancer data, ontology information and more. All resources available through the portal are independently administered and funded by their host organizations. The BioMart data federation technology provides a unified interface to all the available data. The latest version of the portal comes with many new databases that have been created by our ever-growing community. It also comes with better support and extensibility for data analysis and visualization tools. A new addition to our toolbox, the enrichment analysis tool is now accessible through graphical and web service interface. The BioMart community portal averages over one million requests per day. Building on this level of service and the wealth of information that has become available, the BioMart Community Portal has introduced a new, more scalable and cheaper alternative to the large data stores maintained by specialized organizations
    corecore