Search CORE

7 research outputs found

SERpredict: Detection of tissue- or tumor-specific isoforms generated through exonization of transposable elements

Author: Ast Gil
Mersch Britta
Sela Noa
Suhai Sandor
Wagenblatt Agnes Hotz-
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Background: Transposed elements (TEs) are known to affect transcriptomes, because either new exons are generated from intronic transposed elements (this is called exonization), or the element inserts into the exon, leading to a new transcript. Several examples in the literature show that isoforms generated by an exonization are specific to a certain tissue (for example the heart muscle) or inflict a disease. Thus, exonizations can have negative effects for the transcriptome of an organism. Results: As we aimed at detecting other tissue- or tumor-specific isoforms in human and mouse genomes which were generated through exonization of a transposed element, we designed the automated analysis pipeline SERpredict (SER = Specific Exonized Retroelement) making use of Bayesian Statistics. With this pipeline, we found several genes in which a transposed element formed a tissue- or tumor-specific isoform. Conclusion: Our results show that SERpredict produces relevant results, demonstrating the importance of transposed elements in shaping both the human and the mouse transcriptomes. The effect of transposed elements on the human transcriptome is several times higher than the effect on the mouse transcriptome, due to the contribution of the primate-specific Alu element

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

PubMed Central

cDNA2Genome: A tool for mapping and annotating cDNAs

Author: del Val Coral
Glatting Karl-Heinz
Suhai Sandor
Publication venue: BioMed Central
Publication date: 01/01/2003
Field of study

BACKGROUND: In the last years several high-throughput cDNA sequencing projects have been funded worldwide with the aim of identifying and characterizing the structure of complete novel human transcripts. However some of these cDNAs are error prone due to frameshifts and stop codon errors caused by low sequence quality, or to cloning of truncated inserts, among other reasons. Therefore, accurate CDS prediction from these sequences first require the identification of potentially problematic cDNAs in order to speed up the posterior annotation process. RESULTS: cDNA2Genome is an application for the automatic high-throughput mapping and characterization of cDNAs. It utilizes current annotation data and the most up to date databases, especially in the case of ESTs and mRNAs in conjunction with a vast number of approaches to gene prediction in order to perform a comprehensive assessment of the cDNA exon-intron structure. The final result of cDNA2Genome is an XML file containing all relevant information obtained in the process. This XML output can easily be used for further analysis such us program pipelines, or the integration of results into databases. The web interface to cDNA2Genome also presents this data in HTML, where the annotation is additionally shown in a graphical form. cDNA2Genome has been implemented under the W3H task framework which allows the combination of bioinformatics tools in tailor-made analysis task flows as well as the sequential or parallel computation of many sequences for large-scale analysis. CONCLUSIONS: cDNA2Genome represents a new versatile and easily extensible approach to the automated mapping and annotation of human cDNAs. The underlying approach allows sequential or parallel computation of sequences for high-throughput analysis of cDNAs

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Exchange energy density and exchange potential via a hartree-fock plus mp2 study of the electron liquid in the ground-state conformer of glycine

Author: Knapp-Mohammady Michaela
March Norman H.
Suhai Sandor
Van Alsenoy Christian
Publication venue
Publication date: 01/01/2008
Field of study

Institutional Repository Universiteit Antwerpen

High-throughput protein analysis integrating bioinformatics and experimental assays

Author: del Val Coral
Falkenhahn Mechthild
Glatting Karl-Heinz
Mehrle Alexander
Poustka Annemarie
Seiler Markus
Suhai Sandor
Wiemann Stefan
Publication venue: Oxford University Press
Publication date: 03/02/2004
Field of study

The wealth of transcript information that has been made publicly available in recent years requires the development of high-throughput functional genomics and proteomics approaches for its analysis. Such approaches need suitable data integration procedures and a high level of automation in order to gain maximum benefit from the results generated. We have designed an automatic pipeline to analyse annotated open reading frames (ORFs) stemming from full-length cDNAs produced mainly by the German cDNA Consortium. The ORFs are cloned into expression vectors for use in large-scale assays such as the determination of subcellular protein localization or kinase reaction specificity. Additionally, all identified ORFs undergo exhaustive bioinformatic analysis such as similarity searches, protein domain architecture determination and prediction of physicochemical characteristics and secondary structure, using a wide variety of bioinformatic methods in combination with the most up-to-date public databases (e.g. PRINTS, BLOCKS, INTERPRO, PROSITE SWISSPROT). Data from experimental results and from the bioinformatic analysis are integrated and stored in a relational database (MS SQL-Server), which makes it possible for researchers to find answers to biological questions easily, thereby speeding up the selection of targets for further analysis. The designed pipeline constitutes a new automatic approach to obtaining and administrating relevant biological data from high-throughput investigations of cDNAs in order to systematically identify and characterize novel genes, as well as to comprehensively describe the function of the encoded proteins

Crossref

PubMed Central

CAFTAN: a tool for fast mapping, and quality assessment of cDNAs

Author: del Val Coral
Ernst Peter
Glatting Karl-Heinz
Hotz-Wagenblatt Agnes
Kuryshev Vladimir
Poustka Annemarie
Suhai Sandor
Wiemann Stefan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2006
Field of study

Abstract Background The German cDNA Consortium has been cloning full length cDNAs and continued with their exploitation in protein localization experiments and cellular assays. However, the efficient use of large cDNA resources requires the development of strategies that are capable of a speedy selection of truly useful cDNAs from biological and experimental noise. To this end we have developed a new high-throughput analysis tool, CAFTAN, which simplifies these efforts and thus fills the gap between large-scale cDNA collections and their systematic annotation and application in functional genomics. Results CAFTAN is built around the mapping of cDNAs to the genome assembly, and the subsequent analysis of their genomic context. It uses sequence features like the presence and type of PolyA signals, inner and flanking repeats, the GC-content, splice site types, etc. All these features are evaluated in individual tests and classify cDNAs according to their sequence quality and likelihood to have been generated from fully processed mRNAs. Additionally, CAFTAN compares the coordinates of mapped cDNAs with the genomic coordinates of reference sets from public available resources (e.g., VEGA, ENSEMBL). This provides detailed information about overlapping exons and the structural classification of cDNAs with respect to the reference set of splice variants. The evaluation of CAFTAN showed that is able to correctly classify more than 85% of 5950 selected "known protein-coding" VEGA cDNAs as high quality multi- or single-exon. It identified as good 80.6 % of the single exon cDNAs and 85 % of the multiple exon cDNAs. The program is written in Perl and in a modular way, allowing the adoption of this strategy to other tasks like EST-annotation, or to extend it by adding new classification rules and new organism databases as they become available. We think that it is a very useful program for the annotation and research of unfinished genomes. Conclusion CAFTAN is a high-throughput sequence analysis tool, which performs a fast and reliable quality prediction of cDNAs. Several thousands of cDNAs can be analyzed in a short time, giving the curator/scientist a first quick overview about the quality and the already existing annotation of a set of cDNAs. It supports the rejection of low quality cDNAs and helps in the selection of likely novel splice variants, and/or completely novel transcripts for new experiments.</p

Directory of Open Access Journals

Epigenetic Reactivation of Tumor Suppressor Genes by a Novel Small-Molecule Inhibitor of Human DNA Methyltransferases

Author: Bodo Brueckner
Frank Lyko
H. Christian Kliem
Manfred Wiessler
Pawel Siedlecki
Piotr Zielenkiewicz
Regine Garcia Boy
Sandor Suhai
Tanja Musch
Publication venue: 'American Association for Cancer Research (AACR)'
Publication date
Field of study

Crossref

CAFTAN: a tool for fast mapping, and quality assessment of cDNAs

Author: Agnes Hotz-Wagenblatt
Annemarie Poustka
B Knebelmann
B Modrek
C Burge
C del Val
C Del Val
CA Hayden
Coral del Val
D Bannasch
DF Colgan
DHHWSCLURHBSMABDSISM Arlt
DJ Rowold
E Beaudoing
E Beaudoing
F Gebauer
G Lev-Maor
H Kiyosawa
H Konno
J Takeda
JC Simpson
JE Stajich
JL Ashurst
Karl-Heinz Glatting
KV Prasanth
L Florea
L Stein
LF Lareau
M Gardiner-Garden
M Zhang
MK Sakharkar
N Volfovsky
NJ Sakabe
P Carninci
P Ernst
P Rice
PA Galante
Peter Ernst
R Kikuno
R Mott
R Wellenreuther
R Yamashita
RL Strausberg
S Wiemann
S Wiemann
Sandor Suhai
SJ Wheelan
Stefan Wiemann
T Hubbard
T Imanishi
T Ota
V Gotea
V Starkuviene
Vladimir Yurjevich Kuryshev
WJ Kent
Y Okazaki
Y Suzuki
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref