198 research outputs found

    Bioinformatics approaches for cancer research

    Get PDF
    Cancer is the consequence of genetic alterations that influence the behavior of affected cells. While the phenotypic effects of cancer like infinite proliferation are common hallmarks of this complex class of diseases, the connections between the genetic alterations and these effects are not always evident. The growth of information generated by experimental high-throughput techniques makes it possible to combine heterogeneous data from different sources to gain new insights into these complex molecular processes. The demand on computational biology to develop tools and methods to facilitate the evaluation of such data has increased accordingly. To this end, we developed new approaches and bioinformatics tools for the analysis of high-throughput data. Additionally, we integrated these new approaches into our comprehensive C++ framework GeneTrail. GeneTrail presents a powerful package that combines information retrieval, statistical evaluation of gene sets, result presentation, and data exchange. To make GeneTrail';s capabilities available to the research community, we implemented a graphical user interface in PHP and set up a webserver that is world-wide accessible. In this thesis, we discuss newly integrated algorithms and extensions of GeneTrail, as well as some comprehensive studies that have been performed with GeneTrail in the context of cancer research. We applied GeneTrail to analyze properties of tumor-associated antigens to elucidate the mechanisms of antigen candidate selection. Furthermore, we performed an extensive analysis of miRNAs and their putative target pathways and networks in cancer. In the field of differential network analysis, we employed a combination of expression values and topological data to identify patterns of deregulated subnetworks and putative key players for the deregulation. Signatures of deregulated subnetworks may help to predict the sensitivity of tumor subtypes to therapeutic agents and, hence, may be used in the future to guide the selection of optimal agents. Furthermore, the identified putative key players may represent oncogenes, tumor suppressor genes, or other genes that contribute to crucial changes of regulatory and signaling processes in cancer cells and may serve as potential targets for an individualized tumor therapy. With these applications, we demonstrate the usefulness of our GeneTrail package and hope that our work will contribute to a better understanding of cancer.Krebs ist eine Folge von tiefgreifenden genetischen VerĂ€nderungen, die das Verhalten der betroffenen Zellen beeinflussen. WĂ€hrend phĂ€notypische Effekte wie unaufhörliches Wachstum augenscheinliche Merkmale dieser komplexen Klasse von Krankheiten sind, sind die ZusammenhĂ€nge zwischen genetischen VerĂ€nderungen und diesen Effekten oftmals weit weniger offensichtlich. Mit der stetigen Zunahme an Daten, die aus Hochdurchsatz-Verfahren stammen, ist es möglich geworden, heterogene Daten aus verschiedenen Quellen zu kombinieren und neue Erkenntnisse ĂŒber diese ZusammenhĂ€nge zu gewinnen. Dementsprechend sind auch die Anforderungen an die Bioinformatik gewachsen, geeignete Applikationen und Verfahren zu entwickeln, um die Auswertung solcher Daten zu vereinfachen. Zu diesem Zweck haben wir neue AnsĂ€tze und bioinformatische Werkzeuge fĂŒr die Analyse von entsprechenden Daten fĂŒr die Krebsforschung entwickelt, welche wir in unser umfangreiches C++ System GeneTrail integriert haben. GeneTrail stellt ein mĂ€chtiges Softwarepaket dar, das Informationsgewinnung, statistische Auswertung von Gen Mengen, visuelle Darstellung der Resultate und Datenaustausch kombiniert. Um GeneTrail';s FĂ€higkeiten der Forschungsgemeinschaft zugĂ€nglich zu machen, haben wir eine graphische Benutzerschnittstelle in PHP implementiert und einen Webserver aufgesetzt, auf den weltweit zugegriffen werden kann. In der vorliegenden Arbeit diskutieren wir neu integrierte Algorithmen und Erweiterungen von GeneTrail, sowie umfangreiche Untersuchungen im Bereich Krebsforschung, die mit GeneTrail durchgefĂŒhrt wurden. Wir haben GeneTrail angewendet, um Eigenschaften von Tumorantigenen zu untersuchen, um aufzuklĂ€ren, welche dieser Eigenschaften zur Selektion dieser Proteine als Antigene beitragen. Des Weiteren haben wir eine umfangreiche Analyse von miRNAs und deren potentiellen Zielpfaden und -netzen in verschiedenen Krebsarten durchgefĂŒhrt. Im Bereich differentieller Netzwerkanalyse kombinierten wir Expressionswerte und topologische Netzwerkdaten, um Muster deregulierter Teilnetzwerke und mögliche SchlĂŒsselgene fĂŒr die Deregulation zu identifizieren. Signaturen deregulierter Teilnetzwerke können helfen die SensitivitĂ€t verschiedener Tumorarten gegenĂŒber Therapeutika vorherzusagen und damit zukĂŒnftig eine optimal angepasste Therapie zu ermöglichen. Außerdem können die identifizierten potentiellen SchlĂŒsselgene Oncogene, Tumorsuppressorgene, oder andere Gene darstellen, die zu wichtigen Änderungen von regulatorischen Prozessen in Krebszellen beitragen, und damit auch als potentielle Ziele fĂŒr eine individuelle Tumortherapie in Frage kommen. Mit diesen Anwendungen untermauern wir den Nutzen von GeneTrail und hoffen, dass unsere Arbeit in Zukunft zu einem besseren VerstĂ€ndnis von Krebs beitrĂ€gt

    DynaVenn: web-based computation of the most significant overlap between ordered sets

    Get PDF
    Background: In many research disciplines, ordered lists are compared. One example is to compare a subset of all significant genes or proteins in a primary study to those in a replication study. Often, the top of the lists are compared using Venn diagrams, ore more precisely Euler diagrams (set diagrams showing logical relations between a finite collection of different sets). If different cohort sizes, different techniques or algorithms for evaluation were applied, a direct comparison of significant genes with a fixed threshold can however be misleading and approaches comparing lists would be more appropriate. Results: We developed DynaVenn, a web-based tool that incrementally creates all possible subsets from two or three ordered lists and computes for each combination a p-value for the overlap. Respectively, dynamic Venn diagrams are generated as graphical representations. Additionally an animation is generated showing how the most significant overlap is reached by backtracking. We demonstrate the improved performance of DynaVenn over an arbitrary cut-off approach on an Alzheimer’s Disease biomarker set. Conclusion: DynaVenn combines the calculation of the most significant overlap of different cohorts with an intuitive visualization of the results. It is freely available as a web service at http://www.ccb.uni-saarland.de/dynavenn

    PLSDB: a resource of complete bacterial plasmids

    Get PDF
    The study of bacterial isolates or communities requires the analysis of the therein included plasmids in order to provide an extensive characterization of the organisms. Plasmids harboring resistance and virulence factors are of especial interest as they contribute to the dissemination of antibiotic resistance. As the number of newly sequenced bacterial genomes is growing a comprehensive resource is required which will allow to browse and filter the available plasmids, and to perform sequence analyses. Here, we present PLSDB, a resource containing 13 789 plasmid records collected from the NCBI nucleotide database. The web server provides an interactive view of all obtained plasmids with additional meta information such as sequence characteristics, sample-related information and taxonomy. Moreover, nucleotide sequence data can be uploaded to search for short nucleotide sequences (e.g. specific genes) in the plasmids, to compare a given plasmid to the records in the collection or to determine whether a sample contains one or multiple of the known plasmids (containment analysis). The resource is freely accessible under https://ccbmicrobe.cs.uni-saarland.de/plsdb/

    GraBCas: a bioinformatics tool for score-based prediction of Caspase- and Granzyme B-cleavage sites in protein sequences

    Get PDF
    Caspases and granzyme B are proteases that share the primary specificity to cleave at the carboxyl terminal of aspartate residues in their substrates. Both, caspases and granzyme B are enzymes that are involved in fundamental cellular processes and play a central role in apoptotic cell death. Although various targets are described, many substrates still await identification and many cleavage sites of known substrates are not identified or experimentally verified. A more comprehensive knowledge of caspase and granzyme B substrates is essential to understand the biological roles of these enzymes in more detail. The relatively high variability in cleavage site recognition sequence often complicates the identification of cleavage sites. As of yet there is no software available that allows identification of caspase and/or granzyme with cleavage sites differing from the consensus sequence. Here, we present a bioinformatics tool ‘GraBCas’ that provides score-based prediction of potential cleavage sites for the caspases 1–9 and granzyme B including an estimation of the fragment size. We tested GraBCas on already known substrates and showed its usefulness for protein sequence analysis. GraBCas is available at

    Microrna response of primary human macrophages to Arcobacter Butzleri infection

    Get PDF
    The role of microRNAs (miRNAs) in infectious diseases is becoming more and more apparent, and the use of miRNAs as a diagnostic tool and their therapeutic application has become the major focus of investigation. The aim of this study was to identify miRNAs involved in the immune signaling of macrophages in response to Arcobacter (A.) butzleri infection, an emerging foodborne pathogen causing gastroenteritis. Therefore, primary human macrophages were isolated and infected, and miRNA expression was studied by means of RNAseq. Analysis of the data revealed the expression of several miRNAs, which were previously associated with bacterial infections such as miR-155, miR-125, and miR-212. They were shown to play a key role in Toll-like receptor signaling where they act as fine-tuners to establish a balanced immune response. In addition, miRNAs which have yet not been identified during bacterial infections such as miR-3613, miR-2116, miR-671, miR-30d, and miR-629 were differentially regulated in A. butzleri-infected cells. Targets of these miRNAs accumulated in pathways such as apoptosis and endocytosis — processes that might be involved in A. butzleri pathogenesis. Our study contributes new findings about the interaction of A. butzleri with human innate immune cells helping to understand underlying regulatory mechanisms in macrophages during infection

    Prospect and challenge of detecting dynamic gene copy number increases in stem cells by whole genome sequencing

    Get PDF
    Gene amplification is an evolutionarily well-conserved and highly efficient mechanism to increase the amount of specific proteins. In humans, gene amplification is a hallmark of cancer and has recently been found during stem cell differentiation. Amplifications in stem cells are restricted to specific tissue areas and time windows, rendering their detection difficult. Here, we report on the performance of deep WGS sequencing (average 82-fold depth of coverage) on the BGISEQ with nanoball technology to detect amplifications in human mesenchymal and neural stem cells. As reference technology, we applied arraybased comparative genomic hybridization (aCGH), fluorescence in situ hybridization (FISH), and qPCR. Using different in silico strategies for amplification detection, we analyzed the potential of WGS for amplification detection. Our results provide evidence that WGS accurately identifies changes of the copy number profiles in human stem cell differentiation. However, the identified changes are not in all cases consistent between WGS and aCGH. The results between WGS and the validation by qPCR were concordant in 83.3% of all tested 36 cases. In sum, both genome-wide techniques, aCGH and WGS, have unique advantages and specific challenges, calling for locus-specific confirmation by the low-throughput approaches qPCR or FISH

    MicroRNA signature in spermatozoa and seminal plasma of proven fertile men and in testicular tissue of men with obstructive azoospermia

    Get PDF
    MicroRNAs (miRNAs) have recently received a significant amount of attention due to their remarkable influence on post-transcriptional gene regulation. In this study, we aim to provide a catalogue of miRNAs present in spermatozoa, seminal plasma and testicular tissue. Expression profiles of miRNA in spermatozoa and seminal plasma of 16 proven fertile men and testicular tissue of eight men with morphologically and/or histologically confirmed obstructive azoospermia were determined by microarray and RT-qPCR in combination with bioinformatics analyses. A total of 123, 156 and 133 miRNAs were consistently detected in spermatozoa, seminal plasma and testicular tissue respectively. Sixty-four miRNAs were shared across all sample types. Based on miRNAs expression level present in each group, correlation analysis showed moderate-to-strong correlations within the spermatozoa and seminal plasma samples and a wider range of correlations within the testicular tissue samples. The target genes of known miRNAs appeared to be involved in a wide range of biological processes related to reproduction, development and differentiation of germ cells. Our results suggest that there is a certain similarity between spermatozoa and seminal plasma for the relative miRNA expression changes with respect to testicular tissue and provide an overview of the miRNAs present in each sample type

    Transcriptome analysis by GeneTrail revealed regulation of functional categories in response to alterations of iron homeostasis in Arabidopsis thaliana

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>High-throughput technologies have opened new avenues to study biological processes and pathways. The interpretation of the immense amount of data sets generated nowadays needs to be facilitated in order to enable biologists to identify complex gene networks and functional pathways. To cope with this task multiple computer-based programs have been developed. GeneTrail is a freely available online tool that screens comparative transcriptomic data for differentially regulated functional categories and biological pathways extracted from common data bases like KEGG, Gene Ontology (GO), TRANSPATH and TRANSFAC. Additionally, GeneTrail offers a feature that allows screening of individually defined biological categories that are relevant for the respective research topic.</p> <p>Results</p> <p>We have set up GeneTrail for the use of <it>Arabidopsis thaliana</it>. To test the functionality of this tool for plant analysis, we generated transcriptome data of root and leaf responses to Fe deficiency and the Arabidopsis metal homeostasis mutant <it>nas4x-1</it>. We performed Gene Set Enrichment Analysis (GSEA) with eight meaningful pairwise comparisons of transcriptome data sets. We were able to uncover several functional pathways including metal homeostasis that were affected in our experimental situations. Representation of the differentially regulated functional categories in Venn diagrams uncovered regulatory networks at the level of whole functional pathways. Over-Representation Analysis (ORA) of differentially regulated genes identified in pairwise comparisons revealed specific functional plant physiological categories as major targets upon Fe deficiency and in <it>nas4x-1</it>.</p> <p>Conclusion</p> <p>Here, we obtained supporting evidence, that the <it>nas4x-1 </it>mutant was defective in metal homeostasis. It was confirmed that <it>nas4x-1 </it>showed Fe deficiency in roots and signs of Fe deficiency and Fe sufficiency in leaves. Besides metal homeostasis, biotic stress, root carbohydrate, leaf photosystem and specific cell biological categories were discovered as main targets for regulated changes in response to - Fe and <it>nas4x-1</it>. Among 258 differentially expressed genes in response to - Fe and <it>nas4x-1 </it>five functional categories were enriched covering metal homeostasis, redox regulation, cell division and histone acetylation. We proved that GeneTrail offers a flexible and user-adapted way to identify functional categories in large-scale plant transcriptome data sets. The distinguished feature that allowed analysis of individually assembled functional categories facilitated the study of the <it>Arabidopsis thaliana </it>transcriptome.</p
    • 

    corecore