12 research outputs found
Integrative Bioinformatics Analysis of Proteins Associated with the Cardiorenal Syndrome
The cardiorenal syndrome refers to the coexistence of kidney and cardiovascular disease, where cardiovascular events are the most common cause of death in patients with chronic kidney disease. Both, cardiovascular as well as kidney diseases have been extensively analyzed on a molecular level, resulting in molecular features and associated processes indicating a cross-talk of the two disease etiologies on a pathophysiological level. In order to gain a comprehensive picture of molecular factors contributing to the bidirectional interplay between kidney and cardiovascular system, we mined the scientific literature for molecular features reported as associated with the cardiorenal syndrome, resulting in 280 unique genes/proteins. These features were then analyzed on the level of molecular processes and pathways utilizing various types of protein interaction networks. Next to well established molecular features associated with the renin-angiotensin system numerous proteins involved in signal transduction and cell communication were found, involving specific
molecular functions covering receptor binding with natriuretic peptide receptor and ligands as well
known example. An integrated analysis of identified features pinpointed a protein interaction network
involving mediators of hemodynamic change and an accumulation of features associated with the
endothelin and VEGF signaling pathway. Some of these features may function as novel therapeutic
targets
Improving tuberculosis surveillance by detecting international transmission using publicly available whole genome sequencing data
Improving the surveillance of tuberculosis (TB) is one of the eight core activities identified by the World Health Organization (WHO) and the European Respiratory Society to achieve TB elimination, defined as less than one incident case per million [1]. Monitoring transmission is especially important for multidrug-resistant (MDR) Mycobacterium tuberculosis isolates – defined as being resistant to rifampicin and isoniazid – and for extensively drug-resistant (XDR) M. tuberculosis isolates – defined as MDR isolates with additional resistance to at least one of the fluoroquinolones and at least one of the second-line injectable drugs. In 2017, the WHO estimated that worldwide more than 450,000 people fell ill with MDR-TB and among these, more than 38,000 fell ill with XDR-TB [2].
The rapid advance in molecular typing technology – especially the availability of whole genome sequencing (WGS) to identify and characterise pathogens – gives us the chance to integrate this information into disease surveillance. For TB surveillance, it is possible to combine the results of molecular typing of isolates from the M. tuberculosis complex with traditional epidemiological information to infer or to exclude TB transmission [3,4]. This is of particular relevance if transmission occurs among multiple countries, where epidemiological data such as social contacts are more difficult to get and where data exchange is more difficult to organise. The European Centre for Disease Prevention and Control (ECDC) reported 44 events of international transmission (international clusters) of MDR-TB in different European countries between 2012 and 2015 [5]. In that report, the authors inferred TB transmission using the mycobacterial interspersed repetitive units variable number of tandem repeats (MIRU-VNTR) typing method. However, this method has limitations such as low correlation with epidemiological information in outbreak settings and low discriminatory power [3,6]. In comparison, WGS analysis offers a much higher discriminatory power and allows inferring (or excluding) TB transmission at a higher resolution [4]. In a recent systematic review, van der Werf et al. identified three studies that used WGS to investigate the international transmission of TB [7].
In recent years, the amount of available WGS data is increasing, especially because sequencing has become cheaper [8]. In addition, more and more authors deposit the raw data of their projects in open access public repositories such as the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI) [9]. These publicly available raw WGS data for thousands of isolates enable the re-use and the additional analyses at a large and global scale [10]. For example, it is possible to compare genomic data among different studies or countries since the data are available in a single place. Moreover, new software tools can be tested using the same raw WGS data [11]. However, standards in bioinformatics analysis and interpretation of these WGS data for surveillance purposes are not yet fully established [12].
We aimed to assess the usefulness of raw WGS data of global MDR/XDR M. tuberculosis isolates available in public repositories to improve TB surveillance. Specifically, we wanted to identify potential international events of TB transmission and to compare the international isolates with a collection of M. tuberculosis isolates collected in Germany in 2012 and 2013.Peer Reviewe
The IntAct molecular interaction database in 2012
IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275 000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intac
Computational pan-genome mapping and pairwise SNP-distance improve detection of Mycobacterium tuberculosis transmission clusters.
Next-generation sequencing based base-by-base distance measures have become an integral complement to epidemiological investigation of infectious disease outbreaks. This study introduces PANPASCO, a computational pan-genome mapping based, pairwise distance method that is highly sensitive to differences between cases, even when located in regions of lineage specific reference genomes. We show that our approach is superior to previously published methods in several datasets and across different Mycobacterium tuberculosis lineages, as its characteristics allow the comparison of a high number of diverse samples in one analysis-a scenario that becomes more and more likely with the increased usage of whole-genome sequencing in transmission surveillance
seq-seq-pan: building a computational pan-genome data structure on whole genome alignment
Background: The increasing application of next generation sequencing technologies has led to the availability of thousands of reference genomes, often providing multiple genomes for the same or closely related species. The current approach to represent a species or a population with a single reference sequence and a set of variations cannot represent their full diversity and introduces bias towards the chosen reference. There is a need for the representation of multiple sequences in a composite way that is compatible with existing data sources for annotation and suitable for established sequence analysis methods. At the same time, this representation needs to be easily accessible and extendable to account for the constant change of available genomes. Results: We introduce seq-seq-pan, a framework that provides methods for adding or removing new genomes from a set of aligned genomes and uses these to construct a whole genome alignment. Throughout the sequential workflow the alignment is optimized for generating a representative linear presentation of the aligned set of genomes, that enables its usage for annotation and in downstream analyses. Conclusions: By providing dynamic updates and optimized processing, our approach enables the usage of whole genome alignment in the field of pan-genomics. In addition, the sequential workflow can be used as a fast alternative to existing whole genome aligners for aligning closely related genomes. seq-seq-pan is freely available at https://gitlab.com/rki_bioinformatic
Additional file 1 of seq-seq-pan: building a computational pan-genome data structure on whole genome alignment
Supplementary Information for “seq-seq-pan”: Building a computational pan-genome data structure on whole genome alignment”. Additional file 1 provides details on implementation of the sequential workflow for whole genome alignment. (PDF 183 kb
Access to
The cardiorenal syndrome refers to the coexistence of kidney and cardiovascular disease, where cardiovascular events are the most common cause of death in patients with chronic kidney disease. Both, cardiovascular as well as kidney diseases have been extensively analyzed on a molecular level, resulting in molecular features and associated processes indicating a cross-talk of the two disease etiologies on a pathophysiological level. In order to gain a comprehensive picture of molecular factors contributing to the bidirectional interplay between kidney and cardiovascular system, we mined the scientific literature for molecular features reported as associated with the cardiorenal syndrome, resulting in 280 unique genes/proteins. These features were then analyzed on the level of molecular processes and pathways utilizing various types of protein interaction networks. Next to well established molecular features associated with the renin-angiotensin system numerous proteins involved in signal transduction and cell communication were found, involving specific molecular functions covering receptor binding with natriuretic peptide receptor and ligands as well known example. An integrated analysis of identified features pinpointed a protein interaction network involving mediators of hemodynamic change and an accumulation of features associated with the endothelin and VEGF signaling pathway. Some of these features may function as novel therapeutic targets
Clinical and Molecular Heterogeneity of RTEL1 Deficiency
Typical features of dyskeratosis congenita (DC) resulting from excessive telomere shortening include bone marrow failure (BMF), mucosal fragility, and pulmonary or liver fibrosis. In more severe cases, immune deficiency and recurring infections can add to disease severity. RTEL1 deficiency has recently been described as a major genetic etiology, but the molecular basis and clinical consequences of RTEL1-associated DC are incompletely characterized. We report our observations in a cohort of six patients: five with novel biallelic RTEL1 mutations p.Trp456Cys, p.Ile425Thr, p.Cys1244ProfsX17, p.Pro884_Gln885ins53X13, and one with novel heterozygous mutation p.Val796AlafsX4. The most unifying features were hypocellular BMF in 6/6 and B-/NK-cell lymphopenia in 5/6 patients. In addition, three patients with homozygous mutations p.Trp456Cys or p.Ile425Thr also suffered from immunodeficiency, cerebellar hypoplasia, and enteropathy, consistent with Hoyeraal-Hreidarsson syndrome. Chromosomal breakage resembling a homologous recombination defect was detected in patient-derived fibroblasts but not in hematopoietic compartment. Notably, in both cellular compartments, differential expression of 1243aa and 1219/1300aa RTEL1 isoforms was observed. In fibroblasts, response to ionizing irradiation and non-homologous end joining were not impaired. Telomeric circles did not accumulate in patient-derived primary cells and lymphoblastoid cell lines, implying alternative pathomechanisms for telomeric loss. Overall, RTEL1-deficient cells exhibited a phenotype of replicative exhaustion, spontaneous apoptosis and senescence. Specifically, CD34(+) cells failed to expand in vitro, B-cell development was compromised, and T-cells did not proliferate in long-term culture. Finally, we report on the natural history and outcome of our patients. While two patients died from infections, hematopoietic stem cell transplantation (HSCT) resulted in sustained engraftment in two patients. Whether chemotherapy negatively impacts on the course and onset of other DC-related symptoms remains open at present. Early-onset lung disease occurred in one of our patients after HSCT. In conclusion, RTEL deficiency can show a heterogeneous clinical picture ranging from mild hypocellular BMF with B/NK cell lymphopenia to early-onset, very severe, and rapidly progressing cellular deficiency