Search CORE

26 research outputs found

Regulation of the nucleosome repeat length in vivo by the DNA sequence, protein concentrations and long-range interactions.

Author: Beshnova Daria A
Cherstvy Andrey G
Teif Vladimir B
Vainshtein Yevhen
Publication venue: Public Library of Science (PLoS)
Publication date: 01/07/2014
Field of study

The nucleosome repeat length (NRL) is an integral chromatin property important for its biological functions. Recent experiments revealed several conflicting trends of the NRL dependence on the concentrations of histones and other architectural chromatin proteins, both in vitro and in vivo, but a systematic theoretical description of NRL as a function of DNA sequence and epigenetic determinants is currently lacking. To address this problem, we have performed an integrative biophysical and bioinformatics analysis in species ranging from yeast to frog to mouse where NRL was studied as a function of various parameters. We show that in simple eukaryotes such as yeast, a lower limit for the NRL value exists, determined by internucleosome interactions and remodeler action. For higher eukaryotes, also the upper limit exists since NRL is an increasing but saturating function of the linker histone concentration. Counterintuitively, smaller H1 variants or non-histone architectural proteins can initiate larger effects on the NRL due to entropic reasons. Furthermore, we demonstrate that different regimes of the NRL dependence on histone concentrations exist depending on whether DNA sequence-specific effects dominate over boundary effects or vice versa. We consider several classes of genomic regions with apparently different regimes of the NRL variation. As one extreme, our analysis reveals that the period of oscillations of the nucleosome density around bound RNA polymerase coincides with the period of oscillations of positioning sites of the corresponding DNA sequence. At another extreme, we show that although mouse major satellite repeats intrinsically encode well-defined nucleosome preferences, they have no unique nucleosome arrangement and can undergo a switch between two distinct types of nucleosome positioning

University of Essex Research Repository

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Nucleosome repositioning links DNA (de)methylation and differential CTCF binding during stem cell development

Author: Beshnova Daria A
Höfer Thomas
Mallm Jan-Philipp
Marth Caroline
Rippe Karsten
Teif Vladimir B
Vainshtein Yevhen
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 09/05/2014
Field of study

During differentiation of embryonic stem cells, chromatin reorganizes to establish cell type-specific expression programs. Here, we have dissected the linkages between DNA methylation (5mC), hydroxymethylation (5hmC), nucleosome repositioning, and binding of the transcription factor CTCF during this process. By integrating MNase-seq and ChIP-seq experiments in mouse embryonic stem cells (ESC) and their differentiated counterparts with biophysical modeling, we found that the interplay between these factors depends on their genomic context. The mostly unmethylated CpG islands have reduced nucleosome occupancy and are enriched in cell type-independent binding sites for CTCF. The few remaining methylated CpG dinucleotides are preferentially associated with nucleosomes. In contrast, outside of CpG islands most CpGs are methylated, and the average methylation density oscillates so that it is highest in the linker region between nucleosomes. Outside CpG islands, binding of TET1, an enzyme that converts 5mC to 5hmC, is associated with labile, MNase-sensitive nucleosomes. Such nucleosomes are poised for eviction in ESCs and become stably bound in differentiated cells where the TET1 and 5hmC levels go down. This process regulates a class of CTCF binding sites outside CpG islands that are occupied by CTCF in ESCs but lose the protein during differentiation. We rationalize this cell type-dependent targeting of CTCF with a quantitative biophysical model of competitive binding with the histone octamer, depending on the TET1, 5hmC, and 5mC state

University of Essex Research Repository

Crossref

PubMed Central

Novel Transaminase and Laccase from Streptomyces spp. Using Combined Identification Approaches

Author: Ferrandi Erica E.
Grumaz Christian
Monti Daniela
Nikodinović-Runić Jasmina
Senthamaraikannan Ramsankar
Spasić Jelena
Vainshtein Yevhen
Vojnović Sandra
Đokić Lidija
Publication venue: 'MDPI AG'
Publication date: 01/01/2021
Field of study

Three Streptomyces sp. strains with a multitude of target enzymatic activities confirmed by functional screening, namely BV129, BV286 and BV333, were subjected to genome sequencing aiming at the annotation of genes of interest, in-depth bioinformatics characterization and functional expression of the biocatalysts. A whole-genome shotgun sequencing followed by de novo genome assembly and annotation was performed revealing genomes of 6.4, 9.4 and 7.3 Mbp, respectively. Functional annotation of the proteins of interest resulted in between 2047 and 2763 putative targets. Among the various enzymatic activities that the three Streptomyces strains demonstrated to produce by functional screening, we focused our attention on transaminases (TAs) and laccases due to their high biocatalytic potential. Bioinformatics search allowed the identification of a putative TA from Streptomyces sp. BV333 as a potentially novel broad substrate scope TA and a putative laccase from Streptomyces sp. BV286 as potentially novel blue multicopper oxidase. The two sequences were cloned and overexpressed in Escherichia coli and the two novel enzymes, transaminase Sbv333-TA and laccase Sbv286-LAC, were characterized. Interestingly, both enzymes resulted to be exceptionally thermostable, Sbv333-TA showing a melting temperature (T-M = 85 degrees C) only slightly lower compared to the T-M of the most thermostable transaminases described to date (87-88 degrees C) and Sbv286-LAC being even thermoactivated at temperature gt 60 degrees C. Moreover, Sbv333-TA showed a broad substrate scope and remarkably demonstrated to be active in the transamination of beta-ketoesters, which are rarely accepted by currently known TAs. On the other hand, Sbv286-LAC showed an improved activity in the presence of the cosolvent acetonitrile. Overall, it was shown that a combination of approaches from standard microbiological and biochemical screens to genome sequencing and analysis is required to afford novel and functional biocatalysts

ZENODO

Directory of Open Access Journals

imagine

The IronChip evaluation package: a package of perl modules for robust analysis of custom microarrays

Author: A Grewal
A Richter
Alvis Brazma
C Lu
C Schwager
H Eickhoff
M Kapushevsky
M Muckenthaler
M Muckenthaler
Martina U Muckenthaler
Matthias W Hentze
Mayka Sanchez
R Gentleman
S Dudoit
Thomas Dandekar
Yevhen Vainshtein
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Gene expression studies greatly contribute to our understanding of complex relationships in gene regulatory networks. However, the complexity of array design, production and manipulations are limiting factors, affecting data quality. The use of customized DNA microarrays improves overall data quality in many situations, however, only if for these specifically designed microarrays analysis tools are available. Results The IronChip Evaluation Package (ICEP) is a collection of Perl utilities and an easy to use data evaluation pipeline for the analysis of microarray data with a focus on data quality of custom-designed microarrays. The package has been developed for the statistical and bioinformatical analysis of the custom cDNA microarray IronChip but can be easily adapted for other cDNA or oligonucleotide-based designed microarray platforms. ICEP uses decision tree-based algorithms to assign quality flags and performs robust analysis based on chip design properties regarding multiple repetitions, ratio cut-off, background and negative controls. Conclusions ICEP is a stand-alone Windows application to obtain optimal data quality from custom-designed microarrays and is freely available here (see "Additional Files" section) and at: <url>http://www.alice-dsl.net/evgeniy.vainshtein/ICEP/</url></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Online-Publikations-Server der Universität Würzburg

Nucleosome reorganisation in breast cancer tissues.

Author: Agrawal Shivam P
Armstrong Chris M
Bikova Milena
Brooke Greg N
Chandrasekharan Sankaran
Ciuta Isabella
Clarkson Christopher T
Correa Clark
Dellow Emily
Drobysevskaja Anastasija
Gretton Svetlana
Guiblet Wilfried M
Jacob Divya R
Klenova Elena
Lam Jonathan DG
Mamayusupova Hulkar
Pradeepa Madapura M
Ruje Luminita
Shafiei Navid
Shtumpf Mariya
Sohn Kai
Teif Vladimir B
Thorn Graeme J
Vainshtein Yevhen
Zhurkin Victor B
Publication venue: BioMed Central
Publication date: 01/04/2024
Field of study

Background Nucleosome repositioning in cancer is believed to cause many changes in genome organisation and gene expression. Understanding these changes is important to elucidate fundamental aspects of cancer. It is also important for medical diagnostics based on cell-free DNA (cfDNA), which originates from genomic DNA regions protected from digestion by nucleosomes. Results We have generated high-resolution nucleosome maps in paired tumour and normal tissues from the same breast cancer patients using MNase-assisted histone H3 ChIP-seq and compared them with the corresponding cfDNA from blood plasma. This analysis has detected single-nucleosome repositioning at key regulatory regions in a patient-specific manner and common cancer-specific patterns across patients. The nucleosomes gained in tumour versus normal tissue were particularly informative of cancer pathways, with ~ 20-fold enrichment at CpG islands, a large fraction of which marked promoters of genes encoding DNA-binding proteins. The tumour tissues were characterised by a 5-10 bp decrease in the average distance between nucleosomes (nucleosome repeat length, NRL), which is qualitatively similar to the differences between pluripotent and differentiated cells. This effect was correlated with gene activity, differential DNA methylation and changes in local occupancy of linker histone variants H1.4 and H1X. Conclusions Our study offers a novel resource of high-resolution nucleosome maps in breast cancer patients and reports for the first time the effect of systematic decrease of NRL in paired tumour versus normal breast tissues from the same patient. Our findings provide a new mechanistic understanding of nucleosome repositioning in tumour tissues that can be valuable for patient diagnostics, stratification and monitoring

University of Essex Research Repository

Directory of Open Access Journals

NucTools: analysis of chromatin feature occupancy profiles from high-throughput sequencing data

Author: A Kundaje
A Mammana
A Nellore
A Polishko
A Polishko
A Valouev
A Weiner
AF Bardet
AL Hughes
AN Schep
AR Quinlan
B Langmead
B Wen
BE Bernstein
BS Sexton
C Angelini
C Jiang
C Zang
CY McLean
D Park
DA Beshnova
DA Orlando
DJ Gaffney
DS Johnson
E Eden
EM Berkowitz
EY Chen
F Krueger
F Ramirez
F Zambelli
FJ Sedlazeck
G Längst
G Moyle-Heyrman
GA Orsi
H Ishii
H Ji
H Li
H Thorvaldsdottir
H Younesy
HA Cole
HS Rhee
I Dubchak
I Livyatan
J Becker
J Feng
J Rozowsky
JA West
JD Buenrostro
K Brogaard
K Chen
K Fu
K Liang
Karsten Rippe
KE Holde van
L Teytelman
L Teytelman
L Wang
LD Stein
LN Voong
MJ Guertin
N Kaplan
N Krietenstein
NR Zabet
NU Nair
O Bell
O Flores
O Nikolayeva
P Humburg
PF Kuan
PJ Park
PV Kharchenko
R Jothi
R Schöpflin
RK Auerbach
S Heinz
S Kubik
S Ramachandran
S Woo
T Bailey
TK Kelly
V Gesu Di
VB Teif
VB Teif
VB Teif
VB Teif
Vladimir B. Teif
W Chen
W Huang da
W Ma
WB Langdon
X Zhang
Y Fu
Y Zhang
Y Zhang
Y Zhang
Yevhen Vainshtein
YL Jung
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Background: Biomedical applications of high-throughput sequencing methods generate a vast amount of data in which numerous chromatin features are mapped along the genome. The results are frequently analysed by creating binary data sets that link the presence/absence of a given feature to specific genomic loci. However, the nucleosome occupancy or chromatin accessibility landscape is essentially continuous. It is currently a challenge in the field to cope with continuous distributions of deep sequencing chromatin readouts and to integrate the different types of discrete chromatin features to reveal linkages between them. Results: Here we introduce the NucTools suite of Perl scripts as well as MATLAB- and R-based visualization programs for a nucleosome-centred downstream analysis of deep sequencing data. NucTools accounts for the continuous distribution of nucleosome occupancy. It allows calculations of nucleosome occupancy profiles averaged over several replicates, comparisons of nucleosome occupancy landscapes between different experimental conditions, and the estimation of the changes of integral chromatin properties such as the nucleosome repeat length. Furthermore, NucTools facilitates the annotation of nucleosome occupancy with other chromatin features like binding of transcription factors or architectural proteins, and epigenetic marks like histone modifications or DNA methylation. The applications of NucTools are demonstrated for the comparison of several datasets for nucleosome occupancy in mouse embryonic stem cells (ESCs) and mouse embryonic fibroblasts (MEFs). Conclusions: The typical workflows of data processing and integrative analysis with NucTools reveal information on the interplay of nucleosome positioning with other features such as for example binding of a transcription factor CTCF, regions with stable and unstable nucleosomes, and domains of large organized chromatin K9me2 modifications (LOCKs). As potential limitations and problems we discuss how inter-replicate variability of MNase-seq experiments can be addressed

University of Essex Research Repository

Crossref

Springer - Publisher Connector

Fraunhofer-ePrints

PubMed Central

Anwendung von Mikroarrayanalysen um Genexpressionsmuster zu untersuchen: Ein bioinformatischer Ansatz

Author: Vainshtein Yevhen
Publication venue
Publication date: 01/01/2010
Field of study

The regulation and maintenance of iron homeostasis is critical to human health. As a constituent of hemoglobin, iron is essential for oxygen transport and significant iron deficiency leads to anemia. Eukaryotic cells require iron for survival and proliferation. Iron is part of hemoproteins, iron-sulfur (Fe-S) proteins, and other proteins with functional groups that require iron as a cofactor. At the cellular level, iron uptake, utilization, storage, and export are regulated at different molecular levels (transcriptional, mRNA stability, translational, and posttranslational). Iron regulatory proteins (IRPs) 1 and 2 post-transcriptionally control mammalian iron homeostasis by binding to iron-responsive elements (IREs), conserved RNA stem-loop structures located in the 5’- or 3‘- untranslated regions of genes involved in iron metabolism (e.g. FTH1, FTL, and TFRC). To identify novel IRE-containing mRNAs, we integrated biochemical, biocomputational, and microarray-based experimental approaches. Gene expression studies greatly contribute to our understanding of complex relationships in gene regulatory networks. However, the complexity of array design, production and manipulations are limiting factors, affecting data quality. The use of customized DNA microarrays improves overall data quality in many situations, however, only if for these specifically designed microarrays analysis tools are available. Methods In this project response to the iron treatment was examined under different conditions using bioinformatical methods. This would improve our understanding of an iron regulatory network. For these purposes we used microarray gene expression data. To identify novel IRE-containing mRNAs biochemical, biocomputational, and microarray-based experimental approaches were integrated. IRP/IRE messenger ribonucleoproteins were immunoselected and their mRNA composition was analysed using an IronChip microarray enriched for genes predicted computationally to contain IRE-like motifs. Analysis of IronChip microarray data requires specialized tool which can use all advantages of a customized microarray platform. Novel decision-tree based algorithm was implemented using Perl in IronChip Evaluation Package (ICEP). Results IRE-like motifs were identified from genomic nucleic acid databases by an algorithm combining primary nucleic acid sequence and RNA structural criteria. Depending on the choice of constraining criteria, such computational screens tend to generate a large number of false positives. To refine the search and reduce the number of false positive hits, additional constraints were introduced. The refined screen yielded 15 IRE-like motifs. A second approach made use of a reported list of 230 IRE-like sequences obtained from screening UTR databases. We selected 6 out of these 230 entries based on the ability of the lower IRE stem to form at least 6 out of 7 bp. Corresponding ESTs were spotted onto the human or mouse versions of the IronChip and the results were analysed using ICEP. Our data show that the immunoselection/microarray strategy is a feasible approach for screening bioinformatically predicted IRE genes and the detection of novel IRE-containing mRNAs. In addition, we identified a novel IRE-containing gene CDC14A (Sanchez M, et al. 2006). The IronChip Evaluation Package (ICEP) is a collection of Perl utilities and an easy to use data evaluation pipeline for the analysis of microarray data with a focus on data quality of custom-designed microarrays. The package has been developed for the statistical and bioinformatical analysis of the custom cDNA microarray IronChip, but can be easily adapted for other cDNA or oligonucleotide-based designed microarray platforms. ICEP uses decision tree-based algorithms to assign quality flags and performs robust analysis based on chip design properties regarding multiple repetitions, ratio cut-off, background and negative controls (Vainshtein Y, et al., 2010).Die Regulierung und Aufrechterhaltung der Eisen-Homeostase ist bedeutend für die menschliche Gesundheit. Als Bestandteil des Hämoglobins ist es wichtig für den Transport von Sauerstoff, ein Mangel führt zu Blutarmut. Eukaryotische Zellen benötigen Eisen zum Überleben und zum Proliferieren. Eisen ist am Aufbau von Hämo- und Eisenschwefelproteinen (Fe-S) beteiligt und kann als Kofaktor dienen. Die Aufnahme, Nutzung, Speicherung und der Export von Eisen ist zellulär auf verschiedenen molekularen Ebenen reguliert (Transkription, mRNA-Level, Translation, Protein-Level). Die iron regulatory proteins (IRPs) 1 und 2 kontrollieren die Eisen-Homeostase in Säugetieren posttranslational durch die Bindung an Iron-responsive elements (IREs). IREs sind konservierte RNA stem-loop Strukturen in den 5' oder 3' untranslatierten Bereichen von Genen, die im Eisenmetabolismus involviert sind (z.B. FTH1, FTL und TFRC). In dieser Arbeit wurden biochemische und bioinformatische Methoden mit Microarray-Experimenten kombiniert, um neue mRNAs mit IREs zu identifizieren. Genexpressionsstudien verbessern unser Verständnis über die komplexen Zusammenhänge in genregulatorischen Netzwerken. Das komplexe Design von Microarrays, deren Produktion und Manipulation sind dabei die limitierenden Faktoren bezüglich der Datenqualität. Die Verwendung von angepassten DNA Microarrays verbessert häufig die Datenqualität, falls entsprechende Analysemöglichkeiten für diese Arrays existieren. Methoden Um unser Verständnis von eisenregulierten Netzwerken zu verbessern, wurde im Rahmen dieses Projektes die Auswirkung einer Behandlung mit Eisen bzw. von Knockout Mutation unter verschiedenen Bedingungen mittels bioinformatischer Methoden untersucht. Hierfür nutzen wir Expressionsdaten aus Microarray-Experimenten. Durch die Verknüpfung von biochemischen, bioinformatischen und Microarray Ansätzen können neue Proteine mit IREs identifiziert werden. IRP/IRE messenger Ribonucleoproteine wurden immunpräzipitiert. Die Zusammensetzung der enthaltenen mRNAs wurde mittels einem IronChip Microarray analysiert: Für diesen Chip wurden bioinformatisch Gene vorhergesagt, die IRE-like Motive aufweisen. Der Chip wurde mit solchen Oligonucleotiden beschichtet und durch Hybridisierung überprüft, ob die präzipitierten mRNA sich hieran binden. Die Analyse der erhaltenen Daten erfordert ein spezialisiertes Werkzeug um von allen Vorteilen der angepassten Microarrays zu profitieren. Ein neuer Entscheidungsbaum-basierter Algorithmus wurde in Perl im IronChip Evaluation Package (ICEP) implementiert. Ergebnisse Aus großen Sequenz-Datenbanken wurden IRE-like Motive identifiziert. Dazu kombiniert der Algorithmus, insbesondere RNA-Primärsequenz und RNA-Strukturdaten. Solche Datenbankanalysen tendieren dazu, eine große Anzahl falsch positiver Treffer zu generieren. Daher wurden zusätzliche Bedingungen formuliert, um die Suche zu verfeinern und die Anzahl an falsch positiven Treffer zu reduzieren. Die angepassten Suchkriterien ergaben 15 IRE-like Motive. In einem weiteren Ansatz verwendeten wir eine Liste von 230 IRE-like Sequenzen aus UTR-Datenbanken. Daraus wurden 6 Sequenzen ausgewählt, die auch im unteren Teil stabil sind (untere Helix über 6 bp stabil). Die korrespondierenden Expressed Sequence Tags (ESTs) wurden auf die humane oder murine Version des IronChips aufgetragen. Die Microarray Ergebnisse wurden mit dem ICEP Programm ausgewertet. Unsere Ergebnisse zeigen, dass die Immunpräzipitation mit anschließender Microarrayanalyse ein nützlicher Ansatz ist, um bioinformatisch vorhergesagte IRE-Gene zu identifizieren. Darüber hinaus ermöglicht uns dieser Ansatz die Detektion neuer mRNAs, die IREs enthalten, wie das von uns gefundene Gen CDC14A (Sanchez et al., 2006). ICEP ist ein optimiertes Programmpaket aus Perl Programmen (Vainshtein et al., BMC Bioinformatics, 2010). Es ermöglicht die einfache Auswertung von Microarray Daten mit dem Fokus auf selbst entwickelten Microarray Designs. ICEP diente für die statistische und bioinformatische Analyse von selbst entwickelten IronChips, kann aber auch leicht an die Analyse von oligonucleotidbasierten oder cDNA Microarrays adaptiert werden. ICEP nutzt einen Entscheidungsbaum-basierten Algorithmus um die Qualität zu bewerten und führt eine robuste Analyse basierend auf Chipeigenschaften, wie mehrfachen Wiederholungen, Signal/Rausch Verhältnis, Hintergrund und Negativkontrollen durch

Online-Publikations-Server der Universität Würzburg

Additional file 2: Table S2. of NucTools: analysis of chromatin feature occupancy profiles from high-throughput sequencing data

Author: Karsten Rippe (22511)
Vladimir Teif (3747643)
Yevhen Vainshtein (592162)
Publication venue
Publication date
Field of study

EnrichR analysis of the enrichment of DNA sequence motifs based on TRANSFAC and JASPAR PWMs in 100-bp genomic regions which lost nucleosomes in MEFs. (PDF 29 kb

FigShare

Additional file 1: Table S1. of NucTools: analysis of chromatin feature occupancy profiles from high-throughput sequencing data

Author: Karsten Rippe (22511)
Vladimir Teif (3747643)
Yevhen Vainshtein (592162)
Publication venue
Publication date
Field of study

EnrichR analysis of the enrichment of DNA sequence motifs based on TRANSFAC and JASPAR PWMs in 100-bp genomic regions which gained nucleosomes in MEFs. (PDF 29 kb

FigShare

Genetic repertoires of anaerobic microbiomes driving generation of biogas

Author: Anja Grohmann
Christian Grumaz
Dieter Bryniok
Ellen Euchner
Kai Sohn
Ralf Rabus
Yevhen Vainshtein
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2018
Field of study

Abstract Background Biogas production is an attractive technology for a sustainable generation of renewable energy. Although the microbial community is fundamental for such production, the process control is still limited to technological and chemical parameters. Currently, most of the efforts on microbial management system (MiMaS) are focused on process-specific marker species and community dynamics, but a practical implementation is in its infancy. The high number of unknown and uncharacterized microorganisms in general is one of the reasons hindering further advancements. Results A Biogas Metagenomics Hybrid Assembly (BioMETHA) database, derived from microbiomes of biogas plants, was generated using a dedicated assembly strategy for different metagenomic datasets. Long reads from nanopore sequencing (MinION) were combined with short, more accurate second-generation sequencing reads (Illumina). The hybrid assembly resulted in 231 genomic bins each representing a taxonomic unit with an average completeness of 47%. Functional annotation identified 13,190 non-redundant genes covering roughly 207 k coding sequences. Mapping rates of metagenomics DNA derived from diverse biogas plants and laboratory reactors increased up to 73%. In addition, an EC (enzyme commission) reference sequence collection (ERSC) was generated whose genes are crucial for biogas-related processes, consisting of 235 unique EC numbers organized in 52 metabolic modules. Mapping rates of metatranscriptomic data to this ERSC revealed coverages of up to 93%. Process parameters and imbalances of laboratory reactors could be reconstructed by evaluating abundance of biogas-specific metabolic modules using metatranscriptomic data derived from various fermenter systems. Conclusion This newly established metagenomic hybrid assembly in combination with an EC reference sequence collection might help to shed light on the microbial dark matter of biogas plants by contributing to the development of a reference for biogas plant microbiome-specific gene sequences. Considering a biogas microbiome as a complex meta-organism expressing a meta-transcriptome, the approach established here could lay the foundation for a function-based microbial management system

Directory of Open Access Journals

Fraunhofer-ePrints