Search CORE

12 research outputs found

Syncopy - Systems Neuroscience Computing in Python

Author: Mönke G.
Publication venue
Publication date: 01/01/2022
Field of study

Syncopy (www.syncopy.org) is aimed to be a completely open source, user-friendly yet powerful data analysis suite for the Neurosciences. It is developed in Python and makes extensive use of distributed computing via Dask, and achieves low memory footprints by using on-disc hdf5 data structures in the backend per default. For our users, we supply highly abstracted frontend functions, which allow using the same analysis code irrespective of whether the code is run on their local machines or on an HPC cluster. We aim to interface with existing data formats (e.g. Neurodata Without Borders, NWB) and community tools (e.g. Fieldtrip), and foster reproducibility by creating and preserving a lot of meta-information during processing

MPG.PuRe

De-Novo Discovery of Differentially Abundant Transcription Factor Binding Sites Including Their Positional Preference

Author: AD Smith
AM Benotmane
C Linhart
CE Lawrence
CT Harbison
DJ Galas
DJ Lockhart
DJC MacKay
DS Johnson
E Redhead
E Wingender
G Mönke
G Pavesi
GA Wray
GK Sandve
H Wettig
Harmen J. Bussemaker
HM Wallach
IA Paponov
Ivan A. Paponov
Ivo Grosse
J Cerquides
J Davis
J Wu
Jan Grau
JC Bryne
JD Hughes
Jens Keilwagen
LM Hellman
LV Sun
M Tompa
Marc Strickert
NK Kim
O Elemento
S Sonnenburg
S Sonnenburg
Stefan Posch
T Ulmasov
T Ulmasov
TD Schneider
TJ Guilfoyle
TL Bailey
V Matys
VV Raghavan
W Ao
W Thompson
WA Thompson
WD Teale
Publication venue: Public Library of Science
Publication date: 10/02/2011
Field of study

Transcription factors are a main component of gene regulation as they activate or repress gene expression by binding to specific binding sites in promoters. The de-novo discovery of transcription factor binding sites in target regions obtained by wet-lab experiments is a challenging problem in computational biology, which has not been fully solved yet. Here, we present a de-novo motif discovery tool called Dispom for finding differentially abundant transcription factor binding sites that models existing positional preferences of binding sites and adjusts the length of the motif in the learning process. Evaluating Dispom, we find that its prediction performance is superior to existing tools for de-novo motif discovery for 18 benchmark data sets with planted binding sites, and for a metazoan compendium based on experimental data from micro-array, ChIP-chip, ChIP-DSL, and DamID as well as Gene Ontology data. Finally, we apply Dispom to find binding sites differentially abundant in promoters of auxin-responsive genes extracted from Arabidopsis thaliana microarray data, and we find a motif that can be interpreted as a refined auxin responsive element predominately positioned in the 250-bp region upstream of the transcription start site. Using an independent data set of auxin-responsive genes, we find in genome-wide predictions that the refined motif is more specific for auxin-responsive genes than the canonical auxin-responsive element. In general, Dispom can be used to find differentially abundant motifs in sequences of any origin. However, the positional distribution learned by Dispom is especially beneficial if all sequences are aligned to some anchor point like the transcription start site in case of promoter sequences. We demonstrate that the combination of searching for differentially abundant motifs and inferring a position distribution from the data is beneficial for de-novo motif discovery. Hence, we make the tool freely available as a component of the open-source Java framework Jstacs and as a stand-alone application at http://www.jstacs.de/index.php/Dispom

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Recommended from our members

An integrated omics analysis reveals molecular mechanisms that are associated with differences in seed oil content between Glycine max and Brassica napus

Author: A Graf
A Kozomara
A Misra
AJ Crowe
AM Murad
AS Hsiao
B Chalhoub
B Li
B O'Leary
C Jako
C Jiang
C Xie
CW Min
D Klaus
D Posada
EL Sonnhammer
ER Cober
EY Hwang
F Liu
F Sun
FL Wang
G Mönke
H Erp van
H Lin
H Tan
H Vigeolas
H Wan
H Wang
HM Xu
HW Wang
I Letunic
J Ernst
J Jin
J Jin
J Mu
J Niu
J Ohlrogge
J Schmutz
J Yang
J Yu
Jim M. Dunwell
JJ Thelen
JS Thrower
K McGlew
K Oishi
K Roesler
K Wang
KO Chung
L Li
L Loic
L Wang
L Zhang
M Gargouri
M Lechner
M Li
M Miquel
M Sun
M Zhou
MA Troncoso-Ponce
MS Davis
N Wang
N Wong
P Roos-Mattjus
P Zheng
PJ Horn
QT Li
QX Song
R Angelovici
RC Edgar
RG Uhrig
RJ Weselake
RJ Weselake
RK Jain
RL Tatusov
S Anders
S Baud
S Dongen Van
S Gennidakis
S Guindon
S Killcoyne
S Kumar
S Maisonneuve
S Rawsthorne
SI Jones
SK Gidda
SW Ritchie
T Konishi
T Voelker
TL Bailey
TL Shimada
TP Durrett
W Ma
X Chen
X Dai
X Daniel
X Li
X Lu
X. Li
Y Benjamini
Y Cao
Y Kennedy
YF Liu
YK Madoka
YQ Zhang
Yuan-Ming Zhang
Z Xu
Z Yang
Zhibin Zhang
ZY Hu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2018
Field of study

Abstract Background: Rapeseed (Brassica napus L.) and soybean (Glycine max L.) seeds are rich in both protein and oil, which are major sources of biofuels and nutrition. Although the difference in seed oil content between soybean (~ 20%) and rapeseed (~ 40%) exists, little is known about its underlying molecular mechanism. Results: An integrated omics analysis was performed in soybean, rapeseed, Arabidopsis (Arabidopsis thaliana L. Heynh), and sesame (Sesamum indicum L.), based on Arabidopsis acyl-lipid metabolism- and carbon metabolism-related genes. As a result, candidate genes and their transcription factors and microRNAs, along with phylogenetic analysis and co-expression network analysis of the PEPC gene family, were found to be largely associated with the difference between the two species. First, three soybean genes (Glyma.13G148600, Glyma.13G207900 and Glyma.12G122900) co-expressed with GmPEPC1 are specifically enriched during seed storage protein accumulation stages, while the expression of BnPEPC1 is putatively inhibited by bna-miR169, and two genes BnSTKA and BnCKII are co-expressed with BnPEPC1 and are specifically associated with plant circadian rhythm, which are related to seed oil biosynthesis. Then, in de novo fatty acid synthesis there are rapeseed-specific genes encoding subunits β-CT (BnaC05g37990D) and BCCP1 (BnaA03g06000D) of heterogeneous ACCase, which could interfere with synthesis rate, and β-CT is positively regulated by four transcription factors (BnaA01g37250D, BnaA02g26190D, BnaC01g01040D and BnaC07g21470D). In triglyceride synthesis, GmLPAAT2 is putatively inhibited by three miRNAs (gma-miR171, gma-miR1516 and gma-miR5775). Finally, in rapeseed there was evidence for the expansion of gene families, CALO, OBO and STERO, related to lipid storage, and the contraction of gene families, LOX, LAH and HSI2, related to oil degradation. Conclusions: The molecular mechanisms associated with differences in seed oil content provide the basis for future breeding efforts to improve seed oil content

Central Archive at the University of Reading

Crossref

Directory of Open Access Journals

FigShare