Search CORE

17 research outputs found

MDAT- Aligning multiple domain arrangements

Author: Bitard-Feildel T. (Tristan)
Bornberg-Bauer E. (Erich)
Kemena C. (Carsten)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/01/2015
Field of study

Background: Proteins are composed of domains, protein segments that fold independently from the rest of the protein and have a specific function. During evolution the arrangement of domains can change: domains are gained, lost or their order is rearranged. To facilitate the analysis of these changes we propose the use of multiple domain alignments. Results: We developed an alignment program, called MDAT, which aligns multiple domain arrangements. MDAT extends earlier programs which perform pairwise alignments of domain arrangements. MDAT uses a domain similarity matrix to score domain pairs and aligns the domain arrangements using a consistency supported progressive alignment method. Conclusion: MDAT will be useful for analysing changes in domain arrangements within and between protein families and will thus provide valuable insights into the evolution of proteins and their domains. MDAT is coded in C++, and the source code is freely available for download at http://www.bornberglab.org/pages/mda

Springer - Publisher Connector

PubMed Central

Münstersches Informations und Archivsystem für Multimediale Inhalte

Domain similarity based orthology detection

Author: Bitard-Feildel T. (Tristan)
Bornberg-Bauer E. (Erich)
Greenwood J.M. (Jenny)
Kemena C. (Carsten)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/05/2015
Field of study

Background: Orthologous protein detection software mostly uses pairwise comparisons of amino-acid sequences to assert whether two proteins are orthologous or not. Accordingly, when the number of sequences for comparison increases, the number of comparisons to compute grows in a quadratic order. A current challenge of bioinformatic research, especially when taking into account the increasing number of sequenced organisms available, is to make this ever-growing number of comparisons computationally feasible in a reasonable amount of time. We propose to speed up the detection of orthologous proteins by using strings of domains to characterize the proteins. Results: We present two new protein similarity measures, a cosine and a maximal weight matching score based on domain content similarity, and new software, named porthoDom. The qualities of the cosine and the maximal weight matching similarity measures are compared against curated datasets. The measures show that domain content similarities are able to correctly group proteins into their families. Accordingly, the cosine similarity measure is used inside porthoDom, the wrapper developed for proteinortho. porthoDom makes use of domain content similarity measures to group proteins together before searching for orthologs. By using domains instead of amino acid sequences, the reduction of the search space decreases the computational complexity of an all-against-all sequence comparison. Conclusion: We demonstrate that representing and comparing proteins as strings of discrete domains, i.e. as a concatenation of their unique identifiers, allows a drastic simplification of search space. porthoDom has the advantage of speeding up orthology detection while maintaining a degree of accuracy similar to proteinortho. The implementation of porthoDom is released using python and C++ languages and is available under the GNU GPL licence 3 at http://www.bornberglab.org/pages/porthoda.<br

Springer - Publisher Connector

PubMed Central

Münstersches Informations und Archivsystem für Multimediale Inhalte

MDAT- Aligning multiple domain arrangements

Author: AD Moore
AR Kersting
B Paten
C Notredame
Carsten Kemena
CO Buckee
D Ekman
DG Higgins
E Bornberg-Bauer
Erich Bornberg-Bauer
F Sievers
H Fang
JA Marsh
JD Thompson
JS Papadopoulos
K Forslund
K Katoh
L Leclère
LA Ait
LY Geer
M Levitt
M Punta
MOSRM Dayhoff
N Terrapon
O Gotoh
RA de Maagd
RD Finn
RD Finn
S Henikoff
SR Eddy
Söding J
T Kawashima
Tristan Bitard-Feildel
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Evolution of protein domain architectures

Author: A Heger
A Marchler-Bauer
A Nagy
A Nagy
A Nagy
A Nasir
A Rijk van
A Rzhetsky
A-L Barabási
AD Moore
AD Moore
AD Moore
AH Brivanlou
AR Kersting
B Lee
B Snel
C Bru
C Chothia
C Feschotte
C Haider
C Vogel
C Vogel
C-H Hsu
C-H Hsu
CM Zmasek
D Ekman
D Wilson
DP Syamaladevi
E Bornberg-Bauer
E Dohmen
E Gogvadze
E Nimwegen van
EE Schmidt
EM Marcotte
EV Koonin
G Apic
G Apic
GP Karev
H Tordai
I Cohen-Gihon
I Letunic
I Yanai
J Gough
J Qian
J Weiner
J Weiner
J Weiner III
J Wiedenhoeft
J-M Chandonia
JAG Ranea
JH Fong
JM Eirin-Lopez
JP Demuth
JS Farris
K Forslund
L Grassi
L Leclère
L Li
L Patthy
LY Geer
M Bashton
M Buljan
M Buljan
M d C Orozco-Mosqueda
M Itoh
M Liu
M Sharma
M Stolzer
M Toll-Riera
MA Huynen
MK Basu
MK Basu
N Terrapon
N Vera-Parra
NC Brissett
NL Dawson
NM Luscombe
R Cordaux
RD Finn
RD Finn
RF Doolittle
S Wuchty
S Yang
SD Lam
SK Kummerfeld
SK Kummerfeld
T Bitard-Feildel
T Doğan
T Koestler
T Przytycka
TE Lewis
UniProt Consortium
V Hollich
VA Kuznetsov
W-D Heyer
X Xie
X-C Zhang
Y-C Wu
ÅK Björklund
ÅK Björklund
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

This chapter reviews current research on how protein domain architectures evolve. We begin by summarizing work on the phylogenetic distribution of proteins, as this will directly impact which domain architectures can be formed in different species. Studies relating domain family size to occurrence have shown that they generally follow power law distributions, both within genomes and larger evolutionary groups. These findings were subsequently extended to multi-domain architectures. Genome evolution models that have been suggested to explain the shape of these distributions are reviewed, as well as evidence for selective pressure to expand certain domain families more than others. Each domain has an intrinsic combinatorial propensity, and the effects of this have been studied using measures of domain versatility or promiscuity. Next, we study the principles of protein domain architecture evolution and how these have been inferred from distributions of extant domain arrangements. Following this, we review inferences of ancestral domain architecture and the conclusions concerning domain architecture evolution mechanisms that can be drawn from these. Finally, we examine whether all known cases of a given domain architecture can be assumed to have a single common origin (monophyly) or have evolved convergently (polyphyly). We end by a discussion of some available tools for computational analysis or exploitation of protein domain architectures and their evolution

Crossref

MDC Repository

Critical assessment of protein intrinsic disorder prediction

Author: Aykac-Fas Burcu
Bassot Claudio
Benítez Guillermo Ignacio
Bevilacqua Martina
Bitard-Feildel Tristan
Caid Predictors
Callebaut Isabelle
Chasapi Anastasia
Chemes Lucia Beatriz
Cheng Jianlin
Cozzetto Domenico
Davey Norman
Davidović Radoslav
Disprot Curators
Dosztányi Zsuzsanna
Dunker A. Keith
Elofsson Arne
Erdős Gábor
Galzitskaya Oxana Valerianovna
Gao Jianzhao
González-Foutel Nicolás S.
Govindarajan Sudha
Gsponer Jörg
Guharoy Mainak
Hajdu-Soltész Borbála
Hanson Jack
Hatos András
Hoque Md Tamjidul
Horvath Tamas
Hu Gang
Iglesias Valentin
Iqbal Sumaiya
Jones David T.
Kajava Andrey V.
Kovacs Orsolya Panna
Kurgan Lukasz
Lamb John
Lambrughi Matteo
Lazar Tamas
Leclercq Jeremy Y.
Leonardi Emanuela
Litfin Thomas
Lobanov Michail Yu
Macedo-Ribeiro Sandra
Macossay-Castillo Mauricio
Maiani Emiliano
Malhis Nawar
Manso Jose Antonio
Marino-Buslje Cristina
Martínez-Pérez Elizabeth
Meng Fanchi
Minervini Giovanni
Mirabello Claudio
Mičetić Ivan
Monzon Alexander Miguel
Murvai Nikoletta
Mészáros Bálint
Necci Marco
Orlando Gabriele
Ouzounis Christos
Pajkos Mátyás
Paladin Lisanna
Paliwal Kuldip
Palopoli Nicolás
Pancsa Rita
Papaleo Elena
Parisi Gustavo
Peng Zhenling
Pereira Pedro José Barbosa
Piovesan Damiano
Promponas Vasilis J.
Pujols Jordi
Quaglia Federica
Raimondi Daniele
Salvatore Marco
Schad Eva
Sharma Alok
Sharma Ronesh
Sormanni Pietro
Szabo Beata
Szaniszló Tamás
Tamana Stella
Tantos Agnes
Tompa Peter
Tosatto Silvio C. E.
Veljkovic Nevena
Vendruscolo Michele
Ventura Salvador
Vranken Wim
Wallner Björn
Walsh Ian
Wang Chen
Wang Kui
Wang Sheng
Wu Tianqi
Wu Zhonghua
Xu Jinbo
Yan Jing
Zhou Yaoqi
Álvarez Lucía
Publication venue: Nature Methods
Publication date: 01/01/2021
Field of study

Abstract: Intrinsically disordered proteins, defying the traditional protein structure–function paradigm, are a challenge to study experimentally. Because a large part of our knowledge rests on computational predictions, it is crucial that their accuracy is high. The Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiment was established as a community-based blind test to determine the state of the art in prediction of intrinsically disordered regions and the subset of residues involved in binding. A total of 43 methods were evaluated on a dataset of 646 proteins from DisProt. The best methods use deep learning techniques and notably outperform physicochemical methods. The top disorder predictor has Fmax = 0.483 on the full dataset and Fmax = 0.792 following filtering out of bona fide structured regions. Disordered binding regions remain hard to predict, with Fmax = 0.231. Interestingly, computing times among methods can vary by up to four orders of magnitude

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

CONICET Digital

HAL-IRD

Diposit Digital de Documents de la UAB

Apollo (Cambridge)

Improving pairwise comparison of protein sequences with domain co-occurrence

Author: A Ghouila
A Heger
A Ochoa
A Ochoa
A Prakash
BE Suzek
C Vogel
Christophe Menichelli
CM Zmasek
DA Triant
E Bornberg-Bauer
F Servant
GM Boratyn
I Callebaut
J Bernardes
J Soding
JC Wootton
JS Bernardes
KA Dill
Laurent Bréhélin
M Ashburner
M Gouy
N Terrapon
N Terrapon
Olivier Gascuel
PJ Keeling
R Durbin
RC Edgar
RD Finn
S Altschul
Scott Markel
SF Altschul
SR Eddy
T Bitard-Feildel
WR Pearson
Y Ye
Z Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref

Biochemical characterization of a glycosyltransferase Gtf3 from Mycobacterium smegmatis: a case study of improved protein solubilization

Author: AM Mulichak
AM Mulichak
B Ren
C Breton
C Deshayes
C Klammt
CL Young
D Jeevarajah
D Matsui
D Veesler
E Bramucci
EA Isiorho
EA Isiorho
F Zhu
G Sulzenbacher
GL Rosano
J Schmid
JS Schorey
K Guild
KM Thayer
LA Kelley
M Biasini
M Dolzan
M Kushwaha
M Vandermies
MC Deller
NW Warne
P Lieutaud
PJ Brennan
PK Qasba
R Mukherjee
R Vincentelli
RR Burgess
RW Gantt
T Bitard-Feildel
T Sengoku
UK Laemmli
V Lombard
Y Miyamoto
Y-L Chen
ZM Ali
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Torches, Candles, Lamps, Lanterns, Flashlights, Spotlights, Night Vision Goggles… You Need Them All to See in Darkness

Articles assembled in the second part of this Special Issue describe some experimental and computational approaches for the structural and functional characterization of intrinsically disordered proteins. Since these tools represent specialized gear for the focused analysis of various aspects of dark proteome, they can be viewed as torches, candles, lamps, lanterns, flashlights, spotlights, night vision goggles, and other means needed to see in darkness

Crossref

USFSP Digital Archive

Scholar Commons - University of South Florida

Fact or fiction: updates on how protein-coding genes might emerge de novo from previously non-coding DNA

Crossref

Critical assessment of protein intrinsic disorder prediction

Author: Alvarez L.
Aykac-Fas B.
Bassot C.
Benitez G. I.
Bevilacqua M.
Bitard-Feildel T.
Callebaut I.
Chasapi A.
Chemes L. B.
Cheng J.
Cozzetto D.
Davey N.
Davidovic R.
Dosztanyi Z.
Dunker A. K.
Elofsson A.
Erdos G.
Galzitskaya O. V.
Gao J.
Gonzalez-Foutel N. S.
Govindarajan S.
Gsponer J.
Guharoy M.
Hajdu-Soltesz B.
Hanson J.
Hatos A.
Hoque M. T.
Horvath T.
Hu G.
Iglesias V.
Iqbal S.
Jones D. T.
Kajava A. V.
Kovacs O. P.
Kurgan L.
Lamb J.
Lambrughi M.
Lazar T.
Leclercq J. Y.
Leonardi E.
Litfin T.
Lobanov M. Y.
Macedo-Ribeiro S.
Macossay-Castillo M.
Maiani E.
Malhis N.
Manso J. A.
Marino-Buslje C.
Martinez-Perez E.
Meng F.
Meszaros B.
Micetic I.
Minervini G.
Mirabello C.
Monzon A. M.
Murvai N.
Necci M.
Orlando G.
Ouzounis C.
Pajkos M.
Paladin L.
Paliwal K.
Palopoli N.
Pancsa R.
Papaleo E.
Parisi G.
Peng Z.
Pereira P. J. B.
Piovesan D.
Promponas V. J.
Pujols J.
Quaglia F.
Raimondi D.
Salvatore M.
Schad E.
Sharma A.
Sharma R.
Sormanni P.
Szabo B.
Szaniszlo T.
Tamana S.
Tantos A.
Tompa P.
Tosatto S. C. E.
Veljkovic N.
Vendruscolo M.
Ventura S.
Vranken W.
Wallner B.
Walsh I.
Wang C.
Wang K.
Wang S.
Wu T.
Wu Z.
Xu J.
Yan J.
Zhou Y.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Intrinsically disordered proteins, defying the traditional protein structure\u2013function paradigm, are a challenge to study experimentally. Because a large part of our knowledge rests on computational predictions, it is crucial that their accuracy is high. The Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiment was established as a community-based blind test to determine the state of the art in prediction of intrinsically disordered regions and the subset of residues involved in binding. A total of 43 methods were evaluated on a dataset of 646 proteins from DisProt. The best methods use deep learning techniques and notably outperform physicochemical methods. The top disorder predictor has Fmax = 0.483 on the full dataset and Fmax = 0.792 following filtering out of bona fide structured regions. Disordered binding regions remain hard to predict, with Fmax = 0.231. Interestingly, computing times among methods can vary by up to four orders of magnitude

Diposit Digital de Documents de la UAB

Archivio istituzionale della ricerca - Università di Padova