Search CORE

3 research outputs found

Benchmarks in antimicrobial peptide prediction are biased due to the selection of negative data

Author: Bakala Laura
Burdukiewicz Michal
Cooke Ira R.
Fingerhut Legana C.H.W.
Gagat Przemyslaw
Kala Jakub
Kolenda Rafal
Mackiewicz Pawel
Pietluch Filip
Rafacz Dominik
Rodiger Stefan
Sidorczuk Katarzyna
Slowik Jadwiga
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2022
Field of study

Antimicrobial peptides (AMPs) are a heterogeneous group of short polypeptides that target not only microorganisms but also viruses and cancer cells. Due to their lower selection for resistance compared with traditional antibiotics, AMPs have been attracting the ever-growing attention from researchers, including bioinformaticians. Machine learning represents the most cost-effective method for novel AMP discovery and consequently many computational tools for AMP prediction have been recently developed. In this article, we investigate the impact of negative data sampling on model performance and benchmarking. We generated 660 predictive models using 12 machine learning architectures, a single positive data set and 11 negative data sampling methods; the architectures and methods were defined on the basis of published AMP prediction software. Our results clearly indicate that similar training and benchmark data set, i.e. produced by the same or a similar negative data sampling method, positively affect model performance. Consequently, all the benchmark analyses that have been performed for AMP prediction models are significantly biased and, moreover, we do not know which model is the most accurate. To provide researchers with reliable information about the performance of AMP predictors, we also created a web server AMPBenchmark for fair model benchmarking. AMP Benchmark is available at http://BioGenies.info/AMPBenchmark

ResearchOnline at James Cook University

PubMed Central

Diposit Digital de Documents de la UAB

AmyloGraph : a comprehensive database of amyloid-amyloid interactions

Author: Barbach Agnieszka
Burdukiewicz Michał
Bąkała Laura
Chilimoniuk Jarosław
Gąsior-Głogowska Marlena
Hubicka Katarzyna
Jęśkowiak Izabela
Kotulska Małgorzata
Kozakiewicz Dominika
Lassota Anna
Rafacz Dominik
Stecko Jakub
Szulc Natalia
Szymańska Natalia
Wojciechowski Jakub W
Publication venue
Publication date: 01/01/2022
Field of study

Information about the impact of interactions between amyloid proteins on their fibrillization propensity is scattered among many experimental articles and presented in unstructured form. We manually curated information located in almost 200 publications (selected out of 562 initially considered), obtaining details of 883 experimentally studied interactions between 46 amyloid proteins or peptides. We also proposed a novel standardized terminology for the description of amyloid-amyloid interactions, which is included in our database, covering all currently known types of such a cross-talk, including inhibition of fibrillization, cross-seeding and other phenomena. The new approach allows for more specific studies on amyloids and their interactions, by providing very well-defined data. AmyloGraph, an online database presenting information on amyloid-amyloid interactions, is available at (). Its functionalities are also accessible as the R package (). AmyloGraph is the only publicly available repository for experimentally determined amyloid-amyloid interactions

Diposit Digital de Documents de la UAB

Conference Report: Why R? 2019

Author: Burdukiewicz Michal
Chilimoniuk Jaroslaw
Jessen Leon Eyrich
Kosinski Marcin
Pietluch Filip
Rafacz Dominik
Roediger Stefan
Sidorczuk Katarzyna
Wojcik Piotr
Publication venue
Publication date: 01/01/2020
Field of study

Online Research Database In Technology