Search CORE

88 research outputs found

Structuprint: a scalable and extensible tool for two-dimensional representation of protein surfaces

Author: Kontopoulos DG
Kossida S
Tsiliki G
Vlachakis D
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

© 2016 Kontopoulos et al.Background: The term molecular cartography encompasses a family of computational methods for two-dimensional transformation of protein structures and analysis of their physicochemical properties. The underlying algorithms comprise multiple manual steps, whereas the few existing implementations typically restrict the user to a very limited set of molecular descriptors. Results: We present Structuprint, a free standalone software that fully automates the rendering of protein surface maps, given - at the very least - a directory with a PDB file and an amino acid property. The tool comes with a default database of 328 descriptors, which can be extended or substituted by user-provided ones. The core algorithm comprises the generation of a mould of the protein surface, which is subsequently converted to a sphere and mapped to two dimensions, using the Miller cylindrical projection. Structuprint is partly optimized for multicore computers, making the rendering of animations of entire molecular dynamics simulations feasible. Conclusions: Structuprint is an efficient application, implementing a molecular cartography algorithm for protein surfaces. According to the results of a benchmark, its memory requirements and execution time are reasonable, allowing it to run even on low-end personal computers. We believe that it will be of use - primarily but not exclusively - to structural biologists and computational biochemists

Springer - Publisher Connector

HAL Descartes

PubMed Central

Spiral - Imperial College Digital Repository

Protein signatures using electrostatic molecular surfaces in harmonic space

Author: Carvalho C. Sofia
Kossida Sophia
Megalooikonomou Vasileios
Tsiliki Georgia
Vlachakis Dimitrios
Publication venue: 'PeerJ'
Publication date: 01/10/2013
Field of study

We developed a novel method based on the Fourier analysis of protein molecular surfaces to speed up the analysis of the vast structural data generated in the post-genomic era. This method computes the power spectrum of surfaces of the molecular electrostatic potential, whose three-dimensional coordinates have been either experimentally or theoretically determined. Thus we achieve a reduction of the initial three-dimensional information on the molecular surface to the one-dimensional information on pairs of points at a fixed scale apart. Consequently, the similarity search in our method is computationally less demanding and significantly faster than shape comparison methods. As proof of principle, we applied our method to a training set of viral proteins that are involved in major diseases such as Hepatitis C, Dengue fever, Yellow fever, Bovine viral diarrhea and West Nile fever. The training set contains proteins of four different protein families, as well as a mammalian representative enzyme. We found that the power spectrum successfully assigns a unique signature to each protein included in our training set, thus providing a direct probe of functional similarity among proteins. The results agree with established biological data from conventional structural biochemistry analyses.Comment: 9 pages, 10 figures Published in PeerJ (2013), https://peerj.com/articles/185

arXiv.org e-Print Archive

Directory of Open Access Journals

Recommended from our members

AmalgamScope: merging annotations data across the human genome

Author: Kossida Sophia
Tsaramirsis Konstantinos
Tsiliki Georgia
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2014
Field of study

The past years have shown an enormous advancement in sequencing and array-based technologies, producing supplementary or alternative views of the genome stored in various formats and databases. Their sheer volume and different data scope pose a challenge to jointly visualize and integrate diverse data types. We present AmalgamScope a new interactive software tool focusing on assisting scientists with the annotation of the human genome and particularly the integration of the annotation files from multiple data types, using gene identifiers and genomic coordinates. Supported platforms include next-generation sequencing and microarray technologies. The available features of AmalgamScope range from the annotation of diverse data types across the human genome to integration of the data based on the annotational information and visualization of the merged files within chromosomal regions or the whole genome. Additionally, users can define custom transcriptome library files for any species and use the file exchanging distant server options of the tool

Central Archive at the University of Reading

Crossref

Directory of Open Access Journals

PubMed Central

3D structural analysis of proteins using electrostatic surfaces based on image segmentation

Author: Champeris Tsaniras Spyridon
Kossida Sophia
Megalooikonomou Vasileios
Tsiliki Georgia
Vlachakis Dimitrios
Publication venue: Lorem Ipsum Press
Publication date: 01/01/2014
Field of study

Herein, we present a novel strategy to analyse and characterize proteins using protein molecular electrostatic surfaces. Our approach starts by calculating a series of distinct molecular surfaces for each protein that are subsequently flattened out, thus reducing 3D information noise. RGB images are appropriately scaled by means of standard image processing techniques whilst retaining the weight information of each protein’s molecular electrostatic surface. Then homogeneous areas in the protein surface are estimated based on unsupervised clustering of the 3D images, while performing similarity searches. This is a computationally fast approach, which efficiently highlights interesting structural areas among a group of proteins. Multiple protein electrostatic surfaces can be combined together and in conjunction with their processed images, they can provide the starting material for protein structural similarity and molecular docking experiments.

PubMed Central

Journal of Molecular Biochemistry

Outcome prediction based on microarray analysis: a critical perspective on methods

Author: Blazadonakis Michalis E
Danilatou Vasiliki
Kafetzopoulos Dimitris
Tsiknakis Manolis
Tsiliki Georgia
Zervakis Michalis
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Information extraction from microarrays has not yet been widely used in diagnostic or prognostic decision-support systems, due to the diversity of results produced by the available techniques, their instability on different data sets and the inability to relate statistical significance with biological relevance. Thus, there is an urgent need to address the statistical framework of microarray analysis and identify its drawbacks and limitations, which will enable us to thoroughly compare methodologies under the same experimental set-up and associate results with confidence intervals meaningful to clinicians. In this study we consider gene-selection algorithms with the aim to reveal inefficiencies in performance evaluation and address aspects that can reduce uncertainty in algorithmic validation. Results A computational study is performed related to the performance of several gene selection methodologies on publicly available microarray data. Three basic types of experimental scenarios are evaluated, i.e. the independent test-set and the 10-fold cross-validation (CV) using maximum and average performance measures. Feature selection methods behave differently under different validation strategies. The performance results from CV do not mach well those from the independent test-set, except for the support vector machines (SVM) and the least squares SVM methods. However, these wrapper methods achieve variable (often low) performance, whereas the hybrid methods attain consistently higher accuracies. The use of an independent test-set within CV is important for the evaluation of the predictive power of algorithms. The optimal size of the selected gene-set also appears to be dependent on the evaluation scheme. The consistency of selected genes over variation of the training-set is another aspect important in reducing uncertainty in the evaluation of the derived gene signature. In all cases the presence of outlier samples can seriously affect algorithmic performance. Conclusion Multiple parameters can influence the selection of a gene-signature and its predictive power, thus possible biases in validation methods must always be accounted for. This paper illustrates that independent test-set evaluation reduces the bias of CV, and case-specific measures reveal stability characteristics of the gene-signature over changes of the training set. Moreover, frequency measures on gene selection address the algorithmic consistency in selecting the same gene signature under different training conditions. These issues contribute to the development of an objective evaluation framework and aid the derivation of statistically consistent gene signatures that could eventually be correlated with biological relevance. The benefits of the proposed framework are supported by the evaluation results and methodological comparisons performed for several gene-selection algorithms on three publicly available datasets.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Institutional Repository of the Technical University of Crete

On a meaningful integration of web services in data-intensive biomedical environments: The DICODE approach

Author: Calle Guillermo de la
Christodoulou Spyros
García Remesal Miguel
Karacapilidis Nikos
Tsiliki Georgia
Tzagarakis Manolis
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

This paper reports on an innovative approach that aims to reduce information management costs in data-intensive and cognitively-complex biomedical environments. Recognizing the importance of prominent high-performance computing paradigms and large data processing technologies as well as collaboration support systems to remedy data-intensive issues, it adopts a hybrid approach by building on the synergy of these technologies. The proposed approach provides innovative Web-based workbenches that integrate and orchestrate a set of interoperable services that reduce the data-intensiveness and complexity overload at critical decision points to a manageable level, thus permitting stakeholders to be more productive and concentrate on creative activities

Crossref

Archivo Digital UPM

Ordered weighted average based grouping of nanomaterials with Arsinh and dose response similarity models

Author: Basei G.
Hristozov D
Peijnenburg W.J.G.M.
Tsiliki G.
Zabeo A.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

Environmental Biolog

Leiden University Scholary Publications

Erratum to: Structuprint: a scalable and extensible tool for two-dimensional representation of protein surfaces

Author: Dimitrios Georgios Kontopoulos
Dimitrios Vlachakis
Georgia Tsiliki
Sofia Kossida
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Data S1: Original datasets, raw data results, summary file results for each dataset separated in folders

Author: Aguiar-Pulido
Ambroise
Baker
Baker
Bartlett
Bishop
Bontempi
Breiman
Cassotti
Chawla
Cuesta
Daniel
Dasu
Dobson
Doksum
Donoho
Fernandez-Lozano
Fernandez-Lozano
Fernandez-Lozano
Fernandez-Lozano
Fourches
Gajewicz
García
Gilad
Guyon
Hocking
Kutner
Lichman
McLachlan
O’Hara
Quade
Saeys
Seiffert
Shapiro
Tibshirani
Tropsha
Tsiliki
Tsiliki
Walkey
Wold
Zou
Publication venue: 'PeerJ'
Publication date
Field of study

Crossref

Deliverable Raport D4.6 Tools for generating QMRF and QPRF reports

Author: G. Drakakis C. Chomenidis, G. Tsiliki, P. Doganis, E. Anagnostopoulou, H. Sarimveis, M. Rautenberg, D. Gebele, C. Helma, N. Jeliazkova, V. Jeliazkov, B. Hardy
Publication venue
Publication date
Field of study

Scientific reports carry significant importance for the straightforward and effective transfer of knowledge, results and ideas. Good practice dictates that reports should be well-structured and concise. This deliverable describes the reporting services for models, predictions and validation tasks that have been integrated within the eNanoMapper (eNM) modelling infrastructure. Validation services have been added to the Jaqpot Quattro (JQ) modelling platform and the nano-lazar read-across framework developed within WP4 to support eNM modelling activities. Moreover, we have proceeded with the development of reporting services for predictions and models, respectively QPRF and QMRF reports. Therefore, in this deliverable, we first describe the three validation schemes created, namely training set split, cross- and external validation in detail and demonstrate their functionality both on API and UI levels. We then proceed with the description of the read across functionalities and finally, we present and describe the QPRF and QMRF reporting services

ZENODO