Search CORE

32 research outputs found

A discriminative method for family-based protein remote homology detection that combines inductive logic programming and propositional models

Author: A Andreeva
A Ben-Hur
A Karwath
A Karwath
A Shah
Alessandra Carbone
B Liu
B Qian
B Webb-Robertson
C Ferreira
C Leslie
D Higgins
F Wilcoxon
G Yona
Gerson Zaverucha
H Rangwala
H Saigo
J Bernardes
J Davis
J Gough
J Quinlan
J Soeding
J Weston
Juliana S Bernardes
L De Raedt
L Dehaspe
L Liao
N Shan-Hwei
Q Dong
Q Su
R Agrawal
R Hughey
R King
R King
R Kuang
R Sadreyev
S Altschul
S Altschul
S Brenner
S Eddy
S Eddy
S Kawashima
S Lee
T Handstad
T Jaakkola
T Lingner
U Syed
V Alexandrov
V Atalay
Y Hou
Y Hou
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Remote homology detection is a hard computational problem. Most approaches have trained computational models by using either full protein sequences or multiple sequence alignments (MSA), including all positions. However, when we deal with proteins in the "twilight zone" we can observe that only some segments of sequences (motifs) are conserved. We introduce a novel logical representation that allows us to represent physico-chemical properties of sequences, conserved amino acid positions and conserved physico-chemical positions in the MSA. From this, Inductive Logic Programming (ILP) finds the most frequent patterns (motifs) and uses them to train propositional models, such as decision trees and support vector machines (SVM). Results We use the SCOP database to perform our experiments by evaluating protein recognition within the same superfamily. Our results show that our methodology when using SVM performs significantly better than some of the state of the art methods, and comparable to other. However, our method provides a comprehensible set of logical rules that can help to understand what determines a protein function. Conclusions The strategy of selecting only the most frequent patterns is effective for the remote homology detection. This is possible through a suitable first-order logical representation of homologous properties, and through a set of frequent patterns, found by an ILP system, that summarizes essential features of protein functions.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

HAL-Inserm

PubMed Central

CheS-Mapper - Chemical Space Mapping and Visualization in 3D

Author: A Maunz
Andreas Karwath
B Hardy
C Steinbeck
DH Fisher
DK Agrafiotis
E Papa
G Patlewicz
J Oksanen
JD Leeuw
JJW Sammon
KR Przybylak
L van der Maaten
M Hall
M Seeland
M Wawer
Martin Gütlein
N Jeliazkova
N O'Boyle
NL Allinger
P Langfelder
R Development Core Team
R Guha
S Dasgupta
Stefan Kramer
Susan Schiffman FWY M Lance Reynolds
T CaliÅ„ski
TA Halgren
TJ Hou
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Analyzing chemical datasets is a challenging task for scientific researchers in the field of chemoinformatics. It is important, yet difficult to understand the relationship between the structure of chemical compounds, their physico-chemical properties, and biological or toxic effects. To that respect, visualization tools can help to better comprehend the underlying correlations. Our recently developed 3D molecular viewer CheS-Mapper (Chemical Space Mapper) divides large datasets into clusters of similar compounds and consequently arranges them in 3D space, such that their spatial proximity reflects their similarity. The user can indirectly determine similarity, by selecting which features to employ in the process. The tool can use and calculate different kind of features, like structural fragments as well as quantitative chemical descriptors. These features can be highlighted within CheS-Mapper, which aids the chemist to better understand patterns and regularities and relate the observations to established scientific knowledge. As a final function, the tool can also be used to select and export specific subsets of a given dataset for further analysis

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

Directory of Open Access Journals

PubMed Central

Gutenberg Open Science

Collaborative development of predictive toxicology applications

OpenTox provides an interoperable, standards-based Framework for the support of predictive toxicology data management, algorithms, modelling, validation and reporting. It is relevant to satisfying the chemical safety assessment requirements of the REACH legislation as it supports access to experimental data, (Quantitative) Structure-Activity Relationship models, and toxicological information through an integrating platform that adheres to regulatory requirements and OECD validation principles. Initial research defined the essential components of the Framework including the approach to data access, schema and management, use of controlled vocabularies and ontologies, architecture, web service and communications protocols, and selection and integration of algorithms for predictive modelling. OpenTox provides end-user oriented tools to non-computational specialists, risk assessors, and toxicological experts in addition to Application Programming Interfaces (APIs) for developers of new applications. OpenTox actively supports public standards for data representation, interfaces, vocabularies and ontologies, Open Source approaches to core platform components, and community-based collaboration approaches, so as to progress system interoperability goals

Queen's University Belfast Research Portal

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

Directory of Open Access Journals

Fraunhofer-ePrints

PubMed Central

DSpace at NTUA

IMT Institutional Repository

Gutenberg Open Science

Gradient-based boosting for statistical relational learning: The relational dependency network case

Author: A. Assche Van
A. Karwath
B. Gutmann
B. Milch
Bernd Gutmann
C. Boutilier
C. Parker
C. Vens
D. Chickering
D. Fierens
D. Heckerman
D. Koller
D. Poole
D. Poole
H. Blockeel
H. Poon
J. Davis
J. H. Friedman
J. Neville
J. Neville
J. Neville
J. Pearl
Jude Shavlik
K. Kersting
K. Kersting
K. Kersting
Kristian Kersting
L. Breiman
L. Getoor
L. Getoor
L. Getoor
L. Mihalkova
L. Raedt De
M. Bilenko
M. Craven
M. Jaeger
P. Domingos
P. Singla
P. Singla
R. de Salvo Braz
R. Sutton
S. Kok
S. Kok
S. Lawrence
S. Muggleton
S. Natarajan
Sriraam Natarajan
T. G. Dietterich
T. Sato
T. Truyen
Tushar Khot
Y. Freund
Y. Jing
Z. Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Open Babel: An open chemical toolbox

Author: A Amini
A Andronico
A Bender
A Gakh
A Karwath
A Maunz
A Maunz
A Poater
A Rappe
AA Gakh
AD Hill
B-b Yan
BD McKay
C Helma
C Reynès
Chris Morley
CR Jacob
Craig A James
CW Bullock
D Filimonov
D Lagorce
D Lagorce
D Weininger
DC Bas
DC Lonie
DR Koes
F Fontaine
Geoffrey R Hutchison
GL Holliday
HL Morgan
I Wallach
I Wallach
IV Filippov
IV Tetko
J Ahmed
J Ahmed
J Kazius
J Myers
J Wang
J Wang
JH Chen
JJ Langham
JL Melville
JL Sharman
K Fogel
K Martin
L Fabian
L Liu
L Schietgat
M Brüstle
M Buehler
M Dehmer
M Konyk
M Krier
M Kuhn
MA Meineke
MA Miteva
Michael Banck
MJ Gómez
N O'Boyle
N Zonta
NM O'Boyle
NM O'Boyle
Noel M O'Boyle
O Sperandio
P Lind
P Murray-Rust
P Murray-Rust
P Murray-Rust
P Murray-Rust
P Rydberg
P Tosco
P Tosco
R Esposito
RA Bauer
RA Bauer
RS Armen
S Arbor
S Ingsriswang
SV Trepalin
T Cheng
T Halgren
T Halgren
T Halgren
T Halgren
T Halgren
T Kogej
T Pencheva
Tim Vandermeersch
TWH Backman
U Schmidt
VV Mihaleva
William H Green
X Jiang
X Wang
YD Paila
Z Huang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background: A frequent problem in computational modeling is the interconversion of chemical structures between different formats. While standard interchange formats exist (for example, Chemical Markup Language) and de facto standards have arisen (for example, SMILES format), the need to interconvert formats is a continuing problem due to the multitude of different application areas for chemistry data, differences in the data stored by different formats (0D versus 3D, for example), and competition between software along with a lack of vendorneutral formats. Results: We discuss, for the first time, Open Babel, an open-source chemical toolbox that speaks the many languages of chemical data. Open Babel version 2.3 interconverts over 110 formats. The need to represent such a wide variety of chemical and molecular data requires a library that implements a wide range of cheminformatics algorithms, from partial charge assignment and aromaticity detection, to bond order perception and canonicalization. We detail the implementation of Open Babel, describe key advances in the 2.3 release, and outline a variety of uses both in terms of software products and scientific research, including applications far beyond simple format interconversion. Conclusions: Open Babel presents a solution to the proliferation of multiple chemical file formats. In addition, it provides a variety of useful utilities from conformer searching and 2D depiction, to filtering, batch conversion, and substructure and similarity searching. For developers, it can be used as a programming library to handle chemical data in areas such as organic chemistry, drug design, materials science, and computational chemistry. It is freely available under an open-source license fro

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Irish Universities

PubMed Central

Cork Open Research Archive

Cost curves: An improved method for visualizing classifier performance

Author: A. Karwath
A. P. Bradley
B. Efron
B. J. McNeil
C. E. Metz
C. J. van Rijsbergen
Chris Drummond
D. J. Hand
E. J. Halpern
F. P. Preparata
F. Provost
G. M. Weiss
G. Ma
G. Webb
I. H. Witten
J. A. Swets
J. A. Swets
J. Hilden
J. R. Quinlan
J. Tilbury
K. H. Zou
K. Jensen
L. Breiman
L. Saitta
M. Kubat
N. M. Adams
P. Clark
P. D. Turney
R. C. Holte
R. O. Duda
Robert C. Holte
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Runs of homozygosity reveal signatures of positive selection for reproduction traits in breed and non-breed horses

Author: A Khanshour
A McKenna
AH Freedman
AMP O’Brien
AR Boyko
B Giardine
B Wehrle-Haller
C Beechey
C Mau
CB Newgard
CS Ku
D Blankenberg
D Warde-Farley
DC Purfield
E Axelsson
E Cunningham
E-S Kim
EE Dymek
EJ Huang
FS Alkuraya
G-C Fan
H Ai
H Hamann
H Kayserili
H Li
H Li
H Mi
IM MacLeod
Ivo Glynne Gut
J Drgonova
J Galan
J Gibson
J Goecks
J Gu
J Metzger
J Reimand
J Reimand
J-P Gong
JJ Seitz
JL Petersen
Julia Metzger
K Ishibashi
KN Gregory
KS Aberle
L Orlando
LA Mavrogiannis
Lídia Águeda
M Bosse
M Ferencakovic
M Ferenčaković
M Kirin
M Mirotsou
M Nalls
M Nothnagel
M Pinheiro
M Taipale
Marta Gut
Matthias Karwath
MC Hunt
Ottmar Distl
P Cingolani
P Danecek
P Kumar
P Petrou
P Urbanska
PA Watkins
R Gahlmann
R McQuillan
Raul Tonda
S Beckmann
S MacEachern
S Makvandi-Nejad
S Pfahler
S Purcell
S Qanbari
S Qanbari
S Qanbari
S Wilkinson
SA Brooks
Sergi Beltran
T Lencz
T Nastasi
TM Foos
TS Korneliussen
V Warmuth
W McLaren
Y Liang
Y Matsui
Y Wang
Y Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

BIOINFORMATICS Functional Bioinformatics for Arabidopsis thaliana

Author: A. Clare A
A. Karwath B
R. D. King A
Publication venue
Publication date
Field of study

Motivation: The genome of Arabidopsis thaliana, which has the best understood plant genome, still has approximately one third of its genes with no functional annotation at all from either MIPS or TAIR. We have applied our Data Mining Prediction (DMP) method to the problem of predicting the functional classes of these protein sequences. This method is based on using a hybrid machine-learning/data-mining method to identify patterns in the bioinformatic data about sequences that are predictive of function. We use data about sequence, predicted secondary structure, predicted structural domain, InterPro patterns, sequence similarity profile, and expressions data. Results: We predicted the functional class of a high percentage of the Arabidopsis genes with currently unknown function. These predictions are interpretable and have good test accuracies. We describe in detail seven of the rules produced

CiteSeerX

Ocean Acidification Reduces Growth and Calcification in a Marine Dinoflagellate

Author: AD Moy
AG Dickson
AG Dickson
Appy Sluijs
AR Taylor
B Karwath
B Rost
B Rost
BE Bemis
Björn Rost
C Le Quere
C Mehrbach
CJM Hoppe
CL Sabine
DA Wolf-Gladrow
Dedmer B. Van de Waal
E Paasche
Gert-Jan Reichart
HJ Freeman
HJ Spero
Howard I. Browman
I Inouye
I Wendler
J Zhang
JM McCrea
JM Pandolfi
K Tangen
K Zonneveld
KAF Zonneveld
KJS Meier
KT Lohbeck
KW Beyenbach
L Beaufort
L Mackinder
L Mackinder
LT Bach
M Kohn
Mirja Hoins
N Gussone
N Uehlein
P Ziveri
P Ziveri
P Ziveri
Patrizia Ziveri
RE Zeebe
S Collins
S Trimborn
SA Kranz
SD Rokitta
ST Nelson
T Hildebrand-Habel
U Riebesell
U Riebesell
Uwe John
WC Beck
WG Mook
Y Araki
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Ocean acidification is considered a major threat to marine ecosystems and may particularly affect calcifying organisms such as corals, foraminifera and coccolithophores. Here we investigate the impact of elevated pCO2 and lowered pH on growth and calcification in the common calcareous dinoflagellate Thoracosphaera heimii. We observe a substantial reduction in growth rate, calcification and cyst stability of T. heimii under elevated pCO2. Furthermore, transcriptomic analyses reveal CO2 sensitive regulation of many genes, particularly those being associated to inorganic carbon acquisition and calcification. Stable carbon isotope fractionation for organic carbon production increased with increasing pCO2 whereas it decreased for calcification, which suggests interdependence between both processes. We also found a strong effect of pCO2 on the stable oxygen isotopic composition of calcite, in line with earlier observations concerning another T. heimii strain. The observed changes in stable oxygen and carbon isotope composition of T. heimii cysts may provide an ideal tool for reconstructing past seawater carbonate chemistry, and ultimately past pCO2. Although the function of calcification in T. heimii remains unresolved, this trait likely plays an important role in the ecological and evolutionary success of this species. Acting on calcification as well as growth, ocean acidification may therefore impose a great threat for T. heimii

OceanRep

Public Library of Science (PLOS)

Crossref

VU Research Portal

Directory of Open Access Journals

PubMed Central

Electronic Publication Information Center

Diposit Digital de Documents de la UAB

Utrecht University Repository

KNAW Repository