Search CORE

187 research outputs found

Visualising biological data: a semantic approach to tool and database integration

Author: AL Mitchell
Alice Villéger
C Bru
C Notredame
David Thorne
DJ Parry-Smith
DN Perkins
Douglas B Kell
E Sonnhammer
I Letunic
J McCarthy
J Pérez
J Schultz
James Marsh
JD Thompson
L Devereux
L Pritchard
MY Galperin
N Hulo
NF Noy
NJ Mulder
P McDermott
Philip McDermott
PW Lord
RC Edgar
RD Finn
S Bergamaschi
S Hunter
S Pettifer
Steve Pettifer
Teresa K Attwood
TK Attwood
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Motivation In the biological sciences, the need to analyse vast amounts of information has become commonplace. Such large-scale analyses often involve drawing together data from a variety of different databases, held remotely on the internet or locally on in-house servers. Supporting these tasks are <it>ad hoc </it>collections of data-manipulation tools, scripting languages and visualisation software, which are often combined in arcane ways to create cumbersome systems that have been customised for a particular purpose, and are consequently not readily adaptable to other uses. For many day-to-day bioinformatics tasks, the sizes of current databases, and the scale of the analyses necessary, now demand increasing levels of automation; nevertheless, the unique experience and intuition of human researchers is still required to interpret the end results in any meaningful biological way. Putting humans in the loop requires tools to support real-time interaction with these vast and complex data-sets. Numerous tools do exist for this purpose, but many do not have optimal interfaces, most are effectively isolated from other tools and databases owing to incompatible data formats, and many have limited real-time performance when applied to realistically large data-sets: much of the user's cognitive capacity is therefore focused on controlling the software and manipulating esoteric file formats rather than on performing the research. Methods To confront these issues, harnessing expertise in human-computer interaction (HCI), high-performance rendering and distributed systems, and guided by bioinformaticians and end-user biologists, we are building reusable software components that, together, create a toolkit that is both architecturally sound from a computing point of view, and addresses both user and developer requirements. Key to the system's usability is its direct exploitation of semantics, which, crucially, gives individual components knowledge of their own functionality and allows them to interoperate seamlessly, removing many of the existing barriers and bottlenecks from standard bioinformatics tasks. Results The toolkit, named Utopia, is freely available from <url>http://utopia.cs.man.ac.uk/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Manchester - Institutional Repository

Biodegradative mechanism of the brown rot basidiomycete Gloeophyllum trabeum: evidence for an extracellular hydroquinone-driven fenton reaction

Author: Attwood M
Baserga SJ
Briggs JW
Cockman ME
di Summa FM
Dinman JD
Fischer R
Gleizes P-E
Houtkooper RH
Kuijpers TW
MacInnes AW
O'Donohue M-F
Paolini NA
Rachuri S
Ratcliffe P
Sondalle SB
van Adrichem AM
Vieira CM
von Lindern M
Wlodarski MW
Publication venue: Federation of European Biochemical Societies. Published by Elsevier B.V.
Publication date: 05/03/1999
Field of study

AbstractWe have identified key components of the extracellular oxidative system that the brown rot fungus Gloeophyllum trabeum uses to degrade a recalcitrant polymer, polyethylene glycol, via hydrogen abstraction reactions. G. trabeum produced an extracellular metabolite, 2,5-dimethoxy-1,4-benzoquinone, and reduced it to 2,5-dimethoxyhydroquinone. In the presence of 2,5-dimethoxy-1,4-benzoquinone, the fungus also reduced extracellular Fe3+ to Fe2+ and produced extracellular H2O2. Fe3+ reduction and H2O2 formation both resulted from a direct, non-enzymatic reaction between 2,5-dimethoxyhydroquinone and Fe3+. polyethylene glycol depolymerization by G. trabeum required both 2,5-dimethoxy-1,4-benzoquinone and Fe3+ and was completely inhibited by catalase. These results provide evidence that G. trabeum uses a hydroquinone-driven Fenton reaction to cleave polyethylene glycol. We propose that similar reactions account for the ability of G. trabeum to attack lignocellulose

Elsevier - Publisher Connector

Crossref

PubMed Central

Oxford University Research Archive

Recommended from our members

A behavioral comparison of male and female adults with high functioning autism spectrum conditions

Author: A Ben-Sasson
A Le Couteur
AJ Russell
Amber N. V. Ruigrok
AS Carter
AT Beck
AT Beck
B Auyeung
B Auyeung
B Hofvander
Bhismadev Chakrabarti
C Allison
C Gillberg
C Gillberg
C Lord
C Lord
C Lord
C Lord
C Zahn-Waxler
CJ Newschaffer
D Wechsler
DG Murphy
DH Skuse
DH Skuse
E Fombonne
E Schwarz
EB Foa
ER Ritvo
FR Volkmar
FR Volkmar
G Baird
G Iarocci
GB Mesibov
Greg Pasco
H Asperger
H Markram
H Ring
JA Bastiaansen
James G. Scott
JD McLennan
JG Williams
JK Miller
K Gotham
K Gotham
L Ernsperger
L Kanner
L Mottron
L Ruta
L Tsai
L Wing
L Wing
L Wing
L Wing
LH Willey
LY Tsai
M Holtmann
M Rutter
Meng-Chuan Lai
Michael V. Lombardo
MV Lombardo
MV Lombardo
NR Crick
R Parke
R Simone
S Baron-Cohen
S Baron-Cohen
S Baron-Cohen
S Baron-Cohen
S Baron-Cohen
S Baron-Cohen
S Baron-Cohen
S Baron-Cohen
S Kopp
S Wheelwright
S Wolff
Sally J. Wheelwright
Simon Baron-Cohen
SL Hartley
Susan A. Sadek
T Attwood
T Attwood
T Pilowsky
V Lotter
W Jones
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Autism spectrum conditions (ASC) affect more males than females in the general population. However, within ASC it is unclear if there are phenotypic sex differences. Testing for similarities and differences between the sexes is important not only for clinical assessment but also has implications for theories of typical sex differences and of autism. Using cognitive and behavioral measures, we investigated similarities and differences between the sexes in age- and IQ-matched adults with ASC (high-functioning autism or Asperger syndrome). Of the 83 (45 males and 38 females) participants, 62 (33 males and 29 females) met Autism Diagnostic Interview-Revised (ADI-R) cut-off criteria for autism in childhood and were included in all subsequent analyses. The severity of childhood core autism symptoms did not differ between the sexes. Males and females also did not differ in self-reported empathy, systemizing, anxiety, depression, and obsessive-compulsive traits/symptoms or mentalizing performance. However, adult females with ASC showed more lifetime sensory symptoms (p = 0.036), fewer current socio-communication difficulties (p = 0.001), and more self-reported autistic traits (p = 0.012) than males. In addition, females with ASC who also had developmental language delay had lower current performance IQ than those without developmental language delay (p<0.001), a pattern not seen in males. The absence of typical sex differences in empathizing-systemizing profiles within the autism spectrum confirms a prediction from the extreme male brain theory. Behavioral sex differences within ASC may also reflect different developmental mechanisms between males and females with ASC. We discuss the importance of the superficially better socio-communication ability in adult females with ASC in terms of why females with ASC may more often go under-recognized, and receive their diagnosis later, than males

Central Archive at the University of Reading

Public Library of Science (PLOS)

CiteSeerX

Crossref

Online Research @ Cardiff

Directory of Open Access Journals

PubMed Central

UCL Discovery

King's Research Portal

PhenoFam-gene set enrichment analysis through protein structural information

Author: A Keller
A Subramanian
ASL Cheng
CA Worby
CH Wu
daW Huang
DH Haft
F Al-Shahrour
F Corpet
F Hahne
F Pearl
Frank Buchholz
G Dennis
H Mi
I Letunic
J Gough
JD Storey
JD Storey
JJ Fuster
LN Nguyen
M Ashburner
M Kanehisa
M Teresa Pisabarro
MA Sartor
Maciej Paszkowski-Rogacz
Mikolaj Slabicki
N Hulo
P Khatri
PD Thomas
PD Thomas
R Kittler
RD Finn
RJ Hernstein
S Hunter
SY Kim
T Li
T Lima
TJ Hubbard
TK Attwood
U Fuchs
VK Mootha
WJ Gehring
Y Ben-Shaul
Y Benjamini
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background With the current technological advances in high-throughput biology, the necessity to develop tools that help to analyse the massive amount of data being generated is evident. A powerful method of inspecting large-scale data sets is gene set enrichment analysis (GSEA) and investigation of protein structural features can guide determining the function of individual genes. However, a convenient tool that combines these two features to aid in high-throughput data analysis has not been developed yet. In order to fill this niche, we developed the user-friendly, web-based application, PhenoFam. Results PhenoFam performs gene set enrichment analysis by employing structural and functional information on families of protein domains as annotation terms. Our tool is designed to analyse complete sets of results from quantitative high-throughput studies (gene expression microarrays, functional RNAi screens, <it>etc</it>.) without prior pre-filtering or hits-selection steps. PhenoFam utilizes Ensembl databases to link a list of user-provided identifiers with protein features from the InterPro database, and assesses whether results associated with individual domains differ significantly from the overall population. To demonstrate the utility of PhenoFam we analysed a genome-wide RNA interference screen and discovered a novel function of plexins containing the cytoplasmic RasGAP domain. Furthermore, a PhenoFam analysis of breast cancer gene expression profiles revealed a link between breast carcinoma and altered expression of PX domain containing proteins. Conclusions PhenoFam provides a user-friendly, easily accessible web interface to perform GSEA based on high-throughput data sets and structural-functional protein information, and therefore aids in functional annotation of genes.</p

Qucosa

Crossref

HSSS - Hochschulschriftenserver der SLUB

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Technische Universität Dresden: Qucosa

Keeping Pace with Your Eating: Visual Feedback Affects Eating Rate in Humans

Author: A Field
AJ Stunkard
AL Ford
AS Attwood
B Wansink
BA Cassady
C de Graaf
C de Graaf
C de Graaf
CG Forde
Danielle Ferriday
DP Bolhuis
DP Bolhuis
E Almiron-Roig
E Robinson
EC Hinton
EM Blass
EM Bobroff
GB Haber
JD Davis
JD Davis
JE Cecil
Jeffrey M. Brunstrom
JM Brunstrom
JM Brunstrom
JM Brunstrom
K Maruyama
KS Burger
Laura L. Wilkinson
M Viskaal-van Dongen
Marcello Costantini
Matthew L. Bosworth
MR Yeomans
Nathalie Martin
Nicolas Godinot
Peter J. Rogers
PS Hogenkamp
R Otsuka
RE Oldham-Cooper
S Sasaki
SC Wooley
SH Fay
SW Hill
TA Spiegel
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/02/2016
Field of study

Deliberately eating at a slower pace promotes satiation and eating quickly has been associated with a higher body mass index. Therefore, understanding factors that affect eating rate should be given high priority. Eating rate is affected by the physical/textural properties of a food, by motivational state, and by portion size and palatability. This study explored the prospect that eating rate is also influenced by a hitherto unexplored cognitive process that uses ongoing perceptual estimates of the volume of food remaining in a container to adjust intake during a meal. A 2 (amount seen; 300ml or 500ml) x 2 (amount eaten; 300ml or 500ml) between-subjects design was employed (10 participants in each condition). In two ‘congruent’ conditions, the same amount was seen at the outset and then subsequently consumed (300ml or 500ml). To dissociate visual feedback of portion size and actual amount consumed, food was covertly added or removed from a bowl using a peristaltic pump. This created two additional ‘incongruent’ conditions, in which 300ml was seen but 500ml was eaten or vice versa. We repeated these conditions using a savoury soup and a sweet dessert. Eating rate (ml per second) was assessed during lunch. After lunch we assessed fullness over a 60-minute period. In the congruent conditions, eating rate was unaffected by the actual volume of food that was consumed (300ml or 500ml). By contrast, we observed a marked difference across the incongruent conditions. Specifically, participants who saw 300ml but actually consumed 500ml ate at a faster rate than participants who saw 500ml but actually consumed 300ml. Participants were unaware that their portion size had been manipulated. Nevertheless, when it disappeared faster or slower than anticipated they adjusted their rate of eating accordingly. This suggests that the control of eating rate involves visual feedback and is not a simple reflexive response to orosensory stimulatio

Public Library of Science (PLOS)

Crossref

Online Research @ Cardiff

Directory of Open Access Journals

PubMed Central

Cronfa at Swansea University

Explore Bristol Research

FigShare

Human phenotype ontology annotation and cluster analysis to unravel genetic defects in 707 cases with unexplained bleeding and platelet disorders

Author: Attwood A
Austin S
Bakchoul T
Bariana TK
Crisp-Hihn A
Erber WN
Favier R
Foad N
Freson K
Furie B
Gattens M
Gomez K
Greene D
Jansen SBG
Jolley JD
Kelly AM
Laffan MA
Lambert MP
Lentaigne C
Liesner R
Meacham S
Millar CM
Mumford AD
Nurden AT
Nurden P
Ouwehand WH
Peerlinck K
Perry DJ
Pillois X
Poudel P
Rendon A
Richardson S
Robinson PN
Schulman S
Schulze H
Simeoni I
Stephens JC
Turro E
Van Geet C
Westbury SK
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/03/2015
Field of study

Spiral - Imperial College Digital Repository

Light-induced transcriptional responses associated with proteorhodopsin-enhanced growth in a marine flavobacterium

Author: A Martinez
Asuncion Martinez
BL Taylor
CC Häse
Curtis R Young
Edward F DeLong
EF DeLong
FJ Stewart
FX Cunningham
G Armstrong
G Sabehi
H Futamata
H Tokuda
Hiroyuki Kimura
J McCarren
J McCarren
JD Storey
JL Spudich
JM González
JM Walter
JR de la Torre
K Izui
KC Keiler
L Gómez-Consarnau
LS Brown
M Gomelsky
M Reich
MA Moran
MB Eisen
N-U Frigaard
O Béjà
O Béjà
PP Gardner
PV Attwood
R Gillet
R Lami
S Jitrapakdee
SD Moore
SJ Giovannoni
U Stingl
V Anantharaman
Y Benjamini
Y Shi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2011
Field of study

Proteorhodopsin (PR) is a photoprotein that functions as a light-driven proton pump in diverse marine Bacteria and Archaea. Recent studies have suggested that PR may enhance both growth rate and yield in some flavobacteria when grown under nutrient-limiting conditions in the light. The direct involvement of PR, and the metabolic details enabling light-stimulated growth, however, remain uncertain. Here, we surveyed transcriptional and growth responses of a PR-containing marine flavobacterium during carbon-limited growth in the light and the dark. As previously reported (Gómez-Consarnau et al., 2007), Dokdonia strain MED134 exhibited light-enhanced growth rates and cell yields under low carbon growth conditions. Inhibition of retinal biosynthesis abolished the light-stimulated growth response, supporting a direct role for retinal-bound PR in light-enhanced growth. Among protein-coding transcripts, both PR and retinal biosynthetic enzymes showed significant upregulation in the light. Other light-associated proteins, including bacterial cryptochrome and DNA photolyase, were also expressed at significantly higher levels in the light. Membrane transporters for Na+/phosphate and Na+/alanine symporters, and the Na+-translocating NADH-quinone oxidoreductase (NQR) linked electron transport chain, were also significantly upregulated in the light. Culture experiments using a specific inhibitor of Na+-translocating NQR indicated that sodium pumping via NQR is a critical metabolic process in the light-stimulated growth of MED134. In total, the results suggested the importance of both the PR-enabled, light-driven proton gradient, as well as the generation of a Na+ ion gradient, as essential components for light-enhanced growth in these flavobacteria.Gordon and Betty Moore FoundationNational Science Foundation (U.S.) (NSF Science and Technology Center Award EF0424599.)Japan Society for the Promotion of Science (Postdoctoral Fellowships for Research Abroad

DSpace@MIT

Crossref

Investigation of G72 (DAOA) expression in the human brain

Author: A Siepel
AD Medhurst
B John
C Burge
CM van Drunen
E Birney
E Hattori
EL Sonnhammer
F Larsen
Fiona Kelly
G Pesole
I Chumakov
Isabel Benzel
J Cheng
Jacqueline de Belleroche
James NC Kew
JD Bendtsen
JL Ashurst
JNC Kew
L Falquet
L Verrall
LM Melnick
M Korostishevsky
M Kvajo
M Rehmsmeier
MC Frith
MJ van Baren
P Rice
Peter R Maycox
R Kapoor
Ramya Viknaraja
SD Detera-Wadleigh
Steven Hirsch
T Bakheet
T Werner
T Wiehe
TA Down
Thirza H Sanderson
TK Attwood
VG Levitsky
WJ Kent
Publication venue: BioMed Central
Publication date: 01/12/2008
Field of study

Abstract Background Polymorphisms at the G72/G30 locus on chromosome 13q have been associated with schizophrenia or bipolar disorder in more than ten independent studies. Even though the genetic findings are very robust, the physiological role of the predicted G72 protein has thus far not been resolved. Initial reports suggested G72 as an activator of D-amino acid oxidase (DAO), supporting the glutamate dysfunction hypothesis of schizophrenia. However, these findings have subsequently not been reproduced and reports of endogenous human G72 mRNA and protein expression are extremely limited. In order to better understand the function of this putative schizophrenia susceptibility gene, we attempted to demonstrate G72 mRNA and protein expression in relevant human brain regions. Methods The expression of G72 mRNA was studied by northern blotting and semi-quantitative SYBR-Green and Taqman RT-PCR. Protein expression in human tissue lysates was investigated by western blotting using two custom-made specific anti-G72 peptide antibodies. An in-depth <it>in silico </it>analysis of the G72/G30 locus was performed in order to try and identify motifs or regulatory elements that provide insight to G72 mRNA expression and transcript stability. Results Despite using highly sensitive techniques, we failed to identify significant levels of G72 mRNA in a variety of human tissues (e.g. adult brain, amygdala, caudate nucleus, fetal brain, spinal cord and testis) human cell lines or schizophrenia/control post mortem BA10 samples. Furthermore, using western blotting in combination with sensitive detection methods, we were also unable to detect G72 protein in a number of human brain regions (including cerebellum and amygdala), spinal cord or testis. A detailed <it>in silico </it>analysis provides several lines of evidence that support the apparent low or absent expression of G72. Conclusion Our results suggest that native G72 protein is not normally present in the tissues that we analysed in this study. We also conclude that the lack of demonstrable G72 expression in relevant brain regions does not support a role for G72 in modulation of DAO activity and the pathology of schizophrenia via a DAO-mediated mechanism. <it>In silico </it>analysis suggests that G72 is not robustly expressed and that the transcript is potentially labile. Further studies are required to understand the significance of the G72/30 locus to schizophrenia.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Spiral - Imperial College Digital Repository

Gastroesophageal reflux leads to esophageal cancer in a surgical model with mice

Author: A Hofmann
A Yang
AG Casson
Ba Liu
Chung S Yang
CP Hsu
DH Stamp
DT McManus
E Piazuelo
FH Ellis Jr
G Cammarota
G Mercante
J Que
JD Long
Jing Hao
K Miwa
K Nishijima
KH Chen
KJ Levin
KN Christie
LA Feagins
M Fein
M Harvey
M Lechpammer
M Pera
M Pera
M Pera
M Sarbia
M Tachibana
M Vieth
MD Duncan
N Hashimoto
RC Martin
RS Holmes
SE Attwood
SM Chung
SR Goldstein
SR Goldstein
X Chen
X Chen
X Chen
X Chen
X Chen
Xiaoxin Chen
Y Kuroiwa
Y Su
Y Wang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Esophago-gastroduodenal anastomosis with rats mimics the development of human Barrett's esophagus and esophageal adenocarcinoma by introducing mixed reflux of gastric and duodenal contents into the esophagus. However, use of this rat model for mechanistic and chemopreventive studies is limited due to lack of genetically modified rat strains. Therefore, a mouse model of esophageal adenocarcinoma is needed. Methods We performed reflux surgery on wild-type, <it>p53</it><it>A</it>135<it>V </it>transgenic, and <it>INK4a/Arf</it>+/- mice of A/J strain. Some mice were also treated with omeprazole (1,400 ppm in diet), iron (50 mg/kg/m, <it>i.p</it>.), or gastrectomy plus iron. Mouse esophagi were harvested at 20, 40 or 80 weeks after surgery for histopathological analysis. Results At week 20, we observed metaplasia in wild-type mice (5%, 1/20) and <it>p53</it><it>A</it>135<it>V </it>mice (5.3%, 1/19). At week 40, metaplasia was found in wild-type mice (16.2%, 6/37), <it>p53</it><it>A</it>135<it>V </it>mice (4.8%, 2/42), and wild-type mice also receiving gastrectomy and iron (6.7%, 1/15). Esophageal squamous cell carcinoma developed in <it>INK4a/Arf</it>+/- mice (7.1%, 1/14), and wild-type mice receiving gastrectomy and iron (21.4%, 3/14). Among 13 wild-type mice which were given iron from week 40 to 80, twelve (92.3%) developed squamous cell carcinoma at week 80. None of these mice developed esophageal adenocarcinoma. Conclusion Surgically induced gastroesophageal reflux produced esophageal squamous cell carcinoma, but not esophageal adenocarcinoma, in mice. Dominant negative <it>p53 </it>mutation, heterozygous loss of <it>INK4a/Arf</it>, antacid treatment, iron supplementation, or gastrectomy failed to promote esophageal adenocarcinoma in these mice. Further studies are needed in order to develop a mouse model of esophageal adenocarcinoma.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Gene ontology based transfer learning for protein subcellular localization

Author: A Bateman
A Dijk
A Hoglund
A Hoglund
A Pierleoni
C Chen
C Leslie
C Leslie
DH Haft
E Marcotte
EM Zdobnov
F Corpet
FM Li
G Lanckriet
G Schneider
H Ding
H Lin
H Lin
H Liu
H Rangwala
H Shen
HB Shen
HB Shen
HB Shen
HB Shen
HB Shen
J Cedano
J Schultz
J Shen
JD Qiu
JD Qiu
K Chou
K Chou
K Chou
K Hofmann
K Lee
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
L Nanni
M Ashburner
M Esmaeili
M Mak
M Wang
Q Gu
Q Yang
R Apweiler
R Kuang
R Kuang
S Mei
S Pan
Shuigeng Zhou
Suyu Mei
T Blum
T Tung
TK Attwood
W Dai
W Dai
W Huang
W Huang
Wang Fei
X Jiang
X Xiao
XB Zhou
YH Zeng
YS Ding
YS Ding
Z Lei
Z Lu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Prediction of protein subcellular localization generally involves many complex factors, and using only one or two aspects of data information may not tell the true story. For this reason, some recent predictive models are deliberately designed to integrate multiple heterogeneous data sources for exploiting multi-aspect protein feature information. Gene ontology, hereinafter referred to as <it>GO</it>, uses a controlled vocabulary to depict biological molecules or gene products in terms of biological process, molecular function and cellular component. With the rapid expansion of annotated protein sequences, gene ontology has become a general protein feature that can be used to construct predictive models in computational biology. Existing models generally either concatenated the <it>GO </it>terms into a flat binary vector or applied majority-vote based ensemble learning for protein subcellular localization, both of which can not estimate the individual discriminative abilities of the three aspects of gene ontology. Results In this paper, we propose a Gene Ontology Based Transfer Learning Model (<it>GO-TLM</it>) for large-scale protein subcellular localization. The model transfers the signature-based homologous <it>GO </it>terms to the target proteins, and further constructs a reliable learning system to reduce the adverse affect of the potential false <it>GO </it>terms that are resulted from evolutionary divergence. We derive three <it>GO </it>kernels from the three aspects of gene ontology to measure the <it>GO </it>similarity of two proteins, and derive two other spectrum kernels to measure the similarity of two protein sequences. We use simple non-parametric cross validation to explicitly weigh the discriminative abilities of the five kernels, such that the time & space computational complexities are greatly reduced when compared to the complicated semi-definite programming and semi-indefinite linear programming. The five kernels are then linearly merged into one single kernel for protein subcellular localization. We evaluate <it>GO-TLM </it>performance against three baseline models: <it>MultiLoc, MultiLoc-GO </it>and <it>Euk-mPLoc </it>on the benchmark datasets the baseline models adopted. 5-fold cross validation experiments show that <it>GO-TLM </it>achieves substantial accuracy improvement against the baseline models: 80.38% against model <it>Euk-mPLoc </it>67.40% with <it>12.98% </it>substantial increase; 96.65% and 96.27% against model <it>MultiLoc-GO </it>89.60% and 89.60%, with <it>7.05% </it>and <it>6.67% </it>accuracy increase on dataset <it>MultiLoc plant </it>and dataset <it>MultiLoc animal</it>, respectively; 97.14%, 95.90% and 96.85% against model <it>MultiLoc-GO </it>83.70%, 90.10% and 85.70%, with accuracy increase <it>13.44%</it>, <it>5.8% </it>and <it>11.15% </it>on dataset <it>BaCelLoc plant</it>, dataset <it>BaCelLoc fungi </it>and dataset <it>BaCelLoc animal </it>respectively. For <it>BaCelLoc </it>independent sets, <it>GO-TLM </it>achieves 81.25%, 80.45% and 79.46% on dataset <it>BaCelLoc plant holdout</it>, dataset <it>BaCelLoc plant holdout </it>and dataset <it>BaCelLoc animal holdout</it>, respectively, as compared against baseline model <it>MultiLoc-GO </it>76%, 60.00% and 73.00%, with accuracy increase <it>5.25%</it>, <it>20.45% </it>and <it>6.46%</it>, respectively. Conclusions Since direct homology-based <it>GO </it>term transfer may be prone to introducing noise and outliers to the target protein, we design an explicitly weighted kernel learning system (called Gene Ontology Based Transfer Learning Model, <it>GO-TLM</it>) to transfer to the target protein the known knowledge about related homologous proteins, which can reduce the risk of outliers and share knowledge between homologous proteins, and thus achieve better predictive performance for protein subcellular localization. Cross validation and independent test experimental results show that the homology-based <it>GO </it>term transfer and explicitly weighing the <it>GO </it>kernels substantially improve the prediction performance.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central