Search CORE

ResearchOnline at James Cook University

NSU Works

MPG.PuRe

The Francis Crick Institute

Public Library of Science (PLOS)

Repository of the Academy's Library

University of East Anglia digital repository

High-order graph matching kernel for early carcinoma EUS image classification

Author: A Das
A Săftoiu
B Julesz
C De Angelis
CE Shannon
CJ Buskens
DE Loren
DG Lowe
Edwin R. Hancock
F Eriksson
G Lerman
GL Scott
H Tamura
HC Van
HC Van
ID Norton
J Levman
J Munkres
K Olowe
L Bai
Lu Bai
M Zhang
M Zhu
MC Kolios
N Shervashidze
N Shervashidze
O Pech
Peng Ren
R Jenssen
T Jebara
VX Nguyen
WS Noble
Y Nagami
Zhihong Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

White Rose Research Online

Predicting mental imagery based BCI performance from personality, cognitive profile and neurophysiological patterns

Mental-Imagery based Brain-Computer Interfaces (MI-BCIs) allow their users to send commands to a computer using their brain-activity alone (typically measured by ElectroEncephaloGraphy— EEG), which is processed while they perform specific mental tasks. While very promising, MI-BCIs remain barely used outside laboratories because of the difficulty encountered by users to control them. Indeed, although some users obtain good control performances after training, a substantial proportion remains unable to reliably control an MI-BCI. This huge variability in user-performance led the community to look for predictors of MI-BCI control ability. However, these predictors were only explored for motor-imagery based BCIs, and mostly for a single training session per subject. In this study, 18 participants were instructed to learn to control an EEG-based MI-BCI by performing 3 MI-tasks, 2 of which were non-motor tasks, across 6 training sessions, on 6 different days. Relationships between the participants’ BCI control performances and their personality, cognitive profile and neurophysiological markers were explored. While no relevant relationships with neurophysiological markers were found, strong correlations between MI-BCI performances and mental-rotation scores (reflecting spatial abilities) were revealed. Also, a predictive model of MI-BCI performance based on psychometric questionnaire scores was proposed. A leave-one-subject-out cross validation process revealed the stability and reliability of this model: it enabled to predict participants’ performance with a mean error of less than 3 points. This study determined how users’ profiles impact their MI-BCI control ability and thus clears the way for designing novel MI-BCI training protocols, adapted to the profile of each user

Public Library of Science (PLOS)

INRIA a CCSD electronic archive server

HAL: Hyper Article en Ligne

Sussex Research Online

SVM Classifier – a comprehensive java interface for support vector machine classification of microarray data

Author: B Schölkopf
B Schölkopf
C Cortes
Chang Chih-Chung
I Guyon
I Hedenfalk
Mehdi Pirooznia
MPS Brown
P Pavlidis
T Joachims
VN Vapnik
WS Noble
Youping Deng
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

MOTIVATION: Graphical user interface (GUI) software promotes novelty by allowing users to extend the functionality. SVM Classifier is a cross-platform graphical application that handles very large datasets well. The purpose of this study is to create a GUI application that allows SVM users to perform SVM training, classification and prediction. RESULTS: The GUI provides user-friendly access to state-of-the-art SVM methods embodied in the LIBSVM implementation of Support Vector Machine. We implemented the java interface using standard swing libraries. We used a sample data from a breast cancer study for testing classification accuracy. We achieved 100% accuracy in classification among the BRCA1–BRCA2 samples with RBF kernel of SVM. CONCLUSION: We have developed a java GUI application that allows SVM users to perform SVM training, classification and prediction. We have demonstrated that support vector machines can accurately classify genes into functional categories based upon expression data from DNA microarray hybridization experiments. Among the different kernel functions that we examined, the SVM that uses a radial basis kernel function provides the best performance. The SVM Classifier is available at

Aquila Digital Community (University of Southern Mississippi, USM)

A Regression-based K nearest neighbor algorithm for gene function prediction from heterogeneous data

Author: A Enright
A Gavin
A Grigoriev
A Hoerl
AJ Dobson
EG WS Cleveland
G GH
GRG Lanckriet
H Ge
M Deng
M Eisen
M Fellenberg
MPS Brown
O Troyanskaya
P Liang
P Pavlidis
P Pavlidis
R Overbeek
R Tibshirani
Walter L Ruzzo
WS Noble
Y Zheng
Zizhen Yao
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: As a variety of functional genomic and proteomic techniques become available, there is an increasing need for functional analysis methodologies that integrate heterogeneous data sources. METHODS: In this paper, we address this issue by proposing a general framework for gene function prediction based on the k-nearest-neighbor (KNN) algorithm. The choice of KNN is motivated by its simplicity, flexibility to incorporate different data types and adaptability to irregular feature spaces. A weakness of traditional KNN methods, especially when handling heterogeneous data, is that performance is subject to the often ad hoc choice of similarity metric. To address this weakness, we apply regression methods to infer a similarity metric as a weighted combination of a set of base similarity measures, which helps to locate the neighbors that are most likely to be in the same class as the target gene. We also suggest a novel voting scheme to generate confidence scores that estimate the accuracy of predictions. The method gracefully extends to multi-way classification problems. RESULTS: We apply this technique to gene function prediction according to three well-known Escherichia coli classification schemes suggested by biologists, using information derived from microarray and genome sequencing data. We demonstrate that our algorithm dramatically outperforms the naive KNN methods and is competitive with support vector machine (SVM) algorithms for integrating heterogenous data. We also show that by combining different data sources, prediction accuracy can improve significantly. CONCLUSION: Our extension of KNN with automatic feature weighting, multi-class prediction, and probabilistic inference, enhance prediction accuracy significantly while remaining efficient, intuitive and flexible. This general framework can also be applied to similar classification problems involving heterogeneous datasets

Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo

Author: AE Carpenter
BE Boser
CC Chang
G Lin
G Lin
JI Murray
JI Murray
John I Murray
M Wang
M Wang
MR Lamprecht
MS Vokes
R Wollman
RA Russell
Robert H Waterston
S Hamahashi
S Sanei
TJ Boyle
William S Noble
WS Noble
X Chen
Z Bao
Zafer Aydin
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Image analysis is an essential component in many biological experiments that study gene expression, cell cycle progression, and protein localization. A protocol for tracking the expression of individual <it>C. elegans </it>genes was developed that collects image samples of a developing embryo by 3-D time lapse microscopy. In this protocol, a program called StarryNite performs the automatic recognition of fluorescently labeled cells and traces their lineage. However, due to the amount of noise present in the data and due to the challenges introduced by increasing number of cells in later stages of development, this program is not error free. In the current version, the error correction (<it>i.e</it>., editing) is performed manually using a graphical interface tool named AceTree, which is specifically developed for this task. For a single experiment, this manual annotation task takes several hours. Results In this paper, we reduce the time required to correct errors made by StarryNite. We target one of the most frequent error types (movements annotated as divisions) and train a support vector machine (SVM) classifier to decide whether a division call made by StarryNite is correct or not. We show, via cross-validation experiments on several benchmark data sets, that the SVM successfully identifies this type of error significantly. A new version of StarryNite that includes the trained SVM classifier is available at <url>http://starrynite.sourceforge.net</url>. Conclusions We demonstrate the utility of a machine learning approach to error annotation for StarryNite. In the process, we also provide some general methodologies for developing and validating a classifier with respect to a given pattern recognition task.</p

Public Library of Science (PLOS)

High Resolution Models of Transcription Factor-DNA Affinities Improve In Vitro and In Vivo Binding Predictions

Author: Aaron Arvey
C Kissinger
C Leslie
C Zhu
Christina Leslie
CT Harbison
D Fulton
DE Newburger
E Bolotin
E Fraenkel
G Badis
G Badis
G Pavesi
MF Berger
O Wallerman
P Kharchenko
Phaedra Agius
R Kuang
S Georgiev
Uwe Ohler
William Chang
William Stafford Noble
WS Noble
X Chen
X Chen
XS Liu
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Accurately modeling the DNA sequence preferences of transcription factors (TFs), and using these models to predict in vivo genomic binding sites for TFs, are key pieces in deciphering the regulatory code. These efforts have been frustrated by the limited availability and accuracy of TF binding site motifs, usually represented as position-specific scoring matrices (PSSMs), which may match large numbers of sites and produce an unreliable list of target genes. Recently, protein binding microarray (PBM) experiments have emerged as a new source of high resolution data on in vitro TF binding specificities. PBM data has been analyzed either by estimating PSSMs or via rank statistics on probe intensities, so that individual sequence patterns are assigned enrichment scores (E-scores). This representation is informative but unwieldy because every TF is assigned a list of thousands of scored sequence patterns. Meanwhile, high-resolution in vivo TF occupancy data from ChIP-seq experiments is also increasingly available. We have developed a flexible discriminative framework for learning TF binding preferences from high resolution in vitro and in vivo data. We first trained support vector regression (SVR) models on PBM data to learn the mapping from probe sequences to binding intensities. We used a novel -mer based string kernel called the di-mismatch kernel to represent probe sequence similarities. The SVR models are more compact than E-scores, more expressive than PSSMs, and can be readily used to scan genomics regions to predict in vivo occupancy. Using a large data set of yeast and mouse TFs, we found that our SVR models can better predict probe intensity than the E-score method or PBM-derived PSSMs. Moreover, by using SVRs to score yeast, mouse, and human genomic regions, we were better able to predict genomic occupancy as measured by ChIP-chip and ChIP-seq experiments. Finally, we found that by training kernel-based models directly on ChIP-seq data, we greatly improved in vivo occupancy prediction, and by comparing a TF's in vitro and in vivo models, we could identify cofactors and disambiguate direct and indirect binding

CiteSeerX

Physicochemical property distributions for accurate and rapid pairwise protein homology detection

Author: A Ben-Hur
A Kumar
AG Murzin
AR Shah
B Liu
BJ Webb-Robertson
BJ Webb-Robertson
BJ Webb-Robertson
Bobbie-Jo M Webb-Robertson
C Leslie
Christopher S Oehmen
CS Leslie
H Rangwala
H Saigo
I Jung
I Melvin
I Melvin
J Weston
Kyle G Ratuiste
L Liao
NH Anderson
QW Dong
R Kuang
S Hochreiter
SF Altschul
SF Altschul
T Damoulas
T Lingner
TF Smith
WS Noble
WS Noble
Y Hou
Y Hou
Y Yang
Y Yuan
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The challenge of remote homology detection is that many evolutionarily related sequences have very little similarity at the amino acid level. Kernel-based discriminative methods, such as support vector machines (SVMs), that use vector representations of sequences derived from sequence properties have been shown to have superior accuracy when compared to traditional approaches for the task of remote homology detection. Results We introduce a new method for feature vector representation based on the physicochemical properties of the primary protein sequence. A distribution of physicochemical property scores are assembled from 4-mers of the sequence and normalized based on the null distribution of the property over all possible 4-mers. With this approach there is little computational cost associated with the transformation of the protein into feature space, and overall performance in terms of remote homology detection is comparable with current state-of-the-art methods. We demonstrate that the features can be used for the task of pairwise remote homology detection with improved accuracy versus sequence-based methods such as BLAST and other feature-based methods of similar computational cost. Conclusions A protein feature method based on physicochemical properties is a viable approach for extracting features in a computationally inexpensive manner while retaining the sensitivity of SVM protein homology detection. Furthermore, identifying features that can be used for generic pairwise homology detection in lieu of family-based homology detection is important for applications such as large database searches and comparative genomics.</p

University of Liverpool Repository

Population‐based cohort study of outcomes following cholecystectomy for benign gallbladder diseases

Author: Aawsaj Y
Abayomi M
Acharya V
Aggarwal SK
Ahmad A
Ahmed I
Ahmed J
Ahmed J
Ahmed T
Ainley P
Akhtar M
Al Amari K
Al-Abed YA
Al-Akash M
Al-Bahrani AZ
Al-Khyatt W
Al-Muhktar A
Al-Taan O
Alagaratnam S
Alam I
Alberts J
Alderson D
Aldridge RC
Aleksandrov D
Ali A
Ali H
Ali S
Amin K
Andrew DR
Appleton S
Arhi C
Aroori S
Aryal K
Asad M
Aseem R
Asprou FM
Awan A
Ayaani S
Babu BI
Bajwa DS
Bajwa FM
Baker AL
Balfe P
Ball WR
Ballance L
Banwell V
Barnes J
Barrie J
Bartlett F
Basheer M
Bashir G
Basra M
Basu S
Basynat P
Batt J
Bausbacher H
Bawa S
Beasley WD
Belding R
Bellini MI
Bennett SP
Bhandari S
Bhattacharya V
Blane C
Blazeby JM
Boal M
Boddy A
Boland M
Bond-Smith G
Booth MI
Borowski DW
Bowen L
Boyce KM
Boyd AT
Bradley D
Bradley N
Brennan S
Brett D
Broadhurst J
Brooke E
Brown TH
Bryant A
Buchan J
Bullen N
Bunnell C
Bunting DM
Burke P
Byrne JP
Byrnes CK
Campbell W
Carney K
Carney K
Carroll P
Carter CR
Carter NC
Carty N
Cassidy JT
Chambers A
Chambers A
Chambers A
Chan WM
Chang J
Chauhan N
Cheung M
Chishti IA
Chitre V
Choy A
Cieplucha K
Clements JM
Coats M
Cockbain AJ
Coe PO
Collaborative WMR
Connelly M
Cooke F
Courtney MJ
Cowley JB
Cross KS
Cuming T
Cunha P
D'Costa C
Dalgaty F
Dar F
Date R
Dave RV
Davey P
Davies J
Dawson S
De Marchi JA
de Siqueira J
Deguara J
Delisle TG
Dennison AR
Derbyshire L
Devoto L
Dhillon A
Diamond T
Dickson-Lowe R
Digney R
Dindyal S
Dobbins B
Doe M
Dorrian E
Doulias T
Downing J
Downing J
Drake B
Driscoll PJ
Duke D
Durkin D
Dwerryhouse SJ
Ebdewi H
Ebied H
Eisawi A
El-Dhuwaib Y
El-Hasani SS
Elkheir N
Elmasry M
Elsayed M
Elshaer M
Emeshi S
Farag SF
Farooq A
Farouk M
Fasih T
Fawole AS
Fenwick S
Ferguson G
Ferris P
Finch D
Finch JG
Finlay I
Fleming K
Fletcher T
Fordham IJ
Forouzanfar A
Forrest CR
Forster L
Fothergill L
Frame RJ
Francombe J
Frank L
Fusai G
Galanopoulos G
Gallagher PV
Garcea G
Gardner-Thorpe J
Geoghegan JG
Geogloma I
Gerakopoulos S
Ghareeb E
Ghazanfar MA
Gibson S
Giles M
Giovinazzo F
Goh YL
Gopalswamy S
Goscimski A
Gossage JA
Gough M
Gould L
Gould L
Grant AJ
Gravante G
Griffiths EA
Grp CS
Gull S
Gunasekera RT
Gurung K
Guthrie GJK
Hafiz S
Halkias C
Hall C
Hamady Z
Hamdan A
Hamdan MF
Hamilton E
Hancorn K
Haque M
Hardy TJ
Hargreaves A
Harilingam ACM
Harris A
Harrison E
Hawkins H
Heath J
Hebbar M
Henderson LT
Henley N
Heshaishi M
Hewes J
Hewes JC
Heywood N
Higgs SM
Hill ADK
Hill ADK
Hindley C
Hirst NA
Hisham E
Ho W-M
Hoban JR
Hodgkins KA
Holroyd DJ
Hooks G
Hopwood B
Hornby ST
Hornsby J
Hosie KB
Hossack MR
Hossain T
Hoti E
Hou D
Hrycaiczuk A
Hughes M
Hunter DI
Hussain A
Hussain A
Hussain AA
Hussain N
Iftikhar SY
Iqbal LGN
Iskandar E
Issa E
Ivanovski I
Jackson A
Jackson C
Jafferbhoy SF
Jambulingam P
Jamdar S
Jamel S
Janeczko A
Jaunoo S
Jayanthi NVG
Jelley C
Jennings NA
Joel A
Johnpulle MA
Johnston D
Johnstone M
Johnstone M
Joji N
Jones C
Jones E
Jones GH
Jones MA
Jones MJ
Jones RP
Jones SM
Kadhim A
Kadirkamanathan S
Kallaway C
Kanakala V
Kanavati O
Kansal N
Kaptanis S
Karavadra B
Karim L
Karunakaran P
Kaur P
Kausar A
Kelly ME
Kendal C
Kennedy D
Kenny R
Keogh K
Khalil AM
Khan A
Khan MA
Khan MAS
Khan RB
Khan S
Khan S
Khawaja A
Khawaja Z
Khogali E
Kimble A
Kinghorn A
Kirkham AJ
Knight B
Knight K
Kourkulos M
Krysztopik R
Kumar P
Kynaston J
Kyriakidis D
Lalla K
Lammy S
Lane R
Larkin D
Law J
Lawther R
Lazim T
Lee K
Lee M
Leeder P
Lennon H
Lewis M
Lim HCC
Lim PJ
Lindsay D
Lloyd D
Lo C
Lodhia J
Longbotham DA
Loughlin P
Lovett B
Luhmann A
Luther A
Lyons EM
Macano CAW
MacArtney M
Macdonald A
Madanipour S
Madbak K
Madurska M
Magee CJ
Maguire D
Maharaj G
Majeed A
Malcolm C
Malde D
Malik NS
Mallik M
Mansour S
Manu M
Manzelli A
Marriott P
Martin J
Martin S
Martin ST
Marudanayagam R
Mason D
Mathur P
Mawhinney A
Mbuvi J
McAree B
McCain S
McCallum IJD
McClarey A
McCune K
McDermott F
McEntee GP
McGlone ER
McGuigan A
McIlmunn C
Mckay D
McKenzie S
McNair AGK
Mealy K
Menzies D
Mercer SJ
Milburn J
Mirza AK
Mirza D
Miu V
Mockford KA
Mohamed S
Mohan HM
Mok J
Monk D
Morcous P
Morrison TEM
Mothe BS
Mowbray N
Mozolowski KL
Mukherjee D
Murphy JO
Murtaza G
Nally DM
Napetti S
Nassar AH
Naumann D
Nedujchelyn Y
Needham PJ
Newton K
Newton RC
Ngu WS
Nicholson GA
Nicholson J
Nicholson JA
Nicol L
Nicolay CR
Nieto T
Nijjar RS
Nilsen-Nunn A
Nilsson F
Noble F
Nofal E
Noor N
Nunes Q
O'Dwyer P
O'Neill MA
O'Neill S
O'Reilly DA
O'Shea KM
O'Sullivan AW
Obeidallah MR
Ogedegbe A
Old OJ
Orchard P
Orizu MN
Packer JR
Padwick R
Pannu A
Panteleimonitis S
Parappally CP
Parmar J
Pasquali S
Patel A
Patel K
Patel S
Pati P
Pearce B
Pearson KL
Peirce C
Pellen M
Peter MB
Peterson M
Pezzuto R
Photi E
Phull M
Pinho-Gomes AC
Pollock S
Popplewell M
Pore N
Porter J
Prasad AR
Prasad R
Preston SR
Preziosi G
Prince S
Psica A
Puig S
Puntis DJ
Puri Y
Qandeel H
Rabie M
Radwan R
Rae DM
Rahman S
Rajaganeshan R
Rajendran I
Ramage MI
Rao STV
Rate AJ
Raza S
Read E
Reddy M
Reece-Bolton O
Reed J
Reid A
Rengifo C
Rhys T
Richards CH
Riera M
Ritchie J
Robb W
Roberts GP
Robinson S
Robinson S
Robinson SJ
Rodriguez D
Rodriguez DU
Roebuck A
Rogers PN
Rolph RC
Roy S
Rutherford CL
Sadat MM
Sakai N
Sampat K
Samuel N
Sarmah PB
Sarvananthan K
Sarveswaran J
Satheesan S
Saunders M
Sayegh M
Segaran A
Sekhar H
Sen G
Seymour K
Shabo W
Shah N
Shah R
Shahin Y
Sheel ARG
Shenoy H
Shetty V
Shier D
Shingler GM
Shiwani MH
Shrestha AK
Siddiqi N
Siddiqui MN
Siddiqui Z
Sillo TO
Silva M
Simpson DJ
Singh S
Singh-Ranger D
Sinha S
Skelly BL
Slavin JP
Slawinski C
Sloane J
Smith SR
Smith SR
Spearman J
Spence G
Spreadborough P
Spreadborough P
Starmer BZ
Stevenson TEJ
Stewart DJ
Stirland E
Subramanium D
Sufi PA
Sukha A
Sutcliffe R
Suttie S
Sutton CMLR
Szentpali K
Tang C-B
Tanner N
Tate S
Tayeh S
Taylor C
Taylor GW
Teasdale RL
Templeton A
Terrace J
Thakkar R
Thomas G
Thompson RLE
Tilston MP
Toh SKC
Tolofari S
Tomlinson MA
Trevatt AEJ
Trotter M
Tsang A
Tsavellas G
Tuck L
Turner P
Tutton MG
Ul Haque S
Ullah MF
Upchurch EA
Urbonas T
Varghase J
Vass DG
Vaughan EM
Vijay V
Vijayan D
Vijayanand D
Villatoro E
Vitone LJ
Vohra RS
Wa K
Wadley M
Wainwright D
Wajed S
Wallace T
Wardle S
Warren C
Watkin H
Watson AJM
Watson NF
Wayman J
Weaver S
Weber B
Weber B
Weerasinghe C
Welbourn H
Wheatley K
White TJ
Wijetunga I
Wild JRL
Wilkinson MD
Williams SV
Williamson TK
Willson PD
Wilson MSJ
Wilson TR
Winter DC
Winter H
Wood CPJ
Wood P
Wood S
Woodman T
Yahya S
Yao C
Yeldham G
Yoganathan T
Youssef H
Zafar S
Zafrani Z
Zarsadias P
Zeeshan S
Zhang C
Ziprin P
Publication venue: 'Wiley'
Publication date: 01/11/2016
Field of study

Background The aim was to describe the management of benign gallbladder disease and identify characteristics associated with all‐cause 30‐day readmissions and complications in a prospective population‐based cohort. Methods Data were collected on consecutive patients undergoing cholecystectomy in acute UK and Irish hospitals between 1 March and 1 May 2014. Potential explanatory variables influencing all‐cause 30‐day readmissions and complications were analysed by means of multilevel, multivariable logistic regression modelling using a two‐level hierarchical structure with patients (level 1) nested within hospitals (level 2). Results Data were collected on 8909 patients undergoing cholecystectomy from 167 hospitals. Some 1451 cholecystectomies (16·3 per cent) were performed as an emergency, 4165 (46·8 per cent) as elective operations, and 3293 patients (37·0 per cent) had had at least one previous emergency admission, but had surgery on a delayed basis. The readmission and complication rates at 30 days were 7·1 per cent (633 of 8909) and 10·8 per cent (962 of 8909) respectively. Both readmissions and complications were independently associated with increasing ASA fitness grade, duration of surgery, and increasing numbers of emergency admissions with gallbladder disease before cholecystectomy. No identifiable hospital characteristics were linked to readmissions and complications. Conclusion Readmissions and complications following cholecystectomy are common and associated with patient and disease characteristics

Functional SNP allele discovery (fSNPd): an approach to find highly penetrant, environmental-triggered genotypes underlying complex human phenotypes

Author: C. Geoffrey Woods
David Menon
EL Kwak
G Gibson
JL Haines
Kaitlin Stouffer
MD DH
Michael Lee
Michael Nahorski
MW Foster
Nivedita Sarveswaran
Pablo Moreno
PM Visscher
RA Wilke
S Sawcer
TR Prezant
VM Ingram
WH Chung
WS Noble
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study