Search CORE

112 research outputs found

Data mining: a tool for detecting cyclical disturbances in supply networks.

Author: Chan F. T. S.
Chatfield C.
Davis T.
Devijver P. A.
Fayyad U. M.
Forrester J. W.
Han J.
Harding J. A.
Jolliffe I. T.
Kaufman L.
Klösgen W.
Koopmans L. H.
Mason-Jones R.
Monostori L.
Pyle D.
Witten I. H.
Publication venue: 'SAGE Publications'
Publication date: 21/12/2007
Field of study

Disturbances in supply chains may be either exogenous or endogenous. The ability automatically to detect, diagnose, and distinguish between the causes of disturbances is of prime importance to decision makers in order to avoid uncertainty. The spectral principal component analysis (SPCA) technique has been utilized to distinguish between real and rogue disturbances in a steel supply network. The data set used was collected from four different business units in the network and consists of 43 variables; each is described by 72 data points. The present paper will utilize the same data set to test an alternative approach to SPCA in detecting the disturbances. The new approach employs statistical data pre-processing, clustering, and classification learning techniques to analyse the supply network data. In particular, the incremental k-means clustering and the RULES-6 classification rule-learning algorithms, developed by the present authors’ team, have been applied to identify important patterns in the data set. Results show that the proposed approach has the capability automatically to detect and characterize network-wide cyclical disturbances and generate hypotheses about their root cause

Crossref

Middlesex University Research Repository

Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification

Author: A. Webb
C. Fevotte
D.D. Lee
G.J. Brown
K. Fukunage
M.R. Every
M.S. Pedersen
P. Smaragdis
P.A. Devijver
T. Virtanen
W. Wang
Y. Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Separating multiple music sources from a single channel mixture is a challenging problem. We present a new approach to this problem based on non-negative matrix factorization (NMF) and note classification, assuming that the instruments used to play the sound signals are known a priori. The spectrogram of the mixture signal is first decomposed into building components (musical notes) using an NMF algorithm. The Mel frequency cepstrum coefficients (MFCCs) of both the decomposed components and the signals in the training dataset are extracted. The mean squared errors (MSEs) between the MFCC feature space of the decomposed music component and those of the training signals are used as the similarity measures for the decomposed music notes. The notes are then labelled to the corresponding type of instruments by the K nearest neighbors (K-NN) classification algorithm based on the MSEs. Finally, the source signals are reconstructed from the classified notes and the weighting matrices obtained from the NMF algorithm. Simulations are provided to show the performance of the proposed system. © 2011 Springer-Verlag Berlin Heidelberg

Crossref

University of Surrey

Surrey Research Insight

MCMC implementation for Bayesian hidden semi-Markov models with illustrative applications

Author: A. Gelman
A. Gelman
A.K. Jardine
C. Jouyaux
C. Yau
C.P. Robert
C.P. Robert
D. Gamerman
D.J. Spiegelhalter
E. Bellone
G. Celeux
H. Kozumi
J. Bulla
J. Bulla
J. Sansom
J.-M. Marin
J.D. Ferguson
J.P. Hughes
L. Rabiner
L.E. Baum
M. Dewar
M. Dong
M. Stephens
P. Fearnhead
P.A. Devijver
R Development Core Team
S. Chib
S. Guha
S. Richardson
S. Scott
S. Scott
S. Tokdar
S.-Z. Yu
S.C. Schmidler
S.E. Levinson
T. Economou
T. Economou
T. Rydén
Theodoros Economou
Trevor C. Bailey
W.R. Gilks
Y. Guedon
Y. Guedon
Y. Kleiner
Zoran Kapelan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/06/2014
Field of study

Copyright © Springer 2013. The final publication is available at Springer via http://dx.doi.org/10.1007/s11222-013-9399-zHidden Markov models (HMMs) are flexible, well established models useful in a diverse range of applications. However, one potential limitation of such models lies in their inability to explicitly structure the holding times of each hidden state. Hidden semi-Markov models (HSMMs) are more useful in the latter respect as they incorporate additional temporal structure by explicit modelling of the holding times. However, HSMMs have generally received less attention in the literature, mainly due to their intensive computational requirements. Here a Bayesian implementation of HSMMs is presented. Recursive algorithms are proposed in conjunction with Metropolis-Hastings in such a way as to avoid sampling from the distribution of the hidden state sequence in the MCMC sampler. This provides a computationally tractable estimation framework for HSMMs avoiding the limitations associated with the conventional EM algorithm regarding model flexibility. Performance of the proposed implementation is demonstrated through simulation experiments as well as an illustrative application relating to recurrent failures in a network of underground water pipes where random effects are also included into the HSMM to allow for pipe heterogeneity

Crossref

Open Research Exeter

Automated detection of regions of interest for tissue microarray experiments: an image texture analysis

Author: A Hoque
A Todman
AJ Smola
Aydin Tözeren
B Baisse
Bilge Karaçali
C Demir
C Demir
C Zhang
CIE
CS Schuetz
ET Liu
F Bertucci
F Raimondo
F Tavassoli
F Yang
GJ Fleuren
H Battifora
I Gonzalez-Garcia
J Kononen
J Torhorst
K Schmid
LJ van't Veer
MA Roula
MA Roula
MA Unger
ME Gorre
MJ LeBaron
MJ LeBaron
MW Schwarz
N Murphy
NG Kim
PA Devijver
R Alexandrova
RL Camp
RO Duda
S Petushi
T Dreyer
UD Braumann
V Sharifi-Salamatian
VN Vapnik
VN Vapnik
Z Kaul
Publication venue: BioMed Central
Publication date: 01/03/2007
Field of study

BACKGROUND: Recent research with tissue microarrays led to a rapid progress toward quantifying the expressions of large sets of biomarkers in normal and diseased tissue. However, standard procedures for sampling tissue for molecular profiling have not yet been established. METHODS: This study presents a high throughput analysis of texture heterogeneity on breast tissue images for the purpose of identifying regions of interest in the tissue for molecular profiling via tissue microarray technology. Image texture of breast histology slides was described in terms of three parameters: the percentage of area occupied in an image block by chromatin (B), percentage occupied by stroma-like regions (P), and a statistical heterogeneity index H commonly used in image analysis. Texture parameters were defined and computed for each of the thousands of image blocks in our dataset using both the gray scale and color segmentation. The image blocks were then classified into three categories using the texture feature parameters in a novel statistical learning algorithm. These categories are as follows: image blocks specific to normal breast tissue, blocks specific to cancerous tissue, and those image blocks that are non-specific to normal and disease states. RESULTS: Gray scale and color segmentation techniques led to identification of same regions in histology slides as cancer-specific. Moreover the image blocks identified as cancer-specific belonged to those cell crowded regions in whole section image slides that were marked by two pathologists as regions of interest for further histological studies. CONCLUSION: These results indicate the high efficiency of our automated method for identifying pathologic regions of interest on histology slides. Automation of critical region identification will help minimize the inter-rater variability among different raters (pathologists) as hundreds of tumors that are used to develop an array have typically been evaluated (graded) by different pathologists. The region of interest information gathered from the whole section images will guide the excision of tissue for constructing tissue microarrays and for high throughput profiling of global gene expression

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Gene selection for classification of microarray data based on the Bayes error

Author: A Ben-Dor
A Statnikov
AA Alizadeh
AL Blum
AR Webb
C Ambroise
C Ding
C Gentile
C Lai
C Lee
CF Aliferis
CH Ooi
D Singh
E Xing
EK Tang
F Goudail
G Carneiro
G Kohavi
GR Xuan
HC Peng
Hong-Wen Deng
I Tssamardinos
J Hua
J Khan
J Weston
Ji-Gang Zhang
JW Lee
K Fukunaga
K Tumer
K Yang
KY Yeung
L Devroye
L Yu
M Chow
M Dash
M Dettling
M Dettling
M Wang
M Xiong
MA Shipp
P Baldi
PA Devijver
R Blanco
R Diaz-Uriarte
R Diaz-Uriarte
R Schalkhoff
RO Duda
S Dudoit
S Mukherjee
S Singh
S Varma
T Golub
T Jirapech-Umpai
T Li
TH Bo
U Alon
X Liu
Y Lee
Y Li
ZY Wang
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background With DNA microarray data, selecting a compact subset of discriminative genes from thousands of genes is a critical step for accurate classification of phenotypes for, e.g., disease diagnosis. Several widely used gene selection methods often select top-ranked genes according to their individual discriminative power in classifying samples into distinct categories, without considering correlations among genes. A limitation of these gene selection methods is that they may result in gene sets with some redundancy and yield an unnecessary large number of candidate genes for classification analyses. Some latest studies show that incorporating gene to gene correlations into gene selection can remove redundant genes and improve classification accuracy. Results In this study, we propose a new method, Based Bayes error Filter (BBF), to select relevant genes and remove redundant genes in classification analyses of microarray data. The effectiveness and accuracy of this method is demonstrated through analyses of five publicly available microarray datasets. The results show that our gene selection method is capable of achieving better accuracies than previous studies, while being able to effectively select relevant genes, remove redundant genes and obtain efficient and small gene sets for sample classification purposes. Conclusion The proposed method can effectively identify a compact set of genes with high classification accuracy. This study also indicates that application of the Bayes error is a feasible and effective wayfor removing redundant genes in gene selection.</p

University of Missouri: MOspace

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Pattern Recognition Based Speed Forecasting Methodology for Urban Traffic Network

Author: Alfréd Csikós
Basu D.
Ben-Akiva M. E.
Devijver P. A.
István Varga
Krisztián Balázs Kis
Liu H.
Tamás Tettamanti
Van Grol H. J. M.
Van Lint J. W. C.
Werbos P. J.
Wiedemann R.
Zsolt János Viharos
Publication venue: 'Vilnius Gediminas Technical University'
Publication date: 01/01/2017
Field of study

A full methodology of short-term traffic prediction is proposed for urban road traffic network via Artificial Neural Network (ANN). The goal of the forecasting is to provide speed estimation forward by 5, 15 and 30 min. Unlike similar research results in this field, the investigated method aims to predict traffic speed for signalized urban road links and not for highway or arterial roads. The methodology contains an efficient feature selection algorithm in order to determine the appropriate input parameters required for neural network training. As another contribution of the paper, a built-in incomplete data handling is provided as input data (originating from traffic sensors or Floating Car Data (FCD)) might be absent or biased in practice. Therefore, input data handling can assure a robust operation of speed forecasting also in case of missing data. The proposed algorithm is trained, tested and analysed in a test network built-up in a microscopic traffic simulator by using daily course of real-world traffic

Crossref

SZTAKI Publication Repository

Directory of Open Access Journals

Repository of the Academy's Library

VGTU Journals (Vilnius Gediminas Technical University - Vilnius Tech)

Differentiation of Gram-Negative Bacterial Aerosol Exposure Using Detected Markers in Bronchial-Alveolar Lavage Fluid

Author: A Beineke
A Lembo
A Mellmann
AI Duenas
AJ Hager
Alan Willse
AM Ardekani
AM Friedlander
AM Hajjar
Bobbie-Jo Webb-Robertson
BW Senior
C Fenselau
C Robroeks
Charles W. Frevert
CW Frevert
D Prieto
David Wunschel
DN Kyriacou
EC Kohn
H Zhang
Heather Colburn
HJ Issaq
J Jackman
J Smoll
JEL Visentainer
JO Lay
JS Lee
K Lahteenmaki
K Lahteenmaki
Kathryn Antolick
KH Jarman
KH Jarman
Nat Beagley
Neeraj Vij
P Devijver
P Krader
PE Van den Steen
PE Van den Steen
PM West TE
R Mandrell
R Ott
RA Greenfield
RB Goodman
RD Holland
RG Evans
S Miyoshi
SC Wunschel
Shawn Skerrett
SJ Skerrett
T Ayabe
T Jin
U Sack
W Pusch
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

The identification of biosignatures of aerosol exposure to pathogens has the potential to provide useful diagnostic information. In particular, markers of exposure to different types of respiratory pathogens may yield diverse sets of markers that can be used to differentiate exposure. We examine a mouse model of aerosol exposure to known Gram negative bacterial pathogens, Francisella tularensis novicida and Pseudomonas aeruginosa. Mice were subjected to either a pathogen or control exposure and bronchial alveolar lavage fluid (BALF) was collected at four and twenty four hours post exposure. Small protein and peptide markers within the BALF were detected by matrix assisted laser desorption/ionization (MALDI) mass spectrometry (MS) and analyzed using both exploratory and predictive data analysis methods; principle component analysis and degree of association. The markers detected were successfully used to accurately identify the four hour exposed samples from the control samples. This report demonstrates the potential for small protein and peptide marker profiles to identify aerosol exposure in a short post-exposure time frame

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Scuba:Scalable kernel-based gene prioritization

Author: Alessandro Sperduti
B Chen
B Chen
C Wu
D Botstein
D Börnigen
D Nitsch
D Salgado
D Seelow
Dinh Van Tran
E Adie
F Aiolli
F Aiolli
F Fouss
F Mordelet
Fabio Aiolli
Giorgio Valle
Guido Zampieri
I Vastrik
J Chen
J Hanley
J Hutz
J Shawe-Taylor
K Borgwardt
K Goh
L Jensen
M Gönen
M Kanehisa
M Polato
M Ritchie
M Whirl-Carrillo
Michele Donini
Nicolò Navarin
O Chapelle
P Chebotarev
P Devijver
P Zakeri
R Kondor
S Aerts
S Köhler
S Köhler
S Yu
T De Bie
T Strachan
TS Keshava Prasad
X Wang
Y Chen
Y Moreau
Y Yoshida
Y Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Abstract Background The uncovering of genes linked to human diseases is a pressing challenge in molecular biology and precision medicine. This task is often hindered by the large number of candidate genes and by the heterogeneity of the available information. Computational methods for the prioritization of candidate genes can help to cope with these problems. In particular, kernel-based methods are a powerful resource for the integration of heterogeneous biological knowledge, however, their practical implementation is often precluded by their limited scalability. Results We propose Scuba, a scalable kernel-based method for gene prioritization. It implements a novel multiple kernel learning approach, based on a semi-supervised perspective and on the optimization of the margin distribution. Scuba is optimized to cope with strongly unbalanced settings where known disease genes are few and large scale predictions are required. Importantly, it is able to efficiently deal both with a large amount of candidate genes and with an arbitrary number of data sources. As a direct consequence of scalability, Scuba integrates also a new efficient strategy to select optimal kernel parameters for each data source. We performed cross-validation experiments and simulated a realistic usage setting, showing that Scuba outperforms a wide range of state-of-the-art methods. Conclusions Scuba achieves state-of-the-art performance and has enhanced scalability compared to existing kernel-based approaches for genomic data. This method can be useful to prioritize candidate genes, particularly when their number is large or when input data is highly heterogeneous. The code is freely available at https://github.com/gzampieri/Scuba

Crossref

Directory of Open Access Journals

Teeside University's Research Repository

Archivio istituzionale della ricerca - Università di Padova

UFFizi: a generic platform for ranking informative features

Author: Assaf Gottlieb
B Zhang
BJ Herron
C MéplanDagger
CL Tso
D Horn
D Talantov
David Horn
DL Donoho
DL Donoho
DW Huang
E Maestrini
EA Martorell
F Chu
G Dennis Jr
G Verhaegh
H Hellman
H Zou
Hellman-Feynmann
I Guyon
I Guyon
J Chen
J Herrero
JA Rothnagel
JG Dy
K Yamanishi
L Theresa
M Santala
M Santala
M Wall
MdBA Zoubi
Michal Linial
MM Breunig
N Dahiya
O Alter
P Jaccard
PA Devijver
PD Hodgson
PN Robinson
R Edgar
R Varshavsky
R Varshavsky
RA Maronna
Roy Varshavsky
RP Feynman
RS Barsoum
S Metcalfe
S Ramaswamy
T Barrett
V Hodge
WA Stahel
Y Chan
Y Saeys
Y Zhang
YS Lee
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Early structural and functional defects in synapses and myelinated axons in stratum lacunosum moleculare in two preclinical models for tauopaty

Author: A Bayés
A Brun
A De Calignon
A Delacourte
A Kremer
A Peters
AE Ludvigson
AJ Elberger
AM Monacelli
Anna Kremer
AR DeIpolyi
BB Bendlin
Benoit Lechat
BJ Molyneaux
BK Chen
BT Gold
C Colbert
C Duyckaerts
C Ferri
C Lappe-Siefke
C Menuet
C Menuet
C Rampon
C Wang
Claire Marie Seymour
D Palani
D Terwel
D Terwel
DB Parente
DR Vago
DR Vago
E Braak
E Capetillo-Zarate
F Clavaguera
Fred Van Leuven
G Bartzokis
G Bartzokis
G Feng
G Lace
G Leuba
G Maccaferri
GA Kerchner
Geert Callewaert
H Braak
H Braak
H Dvorak-carbone
H Hampel
H Hering
H Maurin
H Wigström
HE Speed
Heather Davies
Herman Devijver
Hervé Maurin
HS Donohue
I Dewachter
I Kraev
Igor Kraev
J Suh
J-B Deng
JA Harris
JA Harris
Javier Vitorica
JC Fiala
JJP Alix
JL Ziskin
JM Ringman
K Boekhoorn
K Buerger
K Buerger
K Spittaels
KJ Reinikainen
L Bronge
L Liu
L O’Dwyer
LC Schmued
M Bozzali
M Dutschmann
M Remondes
M Remondes
M Tsukamoto
Michael G. Stewart
MK Desai
MN Rasband
N Crespo-Biel
N Ishizuka
NM Van Strien
P Fidzinski
Peter Borghgraef
PK Stys
R Vlkolinsky
RD Terry
S-A Chong
Seon-Ah Chong
T Chomiak
T Jaworski
T Jaworski
T Jaworski
T Nakashiba
T Nakashiba
T Van Cauter
T Van Groen
Tomasz Jaworski
V Chan-Palay
V Niederkofler
X-D Wang
XG Li
YP Tang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

The stratum lacunosum moleculare (SLM) is the connection hub between entorhinal cortex and hippocampus, two brain regions that are most vulnerable in Alzheimer’s disease. We recently identified a specific synaptic deficit of Nectin-3 in transgenic models for tauopathy. Here we defined cognitive impairment and electrophysiological problems in the SLM of Tau.P301L mice, which corroborated the structural defects in synapses and dendritic spines. Reduced diffusion of DiI from the ERC to the hippocampus indicated defective myelinated axonal pathways. Ultrastructurally, myelinated axons in the temporoammonic pathway (TA) that connects ERC to CA1 were damaged in Tau.P301L mice at young age. Unexpectedly, the myelin defects were even more severe in bigenic biGT mice that co-express GSK3β with Tau.P301L in neurons. Combined, our data demonstrate that neuronal expression of protein Tau profoundly affected the functional and structural organization of the entorhinal-hippocampal complex, in particular synapses and myelinated axons in the SLM. White matter pathology deserves further attention in patients suffering from tauopathy and Alzheimer’s disease

Lirias

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Open Research Online (The Open University)

PubMed Central

FigShare