Search CORE

45 research outputs found

Random forests with random projections of the output space for high dimensional multi-label classification

Author: D. Achlioptas
D. Kocev
E.J. Candes
F. Pedregosa
G. Madjarov
G. Tsoumakas
G. Tsoumakas
J. Read
J.L. Faulon
L. Breiman
P. Geurts
W.B. Johnson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

We adapt the idea of random projections applied to the output space, so as to enhance tree-based ensemble methods in the context of multi-label classification. We show how learning time complexity can be reduced without affecting computational complexity and accuracy of predictions. We also show that random output space projections may be used in order to reach different bias-variance tradeoffs, over a broad panel of benchmark problems, and that this may lead to improved accuracy while reducing significantly the computational burden of the learning stage

arXiv.org e-Print Archive

Crossref

Open Repository and Bibliography - Liège

On Aggregation in Ensembles of Multilabel Classifiers

Author: C Shi
C Shi
D Kocev
G Madjarov
G Tsoumakas
G Tsoumakas
J Read
J Read
JM Moyano
JR Quinlan
K Dembczyński
L Breiman
ML Zhang
N Li
SK Murthy
TG Dietterich
W Waegeman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/06/2020
Field of study

While a variety of ensemble methods for multilabel classification have been proposed in the literature, the question of how to aggregate the predictions of the individual members of the ensemble has received little attention so far. In this paper, we introduce a formal framework of ensemble multilabel classification, in which we distinguish two principal approaches: "predict then combine" (PTC), where the ensemble members first make loss minimizing predictions which are subsequently combined, and "combine then predict" (CTP), which first aggregates information such as marginal label probabilities from the individual ensemble members, and then derives a prediction from this aggregation. While both approaches generalize voting techniques commonly used for multilabel ensembles, they allow to explicitly take the target performance measure into account. Therefore, concrete instantiations of CTP and PTC can be tailored to concrete loss functions. Experimentally, we show that standard voting techniques are indeed outperformed by suitable instantiations of CTP and PTC, and provide some evidence that CTP performs well for decomposable loss functions, whereas PTC is the better choice for non-decomposable losses.Comment: 14 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Ontology of core data mining entities

Author: A Bernstein
A Golbraikh
A Karalic
B Smith
B Smith
B Smith
C Silla
C Vens
D Demšar
D Kocev
D Kocev
D Qi
D Young
DJ Hand
F Serban
G Madjarov
G Tsoumakas
GH Bakir
H Mannila
HP Kriegel
I Slavkov
J Vanschoren
K Button
Larisa Soldatova
LN Soldatova
M Courtot
M Ford
M Žáková
MA Avery
MA Avery
MF López
O Spjuth
P Robinson
Panče Panov
Q Yang
R Caruana
R Guha
R Guha
RD King
RD King
RR Brinkman
Sašo Džeroski
T Dietterich
V Podpečan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/07/2014
Field of study

In this article, we present OntoDM-core, an ontology of core data mining entities. OntoDM-core defines themost essential datamining entities in a three-layered ontological structure comprising of a specification, an implementation and an application layer. It provides a representational framework for the description of mining structured data, and in addition provides taxonomies of datasets, data mining tasks, generalizations, data mining algorithms and constraints, based on the type of data. OntoDM-core is designed to support a wide range of applications/use cases, such as semantic annotation of data mining algorithms, datasets and results; annotation of QSAR studies in the context of drug discovery investigations; and disambiguation of terms in text mining. The ontology has been thoroughly assessed following the practices in ontology engineering, is fully interoperable with many domain resources and is easy to extend

Crossref

Brunel University Research Archive

Binary relevance efficacy for multilabel classification

Author: C Bielza
G Madjarov
G Tsoumakas
G Tsoumakas
J Read
JR Quevedo
ML Zhang
R Schapire
W Cheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Timed written picture naming in 14 European languages

Author: Alves RA
Arfe B
Chanquoy L
Chukharev-Hudilainen E
Dimakos I
Fidalgo R
Hyona J
Johannesson OI
Madjarov G
Nottbusch G
Pauly DN
Torrance M
Uppstad PH
van Waes L
Vernon M
Wengelin A
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/10/2022
Field of study

We describe the Multilanguage Written Picture Naming Dataset. This gives trial-level data and time and agreement norms for written naming of the 260 pictures of everyday objects that compose the colorized Snodgrass and Vanderwart picture set (Rossion & Pourtois in Perception, 33, 217-236, 2004). Adult participants gave keyboarded responses in their first language under controlled experimental conditions (N = 1,274, with subsamples responding in Bulgarian, Dutch, English, Finnish, French, German, Greek, Icelandic, Italian, Norwegian, Portuguese, Russian, Spanish, and Swedish). We measured the time to initiate a response (RT) and interkeypress intervals, and calculated measures of name and spelling agreement. There was a tendency across all languages for quicker RTs to pictures with higher familiarity, image agreement, and name frequency, and with higher name agreement. Effects of spelling agreement and effects on output rates after writing onset were present in some, but not all, languages. Written naming therefore shows name retrieval effects that are similar to those found in speech, but our findings suggest the need for cross-language comparisons as we seek to understand the orthographic retrieval and/or assembly processes that are specific to written output

UTUPub

Multi-label classification via multi-target regression on data streams

Author: A Bifet
A Shaker
Aljaž Osojnik
C Largeron
C Vens
E Gibaja
E Ikonomovska
E Ikonomovska
E Ikonomovska
ES Xioufis
G Madjarov
G Tsoumakas
I Triguero
J Demšar
J Fürnkranz
J Gama
J Read
J Read
J Read
L Rutkowski
M Friedman
Panče Panov
Sašo Džeroski
W Cheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Common Variants in the COL4A4 Gene Confer Susceptibility to Lattice Degeneration of the Retina

Author: A Av-shalom
A Meguro
A Oka
AK Lancaster
Akira Meguro
Akira Oka
B Madjarov
B Tazón Vega
BG Hudson
BR Staatsma
C Pescucci
Eiichi Okada
F Murakami
G Tamiya
HE Collins
Hidenao Ideta
Hidetoshi Inoko
I Longo
J Hui
JC Barrett
Junichi Yonemoto
K Sato
K Sato
K Yatsu
M Buzza
M Kawashima
M Kawashima
M Slajpah
Masaki Takeuchi
Masao Ota
NE Byer
NE Byer
Nobuhisa Mizuki
Norihiko Ito
O Gross
PW Hedrick
Riyo Uemoto
RY Foos
Ryuichi Ideta
S Shaikh
Struan Frederick Airth Grant
Tadayuki Nishide
Tatsukata Kawagoe
Tomoko Shiota
Y Michikawa
Yasuhito Iijima
Yuta Hagihara
Publication venue: Public Library of Science
Publication date: 19/06/2012
Field of study

Lattice degeneration of the retina is a vitreoretinal disorder characterized by a visible fundus lesion predisposing the patient to retinal tears and detachment. The etiology of this degeneration is still uncertain, but it is likely that both genetic and environmental factors play important roles in its development. To identify genetic susceptibility regions for lattice degeneration of the retina, we performed a genome-wide association study (GWAS) using a dense panel of 23,465 microsatellite markers covering the entire human genome. This GWAS in a Japanese cohort (294 patients with lattice degeneration and 294 controls) led to the identification of one microsatellite locus, D2S0276i, in the collagen type IV alpha 4 (COL4A4) gene on chromosome 2q36.3. To validate the significance of this observation, we evaluated the D2S0276i region in the GWAS cohort and in an independent Japanese cohort (280 patients and 314 controls) using D2S0276i and 47 single nucleotide polymorphisms covering the region. The strong associations were observed in D2S0276i and rs7558081 in the COL4A4 gene (Pc = 5.8×10−6, OR = 0.63 and Pc = 1.0×10−5, OR = 0.69 in a total of 574 patients and 608 controls, respectively). Our findings suggest that variants in the COL4A4 gene may contribute to the development of lattice degeneration of the retina

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Ceramic Microbial Fuel Cells Stack: Power generation in standard and supercapacitive mode

Author: A Baudler
A Deeke
A Dewan
A Dewan
A Kumar
A Rinaldi
AP Borole
B Erable
B Jiang
B Mecheri
BE Logan
BE Logan
BE Logan
C Borsje
C Donovan
C Donovan
C Santoro
C Santoro
C Santoro
C Santoro
C Santoro
C Santoro
C Santoro
C Santoro
C Santoro
C Santoro
C Santoro
D Leech
D Pankratov
D Pant
D Ucar
E Antolini
E Martin
F Soavi
F Zhang
F Zhang
G Lu
G Papaharalabos
G Papaharalabos
H Liu
H Ren
H Ren
H Rismani-Yazdi
H Wang
H Wang
H Wang
H Yuan
I Gajda
I Gajda
I Ieropoulos
I Ieropoulos
I Merino-Jimenez
IA Ieropoulos
IA Ieropoulos
IA Ieropoulos
J Houghton
J Madjarov
J Masa
J Wei
J-Y Nam
JD Park
JD Park
K Guo
K Kinoshita
K Kinoshita
L Birry
L Xiao
L Zhuang
M Behera
M Ghasemi
M Grattieri
M Kodali
M Lu
M Minson
M Oliot
M Rasmussen
M Santini
MA Rosenbaum
MAC Oliveira de
MJ Cooney
MN Young
MT Nguyen
Mustakeem
MY Nguyen
NS Malvankar
P Atanassov
P Choudhury
P Pandey
S Brocato
S Rojas-Carbonell
S Sevda
S Wu
SD Minteer
U Salaj-Kosla
X Xie
X Zhang
X Zhang
XA Walter
XA Walter
XA Walter
Y Hou
Y Meriah Arias-Thode
Z Ge
Z Ge
Z He
Z Wang
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

© 2018 The Author(s). In this work, a microbial fuel cell (MFC) stack containing 28 ceramic MFCs was tested in both standard and supercapacitive modes. The MFCs consisted of carbon veil anodes wrapped around the ceramic separator and air-breathing cathodes based on activated carbon catalyst pressed on a stainless steel mesh. The anodes and cathodes were connected in parallel. The electrolytes utilized had different solution conductivities ranging from 2.0 mScm-1 to 40.1 mScm-1, simulating diverse wastewaters. Polarization curves of MFCs showed a general enhancement in performance with the increase of the electrolyte solution conductivity. The maximum stationary power density was 3.2 mW (3.2 Wm-3) at 2.0 mScm-1 that increased to 10.6 mW (10.6 Wm-3) at the highest solution conductivity (40.1 mScm-1). For the first time, MFCs stack with 1 L operating volume was also tested in supercapacitive mode, where full galvanostatic discharges are presented. Also in the latter case, performance once again improved with the increase in solution conductivity. Particularly, the increase in solution conductivity decreased dramatically the ohmic resistance and therefore the time for complete discharge was elongated, with a resultant increase in power. Maximum power achieved varied between 7.6 mW (7.6 Wm-3) at 2.0 mScm-1 and 27.4 mW (27.4 Wm-3) at 40.1 mScm-1

Crossref

Southampton (e-Prints Soton)

Directory of Open Access Journals

UWE Bristol Research Repository

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

The University of Manchester - Institutional Repository

Mosaicking and enhancement of slit lamp biomicroscopic fundus images

Author: Asmuth J.
Berger J.
Madjarov B.
Sajda P.
Publication venue
Publication date: 01/05/2001
Field of study

AIMS—To process video slit lamp biomicroscopic fundus image sequences in order to generate wide field, high quality fundus image montages which might be suitable for photodocumentation. METHODS—Slit lamp biomicroscopic fundus examination was performed on human volunteers with a contact or non-contact lens. A stock, charge coupled device camera permitted image capture and storage of the image sequence at 30 frames per second. Acquisition time was approximately 30 seconds. Individual slit lamp biomicroscope fundus image frames were aligned and blended with custom developed software. RESULTS—The developed algorithms allowed for highly accurate alignment and blending of partially overlapping slit lamp biomicroscopic fundus images to generate a seamless, high quality, wide field montage. CONCLUSIONS—Video image acquisition and processing algorithms allow for mosaicking and enhancement of slit lamp biomicroscopic fundus images. The improved quality and wide field of view may confer suitability for inexpensive, real time photodocumentation of disc and macular abnormalities. 

Crossref

PubMed Central