Search CORE

5 research outputs found

μHEM for identification of differentially expressed miRNAs using hypercuboid equivalence partition matrix

Author: A Budhu
A Keller
A Keller
A Schaefer
AD McIver
B Efron
BS Taylor
C Ambroise
C Blenkiron
C Clape
C Ding
C Wang
CW Tseng
D Slezak
D Slezak
FJ Ortega
H Hirata
H Zhao
J Fang
J Guo
J Lu
JJ Valdes
JM Wei
JR Quinlan
KP Porkka
L Wang
M Hart
M Ozen
M Pesta
M Raponi
M Zhu
MG Schrauder
MV Iorio
P Buelmann
P Maji
P Maji
P Maji
P Maji
P Maji
P Maji
P Maji
PM Pereira
Pradipta Maji
Q Jiang
R Tibshirani
R Xu
S Arora
S Li
S Nasser
S Paul
S Paul
S Paul
S Volinia
Sushmita Paul
T Hastie
TR Golub
U Lehmann
U Ralfkiaer
Vapnik V
Y Chen
Y Sylvestre
Z Pawlak
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Rule Extraction on Numeric Datasets Using Hyper-rectangles

Author: de Giusti Armando Eduardo
Hasperué Waldo
Lanzarini Laura Cristina
Publication venue: Canadian Center of Science and Education
Publication date: 01/01/2012
Field of study

When there is a need to understand the data stored in a database, one of the main requirements is being able to extract knowledge in the form of rules. Classification strategies allow extracting rules almost naturally. In this paper, a new classification strategy is presented that uses hyper-rectangles as data descriptors to achieve a model that allows extracting knowledge in the form of classification rules. The participation of an expert for training the model is discussed. Finally, the results obtained using the databases from the UCI repository are presented and compared with other existing classification models, showing that the algorithm presented requires less computational resources and achieves the same accuracy level and number of extracted rules.Fil: Hasperué, Waldo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Facultad de Informática. Instituto de Investigación en Informática Lidi; ArgentinaFil: Lanzarini, Laura Cristina. Universidad Nacional de La Plata. Facultad de Informática. Instituto de Investigación en Informática Lidi; ArgentinaFil: de Giusti, Armando Eduardo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Facultad de Informática. Instituto de Investigación en Informática Lidi; Argentin

CiteSeerX

CONICET Digital

Healthcare data mining from multi-source data

Author: Chen Ling
Publication venue: 'University of Queensland Library'
Publication date: 28/02/2017
Field of study

The "big data" challenge is changing the way we acquire, store, analyse, and draw conclusions from data. How we effectively and efficiently "mine" the data from possibly multiple sources and extract useful information is a critical question. Increasing research attention has been drawn to healthcare data mining, with an ultimate goal to improve the quality of care. The human body is complex and so too the data collected in treating it. Data noise that is often introduced via the collection process makes building Data Mining models a challenging task. This thesis focuses on the classification tasks of mining healthcare data, with the goal of improving the effectiveness of health risk prediction. In particular, we developed algorithms to address issues identified from real healthcare data, such as feature extraction, heterogeneity, label uncertainty, and large unlabeled data. The three main contributions of this research are as follows. First, we developed a new health index called Personal Health Index (PHI) that scores a person's health status based on the examination records of a given population. Second, we identified the key characteristics of the real datasets and issues that were associated with the data. Third, we developed classification algorithms to cope with those issues, particularly, the label uncertainty and large unlabeled data issues. This research takes one step forward towards scoring personal health based on mining increasingly large health records. Particularly, it pioneers exploring the mining of GHE data and tackles the associated challenges. It is our anticipation that in the near future, more robust data-mining-based health scoring systems will be available for healthcare professionals to understand people's health status and thus improve the quality of care

University of Queensland eSpace