Search CORE

6 research outputs found

An insight into imbalanced Big Data classification: outcomes and challenges

Author: A Fernández
A Fernández
A Thusoo
B Krawczyk
C Bunkhumpornpat
CP Chen
D Lyubimov
E Elsebakhi
E Ramentol
F Hu
F Hu
G Haixiang
GEAPA Batista
GM Weiss
H He
H Yu
I Triguero
I Triguero
J Alcalá-Fdez
J Dean
J Huang
J Li
JA Sáez
JM Tomczak
K Kambatla
L Rokach
M Galar
M Galar
M Wasikowski
NV Chawla
NV Chawla
PC Zikopoulos
R Baeza-Yates
R Barandela
R Blagus
RC Prati
S Alshomrani
S Barua
S Elhag
S Kamal
S Owen
S Río
S Río
S-H Park
T Jo
T White
V García
V López
V López
V López
X Meng
X Wu
Y Guo
Y Sun
Y-S Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Big Data applications are emerging during the last years, and researchers from many disciplines are aware of the high advantages related to the knowledge extraction from this type of problem. However, traditional learning approaches cannot be directly applied due to scalability issues. To overcome this issue, the MapReduce framework has arisen as a “de facto” solution. Basically, it carries out a “divide-and-conquer” distributed procedure in a fault-tolerant way to adapt for commodity hardware. Being still a recent discipline, few research has been conducted on imbalanced classification for Big Data. The reasons behind this are mainly the difficulties in adapting standard techniques to the MapReduce programming style. Additionally, inner problems of imbalanced data, namely lack of data and small disjuncts, are accentuated during the data partitioning to fit the MapReduce programming style. This paper is designed under three main pillars. First, to present the first outcomes for imbalanced classification in Big Data problems, introducing the current research state of this area. Second, to analyze the behavior of standard pre-processing techniques in this particular framework. Finally, taking into account the experimental results obtained throughout this work, we will carry out a discussion on the challenges and future directions for the topic.This work has been partially supported by the Spanish Ministry of Science and Technology under Projects TIN2014-57251-P and TIN2015-68454-R, the Andalusian Research Plan P11-TIC-7765, the Foundation BBVA Project 75/2016 BigDaPTOOLS, and the National Science Foundation (NSF) Grant IIS-1447795

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

Repositorio Institucional Universidad de Granada

A Systematic Review of Techniques and Sources of Big Data in the Healthcare Sector

Author: A Alyass
A Moskowitz
A O’Driscoll
C Tu
CC Buchanan
CL Philip Chen
DM Trifiletti
DS Wishart
E Elsebakhi
FF Costa
G Pérez
H Wang
Isabel de la Torre Díez
J Andreu-Perez
J Cunha
J Manuel
Joel J. P. C. Rodrigues
Miguel López-Coronado
MM Fouad
N Garg
N Payakachat
NM Saravana Kumar
PW Rose
S Khan
SD Young
Sofiane Hamrioui
Susel Góngora Alonso
T Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Quasi-linear score for capturing heterogeneous structure in biomarkers

Author: A Oghabian
AK Jain
BR Thompson
E Elsebakhi
H Zou
HC Bravo
IJ Goodfellow
J Friedman
J McQueen
J Naudts
JA Nelder
JHJ Ward
JJ Goeman
JW Lee
Katsuhiro Omae
KR Foster
L Yan
LJ van’t Veer
M Brimacombe
M Buyse
M Dettling
MY Park
Osamu Komori
RA Jacobs
S Boyd
S Eguchi
S Setlur
SC Madeira
Shinto Eguchi
SL Meier
T Sørie
T Yun
W Lu
WJ Youden
Y Li
Y Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce

Author: A Stateczny
C Chen
C Cheng
C Doulkeridis
C Ren
E Elsebakhi
F Yu
F Zhang
GG Wang
GG Wang
GG Wang
GG Wang
GL Jing
H Chiroma
H Chiroma
H Chiroma
H Mohamed
Hao Shi
Hongyan Cui
HS Wang
J Liu
J Yan
JF Cao
JH Zhang
Jianfang Cao
JQ Feng
Lijuan Jiao
LM Xu
ML Liu
MX Hu
NM Nawi
Q Jin
Q Zou
Q Zou
Quan Zou
RR Pan
S Scardapane
SF Ding
V Roberge
WK Jia
WS Zhu
XW Zheng
Y Hu
Y Kim
Y Liu
Y Saadi
YH Guo
YM Gao
ZH Guo
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Approaches of enhancing interoperations among high performance computing and big data analytics via augmentation

Author: A Merzky
A Pérez
Ajeet Ram Pathak
B Nicolae
B-A Yassour
C Xu
DA Reed
E Elsebakhi
EW Biederman
F Zahid
F Zhang
G Bianchini
G Mackey
G Zhao
GM Kurtzer
H Jo
HA Duran-Limon
HA Duran-Limon
I Mavridis
J Bézivin
J Prades
J Ren
J Veiga
J Wang
JL Reyes-Ortiz
JP Martin
JT Daly
M Anderson
M Asch
M De Benedictis
M Eisler
M Katevenis
M Wasi-ur-Rahman
M Welsh
Manjusha Pandey
MW Rahman
O Yildiz
P Xuan
Q Liu
R Gad
S Soltesz
Siddharth S. Rautaray
SW Son
T White
V Costan
V Medel
W Bhimji
X Zhang
Y You
Z Fadika
Z Kozhirbayev
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

System-Wide Pollution of Biomedical Data: Consequence of the Search for Hub Genes of Hepatocellular Carcinoma Without Spatiotemporal Consideration

Author: A Bah
A Bommert
A Giuliani
A Polo
A Polo
A Polo
A Regev
A Sharma
A Sharma
A-L Barabási
A-L Barabási
AM Schnoes
AS Holehouse
B Palsson
B Rong
B Snel
B-J Breitkreutz
BC Bernhardt
C Lewis
C Stark
C Zhang
CH Lee
CT Bergstrom
D Huang
D Lazer
D Salas
D Szklarczyk
D-Y Wen
E Elsebakhi
E Guerriero
EF Civillico
EV Poverennaya
F Meng
F Murray-Zmijewski
H Chen
H Jeong
H Xuo
H Yan
HU Buhl
I Martincorena
J Clark
J Kyte
J Zhang
JC Navarro-Muñoz
K-I Goh
KL Simpson
L Li
L Liang
L Sang
L Venkataramana
L Wang
L Zhou
L Zhu
M Di Stasio
M Fondi
M Necci
M-R Yang
MJE Ardakani
MR Boland
MS Househ
MS Szalay
N Potenza
N Shea
NK Gale
O Güell
P Chen
P Hanus
P Lin
P Shannon
P Sorokowski
PY Wu
R Albert
R Kemp
R Kohavi
R Lokers
R Zhang
RR Vallabhajosyula
S Costantini
S Costantini
S Guariniello
S Maslov
S Singh
SA Billings
SD Ghiassian
SK Parr
T Hase
T van Mierlo
T Xing
T Yamada
V Marcel
V Spirin
VN Uversky
W Chen
W Lou
W Xu
WQ Hu
X Zhu
Y Cheng
Y Yang
Y Zheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

core

core