Search CORE

4 research outputs found

A Framework for Classifying Web Attacks While Respecting ML Requirements

Author: A Khraisat
A Michael
C Yin
F Pedregosa
GG Sundarkumar
N Farnaaz
P Probst
R Vinayakumar
TM Cover
TT Wong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/04/2020
Field of study

International audienceInjection and Cross Site Scripting attacks are among the ten critical security risks to web-based applications. It is difficult, to provide a complete signature for firewalls that detect such attacks. Therefore, there are several proposals based on Machine Learning (ML) methods capable of detecting various web attacks from evolutive, heterogeneous data at large scale, without the need for expert knowledge. Unfortunately, web attacks detection have been addressed only from a ML algorithm viewpoint, there is a lack of clarity regarding the quality and amount of the training data, the hyperparameters tuning and the evaluation method. Low and poor data quality may compromise the success of the most powerful ML methods. Additionally, it is easy to build a model that is perfectly adapted to the dataset but unable to generalize the new unseen data. This paper introduces F2MW, a framework for multi-classifying web attacks with respect to the ML requirements

Client churn prediction with call log analysis

Author: A Keramati
AM Almana
B Yee Liau
C-P Wei
GG Sundarkumar
GS Linoff
HP Luhn
JW Pennebaker
K Coussement
K Ravi
K Sparck Jones
MAH Farquad
OG Ali
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

© Springer International Publishing AG, part of Springer Nature 2018. Client churn prediction is a classic business problem of retaining customers. Recently, machine learning algorithms have been applied to predict client churn and have shown promising performance comparing to traditional methods. Despite of its success, existing machine learning approach mainly focus on structured data such as demographic and transactional data, while unstructured data, such as emails and phone calls, have been largely overlooked. In this work, we propose to improve existing churn prediction models by analysing customer characteristics and behaviours from unstructured data, particularly, audio calls. To be specific, we developed a text mining model combined with gradient boosting tree to predict client churn. We collected and conducted extensive experiments on 900 thousand audio calls from 200 thousand customers, and experimental results show that our approach can significantly improve the previous model by exploiting the additional unstructured data

Crossref

OPUS - University of Technology Sydney

Road accident prediction and model interpretation using a hybrid K-means and random forest algorithm approach

Author: A Baru
A Persson
AA Yahya
AD Laytin
AJP Tixier
C Lee
D Deme
DMW Powers
F Asefa
G Vinodhini
GG Hordofa
GG Sundarkumar
J Balogun
J Xiao
JA Hartigan
JK Kim
K Haleem
L Breiman
L Wahab
M Alikhani
M Bedard
M Seid
MI Sameen
MT Habib
N Casado-Sanz
N Lee
OH Kwon
QA Al-Radaideh
S Ansari
S Kumar
S Sarkar
S Seid
SK Singh
SS Zajac
T Abegaz
W Gissane
W Odero
WH Chen
X Gu
Y Abebe
Y Castro
Z Regassa
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Health stages diagnostics of underwater thruster using sound features with imbalanced dataset

Author: A Cheveigné de
A Nayal
A Samat
B Lei
C Seiffert
Cheng Siong Chin
CHF Santos Dos
E Alexandre
E Omerdic
G Douzas
G Haixiang
G-B Huang
GA Susto
GG Sundarkumar
Guang-bin Huang
H Guo
H He
I Nekooeimehr
I Nekooeimehr
J Kowalski
J Lee
J Salamon
J Yu
J-H Shin
M Galar
N Wang
N-Y Liang
NV Chawla
P Xia
Q Fan
S Barua
Teck Kai Chan
WWY Ng
Y Cui
Y Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref