Search CORE

30 research outputs found

Analyzing First-Person Stories Based on Socializing, Eating and Sedentary Patterns

Author: A Cartas
A Natekin
A Torralba
AR Doherty
BC Russell
CJ Burges
E Talavera
F Pedregosa
M Bolanos
M Dimiccoli
N Srivastava
O Kramer
O Russakovsky
Publication venue
Publication date: 25/07/2017
Field of study

First-person stories can be analyzed by means of egocentric pictures acquired throughout the whole active day with wearable cameras. This manuscript presents an egocentric dataset with more than 45,000 pictures from four people in different environments such as working or studying. All the images were manually labeled to identify three patterns of interest regarding people's lifestyle: socializing, eating and sedentary. Additionally, two different approaches are proposed to classify egocentric images into one of the 12 target categories defined to characterize these three patterns. The approaches are based on machine learning and deep learning techniques, including traditional classifiers and state-of-art convolutional neural networks. The experimental results obtained when applying these methods to the egocentric dataset demonstrated their adequacy for the problem at hand.Comment: Accepted at First International Workshop on Social Signal Processing and Beyond, 19th International Conference on Image Analysis and Processing (ICIAP), September 201

arXiv.org e-Print Archive

Crossref

Deep Convolutional Neural Networks for Breast Cancer Histology Image Analysis

Author: A Natekin
A Tiulpin
AC Ruifrok
BE Bejnordi
CW Elston
JG Elmore
JS Meyer
RL Siegel
S Robertson
T Araújo
T Ching
Y Guo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/04/2018
Field of study

Breast cancer is one of the main causes of cancer death worldwide. Early diagnostics significantly increases the chances of correct treatment and survival, but this process is tedious and often leads to a disagreement between pathologists. Computer-aided diagnosis systems showed potential for improving the diagnostic accuracy. In this work, we develop the computational approach based on deep convolution neural networks for breast cancer histology image classification. Hematoxylin and eosin stained breast histology microscopy image dataset is provided as a part of the ICIAR 2018 Grand Challenge on Breast Cancer Histology Images. Our approach utilizes several deep neural network architectures and gradient boosted trees classifier. For 4-class classification task, we report 87.2% accuracy. For 2-class classification task to detect carcinomas we report 93.8% accuracy, AUC 97.3%, and sensitivity/specificity 96.5/88.0% at the high-sensitivity operating point. To our knowledge, this approach outperforms other common methods in automated histopathological image classification. The source code for our approach is made publicly available at https://github.com/alexander-rakhlin/ICIAR2018Comment: 8 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Forecasting Player Behavioral Data and Simulating in-Game Events

Author: A Natekin
AJ Fox
C Bauckhage
Colin Chen
DH Ackley
G Ridgeway
G Schwarz
G Zhang
GE Box
GE Hinton
H Akaike
JG Cragg
JG Gooijer De
JH Friedman
KD Lawrence
L Deng
L Dwyer
M Gilliland
M Längkvist
MS El-Nasr
N Srivastava
NE Breslow
PH Eilers
PJ Brockwell
RJ Hyndman
S Asmussen
S Hochreiter
S Makridakis
SN Wood
SN Wood
SN Wood
T Hastie
T Zhang
TJ Hastie
Y Bengio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/10/2017
Field of study

Understanding player behavior is fundamental in game data science. Video games evolve as players interact with the game, so being able to foresee player experience would help to ensure a successful game development. In particular, game developers need to evaluate beforehand the impact of in-game events. Simulation optimization of these events is crucial to increase player engagement and maximize monetization. We present an experimental analysis of several methods to forecast game-related variables, with two main aims: to obtain accurate predictions of in-app purchases and playtime in an operational production environment, and to perform simulations of in-game events in order to maximize sales and playtime. Our ultimate purpose is to take a step towards the data-driven development of games. The results suggest that, even though the performance of traditional approaches such as ARIMA is still better, the outcomes of state-of-the-art techniques like deep learning are promising. Deep learning comes up as a well-suited general model that could be used to forecast a variety of time series with different dynamic behaviors

arXiv.org e-Print Archive

Crossref

Predicting physical properties of woven fabrics via automated machine learning and textile design and finishing features

Author: A Natekin
D Cook
GO Campos
HE Eltayib
J Fan
J Han
J Hu
JA Nelder
K Gibert
L Breiman
L Breiman
P Cortez
P Domingos
P Geurts
P Yildirim
PH Yap
R Beltran
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

This paper presents a novel Machine Learning (ML) approach to support the creation of woven fabrics. Using data from a textile company, two CRoss-Industry Standard Process for Data Mining (CRISP-DM) iterations were executed, aiming to compare three input feature representation strategies related with fabric design and finishing processes. During the modeling stage of CRISP-DM, an Automated ML (AutoML) procedure was used to select the best regression model among six distinct state-of-the-art ML algorithms. A total of nine textile physical properties were modeled (e.g., abrasion, elasticity, pilling). Overall, the simpler yarn representation strategy obtained better predictive results. Moreover, for eight fabric properties (e.g., elasticity, pilling) the addition of finishing features improved the quality of the predictions. The best ML models obtained low predictive errors (from 2% to 7%) and are potentially valuable for the textile company, since they can be used to reduce the number of production attempts (saving time and costs).This work was carried out within the project “TexBoost: less Commodities moreSpecialities” reference POCI-01-0247-FEDER-024523, co-funded byFundo Eu-ropeu de Desenvolvimento Regional(FEDER), through Portugal 2020 (P2020)

Universidade do Minho: RepositoriUM

Crossref

Machine learning–XGBoost analysis of language networks to classify patients with epilepsy

Author: A Alvarez
A Ardila
A Besga
A Möller
A Natekin
A Sharan
A Thiel
BC Munsell
D Badre
D Tamayo
DF Abbott
E Cousin
E Cousin
F Pedregosa
G Josse
G Ojemann
GC Cawley
H Wieser
J Friedman
J Hernández-Orallo
J Mbwana
J Springer
JA Wada
JH Friedman
JH Friedman
JR Binder
JR Booth
KJ Friston
KJ Friston
KK Dijkstra
L Thivard
LR Rosenberger
LY Fan
M Baciu
M Baciu
M Baciu
M Baciu
M Bertolotti
M Perrone-Bertolotti
M Ries
MM Bahn
MM Berl
MM Berl
QS Xu
RE Goldmann
RL Billingsley
S Noachtar
S Raschka
SD Spritzer
T Gazit
T Kaufmann
T Nowotny
VR Steele
W Dubitzky
WD Gaillard
Z Han
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Using gradient boosting regression to improve ambient solar wind model predictions

Author: Allen R. C.
Altschuler M. D.
Arden W. M.
Arge C. N.
Arge C. N.
Arge C. N.
Arge C. N.
Bloomfield D. S.
Bussy‐Virat C. D.
Camporeale E.
Chandorkar M.
Chen T.
Devos A.
Dewey R. M.
Feng X.
Friedman J. H.
Hanssen A.
Hastie T.
Henney C. J.
Hickmann K. S.
Hinterreiter J.
Jian L. K.
Kohutova P.
Lee C.
Linker J. A.
Linker J. A.
Liu D. D.
Luhmann J. G.
Luhmann J. G.
MacNeice P.
MacNeice P.
MacNeice P.
Mays M. L.
Merkin V. G.
Nakagawa Y.
Natekin A.
Odstrcil D.
Owens M.
Owens M. J.
Owens M. J.
Owens M. J.
Owens M. J.
Owens M. J.
Owens M. J.
Pomoell J.
Reiss M. A.
Reiss M. A.
Reiss M. A.
Riley P.
Riley P.
Riley P.
Riley P.
Riley P.
Schatten K. H.
Schatten K. H.
Scolini C.
Shen F.
Taktakishvili A.
Temmer M.
Tóth G.
Verbanac G.
Verbeke C.
Wang Y.‐M.
Wang Y.‐M.
Wilks D. S.
Wintoft P.
Wintoft P.
Wold A. M.
Worden J.
Yang Y.
Yang Y.
Zhang J.
Publication venue: 'American Geophysical Union (AGU)'
Publication date: 23/03/2021
Field of study

Studying the ambient solar wind, a continuous pressure‐driven plasma flow emanating from our Sun, is an important component of space weather research. The ambient solar wind flows in interplanetary space determine how solar storms evolve through the heliosphere before reaching Earth, and especially during solar minimum are themselves a driver of activity in the Earth’s magnetic field. Accurately forecasting the ambient solar wind flow is therefore imperative to space weather awareness. Here we present a machine learning approach in which solutions from magnetic models of the solar corona are used to output the solar wind conditions near the Earth. The results are compared to observations and existing models in a comprehensive validation analysis, and the new model outperforms existing models in almost all measures. In addition, this approach offers a new perspective to discuss the role of different input data to ambient solar wind modeling, and what this tells us about the underlying physical processes. The final model discussed here represents an extremely fast, well‐validated and open‐source approach to the forecasting of ambient solar wind at Earth

arXiv.org e-Print Archive

Central Archive at the University of Reading

Crossref