Search CORE

14 research outputs found

Statistical learning in complex and temporal data: distances, two-sample testing, clustering, classification and Big Data

Author: Montero Manso Pablo
Publication venue
Publication date: 01/01/2019
Field of study

Programa Oficial de Doutoramento en Estatística e Investigación Operativa. 555V01[Resumo] Esta tesis trata sobre aprendizaxe estatístico en obxetos complexos, con énfase en series temporais. O problema abórdase introducindo coñecemento sobre o dominio do fenómeno subxacente, mediante distancias e características. Proponse un contraste de dúas mostras basado en distancias e estúdase o seu funcionamento nun gran abanico de escenarios. As distancias para clasificación e clustering de series temporais acadan un incremento da potencia estatística cando se aplican a contrastes de dúas mostras. O noso test compárase de xeito favorable con outros métodos gracias á súa flexibilidade ante diferentes alternativas. Defínese unha nova distancia entre series temporais mediante un xeito innovador de comparar as distribucións retardadas das series. Esta distancia herda o bo funcionamento empírico doutros métodos pero elimina algunhas das súas limitacións. Proponse un método de predicción baseada en características das series. O método combina diferentes algoritmos estándar de predicción mediante unha suma ponderada. Os pesos desta suma veñen dun modelo que se axusta a un conxunto de entrenamento de gran tamaño. Propónse un método de clasificación distribuida, baseado en comparar, mediante unha distancia, as funcións de distribución empíricas do conxuto de proba común e as dos datos que recibe cada nodo de cómputo.[Resumen] Esta tesis trata sobre aprendizaje estadístico en objetos complejos, con énfasis en series temporales. El problema se aborda introduciendo conocimiento del dominio del fenómeno subyacente, mediante distancias y características. Se propone un test de dos muestras basado en distancias y se estudia su funcionamiento en un gran abanico de escenarios. La distancias para clasificación y clustering de series temporales consiguen un incremento de la potencia estadística cuando se aplican al tests de dos muestras. Nuestro test se compara favorablemente con otros métodos gracias a su flexibilidad antes diferentes alternativas. Se define una nueva distancia entre series temporales mediante una manera innovadora de comparar las distribuciones retardadas de la series. Esta distancia hereda el buen funcionamiento empírico de otros métodos pero elimina algunas de sus limitaciones. Se propone un método de predicción basado en características de las series. El método combina diferentes algoritmos estándar de predicción mediante una suma ponderada. Los pesos de esta suma salen de un modelo que se ajusta a un conjunto de entrenamiento de gran tamaño. Se propone un método de clasificación distribuida, basado en comparar, mediante una distancia, las funciones de distribución empírica del conjuto de prueba común y las de los datos que recibe cada nodo de cómputo.[Abstract] This thesis deals with the problem of statistical learning in complex objects, with emphasis on time series data. The problem is approached by facilitating the introduction of domain knoweldge of the underlying phenomena by means of distances and features. A distance-based two sample test is proposed, and its performance is studied under a wide range of scenarios. Distances for time series classification and clustering are also shown to increase statistical power when applied to two-sample testing. Our test compares favorably to other methods regarding its flexibility against different alternatives. A new distance for time series is defined by considering an innovative way of comparing lagged distributions of the series. This distance inherits the good empirical performance of existing methods while removing some of their limitations. A forecast method based on times series features is proposed. The method works by combining individual standard forecasting algorithms using a weighted average. These weights come from a learning model fitted on a large training set. A distributed classification algorithm is proposed, based on comparing, using a distance, the empirical distribution functions between the dataset that each computing node receives and the test set

Repositorio da Universidade da Coruña

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Augmented Judgments: The affordances of artificial intelligence in improving accuracy of new product launch decisions.

Author: Montero-Manso Pablo
Vazquez-Hernandez Carlos
Publication venue: The University of Sydney Business School
Publication date: 26/04/2023
Field of study

The uncertainty behind new product in market makes judging its success a complex endeavour. The extant literature does not accurately explain whether with the help of an artificial intelligence (AI) such uncertainty can be managed. In our paper, we aim at measuring to which extent new product success judgments improve when information provided by an artificial intelligence model is present. We conducted three pilot experiments to measure the effects of different amounts of information given by the artificial intelligence. In the first experiment, participants are presented with the AI’s predicted probability of success. In the second experiment, participants are presented with AI’s probability of success coupled with an explanation how the AI reached its predictions based on variables of the product. In the third experiment, we measured participants improvement on their own judgments after (not while!) being exposed to the information provided by AI. We use new wine products as a context for the experiments. Ground-truth for success is based on a large database of historical product launches. Participants were recruited via a panel, and exposed to new product launch scenarios in an online service. We found that the predicted judgments are significantly improved (p-value: 0.011) when AI information is provided. We also found that participants significantly improved (p-value:0.05) after receiving the AI stimulus. However, we did not find strong evidence that exposing participants to an explanation is better than exposing them to just a probability of success. With our pilot experiments, we also identified required samples sizes and modifications in the experimental design to increase statistical power. Our findings contribute with empirical evidence on the affordances of AI in improving new product success judgments, and the effect of applying novel AI explainability techniques in real-world users. Further, our findings pave the way for further experimentation in human-AI interaction for augmenting new product judgments

Sydney eScholarship

Iberoamerican Reviews

Author: Andreu Miralles Xavier
Aranha Patricia
Barruso Barés Pedro
Büschges Christian
Chacón Nelson
Faust-Scalisi Mario
Fernández Redondo Iñaki
Galindo Pérez Miguel Adolfo
Gil Mariño Cecilia Nuria
Gil Montero Raquel
Giuliana Virginie
González Abellás Miguel
Helgueta Manso Javier
Hernández García Mariano
Herráez Cubino Guillermo
Hevia Jordán Evelyn
Larrinaga Carlos
Loaiza Bejarano Juan José
Pancorbo Fernando J.
Phaf-Rheinberger Ineke
Plötz Jochen
Prutsch Ursula
Rodríguez-Rodríguez Ana M.
Rojas López Estefanía
Rojas Pablo
Ruderer Stephan
Soler Lizbeth
Tribaldos Milla Jaime
Vilches-De Frutos Francisca
Von Tschilschke Christian
Zubiaga Arana Erik
Publication venue: Ibero-Amerikanisches Institut Preußischer Kulturbesitz (Berlin)
Publication date: 22/11/2022
Field of study

Reseñas iberoamericanasIberoamerican Review

Revistas del Instituto Ibero-Americano (IAI), Berlín (Open Access)

Assessment of regional differences of electrical load profiles

Author: Astigarraga Leira
Borges Hernandez Cruz Enrique
Casado-Mansilla Diego
Merveille Chris
Montero-Manso Pablo
Pflugradt Noah
Quesada Carlos
Publication venue
Publication date: 01/01/2021
Field of study

Juelich Shared Electronic Resources

FFORMA: Feature-based forecast model averaging

Author: Bates
Chen
Clemen
Engle
George Athanasopoulos
Hyndman
Kang
Kück
Lemke
Pablo Montero-Manso
Prudêncio
Rob J. Hyndman
Smith
Thiyanga S. Talagala
Timmermann
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Distributed classification based on distances between probability distributions in feature space

Author: Amparo Alonso-Betanzos
Bekkerman
Bello-Orgaz
Breiman
Chan
Chen
Daumé
Dick
Dietterich
Du Plessis
Grigorios
Huang
José A. Vilar
Kittler
Laura Morán-Fernández
Lazarevic
Liaw
Pablo Montero-Manso
Peteiro-Barral
Quionero-Candela
Rodríguez
Sokolova
Székely
Székely
Tsoumakas
Upadhyaya
Venables
Verónica Bolón-Canedo
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Comparison and Evaluation of Methods for a Predict+Optimize Problem in Renewable Energy

Author: Abolghasemi Mahdi
Bean Richard
Bergmeir Christoph
Betts John
Bui Quang
de Nijs Frits
Dinh Nam Trong
Einecke Nils
Esmaeilbeigi Rasul
Ferraro Scott
Galketiya Priya
Genov Evgenii
Glasgow Robert
Godahewa Rakshitha
Kang Yanfei
Kumar Yogesh Pipada Sunil
Limmer Steffen
Magdalena Luis
Montero-Manso Pablo
Peralta Daniel
Rosales-Pérez Alejandro
Ruddick Julian
Sriramulu Abishek
Stratigakos Akylas
Stuckey Peter
Tack Guido
Triguero Isaac
Yuan Rui
Publication venue: HAL CCSD
Publication date: 26/05/2023
Field of study

Algorithms that involve both forecasting and optimization are at the core of solutions to many difficult real-world problems, such as in supply chains (inventory optimization), traffic, and in the transition towards carbon-free energy generation in battery/load/production scheduling in sustainable energy systems. Typically, in these scenarios we want to solve an optimization problem that depends on unknown future values, which therefore need to be forecast. As both forecasting and optimization are difficult problems in their own right, relatively few research has been done in this area. This paper presents the findings of the ``IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling," held in 2021. We present a comparison and evaluation of the seven highest-ranked solutions in the competition, to provide researchers with a benchmark problem and to establish the state of the art for this benchmark, with the aim to foster and facilitate research in this area. The competition used data from the Monash Microgrid, as well as weather data and energy market data. It then focused on two main challenges: forecasting renewable energy production and demand, and obtaining an optimal schedule for the activities (lectures) and on-site batteries that lead to the lowest cost of energy. The most accurate forecasts were obtained by gradient-boosted tree and random forest models, and optimization was mostly performed using mixed integer linear and quadratic programming. The winning method predicted different scenarios and optimized over all scenarios jointly using a sample average approximation method

HAL-MINES ParisTech

European Covid-19 Forecast Hub

Author: Ajitesh Srivastava
Alessio Farcomeni
Alexander Kuhlmann
Alexander Ullrich
Andrea Kraus
Aniruddha Adiga
Anna Gambin
Antonello Maruotti
Antoni Moszynski
Arne Rodloff
Barbara Pabjan
Barbara Tarantino
Bartosz Krupa
Bastian Prasse
Benjamin Bejar
Benjamin Hurt
Berit Lange
Bertsimas Dimitris
Borja Reina
Bradley Suchoski
Bryan Lewis
Cesar Perez Alvarez
Clara Prats
Daniel Lopez
Daniel Sheldon
Daniel Wolffram
David E Singh
David Kraus
David Morina
Dorina Thanou
Ekaterina Krymova
Enric Alvarez
Evan L Ray
Ewa Szczurek
Fabio Divino
Filip Dreger
Francesco Bartolucci
Franciszek Rakowski
Frank Sandman
Fulvia Pennoni
Gianfranco Lovison
Giovanna Jona Lasinio
Giovanni Ardenghi
Giovanni Ziarelli
Graham Gibson
Grzegorz Redlarski
Guillaume Obozinski
Heidi Gurung
Helen Johnson
Hugo Gruson
Inmaculada Villanueva
Isti Rodiah
Jakub Zielinski
Jan Fuhrmann
Jan Kisielewski
Jan Mohring
Jan Pablo Burgard
Jan Trnka
Janez Zibert
Jannik Deuschel
Jaroslaw Wlazlo
Jedrzej Nowosielski
Johanna Schneider
Johannes Bracher
Jonas Dehning
Jose L Aznarte
Jozef Budzinski
Karol Niedzielewski
Katharine Sherratt
Kirsten Holger
Krzysztof Gogolewski
Lenka Pribylova
Lijing Wang
Loic Pottier
Maciej Filinski
Maciej Radwan
Madhav Marathe
Magdalena Gruziel-Slomka
Marcin Bodych
Marcin Semeniuk
Marco Mingione
Maria Vittoria Barbarossa
Markus Scholz
Marti Catala
Martin Smid
Michael Lingzhi Li
Miguel Guzman-Merino
Milan Zajicek
Neele Leithauser
Nicholas G Reich
Nicola Parolini
Nikos I Bosse
Nutcha Wattanachit
Pablo Montero-Manso
Paolo Giudici
Pierfrancesco Alaimo Di Loro
Prasith Baccam
Przemyslaw Porebski
Radoslaw Idzikowski
Rafal Bartczuk
Rene Niehus
Robert Walraven
Sam Abbott
Sebastian Funk
Sebastian Mohr
Sergio Alonso
Soni Saksham
Sophie Meakin
Srinivasan Venkatramanan
Stefan Heyder
Steven Stage
Tao Sun
Thomas Hotz
Tom Zimmermann
Tomasz Ozanski
Tyll Krueger
Veronika Eclerova
Viola Priesemann
Vit Tucek
Wolfgang Bock
Yijin Wang
Yuri Kheifetz
Publication venue
Publication date: 01/01/2022
Field of study

European Covid-19 Forecast Hub

Archivio istituzionale della ricerca - Politecnico di Milano

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Predictive performance of multi-model ensemble forecasts of COVID-19 across European nations

Author: Abbott Sam
Adiga Aniruddha
Alaimo Di Loro Pierfrancesco
Alonso Sergio
Ardenghi Giovanni
Aznarte Jose L
Baccam Prasith
Barbarossa Maria Vittoria
Bartczuk Rafal P
Bartolucci Francesco
Bejar Benjamin
Biecek Przemyslaw
Bock Wolfgang
Bodych Marcin
Bosse Nikos I
Bracher Johannes
Budzinski Jozef
Burgard Jan Pablo
Castro Lauren
Català Marti
Dehning Jonas
Deuschel Jannik
Dimitris Bertsimas
Divino Fabio
Dreger Filip
Eclerová Veronika
Fairchild Geoffrey
Farcomeni Alessio
Filinski Maciej
Fuhrmann Jan
Funk Sebastian
Gambin Anna
Gibson Graham
Giudici Paolo
Gogolewski Krzysztof
Grah Rok
Gruson Hugo
Gruziel-Słomka Magdalena
Gurung Heidi
Guzman-Merino Miguel
Heyder Stefan
Hotz Thomas
Hurt Benjamin
Idzikowski Radoslaw
Johnson Helen
Jona Lasinio Giovanna
Kheifetz Yuri
Kirsten Holger
Kisielewski Jan
Kraus Andrea
Kraus David
Krueger Tyll
Krupa Bartosz
Krymova Ekaterina
Kuhlmann Alexander
Lange Berit
Leithäuser Neele
Lewis Bryan
Lovison Gianfranco
López Daniel
Marathe Madhav
Maruotti Antonello
Meakin Sophie R
Meinke Jan H
Michael Lingzhi Li
Michaud Isaac
Mingione Marco
Mohr Sebastian
Mohring Jan
Montero-Manso Pablo
Moriña David
Moszyński Antoni
Niedzielewski Karol
Niehus Rene
Nowosielski Jedrzej
Obozinski Guillaume
Osthus Dave
Ozanski Tomasz
Pabjan Barbara
Parolini Nicola
Pennoni Fulvia
Porebski Przemyslaw
Pottier Loic
Prasse Bastian
Prats Clara
Pribylova Lenka
Priesemann Viola
Pérez Álvarez Cesar
Radwan Maciej
Rakowski Franciszek
Ray Evan L
Redlarski Grzegorz
Reich Nicholas G
Reina Borja
Rodiah Isti
Rodloff Arne
Saksham Soni
Sandmann Frank
Schneider Johanna
Scholz Markus
Semeniuk Marcin
Sheldon Daniel
Sherratt Katharine
Singh David E
Smid Martin
Srivastava Ajitesh
Stage Steven
Suchoski Bradley
Sun Tao
Szczurek Ewa
Tarantino Barbara
Thanou Dorina
Trnka Jan
Tucek Vit
Ullrich Alexander
Venkatramanan Srinivasan
Villanueva Inmaculada
Walraven Robert
Wang Lijing
Wang Yijin
Wattanachit Nutcha
Wolffram Daniel
Włazło Jaroslaw
Zajíček Milan
Ziarelli Giovanni
Zibert Janez
Zieliński Jakub
Zimmermann Tom
Álvarez Enric
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/01/2022
Field of study

Università degli Studi del Molise: IRIS