Search CORE

57 research outputs found

General supervision via probabilistic transformations

Author: Mazuelas S.
Pérez A.
Publication venue
Publication date: 24/01/2019
Field of study

Different types of training data have led to numerous schemes for supervised classification. Current learning techniques are tailored to one specific scheme and cannot handle general ensembles of training samples. This paper presents a unifying framework for supervised classification with general ensembles of training samples, and proposes the learning methodology of generalized robust risk minimization (GRRM). The paper shows how current and novel supervision schemes can be addressed under the proposed framework by representing the relationship between examples at prediction and training via probabilistic transformations. The results show that GRRM can handle different types of training samples in a unified manner, and enable new supervision schemes that aggregate general ensembles of training samples.RYC-2016-1938

arXiv.org e-Print Archive

BCAM's Institutional Repository Data

Female Models in AI and the Fight Against COVID-19

Author: Guerrero C.
Mazuelas S.
Publication venue
Publication date: 01/11/2022
Field of study

Gender imbalance has persisted over time and is well documented in science, technology, engineering and mathematics (STEM) and singularly in artificial intelligence (AI). In this article we emphasize the importance of increasing the visibility and recognition of women researchers to attract and retain women in the AI field. We review the ratio of women in STEM and AI, its evolution through time, and the differences among disciplines. Then, we discuss the main sources of this gender imbalance highlighting the lack of female role models and the problems which may arise; such as the so called Marie Curie complex, suvivorship bias, and impostor syndrome. We also emphasize the importance of active participation of women researchers in conferences, providing statistics corresponding with the leading conferences. Finally, we give examples of several prestigious female researchers in the field and we review their research work related to COVID-19 displayed in the workshop “Artificial Intelligence for the Fight Against COVID-19” (AI4FA COVID-19), which is an example of a more balanced participation between genders.AXA Research Fund through the project “Early Prognosis of COVID-19 Infections via Machine Learning” under the Exceptional Flash Call “Mitigating risk in the wake of the COVID-19 pandemic” Basque Government through the project “Mathematical Modeling Applied to Health

BCAM's Institutional Repository Data

Deep GEM-based network for weakly supervised UWB ranging error mitigation

Author: Li Y
Mazuelas S.
Shen Y.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 30/12/2021
Field of study

Ultra-wideband (UWB)-based techniques, while becoming mainstream approaches for high-accurate positioning, tend to be challenged by ranging bias in harsh environments. The emerging learning-based methods for error mitigation have shown great performance improvement via exploiting high semantic features from raw data. However, these methods rely heavily on fully labeled data, leading to a high cost for data acquisition. We present a learning framework based on weak supervision for UWB ranging error mitigation. Specifically, we propose a deep learning method based on the generalized expectation-maximization (GEM) algorithm for robust UWB ranging error mitigation under weak supervision. Such method integrate probabilistic modeling into the deep learning scheme, and adopt weakly supervised labels as prior information. Extensive experiments in various supervision scenarios illustrate the superiority of the proposed method.Ramon y Cajal Grant RYC-2016-1938

BCAM's Institutional Repository Data

Minimax Classification with 0-1 Loss and Performance Guarantees

Author: Mazuelas S.
Pérez A.
Zanoni A.
Publication venue
Publication date: 01/12/2020
Field of study

Supervised classification techniques use training samples to find classification rules with small expected 0-1 loss. Conventional methods achieve efficient learning and out-of-sample generalization by minimizing surrogate losses over specific families of rules. This paper presents minimax risk classifiers (MRCs) that do not rely on a choice of surrogate loss and family of rules. MRCs achieve efficient learning and out-of-sample generalization by minimizing worst-case expected 0-1 loss w.r.t. uncertainty sets that are defined by linear constraints and include the true underlying distribution. In addition, MRCs’ learning stage provides performance guarantees as lower and upper tight bounds for expected 0-1 loss. We also present MRCs’ finite-sample generalization bounds in terms of training size and smallest minimax risk, and show their competitive classification performance w.r.t. state-of-the-art techniques using benchmark datasets.Ramon y Cajal Grant RYC-2016-1938

BCAM's Institutional Repository Data

Double-Weighting for Covariate Shift Adaptation

Author: Liu A.
Mazuelas S.
Segovia J.I
Publication venue
Publication date: 01/07/2023
Field of study

Supervised learning is often affected by a covariate shift in which the marginal distributions of instances (covariates

x

) of training and testing samples

p_\text{tr}(x)

and

p_\text{te}(x)

are different but the label conditionals coincide. Existing approaches address such covariate shift by either using the ratio

p_\text{te}(x)/p_\text{tr}(x)

to weight training samples (reweighted methods) or using the ratio

p_\text{tr}(x)/p_\text{te}(x)

to weight testing samples (robust methods). However, the performance of such approaches can be poor under support mismatch or when the above ratios take large values. We propose a minimax risk classification (MRC) approach for covariate shift adaptation that avoids such limitations by weighting both training and testing samples. In addition, we develop effective techniques that obtain both sets of weights and generalize the conventional kernel mean matching method. We provide novel generalization bounds for our method that show a significant increase in the effective sample size compared with reweighted methods. The proposed method also achieves enhanced classification performance in both synthetic and empirical experiments.CNS2022-135203, “Early Prognosis of COVID-19 Infections via Machine Learning” funded by the AXA Research Fun

BCAM's Institutional Repository Data

A Deep Learning Approach for Generating Soft Range Information from RF Data

Author: li Y.
Mazuelas S.
Shen Y.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 24/01/2022
Field of study

Radio frequency (RF)-based techniques are widely adopted for indoor localization despite the challenges in extracting sufficient information from measurements. Soft range information (SRI) offers a promising alternative for highly accurate localization that gives all probable range values rather than a single estimate of distance. We propose a deep learning approach to generate accurate SRI from RF measurements. In particular, the proposed approach is implemented by a network with two neural modules and conducts the generation directly from raw data. Extensive experiments on a case study with two public datasets are conducted to quantify the efficiency in different indoor localization tasks. The results show that the proposed approach can generate highly accurate SRI, and significantly outperforms conventional techniques in both nonline-of-sight (NLOS) detection and ranging error mitigation.Ramon y Cajal Grant RYC-2016-1938

BCAM's Institutional Repository Data

Generalized Maximum Entropy for Supervised Classification

Author: Mazuelas S.
Pérez A.
Shen Y.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 15/12/2021
Field of study

The maximum entropy principle advocates to evaluate events’ probabilities using a distribution that maximizes entropy among those that satisfy certain expectations’ constraints. Such principle can be generalized for arbitrary decision problems where it corresponds to minimax approaches. This paper establishes a framework for supervised classification based on the generalized maximum entropy principle that leads to minimax risk classifiers (MRCs). We develop learning techniques that determine MRCs for general entropy functions and provide performance guarantees by means of convex optimization. In addition, we describe the relationship of the presented techniques with existing classification methods, and quantify MRCs performance in comparison with the proposed bounds and conventional methods.RYC-2016-1938

arXiv.org e-Print Archive

BCAM's Institutional Repository Data

A Variational Learning Approach for Concurrent Distance Estimation and Environmental Identification

Author: Li Y
Mazuelas S.
Shen Y.
Publication venue
Publication date: 01/02/2023
Field of study

Wireless propagated signals encapsulate rich information for high-accuracy localization and environment sensing. However, the full exploitation of positional and environmental features as well as their correlation remains challenging in complex propagation environments. In this paper, we propose a methodology of variational inference over deep neural networks for concurrent distance estimation and environmental identification. The proposed approach, namely inter-instance variational auto-encoders (IIns-VAEs), conducts inference with latent variables that encapsulate information about both distance and environmental labels. A deep learning network with instance normalization is designed to approximate the inference concurrently via deep learning. We conduct extensive experiments on real-world datasets and the results show the superiority of the proposed IIns-VAE in both distance estimation and environmental identification compared to conventional approaches

BCAM's Institutional Repository Data

Efficient Learning of Minimax Risk Classifiers in High Dimensions

Author: Bondugula K.R.
Mazuelas S.
Pérez A.
Publication venue
Publication date: 01/08/2023
Field of study

High-dimensional data is common in multiple areas, such as health care and genomics, where the number of features can be tens of thousands. In such scenarios, the large number of features often leads to inefficient learning. Constraint generation methods have recently enabled efficient learning of L1-regularized support vector machines (SVMs). In this paper, we leverage such methods to obtain an efficient learning algorithm for the recently proposed minimax risk classifiers (MRCs). The proposed iterative algorithm also provides a sequence of worst-case error probabilities and performs feature selection. Experiments on multiple high-dimensional datasets show that the proposed algorithm is efficient in high-dimensional scenarios. In addition, the worst-case error probability provides useful information about the classifier performance, and the features selected by the algorithm are competitive with the state-of-the-art.CNS2022-13520

BCAM's Institutional Repository Data

Probabilistic Load Forecasting Based on Adaptive Online Learning

Author: Lozano J.A.
Mazuelas S.
Álvarez V.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Load forecasting is crucial for multiple energy management tasks such as scheduling generation capacity, planning supply and demand, and minimizing energy trade costs. Such relevance has increased even more in recent years due to the integration of renewable energies, electric cars, and microgrids. Conventional load forecasting techniques obtain singlevalue load forecasts by exploiting consumption patterns of past load demand. However, such techniques cannot assess intrinsic uncertainties in load demand, and cannot capture dynamic changes in consumption patterns. To address these problems, this paper presents a method for probabilistic load forecasting based on the adaptive online learning of hidden Markov models. We propose learning and forecasting techniques with theoretical guarantees, and experimentally assess their performance in multiple scenarios. In particular, we develop adaptive online learning techniques that update model parameters recursively, and sequential prediction techniques that obtain probabilistic forecasts using the most recent parameters. The performance of the method is evaluated using multiple datasets corresponding with regions that have different sizes and display assorted time-varying consumption patterns. The results show that the proposed method can significantly improve the performance of existing techniques for a wide range of scenarios.Ramon y Cajal Grant RYC-2016-19383 Basque Government under the grant "Artificial Intelligence in BCAM number EXP. 2019/00432" Iberdrola Foundation under the 2019 Research Grant

BCAM's Institutional Repository Data