57 research outputs found
General supervision via probabilistic transformations
Different types of training data have led to numerous schemes for supervised classification. Current learning techniques are tailored to one specific scheme and cannot handle general ensembles of training samples. This paper presents a unifying framework for supervised classification with general ensembles of training samples, and proposes the learning methodology of generalized robust risk minimization (GRRM). The paper shows how current and novel supervision schemes can be addressed under the proposed framework by representing the relationship between examples at prediction and training via probabilistic transformations. The results show that GRRM can handle different types of training samples in a unified manner, and enable new supervision schemes that aggregate general ensembles of training samples.RYC-2016-1938
Female Models in AI and the Fight Against COVID-19
Gender imbalance has persisted over time and is well documented in science, technology,
engineering and mathematics (STEM) and singularly in artificial intelligence
(AI). In this article we emphasize the importance of increasing the visibility and
recognition of women researchers to attract and retain women in the AI field. We
review the ratio of women in STEM and AI, its evolution through time, and the differences
among disciplines. Then, we discuss the main sources of this gender
imbalance highlighting the lack of female role models and the problems which may
arise; such as the so called Marie Curie complex, suvivorship bias, and impostor
syndrome. We also emphasize the importance of active participation of women researchers
in conferences, providing statistics corresponding with the leading conferences.
Finally, we give examples of several prestigious female researchers in
the field and we review their research work related to COVID-19 displayed in the
workshop âArtificial Intelligence for the Fight Against COVID-19â (AI4FA COVID-19), which is an example of a more balanced participation between genders.AXA Research Fund through the project âEarly Prognosis of COVID-19 Infections via Machine
Learningâ under the Exceptional Flash Call âMitigating risk in the wake of the COVID-19 pandemicâ
Basque Government through the project âMathematical Modeling Applied to Health
Deep GEM-based network for weakly supervised UWB ranging error mitigation
Ultra-wideband (UWB)-based techniques, while becoming
mainstream approaches for high-accurate positioning,
tend to be challenged by ranging bias in harsh environments.
The emerging learning-based methods for error mitigation have
shown great performance improvement via exploiting high semantic
features from raw data. However, these methods rely
heavily on fully labeled data, leading to a high cost for data
acquisition. We present a learning framework based on weak
supervision for UWB ranging error mitigation. Specifically,
we propose a deep learning method based on the generalized
expectation-maximization (GEM) algorithm for robust UWB
ranging error mitigation under weak supervision. Such method
integrate probabilistic modeling into the deep learning scheme,
and adopt weakly supervised labels as prior information. Extensive
experiments in various supervision scenarios illustrate the
superiority of the proposed method.Ramon y Cajal Grant RYC-2016-1938
Minimax Classification with 0-1 Loss and Performance Guarantees
Supervised classification techniques use training samples to find classification
rules with small expected 0-1 loss. Conventional methods achieve efficient learning
and out-of-sample generalization by minimizing surrogate losses over specific
families of rules. This paper presents minimax risk classifiers (MRCs) that do not
rely on a choice of surrogate loss and family of rules. MRCs achieve efficient
learning and out-of-sample generalization by minimizing worst-case expected 0-1
loss w.r.t. uncertainty sets that are defined by linear constraints and include the
true underlying distribution. In addition, MRCsâ learning stage provides performance
guarantees as lower and upper tight bounds for expected 0-1 loss. We also
present MRCsâ finite-sample generalization bounds in terms of training size and
smallest minimax risk, and show their competitive classification performance w.r.t.
state-of-the-art techniques using benchmark datasets.Ramon y Cajal Grant RYC-2016-1938
Double-Weighting for Covariate Shift Adaptation
Supervised learning is often affected by a covariate shift in which the marginal distributions of instances (covariates ) of training and testing samples and are different but the label conditionals coincide.
Existing approaches address such covariate shift by either using the ratio to weight training samples (reweighted methods) or using the ratio to weight testing samples (robust methods).
However, the performance of such approaches can be poor under support mismatch or when the above ratios take large values.
We propose a minimax risk classification (MRC) approach for covariate shift adaptation that avoids such limitations by weighting both training and testing samples.
In addition, we develop effective techniques that obtain both sets of weights and generalize the conventional kernel mean matching method.
We provide novel generalization bounds for our method that show a significant increase in the effective sample size compared with reweighted methods.
The proposed method also achieves enhanced classification performance in both synthetic and empirical experiments.CNS2022-135203, âEarly Prognosis of COVID-19 Infections via Machine Learningâ funded by the AXA Research Fun
A Deep Learning Approach for Generating Soft Range Information from RF Data
Radio frequency (RF)-based techniques are widely
adopted for indoor localization despite the challenges in extracting
sufficient information from measurements. Soft range
information (SRI) offers a promising alternative for highly
accurate localization that gives all probable range values rather
than a single estimate of distance. We propose a deep learning
approach to generate accurate SRI from RF measurements. In
particular, the proposed approach is implemented by a network
with two neural modules and conducts the generation directly
from raw data. Extensive experiments on a case study with
two public datasets are conducted to quantify the efficiency
in different indoor localization tasks. The results show that
the proposed approach can generate highly accurate SRI, and
significantly outperforms conventional techniques in both nonline-of-sight (NLOS) detection and ranging error mitigation.Ramon y Cajal Grant RYC-2016-1938
Generalized Maximum Entropy for Supervised Classification
The maximum entropy principle advocates to
evaluate eventsâ probabilities using a distribution that maximizes
entropy among those that satisfy certain expectationsâ constraints. Such principle can be generalized for arbitrary decision
problems where it corresponds to minimax approaches. This
paper establishes a framework for supervised classification based
on the generalized maximum entropy principle that leads to
minimax risk classifiers (MRCs). We develop learning techniques
that determine MRCs for general entropy functions and provide
performance guarantees by means of convex optimization. In
addition, we describe the relationship of the presented techniques
with existing classification methods, and quantify MRCs performance in comparison with the proposed bounds and conventional
methods.RYC-2016-1938
A Variational Learning Approach for Concurrent Distance Estimation and Environmental Identification
Wireless propagated signals encapsulate rich information
for high-accuracy localization and environment sensing.
However, the full exploitation of positional and environmental
features as well as their correlation remains challenging in
complex propagation environments. In this paper, we propose
a methodology of variational inference over deep neural networks
for concurrent distance estimation and environmental
identification. The proposed approach, namely inter-instance
variational auto-encoders (IIns-VAEs), conducts inference with
latent variables that encapsulate information about both distance
and environmental labels. A deep learning network with instance
normalization is designed to approximate the inference concurrently
via deep learning. We conduct extensive experiments on
real-world datasets and the results show the superiority of the
proposed IIns-VAE in both distance estimation and environmental
identification compared to conventional approaches
Efficient Learning of Minimax Risk Classifiers in High Dimensions
High-dimensional data is common in multiple areas, such as health care and genomics, where the
number of features can be tens of thousands. In
such scenarios, the large number of features often leads to inefficient learning. Constraint generation methods have recently enabled efficient learning of L1-regularized support vector machines
(SVMs). In this paper, we leverage such methods
to obtain an efficient learning algorithm for the recently proposed minimax risk classifiers (MRCs).
The proposed iterative algorithm also provides a
sequence of worst-case error probabilities and performs feature selection. Experiments on multiple
high-dimensional datasets show that the proposed
algorithm is efficient in high-dimensional scenarios. In addition, the worst-case error probability
provides useful information about the classifier
performance, and the features selected by the algorithm are competitive with the state-of-the-art.CNS2022-13520
Probabilistic Load Forecasting Based on Adaptive Online Learning
Load forecasting is crucial for multiple energy management
tasks such as scheduling generation capacity, planning
supply and demand, and minimizing energy trade costs. Such
relevance has increased even more in recent years due to the
integration of renewable energies, electric cars, and microgrids.
Conventional load forecasting techniques obtain singlevalue
load forecasts by exploiting consumption patterns of past
load demand. However, such techniques cannot assess intrinsic
uncertainties in load demand, and cannot capture dynamic
changes in consumption patterns. To address these problems,
this paper presents a method for probabilistic load forecasting
based on the adaptive online learning of hidden Markov models.
We propose learning and forecasting techniques with theoretical
guarantees, and experimentally assess their performance in
multiple scenarios. In particular, we develop adaptive online
learning techniques that update model parameters recursively,
and sequential prediction techniques that obtain probabilistic
forecasts using the most recent parameters. The performance of
the method is evaluated using multiple datasets corresponding
with regions that have different sizes and display assorted
time-varying consumption patterns. The results show that the
proposed method can significantly improve the performance of
existing techniques for a wide range of scenarios.Ramon y Cajal Grant RYC-2016-19383
Basque Government under the grant "Artificial Intelligence in BCAM number EXP. 2019/00432"
Iberdrola
Foundation under the 2019 Research Grant
- âŠ