57 research outputs found

    General supervision via probabilistic transformations

    Get PDF
    Different types of training data have led to numerous schemes for supervised classification. Current learning techniques are tailored to one specific scheme and cannot handle general ensembles of training samples. This paper presents a unifying framework for supervised classification with general ensembles of training samples, and proposes the learning methodology of generalized robust risk minimization (GRRM). The paper shows how current and novel supervision schemes can be addressed under the proposed framework by representing the relationship between examples at prediction and training via probabilistic transformations. The results show that GRRM can handle different types of training samples in a unified manner, and enable new supervision schemes that aggregate general ensembles of training samples.RYC-2016-1938

    Female Models in AI and the Fight Against COVID-19

    Get PDF
    Gender imbalance has persisted over time and is well documented in science, technology, engineering and mathematics (STEM) and singularly in artificial intelligence (AI). In this article we emphasize the importance of increasing the visibility and recognition of women researchers to attract and retain women in the AI field. We review the ratio of women in STEM and AI, its evolution through time, and the differences among disciplines. Then, we discuss the main sources of this gender imbalance highlighting the lack of female role models and the problems which may arise; such as the so called Marie Curie complex, suvivorship bias, and impostor syndrome. We also emphasize the importance of active participation of women researchers in conferences, providing statistics corresponding with the leading conferences. Finally, we give examples of several prestigious female researchers in the field and we review their research work related to COVID-19 displayed in the workshop “Artificial Intelligence for the Fight Against COVID-19” (AI4FA COVID-19), which is an example of a more balanced participation between genders.AXA Research Fund through the project “Early Prognosis of COVID-19 Infections via Machine Learning” under the Exceptional Flash Call “Mitigating risk in the wake of the COVID-19 pandemic” Basque Government through the project “Mathematical Modeling Applied to Health

    Deep GEM-based network for weakly supervised UWB ranging error mitigation

    Get PDF
    Ultra-wideband (UWB)-based techniques, while becoming mainstream approaches for high-accurate positioning, tend to be challenged by ranging bias in harsh environments. The emerging learning-based methods for error mitigation have shown great performance improvement via exploiting high semantic features from raw data. However, these methods rely heavily on fully labeled data, leading to a high cost for data acquisition. We present a learning framework based on weak supervision for UWB ranging error mitigation. Specifically, we propose a deep learning method based on the generalized expectation-maximization (GEM) algorithm for robust UWB ranging error mitigation under weak supervision. Such method integrate probabilistic modeling into the deep learning scheme, and adopt weakly supervised labels as prior information. Extensive experiments in various supervision scenarios illustrate the superiority of the proposed method.Ramon y Cajal Grant RYC-2016-1938

    Minimax Classification with 0-1 Loss and Performance Guarantees

    Get PDF
    Supervised classification techniques use training samples to find classification rules with small expected 0-1 loss. Conventional methods achieve efficient learning and out-of-sample generalization by minimizing surrogate losses over specific families of rules. This paper presents minimax risk classifiers (MRCs) that do not rely on a choice of surrogate loss and family of rules. MRCs achieve efficient learning and out-of-sample generalization by minimizing worst-case expected 0-1 loss w.r.t. uncertainty sets that are defined by linear constraints and include the true underlying distribution. In addition, MRCs’ learning stage provides performance guarantees as lower and upper tight bounds for expected 0-1 loss. We also present MRCs’ finite-sample generalization bounds in terms of training size and smallest minimax risk, and show their competitive classification performance w.r.t. state-of-the-art techniques using benchmark datasets.Ramon y Cajal Grant RYC-2016-1938

    Double-Weighting for Covariate Shift Adaptation

    Get PDF
    Supervised learning is often affected by a covariate shift in which the marginal distributions of instances (covariates xx) of training and testing samples ptr(x)p_\text{tr}(x) and pte(x)p_\text{te}(x) are different but the label conditionals coincide. Existing approaches address such covariate shift by either using the ratio pte(x)/ptr(x)p_\text{te}(x)/p_\text{tr}(x) to weight training samples (reweighted methods) or using the ratio ptr(x)/pte(x)p_\text{tr}(x)/p_\text{te}(x) to weight testing samples (robust methods). However, the performance of such approaches can be poor under support mismatch or when the above ratios take large values. We propose a minimax risk classification (MRC) approach for covariate shift adaptation that avoids such limitations by weighting both training and testing samples. In addition, we develop effective techniques that obtain both sets of weights and generalize the conventional kernel mean matching method. We provide novel generalization bounds for our method that show a significant increase in the effective sample size compared with reweighted methods. The proposed method also achieves enhanced classification performance in both synthetic and empirical experiments.CNS2022-135203, “Early Prognosis of COVID-19 Infections via Machine Learning” funded by the AXA Research Fun

    A Deep Learning Approach for Generating Soft Range Information from RF Data

    Get PDF
    Radio frequency (RF)-based techniques are widely adopted for indoor localization despite the challenges in extracting sufficient information from measurements. Soft range information (SRI) offers a promising alternative for highly accurate localization that gives all probable range values rather than a single estimate of distance. We propose a deep learning approach to generate accurate SRI from RF measurements. In particular, the proposed approach is implemented by a network with two neural modules and conducts the generation directly from raw data. Extensive experiments on a case study with two public datasets are conducted to quantify the efficiency in different indoor localization tasks. The results show that the proposed approach can generate highly accurate SRI, and significantly outperforms conventional techniques in both nonline-of-sight (NLOS) detection and ranging error mitigation.Ramon y Cajal Grant RYC-2016-1938

    Generalized Maximum Entropy for Supervised Classification

    Get PDF
    The maximum entropy principle advocates to evaluate events’ probabilities using a distribution that maximizes entropy among those that satisfy certain expectations’ constraints. Such principle can be generalized for arbitrary decision problems where it corresponds to minimax approaches. This paper establishes a framework for supervised classification based on the generalized maximum entropy principle that leads to minimax risk classifiers (MRCs). We develop learning techniques that determine MRCs for general entropy functions and provide performance guarantees by means of convex optimization. In addition, we describe the relationship of the presented techniques with existing classification methods, and quantify MRCs performance in comparison with the proposed bounds and conventional methods.RYC-2016-1938

    A Variational Learning Approach for Concurrent Distance Estimation and Environmental Identification

    Get PDF
    Wireless propagated signals encapsulate rich information for high-accuracy localization and environment sensing. However, the full exploitation of positional and environmental features as well as their correlation remains challenging in complex propagation environments. In this paper, we propose a methodology of variational inference over deep neural networks for concurrent distance estimation and environmental identification. The proposed approach, namely inter-instance variational auto-encoders (IIns-VAEs), conducts inference with latent variables that encapsulate information about both distance and environmental labels. A deep learning network with instance normalization is designed to approximate the inference concurrently via deep learning. We conduct extensive experiments on real-world datasets and the results show the superiority of the proposed IIns-VAE in both distance estimation and environmental identification compared to conventional approaches

    Efficient Learning of Minimax Risk Classifiers in High Dimensions

    Get PDF
    High-dimensional data is common in multiple areas, such as health care and genomics, where the number of features can be tens of thousands. In such scenarios, the large number of features often leads to inefficient learning. Constraint generation methods have recently enabled efficient learning of L1-regularized support vector machines (SVMs). In this paper, we leverage such methods to obtain an efficient learning algorithm for the recently proposed minimax risk classifiers (MRCs). The proposed iterative algorithm also provides a sequence of worst-case error probabilities and performs feature selection. Experiments on multiple high-dimensional datasets show that the proposed algorithm is efficient in high-dimensional scenarios. In addition, the worst-case error probability provides useful information about the classifier performance, and the features selected by the algorithm are competitive with the state-of-the-art.CNS2022-13520

    Probabilistic Load Forecasting Based on Adaptive Online Learning

    Get PDF
    Load forecasting is crucial for multiple energy management tasks such as scheduling generation capacity, planning supply and demand, and minimizing energy trade costs. Such relevance has increased even more in recent years due to the integration of renewable energies, electric cars, and microgrids. Conventional load forecasting techniques obtain singlevalue load forecasts by exploiting consumption patterns of past load demand. However, such techniques cannot assess intrinsic uncertainties in load demand, and cannot capture dynamic changes in consumption patterns. To address these problems, this paper presents a method for probabilistic load forecasting based on the adaptive online learning of hidden Markov models. We propose learning and forecasting techniques with theoretical guarantees, and experimentally assess their performance in multiple scenarios. In particular, we develop adaptive online learning techniques that update model parameters recursively, and sequential prediction techniques that obtain probabilistic forecasts using the most recent parameters. The performance of the method is evaluated using multiple datasets corresponding with regions that have different sizes and display assorted time-varying consumption patterns. The results show that the proposed method can significantly improve the performance of existing techniques for a wide range of scenarios.Ramon y Cajal Grant RYC-2016-19383 Basque Government under the grant "Artificial Intelligence in BCAM number EXP. 2019/00432" Iberdrola Foundation under the 2019 Research Grant
    • 

    corecore