Search CORE

130,627 research outputs found

Risk-Sensitive Reinforcement Learning: A Constrained Optimization Viewpoint

Author: A. Prashanth L.
Fu Michael
Publication venue
Publication date: 22/10/2018
Field of study

The classic objective in a reinforcement learning (RL) problem is to find a policy that minimizes, in expectation, a long-run objective such as the infinite-horizon discounted or long-run average cost. In many practical applications, optimizing the expected value alone is not sufficient, and it may be necessary to include a risk measure in the optimization process, either as the objective or as a constraint. Various risk measures have been proposed in the literature, e.g., mean-variance tradeoff, exponential utility, the percentile performance, value at risk, conditional value at risk, prospect theory and its later enhancement, cumulative prospect theory. In this article, we focus on the combination of risk criteria and reinforcement learning in a constrained optimization framework, i.e., a setting where the goal to find a policy that optimizes the usual objective of infinite-horizon discounted/average cost, while ensuring that an explicit risk constraint is satisfied. We introduce the risk-constrained RL framework, cover popular risk measures based on variance, conditional value-at-risk and cumulative prospect theory, and present a template for a risk-sensitive RL algorithm. We survey some of our recent work on this topic, covering problems encompassing discounted cost, average cost, and stochastic shortest path settings, together with the aforementioned risk measures in a constrained framework. This non-exhaustive survey is aimed at giving a flavor of the challenges involved in solving a risk-sensitive RL problem, and outlining some potential future research directions

arXiv.org e-Print Archive

Asynchronous Gossip for Averaging and Spectral Ranking

Author: Borkar Vivek S.
Makhijani Rahul
Sundaresan Rajesh
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

We consider two variants of the classical gossip algorithm. The first variant is a version of asynchronous stochastic approximation. We highlight a fundamental difficulty associated with the classical asynchronous gossip scheme, viz., that it may not converge to a desired average, and suggest an alternative scheme based on reinforcement learning that has guaranteed convergence to the desired average. We then discuss a potential application to a wireless network setting with simultaneous link activation constraints. The second variant is a gossip algorithm for distributed computation of the Perron-Frobenius eigenvector of a nonnegative matrix. While the first variant draws upon a reinforcement learning algorithm for an average cost controlled Markov decision problem, the second variant draws upon a reinforcement learning algorithm for risk-sensitive control. We then discuss potential applications of the second variant to ranking schemes, reputation networks, and principal component analysis.Comment: 14 pages, 7 figures. Minor revisio

arXiv.org e-Print Archive

Open Access Repository of IISc Research Publications

Dspace at IIT Bombay

CogBoost: Boosting for Fast Cost-Sensitive Graph Classification

Author: Pan S
Wu J
Zhu X
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/11/2015
Field of study

© 2015 IEEE. Graph classification has drawn great interests in recent years due to the increasing number of applications involving objects with complex structure relationships. To date, all existing graph classification algorithms assume, explicitly or implicitly, that misclassifying instances in different classes incurs an equal amount of cost (or risk), which is often not the case in real-life applications (where misclassifying a certain class of samples, such as diseased patients, is subject to more expensive costs than others). Although cost-sensitive learning has been extensively studied, all methods are based on data with instance-feature representation. Graphs, however, do not have features available for learning and the feature space of graph data is likely infinite and needs to be carefully explored in order to favor classes with a higher cost. In this paper, we propose, CogBoost, a fast cost-sensitive graph classification algorithm, which aims to minimize the misclassification costs (instead of the errors) and achieve fast learning speed for large scale graph data sets. To minimize the misclassification costs, CogBoost iteratively selects the most discriminative subgraph by considering costs of different classes, and then solves a linear programming problem in each iteration by using Bayes decision rule based optimal loss function. In addition, a cutting plane algorithm is derived to speed up the solving of linear programs for fast learning on large scale data sets. Experiments and comparisons on real-world large graph data sets demonstrate the effectiveness and the efficiency of our algorithm

OPUS - University of Technology Sydney

Development and Optimization of a Machine-Learning Prediction Model for Acute Desquamation After Breast Radiation Therapy in the Multicenter REQUITE Cohort.

Author: Aguado-Barrera Miguel E
Aguado-Barrera Miguel E
Aldraimli Mahmoud
Aldraimli Mahmoud
Azria David
Azria David
Chang-Claude Jenny
Chang-Claude Jenny
Chaussalet Thierry J
Chaussalet Thierry J
de Ruysscher Dirk
de Ruysscher Dirk
de Santis Maria C
de Santis Maria C
Dunning Alison
Dunning Alison
Dwek Miriam V
Dwek Miriam V
Giraldo Alexandra
Giraldo Alexandra
Green Sheryl
Green Sheryl
Grishchuck Diana
Grishchuck Diana
Gutiérrez-Enríquez Sara
Gutiérrez-Enríquez Sara
Herskind Carsten
Herskind Carsten
Ingram Samuel
Ingram Samuel
Lambrecht Maarten
Lambrecht Maarten
Lozza Laura
Lozza Laura
Lyon Robert
Lyon Robert
Mistry Anil
Mistry Anil
Oliveira Jorge
Oliveira Jorge
Osman Sarah
Osman Sarah
Rancati Tiziana
Rancati Tiziana
Rattay Tim
Rattay Tim
REQUITE consortium
REQUITE consortium
Reyes Victoria
Reyes Victoria
Rosenstein Barry S
Rosenstein Barry S
Samuel Robert
Samuel Robert
Seibold Petra
Seibold Petra
Shelley Leila E A
Shelley Leila E A
Soria Daniele
Soria Daniele
Sperk Elena
Sperk Elena
Stobart Hilary
Stobart Hilary
Symonds R Paul
Symonds R Paul
Taboada-Valadares Begoña
Taboada-Valadares Begoña
Talbot Christopher J
Talbot Christopher J
Vakaet Vincent J L
Vakaet Vincent J L
van Hulle Hans
van Hulle Hans
Vega Ana
Vega Ana
Veldeman Liv
Veldeman Liv
Veldwijk Marlon R
Veldwijk Marlon R
Webb Adam
Webb Adam
Weltens Caroline
Weltens Caroline
West Catharine M
West Catharine M
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

Some patients with breast cancer treated by surgery and radiation therapy experience clinically significant toxicity, which may adversely affect cosmesis and quality of life. There is a paucity of validated clinical prediction models for radiation toxicity. We used machine learning (ML) algorithms to develop and optimise a clinical prediction model for acute breast desquamation after whole breast external beam radiation therapy in the prospective multicenter REQUITE cohort study. Using demographic and treatment-related features (m = 122) from patients (n = 2058) at 26 centers, we trained 8 ML algorithms with 10-fold cross-validation in a 50:50 random-split data set with class stratification to predict acute breast desquamation. Based on performance in the validation data set, the logistic model tree, random forest, and naïve Bayes models were taken forward to cost-sensitive learning optimisation. One hundred and ninety-two patients experienced acute desquamation. Resampling and cost-sensitive learning optimisation facilitated an improvement in classification performance. Based on maximising sensitivity (true positives), the "hero" model was the cost-sensitive random forest algorithm with a false-negative: false-positive misclassification penalty of 90:1 containing m = 114 predictive features. Model sensitivity and specificity were 0.77 and 0.66, respectively, with an area under the curve of 0.77 in the validation cohort. ML algorithms with resampling and cost-sensitive learning generated clinically valid prediction models for acute desquamation using patient demographic and treatment features. Further external validation and inclusion of genomic markers in ML prediction models are worthwhile, to identify patients at increased risk of toxicity who may benefit from supportive intervention or even a change in treatment plan. [Abstract copyright: © 2022 The Authors.

WestminsterResearch

Development and Optimization of a Machine-Learning Prediction Model for Acute Desquamation After Breast Radiation Therapy in the Multicenter REQUITE Cohort

Author: Aldraimli Mahmoud
Giraldo Marin Alexandra
Grishchuck Diana
Gutierrez Enriquez Sara
Ingram Samuel
Lyon Robert
Mistry Anil
Osman Sarah
Reyes López Victoria
Publication venue: 'Elsevier BV'
Publication date: 01/05/2022
Field of study

Breast Radiation Therapy; Machine-Learning Prediction; Acute DesquamationRaditeràpia de mama; Predicció d'aprenentatge automàtic; Descamació agudaRadioterapia de mama; Predicción de aprendizaje automático; Descamación agudaPurpose Some patients with breast cancer treated by surgery and radiation therapy experience clinically significant toxicity, which may adversely affect cosmesis and quality of life. There is a paucity of validated clinical prediction models for radiation toxicity. We used machine learning (ML) algorithms to develop and optimise a clinical prediction model for acute breast desquamation after whole breast external beam radiation therapy in the prospective multicenter REQUITE cohort study. Methods and Materials Using demographic and treatment-related features (m = 122) from patients (n = 2058) at 26 centers, we trained 8 ML algorithms with 10-fold cross-validation in a 50:50 random-split data set with class stratification to predict acute breast desquamation. Based on performance in the validation data set, the logistic model tree, random forest, and naïve Bayes models were taken forward to cost-sensitive learning optimisation. Results One hundred and ninety-two patients experienced acute desquamation. Resampling and cost-sensitive learning optimisation facilitated an improvement in classification performance. Based on maximising sensitivity (true positives), the “hero” model was the cost-sensitive random forest algorithm with a false-negative: false-positive misclassification penalty of 90:1 containing m = 114 predictive features. Model sensitivity and specificity were 0.77 and 0.66, respectively, with an area under the curve of 0.77 in the validation cohort. Conclusions ML algorithms with resampling and cost-sensitive learning generated clinically valid prediction models for acute desquamation using patient demographic and treatment features. Further external validation and inclusion of genomic markers in ML prediction models are worthwhile, to identify patients at increased risk of toxicity who may benefit from supportive intervention or even a change in treatment plan

Scientia, Dipòsit d’Informació Digital del Departament de Salut

Cost-Sensitive Selective Classification and its Applications to Online Fraud Management

Author
Publication venue
Publication date: 01/01/2019
Field of study

abstract: Fraud is defined as the utilization of deception for illegal gain by hiding the true nature of the activity. While organizations lose around $3.7 trillion in revenue due to financial crimes and fraud worldwide, they can affect all levels of society significantly. In this dissertation, I focus on credit card fraud in online transactions. Every online transaction comes with a fraud risk and it is the merchant's liability to detect and stop fraudulent transactions. Merchants utilize various mechanisms to prevent and manage fraud such as automated fraud detection systems and manual transaction reviews by expert fraud analysts. Many proposed solutions mostly focus on fraud detection accuracy and ignore financial considerations. Also, the highly effective manual review process is overlooked. First, I propose Profit Optimizing Neural Risk Manager (PONRM), a selective classifier that (a) constitutes optimal collaboration between machine learning models and human expertise under industrial constraints, (b) is cost and profit sensitive. I suggest directions on how to characterize fraudulent behavior and assess the risk of a transaction. I show that my framework outperforms cost-sensitive and cost-insensitive baselines on three real-world merchant datasets. While PONRM is able to work with many supervised learners and obtain convincing results, utilizing probability outputs directly from the trained model itself can pose problems, especially in deep learning as softmax output is not a true uncertainty measure. This phenomenon, and the wide and rapid adoption of deep learning by practitioners brought unintended consequences in many situations such as in the infamous case of Google Photos' racist image recognition algorithm; thus, necessitated the utilization of the quantified uncertainty for each prediction. There have been recent efforts towards quantifying uncertainty in conventional deep learning methods (e.g., dropout as Bayesian approximation); however, their optimal use in decision making is often overlooked and understudied. Thus, I present a mixed-integer programming framework for selective classification called MIPSC, that investigates and combines model uncertainty and predictive mean to identify optimal classification and rejection regions. I also extend this framework to cost-sensitive settings (MIPCSC) and focus on the critical real-world problem, online fraud management and show that my approach outperforms industry standard methods significantly for online fraud management in real-world settings.Dissertation/ThesisDoctoral Dissertation Computer Science 201

ASU Digital Repository

A Low-Cost Machine Learning Based Network Intrusion Detection System With Data Privacy Preservation

Author: Fakirah Jyoti
Johnstone Michael L.
Mooruth Roshni
Yang Wencheng
Zishan Lauhim Mahfuz
Publication venue: Scholarly Commons
Publication date: 01/01/2022
Field of study

Network intrusion is a well-studied area of cyber security. Current machine learning-based network intrusion detection systems (NIDSs) monitor network data and the patterns within those data but at the cost of presenting significant issues in terms of privacy violations which may threaten end-user privacy. Therefore, to mitigate risk and preserve a balance between security and privacy, it is imperative to protect user privacy with respect to intrusion data. Moreover, cost is a driver of a machine learning-based NIDS because such systems are increasingly being deployed on resource-limited edge devices. To solve these issues, in this paper we propose a NIDS called PCC-LSM-NIDS that is composed of a Pearson Correlation Coefficient (PCC) based feature selection algorithm and a Least Square Method (LSM) based privacy-preserving algorithm to achieve low-cost intrusion detection while providing privacy preservation for sensitive data. The proposed PCC-LSM-NIDS is tested on the benchmark intrusion database UNSW-NB15, using five popular classifiers. The experimental results show that the proposed PCC-LSM-NIDS offers advantages in terms of less computational time, while offering an appropriate degree of privacy protection

Embry-Riddle Aeronautical University

A Low-Cost Machine Learning Based Network Intrusion Detection System with Data Privacy Preservation

Author: Fakirah Jyoti
Johnstone Michael N.
Mooruth Roshni
Yang Wencheng
Zishan Lauhim Mahfuz
Publication venue
Publication date: 05/07/2021
Field of study

arXiv.org e-Print Archive

Research Online @ ECU

Embry-Riddle Aeronautical University