Search CORE

11 research outputs found

Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning

Author: Berkenkamp Felix
Kolter J. Zico
Manek Gaurav
Roderick Melrose
Publication venue
Publication date: 24/11/2023
Field of study

A key problem in off-policy Reinforcement Learning (RL) is the mismatch, or distribution shift, between the dataset and the distribution over states and actions visited by the learned policy. This problem is exacerbated in the fully offline setting. The main approach to correct this shift has been through importance sampling, which leads to high-variance gradients. Other approaches, such as conservatism or behavior-regularization, regularize the policy at the cost of performance. In this paper, we propose a new approach for stable off-policy Q-Learning. Our method, Projected Off-Policy Q-Learning (POP-QL), is a novel actor-critic algorithm that simultaneously reweights off-policy samples and constrains the policy to prevent divergence and reduce value-approximation error. In our experiments, POP-QL not only shows competitive performance on standard benchmarks, but also out-performs competing methods in tasks where the data-collection policy is significantly sub-optimal.Comment: 10 page

arXiv.org e-Print Archive

Truly multi-modal YouTube-8M video classification with video, audio, and text

Author: et al
FANG Yuan
KUAN Kingsley
MANEK Gaurav
RAVANT Mathieu
SONG Sibo
WANG Zhe
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2017
Field of study

Institutional Knowledge at Singapore Management University

Region average pooling for context-aware object detection

Author: CHANDRASEKHAR Vijay
FANG Yuan
KUAN Kingsley
LIN Jie
MANEK Gaurav
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/08/2017
Field of study

Crossref

Institutional Knowledge at Singapore Management University

Recommended from our members

The Role of Dapagliflozin in the Management of Heart Failure: An Update on the Emerging Evidence.

Author: Fonarow Gregg C
Ghosh Raktim K
Gupta Manasvi
Manek Gaurav
Rao Shiavax
Publication venue: eScholarship, University of California
Publication date: 01/01/2021
Field of study

The burden and cost of heart failure management, primarily in the form of hospitalization in the setting of decompensated heart failure, continue to be some of the biggest clinical challenges in cardiovascular medicine. In recently published randomized controlled trials, including DAPA-HF, sodium-glucose cotransporter 2 (SGLT2) inhibitor dapagliflozin was shown to reduce hospitalization from heart failure or mortality associated with cardiovascular causes, when added to existing guideline-directed medical therapy. The American College of Cardiology (ACC) released a Clinical Pathway guideline that recommends the use of dapagliflozin in clinical management of heart failure, with or without diabetes. Furthermore, the results of the DAPA-CKD trial broaden the utility of dapagliflozin as a therapeutic option in patients with advanced kidney disease. In this article, the authors explore the existing evidence on dapagliflozin in heart failure with reduced ejection fraction and highlight the need for further research on uses of dapagliflozin in the world of heart failure

eScholarship - University of California

Proposed Pathogenesis, Characteristics, and Management of COVID-19 mRNA Vaccine-Related Myopericarditis

Author: Ashish Kumar
Bandyopadhyay Dhrubajyoti
Ghosh Binita
Gupta Manasvi
Hajra Adrija
Lavie Carl J
Manek Gaurav
Patel Neelkumar
Rai Devesh
Publication venue: RocScholar
Publication date: 24/11/2021
Field of study

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the novel coronavirus causing coronavirus disease 2019 (COVID-19), has affected human lives across the globe. On 11 December 2020, the US FDA granted an emergency use authorization for the first COVID-19 vaccine, and vaccines are now widely available. Undoubtedly, the emergence of these vaccines has led to substantial relief, helping alleviate the fear and anxiety around the COVID-19 illness for both the general public and clinicians. However, recent cases of vaccine complications, including myopericarditis, have been reported after administration of COVID-19 vaccines. This article discusses the cases, possible pathogenesis of myopericarditis, and treatment of the condition. Most cases were mild and should not yet change vaccine policies, although prospective studies are needed to better assess the risk-benefit ratios in different groups

RocScholar (Rochester Regional Health)