Search CORE

44 research outputs found

Identifiability in inverse reinforcement learning

Author: Cao Haoyang
Cohen Samuel N.
Szpruch Lukasz
Publication venue
Publication date: 08/11/2021
Field of study

Inverse reinforcement learning attempts to reconstruct the reward function in a Markov decision problem, using observations of agent actions. As already observed in Russell [1998] the problem is ill-posed, and the reward function is not identifiable, even under the presence of perfect information about optimal behavior. We provide a resolution to this non-identifiability for problems with entropy regularization. For a given environment, we fully characterize the reward functions leading to a given policy and demonstrate that, given demonstrations of actions for the same reward under two distinct discount factors, or under sufficiently different environments, the unobserved reward can be recovered up to a constant. We also give general necessary and sufficient conditions for reconstruction of time-homogeneous rewards on finite horizons, and for action-independent rewards, generalizing recent results of Kim et al. [2021] and Fu et al. [2018]

arXiv.org e-Print Archive

Oxford University Research Archive

Urania: Visualizing Data Analysis Pipelines for Natural Language-Based Data Exploration

Author: Cao Nan
Chen Qing
Guo Yi
Li Haoyang
Qi Xiaoyu
Shi Danqing
Weiskopf Daniel
Zhang Jing
Publication venue
Publication date: 13/06/2023
Field of study

Exploratory Data Analysis (EDA) is an essential yet tedious process for examining a new dataset. To facilitate it, natural language interfaces (NLIs) can help people intuitively explore the dataset via data-oriented questions. However, existing NLIs primarily focus on providing accurate answers to questions, with few offering explanations or presentations of the data analysis pipeline used to uncover the answer. Such presentations are crucial for EDA as they enhance the interpretability and reliability of the answer, while also helping users understand the analysis process and derive insights. To fill this gap, we introduce Urania, a natural language interactive system that is able to visualize the data analysis pipelines used to resolve input questions. It integrates a natural language interface that allows users to explore data via questions, and a novel data-aware question decomposition algorithm that resolves each input question into a data analysis pipeline. This pipeline is visualized in the form of a datamation, with animated presentations of analysis operations and their corresponding data changes. Through two quantitative experiments and expert interviews, we demonstrated that our data-aware question decomposition algorithm outperforms the state-of-the-art technique in terms of execution accuracy, and that Urania can help people explore datasets better. In the end, we discuss the observations from the studies and the potential future works

arXiv.org e-Print Archive

Combined early palliative care for non-small-cell lung cancer patients: a randomized controlled trial in Chongqing, China

Author: Haoyang Cao
Hong Yang
Huiqing Yu
Huiqing Yu
Lei Lei
Liejun Yang
Liling Ma
Ling Tian
Mengting Chen
Shihong Liu
Sixiong Wang
Publication venue: Frontiers Media S.A.
Publication date: 01/09/2023
Field of study

PurposeMore effective approaches are needed to improve the prognosis of non-small-cell lung cancer (NSCLC) patients. Thus, we used the E-warm model to assess how early integration of interdisciplinary palliative care was related to the quality of life (QoL), psychological functioning, pain management, and nutrition factors of NSCLC patients.MethodsThis randomized controlled trial enrolled 280 newly diagnosed NSCLC patients, which were randomly divided (1:1) into combined early palliative care (CEPC) and standard oncological care (SC) groups. At baseline and after 24 weeks, the Functional Assessment of Cancer Therapy-Lung (FACT-L) scale, Hospital Anxiety and Depression Scale (HADS), and the Patient Health Questionnaire-9 (PHQ-9) were used to assess QoL and psychological function, respectively. The Numerical Rating Scale (NRS) and Patient-Generated Subjective Global Assessment (PG-SGA) were used to assess cancer patients’ pain and nutrition levels. The primary outcome was overall survival (OS). Secondary outcomes comprised changes in the QoL, psychological functioning, pain, and nutrition state. The intention-to-treat method was applied for analysis. This study was registered at www.chictr.org.cn (ChiCTR2200062617).ResultsOf the 140 patients enrolled in the CEPC and SC groups, 102 and 82 completed the research. The CEPC group presented higher QoL than the SC group (p < 0.05). Additionally, fewer patients presented depressive symptoms in the CEPC group than in the SC group (p < 0.05), as well as better nutritional status (p = 0.007) and pain management (p = 0.003). Compared to the SC group, CEPC patients had significantly longer OS (20.4 vs. 24.6 months, p = 0.042; HR: 0.19; 95% CI: 0.04-0.85, p = 0.029).ConclusionWith combined early palliative care, NSCLC patients lived longer, had better QoL, were psychologically stable, were in less pain, and were more nutritionally satisfied

Directory of Open Access Journals

arrayMap: A Reference Resource for Genomic Copy Number Imbalances in Human Malignancies

Author: A Bergamaschi
A Kallioniemi
AM Krasinskas
BA Weir
C Curtis
C F
CC Laurie
CG Mullighan
E Alloza
F Bourdeaut
GR Bignell
H Bengtsson
H Bengtsson
H Bengtsson
H Parkinson
Haoyang Cai
HV Firth
I Scheinin
J Climent
J Gearhart
J Greshock
J Li
JI van der Rhee
JR Pollack
JS Smith
K Chin
K Jin
KN Stevens
LO Baumbusch
M Baudis
M Baudis
M Lubin
Michael Baudis
Network Cancer Genome Atlas Research
NI Park
Nitin Kumar
NP Carter
PA Fujita
Q Cao
R Dalla-Favera
R Wiedemeyer
RL Stallings
RN Wiltshire
S Heinrichs
S Horvath
S Myllykangas
SA Forbes
T Barrett
W Zhang
X Hu
Ying Xu
Z Kan
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 11/01/2012
Field of study

Background: The delineation of genomic copy number abnormalities (CNAs) from cancer samples has been instrumental for identification of tumor suppressor genes and oncogenes and proven useful for clinical marker detection. An increasing number of projects have mapped CNAs using high-resolution microarray based techniques. So far, no single resource does provide a global collection of readily accessible oncoge- nomic array data. Methodology/Principal Findings: We here present arrayMap, a curated reference database and bioinformatics resource targeting copy number profiling data in human cancer. The arrayMap database provides a platform for meta-analysis and systems level data integration of high-resolution oncogenomic CNA data. To date, the resource incorporates more than 40,000 arrays in 224 cancer types extracted from several resources, including the NCBI's Gene Expression Omnibus (GEO), EBIs ArrayExpress (AE), The Cancer Genome Atlas (TCGA), publication supplements and direct submissions. For the majority of the included datasets, probe level and integrated visualization facilitate gene level and genome wide data re- view. Results from multi-case selections can be connected to downstream data analysis and visualization tools. Conclusions/Significance: To our knowledge, currently no data source provides an extensive collection of high resolution oncogenomic CNA data which readily could be used for genomic feature mining, across a representative range of cancer entities. arrayMap represents our effort for providing a long term platform for oncogenomic CNA data independent of specific platform considerations or specific project dependence. The online database can be accessed at http://www.arraymap.org.Comment: 17 pages, 5 inline figures, 3 tables, supplementary figures/tables split into 4 PDF files; manuscript submitted to PLoS ON

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

ZORA

FigShare

Recommended from our members

Connecting Mean-field Games and Generative Adversarial Networks

Author: Cao Haoyang
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

The theory of mean-field games (MFGs) belongs to a branch of game theory that studies a large population of (weakly) interacting players. It serves as an analytically feasible framework to approximate stochastic differential games when the number of players is large and detailed characterization of the interactions is computationally expensive. As an effective modeling tool, the theory of MFGs attracts the attention of a variety of application fields in economics, finance and engineering. On the computation front, the development of machine learning provides abundant computational methods of solving for MFGs, which is remarkably meaningful in practice. At the same time, people may also wonder if the theory of MFGs, or stochastic analysis in general, could benefit the machine learning community.This thesis starts with two MFG models, with singular and impulse types of controls, respectively. Theses two control types allow certain degrees of discontinuity, making them better mathematical models compared with regular controls where the interventions must be absolute continuous. However, due to the theoretical challenges brought by the discontinuous nature of the controls, these two models are less explored in existing literature compared with MFGs with regular controls. Both models are motivated by real-world problems. Explicit solutions to the MFGs are presented and shown to approximate Nash equilibria of the corresponding N-player games with an error of the order O(1⁄√N). Further analysis of the solutions reveals the game effect from interacting with the mean-field.Obtaining analytical solutions of MFGs is difficult in general. The thesis then turns to the computation side of MFGs and establish the connection with generative adversarial networks, a celebrated deep learning tool that enjoys tremendous empirical success since its introduction to the machine learning community. It first shows a conceptual connection between GANs and MFGs: MFGs have the structure of GANs, and GANs are MFGs under the Pareto Optimality criterion. Interpreting MFGs as GANs, on one hand, enables a GANs-based algorithm (MFGANs) to solve MFGs: one neural network (NN) for the backward HJB equation and one NN for the forward FP equation, with the two NNs trained in an adversarial way. Viewing GANs as MFGs, on the other hand, reveals a new and probabilistic aspect of GANs. This new perspective, moreover, leads to an analytical connection between GANs and Optimal Transport (OT) problems, and sufficient conditions for the minimax games of GANs to be reformulated in the framework of OT. Numerical experiments demonstrate superior performance of this proposed algorithm, especially in higher dimensional case, when compared with existing NN approaches.Finally, the thesis explores the possibility of enriching the theoretical understanding of the training of GANs from the perspective of stochastic analysis. It establishes approximations, with precise error bound analysis, for the training of GANs under stochastic gradient algorithms (SGAs). The approximations are in the form of coupled stochastic differential equations (SDEs). The analysis of the SDEs and the associated invariant measures yields conditions for the stability and the convergence of GANs training. Further analysis of the invariant measure for the coupled SDEs gives rise to a fluctuation-dissipation relations (FDRs) for GANs, revealing the trade-off of the loss landscape between the generator and the discriminator and providing guidance for learning rate scheduling

eScholarship - University of California

Stationary Discounted and Ergodic Mean Field Games of Singular Control

Author: Cao Haoyang
Dianetti Jodi
Ferrari Giorgio
Publication venue: Center for Mathematical Economics
Publication date: 01/01/2021
Field of study

Cao H, Dianetti J, Ferrari G. Stationary Discounted and Ergodic Mean Field Games of Singular Control. Center for Mathematical Economics Working Papers. Vol 650. Bielefeld: Center for Mathematical Economics; 2021.We study stationary mean field games with singular controls in which the representative player interacts with a long-time weighted average of the population through a discounted and an erodic performance criterion. This class of games finds natural applications in the context of optimal productivity expansion in dynamic oligopolies. We prove existence and uniqueness of the mean field equilibria for the discounted and the ergodic games by showing the validity of an Abelian limit. The latter allows also to approximate Nash equilibria of - so far unexplored - symmetric N-player ergodic singular control games through the mean field equilibrium of the discounted game. Numerical examples finally illustrate in a case study the dependency of the mean field equilibria with respect to the parameters of the games

Publications at Bielefeld University