Search CORE

6 research outputs found

Controlled Decoding from Language Models

Author: Beirami Ahmad
Beutel Alex
Chen Jilin
Chen Zhifeng
Cheng Heng-Tze
Collins Michael
Ganapathy Harish
Huang Yanping
Lee Jong
Li YaGuang
Mudgal Sidharth
Strohman Trevor
Wang Tao
Publication venue
Publication date: 25/10/2023
Field of study

We propose controlled decoding (CD), a novel off-policy reinforcement learning method to control the autoregressive generation from language models towards high reward outcomes. CD solves an off-policy reinforcement learning problem through a value function for the reward, which we call a prefix scorer. The prefix scorer is used at inference time to steer the generation towards higher reward outcomes. We show that the prefix scorer may be trained on (possibly) off-policy data to predict the expected reward when decoding is continued from a partially decoded response. We empirically demonstrate that CD is effective as a control mechanism on Reddit conversations corpus. We also show that the modularity of the design of CD makes it possible to control for multiple rewards, effectively solving a multi-objective reinforcement learning problem with no additional complexity. Finally, we show that CD can be applied in a novel blockwise fashion at inference-time, again without the need for any training-time changes, essentially bridging the gap between the popular best-of-

K

strategy and token-level reinforcement learning. This makes CD a promising approach for alignment of language models

arXiv.org e-Print Archive

Deep Learning for Entity Matching: A Design Space Exploration

Author: Mudgal Sunil Kumar Sidharth
Publication venue
Publication date: 15/05/2018
Field of study

Entity matching (EM) finds data instances that refer to the same real-world entity. In this thesis we examine applying deep learning (DL) to EM, to understand DL's benefits and limitations. We review many DL solutions that have been developed for related matching tasks in text processing (e.g., entity linking, textual entailment, etc.). We categorize these solutions and define a space of DL solutions for EM, as embodied by four solutions with varying representational power: SIF, RNN, Attention, and Hybrid. Next, we investigate the types of EM problems for which DL can be helpful. We consider three such problem types, which match structured data instances, textual instances, and dirty instances, respectively. We empirically compare the above four DL solutions with Magellan, a state-of-the-art learning-based EM solution. The results show that DL does not outperform current solutions on structured EM, but it can significantly outperform them on textual and dirty EM. For practitioners, this suggests that they should seriously consider using DL for textual and dirty EM problems. We then analyze DL's performance and discuss future research directions. Finally, we present Deepmatcher, a Python package for performing entity matching using deep learning

Minds@University of Wisconsin