Search CORE

202 research outputs found

The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation

Author: Arora Kushal
Cheung Jackie C. K.
O'Donnell Timothy J.
Precup Doina
Weston Jason
Publication venue
Publication date: 13/02/2023
Field of study

State-of-the-art language generation models can degenerate when applied to open-ended generation problems such as text completion, story generation, or dialog modeling. This degeneration usually shows up in the form of incoherence, lack of vocabulary diversity, and self-repetition or copying from the context. In this paper, we postulate that ``human-like'' generations usually lie in a narrow and nearly flat entropy band, and violation of these entropy bounds correlates with degenerate behavior. Our experiments show that this stable narrow entropy zone exists across models, tasks, and domains and confirm the hypothesis that violations of this zone correlate with degeneration. We then use this insight to propose an entropy-aware decoding algorithm that respects these entropy bounds resulting in less degenerate, more contextual, and "human-like" language generation in open-ended text generation settings

arXiv.org e-Print Archive

Information theoretic approach to interactive learning

Author: Atkinson A. C. Bogacka B. Zhiglkilavskify A. A. (Editors)
Balcan M.-F.
Box G.
Dasgupta S.
Engel A.
Fedorov V. V.
Pack-Kaelbling L.
S. Still
Schmidhuber J.
Shannon C. E.
Still S. Bialek W.
Still S. Crutchfield J. P. Ellison C.
Still S. Precup D.
Sutton R. S.
Tishby N.
Vapnik V.
Publication venue: 'IOP Publishing'
Publication date: 30/01/2009
Field of study

The principles of statistical mechanics and information theory play an important role in learning and have inspired both theory and the design of numerous machine learning algorithms. The new aspect in this paper is a focus on integrating feedback from the learner. A quantitative approach to interactive learning and adaptive behavior is proposed, integrating model- and decision-making into one theoretical framework. This paper follows simple principles by requiring that the observer's world model and action policy should result in maximal predictive power at minimal complexity. Classes of optimal action policies and of optimal models are derived from an objective function that reflects this trade-off between prediction and complexity. The resulting optimal models then summarize, at different levels of abstraction, the process's causal organization in the presence of the learner's actions. A fundamental consequence of the proposed principle is that the learner's optimal action policies balance exploration and control as an emerging property. Interestingly, the explorative component is present in the absence of policy randomness, i.e. in the optimal deterministic behavior. This is a direct result of requiring maximal predictive power in the presence of feedback.Comment: 6 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Numerical reconstruction of brain tumours

Author: Bastay G
Bastay G
Cocosco C
Jaroudi R
Kozlov VA
Murray JD
Murray JD
Nocedal J
Precup R
Release name: Male subject
Publication venue: 'Informa UK Limited'
Publication date: 29/03/2018
Field of study

We propose a nonlinear Landweber method for the inverse problem of locating the brain tumour source (origin where the tumour formed) based on well-established models of reaction–diffusion type for brain tumour growth. The approach consists of recovering the initial density of the tumour cells starting from a later state, which can be given by a medical image, by running the model backwards. Moreover, full three-dimensional simulations are given of the tumour source localization on two types of data, the three-dimensional Shepp–Logan phantom and an MRI T1-weighted brain scan. These simulations are obtained using standard finite difference discretizations of the space and time derivatives, generating a simple approach that performs well

Crossref

Aston Publications Explorer

Why highly expressed proteins evolve slowly

Author: Akashi
Akashi
Akashi
Bloom
Bucciantini
C. Adami
C. O. Wilke
Cho
Coghlan
D. A. Drummond
Dong
Duret
Ellis
F. H. Arnold
Fraser
Ghaemmaghami
Goldberg
Greenbaum
Gu
Herbeck
Hirsh
Holstege
Hurst
J. D. Bloom
Kellis
Kellis
Kurtzman
Marais
Pal
Pal
Parker
Precup
P l
Rokas
Seoighe
Sharp
Sharp
Spreitzer
Subramanian
Wall
Yang
Zuckerkandl
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 12/08/2005
Field of study

Much recent work has explored molecular and population-genetic constraints on the rate of protein sequence evolution. The best predictor of evolutionary rate is expression level, for reasons which have remained unexplained. Here, we hypothesize that selection to reduce the burden of protein misfolding will favor protein sequences with increased robustness to translational missense errors. Pressure for translational robustness increases with expression level and constrains sequence evolution. Using several sequenced yeast genomes, global expression and protein abundance data, and sets of paralogs traceable to an ancient whole-genome duplication in yeast, we rule out several confounding effects and show that expression level explains roughly half the variation in Saccharomyces cerevisiae protein evolutionary rates. We examine causes for expression's dominant role and find that genome-wide tests favor the translational robustness explanation over existing hypotheses that invoke constraints on function or translational efficiency. Our results suggest that proteins evolve at rates largely unrelated to their functions, and can explain why highly expressed proteins evolve slowly across the tree of life.Comment: 40 pages, 3 figures, with supporting informatio

arXiv.org e-Print Archive

Crossref

PubMed Central

Caltech Authors

Determinants of translation efficiency and accuracy

Author: Akashi H
Andersson SG
Barbarese E
Bennetzen JL
Bulmer M
de Sousa Abreu R
Hila Gingold
Ikemura T
Precup J
Shields DC
Yitzhak Pilpel
Publication venue: Nature Publishing Group
Publication date
Field of study

A given protein sequence can be encoded by an astronomical number of alternative nucleotide sequences. Recent research has revealed that this flexibility provides evolution with multiple ways to tune the efficiency and fidelity of protein translation and folding

Crossref

PubMed Central