Search CORE

20 research outputs found

Generalization on the Unseen, Logic Reasoning and Degree Curriculum

Author: Abbe Emmanuel
Bengio Samy
Lotfi Aryo
Rizk Kevin
Publication venue
Publication date: 28/06/2023
Field of study

This paper considers the learning of logical (Boolean) functions with focus on the generalization on the unseen (GOTU) setting, a strong case of out-of-distribution generalization. This is motivated by the fact that the rich combinatorial nature of data in certain reasoning tasks (e.g., arithmetic/logic) makes representative data sampling challenging, and learning successfully under GOTU gives a first vignette of an 'extrapolating' or 'reasoning' learner. We then study how different network architectures trained by (S)GD perform under GOTU and provide both theoretical and experimental evidence that for a class of network models including instances of Transformers, random features models, and diagonal linear networks, a min-degree-interpolator is learned on the unseen. We also provide evidence that other instances with larger learning rates or mean-field networks reach leaky min-degree solutions. These findings lead to two implications: (1) we provide an explanation to the length generalization problem (e.g., Anil et al. 2022); (2) we introduce a curriculum learning algorithm called Degree-Curriculum that learns monomials more efficiently by incrementing supports.Comment: To appear in ICML 202

arXiv.org e-Print Archive

Multi-Objective GFlowNets

Author: Bengio Emmanuel
Bengio Yoshua
Hernandez-Garcia Alex
Jain Moksh
Miret Santiago
Raparthy Sharath Chandra
Rector-Brooks Jarrid
Publication venue
Publication date: 17/07/2023
Field of study

We study the problem of generating diverse candidates in the context of Multi-Objective Optimization. In many applications of machine learning such as drug discovery and material design, the goal is to generate candidates which simultaneously optimize a set of potentially conflicting objectives. Moreover, these objectives are often imperfect evaluations of some underlying property of interest, making it important to generate diverse candidates to have multiple options for expensive downstream evaluations. We propose Multi-Objective GFlowNets (MOGFNs), a novel method for generating diverse Pareto optimal solutions, based on GFlowNets. We introduce two variants of MOGFNs: MOGFN-PC, which models a family of independent sub-problems defined by a scalarization function, with reward-conditional GFlowNets, and MOGFN-AL, which solves a sequence of sub-problems defined by an acquisition function in an active learning loop. Our experiments on wide variety of synthetic and benchmark tasks demonstrate advantages of the proposed methods in terms of the Pareto performance and importantly, improved candidate diversity, which is the main contribution of this work.Comment: 23 pages, 8 figures. ICML 2023. Code at: https://github.com/GFNOrg/multi-objective-gf

arXiv.org e-Print Archive

Learning GFlowNets from partial episodes for improved convergence and stability

Author: Bengio Emmanuel
Bengio Yoshua
Bosc Tom
Jain Moksh
Korablyov Maksym
Madan Kanika
Malkin Nikolay
Nica Andrei
Rector-Brooks Jarrid
Publication venue
Publication date: 03/06/2023
Field of study

Generative flow networks (GFlowNets) are a family of algorithms for training a sequential sampler of discrete objects under an unnormalized target density and have been successfully used for various probabilistic modeling tasks. Existing training objectives for GFlowNets are either local to states or transitions, or propagate a reward signal over an entire sampling trajectory. We argue that these alternatives represent opposite ends of a gradient bias-variance tradeoff and propose a way to exploit this tradeoff to mitigate its harmful effects. Inspired by the TD(

\lambda

) algorithm in reinforcement learning, we introduce subtrajectory balance or SubTB(

\lambda

), a GFlowNet training objective that can learn from partial action subsequences of varying lengths. We show that SubTB(

\lambda

) accelerates sampler convergence in previously studied and new environments and enables training GFlowNets in environments with longer action sequences and sparser reward landscapes than what was possible before. We also perform a comparative analysis of stochastic gradient dynamics, shedding light on the bias-variance tradeoff in GFlowNet training and the advantages of subtrajectory balance.Comment: ICML 202

arXiv.org e-Print Archive