Search CORE

18 research outputs found

Improving exploration in policy gradient search: Application to symbolic optimization

Author: Faissol Daniel M.
Glatt Ruben
Kim Soo K.
Landajuela Mikel
Mundhenk T. Nathan
Petersen Brenden K.
Pettit Jacob F.
Santiago Claudio P.
Publication venue
Publication date: 19/07/2021
Field of study

Many machine learning strategies designed to automate mathematical tasks leverage neural networks to search large combinatorial spaces of mathematical symbols. In contrast to traditional evolutionary approaches, using a neural network at the core of the search allows learning higher-level symbolic patterns, providing an informed direction to guide the search. When no labeled data is available, such networks can still be trained using reinforcement learning. However, we demonstrate that this approach can suffer from an early commitment phenomenon and from initialization bias, both of which limit exploration. We present two exploration methods to tackle these issues, building upon ideas of entropy regularization and distribution initialization. We show that these techniques can improve the performance, increase sample efficiency, and lower the complexity of solutions for the task of symbolic regression.Comment: Published in 1st Mathematical Reasoning in General Artificial Intelligence Workshop, ICLR 202

arXiv.org e-Print Archive

Recommended from our members

Artificial Intelligence for Climate Change Mitigation Roadmap

Author: d'Aspremont Alexandre
Fan Zhiyuan
Friedman Julio
Glatt Ruben
Halff Antoine M.
Karl Kevin
Kucukelbir Alp
McCormick Colin
Méndez Leal Elena
Nagrani Trishna
Ruane Alexander
Sandalow David B.
Publication venue: Innovation for Cool Earth Forum
Publication date: 01/01/2023
Field of study

The ICEF Roadmap on Artificial Intelligence for Climate Change Mitigation explores high potential opportunities for using artificial intelligence (AI) to fight climate change, including GHG emissions monitoring, the power sector, manufacturing, materials innovation, the food system and road transport. The roadmap also examine barriers and risks, and policies for addressing them. It concludes with findings and recommendations

Columbia University Academic Commons

Mapping genomic loci implicates genes and synaptic biology in schizophrenia

Author: Adams Mark
Adolfsson Rolf
Agartz Ingrid
Agerbo Esben
Al Eissa Mariam
Albus Margot
Alexander Madeline
Alizadeh Behrooz Z
Alptekin Köksal
Als Thomas D
Amin Farooq
Andreassen Ole A
Arango Celso
Arolt Volker
Arrojo Manuel
Atbaşoğlu Eşref Cem
Athanasiu Lavinia
Atkinson Elizabeth G
Awasthi Swapnil
Ayub Muhammad
Azevedo Maria Helena
Bacanu Silviu A
Bass Nicholas J
Baune Bernhard T
Begemann Martin
Belangero Sintia Iole
Belliveau Richard A
Bene Judit
Benner Christian
Benyamin Beben
Bergen Sarah E
Bertolino Alessandro
Bigdeli Tim B
Black Donald W
Blasi Giuseppe
Bobes Julio
Bonassi Stefano
Braff David
Bramon Elvira
Braun Alice
Bray Nicholas J
Breen Gerome
Bressan Rodrigo Affonseca
Bromet Evelyn J
Bruggeman Richard
Bryois Julien
Buccola Nancy G
Buckley Peter F
Buckner Randy L
Buxbaum Joseph D
Bybjerg-Grauholm Jonas
Byerley William F
Børglum Anders D
Cahn Wiepke
Cairns Murray J
Calkins Monica E
Campion Dominique
Carr Vaughan J
Castle David
Catts Stanley V
Cervilla Jorge A
Chambert Kimberley D
Chan Raymond CK
Chaumette Boris
Chen Chia-Yen
Chen Wei J
Cheng Wei
Cheung Eric FC
Chong Siow Ann
Cichon Sven
Cloninger C Robert
Cohen David
Collier David A
Consoli Angèle
Cordeiro Quirino
Corvin Aiden
Costas Javier
Crespo-Facorro Benedicto
Curtis Charles
Curtis David
Daly Mark J
Davidson Michael
Davis Kenneth L
de Haan Lieuwe
Degenhardt Franziska
DeLisi Lynn E
Demontis Ditte
Dennison Charlotte A
Dickerson Faith
Dikeos Dimitris
Dinan Timothy
Djurovic Srdjan
Domenici Enrico
Donohoe Gary
Duan Jubao
Ducci Giuseppe
Dudbridge Frank
Ehrenreich Hannelore
Eriksson Johan G
Escott-Price Valentina
Esko Tõnu
Fanous Ayman H
Faraone Stephen V
Fañanás Lourdes
Fiorentino Alessia
Forstner Andreas
Forti Marta Di
Frank Josef
Freedman Robert
Frei Oleksandr
Freimer Nelson B
Fromer Menachem
Frustaci Alessandra
Gadelha Ary
Galletly Cherrie
Gandal Michael J
Gareeva Anna
Gawlik Micha
Ge Tian
Gejman Pablo V
Gennarelli Massimo
Genovese Giulio
Gershon Elliot S
Giannitelli Marianna
Giegling Ina
Gill Michael
Giusti-Rodríguez Paola
Glatt Stephen J
Godard Stephanie
Goldstein Jacqueline I
Golimbet Vera
González Peñas Javier
González-Pinto Ana
Gopal Srihari
Gratten Jacob
Green Michael F
Greenwood Tiffany A
Grove Jakob
Guillin Olivier
Gur Raquel E
Gur Ruben C
Gutiérrez Blanca
Gülöksüz Sinan
Hahn Eric
Hakonarson Hakon
Hall Lynsey S
Haroutunian Vahram
Hartmann Annette M
Harvey Carol
Harwood Janet C
Hayward Caroline
Henskens Frans A
Herms Stefan
Hoffmann Per
Holmans Peter A
Hong Kyung Sue
Hougaard David M
Howrigan Daniel P
Huang Hailiang
Hultman Christina M
Hwu Hai-Gwo
Hyman Steven E
Ikeda Masashi
Indonesia Schizophrenia Consortium
Iwata Nakao
Iyegbe Conrad
Jablensky Assen V
Joa Inge
Julià Antonio
Jönsson Erik G
Kahn René S
Kam-Thong Tony
Kamatani Yoichiro
Karachanak-Yankova Sena
Kebir Oussama
Keller Matthew C
Kelly Brian J
Kendler Kenneth S
Kennedy James L
Khrunin Andrey
Khusnutdinova Elza
Kim Minsoo
Kim Sung-Wan
Kirov George
Klovins Janis
Knowles James A
Kondratiev Nikolay
Konte Bettina
Koopmans Frank
Kraft Julia
Krebs Marie-Odile
Kubo Michiaki
Kusumawardhani Agung
Kuzelova-Ptackova Hana
Kučinskas Vaidutis
Kučinskiene Zita Ausrele
Kähler Anna K
Lam Max
Landi Stefano
Laurent-Levinson Claudine
Lazzeroni Laura C
Lee Jimmy
Lee Phil H
Legge Sophie E
Lehrer Douglas S
Lencer Rebecca
Lencz Todd
Lerer Bernard
Levinson Douglas F
Li Miaoxin
Li Qingqin S
Li Zhiqiang
Lieberman Jeffrey
Light Gregory A
Limborska Svetlana
Liu Chih-Min
Liu Jianjun
Loughland Carmel M
Lubinski Jan
Luykx Jurjen J
Lynham Amy
Lönnqvist Jouko
Macek Milan
Mackinnon Andrew
Magnusson Patrik KE
Magnusson Sigurdur
Maher Brion S
Maier Wolfgang
Malaspina Dolores
Malhotra Anil K
Malhotra Dheeraj
Mallet Jacques
Marder Stephen R
Marsal Sara
Martin Alicia R
Martorell Lourdes
Mattheisen Manuel
McCarley Robert W
McCarroll Steven A
McDonald Colm
McGrath John J
McIntosh Andrew
McQuillin Andrew
Medeiros Helena
Meier Sandra
Melegh Bela
Melle Ingrid
Menezes Paulo R
Mesholam-Gately Raquelle I
Metspalu Andres
Michie Patricia T
Milani Lili
Milanova Vihra
Mitjans Marina
Molden Espen
Molina Esther
Molto María Dolores
Mondelli Valeria
Moran Jennifer L
Moreno Carmen
Morgan Vera A
Morley Christopher P
Morris Derek W
Mors Ole
Mortensen Preben B
Mowry Bryan J
Muntané Gerard
Murphy Kieran C
Murray Robin M
Myin-Germeys Inez
Müller-Myhsok Bertram
Neale Benjamin M
Neil Amanda L
Nenadić Igor
Nestadt Gerald
Nikitina-Zake Liene
Nimgaonkar Vishwajit
Nordentoft Merete
Noto Cristiano
Nuechterlein Keith H
Nöthen Markus M
O'Brien Niamh Louise
O'Donovan Michael C
O'Neill F Anthony
Oh Sang-Yun
Olincy Ann
Ophoff Roel A
Ota Vanessa Kiyomi
Owen Michael J
Paciga Sara A
Palotie Aarno
Panagiotaropoulou Georgia
Pantelis Christos
Papadimitriou George N
Pardiñas Antonio F
Parellada Mara
Pato Carlos N
Pato Michele T
Paunio Tiina
Pellegrino Renata
Periyasamy Sathish
Perkins Diana O
Petryshen Tracey L
Pfuhlmann Bruno
Pietiläinen Olli
Pimm Jonathan
Pirinen Matti
Pocklington Andrew J
Porteous David
Posthuma Danielle
Powell John
PsychENCODE
Psychosis Endophenotypes International Consortium
Pulver Ann E
Qi Ting
Qin Shengying
Quattrone Diego
Quested Digby
Radant Allen D
Rampino Antonio
Rapaport Mark H
Rautanen Anna
Reichenberg Abraham
Richards Alexander L
Rietschel Marcella
Riley Brien P
Ripke Stephan
Rivera Margarita
Roe Cheryl
Roffman Joshua L
Roth Julian
Rothermundt Matthias
Roussos Panos
Rujescu Dan
Rutten Bart PF
Saka Meram C
Saker-Delye Safaa
Salomaa Veikko
Sanders Alan R
Sanjuan Julio
Santoro Marcos Leite
Savitz Adam
Schall Ulrich
Schizophrenia Working Group of the Psychiatric Genomics Consorti
Schulze Thomas G
Schwab Sibylle G
Scott Rodney J
Seidman Larry J
Serretti Alessandro
Sham Pak C
Sharp Sally Isabel
Shi Jianxin
Shi Yongyong
Sidorenko Julia
Siever Larry J
Sigurdsson Engilbert
Silverman Jeremy M
Sim Kang
Skarabis Nora
Slominsky Petr
Smoller Jordan W
So Hon-Cheong
Sobell Janet L
St Clair David
Stahl Eli A
Stain Helen J
Steen Nils Eiel
Stefansson Kari
Stefánsson Hreinn
Steixner-Kumar Agnes A
Stone William S
Straub Richard E
Streit Fabian
Strengman Eric
Stroup T Scott
Stögmann Elisabeth
Subramaniam Mythily
Sugar Catherine A
Sullivan Patrick F
Suvisaari Jaana
Svrakic Dragan M
Swerdlow Neal R
SynGO Consortium
Szatkiewicz Jin P
Söderman Erik
Ta Thi Minh Tam
Takahashi Atsushi
Terao Chikashi
Thibaut Florence
Toncheva Draga
Tooney Paul A
Torretta Silvia
Tosato Sarah
Trubetskoy Vassily
Tsuang Debby W
Tsuang Ming T
Tura Gian Battista
Turetsky Bruce I
Vaaler Arne
van Amelsvoort Therese
van Os Jim
van Winkel Ruud
Vassos Evangelos
Vawter Marquis P
Veijola Juha
Verhage Matthijs
Vilella Elisabet
Visscher Peter M
Voloudakis Georgios
Waddington John
Walter Henrik
Walters James TR
Wang Shi-Heng
Watanabe Kyoko
Waterreus Anna
Webb Bradley T
Weinberger Daniel R
Weiser Mark
Werge Thomas
Wildenauer Dieter B
Williams Nigel M
Witt Stephanie H
Wormley Brandon K
Wray Naomi R
Wu Jing Qin
Wu Yang
Xu Shuhua
Xu Zhida
Yang Jian
Yolken Robert
Yu Xin
Yue Weihua
Zai Clement C
Zeng Jian
Zhang Wen
Zhou Wei
Zhu Feng
Zimprich Fritz
Üçok Alp
Publication venue: NATURE PORTFOLIO
Publication date: 21/04/2022
Field of study

Schizophrenia has a heritability of 60-80%1, much of which is attributable to common risk alleles. Here, in a two-stage genome-wide association study of up to 76,755 individuals with schizophrenia and 243,649 control individuals, we report common variant associations at 287 distinct genomic loci. Associations were concentrated in genes that are expressed in excitatory and inhibitory neurons of the central nervous system, but not in other tissues or cell types. Using fine-mapping and functional genomic data, we identify 120 genes (106 protein-coding) that are likely to underpin associations at some of these loci, including 16 genes with credible causal non-synonymous or untranslated region variation. We also implicate fundamental processes related to neuronal function, including synaptic organization, differentiation and transmission. Fine-mapped candidates were enriched for genes associated with rare disruptive coding variants in people with schizophrenia, including the glutamate receptor subunit GRIN2A and transcription factor SP4, and were also enriched for genes implicated by such variants in neurodevelopmental disorders. We identify biological processes relevant to schizophrenia pathophysiology; show convergence of common and rare variant associations in schizophrenia and neurodevelopmental disorders; and provide a resource of prioritized genes and variants to advance mechanistic studies

UCL Discovery

Reutilização do conhecimento para aprendizado por reforço profundo.

Author: Glatt Ruben
Publication venue: 'Universidade de Sao Paulo, Agencia USP de Gestao da Informacao Academica (AGUIA)'
Publication date: 12/06/2019
Field of study

With the rise of Deep Learning the field of Artificial Intelligence (AI) Research has entered a new era. Together with an increasing amount of data and vastly improved computing capabilities, Machine Learning builds the backbone of AI, providing many of the tools and algorithms that drive development and applications. While we have already achieved many successes in the fields of image recognition, language processing, recommendation engines, robotics, or autonomous systems, most progress was achieved when the algorithms were focused on learning only a single task with little regard to effort and reusability. Since learning a new task from scratch often involves an expensive learning process, in this work, we are considering the use of previously acquired knowledge to speed up the learning of a new task. For that, we investigated the application of Transfer Learning methods for Deep Reinforcement Learning (DRL) agents and propose a novel framework for knowledge preservation and reuse. We show, that the knowledge transfer can make a big difference if the source knowledge is chosen carefully in a systematic approach. To get to this point, we provide an overview of existing literature of methods that realize knowledge transfer for DRL, a field which has been starting to appear frequently in the relevant literature only in the last two years. We then formulate the Case-based Reasoning methodology, which describes a framework for knowledge reuse in general terms, in Reinforcement Learning terminology to facilitate the adaption and communication between the respective communities. Building on this framework, we propose Deep Case-based Policy Inference (DECAF) and demonstrate in an experimental evaluation the usefulness of our approach for sequential task learning with knowledge preservation and reuse. Our results highlight the benefits of knowledge transfer while also making aware of the challenges that come with it. We consider the work in this area as an important step towards more stable general learning agents that are capable of dealing with the most complex tasks, which would be a key achievement towards Artificial General Intelligence.Com a evolução da Aprendizagem Profunda (Deep Learning), o campo da Inteligência Artificial (IA) entrou em uma nova era. Juntamente com uma quantidade crescente de dados e recursos computacionais cada vez mais aprimorados, o Aprendizado de Máquina estabelece a base para a IA moderna, fornecendo muitas das ferramentas e algoritmos que impulsionam seu desenvolvimento e aplicações. Apesar dos muitos sucessos nas áreas de reconhecimento de imagem, processamento de linguagem natural, sistemas de recomendação, robótica e sistemas autônomos, a maioria dos avanços foram feitos focando no aprendizado de apenas uma única tarefa, sem muita atenção aos esforços dispendidos e reusabilidade da solução. Como o aprendizado de uma nova tarefa geralmente envolve um processo de aprendizado despendioso, neste trabalho, estamos considerando o reúso de conhecimento para acelerar o aprendizado de uma nova tarefa. Para tanto, investigamos a aplicação dos métodos de Transferência de Aprendizado (Transfer Learning) para agentes de Aprendizado por Reforço profundo (Deep Reinforcement Learning - DRL) e propomos um novo arcabouço para preservação e reutilização de conhecimento. Mostramos que a transferência de conhecimento pode fazer uma grande diferença no aprendizado se a origem do conhecimento for escolhida cuidadosa e sistematicamente. Para chegar a este ponto, nós fornecemos uma visão geral da literatura existente de métodos que realizam a transferência de conhecimento para DRL, um campo que tem despontado com frequência na literatura relevante apenas nos últimos dois anos. Em seguida, formulamos a metodologia Raciocínio baseado em Casos (Case-based Reasoning), que descreve uma estrutura para reutilização do conhecimento em termos gerais, na terminologia de Aprendizado por Reforço, para facilitar a adaptação e a comunicação entre as respectivas comunidades. Com base nessa metodologia, propomos Deep Casebased Policy Inference (DECAF) e demonstramos, em uma avaliação experimental, a utilidade de nossa proposta para a aprendizagem sequencial de tarefas, com preservação e reutilização do conhecimento. Nossos resultados destacam os benefícios da transferência de conhecimento e, ao mesmo tempo, conscientizam os desafios que a acompanham. Consideramos o trabalho nesta área como um passo importante para agentes de aprendizagem mais estáveis, capazes de lidar com as tarefas mais complexas, o que seria um passo fundamental para a Inteligência Geral Artificial

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Biblioteca Digital de Teses e Dissertações

Deep learning architecture for gesture recognition

Author: Glatt Ruben
Publication venue: Universidade Estadual Paulista (UNESP)
Publication date
Field of study

O reconhecimento de atividade de visão de computador desempenha um papel importante na investigação para aplicações como interfaces humanas de computador, ambientes inteligentes, vigilância ou sistemas médicos. Neste trabalho, é proposto um sistema de reconhecimento de gestos com base em uma arquitetura de aprendizagem profunda. Ele é usado para analisar o desempenho quando treinado com os dados de entrada multi-modais em um conjunto de dados de linguagem de sinais italiana. A área de pesquisa subjacente é um campo chamado interação homem-máquina. Ele combina a pesquisa sobre interfaces naturais, reconhecimento de gestos e de atividade, aprendizagem de máquina e tecnologias de sensores que são usados para capturar a entrada do meio ambiente para processamento posterior. Essas áreas são introduzidas e os conceitos básicos são descritos. O ambiente de desenvolvimento para o pré-processamento de dados e algoritmos de aprendizagem de máquina programada em Python é descrito e as principais bibliotecas são discutidas. A coleta dos fluxos de dados é explicada e é descrito o conjunto de dados utilizado. A arquitetura proposta de aprendizagem consiste em dois passos. O pré-processamento dos dados de entrada e a arquitetura de aprendizagem. O pré-processamento é limitado a três estratégias diferentes, que são combinadas para oferecer seis diferentes perfis de préprocessamento. No segundo passo, um Deep Belief Network é introduzido e os seus componentes são explicados. Com esta definição, 294 experimentos são realizados com diferentes configurações. As variáveis que são alteradas são as definições de pré-processamento, a estrutura de camadas do modelo, a taxa de aprendizagem de pré-treino e a taxa de aprendizagem de afinação. A avaliação dessas experiências mostra que a abordagem de utilização de uma arquitetura ... (Resumo completo, clicar acesso eletrônico abaixo)Activity recognition from computer vision plays an important role in research towards applications like human computer interfaces, intelligent environments, surveillance or medical systems. In this work, a gesture recognition system based on a deep learning architecture is proposed. It is used to analyze the performance when trained with multi-modal input data on an Italian sign language dataset. The underlying research area is a field called human-machine interaction. It combines research on natural user interfaces, gesture and activity recognition, machine learning and sensor technologies, which are used to capture the environmental input for further processing. Those areas are introduced and the basic concepts are described. The development environment for preprocessing data and programming machine learning algorithms with Python is described and the main libraries are discussed. The gathering of the multi-modal data streams is explained and the used dataset is outlined. The proposed learning architecture consists of two steps. The preprocessing of the input data and the actual learning architecture. The preprocessing is limited to three different strategies, which are combined to offer six different preprocessing profiles. In the second step, a Deep Belief network is introduced and its components are explained. With this setup, 294 experiments are conducted with varying configuration settings. The variables that are altered are the preprocessing settings, the layer structure of the model, the pretraining and the fine-tune learning rate. The evaluation of these experiments show that the approach of using a deep learning architecture on an activity or gesture recognition task yields acceptable results, but has not yet reached a level of maturity, which would allow to use the developed models in serious applications

Improving Deep Reinforcement Learning with Knowledge Transfer

Author: Costa Anna
Glatt Ruben
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 12/02/2017
Field of study

Recent successes in applying Deep Learning techniques on Reinforcement Learning algorithms have led to a wave of breakthrough developments in agent theory and established the field of Deep Reinforcement Learning (DRL). While DRL has shown great results for single task learning, the multi-task case is still underrepresented in the available literature. This D.Sc. research proposal aims at extending DRL to the multi- task case by leveraging the power of Transfer Learning algorithms to improve the training time and results for multi-task learning. Our focus lies on defining a novel framework for scalable DRL agents that detects similarities between tasks and balances various TL techniques, like parameter initialization, policy or skill transfer

Association for the Advancement of Artificial Intelligence: AAAI Publications

Policy Reuse in Deep Reinforcement Learning

Author: Costa Anna
Glatt Ruben
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 12/02/2017
Field of study

Driven by recent developments in Artificial Intelligence research, a promising new technology for building intelligent agents has evolved. The approach is termed Deep Reinforcement Learning and combines the classic field of Reinforcement Learning (RL) with the representational power of modern Deep Learning approaches. It is very well suited for single task learning but needs a long time to learn any new task. To speed up this process, we propose to extend the concept to multi-task learning by adapting Policy Reuse, a Transfer Learning approach from classic RL, to use with Deep Q-Networks

Association for the Advancement of Artificial Intelligence: AAAI Publications

An Advising Framework for Multiagent Reinforcement Learning Systems

Author: Costa Anna
Glatt Ruben
Silva Felipe
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 12/02/2017
Field of study

Reinforcement Learning has long been employed to solve sequential decision-making problems with minimal input data. However, the classical approach requires a long time to learn a suitable policy, especially in Multiagent Systems. The teacher-student framework proposes to mitigate this problem by integrating an advising procedure in the learning process, in which an experienced agent (human or not) can advise a student to guide her exploration. However, the teacher is assumed to be an expert in the learning task. We here propose an advising framework where multiple agents advise each other while learning in a shared environment, and the advisor is not expected to necessarily act optimally. Our experiments in a simulated Robot Soccer environment show that the learning process is improved by incorporating this kind of advice

Association for the Advancement of Artificial Intelligence: AAAI Publications