Search CORE

3 research outputs found

ベイズ母数推定を組み込んだDeep-IRT

Author: Yuchen Cao
曹宇塵
Publication venue
Publication date: 30/08/2021
Field of study

テスト理論分野では，学習者のテスト(課題)への反応を基に，学習者の能力値を高精度に推定することが課題となっている。近年では，学習者の能力値を正しく推定するために，従来からテスト理論分野で用いられている項目反応理論（Item Response Theory:IRT）に深層学習手法を組み合わせたDeep-IRTが開発されている．既存研究ではDeep-IRTはIRTより学習者の能力値を高精度に推定することが示されている。しかし、Deep-IRTはデータ数が少ない場合に学習データに過学習してしまう問題がある。本論文では、少数データにおける過学習を避けるためにベイズ母数推定を組み込んだDeep-IRTを提案する。提案手法ではニューラルネットワークにおける重みとバイアスパラメータを変分推定法を用いてベイズ推定することでパラメータの過学習を避けることができる。評価実験では少数データにおいて提案手法が既存手法よりも学習者の能力値を正しく推定することを示した。さらに，提案手法は学習者の課題への反応を高精度に予測することを示した。電気通信大学202

Creative Repository of Electro-Communications

Automated Test Assembly for Handling Learner Cold-Start in Large-Scale Assessments

Author: Bourda Yolaine
Bruillard Éric
Popineau Fabrice
Vie Jill-Jênn
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2018
Field of study

International audienc

HAL-CentraleSupelec

Crossref

HAL-Rennes 1

Meta Automatic Curriculum Learning

Author: Hofmann Katja
Oudeyer Pierre-Yves
Portelas Rémy
Romac Clément
Publication venue: HAL CCSD
Publication date: 25/01/2021
Field of study

A major challenge in the Deep RL (DRL) community is to train agents able to generalize their control policy over situations never seen in training. Training on diverse tasks has been identified as a key ingredient for good generalization,which pushed researchers towards using rich procedural task generation systems controlled through complex continuous parameter spaces. In such complex taskspaces, it is essential to rely on some form of Automatic Curriculum Learning(ACL) to adapt the task sampling distribution to a given learning agent, instead of randomly sampling tasks, as many could end up being either trivial or unfeasible.Since it is hard to get prior knowledge on such task spaces, many ACL algorithms explore the task space to detect progress niches over time, a costly tabula-rasa process that needs to be performed for each new learning agents, although they might have similarities in their capabilities profiles. To address this limitation, we introduce the concept of Meta-ACL, and formalize it in the context of black-box RL learners, i.e. algorithms seeking to generalize curriculum generation toan (unknown) distribution of learners. In this work, we present AGAIN, a first in-stantiation of Meta-ACL, and showcase its benefits for curriculum generation overclassical ACL in multiple simulated environments including procedurally generated parkour environments with learners of varying morphologies

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server