Search CORE

51,988 research outputs found

Learning to deal with risk: what does reinforcement learning tell us about risk atittudes?.

Author: Burgos Albert
Publication venue
Publication date
Field of study

People are generally reluctant to accept risk. In particular, people overvaluate outcomes that are considered certain, relative to outcomes which are merely probable. At the same time, people is also more willing to accept bets when payoffs involve losses rather than gains. I consider how far adaptive learning can go on in explaining these phenomena. I report simulations in which adaptive learners of the kind studied in Roth & Erev (1995, 1998) and B6rgers & Sarin (1996, 1997) deal with a problem of iterated choice under risk where alternatives differ by a mean preserving spread. The simulations show that adaptive learning induce (on average) risk averse choices. This learning bias is stronger for gains than for losses. Also, risk averse choices are much more likely when one of the alternatives is a certain prospect. The implications of a learning interpretation of risk taking are examined.Iterated choice; reinforcement learning; risk attitudes;

Research Papers in Economics

Peningkatkan Prestasi Belajar Akuntansi Dengan Teknik Bimbingan Self Contracting and Self Reinforcement

Author: Yanuati E. S. (Endang)
Publication venue: State University of Semarang
Publication date: 01/06/2013
Field of study

The objective of the study was to improve students' learnig achievement to complete the Accounting cycle at service companies through the guidance techniques of self- contracting and self reinforcement on the 1st semester students of XI PE 3class in SMK Negeri 1 Pemalang in the academic year of 2011/2012. The previous assessments of students' learning results in completing the Accounting Cycle for service companies showed low average, it was only 65.40, whereas the minimum score was 75. The low achievement could be initiated by students' low motivation and activities. They were reluctant to study because of the domination of lecturing method. Thus, it was needed a more appropriate method to develop the exemplary, to give motivation, and also to give opportunity for students to be optimal in learning. The learning process through the guidance technique of self- contracting and self reinforcement could increase the students' achievement with the average of daily score was 75.98, it was higher than the KKM (minimum score) and the students' completeness was 77.50% which increased about 32.50

Neliti

Directory of Open Access Journals

Students' Schemata Activation in Extensive Reading at Stain Ponorogo

Author: Susilo dkk, Andi
Publication venue: State Islamic Institute of Ponorogo
Publication date: 01/01/2013
Field of study

This study was aimed at investigating the extent to which the lecturer employed strategy and occupied effective classroom language to assist students' schemata activation on Extensive Reading class at English DepartmentSTAIN Ponorogo. To meet with the objectives, qualitative case study formed the methodological basis of this present research involving an extensive reading lecturer as the research subject with one of her respective classes consisted of 32 students of fourth semester. The data were derived from lecturer's utterances (verbal) and body language (nonverbal). Those data were obtained from the transcripts of the recorded lecturer's utterances during two periods of meeting, and notetaking taken from observations and interviews. The results revealed that the lecturer used to employ questioning technique to activate students' schemata. Various strategies were predominantly occupied in lecturer's questioning behaviors to engage students in questionanswer activities, such as probing, redirecting and reinforcement. Generally, those strategies were observed not only to provide motivation and focus students' attention towards the topic being discussed, but also to give a wide chance of opportunity for them to recall their prior knowledge and ease them to predict the content of reading texts they were going to read. Besides, the language the lecturer used under this investigation was fairly fulfilled the requirements to be communicative as some communicative features of talks were utilized properly, such as referential questions, content feedback, and speech modification to optimize students' participation and performance in the process of reading

Neliti

Directory of Open Access Journals

RUMAH JURNAL IAIN PONOROGO

Meta Inverse Reinforcement Learning via Maximum Reward Sharing for Human Motion Analysis

Author: Burdick Joel W.
Li Kun
Publication venue
Publication date: 12/10/2017
Field of study

This work handles the inverse reinforcement learning (IRL) problem where only a small number of demonstrations are available from a demonstrator for each high-dimensional task, insufficient to estimate an accurate reward function. Observing that each demonstrator has an inherent reward for each state and the task-specific behaviors mainly depend on a small number of key states, we propose a meta IRL algorithm that first models the reward function for each task as a distribution conditioned on a baseline reward function shared by all tasks and dependent only on the demonstrator, and then finds the most likely reward function in the distribution that explains the task-specific behaviors. We test the method in a simulated environment on path planning tasks with limited demonstrations, and show that the accuracy of the learned reward function is significantly improved. We also apply the method to analyze the motion of a patient under rehabilitation.Comment: arXiv admin note: text overlap with arXiv:1707.0939

arXiv.org e-Print Archive

Grading Changes after a Writing Faculty Workshop

Author: Andrade Glenna M
Platania Judith
Publication venue: DOCS@RWU
Publication date: 01/01/2009
Field of study

After a workshop on student outcomes for the first-year writing course, the 28 faculty participants discussed the implications of “Development” for critical thinking. This case study of one college’s participatory exercise in improving writing found that although the RWU faculty lacked consensus on the definition, simply discussing topic of “Development” may have had the unintended effect of fewer A grades in the following semester. Unfortunately, the percentage of A grades ascended in the subsequent semesters to suggest that without reinforcement, faculty returned to grade inflation

DOCS@RWU

HELIN Digital Commons

Using rewards and penalties to obtain desired subject performance

Author: Allen R. W.
Cook M.
Jex H. R.
Stein A. C.
Publication venue
Publication date: 15/12/1981
Field of study

Operant conditioning procedures, specifically the use of negative reinforcement, in achieving stable learning behavior is described. The critical tracking test (CTT) a method of detecting human operator impairment was tested. A pass level is set for each subject, based on that subject's asymptotic skill level while sober. It is critical that complete training take place before the individualized pass level is set in order that the impairment can be detected. The results provide a more general basis for the application of reward/penalty structures in manual control research

NASA Technical Reports Server

On ‘common-sense ontology’:A comment on the paper by Frank Hindriks and Francesco Guala

Author: Hume
Lewis
Preston
ROBERT SUGDEN
Schelling
Sugden
Ullmann-Margalit
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 04/02/2015
Field of study

This note comments on Hindriks and Guala’s ‘unified theory of institutions’. One of the components that Hindriks and Guala seek to unify, and which they claim is unsatisfactory on its own, is the analysis of conventions that derives from the work of Lewis. I argue that the Lewisian approach provides simple and powerful explanations of many regularities in the social behaviour of humans and other animals. Those explanations can be seen as good social science even if, as Hindriks and Guala argue, they do not fit with common-sense ontology

Crossref

University of East Anglia digital repository

Reinforcement Learning in Investment Behavior

Author: Andrew Metrick
Brigitte Madrain
David Laibson
James Choi
Publication venue
Publication date
Field of study

Research Papers in Economics

Improving basic life support training for medical students

Author: Gadhvi K
Lami M
Nair P
Publication venue: 'Dove Medical Press Ltd.'
Publication date: 25/01/2016
Field of study

Mariam Lami, Pooja Nair, Karishma GadhviFaculty of Medicine, Imperial College, London, London, UKAbstract: Questions have been raised about basic life support (BLS) training in medical education. This article addresses the research evidence behind why BLS training is inadequate and suggests recommendations for improving BLS training for medical students.Keywords: medical education, basic life suppor

Directory of Open Access Journals

PubMed Central

Spiral - Imperial College Digital Repository