EFFECTS OF RESPONSE FREQUENCY CONSTRAINTS ON LEARNING IN A NON-STATIONARY MULTI-ARMED BANDIT TASK
An n-armed bandit task was used to investigate the trade-off between exploratory (choosing lesser-known options) and exploitive (choosing options with the greatest probability of reinforcement) human choice in a trial-and-error learning problem. In Experiment 1 a different probability of reinforceme...
Main Author: | |
---|---|
Format: | Others |
Published: |
OpenSIUC
2009
|
Subjects: | |
Online Access: | https://opensiuc.lib.siu.edu/dissertations/86 https://opensiuc.lib.siu.edu/cgi/viewcontent.cgi?article=1086&context=dissertations |