TY - JOUR PY - 2015// TI - Uncertainty and exploration in a restless bandit problem JO - Topics in cognitive science A1 - Speekenbrink, Maarten A1 - Konstantinidis, Emmanouil SP - 351 EP - 367 VL - 7 IS - 2 N2 - Decision making in noisy and changing environments requires a fine balance between exploiting knowledge about good courses of action and exploring the environment in order to improve upon this knowledge. We present an experiment on a restless bandit task in which participants made repeated choices between options for which the average rewards changed over time. Comparing a number of computational models of participants' behavior in this task, we find evidence that a substantial number of them balanced exploration and exploitation by considering the probability that an option offers the maximum reward out of all the available options.
Language: en
LA - en SN - 1756-8765 UR - http://dx.doi.org/10.1111/tops.12145 ID - ref1 ER -