3,078
edits
Changes
→Bush and Mosteller Learning Curve
===Bush and Mosteller Learning Curve===
This section was mostly taken from Paul W. Glimcher1 paper in PNAS<ref>Glimcher, P. W. (2011). Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis. Proceedings of the National Academy of Sciences of the United States of America, 108 Suppl (Supplement_3), 15647–54.</ref>
This very general idea of Conditional learning, was first mathematically formalized when Bush and Mosteller<ref>Bush RR, Mosteller F (1951) A mathematical model for simple learning. Psychol Rev 58:313–323</ref>, 24) proposed that the probability of Pavlov's<ref>Pavlov IP (1927) Conditioned Reflexes: An Investigation of the Physiological Activity of the Cerebral Cortex (Dover, New York).</ref> dog expressing the salivary response on sequential trials could be computed through an iterative equation where