TY - JOUR
T1 - Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity
AU - Loewenstein, Yonatan
AU - Seung, H. Sebastian
PY - 2006/10/10
Y1 - 2006/10/10
N2 - The probability of choosing an alternative in a long sequence of repeated choices is proportional to the total reward derived from that alternative, a phenomenon known as Herrnstein's matching law. This behavior is remarkably conserved across species and experimental conditions, but its underlying neural mechanisms still are unknown. Here, we propose a neural explanation of this empirical law of behavior. We hypothesize that there are forms of synaptic plasticity driven by the covariance between reward and neural activity and prove mathematically that matching is a generic outcome of such plasticity. Two hypothetical types of synaptic plasticity, embedded in decision-making neural network models, are shown to yield matching behavior in numerical simulations, in accord with our general theorem. We show how this class of models can be tested experimentally by making reward not only contingent on the choices of the subject but also directly contingent on fluctuations in neural activity. Maximization is shown to be a generic outcome of synaptic plasticity driven by the sum of the covariances between reward and all past neural activities.
AB - The probability of choosing an alternative in a long sequence of repeated choices is proportional to the total reward derived from that alternative, a phenomenon known as Herrnstein's matching law. This behavior is remarkably conserved across species and experimental conditions, but its underlying neural mechanisms still are unknown. Here, we propose a neural explanation of this empirical law of behavior. We hypothesize that there are forms of synaptic plasticity driven by the covariance between reward and neural activity and prove mathematically that matching is a generic outcome of such plasticity. Two hypothetical types of synaptic plasticity, embedded in decision-making neural network models, are shown to yield matching behavior in numerical simulations, in accord with our general theorem. We show how this class of models can be tested experimentally by making reward not only contingent on the choices of the subject but also directly contingent on fluctuations in neural activity. Maximization is shown to be a generic outcome of synaptic plasticity driven by the sum of the covariances between reward and all past neural activities.
KW - Decision making
KW - Neuroeconomics
KW - Rational choice theory
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=33750041626&partnerID=8YFLogxK
U2 - 10.1073/pnas.0505220103
DO - 10.1073/pnas.0505220103
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 17008410
AN - SCOPUS:33750041626
SN - 0027-8424
VL - 103
SP - 15224
EP - 15229
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 41
ER -