Abstract
Selecting the most advantageous actions in a changing environment is a central feature of adaptive behavior. The midbrain dopamine (DA) neurons along with the major targets of their projections, including dopaminoceptive neurons in the frontal cortex and basal ganglia, play a key role in this process. Here, we investigate the consequences of a selective genetic disruption of N-methyl-D-aspartate (NMDA) and metabotropic glutamate 5 (mGluR5) receptors in the DA system on adaptive choice behavior, in mice. We tested the effects of the mutation on performance in the probabilistic reinforcement learning and probability discounting tasks. In case of the probabilistic choice, both the loss of NMDA receptors in dopaminergic neurons or loss mGluR5 receptors in D1 receptor-expressing dopaminoceptive neurons reduced the probability of selecting the more rewarded alternative and lowered the likelihood of returning to the previously rewarded alternative (win-stay). When observed behavior was fitted to reinforcement learning models, we found that these two mutations were associated with reduced effect of expected outcome on choice, i.e. more random choices. None of the mutations affected probability discounting, which indicates that all animals had normal ability to assess probability. However, in both behavioral tasks animals with targeted loss of NMDA receptors in dopaminergic neurons or mGluR5 receptors in D1 neurons were significantly slower to perform choices. In conclusion, these results show that glutamate receptor-dependent signaling in the DA system is essential for the speed and accuracy of choices, but at the same time probably not critical for correct estimation of probable outcomes.