Primate Orbitofrontal Cortex Codes Information Relevant for Managing Explore-Exploit Tradeoffs. 2020

Vincent D Costa, and Bruno B Averbeck
Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, Oregon 97239-3098, and costav@ohsu.edu.

Reinforcement learning (RL) refers to the behavioral process of learning to obtain reward and avoid punishment. An important component of RL is managing explore-exploit tradeoffs, which refers to the problem of choosing between exploiting options with known values and exploring unfamiliar options. We examined correlates of this tradeoff, as well as other RL related variables, in orbitofrontal cortex (OFC) while three male monkeys performed a three-armed bandit learning task. During the task, novel choice options periodically replaced familiar options. The values of the novel options were unknown, and the monkeys had to explore them to see if they were better than other currently available options. The identity of the chosen stimulus and the reward outcome were strongly encoded in the responses of single OFC neurons. These two variables define the states and state transitions in our model that are relevant to decision-making. The chosen value of the option and the relative value of exploring that option were encoded at intermediate levels. We also found that OFC value coding was stimulus specific, as opposed to coding value independent of the identity of the option. The location of the option and the value of the current environment were encoded at low levels. Therefore, we found encoding of the variables relevant to learning and managing explore-exploit tradeoffs in OFC. These results are consistent with findings in the ventral striatum and amygdala and show that this monosynaptically connected network plays an important role in learning based on the immediate and future consequences of choices.SIGNIFICANCE STATEMENT Orbitofrontal cortex (OFC) has been implicated in representing the expected values of choices. Here we extend these results and show that OFC also encodes information relevant to managing explore-exploit tradeoffs. Specifically, OFC encodes an exploration bonus, which characterizes the relative value of exploring novel choice options. OFC also strongly encodes the identity of the chosen stimulus, and reward outcomes, which are necessary for computing the value of novel and familiar options.

UI MeSH Term Description Entries
D007858 Learning Relatively permanent change in behavior that is the result of past experience or practice. The concept includes the acquisition of knowledge. Phenomenography
D008253 Macaca mulatta A species of the genus MACACA inhabiting India, China, and other parts of Asia. The species is used extensively in biomedical research and adapts very well to living with humans. Chinese Rhesus Macaques,Macaca mulatta lasiota,Monkey, Rhesus,Rhesus Monkey,Rhesus Macaque,Chinese Rhesus Macaque,Macaca mulatta lasiotas,Macaque, Rhesus,Rhesus Macaque, Chinese,Rhesus Macaques,Rhesus Macaques, Chinese,Rhesus Monkeys
D008297 Male Males
D009474 Neurons The basic cellular units of nervous tissue. Each neuron consists of a body, an axon, and dendrites. Their purpose is to receive, conduct, and transmit impulses in the NERVOUS SYSTEM. Nerve Cells,Cell, Nerve,Cells, Nerve,Nerve Cell,Neuron
D011597 Psychomotor Performance The coordination of a sensory or ideational (cognitive) process and a motor activity. Perceptual Motor Performance,Sensory Motor Performance,Visual Motor Coordination,Coordination, Visual Motor,Coordinations, Visual Motor,Motor Coordination, Visual,Motor Coordinations, Visual,Motor Performance, Perceptual,Motor Performance, Sensory,Motor Performances, Perceptual,Motor Performances, Sensory,Perceptual Motor Performances,Performance, Perceptual Motor,Performance, Psychomotor,Performance, Sensory Motor,Performances, Perceptual Motor,Performances, Psychomotor,Performances, Sensory Motor,Psychomotor Performances,Sensory Motor Performances,Visual Motor Coordinations
D011678 Punishment The application of an unpleasant stimulus or penalty for the purpose of eliminating or correcting undesirable behavior. Punishments
D002755 Choice Behavior The act of making a selection among two or more alternatives, usually after a period of deliberation. Approach Behavior,Approach Behaviors,Behavior, Approach,Behavior, Choice,Behaviors, Approach,Behaviors, Choice,Choice Behaviors
D003216 Conditioning, Operant Learning situations in which the sequence responses of the subject are instrumental in producing reinforcement. When the correct response occurs, which involves the selection from among a repertoire of responses, the subject is immediately reinforced. Instrumental Learning,Learning, Instrumental,Operant Conditioning,Conditionings, Operant,Instrumental Learnings,Learnings, Instrumental,Operant Conditionings
D005106 Exploratory Behavior The tendency to explore or investigate a novel environment. It is considered a motivation not clearly distinguishable from curiosity. Curiosity,Novelty-Seeking Behavior,Behavior, Exploratory,Behavior, Novelty-Seeking,Behaviors, Exploratory,Behaviors, Novelty-Seeking,Curiosities,Exploratory Behaviors,Novelty Seeking Behavior,Novelty-Seeking Behaviors
D000679 Amygdala Almond-shaped group of basal nuclei anterior to the INFERIOR HORN OF THE LATERAL VENTRICLE of the TEMPORAL LOBE. The amygdala is part of the limbic system. Amygdaloid Body,Amygdaloid Nuclear Complex,Amygdaloid Nucleus,Archistriatum,Corpus Amygdaloideum,Intercalated Amygdaloid Nuclei,Massa Intercalata,Nucleus Amygdalae,Amygdalae, Nucleus,Amygdaloid Bodies,Amygdaloid Nuclear Complices,Amygdaloid Nuclei, Intercalated,Amygdaloid Nucleus, Intercalated,Amygdaloideum, Corpus,Amygdaloideums, Corpus,Archistriatums,Complex, Amygdaloid Nuclear,Complices, Amygdaloid Nuclear,Corpus Amygdaloideums,Intercalata, Massa,Intercalatas, Massa,Intercalated Amygdaloid Nucleus,Massa Intercalatas,Nuclear Complex, Amygdaloid,Nuclear Complices, Amygdaloid,Nuclei, Intercalated Amygdaloid,Nucleus, Amygdaloid,Nucleus, Intercalated Amygdaloid

Related Publications

Vincent D Costa, and Bruno B Averbeck
July 2020, The Journal of neuroscience : the official journal of the Society for Neuroscience,
Vincent D Costa, and Bruno B Averbeck
July 2023, Trends in cognitive sciences,
Vincent D Costa, and Bruno B Averbeck
May 1996, Journal of neurophysiology,
Vincent D Costa, and Bruno B Averbeck
December 2011, Annals of the New York Academy of Sciences,
Vincent D Costa, and Bruno B Averbeck
December 2012, Current biology : CB,
Vincent D Costa, and Bruno B Averbeck
February 2006, Trends in cognitive sciences,
Vincent D Costa, and Bruno B Averbeck
April 1999, Nature,
Vincent D Costa, and Bruno B Averbeck
March 2000, Cerebral cortex (New York, N.Y. : 1991),
Copied contents to your clipboard!