Basal Ganglia Activity Mirrors a Benefit of Action and Reward on Long-Lasting Event Memory

The expectation of reward is known to enhance a consolidation of long-term memory for events. We tested whether this effect is driven by positive valence or action requirements tied to expected reward. Using a functional magnetic resonance imaging (fMRI) paradigm in young adults, novel images predicted gain or loss outcomes, which in turn were either obtained or avoided by action or inaction. After 24 h, memory for these images reflected a benefit of action as well as a congruence of action requirements and valence, namely, action for reward and inaction for avoidance. fMRI responses in the hippocampus, a region known to be critical for long-term memory function, reflected the anticipation of inaction. In contrast, activity in the putamen mirrored the congruence of action requirement and valence, whereas other basal ganglia regions mirrored overall action benefits on long-lasting memory. The findings indicate a novel type of functional division between the hippocampus and the basal ganglia in the motivational regulation of long-term memory consolidation, which favors remembering events that are worth acting for.


Introduction
Episodes that are experienced at any given moment can usually be remembered a few minutes later, but only a fraction of episodes are consolidated as long-lasting memories. Why some episodes are consolidated and others forgotten is still not fully understood. Converging evidence points toward the importance of motivational valence (Shohamy and Adcock 2010;Lisman et al. 2011), because events that predict rewards are more likely to be remembered after long delay intervals (24 h and longer) than those associated with neutral outcomes (Wittmann et al. 2005(Wittmann et al. , 2011Adcock et al. 2006). However, recent studies indicate that valence representations are also closely coupled to action requirements (Crockett et al. 2009;Guitart-Masip et al. 2011Levita et al. 2012), suggesting that the explanatory power of valence in isolation could be limited. Therefore, instead of valence, memory performance may depend on whether rewards are obtained through action or inaction.
Motivational influences on episodic long-term memory are likely to be mediated by functional anatomical loops between the hippocampus and the dopaminergic neurons (Morris 2006;Hansen and Manahan-Vaughan 2014) in the substantia nigra/ventral tegmental area (SN/VTA; Lisman and Grace 2005;Lisman et al. 2011). Pharmacological studies in rats (O'Carroll et al. 2006) and aging humans ) have shown that dopamine can improve hippocampal-dependent memory consolidation. Likewise, reward anticipation (Wittmann et al. 2005(Wittmann et al. , 2011Adcock et al. 2006;Callan and Schweighofer 2008;Wolosin et al. 2012) boosts memory consolidation via coactivation (Wittmann et al. 2005) and increased functional connectivity (Adcock et al. 2006) of hippocampus and SN/VTA. In human studies of reward anticipation, the need to perform an impending action has been shown to dominate brain responses in the striatum and the SN/VTA and has a greater impact here than the anticipation of expected value . Together with previous observations (Tricomi et al. 2004;Zink et al. 2004), these findings show that motivational brain activity is strongly associated with the anticipated action needed to obtain a reward. Evidence for this intrinsic coupling of anticipated action and reward (Dayan et al. 2006) in dopaminergic circuitry ) is also seen following pharmacological manipulations .
These insights regarding the relationship between action and valence can provide new perspectives on the motivational regulation of memory consolidation. For example, asymmetry in memory persistence based on the congruence of action and reward could be adaptive in an evolutionary sense because an action that led to reward may be the most valuable information to retain, especially compared with rewards that are gained passively. Additionally, value representations themselves may be affected by an association with action (Schonberg et al. 2014), which also could enhance the memory benefit further. We hypothesized that the need to perform an impending action would modulate memory consolidation beyond any effects attributable to reward expectation per se. To address this, we adapted a design ) that orthogonalizes the anticipation of action and valence to the context of a memory encoding experiment while we acquired simultaneous fMRI data (Fig. 1). This allowed us to investigate whether action, reward valence, or an interaction of the two affected memory persistence. The design also allowed to contrast reward for action and identical reward for inaction. We were particularly interested in determining whether the ability to remember new information under the anticipatory influence of reward and action requirement would be determined by the hippocampus, by the basal ganglia and the SN/VTA (i.e., Chowdhury et al. 2012), or both. A behavioral follow-up study was conducted with a 30-min delay between encoding and retrieval in order to investigate the time course of the memory modulation.

Participants
Thirty-three healthy right-handed participants who provided informed consent took part in the study. All had normal or corrected-to-normal vision and no history of psychological or neurological illness. One participant was excluded for poor overall memory (overall corrected hit rate below 0) and 3 were excluded for signal dropout in the midbrain (detection of loss of inferior SN/VTA regions was guided by a priori anatomical masks), leaving 29 participants (12 male, 18-31 years, 26.3 years mean age). A behavioral follow-up study with the 30-min delay between sessions contained 30 subjects (13 males, 18-34 years, 22 years mean age). The study was approved by the Institute of Neurology (University College London) Research Ethics Committee. Participants received reimbursement for their participation.

Stimuli
Stimuli consisted of 480 grayscale images of cars, boats, motorbikes, and airplanes (320 used in the task and 160 used as lures for the memory task). The stimuli were presented in randomized order on a gray background using Cogent 2000 (www.vislab.ucl.ac. uk/cogent.php) running in MATLAB.

Experimental Protocol
Two experiments were conducted in 2 independent samples: 1 fMRI experiment and 1 behavioral follow-up. Each study consisted of 2 sessions. Session 1 (Fig. 1) consisted of 4 experimental runs during which functional MRI data were acquired (for the Figure 1. Experimental paradigm performed in the scanner. Subjects were presented 80 trials in each of 4 consecutive blocks. In each trial, a trial-unique image from 1 of 4 categories (randomized across participants) informed subjects about the 2 factors: Whether to respond by indicating the position of a circle with an either left or right button press or to omit a response entirely (Go or No-Go); whether they could win a reward or avoid a loss by performing a correct action or inaction (Win or Avoid).
On Win trials, subjects could win £1 if they performed correctly, or gain £0 if they performed incorrectly. On Avoid trials, subjects could gain £0 instead of losing £1 by performing correctly. The outcomes after a correct response were 70% probabilistic. To disassociate anticipation from actual performance, half of the trials ended after the cue image and were directly followed by a new cue image. Subjects were trained extensively with a different set of images that were not trial unique before entering the scanner. follow-up group, the experiment was conducted with no scanning, simply in front of a computer). Each run consisted of 80 trials. Each trial consisted of the 3 following events: Presentation of a trial-unique image (of 1 of 4 image categories: cars, boats, motorbikes, and airplanes) for 3000 ms; the display of a circle for 2000 ms on either the left or right side of the screen; the display of the outcome for 1000 ms represented by either a green arrow pointing up (win), a yellow horizontal bar (neutral), or a red arrow pointing down (loss). Each of the 3 events was followed by a fixation cross for a variable interval of 1000-2000 ms. Half of the trials were aborted after the fixation cross following the display of the image. This allows to decorrelate the encoding event (and anticipation of action and reward from) from the actual performance and outcome (however not performance and outcome from each other).
The image category instructed the participant whether on this trial it would be required to indicate the position of the circle with a keypress (Go) or omit the response entirely (No-Go). The Go response had to be entered in under 700 ms in order to be registered. The images also informed the participant about the potential valence of the outcome: win/neutral (Win condition) or neutral/loss (Avoid condition). In each trial, £1could either be won or lost. The outcome was probabilistic to 70% for correct responses (e.g., 30% of correct Win trials produced a neutral outcome) and deterministic for incorrect responses. The mapping of the image categories to the experimental conditions was counterbalanced across participants. Because wins and avoided losses were of equal magnitude in absolute terms for the Go and No-Go conditions, this design holds expected value constant within each action condition enabling us to separately probe the effect of action and outcome valence. The 4 conditions can be considered of high salience because they all signal a potentially affective outcome (Bromberg-Martin et al. 2010;Guitart-Masip et al. 2014). Note that due to the outcomes being probabilistic subjects faced the whole range of outcomes even at perfect performance.
Each of the 4 experimental runs contained 20 trials per condition (Go Win, Go Avoid, No-Go Win, and No-Go Avoid), half of which were aborted after the display of the image and did not require the Go/No-Go response.
Subjects were instructed that their performance across all stages of the experiment would affect their payment, resulting in a total outcome of up to £45. To ensure that subjects learned the meaning of the image categories, subjects completed 3 runs of training. The first run consisted of 5 trials in which subjects were asked to indicate the position of the circle with a button press. The second run was 1 block in which no trial was aborted after image display, and the outcome was accompanied by text providing feedback whether the response was correct. To ensure subjects were familiar with the manipulation by which half of the trials were aborted, another run of training consisted of a shortened version of an actual experimental run (∼30 min of total training time). During the training the images were not trial unique but a repeating subset, that was not used in the actual experimental runs. See Guitart-Masip et al. (2011) for a similar experimental paradigm in which fractal images were used instead of image categories to indicate the condition of the trial. Note that, in this paradigm, the instructions are semantically embedded in the memoranda (images of vehicles). Reactions can only be planned post perception and semantic classification of the stimulus. This ensures a lower bound of equal encoding across all conditions. Session 2 took place the following day for the fMRI group and after a 30-min break for the behavioral follow-up experiment group. Subjects were instructed in advance that there would be a second session but not what type of task the second session would include. Subjects were presented with all 80 images of each image category they saw in the experimental runs of Session 1 plus 40 images of each category they had not seen before (this included cue images of trials in which the response and outcome parts of the trial were skipped). For each image, subjects were asked to indicate whether they had seen the image in Session 1 or not. After that, subjects were asked to rate their confidence in their answer as either "very sure," "sure," or "uncertain." If subjects responded they saw the image in Session 1, the additional confidence option "remember" was available. Subjects were asked to use this option if they had a vivid recollection of the full context in which they saw the image. Furthermore, if subjects responded they saw the image in Session 1, they were also asked whether or not the image was directly followed by a circle. Images were presented in randomized order. There was no reaction time limit for any of the questions in Session 2 (session duration ∼50 min).

MRI Data Acquisition
The study was conducted at the Wellcome Trust Center for Neuroimaging at University College London using a 3-T Siemens Trio scanner equipped with a 32-channel Siemens head coil.

Functional MRI Data Acquisition
Functional scans used a gradient-echo sequence optimized for hippocampus coverage: TR ( per volume) = 2.975 s; TE = 37 ms; 2.3 mm isotropic image resolution; interleaved acquisition order; matrix size = 96 × 96; field of view = 192 × 222 mm. A total of 35 axial slices (−45°tilt) were sampled for partial brain coverage including the hippocampus and the striatum.
A field map was recorded using a double-echo FLASH sequence (matrix size = 64 × 64; 64 slices; spatial resolution: 3 × 3 × 2 mm; 1 mm gap; TE1 = 10 ms; TE2 = 12.46 ms; TR = 1020 ms) for distortion correction of the acquired functional images. Field maps were estimated from the phase difference between the images acquired at the short and long TE with the FieldMap toolbox for SPM8.
Participants' pulse and breathing were measured synchronized with MRI scanner slice pulses using the Spike2 data acquisition system (Cambridge Electronic Design Limited, Cambridge, UK). The cardiac pulse signal was measured using an pulse oximeter (Model 8600 F0, Nonin Medical, Inc., Plymouth, MN, USA) attached to the participants' left index finger. Thoracic movement was recorded using a pneumatic belt positioned around the abdomen.

Analysis of Behavioral Data
The data were analyzed using Matlab R2012b and SPSS 19. Response speed in Go trials was analyzed by testing the speed of correct Go responses between the 2 Go conditions (Go-Win vs. Go-Avoid) with a paired t-test. The number of correct Go/ No-Go responses per condition was analyzed with a 2 × 2 repeated-measures ANOVA with action (Go/No-Go) and valence (Win/Avoid) as factors. As there were not enough "remember" responses for an fMRI analysis, a memory score was calculated that weighted correct and incorrect responses according to their confidence (e.g., a correct recognition response with a "remember" confidence rating added 4 points to the total while an incorrect recognition response with a "sure" confidence rating subtracted 2 points from the total). The memory score was analyzed with a 2 × 2 repeated-measures ANOVA with action (Go/No-Go) and valence (Win/Avoid) as factors. For the group comparison, a between-subject factor was added. Post hoc t-tests were performed to further analyze these effects.

fMRI Data Preprocessing
Imaging data were analyzed with Statistical Parametric Mapping (SPM8; Wellcome Trust Centre for Neuroimaging, London, UK; http://www.fil.ion.ucl.ac.uk/spm). Images were bias-corrected for intensity inhomogeneities, realigned with the first volume (after discarding the first 6 dummy volumes) and unwarped, coregistered with the structural image, normalized to a standard echo-planar imaging template based on the Montreal Neurological Institute reference brain, resampled to 2 × 2 × 2 mm 3 voxels, and spatially smoothed (4-mm full-width at half-maximum). The magnetization transfer images were averaged across subjects after normalization in order to identify the SN/VTA and create a group-specific template (Duzel et al. 2009).

fMRI Data Analysis
The fMRI data were analyzed in a 2 × 2 [Action (Go/No-Go) × Valence (Win/Avoid)] design to mirror the behavioral analysis. For each participant, a statistical model was computed by applying a canonical hemodynamic response function combined with time and dispersion derivatives. The time series indicated the temporal position of: display of image for each of the 4 conditions (Go Win, Go Avoid, No-Go Win, and No-Go Avoid), the onset of the circles (for each Go and No-Go trials in which subjects correctly responded), display of outcomes for the 3 possible outcomes (win, neutral, and loss), and display of image for trials with an incorrect response. Note that this includes the trials that were aborted after the cue image. To mirror the behavioral analysis of the data, each of the 4 conditions indicating the cue onset was parametrically modulated by memory strength (ranging from "not seen before-very sure" to "seen before-remembered"). To remove physiological noise, a model was constructed to account for artifacts related to cardiac and respiratory phase and changes in respiratory volume using an in-house developed Matlab toolbox (Hutton et al. 2011). A total of 14 regressors for physiological noise reduction and 6 motion correction regressors estimated from the realignment procedure were entered as covariates of no interest ( physiological data were available for all but 2 participants, for whom just 6 motion correction regressors were entered). Contrast images were entered into two 2 × 2 ANOVAs (Action by Valence) for the second-level random-effects analysis, one containing the contrasts of the 4 conditions coding the image onset (cue activity) and the other containing the 4 parametric modulators of memory strength for each of those conditions (memory parameter). The main effect of memory was assessed by weighting all 4 conditions containing the memory parameter over baseline. The main effects of action (and its inverse for inaction) and valence were assessed for both second-level models. All these effects were assessed at the whole-brain FWE clustercorrected level after voxel-wise thresholding P < 0.001. The memory contrast (all memory parameters against baseline) was masked with bilateral anatomical masks of the hippocampus. Furthermore, we investigated the interaction of action and valence in the memory parameter using a small-volume correction for a bilateral anatomical mask including the hippocampus and the basal ganglia (after thresholding at P < 0.001 uncorrected) due to their role in memory and motivation. For all clusters of interest, the average activation over all voxels in the respective functional regions of interest (ROIs) was extracted using the MarsBaR toolbox (Brett et al. 2002) and analyzed using a 2 × 2 ANOVA (Action by Valence). Post hoc t-tests were used to investigate the precise differences across conditions that created the observed effects in the ANOVAs.
For all parts of the fMRI analysis, the WFU PickAtlas toolbox for SPM was used to identify the anatomical location of peak voxels (Lancaster et al. 2000;Maldjian et al. 2003). In the case of the SN/VTA, the anatomical mask was created based on the average magnetization transfer image of all subjects. The average activation over all voxels in the individual ROIs was extracted using the MarsBaR toolbox (Brett et al. 2002).

Results
During the scanning session, subjects' responses (Go or No-Go) after the cue were 97.2% correct and this did not differ across conditions (Fig. 2a, main effect of action: F 1,28 = 1.05, P > 0.3; main effect of valence: F 1,28 = 1.79, P > 0.1; interaction: F 1,28 = 1.25, P > 0.2). Correct Go responses were significantly [t (28) = 3.83, P = 0.001] faster in Go Win (445.43 ± 82.23 ms) than in Go Avoid trials (476.32 ± 102.07 ms), indicating that participants anticipated the valence associated with the different conditions along with the associated action requirements (Fig. 2b). As in previous versions of the task , the number of responses that were correct yet below the instructed 700 ms threshold was higher in the Go Win compared with the Go Avoid condition [t (28) = 2.92, P = 0.007]. The reaction time facilitation by reward valence demonstrates that subjects processed valence, even though valence information was not instrumentally essential in expressing the correct action requirements.

The Behavioral Effect of Action and Valence on Memory After 24 h and 30 min
Twenty-four-hour memory accuracy across conditions was assessed by calculating a memory score that weighted answers according to their confidence judgment and added or subtracted from the total depending on whether the answer was correct. A positive score means that correct answers were given more confidently and/or frequently than wrong answers. As shown in Figure 2c, memory score showed both a main effect of action (F 1,28 = 12.54, P = 0.001) and an interaction (F 1,28 = 4.7, P = 0.039) but no effect of valence (F 1,21 = 0.1, P > 0.9). These effects were driven by the Go versus No-Go difference between the Win conditions [t (28) = 3.65, P = 0.001] and the lack of a Go versus No-Go difference in the Avoid losing conditions [t (28) = 1.51, P > 0.1]. The Go Win condition also showed better memory compared with the No-Go Avoid condition [t (28) = 2.32, P = 0.028], and the No-Go Win condition showed significantly lower memory performance than the Go Avoid condition [t (28) = 3.1, P = 0.005]. This score also contained "Remember" responses, which showed a highly similar pattern [ Fig. 2d; main effect of action (F 1,28 = 4.89, P = 0.035) and an interaction (F 1,28 = 4.6, P = 0.041); Go Win vs. No-Go Win (t (28) = 2.79, P = 0.009); see Supplementary Materials for recognition memory]. Note that we found no connection between reaction time to the action cue and subsequent memory.
The behavioral experiment was repeated without fMRI scanning but now with a 30-min delay (instead of 24 h) in order to determine whether the effect of action on memory was already present even after a short delay. This would be compatible with a strong effect of action on encoding. When the experiment was repeated with a 30-min delay, the main effect of action did not replicate (though showed a trend-level effect, F 1,29 = 2.74, P = 0.11), and the interaction of action and valence did replicate (F 1,29 = 13.11, P = 0.001). Testing the effect of delay length on the effect of action or the interaction of action and valence yielded no significant results. Thus, although we do not see statistically significant group differences, these data indicate that a strong behavioral main effect of action observed after 24 h is not explained as a residue of a stronger encoding modulation by action visible after 30 min.

fMRI Analyses
To examine how the experimental conditions affect memory encoding at a neural level, we analyzed the blood oxygen level-dependent activity evoked by image onset (hereafter "cue activity"). The data were analyzed in a 2 × 2 ANOVA containing the factor of action (Go/No-Go) and valence (Win/Avoid). The same ANOVA was constructed with the parametric modulation of each of the 4 conditions, containing a 7-point subsequent memory rating ranging from "sure not seen before" to "remembered seen before." This activation parametrically reflected activity correlated with memory fate after 24 h (hereafter "memory parameter"). Note that these 2 measures are independent of each other and can be interpreted separately: While the cue activity describes the overall activity in a condition (like an intercept), the memory parameter describes how much the activity increases as a function of subsequent memory (like a slope).
The factorial design and the orthogonal nature of the contrasts allowed us to employ a functional ROI approach that extracts the beta estimates of all conditions from regions that have been independently selected. The fMRI analyses were conducted on statistical maps (thresholded at P < 0.001 uncorrected) that were corrected for multiple comparisons on the whole-brain level or for a bilateral anatomical ROI including the basal ganglia and the hippocampus together.

Neural Effects of Action, Inaction, and Valence
Examining the cue activity in the contrast of Go over No-Go (action) and its inverse (inaction) revealed large sets of significant clusters (see Supplementary Materials for full list). Most dominantly, the effect of action revealed large whole-brain-corrected clusters spanning the midbrain, the thalamus, and the entire striatum ( peaking at −14 −18 16, z = 7.34, k = 4718, Fig. 3). There were no whole-brain-corrected effects of valence anticipation. This activation pattern replicated previous findings . However, a positive effect of valence was present in the action cluster in the SN/VTA (Fig. 3c,d, F 1,28 = 4.81, P = 0.037). This result shows that the statistical power of the current study can uncover action-independent reward representations, which are dominated be action representations .

Modulation of Subsequent Memory Effects by Action and Valence
First, we identified brain regions which showed a main effect of subsequent memory (memory parameter averaged across all conditions). This analysis resulted in significant clusters in medial temporal and frontal lobes (see Supplementary Materials for full list). Memory parameters from each of the 4 experimental conditions were extracted from these clusters to investigate the effects of action and valence on memory. However, we found that within these clusters, memory parameters did not differ across conditions (all main effects and interactions not significant). Thus, in regions selected for a subsequent memory effect, the activity difference between subsequently recognized/remembered items and those items that were forgotten did not differ across action and valence conditions. As the hippocampus was of specific interest here due to its central role in memory (Vargha-Khadem et al. 1997;Eichenbaum et al. 2007;Ranganath 2010;Squire and Wixted 2011), we anatomically restricted the activation map showing a significant effect of memory parameter across conditions to bilateral hippocampi (note that this cluster remained significant whole-brain FWE-corrected). While this hippocampal cluster showed a positive memory parameter in every condition except No-Go Win, we again found no significant differences across action/valence conditions for the memory parameter within the hippocampus (see Supplementary Materials for details). While the hippocampus reflected subsequent memory, its overall activity levels showed increased activity in the No-Go conditions (see above for effect of inaction), suggesting that the hippocampal activity may not alone account for the observed behavioral memory performance differences across conditions. Therefore, we conducted a more specific analysis that directly targeted the interaction of the memory parameter with action and valence. On the basis of previous work on the relationship between memory and motivation (Haber and Knutson 2010;Guitart-Masip et al. 2011;Chowdhury et al. 2012), this analysis was restricted to the hippocampus and the basal ganglia. Investigating the interaction effect on the memory parameter in the hippocampus and basal ganglia revealed a significant cluster in the right putamen (Fig. 4, FWE small-volume correction for a priori anatomical ROI consisting of the hippocampus and basal ganglia peaking at 26 8 8, P = 0.037, z = 4.27, k = 32). Of course, the finding of a neural interaction effect does not mean that this signal is necessary to produce the behavioral results. For the memory parameter, both congruent conditions (Go Win and No-Go Avoid) were significantly increased compared with the incongruent conditions [Go Win vs. Go Avoid: t (28) = 2.12, P = 0.043; Go Win vs. No-Go Win: t (28) = 3.05, P = 0.005; No-Go Avoid vs. Go Avoid: t (28) = 2.58, P = 0.016; No-Go Avoid vs. No-Go Win: t (28) = 3.83, P < 0.001]. The cue activity in this cluster revealed a main effect of action (F 1,28 = 13.28, P < 0.001, Fig. 4b) and a trend of an interaction (F 1,28 = 4.1, P = 0.053).

Discussion
We show that images that predict a necessity to act were more likely to be recognized and remembered after a long retention interval (24 h) than those that predicted a necessity to omit action. This action benefit was particularly strong for images associated with reward but there also was a main effect of action across both the Win and Avoid conditions (Fig. 2c). Considering images associated with vivid recollection showed a highly similar pattern of an enhancement by action and the interaction of action and valence (remember responses to Go Win vs. No-Go Win, Fig. 2d).
It is well established that the ability to recognize events with high confidence and to experience vivid recollection is critically dependent on the hippocampus (Vargha-Khadem et al. 1997;Eichenbaum et al. 2007;Ranganath 2010;Squire and Wixted 2011). However, the observed dominant hippocampal activation for anticipated omission of action constitutes a striking dissociation between hippocampal activity at the time of encoding and behavioral memory. This dissociation and the fact that covariation of subsequent memory with hippocampal activation was constant across conditions make it unlikely that either the valence-independent memory benefit for anticipated action or the improved recollection for Go Win was driven by a hippocampal enhancement of encoding.
Taken together, the valence-independent enhancement of long-term memory in the Go conditions cannot be fully explained by hippocampal encoding processes per se. Rather, the long-term memory performance is compatible with an enhancement of memory consolidation independent of hippocampal activation during encoding . Indeed, a follow-up experiment using the same task design, but a short retention interval of 30 min, enables us to rule out the possibility that a benefit we observed after 24 h is a residue of a stronger immediate effect, which would likely reflect an encoding benefit.
Given the behavioral memory enhancement by action and the involvement of dopamine in long-term memory consolidation , we predicted that the activity in the SN/VTA, which contains dopaminergic neurons also projecting to the hippocampus (Lisman et al. 2011), would be related to that memory benefit. However, fMRI analyses on the basis of subsequent memory performance (using the memory parameter) did not implicate the SN/VTA. Instead, as in previous studies , activity in the SN/VTA as well as striatal activity were overall higher for a requirement to act (Fig. 3). Thus, while this overall effect of action in the SN/VTA and striatum mirrored the memory benefit of action, the functional relationship between this brain activity pattern and subsequent memory remained indirect.
In addition to a main effect of action, the behavioral memory performance after 24 h showed an interaction pattern of Pavlovian congruence , favoring Go Win (motor activation to obtain rewards) and No-Go Avoid (motor inhibition when faced with punishment), over the incongruent conditions (No-Go Win and Go Avoid). The only region in the basal ganglia that showed a neural interaction effect consistent with Pavlovian congruence was the right dorsal putamen. A cluster in this region predicted memory fate more in the congruent than the incongruent conditions (Fig. 4). Of course, the finding of a neural interaction effect does not mean that this signal is necessary to produce the behavioral results. To our knowledge, there is no direct projection between the dorsal parts of the putamen and the hippocampus (Friedman et al. 2002;Haber and Knutson 2010). Therefore, a memory-related modulation of activity was not predicted for this region. However, it is interesting to note that a decreased volume of the putamen has recently also been observed, albeit not predicted, in preclinical stages of dominantly inherited Alzheimer's disease (Cash et al. 2013), a disorder where memory problems are among the first symptoms. Also, studies in non-human primates show that bilateral lesions to the entorhinal and perirhinal cortices, important input/output structures of the hippocampus, lead to metabolic dysfunction in the basal ganglia including the putamen (Meguro et al. 1999). Taken together, our observations [also see Sadeh et al. (2011)] and the data regarding Alzheimer's disease point to the possibility that functions related to motor control in the putamen also influence the ability to recognize or recollect events in long-term memory. This adds to an emerging understanding of Pavlovian control in the striatum ) and a role for putamen to factor in the nature of an anticipated, instrumental action (Kurniawan et al. 2013). The results also expand the knowledge of striatal contributions to memory (Scimeca and Badre 2012) and suggest that there is clinical importance in investigating the interaction between memory and action.
Previous experiments have highlighted a benefit of reward expectation on long-term memory (Wittmann et al. 2005;Adcock et al. 2006), yet did not fully orthogonalize valence and action . Our behavioral results now show that on trials that were associated with a positive expectation of value (Win conditions), the need to act enhanced memory performance compared with the need to withhold action (Fig. 2). These observations highlight the importance of fully controlling modes of action and types of outcomes to understand the motivational regulation of long-term memory, particularly with regard to the role of the hippocampus. Further research might usefully determine how action and inaction relate to effort and the associated costs of acting. While on the one hand effortful actions may be discounted (Kurniawan et al. 2011), in other circumstances effort can be associated with increased evaluation (Zentall 2010).
Our data highlight a hippocampal role in instrumentally omitted actions (see Supplementary Materials for similar results obtained in previous studies). The neural network pattern of inaction (Fig. 3, also see Supplementary Material) is not consistent with the default network (Buckner et al. 2008), which one may expect if subjects were passively resting during No-Go. Indeed, the hippocampus has been associated with avoidance of potential threat (Bach et al. 2014;Bannerman et al. 2014), the expression of memory-dependent on behavioral inhibition (Bannerman et al. 2012), and inhibitory control of action for reward (Chudasama et al. 2012;Abela et al. 2013;Abela and Chudasama 2014). Another possibility which chimes well with recent accounts of hippocampal-basal ganglia interactions (Shohamy and Turk-Browne 2013) is that the hippocampus can modulate basal ganglia function on the basis of declarative memory (Pennartz et al. 2011). Another possibility is that hippocampally recalled task rules compete with action biases (Izquierdo et al. 2006;Eschenko and Mizumori 2007;Doll et al. 2009;Peyrache et al. 2009). Irrespective of which of these interpretations is true, these findings add to an emerging perspective of the broader impact the hippocampus has on cognition (Shohamy and Turk-Browne 2013) and its potentially antagonistic function to the basal ganglia under certain circumstances (Seger and Cincotta 2006;Lee et al. 2008;Foerde et al. 2013). In the current design, embedding the task in the semantic content of stimuli ensured the same level of stimulus-bound attention for each image. The requirement to perform an action in the Go-but not in the No-Go trials may have induced an additional attentional modulation directed toward the subsequent target detection task. While we cannot fully exclude the possibility that this additional action-related requirement had an effect on memory, we note evidence from levels-of-processing studies indicate that attention per se cannot improve memory if it is not directed toward the semantic content of a stimulus (Craik 2002). Likewise, an action-related attentional effect could not account for the interaction pattern observable in the behavioral and neural data that integrates action, valence, and memory.
The strong memory difference between action for reward and inaction for reward has implications for how we interact with information toward optimizing long-term learning. Indeed, the advantages of "active" over "passive" learning approaches have been of interest to educational psychologists (Michael 2006). While the "active" component in active learning approaches obviously exceeds the experimentally controlled and deliberately simple motor responses required here, our findings provide a neurocognitive framework that can account for the benefits of active learning. Most strikingly, we show that the omission of an action for reward is the least conducive to persistence of memory. Indeed, many educational approaches are likely to be suboptimal for memory consolidation because they are based on passive positive reinforcement.
In conclusion, we show a long-lasting modulation of recognition memory and recollection by the influence of anticipated action and the Pavlovian congruence between action and reward. This modulation is associated with complementary contributions of the basal ganglia and the hippocampus to action anticipation and the omission of action. How the 2 regions functionally interact to enable the observed modulation of memory consolidation remains to be established because previously reported functional interactions between the hippocampus and the SN/ VTA do not fully account for the behavioral effect while the putamen, a region without known anatomical connectivity to the hippocampus, does. Our findings therefore indicate a novel type of functional division between the hippocampus and basal ganglia in the motivational regulation of long-term memory consolidation, which favors remembering events that are worth acting for.
Funding This work was supported by an UCL Grand Challenge Studentship in Biomedicine to R.K., the Wellcome Trust (Senior Investigator Award 098362/Z/12/Z to R.J.D.), and DFG grant SFB 779 (A07) supporting E.D. The Wellcome Trust Centre for Neuroimaging is supported by core funding from the Wellcome Trust 091593/Z/10/Z. Funding to pay the Open Access publication charges for this article was provided by the The Wellcome Trust.