Memory for Dynamic Events When Event Boundaries Are Accentuated With Emotional Stimuli

Event boundaries are important moments throughout an ongoing activity that influence perception and memory. They allow people to parse continuous activities into meaningful events, encode the temporal sequence of events and bind event information together in episodic memory Thus, drawing attention to event boundaries may facilitate these important perceptual and encoding processes. In the current study, we used emotionally arousing stimuli to guide attention to event boundaries because this type of stimulus has been shown to influence perception and attention. We evaluated whether accentuating event boundaries with commercials improves memory and whether emotional stimuli further enhance this effect. A total of 97 participants watched a television episode in which we manipulated commercial break locations (boundary, non-boundary, no commercial) and the type of commercial (emotional, neutral) and then completed memory tasks. Overall, placing emotionally arousing commercials at event boundaries increased memory for the temporal order of events, but no other effects of accentuating event boundaries were observed. Thus, drawing attention to event boundaries—via emotionally charged commercials—increases the likelihood that people will perceive the change in events, update their mental model accordingly and better integrate temporal information from the just-encoded event.

Event boundaries are important moments throughout an ongoing activity that influence perception and memory. They allow people to parse continuous activities into meaningful events, encode the temporal sequence of events and bind event information together in episodic memory (DuBrow & Davachi, 2013). Thus, drawing attention to event boundaries may facilitate these important perceptual and encoding processes. In the current study, we used emotionally arousing stimuli to guide attention to event boundaries because this type of stimulus has been shown to influence perception and attention. We evaluated whether accentuating event boundaries with commercials improves memory and whether emotional stimuli further enhance this effect. A total of 97 participants watched a television episode in which we manipulated commercial break locations (boundary, non-boundary, no commercial) and the type of commercial (emotional, neutral) and then completed memory tasks. Overall, placing emotionally arousing commercials at event boundaries increased memory for the temporal order of events, but no other effects of accentuating event boundaries were observed. Thus, drawing attention to event boundaries-via emotionally charged commercials-increases the likelihood that people will perceive the change in events, update their mental model accordingly and better integrate temporal information from the just-encoded event.
At any given moment our perceptual system deals with an overwhelming amount of sensory input and yet we are able to readily experience and act upon our environment. Our perceptual system is able to manage and interpret this input, in part, because it breaks the activity down into individual events. As a simple everyday example, the act of driving to work can be broken down into several smaller sub-events such as opening your car door, starting the car's ignition, backing out of your garage, driving the route and parking your car in the company's parking lot. Although this series of events may be experienced as a seamless flow of activity, our perceptual system segments the activity into separate, meaningful units. The point at which one event ends and another event begins is referred to as an event boundary.
Previous work has demonstrated that while watching a video people tend to perceive remarkably similar event boundaries (Newtson, 1976), which are often associated with perceptual changes such as motion, body position and luminance (Cutting et al., 2012;Newtson et al., 1977) as well as the introduction of new characters, a change in goals, spatial location or time (Magliano et al., 2001). Further, perception of these event boundaries remains quite stable over time, such that people identify similar event boundaries when watching the same movie up to one year apart (Speer et al., 2003). While event boundary perception is fairly consistent within and across individuals, some individuals are more likely to identify normative event boundaries, whereas others are more likely to identify idiosyncratic boundaries. Importantly, these individual differences predict later memory performance such that more normative event segmentation is associated with better memory for the activity Flores et al., 2017;Gold et al., 2017;Kurby & Zacks, 2011;Sargent et al., 2013;Zacks et al., 2006). Thus, the perception of event boundaries is important for comprehension and memory (Boltz, 1992;DuBrow & Davachi, 2013;Ezzyat & Davachi, 2011;Schwan et al., 2000;Speer & Zacks, 2005).

Event Boundaries and Memory
Event boundaries serve as anchors by helping people segment complex activity into discrete, meaningful units (Kurby & Zacks, 2008). People tend to recall more actions (Schwan et al., 2000) and better recognize pictures related to event boundaries better than those related to nonboundaries (Newtson & Engquist, 1976). Further, removing scenes that contain event boundaries impairs memory for the activity more so than removing scenes from the middle of an ongoing event (Schwan & Garsoffky, 2004). Thus, the perception of boundaries is important for long-term memory.
Event boundary perception is also important for encoding the temporal sequence of an activity into episodic memory. To evaluate this relationship, Davachi and colleagues have evaluated the effect of event boundaries on temporal memory for text (Ezzyat & Davachi, 2011) and images (DuBrow & Davachi, 2013, 2014. Specifically, DuBrow and Davachi (2014) asked participants to encode a sequence of images of everyday objects and celebrity headshots and then tested their temporal memory by asking which of two objects was presented more recently. Critically, half of the image pairs came from the same event (or category, e.g., two celebrities) and half came from different events (e.g., a celebrity and an object), but all pairs had the same number of intervening items between them at encoding. They found that memory for temporal order was better for items from the same event than items from different events. Thus, a shared context is likely to help bind items together sequentially within an event. Similarly, memory for information from a previous event is worse after people encounter a spatial event boundary (i.e., walking through a doorway; Hornsby et al., 2016;Radvansky & Copeland, 2006). Thus, event boundaries signal a context change, or a new event, and trigger the binding of information from the just-encoded event. However, less binding occurs between information from different events (see also, DuBrow & Davachi, 2013, 2014Silva et al., 2019).
Given that event boundaries seem to help structure events in memory, perhaps drawing attention to them would promote this binding process and thus benefit memory? A few previous studies have addressed this question by manipulating an activity in such a way that emphasizes important event boundaries. For instance, Boltz (1992) found improved memory when event boundaries were emphasized through the use of commercial breaks. In this study, participants watched a television episode with commercial breaks inserted at event boundaries, at non-boundaries, or not at all. After watching the episode, participants completed several memory tasks. Boltz found that when event boundaries were emphasized, memory improved in comparison to the no commercial (control) condition, whereas when the events were interrupted by placing commercials in the middle of an event, memory suffered. Similarly, Gold et al. (2017) edited films to include visual and auditory cues such as the movie slowing down, arrows pointing to important objects, and a bell ringing at event boundaries or nonboundaries. They found that cueing event boundaries improved memory, particularly memory for the cued information, which is consistent with prior work (Boltz, 1992;Schwan et al., 2000). However, inconsistent with Boltz's findings, Gold et al. (2017) found that cueing non-boundaries also improved memory but to a lesser degree than cueing boundaries.
This research indicates that drawing attention to event boundaries improves overall memory for an event. It is possible that this boundary effect further increases the binding of information within the same event (Ezzyat & Davachi, 2011), thereby improving overall memory. However, these prior interventions did not evaluate temporal memory for order of information within the same event versus across different events.

Emotions and Attention
Another factor that has been shown to affect how individuals attend to and encode information is emotion. It is thought that our perceptual and attentional systems prioritize emotional information due to their importance for survival (e.g., Brosch et al., 2010;New et al., 2007;Seligman, 1971) because positive and negative arousing stimuli better capture a viewer's attention (e.g., Bannerman et al., 2009;Hansen & Hansen, 1988;Ohman et al., 2001). How, then, does emotional processing affect perception of event structure and later memory for a dynamic activity?
In the current study, we evaluated this question through the use of commercials that varied in emotional intensity and their effects on memory for a television episode. We made use of commercials because the goal of a commercial is to capture the viewer's attention to the target content (Teixeira et al., 2010). Emotion can help advertisers reach this goal because viewers tend to better remember commercials with emotional content (Mehta & Purvis, 2006). However, this previous work has not evaluated the effect that emotional content in commercials has on the neighboring information in a television episode. Thus, we have two competing hypotheses.
The first hypothesis is that highly arousing, emotional commercials will further draw viewers' attention to event boundaries, promote binding for information from the justencoded event and improve memory. If this hypothesis were true, then emphasizing event boundaries using emotionally arousing commercial breaks will result in better memory performance compared to neutral commercials or no commercials. The second hypothesis is that emotionally arousing commercial breaks will draw the viewers' attention away from the just-encoded event. Prior work has shown that an emotional stimulus disrupts consolidation of the preceding information resulting in poor memory for that information (Strange et al., 2003;Tulving, 1969). If this hypothesis were true, then emotionally arousing commercial breaks could result in worse memory performance than the neutral or no commercial breaks due to an interruption in encoding or consolidation. However, we must note a possible issue with using emotionally arousing commercials: Emotional content that is too arousing can have a detrimental effect on memory (Yerkes-Dodson law). Further, extremely arousing content may be driven by bottom-up processing (Bannerman et al., 2009;Ohman et al., 2001) and may shift attention away from semantic content towards perceptual details (Brewin et al., 1996;Sussman et al., 2016). Such changes in attention may alter how people segment and later remember information (Sherrill et al., 2019). Thus, in the current study, we were careful to select commercials with optimal arousal levels.

Current Study
The purpose of the current study is to investigate whether accentuating event boundaries improves memory Memory for Dynamic Events When Event Boundaries Are Accentuated With Emotional Stimuli Collabra: Psychology (i.e., boundary effect; conceptually replicating the effect reported by Boltz, 1992;Gold et al., 2017) and the degree to which emotionally arousing content enhances the boundary effect. Finally, to the extent that boundary accentuation and emotional arousal influence overall memory, we evaluated whether this improvement was a result of enhanced within-event binding. If so, we expected that memory for temporal order would be better for information within the same event as compared to information from different events (DuBrow & Davachi, 2014;Ezzyat & Davachi, 2011). For example, we expected that people would be more accurate at remembering that Axel told Magnus he is suspected of being a spy before Axel and Magnus discuss defecting to the U.S. because these subevents were from the same larger event ("spy discussion" event) compared to remembering that Axel and Magnus discuss defecting to the U.S. before Magnus met Jack at a café because these subevents were from different larger events ("spy discussion" event and "café" event).

Participants
There were 97 participants (Female = 45, Age M = 19.84, SD = 3.54) from a large Midwest university's undergraduate psychology research pool randomly assigned to one of five commercial conditions: Boundary-Emotional (n =20); Boundary-Neutral (n = 16); Non-Boundary-Emotional (n = 18); Non-Boundary-Neutral (n = 18); and No Commercial (n = 25). We conducted a power analyses using G*Power version 3.1.9.6, for a one-way ANOVA with a one-tailed hypothesis, set power of .95, alpha of .05 and an effect size of Cohen's f = 1.55 (the most conservative effect size estimate from Boltz, 1992) and found that a total sample size of 30 participants would be sufficient to detect such effects of interests. However, we recruited nearly 100 participants, which was more similar to the sample size reported in Boltz (1992). Participants were naïve to the purpose of the experiment. Study procedures were approved by the large Midwest university's Institutional Review Board and all participants received class credit for their participation.

Apparatus
The experiment was performed using Mac Minis, which have 2.5 GHz dual-core Intel Core i5 with 3MB L3 cache (Turbo Boost up to 3.1 GHz), 4 GB of 1600MHz DDR3 SDRAM, 500 GB 5400-rpm hard drive, and an Intel HD Graphics 4000. Stimuli were presented on 19-inch Dell E1914H monitors set to a refresh rate of 75 Hz and a resolution of 1024 x 768. The experiment was conducted using Experiment Builder 2.1.140 for Mac OS X. Participants listened to the experimental audio using Maxwell HP-100 headphones and made responses with Logitech K120 keyboards.

Television Episodes
As in Boltz (1992), the experimental stimuli used were episodes four and six from A Perfect Spy mini-series. 1 The duration of Episode 4 is 53:26 minutes, and the duration of Episode 6 is 56:12 minutes. Both episodes have frame rates of 25 Hz. The two episodes originally do not include any commercial breaks; however, the episodes were experimentally manipulated to present commercials at boundary and non-boundary locations. These locations have been identified using Boltz's (1992, p. 104-105) descriptions in Appendixes A and B. Each episode had 7 large events separated by a total of 6 event boundaries. The mean event length across both episodes was 454 seconds (range 40-870 seconds).

Commercials
Prior to the current study, a pilot study was conducted to select the commercials to be included in the main experiment. A separate group of 33 participants (Female = 17, Age M = 36.03, SD = 9.07) was recruited from Amazon Mechanical Turk and asked to rate the emotional intensity of a series of 47 commercials. These commercials were selected from YouTube and ranged in content, including both negative and positive content. We only assessed emotional intensity/ arousal, not valence. In this pilot study, the participants provided informed consent, completed several demographic questions, then watched the commercials and rated each commercial on a sliding scale from 0 to 7 (0 = no emotional intensity, 7 = extreme emotional intensity), and finally were debriefed.
The mean of each commercial emotional intensity rating was calculated to determine the most and the least emotionally intense commercials. The commercials with the highest mean intensity ratings (ranging from 4.45-6.12) were placed into the emotional condition, whereas commercials with the lowest mean intensity ratings (ranging from 0.41-1.38) were placed into the neutral condition. Then within each condition, commercials were randomly grouped together to create 2-2 ½ minute commercial breaks. There were a total of six emotional commercial breaks and six neutral commercial breaks. Examples of commercials used to create an emotional commercial break were anti-bullying PSAs, anti-drinking and driving PSAs, and animal abuse commercials. Examples for the neutral commercials were infomercials, lawyer advertisements, and shampoo commercials. The order of commercial breaks was randomized and then inserted at either the boundary or non-boundary locations throughout the television episode.

Filler Task
After viewing the entire episode, participants were asked Boltz (1992, p. 94): "A Perfect Spy miniseries was produced by BBC-TV Productions. The use of all filmed material in this experiment conformed to the specifications of the House Report on piracy and counterfeiting amendments (H.R. 97-495, pp. 8, 9)." 1

Memory for Dynamic Events When Event Boundaries Are Accentuated With Emotional Stimuli
Collabra: Psychology to type out as many country names as they could think of into an empty text field for two minutes. The purpose of this filler task was to distract the participants and clear working memory of the just watched episode content prior to any of the memory tests.

Recall Task
After the filler task, participants were asked to describe the events in the episode in as much detail as possible in the order that they had occurred. Participants typed their recall responses into an empty text field, and they were given five minutes to complete this task. The recall task score was measured by using Flores et al. (2017) normalized recall method, which is strongly related to hand-scored recall (r = 0.77) for shorter videos of everyday activities. For each participant, we calculated the total number of recall response words divided by the total number of clauses in the episode. The number of clauses were determined from the episode summaries in Appendices A and B from Boltz (1992). Episode 4 contained 81 clauses and Episode 6 contained 74 clauses.

Recognition Task
The recognition task consisted of 24 three-second video clips (12 from each episode) that were randomly ordered. These clips were taken from approximately every four minutes throughout the episode. On each trial, participants saw a ready check screen, and then one video clip was presented, followed by a response screen ( Figure 1). Participants had to decide whether the clip came from the episode they watched by selecting "Yes, I watched this" (pressing 'x') or from a different episode, which they had not seen, by selecting "No, I did not watch this" (pressing 'm'). For example, participants were presented with a three-second clip of Magnus and Jack walking in a park, a scene from Episode 4. After the clip was over, the response screen appeared, and they were asked to decide whether or not that scene came from the episode that they watched. If the participant had been assigned to the Episode 4 condition, they would press "x" for a correct response; if the participant had been assigned to the Episode 6 condition, they would press "m" for a correct response. After making their decision, participants saw a confidence rating scale and indicated how confident they were in their recognition response on a confidence scale ranging from Not Confident (1) to Very Confident (7).

Temporal Order Task
In this task, participants viewed 12 pairs of three-second video clips that all came from the episode they watched. Their task was to decide which clip occurred first (Video Clip 1 or Video Clip 2) in the episode (Figure 1). The video clip pairs were either from within the same event or across different, but contiguous events as determined by the event boundaries from Boltz's (1992, p. 104-105) Appendixes A and B. Each trial took approximately nine seconds, plus the response time: Video Clip 1 was presented followed by a three-second black screen, and then Video Clip 2 was presented. A response screen appeared asking which clip oc-  (1) to Very Confident (7).
Six trials consisted of two video clip pairings that come from within the same event (i.e., no intervening event boundary between the two clips, same event condition). The other six trials consisted of two video clip pairings that come from different, but adjacent events (i.e., intervening event boundary between the two clips, different event condition). These clips occurred approximately two minutes apart to ensure the difficulty of the same and different event pairs were roughly equivalent. Different event clips were taken from ±60 seconds from a boundary whereas same event clips were taken from ±60 seconds from the exact midpoint of that event. Figure 2 shows a schematic depiction of how these temporal order memory items were arranged throughout Episode 6.

Design
The design of this experiment is a 2 (Episode: Episode 4 vs. Episode 6) x 5 (Commercial Condition: Boundary-Emotional, Boundary-Neutral, Non-Boundary-Emotional, Non-Boundary-Neutral, and No Commercial) betweensubject factorial design. The Commercial Condition variable was created by merging two variables, Commercial Break Location (Boundary vs. Non-Boundary vs. No Commercial) and Commercial Type (Emotional vs. Neutral). The Emotional and Neutral levels of the Commercial Type variable were nested within the Boundary and Non-Boundary levels of the Commercial Break Location variable; however, these variables were not fully crossed given that there were no commercial breaks in the No Commercial level. Commercial Condition was used as a predictor in all of the memory task analyses reported in the results section. Each participant watched 1 of 2 episodes. For some participants, the episode was interrupted by six commercial breaks, which occurred either at boundary or non-boundary locations and consisted of either emotional or neutral commercials. For other participants, the episode played continuously without commercial breaks (see Figure 2 for a schematic of an episode, including boundaries, commercial breaks and temporal memory pairs).
The Commercial Break Location and Episode variables were all counterbalanced for a total of ten groups. The temporal order task also included a within-subject variable, Pair Type, which had different versus same event conditions. Memory for Dynamic Events When Event Boundaries Are Accentuated With Emotional Stimuli Collabra: Psychology

Figure 2. Schematic of Episode 6 including Locations of Event Boundaries, Commercial Break Placement and Locations of Temporal Order Memory Stimuli
The different event condition included two clips that were separated by a narrative event boundary (i.e., two clips from two different events), whereas the same event condition included two clips that were within the same narrative event boundaries (i.e., two clips from the same event). These event boundary locations were determined by Boltz's (1992, p. 104-105) descriptions in Appendixes A and B.
All Commercial Conditions responded to the same stimuli in the temporal order memory task. For all groups, the different event pairs were always separated by a narrative event boundary and the same event pairs were not. However, for two groups (Boundary-Emotional & Boundary-Neutral), the different event pairs were also always separated by commercial breaks, whereas the same event pairs were not. Whereas for two groups (Non-Boundary-Emotional & Non-Boundary-Neutral), the commercial breaks were not placed at narrative event boundaries but instead during the middle of an event. This resulted in some of the different and same event pairs being separated by a commercial break and others not (see Figure 2).

Procedure
The participants read instructions describing that they would watch a television episode and then complete several memory tasks. Then they began watching the selected television episode (4 or 6) from the mini-series A Perfect Spy. After watching the full episode, participants completed the 2-minute filler task, followed by the recall task (5 minutes), recognition task (24 trials), and then the temporal order task (12 trials). Following the completion of the memory tasks, participants then indicated whether they had previously seen the episode or not. Participants were then de-briefed and thanked for participating.

Results
All analyses were conducted in R version 4.0.3 (RStudio Team, 2020). Recall performance was modeled with the lm function, both Recognition (Old/New) and Temporal Order (Accuracy) were modeled with the glmer function using the lme4 package (Bates et al., 2015). Significance tests for these models were performed with Type III Wald Chi-square tests using the car package and Anova function (Fox & Weisberg, 2019). Post hoc comparisons were conducted with a Bonferroni correction using the emmeans package (Lenth, 2020). Confidence ratings for the Recognition and Temporal Order tasks were analyzed with cumulative link mixed models (CLMM) created using the ordinal package v2019.12-10 and the clmm function (Christensen, 2019). Significance tests for the CLMM models were performed with Type II Wald Chi-square tests using the RVAideMemoire package and Anova.clmm function (Hervé, 2020). Graphs were created using the ggplot2 package (Wickham, 2016). The confidence ratings graphs were also generated using the ggeffects package (Lüdecke, 2018) guided by Barlaz (2020).
The independent variables included in the analyses were effect coded: Commercial Condition levels were Boundary-Emotional (+1, 0, 0, 0), Boundary-Neutral (0, +1, 0, 0), Non-Boundary-Emotional (0, 0, +1, 0), Non-Boundary-Neutral (0, 0, 0, +1), and No Commercial (-1, -1, -1, -1), and the Pair Type levels were Different Event (+1) and Same Event (-1). The Episode variable was not of theoretical interest and did not significantly interact with the Commercial Condition variable in any analysis, thus, all of the reported analyses are collapsed across Episode. The data from   one participant was removed from all analyses due to failing to attend to the experimental tasks. All other data removed from cleaning procedures are explained in each of the task sections.

Recall Task
Recall performance was scored as the number of recall response words divided by the number of clauses and is plotted by Commercial Condition in Figure 3. A one-way between-subjects Type III Wald Chi-square test revealed no

Recognition Task
Signal Detection. Responses were removed if a participant's response time on a given recognition trial was less than 150 msec or greater than 10 seconds, which resulted in the removal of 21 recognition trials from the recognition task analyses (0.93% overall data). The signal detection analysis was performed with a mixed effects probit model (Wright & London, 2009). The observed responses "old" or "new" were the DV and the fixed effects were the 2 (Item Type; old vs. new) x 5 (Commercial Condition: Boundary-Emotional, Boundary-Neutral, Non-Boundary-Emotional, Non-Boundary-Neutral, and No Commercial) with Participant and Recognition Item included as random intercepts and Item Type by-Participant included as a random slope. Recognition Item was a categorical variable that uniquely identified each of the 24 Recognition task stimuli. Table 1 provides the parameter estimates from the Recognition Task mixed effects probit model. A Type III Wald Chi-square test was performed using the car package

. Recall Performance as a Function of Commercial Condition
Note: The bar graph shows recall performance for each level of Commercial Condition with 95% confidence interval error bars. Recall performance is scored as the total number of recall response words divided by the total number of possible clauses. and Anova function (Fox & Weisberg, 2019) to test overall bias (c) and sensitivity (d') and determine whether they differed across Commercial Condition. We found participants were sensitive at discriminating between new and old items (d' = 2.82, z = 24.18, p <.001) but no significant difference was detected for a bias (c = -0.17, z = -1.55, p = .122) to select new or old. Neither sensitivity (d'), χ2 (4) = 0.92, p = .921 (see Figure 4), nor bias, (c) χ2 (4) = 0.365, p = .985, differed significantly by Commercial Condition. These results indicate that participants were sensitive to determining whether a video clip was new or old, without a detected overall bias, and that no significant difference was observed across the Commercial Conditions.
Confidence Ratings. The recognition confidence ratings (1-7) were analyzed as ordinal data with a cumulative link mixed model (CLMM). Multiple confidence ratings per participant were collected. In the following analysis, only correct trials were included, which led to removing 11.4% of Memory for Dynamic Events When Event Boundaries Are Accentuated With Emotional Stimuli Collabra: Psychology   trials. The CLMM model included Commercial Condition (Boundary-Emotional, Boundary-Neutral, Non-Boundary-Emotional, Non-Boundary-Neutral, and No Commercial) as a fixed effect and Participant was treated at its intercept as a random effect. The model had a logit link function. Table 2 provides the parameter estimates from the Recognition Task Confidence Ratings CLMM. Figure 5 displays the predicted probability of response for each of the confidence ratings by the five Commercial Conditions from the Recognition Task CLMM. To test whether confidence ratings significantly differ by Commercial Condition, the Recognition Task CLMM was analyzed with a Type II Wald Chi-square test (Mangiafico, 2016), which resulted in no significant main effect of Commercial Condition, χ2 (4, N=97) = 4.38, p = .357. Table 3 provides the Type II Wald Chi-square test output for the levels of Commercial Condition estimated marginal means, standard errors, as well as the lower and upper confidence intervals.

Figure 4. Recognition Task -Sensitivity (d') as a Function of Commercial Condition
Note: The bar graph shows the recognition task sensitivity (d') for each Commercial Condition with 95% confidence interval error bars. All CIs are calculated from the overall sensitivity (Item Type) standard error.

Memory for Dynamic Events When Event Boundaries Are Accentuated With Emotional Stimuli
Collabra: Psychology

Temporal Order Task
On this task, participants saw two video clips and had to determine which clip occurred earlier in the watched episode. Each participant responded to 12 video clip pairings. These video clips were taken either from the same event or from two different events (for example, see Figure  2). Similar to the recognition task, responses on the temporal order task were removed if a participant's response time on a given trial was less than 150 msec or greater than 10 seconds, which resulted in the removal of 34 temporal order task trials from the temporal order task analyses (3% overall data). Further, data were removed if a participant's cumulative accuracy was less than chance (50%), which resulted in five participants being removed from the analyses (Boundary-Emotional, n = 1; Boundary-Neutral, n = 1; Non-Boundary-Emotional, n = 2; and No Commercial, n = 1). Therefore, the temporal order task analyses included the data of 92 participants and a total of 1045 observation data points.
Accuracy. Performance was analyzed using a mixed effects logistic regression because the dependent variable, accuracy, was binary (0 = incorrect response; 1 = correct response). The model had a fixed effects structure, which was a 2 (Pair Type: Same vs. Different Events) x 5 (Commercial Condition: Boundary-Emotional, Boundary-Neutral, Non-Boundary-Emotional, Non-Boundary-Neutral, and No Commercial) mixed design. Pair Type was within-subject and Commercial Condition was between-subject. Participant was treated at its intercept as a random effect in the model. Figure 6 displays the predicted accuracy on a given trial from the model for each Pair Type by the five Commercial Conditions in the temporal order task. Table 4 provides the parameter estimates from the Temporal Order Task Accuracy model. To test whether accuracy performance significantly differed by Pair Type, Commercial Condition, or their interaction, the temporal order mixed effects logistic model was analyzed with a Type III Wald Chi-square test using the car package and Anova function (Fox & Weisberg, 2019). Table 5 provides the Type III Wald Chi-square test output

Figure 5. Recognition Task -Predicted Probability of Confidence Rating Response as a Function of Commercial Condition
Note: The stacked bar plot provides the predicted probability of confidence rating response as a function of Commercial Condition. for the Pair Type by Commercial Condition estimated marginal means, standard errors, as well as the lower and upper confidence intervals. There was a significant main effect of Pair Type, χ2 (1, N=92) = 6.39, p = .012, indicating that accuracy was significantly higher for video clips within the same event than for those from different events. There was no significant main effect of Commercial Condition, χ2 (4, N=92) = 1.49, p = 0.829, nor a significant interaction of Pair Type x Commercial Condition, χ2 (4, N=92) = 6.79, p = 0.147. However, post-hoc comparisons with a Bonferroni correction (5 tests) were performed comparing the accuracy performance of the two levels of Pair Type for each Commercial Condition. Accuracy for Same Events (M = 2.00, SE = 0.302) was significantly higher than accuracy for Different Events (M = 0.835, SE = 0.226) in the Boundary-Emotional condition (p = .006). No other comparisons differed significantly.
Confidence Ratings. Similar to the confidence ratings from the recognition task, the confidence ratings (1-7) from the temporal order task were also analyzed as ordinal data  with a cumulative link mixed model (CLMM). Multiple confidence ratings per participant were collected. In the following analysis, only correct trials were included, which led to removing 22.8% of trials. The CLMM model had the same fixed and random effects structures as the mixed effects logistic model for the temporal order accuracy data, 2 (Pair Type: Same vs. Different Events) x 5 (Commercial Condition: Boundary-Emotional, Boundary-Neutral, Non-Boundary-Emotional, Non-Boundary-Neutral, and No Commercial) mixed design. Pair Type was within-subject and Commercial Condition was between-subject. Participant was treated at its intercept as a random effect in the model. The model had a logit link function. Table 6 provides the parameter estimates from the Temporal Order Task Confidence Ratings CLMM. Figure 7 displays the predicted probability of each confidence rating response for each of the five Commercial Conditions and for each Pair Type. To test whether confidence ratings significantly differed by Pair Type, Commercial Condition, or their interaction, the Temporal Order CLMM was analyzed with a Type II Wald Chi-square test (Mangiafico, 2016). Table 7 provides the Type II Wald Chi-square test output for the Pair Type by Commercial Condition estimated marginal means, standard errors, as well as the lower and upper confidence intervals. There was a significant main effect of Pair Type, χ2 (1, N=92) = 13.77, p <.001, indicating that confidence ratings were significantly higher for video clips within the same event than for those from different events. There was no significant main effect of Commercial Condition, χ2 (4, N=92) = 3.31, p = 0.507, nor a significant interaction of Pair Type x Commercial Condition, χ2 (4, N=92) = 2.24, p = 0.691.

Discussion
The main goals of the current experiment were to investigate whether accentuating event boundaries improves memory and whether emotionally arousing content enhances this effect. First, we evaluated the overall effect of accentuating event boundaries on memory accuracy. Surprisingly, we found no effect of commercials on recall or recognition memory performance. That is, commercial     episode did not improve memory compared to a control group, nor did commercials placed at non-boundaries impair memory, unlike the effects reported in Boltz (1992). Further, we found no significant effect of the commercial's emotional content on recall or recognition performance, regardless of whether these commercials were placed at boundaries or non-boundaries. Most importantly, though, we did find a significant boundary effect for temporal memory but only when the commercials contained emotionally arousing content (Boundary-Emotional condition). That is, participants were better able to remember the temporal order of two video clips when they came from the same event as compared to when they came from two different, but contiguous events. The observed performance difference between the same and different event conditions cannot be explained by the two video clips being closer or farther apart in time because all video clip pairs were approximately two minutes Memory for Dynamic Events When Event Boundaries Are Accentuated With Emotional Stimuli Collabra: Psychology apart (see Figure 2). Instead, it is possible that the emotional commercials disrupted consolidation of the information at the end of the preceding event (Strange et al., 2003;Tulving, 1969), which led to poor temporal order memory for events coming before and after the emotional commercial break. However, we did not find a similar effect of emotion on memory for temporal order in the Non-Boundary-Emotional condition, indicating that the combination of a narrative event boundary and emotional information produced this effect. Further, it does not seem as if the emotional commercials impaired memory in the Boundary-Emotional condition, but rather improved temporal memory for information within the same events relative to other conditions (see Figure 6).
Thus, a more likely possibility is that the emotional content made the narrative event boundaries more salient. Event boundaries serve as anchors in memory and help to bind subevents within a larger event (DuBrow & Davachi, 2013, 2014Ezzyat & Davachi, 2011). Emphasizing event boundaries with emotional commercials may have ensured that participants perceived the event boundaries, segmented the activity and sequentially bound information together from the preceding event.
Interestingly, prior work from Davachi and colleagues has observed the boundary effect in temporal memory without the use of boundary cues or emotional content. However, their prior work used written texts or object-face stimuli in which the events were considerably shorter (e.g., 2 seconds, DuBrow & Davachi, 2016; approximately 4-7 sentences, Ezzyat & Davachi, 2011) than those in the current study (approximately 7½ minutes). Moreover, the prior work used stimuli with more event boundaries and more temporal order memory pairs than we used in the current study. For instance, Dubrow and Davachi (2016) had 80 event boundaries and approximately 192 order memory pairs (16 series * 12 pairs). Each of our television episodes had 6 event boundaries and 12 order memory pairs. Thus, it is possible that more trials were needed to detect the same event effect within the conditions that experienced no commercials or neutral commercials at event boundaries. This is a testable hypothesis for future research.
A growing body of literature has shown that emotional stimuli are prioritized in working memory. However, emotional stimuli can either disrupt inter-item binding or enhance short-term processing (see Bennion et al., 2013 for review). Our study sought to evaluate these effects when one is asked to remember a continuous stream of on-going activity. We found that placing emotional commercials at narrative event boundaries (i.e., important changes in the plot of the television episode) may have drawn attention to and enhanced the processing of the just-encoded event. Future work should continue to investigate the specific role of emotion on the perception of ongoing events.
We should note that this study was not a direct replication of Boltz (1992). Instead, our goal was to extend her work by manipulating the content of the commercial breaks (emotional vs. neutral content). We attempted to conduct many aspects of the current study as similarly as possible to Boltz's original study based on the details provided in the manuscript-e.g., the specific television episodes and the types of memory measures. However, our failure to repli-cate her effects on recall and recognition may be due to a few methodological differences. First, Boltz's (1992) recall task allowed participants to write down what they could remember from the viewed episode for 20 minutes, whereas we only allowed them 5 minutes to type what they remembered, and Boltz's dependent measures were collected through paper-based methods whereas ours were computer-based. Additionally, while prior work has shown normalized word counts are highly correlated with hand scored-recall , this correlation is based on scoring of shorter videos of everyday activities as opposed to longer professionally filmed videos, like A Perfect Spy. Further, our recognition and temporal order stimuli were presumably different from those used in Boltz (1992) because her specific stimuli were not provided. Therefore, one possible reason that we failed to replicate Boltz's findings from the recognition and part of the temporal order tasks may be because different stimuli were used in the current study than Boltz (1992).

Limitations
One limitation is that the effect of emotional valence was not evaluated in this study. Both positive and negative emotionally intense commercial breaks were used because they were both rated as emotionally intense. However, most commercials contained negative events such as a deadly car crash due to drinking and driving, or the abuse of animals. Future research could distinguish whether memory is differentially affected by positive and negative valence of a commercial. Further, commercials appearing in higher rated programs tend to be remembered more than lower rated programs (Barclay et al., 1965), thus the commercials' attentional effects may vary depending on the viewer's interest in A Perfect Spy.

Conclusion
The current study found that accentuating event boundaries with emotionally arousing content increased memory for the sequence of events in the television episode. The link between event boundary perception and memory is exciting and provides a potential avenue for improving memory. Future interventions may be created to improve event boundary perception and thus improve memory. Such interventions could have widespread impact on students learning new material, employees trying to learn new skills, older adults who demonstrate age-related declines in memory, and many other populations that have difficulty learning and remembering new information. However, before investing time in developing these interventions, more research must be done to evaluate the most effective way to guide event boundary encoding, the mechanisms by which such manipulations affect memory and how long the effects persist.
Importantly, the current study provides a potential mechanism by which event segmentation interventions may improve memory for dynamic, everyday activities. Drawing attention to event boundaries-via emotionally charged commercials-increases the likelihood that people will perceive the change in events, update their mental model accordingly and better integrate information from Memory for Dynamic Events When Event Boundaries Are Accentuated With Emotional Stimuli Collabra: Psychology the just-encoded event. Thus, event boundaries trigger the binding of information from the just-encoded event and help to temporally organize the actions in episodic memory.

Contributions
Contributed to conception and design: HRB, JJP, JSR. Contributed to acquisition of data: JJP, JSR, HRB. Contributed to analysis and interpretation of data: JJP, HRB, JSR. Drafted and/or revised the article: JJP, JSR, HRB. Approved the submitted version for publication: JJP, JSR, HRB.