How does episodic memory develop in adolescence?

This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first 12 months after the full-issue publication date (see http://learnmem.cshlp.org/site/misc/terms.xhtml). After 12 months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

Abstract

Key areas of the episodic memory (EM) network demonstrate changing structure and volume during adolescence. EM is multifaceted and yet studies of EM thus far have largely examined single components, used different methods and have unsurprisingly yielded inconsistent results. The Treasure Hunt task is a single paradigm that allows parallel investigation of memory content, associative structure, and the impact of different retrieval support. Combining the cognitive and neurobiological accounts, we hypothesized that some elements of EM performance may decline in late adolescence owing to considerable restructuring of the hippocampus at this time. Using the Treasure Hunt task, we examined EM performance in 80 participants aged 10–17 yr. Results demonstrated a cubic trajectory with youngest and oldest participants performing worst. This was emphasized in associative memory, which aligns well with existing literature indicating hippocampal restructuring in later adolescence. It is proposed that memory development may follow a nonlinear path as children approach adulthood, but that future work is required to confirm and extend the trends demonstrated in this study.

Episodic memory (EM) describes the ability to encode, store, and retrieve representations of previously experienced episodes and their temporal-spatial context (Tulving 1972). EM development continues well into the third decade of life (Ruggiero et al. 2016); however, its developmental trajectory after the preschool years remains controversial, with some studies suggesting linear improvements (Ofen et al. 2007) and others no improvement (Picard et al. 2012) or a nonlinear pattern (Tulving 1985; Keresztes et al. 2017). While there has been some debate as to the “defining features” of EM (Cheke and Clayton 2013, 2015) most theorists agree that it is not a unitary ability, instead reflecting the combination of a number of contributing features. Given that many of these studies used different methods for testing EM, and that different tests may emphasize different features (Cheke and Clayton 2013, 2015), it is likely that empirical differences reflect the fact that different features of memory may develop differently during later childhood and adolescence (Picard et al. 2012).

The importance of understanding the developmental trajectory of EM in adolescence is highlighted in the close association between EM and other cognitive processes. EM is thought to support decision-making, particularly in the incorporation of memories into task- and goal-relevant responses (Murty et al. 2016); thus, immaturity of EM may influence the high levels of risk taking observed in adolescence. Adolescence also represents a period of vulnerability to the development of mental illness (Kessler et al. 2007). Evidence that deficits in EM have been linked to a number of mental health disorders such as depression (Goodwin 1997) and anxiety (Airaksinen et al. 2005) raises the possibility that individual differences in memory development during this period may influence this vulnerability. Finally, adolescence is a demanding time academically: During these school years, large quantities of knowledge must be acquired to be successful in exams, which have long-term impacts on individuals’ academic and professional future. It is therefore important to understand factors that may contribute to individual differences and challenges in learning and memory during this period.

Memory development in adolescence has attracted considerable research attention in recent years, with the majority of work conducted on developmental trajectories of brain areas within the memory network. EM relies on a distributed network of brain areas, including the medial temporal and superior parietal lobes and the prefrontal cortex (PFC) (Simons and Spiers 2003). Each area within the network, as well as the network itself, shows protracted maturation across adolescence.

Development of the memory network during adolescence

Structural changes in the PFC extend throughout adolescence into adulthood (Spear 2000) and may be nonlinear and multifaceted, with research providing evidence for a peak in gray matter volume at ∼11 yr (Giedd et al. 1999) followed by a decrease, while others demonstrate gradual cortical thinning from 7 yr of age (Ducharme et al. 2016; Sowell et al. 2004, 2007). This shift in trajectory of gray matter volume is thought to reflect protracted synaptogenesis, increasing capacity for higher cognitive functions (Huttenlocher and Dabholkar 1997), followed by synaptic pruning of obsolete connections to produce maximally efficient neural pathways (Huttenlocher 1979). According to this account, at peak gray matter volume, large numbers of obsolete connections might feasibly compromise cognitive efficiency. Indeed, there is some evidence that degree of cortical thinning during this period is associated with improved memory recall (Sowell et al. 2001) and this is linked with increased memory-related activity in PFC regions, particularly the dorsolateral PFC (Ofen et al. 2012).

Hippocampal volumes increase throughout childhood (e.g., Brown et al. 2012; Gilmore et al. 2012); however, investigations of its development through adolescence has produced inconsistent findings, with some indicating stable volume (e.g., Koolschijn and Crone 2013), some indicating increases (e.g., Dennison et al. 2013), and others decreases in hippocampal volume during the teenage years (Tamnes et al. 2013). More recent studies suggest a quadratic trajectory of development (e.g., Herting et al. 2018; Tamnes et al. 2018), which may explain some of the earlier inconsistencies. Further inconsistency in this literature may stem from variation in developmental trajectory between different hippocampal subregions, although these studies also show inconsistent findings, likely reflecting variations in sampling (cross-sectional, longitudinal, or accelerated longitudinal) and segmentation techniques. That being said, many of these studies indicate quadratic or cubic development during adolescence in specific subregions (DeMaster et al. 2014; Daugherty et al. 2017; Tamnes et al. 2018). Adding yet another level of complexity, there appear to be changes in the way in which the hippocampus is recruited during memory performance over the period spanning late childhood, adolescence, and early adulthood (DeMaster et al. 2014; Sastre et al. 2016). Finally, the frontal–temporal network, a crucial part of a functioning EM system in adults (Simons and Spiers 2003; Blumenfeld and Ranganath 2007), is also developing during adolescence (e.g., Sherman et al. 2014; Simmonds et al. 2014).

How these neurodevelopmental changes are reflected in memory performance is unclear, as demonstrated by the elaborate patchwork of studies that exist, individually examining aspects of the relationship of frontal or hippocampal structure and functioning in relation to measures of memory. To date, no research has specifically investigated the developmental trajectory of different component of EM within an integrated framework.

Behavioral changes in memory performance across adolescence

There is evidence for a nonlinear developmental trajectory in certain components of EM development. Lee et al. (2014) suggest that performance on associative memory during the middle childhood and adolescent period may be quadratic in nature. They showed that 8- to 9-yr-old children performed significantly more poorly in an item–color associative memory task than 9- to 11- and 13- to 15-yr-old children, but not the intermediate 11- to 13-yr-old. This performance, when controlling for age, was associated with the volume in the right hippocampus (particularly CA3/DG), which also demonstrated a nonlinear developmental pattern during this period, with highest volumes in the 11- to 13-yr-old children. In contrast, tasks that might be considered to preferentially rely on frontal processing for example, assessment of “remembered” as opposed to “familiar” memories show linear improvements between 8 and 24 yr and are associated with functional and structural development of the DLPFC but not with any measure of medial temporal lobe volume (Ofen et al. 2007). Interestingly, these authors note that their results may be “better described in a nonlinear function,” but this was not something they assessed.

The heterogeneity of previous data suggests that the trajectory of memory development seen may depend on the nature of memory assessed. Different tasks assessing different components of EM may produce different trajectories, likely reflecting development of different brain areas. In support of this, Keresztes et al. (2017) conducted a number of memory assessments in participants aged 6–14 and 18–27 and found linear improvements in some, such as source memory, which was correlated to “frontal maturity,” and quadratic development of others, such as associative recognition that were positively correlated with “hippocampal maturity.” Given that 14- to 18-yr-old were not assessed in this study, it is difficult to identify the age of “peak” performance. However, these findings suggest that memory tasks relying more on frontal function may be expected to show linear increases during this period, while those assessing more hippocampal-dependent processes are more likely to show nonlinear development.

The complication of puberty

Adolescence is made unique as a developmental period due to the transformational hormonal, psychological and physical effects of puberty. Pubertal status, independently of age, significantly influences subcortical volumes and is likely to be a key driver in the neural maturation in adolescence (Goddings et al. 2014). In their study, using 711 MRI scans from 275 individuals aged 7–20 yr, Goddings et al. (2014) estimated the volume of subcortical structures. They showed that pubertal development, as assessed by Tanner staging, and chronological age had both independent and interactive influences on volume for the hippocampus, amygdala and putamen in both sexes and the caudate in females. In keeping with this, the neurocognitive data suggests puberty-dependent results in cognition. Indeed, nonlinear development producing cognitive “dips” in later adolescence have been observed in other areas of cognition in a manner that was puberty-dependent. For example facial processing is impaired in older adolescence (McGivern et al. 2002) and puberty rather than age per se is thought to account for these changes (Blakemore 2008). As such in this study, analyses are presented with both the entire cohort and with only peripubescent and postpubescent participants. While this does not explicitly investigate the role of puberty (this is confounded with age in our sample), it allows clarification of developmental patterns when variation due to puberty is reduced.

In summary, areas throughout the EM network demonstrate protracted development throughout the adolescent period. These developments may be nonlinear, with gray matter volumes increasing to a peak and subsequently decreasing in a region-specific manner (Giedd 2004; Gogtay et al. 2004). This nonlinear neural development may be reflected in EM performance, depending on what component processes are challenged by the specific task used. However previously used tasks differ in more than just the type of memory they assess, and evidence for varying trajectories may be related to these “nontarget” differences. It is impossible to extrapolate general trends from such isolated studies, demonstrating the need to investigate the different components of EM within the same integrated framework to allow meaningful conclusions to be drawn.

Assessing the component processes of episodic memory

Different theorists have emphasized different component processes that underpin EM in development (Clayton et al. 2003). Clayton et al. (2003) define three criteria for behavioral demonstrations of EM in children and animals: content, structure, and flexibility. Since EM is spatio–temporal in nature, the content of the memory must include information as to what happened (“What”/item memory), where it happened (“Where”/spatial memory) and when it happened (“When”/temporal memory). However, it is not sufficient for all three of these informational elements to be present—they must be structured in an integrated fashion. Thus, the structure of the memory must be associative. Finally, they argue that the memory must be flexibly accessible to conscious recall, and not a mere response to external stimuli. These latter two features overlap significantly with Shing et al. (2010) two-component framework of EM as consisting of an “associative” and “strategic” component. The following section reviews these three components of content: structure (/association), and flexibility (/strategy), and developmental evidence.

Content: What, Where, and When

The content component of EM concerns remembering information about events (What), locations (Where), and times (When). In general, these can be translated as item memory, spatial memory and temporal memory.

Studies agree that item memory steadily increases with age up until the eighth year. Beyond this age, some studies show a continued increase (Riggins 2014), others an increase from six to nine and then a plateau (Picard et al. 2012), and others age invariance (Ghetti and Angelini 2008). These differences likely reflect the different stimuli used (e.g., words vs. pictures), and task difficulty. For example, Keresztes et al. (2017) showed a quadratic development of item recognition for faces, while Daugherty et al. (2017) showed no development for word memory over a similar period (six to 27 and eight to 25, respectively). Other studies have demonstrated different developmental trajectories depending on the level of retrieval support (see “Flexibility: Strategic Remembering and Retrieval Support”).

Spatial memory appears to be more consistent, with most studies showing linearly increasing ability when sampling between 1 and 20 yr (e.g., Bauer et al. 2012; Ruggiero et al. 2016) with the exception of one study showing evidence of age invariance after 4 yr (Sluzenski et al. 2006).

Temporal memory lags behind item and spatial memory in the early years of life (e.g., Hayne and Imuta 2011; Scarf et al. 2017). However, results on development trajectories after this point have been largely inconsistent. In studies assessing relative recency, some studies have indicated no improvement in memory for item recency between 4 and 18 yr (Brown 1973), while others demonstrated improvement between 5 and 12 yr with age on similar tasks (Mathews and Fozard 1970; Von Wright 1973). Others have argued that different types of temporal memory judgments (relative recency vs. temporal position) develop at different rates, with recency judgements being more easily made by younger children (Friedman 1991, 2013). Memory for temporal location may not be reliable until the age of six (Friedman 1991) but appears to be relatively age invariant beyond this point (Friedman et al. 2010).

All three content features (item, spatial and temporal memory) are thought to rely to various degrees on the medial temporal lobe, but may differ in the extent and nature of hippocampal involvement, with spatial and temporal memory being particularly hippocampal (Burgess et al. 2002; Palombo and Verfaellie 2017). Given this, in the current study we might predict a more nonlinear pattern of development in temporal and spatial memory compared with item memory.

Structure: association

Clayton et al. (2003) emphasize that EM must not merely contain information on item, space, and time, but that this information must be structured as a bound representation. This association of elements is reflected in the “association component” described by Shing et al. (2010).

A large amount of neuroimaging data implicates the hippocampus as being critical for the association of item, spatial and temporal information (e.g., Cheke et al. 2017; Davachi and Wagner 2002; Konkel and Cohen 2009) in order to create a unique episode, which can be differentiated from other similar episodes (Devito and Eichenbaum 2010; Ergorul and Eichenbaum 2004). Given the hypothesis that more hippocampal-dependent elements are more likely to show nonlinear development in the teenage years, what evidence is there of nonlinear development in associative memory in adolescence?

Associative memory can be assessed in many ways: Usually tasks require the association of two features or stimuli, which may either be arbitrarily combined (e.g., two unrelated words presented together), or may form a more coherent unit (e.g., face–name, or a word written in a colored ink). Item–location associative memory has been shown to improve between 4 and 8 yr (Bauer et al. 2012; Sluzenski et al. 2006) even when accounting for memory for the individual elements. While the evidence seems to consistently report developmental change in associative memory through late childhood and adolescence, some report linear improvements (Daugherty et al. 2017) while others indicate a quadratic developmental trajectory (Keresztes et al. 2017; Lee et al. 2014) and most studies agree that performance in associative memory tasks are linked with maturity of the hippocampal formation.

A number of developmental studies investigating the association between item, spatial, and temporal information have been conducted in recent years. This “What–Where–When” (WWW) memory has been shown to improve with age between 2 and 7 yr (e.g., Cheke and Clayton 2015; Hayne and Imuta 2011; Huttenlocher et al. 2016), but few of these controlled for memory for the individual elements, and none (to our knowledge) extend this investigation beyond the age of 7 yr (although see P Guo, E Carey, K Plaisted-Grant, et al., in prep, for an investigation in middle childhood). Due to the established reliance on hippocampal function, we hypothesize that item–location–time (“What–Where–When”) associative memory will demonstrate nonlinear (cubic) development in the 10–17 age range, with the youngest and oldest adolescents being outperformed by those of intermediate age.

Flexibility: strategic remembering and retrieval support

A major source of development in memory from birth to adulthood appears to be in the degree to which retrieval is rigidly dependent on cues from the environment (Gee and Pipe 1995; Usher and Neisser 1993). Memory retrieval can occur as a reflexive response to a familiar stimuli (recognition), in response to external cues that trigger the retrieval of a memory (cued recall) or spontaneously, in response to internally generated cues (free recall). The third and final component of Clayton et al.’s (2003) model of EM is flexibility; the idea that a memory representation must be accessible through self-generated retrieval mechanisms, and available for flexible use in decision-making. Reducing the amount of retrieval support in the form of cues is thought to increase the necessity of episodic recollection, reflecting in evidence that individuals are more likely to report “remembering” items that have been freely recalled as compared with those that have been cued (Tulving 1985; Yonelinas 2002).

Age-related differences during early and middle childhood are more pronounced in situations where less retrieval support is provided (e.g., Paz-Alonso et al. 2009; Cheke and Clayton 2015). Free recall requires more self-initiation and therefore puts higher demand on frontal executive compared with cued recall or recognition (Craik and McDowd 1987; Shing et al. 2010). This self-initiation forms part of what Shing et al. (2010) describe as the “strategic” component of memory, which is concerned with searching, selecting and organizing memory features. This facilitates purposeful encoding strategies, as well as being important for “source monitoring”—that is, remembering the context in which information was learned—both of which demonstrate protracted development (Pressley and Schneider 1997; Keresztes et al. 2017). Like executive functions, with which they overlap, these strategic processes are highly dependent on the prefrontal cortex and in particular the dorsolateral prefrontal cortex (Achim and Lepage 2005; Badre and Wagner 2007; Blumenfeld et al. 2011). Shing et al. (2010) suggest that the framework for strategic memory is established from 10 to 13yr of age but may undergo a “transition period” in which the benefits of strategy use fail to materialize. To our knowledge, there has not been a previous investigation of the impact of retrieval support on memory performance across the adolescent years. If peak gray matter in the PFC implies that frontal-dependent processes should demonstrate a “dip” very early in the adolescent period (around age 10), we hypothesize that performance advantage afforded by increased retrieval support should gradually—and linearly—decrease during the teenage years,

Assessing multiple elements of EM in a single paradigm: the Treasure Hunt task

From the review above, it is clear that when considering the development of EM, this cannot be seen as a unitary ability, but a multifaceted cognitive process. Studies using different methodologies to assess particular elements of EM demonstrate variance in developmental trajectory (Cheke and Clayton 2013), and comparing between studies, it is difficult to ascertain whether differences seen were due to task demands or other influences. To understand the relative development of different component factors, it is important to investigate these within a single paradigm.

The present study examines the developmental trajectory of EM using a variant of the “Treasure Hunt task” (Cheke et al. 2016), a computer-based task in which participants are presented with scenes and asked to hide objects around the scenes on different days. Following the hiding phase, participants are prompted to remember what they hid (identify previously seen items), where (identify locations used) and when (identify item order) as well as what-where-when combinations (identify the location an item was hidden during a particular time period) with different levels of retrieval support. The Treasure Hunt task enables assessment of individual item, place and time memory ability (content) as well as the ability to integrate these into a single representation (structure/association) within the same paradigm, based on the same encoding phase. In addition, while keeping the encoding constant, the Treasure Hunt task permits manipulation of retrieval support (contrasting recognition and cued-recall tasks) such that flexibility/strategy can also be investigated. Neuroimaging investigation of this task has indicated that the association of elements, rather than individual elements alone elicited activation within the hippocampus and angular gyrus (Cheke et al. 2017). Successful associative memory, but not item memory, was also associated with activity in the dorsolateral prefrontal cortex (DLPFC), with activity in this area during retrieval being associated positively with integrated memory performance, and activity at both encoding and retrieval negatively correlating with binding errors. As such, this task is able to assess multiple elements of memory, as defined from both from a psychological and neuroscientific perspective.

A number of further features make the Treasure Hunt task an attractive tool for measuring EM. Participants are responsible for generating their own associations by hiding items themselves during the encoding phase. This makes encoding closer to “real life” than the arbitrary associations presented in other paradigms and recall is prompted nonverbally using simplistic cues, reducing confounds pertaining to verbal ability. The task has also been validated across a wide age range from middle childhood to old age (Cheke 2016; Silva et al. 2019; P Guo, E Carey, K Plaisted-Grant, et al., in prep.).

In the present study, we investigate multiple components of EM using the Treasure Hunt task in 80 adolescents aged 10–17 yr. Based on previous behavioral data, we predict that some elements of memory will demonstrate linear improvement during this period, while others may demonstrate nonlinear (cubic) development. Given the heterogeneity of previous findings it is difficult to predict the precise pattern of nonlinear development; however, they may broadly tie with the average timing of lobe-specific neural maturity. Peak gray matter volume (GMV) in the frontal lobe has been suggested to be achieved at ∼11 yr (Giedd et al. 1999), whereas peak GMV in the temporal lobe (and the hippocampus) occurs at 17 yr. Following the account that suggests that this cubic trajectory reflects synaptogenesis followed by synaptic pruning of obsolete connections (Peter 1979), we suggest that peak GMV may be reflected in inefficient cognitive performance (McGivern et al. 2002), which may then be followed by improvements as pruning progresses. Based on these timings, we therefore predict that during the 10–17 period, we should see broadly linear increases in performance with age when demands are placed on more frontal processes—for example, the strategic retrieval required with reduced retrieval support (represented in our data by the “support benefit” variable)—while a nonlinear (cubic) pattern may be seen with increased demand on hippocampal functions; that is, spatial, temporal, and associative memory (here represented by the “Where,” “When,” and “What–Where–When” tasks). Adolescence is a period of change on multiple levels, one of which is pubertal status. In our sample we are unable to independently investigate age and puberty due to the high relatedness of these variables. Instead, we present the main analyses twice, once with the whole sample, and once with only the postpubescent participants, this allows investigation of whether age-related patterns are present when variation due to puberty is reduced, or whether they are reliant on pubertal change per se.

Results

To correct for oversampling of older participants (see Fig. 6 ), a fractional weighting variable was created based on the expected population proportion for each age group (in years: 12.5%) such that all age groups contributed equally to the analysis. Analyses were then conducted across all participants and again separately, considering only the postpubescent participants. In addition to the regression analyses quoted in the text, all analyses conducted can be seen in Table 1 .

Table 1.

Regression analysis r 2 , P, and BF values for all regressions conducted on participant performance against age in months

An external file that holds a picture, illustration, etc. Object name is LM053264Mec_TB1.jpg

An external file that holds a picture, illustration, etc. Object name is LM053264Mec_F6.jpg

Number of participants in each age group

Overall EM performance across age

A repeated-measures ANOVA with within-subject factors of Support (two levels: High Support and Low Support) and Task (four levels: What, Where, When, and WWW) against age in months as a covariate reveals a significant main effect of task (F_(3,70) = 19.124, P < 0.001, η 2 = 0.450), a main effect of support (F_(1,72) = 10.89, P = 0.002, η 2 = 0.131) and a task × age interaction (F_(1,70) = 6.183, P = 0.001, η 2 = 0.209). However, there was no main effect of age (F_(1,72) = 9.507, P = 0.003, η 2 = 0.117), support × age (F_(1,72) = 0.46, P = 0.83,η 2 = 0.001), support × task (F_(2,70) = 0.975, P = 0.409, η 2 = 0.040), or support × task × age interaction (F_(1,70) = 1.873, P = 0.142, η 2 = 0.074). Overall, performance on all four tasks differed significantly from one another, with the “What” task attracting the highest scores, followed by “Where,” followed by “WWW” and finally the “When” tasks were found the most difficult (see Fig. 1 ). Overall “What” scores were significantly higher than all other tasks (all Ps < 0.001), When scores were significantly lower than all other tasks (all Ps < 0.001), and Where scores were significantly higher than WWW scores (P < 0.001). All these analyses survived correction for multiple comparisons. Overall, High Support scores were significantly higher than Low Support scores (P < 0.001). Finally, High Support tasks attracted significantly higher scores in the When task (F_(1,78) = 8.376, P = 0.005) but not for any of the other individual tasks (WWW: F_(1,78) = 0.041, P = 0.840; What: F_(1,78) = 0.125, P = 0.725; Where: F_(1,78) = 1.322, P = 0.254). The “What” task showed a considerable ceiling effect (38% of cases achieving top score). As such, this task was converted into a binary variable (top score/nonstop scores). Nonparametric analysis revealed no impact of support on this task (Wilcoxen, W = −0.164, P = 0.869). Repeating the repeated measures ANOVA without the “What” task did not change the pattern of results (with the possible exception of bringing the Support × Task × Age interaction up to a nonsignificant trend F_(2,71) = 2.850, P = 0.064, η 2 = 0.074).

An external file that holds a picture, illustration, etc. Object name is LM053264Mec_F1.jpg

Mean What, Where, When, and WWW scores in the High and Low Support versions of the task.

Content

Regression analysis of the three content elements What (as a binary variable), Where, and When (as continuous variables) against age in months was performed, modeling the data against linear and cubic trajectories ( Fig. 2 ). “What” score did not show a significant binary logistic regression with age (What: all participants β(0.008) < 0.001, P = 0.971), but cubic models could not be assessed. Cubic and linear models were nonsignificant for “Where” and “When” scores suggesting age-invariant performance ( Table 1 ). A JZS Bayesian linear regression with default priors suggested that there was anecdotal (BF₀₁ = 2.57) and moderate (BF₀₁ = 7.14) evidence for accepting the null hypothesis of no change with age for Where and When, respectively ( Table 1 ).

An external file that holds a picture, illustration, etc. Object name is LM053264Mec_F2.jpg

Binary “What” score against age in years with fractional weighting in all participants (i) and postpubescent participants (ii).

Structure/association

Regression analysis on associative memory (Integrated WWW score) demonstrated a significant cubic trajectory (cubic regression: all participants r 2 = 0.091, P = 0.026). The linear model also demonstrated significance, perhaps capturing the early improvement in performance, and Bayesian analysis suggested this indicated “extreme evidence” (integrated WWW: linear regression: all participants r 2 = 0.056, P = 0.035, BF₁₀ = 137.46). However, these analyses did not survive adjustment for multiple comparisons (Sidak α = 0.01563).

To control for memory for the individual elements, a measure of structuring difficulty was created, by subtracting the integrated score from the averaged content (What, Where, and When) scores. Regression analysis on structure difficulty score showed a significant cubic and linear trajectory across all participants, which survived multiple comparison adjustment (linear regression: all participants: r 2 = 0.083, P = 0.010; cubic regression: all participants: r 2 = 0.122, P = 0.007). Bayesian analysis of the linear model suggested extreme evidence to support an association (BF₁₀ = 100.64). These results suggest the greatest difficulty with associating multiple components in the youngest and oldest participants, and that this was not driven by individual content features ( Table 1 ; Fig. 4 ).

An external file that holds a picture, illustration, etc. Object name is LM053264Mec_F4.jpg

Associative memory (WWW) (A), nonintegrated scores (averaged What Where When scores) (B), and the structuring difficulty score (C) as a function of age across all participants (i) and postpubescent participants (ii) modeled with linear and cubic regressions. (*) Significant model fit, (**) significant fit model that survives multiple comparisons.

Flexibility/strategy

The degree to which participants benefited from retrieval support was investigated by calculating an average High Support and an average Low Support score (averaged content + WWW scores in the HS and LS format, respectively). Both linear and cubic regressions of the High Support score were significant when considering all participants (High Support score: linear: all participants: r 2 = 0.062, P = 0.026; cubic: all participants: r 2 = 0.078, P = 0.045). Although neither survived adjustment for multiple comparisons (Sidak α = 0.01563), the JZS Bayesian analysis suggested that there was strong evidence for the linear model (BF₁₀ = 20.58). Cubic and linear regressions of the Low Support score were both nonsignificant and Bayesian analysis suggested that there was anecdotal evidence to accept the null hypothesis (BF₀₁ = 1.39). Support benefit—that is, the degree to which performance was improved in the High Support relative to the Low Support task—was then calculated as the difference between the High and Low Support scores and regression analysis was performed. The support benefit did not appear to be modeled by either linear or cubic models and Bayesian analysis suggested that there was strong evidence to accept the null hypothesis that performance did not change with age (BF₀₁ = 10.64) (see Table 1 ; Fig. 5 ).

An external file that holds a picture, illustration, etc. Object name is LM053264Mec_F5.jpg

Overall scores on the High Support tasks (A), Low Support tasks (B), and support benefit (C). Regressions performed on all participants (i) and and postpubescent participants (ii) modeled against cubic and linear regressions. (*) Significant model fit, (**) significant fit model that survives multiple comparisons.

Postpubescent data analysis

Given the considerable impact of puberty on brain development, it is important to consider pubertal status. However, the overlap between age and pubertal status in this sample is high, rendering it impossible to compare prepubescent and postpubescent data independently of age. Instead, the same analyses are repeated on only the postpubescent data. This maintains the age range of greatest interest (12–18; N = 53) while reducing the confounding influence of pubertal status

Content: postpubescent cohort only

Regression analysis of the three content elements What (as a binary variable), Where, and When (as continuous variables) against age in months was performed, modeling the data against linear and cubic trajectories in the postpuberty cohort. The binary logistic regression of the “What” score was strengthened but remained nonsignificant when considering only postpubescent participants [postpuberty only β(0.019) = 0.037, P = 0.057] ( Fig. 2 ).

Models for “Where” performance remained nonsignificant in the postpuberty analysis. “When” performance demonstrated a significant cubic model when considering only postpubescent individuals, accounting for 13% of the variance (see Fig. 3 ) (When: cubic regression: postpuberty only: r 2 = 0.132, P = 0.046). However, this did not survive the adjustment for multiple comparisons (Sidak α = 0.03125). A JZS Bayesian linear regression with default priors suggested there was moderate and anecdotal evidence to accept the null hypothesis for Where (BF₀₁ = 3.88) and When (BF₀₁ = 1.42), respectively.

An external file that holds a picture, illustration, etc. Object name is LM053264Mec_F3.jpg

Where (A) and When (B) performance as a function of age in months in all participants (i) and postpubescent participants (ii) modeled against linear and cubic regressions.

Structure/association: postpubescent cohort only

The significant cubic model observed in the regression analysis of associative memory against age was strengthened when only considering postpubescent participants, surviving the adjustment for multiple comparison (cubic regression: postpuberty r 2 = 0.180, P = 0.014; Sidak α = 0.01563) suggesting a significant increase in early years in association performance and subsequent decrease later in adolescence. The linear model lost significance (r 2 = 0.011, P = 0.495), but Bayesian analysis suggested there remained strong evidence for the model (BF₁₀ = 14.35). Regression analysis of the structuring difficulty score lost significance in both the linear and cubic models (linear regression: postpuberty: r 2 = 0.002, P = 0.780; cubic regression: postpuberty: r 2 = 0.076, P = 0.179) and Bayesian analysis suggested there was anecdotal evidence to accept the null hypothesis (BF₀₁ = 2.28) ( Table 1 ; Fig. 4 ).

Flexibility/strategy: postpubescent cohort only

When considering only the postpubescent cohort, regression analysis of the average High Support score against age in months strengthened the cubic model, accounting for 26% of the variance and withstanding the correction for multiple comparisons (cubic regression: postpuberty: r 2 = 0.260, P = 0.001; Sidak α = 0.01563), whereas the linear model lost significance but remained “extreme evidence” for model according to Bayesian analysis (High Support score: postpuberty: r 2 = 0.074, P = 0.067, BF₁₀ = 2300.89). Regression analyses against the Low Support score and support benefit remained nonsignificant (see Table 1 ; Fig. 5 ). Bayesian analysis suggested there was anecdotal evidence to support a linear model for the low support task (BF₁₀ = 2.99) and moderate evidence to support the null hypothesis for support benefit (BF₀₁ = 7.35).

Strategy

Participants were asked to report on what strategies they used in the task. All but two participants (female 120 mo, male 179 mo) reported using strategies to aid memory. ANOVA (IV: strategy; DV: age) performed with the data weighted by age group showed no difference in strategy type used with age (F_(2,77) = 0.304, P = 0.583). There was no association between strategy type and performance (all Fs < 1).

Discussion

This study aimed to investigate the developmental trajectory of different elements of EM in a cross-sectional sample of children aged 10–17. We found that while EM appears to show both linear and nonlinear features over this age range depending on the aspect being tested, it was in general better characterized by a cubic model (particularly when there was a high level of retrieval support). The results are broadly consistent with the mixed previous research demonstrating both linear and nonlinear development over the teenage years. Furthermore, these findings tie in well with neurobiological evidence of different developmental trajectories for different neural areas within the EM network. Broadly speaking, the tasks that were predicted to be more hippocampal-dependent, such as temporal and associative (WWW) memory, were more likely to demonstrate (or be better predicted by) a cubic trajectory, with a peak at ∼15–16 followed by a considerable dip in performance at around the age of 17. This timeline reflects some previous behavioral findings (Keresztes et al. 2017) as well as the suggested period of peak gray matter volume of the hippocampus (Giedd et al. 1999).