Executive functions (EFs) are a set of cognitive processes that refer to executive control, that is, the ability to set goals, make decisions, define strategies, and direct and regulate behavior to achieve specific goals (Diamond, Reference Diamond2013; Friedman & Miyake, Reference Friedman and Miyake2017). This definition has been widely accepted in the last decade, despite there are different definitions of EFs looking for operationalize and clarify this term any better (e.g., Lezak et al., Reference Lezak, Howieson, Bigler and Tranel2012; Zelazo et al., Reference Zelazo, Müller, Frye and Marcovitch2003).
In this paper, Miyake et al. model (Reference Miyake, Friedman, Emerson, Witzki, Howerter and Wager2000) and Diamond proposal (Reference Diamond2013) will be used as a background for the results interpretation. According to these models, three core skills make up EFs: Working memory, which involves the ability to maintain and manipulate information; inhibition or inhibitory control, important for ignoring distracting elements and suppressing attention or behavior directed at something irrelevant; and cognitive flexibility, which involves the ability to thinking creatively in order to achieve a goal (Diamond, Reference Diamond2013; Friedman & Miyake, Reference Friedman and Miyake2017). According to the theoretical model of Diamond (Reference Diamond2013), these three core skills are related to higher-order components of EFs, such as reasoning, problem solving, and planning. This set of processes that make up EFs relate to the ability to prioritize, self-monitor, and organize actions (Friedman & Miyake, Reference Friedman and Miyake2017).
Other models have been proposed and there are still some aspects to be better understood, such as the definition and the role of inhibition, and other possible skills, as fluency and dual-task coordination (Friedman & Miyake, Reference Friedman and Miyake2017). However, despite the Miyake et al. model (Reference Miyake, Friedman, Emerson, Witzki, Howerter and Wager2000) is not comprehensive (because there are other EF components), or it is not even a proposition about elementary processes (since these core skills were chosen to examine unity and diversity of EFs) (Friedman & Miyake, Reference Friedman and Miyake2017), that models with the three core skills, as well as the complex skills, have been widely accepted in the research in the area (Zelazo et al., Reference Zelazo, Blair and Willoughby2017) and can facilitate the operationalization of assessment and intervention.
Alterations or underdevelopment of EFs can lead to difficulties in self-regulation of various behaviors in various age groups. In adulthood, they are related to work productivity impairment, alcohol abuse, drug abuse, and motor vehicle collisions; in children, they may cause reading and writing difficulties, learning difficulties in mathematics, and disruptive behaviors (Clements & Sarama, Reference Clements, Sarama, Fritz and Räsänen2019; Meixner et al., Reference Meixner, Warner, Lensing, Schiefele and Elsner2019; Pope et al., Reference Pope, Ross and Stavrinos2016).
Given the relevance of EFs, studies have been conducted with promising results seeking to stimulate such functions, especially in children and adolescents. As examples we could mention studies that sought to verify the cognitive stimulation through dance classes, karate, martial arts, tennis lessons, meditation, digital games, football training program, educational robotics (Harwood et al., Reference Harwood, Lavidor and Rassovsky2017; Homer et al., Reference Homer, Plass, Raffaele, Ober and Ali2018; Kiani et al., Reference Kiani, Hadianfard and Mitchell2017; Kullina et al., Reference Kulinna, Stylianou, Dyson, Banville, Dryden and Colby2018).
One of the cognitive techniques for rehabilitation of EFs is the Goal Management Training approach (GMT), which was developed by Robertson in Reference Robertson1996 (Levine et al., Reference Levine, Robertson, Clare, Carter, Hong, Wilson and Stuss2000; Robertson, Reference Robertson1996) from theories about the functions of the prefrontal cortex based on Duncan’s goal neglect theory (Reference Duncan1986). The GMT approach follows a multi-module protocol in which strategies are trained while a cognitive activity is performed. Its main purpose is to train the individual to stop his or her automatic behavior in order to set goals and to hierarchically plan the steps to achieve them.
Thus, GMT studies follow a protocol of predefined stages with specific topics to be worked on in each session. The stages consist of: (a) Stop and ask: What am I doing?; (b) check the mental blackboard; (c) define the main task or goal to be met; (d) list the steps or stages required to meet a goal; (e) learn the steps; (f) do the steps; (g) check if you did what you planned (Robertson, Reference Robertson1996).
In addition, some studies were analyzed to development this protocol based on GMT (Grant et al., Reference Grant, Ponsford and Bennett2012; Levine et al., Reference Levine, Robertson, Clare, Carter, Hong, Wilson and Stuss2000; Tornås et al., Reference Tornås, Løvstad, Solbakk, Evans, Endestad, Hol, Schanke and Stubberud2016). Because it is a sample of adolescents, the use of metaphor “check the mental blackbord” to the training of working memory was changed to “check the mental tablet” and the presentations were done by Prezi program for give more movement and improve the interesting of the young sample.
The technique has also been applied to individuals with other conditions or age groups than adults with brain injury. A case study with a patient with schizophrenia sought to ecologically verify the benefits of GMT in meal preparation, laundry, and meeting preparation, noting significant improvements in the questionnaires answered by the caregiver (Levaux et al., Reference Levaux, Larøi, Malmedier, Offerlin-Meyer, Danion and van der Linden2012). Another recent study was conducted with three young college students without previous diagnosis, and revealed improvements after the intervention in their ability to set goals and plan the steps to achieve them (Rivera et al., Reference Rivera, Wood, James and Williams2019). Another study reported the case of an 11-year-old teenager with ADHD symptoms and complaints of executive dysfunctions in the social environment. The results suggested benefits of GMT in attention and executive functions, but when combined with another type of intervention (Nicoló, Reference Nicoló2015). There are also published studies with young university students with complaints of executive dysfunction in their social environment (Carstens, Reference Carstens2016) and children with traumatic brain injury (Krasny-Pacini et al., Reference Krasny-Pacini, Limond, Evans, Hiebel, Bendjelida and Chevignard2014).
Despite advances in the use of the GMT technique, only a still few studies with adolescents are reported, especially without specific diagnoses of neurodevelopmental injuries or disorders. It is essential to think of such a population that, despite the lack of diagnosis, may present with subclinical difficulties in executive functions. Since such functions, as previously described, predict physical health, substance dependence, personal finances, and criminal offending outcomes, their development is essential to the health, wealth, and public safety of a population (Moffitt et al., Reference Moffitt, Caspi, Taylor, Kokaua, Milne, Polanczyk and Poulton2010). Thus, executive function difficulties in adolescence can have important lifetime consequences (Moffitt et al., Reference Moffitt, Caspi, Taylor, Kokaua, Milne, Polanczyk and Poulton2010).
Thus, based on the literature suggesting gains with the application of the GMT technique and highlighting the importance of the development of executive functions in adolescence, the aims of this study were to develop a program based on GMT training and to investigate its effectiveness among adolescents with executive function deficits, through formal instruments and an ecological task. We expected greater gains in the experimental group, submitted to GMT, on formal EFs tests, as well as on an ecological task, in comparison to the active control group.
Method
Participants
Participants were recruited from private and public schools in Três Lagoas, state of Mato Grosso do Sul, Brazil. We contacted 7 coordinators and 35 teachers from 5 different schools to talk about all their 700 students. Students with executive functioning issues according to parents’ and teachers’ report were selected, as described below.
Participant selection criteria. The inclusion criteria were students between 11 and 17 years, from private and public schools in Três Lagoas, Brazil, with difficulties in executive functions, according to both teachers’ and parents’ reports, as explained above.
We thoroughly explained the concept of executive functions to the 7 coordinators and 35 teachers in an initial 2-hour meeting. They were given descriptions of EF based on examples that could be observed in the classroom environment.
Then teachers and coordinators were asked – based on those descriptions and coexistence in school context – to indicate all students between 11 and 17 years with executive functioning complaints (including working memory, inhibitory control, flexibility, and planning issues). Of the 700 students, 102 were indicated. Then students’ teachers and parents responded to the Executive Function, Regulation, and Delay Aversion Inventory (Inventário de Funções Executivas, Regulação e Aversão ao adiamento – IFERA), described in the Assessment Tools section. In the following stage of the study, only those participants who showed IFERA difficulty indices above the expected average were included, according to both teachers’ and parents’ reports. A total of 53 participants had scores higher than 80 for both the parent and teacher scales (the maximum scale score is 140 and the average score in Brazilian samples is 77.3 points, according to Trevisan, Reference Trevisan2014). The 53 participating students corresponded to 7.5% of the initial 700 students and 51% of the 102 students initially indicated by teachers.
Exclusion criteria. The 53 students with scores indicating difficulties above the expected average in IFERA, according to both teachers’ and parents’ reports, underwent a neuropsychological assessment to analyze the other exclusion criteria. Participants who had previous psychotic symptoms (1 participant), IQ below 70 (26 participants), or any type of brain injury as reported by the school or their guardians (1 participant) were excluded. This selection resulted in a final sample of 25 participants (corresponding to approximately 3.6% of the initial 700 students).
Thus, IQ below 70 was used as exclusion criteria to avoid that participants with difficult in comprehension and realization of activities participated of research. For that, we used the abbreviated measure of IQ, with subtests of vocabulary and matrix reasoning of WASI, as explained in Instruments. In addition, T-scores for vocabulary and matrix reasoning subtests were used in analyses of effect of this intervention to check if the GMT would produce improve in these variables.
We established no prior criteria regarding the origin of the difficulties, so that participants with a known or unknown etiology of their difficulties were considered for the study. The subjects were not submitted to any other neuropsychological intervention program before or during our intervention. Four participants had known etiologies, namely: Attention deficit hyperactivity disorder (2 participants), high-functioning autism (1 participant), and oppositional defiant disorder (1 participant), all with medical and neuropsychological diagnoses. Although these participants reported using medication for at least 2 years, the complaints persisted. The remaining 21 participants reported unknown etiology and no medication use to minimize the disorder.
Study Design
The 25 participants were first divided into 2 groups according to age group: G1 (11–14) and G2 (15–17). G1 was then randomly divided (by drawing names) into an experimental group (EG1), which was submitted to GMT intervention, and a control group (CG1) that underwent psychoeducation intervention. G2 was also divided via random draw into experimental (EG2) and control (CG2). Two participants in group EG1 had a diagnosis of attention deficit disorder, 1 participant in group EG2 had a diagnosis of autism spectrum disorder and 1 participant in CG1 had a diagnosis of oppositional defiant disorder. It is noteworthy that parents and teachers were unaware of the group to which the child was allocated (EG or CG). Figure 1 shows a flowchart with the research group division.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210320113522016-0534:S1138741621000172:S1138741621000172_fig1.png?pub-status=live)
Figure 1. Flowchart with Research Group Division
Assessment Tools
For IQ assessment, we used the Brazilian Portuguese version of the Wechsler Abbreviated Scale of Intelligence (WASI; Wechsler, Reference Wechsler1999) using T-scores for the variables vocabulary, matrix reasoning, and full scale IQ (test, evidence of validity and reliability in the Brazilian manual by Trentini et al., Reference Trentini, Yates and Heck2014). Such measures were used as exclusion criteria and also to measure possible changes throughout the three measures. To assess executive functions, we used neuropsychological tests, parent/teacher reports, and an ecological task, as described below.
For the selection of participants, we also applied the Executive Function, Regulation, and Delay Aversion Inventory (Inventário de Funções Executivas, Regulação e Aversão ao adiamento – IFERA, Trevisan, Reference Trevisan2014) to parents and teachers to assess their perceptions of students’ executive functioning in everyday and classroom environments. This inventory aims to assess executive functioning in an ecological manner and includes items that assess state regulation and delay aversion (some examples include “When you are in the middle of an activity, you often forget what you were doing” or “Do things without thinking first about what might happen”). Answers are on a 5-point likert scale ranging from never (1) to always (5) where responses with higher scores indicate greater reported difficulties. The measure used was the total raw score of the inventory. As problems with executive functioning are not expected to decrease with age, since the comparison is made with the expected average for age, the use of raw scores is justified despite the participants’ age difference. There are evidences of validity and reliability for Brazilian population (Trevisan, Reference Trevisan2014; Trevisan, et al., Reference Trevisan, Berberian, Dias, Alves and Seabrain press). The inventory has evidences of content validity, as assessed by experts with experience in clinical neuropsychology and psychometrics; evidences for confirmatory factor analysis, with acceptable five factors model; evidences for relation to other instruments (SNAP–IV and CHEXI); discrimination between children with and without Attentional-deficit/hyperactivity disorder; and good reliability indexes (Trevisan, Reference Trevisan2014; Trevisan et al., Reference Trevisan, Berberian, Dias, Alves and Seabrain press).
Among the neuropsychological tests, we used the Five Digit Test – FDT to assess participants’ cognitive flexibility and inhibitory control (test, evidence of validity and reliability in Sedó et al., Reference Sedó, de Paula and Malloy-Diniz2015). The test has evidences of validity, reliability and normative scores for Brazilian population and presents a numerical Stroop paradigm in four parts. Parts 1 and 2 assess automatic processes: The examinee is supposed to read Arabic numerals in a screen (1, 2, 3, 4, 5) and count asterisks (1–5 asterisks). In Part 3, the examinee should identify the amount of numbers (instead of reading the numbers). In Part 4, the task changes between the rules given on Part 1 and Part 3, depending on an explicit clue. There are two executive function measures: Inhibition (Part 3 minus Part 1) and flexibility (Part 4 minus Part 2), calculated in terms of score and time to task completion. Percentile was used for score measurements and, for time measurement, we converted the results to t score (Strauss et al., Reference Strauss, Sherman and Spreen2006).
To assess working memory we used the WISC–IV Digits subtest only in the backward digit sequence (test, Brazilian standards, evidence of validity and reliability in Wechsler, Reference Wechsler2013). In this test, the researcher read a sequence of numbers and the participant had to verbally reproduce these digits indirectly. Results were converted to t-score.
The Tower of London test – ToL (Brazilian standards, evidence of validity and reliability in Menezes et al., Reference Menezes, Dias, Seabra, Seabra and Dias2012) assessed participants’ planning ability. The test consists of a board with three vertical rods on which four different colored beads of the same size are placed. The task is to move the colored beads to reproduce the position shown in a stimulus book in as few moves as possible. Planning skill is considered a complex EF because the individual needs to use several basic EFs to solve the task, such as setting the goal and the steps to achieve it, staying active during task resolution – inhibiting distracting stimuli – and using cognitive flexibility when each card presented requires a new way of thinking about the goal. To measure the results of the Tower of London test we used the standard score according to the Brazilian norms (with a mean of 100 and a standard deviation of 15).
The ecological task to assess executive functioning consisted of a cooking activity, which was performed in a kitchen in the clinic where the research was conducted. The students were supposed to make a cookie (no baking required, only the proper mixing of ingredients). To do so, they were given a recipe without the “ingredients” section. Therefore, the ingredients were not described separately, but throughout the recipe execution steps. During the activity, all the necessary ingredients were available on the counter where students worked, as well as other unnecessary ingredients that were added as distractors. The participant was instructed to follow the recipe and make it as quickly as possible. The following variables were measured: total execution time, total errors (errors per action, such as using ingredients not needed or in a larger quantity; and errors of omission, such as not adding required ingredients to the recipe or using them in smaller quantities), number of actions (number of total actions taken by the participant from the beginning to end), and recipe verification (number of times the participant reviews the recipe). There are no normative data that quantitatively measure the gains of this intervention through this activity. For this reason, we analyzed changes in the number of times participants consulted the recipe (action considered indicative of working memory use), number of errors (considered indicative of inhibition), and time (management of actions, which was an analysis used in a previous study conducted by Levaux et al., Reference Levaux, Larøi, Malmedier, Offerlin-Meyer, Danion and van der Linden2012).
Intervention Tools
GMT technique. The experimental groups (EG1 and EG2) underwent 8 intervention sessions. The development of the activities was based on GMT principles (Levine et al., Reference Levine, Robertson, Clare, Carter, Hong, Wilson and Stuss2000). For the animations and visual examples we used PREZI computer presentations. The first session consisted of general information about the training and identification of situations that place high demands on EF in daily tasks. In the second session, participants worked on the concept of autopilot. The concept refers to behaviors that emerge naturally, automatically, without conscious control over them. In the third session, we explored the concept of working memory through examples of daily life using the “mental tablet” metaphor. In the fourth session, participants were instructed to set goals, keep them in their working memory (mental tablet) and resort to them whenever necessary. In the fifth session, we addressed the issue of decision making, which is necessary when we have two competing goals that have the same weight of obligation. In the sixth and seventh sessions, we explored complex goals, meaning in these sessions participants were trained to divide a complex task into several smaller tasks to cope with complex goals. Finally, in the eighth session, we proposed a complex ecological activity, with the objective of making participants apply the strategies learned in an Internet search in order to get a response for the task. The task consisted of a fictitious situation in which participants simulated purchases for their relatives with a certain amount of play money and with the help of the researcher. Internet searches were based on tips on what family members needed and liked. The tasks involved: 1) Finding a gift for a newly married older sister who loves blue shoes as well as cooking; 2) finding a gift for a brother who is very fond of chocolates but has started a diet and needs to practice sports to lose weight; 3) finding a gift to the twin brothers who have just come out of diapers and will go to their first day of kindergarten; 4) finding something for mom who likes photography and reading, but currently has no space to place more photos at home; 5) finding something for father who is a truck driver – therefore, who is almost never home – and loves his family.
In addition to group activities, participants were given homework assignments in each section. They were to report everyday situations in which session techniques were used. Such reports should be sent to the researcher’s cell phone via text messages, videos, or voice messages. Table 1 presents a summary of the sessions.
Table 1. Plan of the Sessions for GMT and Control Groups
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210320113522016-0534:S1138741621000172:S1138741621000172_tab1.png?pub-status=live)
Psychoeducation. Both control groups (CG1 and CG2) underwent a psychoeducational intervention that corresponded to the time of administration and the collective application of the GMT technique. All participants had access to computer-presented educational materials and group discussions on some age-appropriate topics. Thus, topics for discussions about everyday issues by adolescents were addressed, such as bullying, self-awareness, vandalism, inclusive education, technology overwhelm, gender equality, and teenage pregnancy. Similar to EGs, homework was assigned after each session. Participants were to report on the subjects that were covered in the sessions and send them to the researcher’s cell phone via text messages, videos, or voice messages. Table 1 presents a summary of the sessions.
Procedure
After approval of the project by the Research Ethics Committee of our University, and after authorization of responsible and consent of the adolescents using the informed consent forms, the baseline assessment was initially performed. Then, the G1E and G2E groups received GMT intervention in 8 sessions, 60-minute each, which were held in three weeks. Concomitantly, groups G1C and G2C underwent interventions of 8 psychoeducation sessions, 60-minute each. After the intervention period, all participants were reevaluated with the same measures used in the pre-intervention. After a period of 4 weeks of the intervention period, all participants went through the same follow-up process again. The evaluations and interventions were performed in a private psychological clinic in the city of Três Lagoas - MS.
Data Analysis
In the study design, there were a between-subject variable (group: EG and CG, with grouping by age) and a within-subject variable (assessment time: Pre-intervention, post-intervention, and follow-up). Our dependent variable were the measures in the tasks, as described in Instruments section.
Considering the small sample size (n = 25) and the arbitrary distribution of the data, we conducted a series of nonparametric mixed analysis of variance (ANOVA), based on rank estimates. This approach enables to compare between- and within-groups temporal effects, and their interaction even with small samples sizes or presence of outliers, due to its rank-based robust statistics (Noguchi et al., Reference Noguchi, Gel, Brunner and Konietschke2012). The ANOVAs were implemented through nparLD (nonparametric longitudinal designs; Noguchi et al., Reference Noguchi, Gel, Brunner and Konietschke2012) package, at the R statistical environment, which allow Fisher-liked hypothesis testing with an adjusted estimator for model´s degrees of freedom. In this analysis, we focused on the interaction between group and assessment time in order to verify if the EG would have a different progress than the CG.
Additionally, differences for the different measures were calculated, considering Difference 1 (post-test result minus pretest result) and Difference 2 (follow-up result minus pretest result). Due to the small number of subjects in the analyzes, we used Hegdes’ g to appropriate measure the effect size of the gain scores in order to compare the differences in EG with the differences in CG. Effect sizes were interpreted as follows up g from .20 = no effect; g from .20 to .30 = small effect; g from .40 to .70 = medium effect; g greater than or equal to .80 = large effect (Kline, Reference Kline2004). It is noteworthy that, in our analysis, negative g values were indicative that the difference was greater – in terms of raw scores – for EG than for CG.
Results
Table 2 summarizes the results obtained, with the values of Differences 1 and 2 for the control (C) and experimental (E) groups, and the effect size (g) of the comparison of such gain scores between the groups. The table also shows the results of the Assessment Time x Group Interaction obtained with mixed ANOVA (F-value, p-value).
Table 2. Results obtained, with the Values of Differences 1 (Post-test – Pre-test) and 2 (Follow-up – Pretest) and 2 for the Control (C) and Experimental (E) Groups, Effect Size (g) of the Comparison of the Gain Scores, and Results of the Time x Group Interaction after Mixed Non-Parametric ANOVA
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210320113522016-0534:S1138741621000172:S1138741621000172_tab2.png?pub-status=live)
Note. The cells in bold refer to significant values
Table 3 summarizes the results obtained.
Table 3. Summary of Non-parametric Anova Results (Interaction between Time and Group) and Size Effects for Differences 1 (Post-test – Pre-test) and 2 (Follow-up – Pretest)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210320113522016-0534:S1138741621000172:S1138741621000172_tab3.png?pub-status=live)
Note. Gray cells in effect sizes indicate measures in which CG gains were greater than EG gains
ANOVA results revealed significant effects of the intervention in terms of interaction between time and group in five of the 14 measures assessed. Analyzing the effect size of the differences, we observed medium or large effect sizes in several measures, namely, seven measures for Difference 1 and nine measures for Difference 2.
Regarding WASI scores, we observed a significant time-group interaction for the estimated IQ. The EG showed greater gains than the CG soon after the intervention and also at follow-up, with a large effect size for Difference 1 and medium effect size for Difference 2, according to Hedges’ g. These gains were leveraged especially by Matrix Reasoning scores in Difference 1, where the EG had a higher gain than the CG (3.85 versus .50, respectively), and by Vocabulary, in difference 2, where the EG had higher gain than the CG (.62 versus –.67, respectively). For the estimated IQ, the EG gain in Difference 2 was lower than in Difference 1, suggesting that changes soon after the intervention decreased over time. There was significant Time x Group for IQ and Matrix Reasoning, with higher gains for EG in both measures.
In the measurement of working memory, as assessed by the WISC scale in the backward digit sequence, we observed no significant interaction between assessment time and group. However, there was an average effect size in favor of the EG in the first difference, suggesting that, right after the intervention, the EG showed higher gains than the CG. On the other hand, the opposite occurred in the second difference, that is, the CG revealed higher gains. Thus, there seems to have been no significant long-term differences in this measure after the intervention.
The FDT revealed important results: Out of the eight measures, four had large effect sizes and three had medium effect sizes. There was also significant Time x Group interactions for time measures in flexibility and inhibition, and for score in flexibility. In terms of means, EG showed greater gains in flexibility (Difference 1) and inhibition scores (Differences 1 and 2), and CG exhibited greater gain in flexibility score (Difference 2). Still in relation to the means, the EG showed greater time gain (that is, participants took longer to respond) in all FDT measurements. It is noteworthy that, regarding time, there was significant interaction between time and group. That is, the EG started to exhibit a longer execution time, which was accompanied by more accurate performances, especially in flexibility.
The Tower of London, which measured planning ability, revealed no significant interaction between assessment time and group, but showed an average effect size in the Hedges’ analysis on Difference 2. The effect size had been insignificant for the Tower of London shortly after the intervention, which may suggest that participants may have continued to use planning strategies and, consequently, the effect size increased over time, detected at follow-up.
In the cooking activity, which aimed to verify executive functions in a real-life situation, we observed a significant interaction between assessment time and group for time measure in the execution of the recipe and number of times participants referred to the recipe. We observed that the EG decreased the number of recipe verification, which reduced the total time to perform the activity, that is, the performance has become more efficient. Moreover, regarding Hedges’ analysis, the effect sizes were insignificant or small for errors (per action or omission), but with a medium to large effect size for the other measures. That is, EG participants were more effective in memorizing the recipe before executing it, which reduced execution time. However, we observed no differences in CG in terms of behavior inhibition, measured by errors per action and of omission.
Discussion
The aim of the present study was to evaluate the effectiveness of Goal Management Training in adolescents with executive functioning issues through formal instruments and an ecological task. The results revealed gains for the experimental group in the IQ, flexibility, inhibition, and ecological task tests, regarding significant assessment Time x Group Interaction. Moreover, measurement of all tasks revealed medium or large effect sizes.
Thus, initially, after the intervention (Difference 1), by analyzing medium or large effect sizes, EG participants had greater gains in matrix reasoning skills, estimated IQ, longer time spent on flexibility and inhibition on FDT, higher flexibility scores on FDT, shorter performance time, and fewer verification at the recipe task. The analysis of Difference 2 revealed the maintenance of the gains after the intervention was completed. Considering medium or large effect sizes, the EG had greater gains in vocabulary, estimated IQ, longer time spent on flexibility and inhibition on FDT, higher inhibition scores on FDT, higher Tower of London score, shorter performance time, and fewer recipe consultations. It is noteworthy that only one measure revealed a relevant effect size in favor of the control group, namely, the flexibility score on FDT at Difference 2. In addition, there was significant interaction Time x Group for estimate IQ, reasoning, flexibility (time and score), inhibitory control (time), recipe verification and time in ecological task.
Therefore, overall, the results suggest that the 8-session GMT training was able to promote greater gains in these skills than the psychoeducation performed with the control group. Our findings were consistent with previous studies that reported greater benefits of inhibitory control, sustained attention (Novakovic-Agopian et al., Reference Novakovic-Agopian, Kornblith, Abrams, Burciaga-Rosales, Loya, D’Esposito and Chen2018), planning (Tornås et al., Reference Tornås, Løvstad, Solbakk, Evans, Endestad, Hol, Schanke and Stubberud2016), and time (measured by the Tower Test; Stubberud et al., Reference Stubberud, Langenbahn, Levine, Stanghelle and Schanke2014).
We observed important changes in various inhibition skills. Such changes probably occurred because inhibitory control ability is already trained in early GMT sessions, when the concept of “turning off the autopilot” in cognitive activities and day-to-day tasks is presented to participants. In the present study, it is noteworthy that higher EG gains in inhibition score measures were accompanied by longer execution time in the activity, which may suggest that participants began to think more critically in the face of unusual situations that required voluntary responses. That is, in this case, since the tasks were difficult and not automated, the increase in time can be considered a gain based on the intervention procedure used, which has as one of the key principles to turn off the autopilot (Levine et al., Reference Levine, Robertson, Clare, Carter, Hong, Wilson and Stuss2000). Several other studies also showed improvements in participants’ inhibition (Cuberos-Urbano et al., Reference Cuberos-Urbano, Caracuel, Valls-Serrano, García-Mochón, Gracey and Verdejo-García2018; Levaux et al., Reference Levaux, Larøi, Malmedier, Offerlin-Meyer, Danion and van der Linden2012).
Regarding cognitive flexibility, the present study revealed greater score gains in the EG only for Difference 1, while the CG revealed greater gains in Difference 2. Despite CG had greater score gains than EG in one of the differences (flexibility score at Difference 2), there was significant interaction between moment and group for flexibility scores and for flexibility time. That is, the score gain in the EG for Difference 1 was accompanied by longer execution time in both differences, which may suggest that participants began to think more critically in situations that required cognitive flexibility, as occurred in relation to inhibition. Gains in flexibility were found in some previous studies, but with some inconsistence. For example, although Cuberos-Urbano et al. (Reference Cuberos-Urbano, Caracuel, Valls-Serrano, García-Mochón, Gracey and Verdejo-García2018) showed benefits of GMT in the flexibility of its participants, some methodological limitations were observed in that study, such as the very small sample size of only 15 participants and the non-use of a control group. On the other hand, although the study performed by Tornås et al. (Reference Tornås, Løvstad, Solbakk, Evans, Endestad, Hol, Schanke and Stubberud2016) had a larger sample, it showed no improvement in flexibility skills. We hypothesize that, although the core of GMT is in fact inhibition, because this ability is fostered since the beginning of the intervention and because it is present in all GMT sessions, the flexibility was essential in our procedure to divide a complex task into smaller tasks is worked out, supporting the development of higher mental functions related to planning and problem-solving.
About working memory, we observed no significant differences in the results between the groups in the traditional digit test, in which there was no significant interaction between assessment time and group, with only small effect size in favor of the EG in Difference 1 and in favor of the CG in Difference 2. However, we observed a gain and significative effect in working memory proposed scores for the ecological activity in relation to the number of times participants needed refer to the recipe for its execution. Thus, the EG revealed a greater reduction than the CG in the number of recipe verification, both in Difference 1 (with a large effect size) and Difference 2 (with a medium effect size) and significant interactions between Time x Group. Thus, we can hypothesize that the techniques stimulated in the intervention, such as the use of the mental tablet, may have favored such performance in the EG compared to the CG.
The planning activity, assessed by the Tower of London test, is characterized as complex EF as it requires the joint use of core EFs (Diamond, Reference Diamond2013). In this task, although we observed no significant interaction between assessment time and group, and the effect size was non-significant in Difference 1, there was an average effect size in Difference 2 in favor of the EG. That is, EG participants had greater gain scores than the CG in the Tower of London test from pretest to follow-up. It is noteworthy that, according to Diamond’s (Reference Diamond2013) executive function theory, the so-called core executive functions – which include inhibition, working memory, and cognitive flexibility – when working together, enable the execution of complex executive functions, such as planning. Based on the results presented here, we can hypothesize that the change in effect size (from non-significant in Difference 1 to medium in Difference 2) is related to the complexity of the task, that is, it is possible that, because it is a more complex skill, planning is developed after the most basic skills.
It is important to note that the benefits generated by the more traditional measures (inhibition and flexibility) were not verified in the recipe ecological task. Thus, our findings are in line with the research by Carstens (Reference Carstens2016), who also sought to verify the use of the technique in young individuals only with complaints of executive dysfunction. It may be that in non-chronic individuals, behavioral changes may be milder and barely noticeable.
Besides the gains evidenced in relation to executive function measurements, we also highlight a significant interaction between assessment time and group for the estimated IQ. Moreover, the EG showed significantly greater gains than the CG. This result is interesting because, a priori, it is not expected that the intervention in EF significantly changes IQ. However, we highlight the important relationship between EF and intelligence that can be outlined, for example, from Primi’s (Reference Primi2002) research. According to that study, intelligence is related to the ability to find a goal and subsequently inhibit irrelevant stimuli. Thus, the ability to inhibit, which is fostered in GMT, would play an important role in the resolution of intelligence tests. Additionally, it is noteworthy that previous studies have also suggested the relationship between fluid intelligence and working memory, and that working memory stimulation, one of the components of EF, is associated with increased performance in fluid intelligence tests (Au et al., Reference Au, Buschkuehl, Duncan and Jaeggi2016). Thus, stimulating the basic components of EF can have an important effect on intelligence test performances.
In summary, the main results showed evidence of the effectiveness of GMT treatment for adolescents with complaints of executive dysfunction according to parents’ and teachers’ reports. However, limitations of this study should be highlighted. Firstly, our study sample was small, which generates some sample bias or may contribute to the absence of significant effects on ANOVA, although mean trends have been identified. The second limitation is that although the study was randomized in a second stage, it was not a double-blind study in its assessment and intervention phases. A third limitation is the short follow-up time for the study, as the 4-week period may not be sufficient to predict the prolonged gains of the intervention. Additionally, two participants in this research, who were randomly included in the experimental group, were using methylphenidate throughout the intervention process. This is an important variable in the stimulation process, as the gains from medication use could be enhanced with the GMT stimulation.
Thus, future studies could include more sessions and with longer duration, a larger number of participants, training tasks closer to activities of daily living that include executive functions, and a longer follow-up. It is noteworthy that, despite the differences in cognitive abilities measured here – related to working memory, inhibitory control, planning, and flexibility – there is no predictive benefit in the ability to make decisions in adolescents, as the socio-emotional aspects related to peer influence, emotional control, and the reward process were not the focus of this study. Since decision making has been treated as the product of cognitive and socio-emotional aspects, and its deficits associated with risky behavioral patterns in adolescence – such as high mortality rates, toxic abuse, and mental disorders – interventions that focus both on cognitive and socio-emotional skills could be beneficial to this population (Defoe et al., Reference Defoe, Dubas, Figner and van Aken2015; Moffitt et al., Reference Moffitt, Caspi, Taylor, Kokaua, Milne, Polanczyk and Poulton2010; Zelazo & Carlson, Reference Zelazo and Carlson2012).
Despite the limitations, the study allowed the preliminary development and investigation of an intervention proposal that aims to improve working memory, inhibitory control, flexibility, and planning skills in adolescents. The results, albeit modest, seem to indicate a promising path for further research in this area.