Development involves synergistic interplay among genotypes and the physical and sociocultural environments. These influences are interdependent, with reciprocal causal pathways and feedback, correlations, and interactions across levels of analysis. Until recently, the ability to study such transactions has been limited by technology. With the completion of the Human Genome Project (International Human Genome Sequencing Consortium, 2004), development of new methods of measuring brain architecture and functioning (Roalf & Gur, Reference Roalf and Gur2017), and the birth of new fields of science (e.g., epigenetics and proteomics), levels of analysis that previously existed in “the black box” can now be measured and developmental theories integrating genetic and sociocultural levels of analysis can be rigorously tested. Such research has the potential not only to advance our understanding of child development but also to improve the design and testing of interventions. Because individuals vary in their sensitivity to psychosocial interventions for reasons including their genotype, understanding genetic influences on intervention response is critical for an accurate judgment of efficacy. However, genetically informed intervention research is still rare and largely based on a handful of candidate genes (e.g., Bakermans-Kranenburg & van IJzendoorn, Reference Bakermans-Kranenburg and van IJzendoorn2015). It is likely that psychosocial interventions have large effects on developmental psychopathology for some individuals, but no significant effects for other children. When we do not acknowledge developmental theory and only consider main effects of intervention and effectively average responses across all individuals in the sample, we may erroneously conclude that interventions have small and sometimes nonsignificant effects. A tailored and personalized medicine approach is needed to identify which interventions work for which people.
Further, genetically informed research as a whole still lags behind in the area of cultural and ethnic diversity, with most studies relying on samples of Northern European descent and extremely limited representation of Latino, Indigenous, and African or African American participants (Popejoy & Fullerton, Reference Popejoy and Fullerton2016). Given that both genetic and environmental influences can differ across populations, genetics research based on participants from a narrow range of cultures and ethnicities cannot be assumed to generalize to humanity as a whole, and in particular, to racial and ethnic minorities who are often left out of research participation (Oquendo, Canino, Lehner, & Licinio, Reference Oquendo, Canino, Lehner and Licinio2010).
We address these limitations by testing genetic moderation of the effects of a parenting-based intervention on internalizing symptoms in a racially/ethnically diverse sample of US children recruited based on economic and family risk, using a polygenic score based on an existing genome-wide association study intended to tap sensitivity to the environment (Keers et al., Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016). Participants underwent a randomized control trial with the Family Check-Up, a family-based intervention that incorporates motivational interviewing that has been successfully utilized to reduce child problem behavior across a wide range of socioeconomic and sociocultural groups (Dishion, Kavanagh, Schneiger, Nelson, & Kaufman, Reference Dishion, Kavanagh, Schneiger, Nelson and Kaufman2002; Dishion et al., Reference Dishion, Shaw, Connell, Gardner, Weaver and Wilson2008; Shaw, Dishion, Supplee, Gardner, & Arnds, Reference Shaw, Dishion, Supplee, Gardner and Arnds2006). We indexed genetic sensitivity to the environment using a polygenic score based on genetic variants that were associated with identical twin differences in internalizing symptoms in a previous genome-wide association study (Keers et al., Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016). Because it was derived from the prediction of identical twin differences, this score is an aggregation, not of small genetic main effects, but of variants associated with the magnitude of response to nonshared environmental factors across all environments the twins encounter, in essence acting as a genetic index of sensitivity to the environment broadly defined. Accordingly, we hypothesized that the Family Check-Up intervention would have stronger effects on internalizing psychopathology for children who were higher on the polygenic sensitivity score.
Theoretical Background
Developmental theories suggest that we cannot understand genetic influences on individual behavior and outcomes without simultaneous consideration of sociocultural influences, and conversely, that we cannot understand sociocultural influences without understanding genetic influences. Yet cultural and genetic fields of study are still very much distinct, with few collaborations between cultural and genetic scientists, and no established training programs that combine these theoretical perspectives and methods. Behavioral genetics focuses on how genotypes are related to phenotypes, with phenotypes defined as observable characteristics and behaviors that are the result of both genes and environments. Cultural psychologists often study relations between components of culture and individual phenotypes, with culture defined as a system of behaviors and cognitions shared by a community that informs values, goals, practices, traditions, and institutions; this knowledge is transmitted from one generation to the next (Cohen, Reference Cohen2009). The emerging field of cultural genomics combines these areas by studying the interplay of genomics, cultures, physical environments, and phenotypes (Causadias, Telzer, & Gonzales, Reference Causadias, Telzer, Gonzales, Causadias, Telzer and Gonzales2018).
According to this framework, individual development is dynamically shaped by families and communities (e.g., race, religion, and social group), creating unique realities across cultures (Causadias et al., Reference Causadias, Telzer, Gonzales, Causadias, Telzer and Gonzales2018). At the same time, person-level factors including genetic variation can influence the way individuals actively construct and interact with their own cultures and environments, with all of these factors mutually influencing each other over both developmental and evolutionary time. Genetic variation also enables humans to adapt to large geographic ranges and changes in the environment, with a genotype that may be disadvantageous in one environment potentially offering an advantage in another (e.g., the genetic mutation that causes sickle cell anemia is protective against malaria), and natural selection maintaining genetic variation when there is high variability in environments. Further, both the strength and the nature of genetic effects can vary considerably for individuals with similar genotypes based on factors including the physical environment and access to resources, social and cultural influences, and individual experiences (Shanahan & Hofer, Reference Shanahan and Hofer2005).
Transactions between genes and environments can take many forms (Shanahan & Hofer, Reference Shanahan and Hofer2005). Genetic influences on a phenotype may become a more important source of individual variation when socialization does not constrain expression of the phenotype (e.g., generational increases in the heritability of Swedish women's tobacco use; Kendler, Thornton, & Pedersen, Reference Kendler, Thornton and Pedersen2000). Many heritable traits are not realized without environmental support, such that intelligence is on average both lower and less heritable in children growing up in extreme poverty in the United States (Turkheimer, Haley, Waldron, D'Onofrio, & Gottesman, Reference Turkheimer, Haley, Waldron, D'Onofrio and Gottesman2003). Conversely, some genetic vulnerabilities may be mitigated by high levels of environmental support or enrichment even when they would manifest under typical conditions (Shanahan & Hofer, Reference Shanahan and Hofer2005), and some genetic variants may increase sensitivity to environmental stress (i.e., diathesis-stress; Rende & Plomin, Reference Rende and Plomin1992), or to both stressful and positive sociocultural factors as a whole (e.g., differential susceptibility; Ellis, Boyce, Belsky, Bakermans-Kranenburg, & van Ijzendoorn, Reference Bakermans-Kranenburg and van IJzendoorn2011), such that genetic risk manifests only as a function of the environment. Although behavioral genetics research has increasingly acknowledged the importance of gene–environment interplay and nonadditivity, research integrating insights from a cultural genomics perspective is still very limited.
The largest barrier to adopting a cultural genomics approach is study design and measurement. Behavior geneticists have adopted study designs with very large racially and ethnically homogeneous samples, extensive genotyping, but very limited phenotypic and environmental measurement. These behavior genetics designs are supported by outdated “medical models” that suggest phenotypes are directly caused by genetic mutations, and that this direct effect does not vary across physical and sociocultural environments. Conversely, cultural scientists have adopted study designs with small samples, rich environmental and phenotypic measurement, but no or very limited genotyping. These cultural designs are supported by models that suggest interplay across levels of analysis (Bronfenbrenner & Ceci, Reference Bronfenbrenner and Ceci1994), but the impact of children's own biology is discounted. Variations and limitations in study design and measurement have prohibited interdisciplinary collaborations across cultural and behavior genetics scientists, generated from differences in perspective and the unique methodological challenges facing each field. For example, differences in allelic frequencies across cultural groups must be accounted for in genetics research to avoid spurious associations, making research with culturally diverse samples challenging. In addition, small genetic effects require large sample sizes for adequate power. However, with continuing advances in genotyping and genetic methodology, as well as the replication and extension of existing behavioral genetics research in more diverse samples with richer cultural measurement, the strengths of the two fields can be combined.
One major issue facing behavior genetics research today is the lack of racial and ethnic diversity in genetically informed studies, perhaps especially those taking a molecular genetics approach. Although cultural genomic theorists caution against using race and ethnicity as proxies for culture, they emphasize their importance as culturally linked social factors with the potential to shape individual and social-level experiences (e.g., identity formation and discrimination), and highlight the need to increase representation of racial and ethnic minorities in research (Causadias, Reference Causadias2013). Unfortunately, although matters have improved in recent years, molecular genetics research is still largely based on a narrow range of samples of European, and in particular, Northern European ancestry. A 2016 study found that 81% of the participants included in the genome-wide association study were of European ancestry, with much of the remaining 19% accounted for by participants of Asian ancestry living in Asian countries, and participants of African, Latino, and Native or Indigenous ancestry comprising less than 4% of the total (Popejoy & Fullterton, Reference Popejoy and Fullerton2016). This lack of diversity in genetics research is a problem for several reasons. First, it is unfair to people of racial and ethnic minority groups themselves, as it will result in failure to detect and account for genetic risk factors that are more common or only present in these groups, or, just as important, differences in drug safety and treatment efficacy. Second, it is a wasted opportunity for elucidating genetic risk for disorders that do differ across populations, and runs the risk of filtering out novel variants that have strong effects on a phenotype but happen to be rare in European populations. Third, findings from samples of European and European American participants may not replicate in other racial and ethnic groups, and this nonreplication may be a sign of false-positive findings. As such, failure to address this bias not only will result in unequal distribution of the potential benefits of genetics research but also is detrimental to the quality of the research itself.
Improving the representation of racial and ethnic minorities in genetics research will require a concentrated effort to recruit and retain participants from backgrounds other than Northern European, including the use of culturally sensitive methods of recruitment and interaction with participants, as well as strategies such as oversampling or the use of multisite studies to recruit demographically representative samples of individuals from less well-represented groups (Oquendo et al., 2009). In addition, as noted earlier, accounting for genetic diversity across populations presents a statistical challenge for molecular genetics research. Addressing this challenge will require genotyping and imputation that is sensitive to variation in allelic frequencies and haplotypes across populations, as well as statistical methods of accounting for population stratification (Oquendo et al., 2009). The present Early Steps Multisite study was designed to recruit low-income racially/ethnically diverse families from urban, suburban, and rural regions of the United States, who also scored high on family or child risk factors, to better represent children at risk of developing psychopathology.
Developmental Cascades of Externalizing and Internalizing Psychopathology
Internalizing and externalizing symptoms are moderately to highly positively correlated across childhood, with support for a general shared genetic etiology, and more evidence of externalizing in childhood leading to later internalizing problems than vice versa. Specifically, McDonough-Caplan, Klein, and Beauchaine (Reference McDonough-Caplan, Klein and Beauchaine2018) report that externalizing problems in childhood lead to comorbid internalizing/externalizing problems in adolescence, but internalizing problems in childhood did not increase risk of later externalizing problems. Similarly, Moilanen, Shaw, and Maxwell (Reference Moilanen, Shaw and Maxwell2010) tested developmental cascades of internalizing and externalizing problems across childhood in low-income boys, controlling for moderate to high autoregressive associations. They reported that higher externalizing at ages 6 and 11 predicted higher internalizing at ages 8 and 12, respectively. There is also evidence of bidirectional influences between parental sensitivity and child psychopathology, with child externalizing problems affecting maternal sensitivity, and maternal sensitivity influencing later internalizing problems in females (Zvara, Sheppard, & Cox, Reference Zvara, Sheppard and Cox2018). Paternal sensitivity was reciprocally related to both internalizing and externalizing problems.
An intervention that focuses on increasing parents’ positive behavioral management such as the Family Check-Up is thought to decrease children's internalizing symptoms as well as externalizing problem behaviors because of shared risk factors, including the long-studied association between parenting and children's psychopathology. The present study focuses on children's internalizing symptoms utilizing a structured clinical interview in the home with the children in middle childhood. A recent meta-analysis concluded that parent harsh and psychological control predicted increases in internalizing symptoms, whereas authoritative parenting, autonomy granting, behavioral control, and warmth predicted decreases in internalizing symptoms across childhood, with some of these associations being bidirectional (Pinquart, Reference Pinquart2017).
Parenting as a Target of Intervention
Perhaps the risk/protective factor that has received the most attention across cultures in the field of developmental psychopathology is parenting. Parenting is a proximal process impacting child psychopathology, mediating the effects of family socioeconomic adversity (Dodge, Pettit, & Bates, Reference Dodge, Pettit and Bates1994), and parent psychopathology (Harold, Rice, Hay, Boivin, van Den Bree, & Thapar, Reference Harold, Rice, Hay, Boivin, van Den Bree and Thapar2011). Parenting, although partially genetically influenced, has been related to child behavior after accounting for genetic influences (Oliver, Trzaskowski, & Plomin, Reference Oliver, Trzaskowski and Plomin2014). Research across cultures suggests that some factors, such as warm, positive parenting, are associated with positive child adjustment, whereas detached or abusive parenting is related to maladjustment (Smith, Knoble, Zerr, Dishion, & Stormshak, Reference Smith, Knoble, Zerr, Dishion and Stormshak2014). It is these factors that the Family Check-Up intervention aims to target, in a way that is sensitive to family and cultural differences.
The Family Check-Up intervention
The Family Check-Up was developed as an intervention framework that is flexible and adaptive to diverse cultural groups and is individually tailored to each family context (Dishion & Stormshak, Reference Dishion and Stormshak2007). The original purpose of the intervention was to reduce oppositional and aggressive behavior by improving positive parent management skills, especially during times of developmental transition (e.g., early adolescence or the “terrible twos”). However, it has also been shown to have positive effects on children's broader development, including increases in inhibitory control and verbal skills (Lunkenheimer et al., Reference Lunkenheimer, Dishion, Shaw, Connell, Gardner, Wilson and Skuban2008) and decreases in symptoms of internalizing disorders (Shaw, Connell, Dishion, Wilson, & Gardner, Reference Shaw, Connell, Dishion, Wilson and Gardner2009). The Family Check-Up has been tested in a series of randomized controlled trials from ages 2 to 18 years and found to be effective in the prevention of internalizing and externalizing problem behaviors (e.g., Connell & Dishion, Reference Connell and Dishion2017; Dishion et al., Reference Dishion, Kavanagh, Schneiger, Nelson and Kaufman2002, Reference Dishion, Shaw, Connell, Gardner, Weaver and Wilson2008; Stormshak et al., Reference Stormshak, Connell, Véronneau, Myers, Dishion, Kavanagh and Caruthers2011; Stormshak, Fosco, & Dishion, Reference Stormshak, Fosco and Dishion2010; Van Ryzin, Stormshak, & Dishion, Reference Van Ryzin, Stormshak and Dishion2012). The menu of service options fits within a variety of cultural frameworks, with a focus on contextual stressors and parental factors that may lead to the emergence and maintenance of child psychopathology based on coercive family processes (Patterson, Reid, & Dishion, Reference Patterson, Reid and Dishion1992) and social learning theory.
In general practice, the Family Check-Up begins with an initial get-to-know-you interview during which time the family coach finds out about the parents’ strengths and challenges, their aspirations for their child, and their family values (30–40 min). This is followed by a brief home-based ecological assessment to formally assess caregiving and child and family functioning, the focus of which is tailored to the child's developmental status. Using data from the assessment and initial interview, the caregiver(s) and family coach then meet for a feedback session (1.25–1.5 hr), during which the family coach provides the parents with information on family and child functioning relative to normative data. To retain blindness, in research studies such as the present one, the assessment precedes the initial interview, to minimize potential bias resulting from the intervention group having a session prior to the assessment (i.e., both control and intervention families have the same assessment). Family stories at the core of family intervention is important for the majority of cultural groups in the United States (McGoldrick & Hardy, Reference McGoldrick and Hardy2008), and during the initial interview, family coaches ask open-ended questions to foster a trusting relationship and give caregivers an opportunity to tell their family story. Caregivers are thus acknowledged as the respected authority on their children and their family, which is culturally congruent across diverse families. Within this collaborative framework, the family coach's questions illuminate contextual factors that contribute to children's mood and behavior problems (e.g., family roles or discrimination stress) and motivate the caregiver to change family management strategies. The feedback sessions also are adapted to focus on both parenting strengths and challenges within the cultural context. The extent to which the Family Check-Up is effective in improving parenting and children's adjustment across European American, African American, and Latino groups has been empirically evaluated, with participation in the Family Check-Up reducing antisocial behavior through reducing family conflict across all three groups (Smith et al., Reference Smith, Knoble, Zerr, Dishion and Stormshak2014).
Gene × Intervention interactions
There is wide variation in the efficacy of psychological preventive interventions such as the Family Check-Up, with response to treatment varying substantially between individuals, and genetics are likely to be one source of these individual differences. Understanding the role of genetics elucidates mechanisms underlying treatment response, and identifying genetic predictors of treatment response allows one to match treatment to individuals at the outset to improve outcomes, such as in personalized medicine. Accordingly, a parenting-based intervention such as the Family Check-Up could work best with children who are genetically sensitive to the environment. Conversely, a more cognitively based intervention may work best with those who are less sensitive to contexts. In addition, randomizing participants to an intervention is one of the best ways to understand interactions between genetics and the physical and sociocultural environments. Testing gene–environment interaction in the context of random assignment to an intervention increases power because of the absence of gene–environment correlation, or the extent to which the genetic score is correlated with the environmental exposure (Plomin, DeFries, & Loehlin, Reference Plomin, DeFries and Loehlin1977). Gene–environment correlation is widespread, as individuals live in environments that are partially created by kin (passive gene–environment correlation). In addition, they evoke differential responses from others based on their heritable characteristics (evocative gene–environment correlation), and they niche pick, or actively seek out environments that match their heritable traits (active gene–environment correlation). Experimental designs with random assignment of children and families to treatment and control conditions eliminates confounding because of gene–environment correlation, affording stronger causal inference. To that end, Bakermans-Kranenburg and van IJzendoorn (Reference Bakermans-Kranenburg and van IJzendoorn2015) suggest that statistical power of studies that consider genetic moderation of intervention effects is much higher than correlational studies that require up to 13 times more participants to reach similar levels of power.
Molecular Behavior Genetics Methodology
The majority of genetic effects in humans are homogeneous and do not contribute to individual differences. However, dbSNP (www.ncbi.nlm.nih.gov/projects/SNP/) contains over 12 million single nucleotide polymorphisms (SNPs) that vary across individuals, with some differences in allele (i.e., specific genetic variant) frequencies across cultural groups. With 23 paired chromosomes (1 from each parent), each individual has two copies of each strand of DNA, and thus carry 0, 1, or 2 copies of a particular allele at a SNP (e.g., AA, AG, or GG nucleotides make up the genotype at one SNP across individuals). Early molecular genetics research focused on examining relations between single genetic polymorphisms in candidate genes, chosen for their theoretical relevance, and outcomes such as psychopathology (see Bakermans-Kranenburg & van IJzendoorn, Reference Bakermans-Kranenburg and van IJzendoorn2011; van IJzendoorn, Belsky, & Bakermans-Kranenburg, Reference van IJzendoorn, Belsky and Bakermans-Kranenburg2012, for meta-analyses). However, this approach has multiple limitations. Genetic influences of single common genetic variants on complex psychological traits are likely to be small, especially as psychopathology is biologically distal from an intra- or intercellular genetic effect. As a consequence, most candidate gene research has been underpowered to detect true effects. The lack of power, combined with multiple testing across studies and the file drawer effect (i.e., studies with nonsignificant findings are likely not published), has led to a high false-positive rate in the field as a whole (Ioannidis, Reference Ioannidis2005).
To address the high false-positive rate, behavior genetics research has turned increasingly to the genome-wide association study, a hypothesis-free analysis of the additive predictive power of each individual SNP across the genome on a particular outcome (i.e., presence or absence of a diagnosis, or a composite of symptoms). Because so many statistical tests are performed in a genome-wide association study, the significance threshold is typically set at a stringent 5 × 10−8 (p < .00000005) to statistically correct for the number of tests, rather than the standard 5 × 10−2 (p < .05). Because we expect very small effect sizes for individual SNPs, using a polygenic score to aggregate the small effects of SNPs identified in a discovery genome-wide association study has utility. Typically, results from a discovery genome-wide association study are used to identify a large group of SNPs based on their individual association with the outcome of interest, using discovery genome-wide association study cutoffs of p < .001, p < .01, p < .05, p < .1, and p < .5, or all SNPs studied. The so-called risk alleles are then weighted based on their effect size in the discovery genome-wide association study and summed to produce a quantitative polygenic score for use in an independent validation sample. However, like candidate gene research, the genome-wide association study has its limitations, some of which make it challenging to combine with a cultural approach. For example, a genome-wide association study is designed to detect small main effects of SNPs, without regard for interaction with the environment, whereas many models of gene–environment interplay (e.g., diathesis-stress and differential susceptibility) posit that some genetic variants are associated with an outcome only in particular environmental contexts. In addition, the hypothesis-free multiple testing and small effects lead to both high Type 1 and Type II error, and aggregation into polygenic scores mitigates but does not eliminate the false-positive problems. Thus, the findings from discovery genome-wide association studies should be taken as a first step from which other research can draw on to more clearly elucidate biological and sociocultural pathways. In addition, although genome-wide association studies are designed for the detection of small additive effects, innovative research designs such as the prediction of identical twin differences (Keers et al., Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016) make it possible to address gene–environment interplay within a genome–wide association study framework.
Empirical Support of Genetic Moderation of Intervention Effects
As with gene–environment interplay as a whole, most research testing genetic moderation of randomized intervention has been limited to studies of a few candidate genes (e.g., serotonin transporter gene, serotonin receptors, dopamine transporter gene, and dopamine receptors). One meta-analysis of more than 20 experiments supports genetic moderation of the effect of family-based intervention on externalizing problems, but findings for internalizing were more equivocal (Bakermans-Kranenburg & van IJzendoorn, Reference Bakermans-Kranenburg and van IJzendoorn2015), and effect sizes are likely inflated by the combination of underpowered individual studies and publication bias. Nevertheless, to the extent that these findings prove replicable in adequately powered samples, genetic moderation of intervention is monumental in that it suggests that the effects of early intervention are likely underestimated or go undetected for children whose genotypes support higher environmental sensitivity.
To date, there are only two studies that consider moderation of intervention effects utilizing polygenic scores based on findings from a previous genome-wide association study rather than focusing on one or a few candidate genes (Keers et al., Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016; Musci et al., Reference Musci, Masyn, Uhl, Maher, Kellam and Ialongo2015, Reference Musci, Fairman, Masyn, Uhl, Maher, Sisto and Ialongo2018). Musci et al. found a Polygenic Risk × Intervention interaction predicting age of first tobacco use and age of first marijuana use in a sample of 556 primarily African American individuals followed from sixth grade to age 18 (Musci et al., Reference Musci, Masyn, Uhl, Maher, Kellam and Ialongo2015, Reference Musci, Fairman, Masyn, Uhl, Maher, Sisto and Ialongo2018). Specifically, Musci et al. report that a classroom-based behavioral intervention targeting aggressive and disruptive behavior was most strongly associated with later onset of smoking and marijuana use for individuals scoring high on a polygenic score based on 12,058 SNPs previously associated with smoking cessation and lower substance use (alcohol, marijuana, and tobacco) in adults.
Rather than focus on risk, Keers et al. (Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016) for the first time formed a polygenic sensitivity to the environment score from SNPs that predicted identical twin differences in childhood emotional problems in a discovery genome-wide association study, and found that it moderated the effects of cognitive behavioral therapy on emotional disorders in children. The sample of twins was 93% White, which is representative of the population of the United Kingdom (Haworth, Davis, & Plomin, Reference Haworth, Davis and Plomin2013). It is this research that we draw on in the present study, with the goal of forming a polygenic score that includes SNPs that differentiate individuals who are more or less sensitive to the physical and sociocultural environments.
Polygenic Sensitivity Score Based on Identical Twin Differences
Unlike other genome-wide association studies that aim to detect genetic variants associated with main effects on a phenotype, Keers et al. (Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016) took advantage of the unique nature of an identical twin sample to pinpoint genetic variants associated with sensitivity to the environment itself. Because identical twins are genetically identical and share the same family environment, all differences between identical twins in a pair are due to environmental factors not shared between twin siblings, controlling for all genetic main effects. Identical twin differences can result not only from environmental main effects but also from interaction between genetic factors and the nonshared environment, such that genes that increase sensitivity to broad or specific environmental factors will also act to increase differences between identical twins whose experiences of those environments differ. Using this logic, Keers et al. (Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016) predicted identical twin differences in emotional disorder symptoms at age 12 as the outcome of a discovery genome-wide association study, allowing for the identification of SNPs associated with sensitivity to the environment while simultaneously controlling for all other genetic and shared environmental effects.
The discovery sample included 1,026 identical twin pairs from the United Kingdom who participated in the Twins Early Development Study. At 12 years of age, the Strengths and Difficulties Questionnaire was given to parents and children, and a mean composite of the standardized emotional symptoms scale was used to index emotional problems. Identical twin differences in emotional problems were operationally defined as the absolute difference in scores between cotwins, after regressing out age, sex, and the twin pair's mean score on emotional symptoms. Linear regressions using the software PLINK were conducted, covarying the first 10 principal components representing population admixture, and p values were obtained for each SNP examined.
Based on this discovery genome-wide association study, a polygenic sensitivity score was formed in a separate Twins Early Development Study sample of 1,406 individuals to examine polygenic moderation of parenting effects on children's emotional problems. As is standard practice, multiple significance thresholds were used to create eight polygenic sensitivity scores from a total of 155,019 SNPs (after linkage disequilibrium pruning, or dropping some SNPs because they were highly correlated with others) at p < .001 (n = 400 SNPs), p = .01 (n = 3,161), p = .05 (n = 13,632), p = .1 (n = 25,384), p = .2 (n = 46,752), p = .3 (n = 66,205), p = .4 (n = 84,025), and p = .5 (n = 100,111). Polygenic sensitivity moderated the influence of parenting on emotional problems, with five of the eight polygenic scores formed (p < .1 through p < .5) yielding significant interactions. For those with higher polygenic sensitivity, positive parenting was associated with decreased emotional problems and negative parenting was associated with more emotional problems. At lower levels of polygenic sensitivity, parenting was unrelated to emotional problems.
Next, with the Genes for Treatment sample, a polygenic sensitivity score was formed in the same fashion and was used to predict response to cognitive behavioral therapy in children with anxiety disorders (Keers et al., Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016). Genes for Treatment includes 973 children who met DSM-IV criteria for a primary diagnosis of an anxiety disorder and were genotyped. After linkage disequilibrium pruning, there were 72,375 SNPs in common between the Twins Early Development Study discovery sample and Genes for Treatment, so the polygenic sensitivity scores were smaller. The polygenic scores at thresholds of p < .05 and above significantly moderated treatment response such that individual cognitive behavioral therapy (compared to group cognitive behavioral therapy or brief parent-led cognitive behavioral therapy) had a larger effect for children with higher polygenic sensitivity scores, although there was no moderation of overall treatment response.
The Present Study
We formed polygenic sensitivity scores based on Keers et al.’s (Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016) discovery genome-wide association study results associating SNPs with identical twin differences in emotional problems, and then used these scores to examine genetic moderation of intervention effects in a high economic and family risk and culturally diverse US sample. The overarching goal was to examine whether a genetic index of environmental sensitivity moderated the effects of the Family Check-Up intervention on symptoms of internalizing psychopathology in middle childhood. The sample was the Early Steps Multisite Study, a large randomized controlled trial of the Family Check-Up in early childhood that was designed to reduce aggressive and oppositional behavior in early and middle childhood by increasing positive parenting (Dishion et al., Reference Dishion, Brennan, Shaw, McEachern, Wilson and Jo2014; Shaw et al., Reference Shaw, Sitnick, Brennan, Choe, Dishion, Wilson and Gardner2017). Children's psychopathology was assessed with a broad index of symptoms and diagnoses of internalizing disorders with a structured clinical interview. Advances in nosology suggest that psychopathology can be parsed into broad dimensions rather than disorder-specific categories (Beauchaine & Cicchetti, Reference Beauchaine and Cicchetti2016), and findings from twin studies have suggested that a shared latent genetic factor accounts for comorbidity and co-occurrence within and between anxiety disorders and depression (Middeldorp, Cath, Van Dyck, & Boomsma, Reference Middeldorp, Cath, Van Dyck and Boomsma2005). Thus, we initially tested genetic association using a sum of all symptoms across the internalizing diagnostic categories, rather than focusing on diagnoses or their symptom clusters as distinct analyses. Our first hypothesis was that children with higher polygenic sensitivity would have larger intervention effects. Children who scored higher on polygenic sensitivity and were in the intervention condition would have fewer internalizing symptoms than those who had higher polygenic sensitivity and were randomly assigned to the control condition. We also tested Polygenic Score × Intervention effects on “presence versus absence of any internalizing diagnosis” as an outcome, but we did not expect significant effects given the low rates of diagnoses at age 10 years. Because our sample was at high economic and family risk, our second hypothesis was that those randomly assigned to the control group who scored higher on genetic sensitivity to the environment would have more internalizing symptoms than those lower on polygenic sensitivity.
Method
Participants in the Early Steps Multisite Study
Seven hundred and thirty-one culturally diverse, low-income families with 2-year-old children were recruited between 2002 and 2003 from the Special Supplemental Nutrition Program for Women, Infants, and Children in Eugene, Oregon (suburban), within and outside Charlottesville, Virginia (rural), and in metropolitan Pittsburgh, Pennsylvania (urban). Screening procedures were used to recruit families of toddlers at high risk for conduct problems. Recruitment risk criteria were defined as 1 SD above the mean on screening measures in at least two of the following three domains: (a) sociodemographic risk (low education achievement—less than or equal to a mean of 2 years of post-high school education between parents and low family income using Special Supplemental Nutrition Program for Women, Infants, and Children criterion); (b) primary caregiver risk (maternal depression—Center for Epidemiological Studies on Depression Scale; Radloff, Reference Radloff1977; daily parenting challenges; Parenting Daily Hassles; Crnic & Greenberg, Reference Crnic and Greenberg1990; or self-report of substance or mental health diagnosis, or adolescent parent at birth of first child); and (c) toddler behavior problems (conduct problems—Eyberg Child Behavior Inventory; Robinson, Eyberg, & Ross, Reference Robinson, Eyberg and Ross1980; or high-conflict relationships with adults; Adult Child Relationship Scale; adapted from Pianta, Steinberg, & Rollins, Reference Pianta, Steinberg and Rollins1995). Participation rates were high across the three sites (83.2% total [49% female]; 84% in Eugene [n = 271], 76% in Charlottesville [n = 188], and 88% in Pittsburgh [n = 272]). Primary caregivers (97% mothers) self-identified as belonging to the following ethnic groups: 13% Latino, 28% African American, 50% European American, 13% biracial, and 9% other groups (e.g., Native American or Asian American). More than two-thirds of the families reported an annual income of less than $20,000, with 24% of primary caregivers having less than a high school education, 41% having a high school diploma or general education diploma (GED), and an additional 32% having 1–2 years of post-high school education. For more information about sample characteristics, see Dishion et al. (Reference Dishion, Shaw, Connell, Gardner, Weaver and Wilson2008).
Families were randomly assigned to the control condition or the intervention condition after the baseline assessment at child age 2 years. Those in the control condition received Special Supplemental Nutrition Program for Women, Infants, and Children services as usual. Those in the intervention condition received services implementing the Family Check-Up. The Family Check-Up is composed of three sessions: assessment, where research staff and parents completed questionnaires about the child's behavior and family factors, and parents and children were videotaped while taking part in tasks that varied in terms of stress level (e.g., free play vs. clean-up task); initial interview, where intervention staff and parents discussed their child's strengths and challenges as well as aspirations the parents had for their child; and feedback, where intervention staff provided feedback to the parents based on the assessment and initial interview, and encouraged reflection on behavioral change and engagement in further intervention services. All families were recontacted at child age 3, 4, 5, 7.5, 8.5, 9.5, and 10.5 years for home-based assessments, with intervention families also being offered the same Family Check-Up services through age 10.5. In terms of engagement, 76% of families engaged at age 2, with over 90% of the families engaging in at least one Family Check-Up by child age 5. Families also were seen at their homes at youth age 14, with 81% participating.
Adolescents who were genotyped at age 14 years (n = 515, or 86.7% of the sample who participated in home visits at age 14) make up the sample for the current study. These adolescents were 50% female and belonged to the following racial/ethnic groups: 10% Latino, 30% African American, 48% European American, 5% Native American, Native Hawaiian or Pacific Islander, 1% Asian American, and 6% other race or unknown race. Selective attrition analyses revealed no significant differences between members of the initial sample who did not give a saliva sample for genotyping at age 14, and those who did give a saliva sample with respect to parental education (high school diploma vs. no high school diploma), χ2 (1) = 0.40, p = .53; minority racial status (Black vs. non-Black), χ2 (1) = 2.73, p = .10; gender (male vs. female), χ2 (1) = 0.45, p = .50; intervention status (control vs. Family Check-Up), χ2 (1) = 0.023, p = .88; study site (Pennsylvania vs. non-Pennsylvania), χ2 (1) = 2.27, p = .13 (Virginia vs. non-Virginia), χ2 (1) = 1.02, p = .31; parental depression (assessed at child age 2 before the intervention), t (590) = –0.003, p = .998; child behavioral inhibition (assessed at child age 2), t (562) = –0.99, p = .32; and child conduct problems (assessed at child age 2), t (591) = –1.36, p = .17.
Procedure
The computerized self-report version of the National Institutes of Mental Health Diagnostic Interview Schedule for Children—IV was administered to the child using a laptop computer at the age 10.5 years home visit (but not at other ages). Interviewers underwent several days of formal training with certified DISC-IV administrators. The interview took 30–45 min to complete with select modules, sometimes longer if many symptoms were endorsed.
During the age 14 home visit, participants provided saliva samples with Oragene kits for genotyping. RUCDR Infinite Biologics at Rutgers University extracted and normalized the DNA, and then genotyped the samples using the Affymetrix Axiom Biobank1 Array. Any SNP or individual with a missing data rate greater than or equal to 5% was removed (no participants met this criteria), and any SNP with a minor allele frequency less than 1% was removed. SNPs not in Hardy–Weinberg equilibrium at p < 10−6 were also removed. To reduce correlation among the SNPs, or linkage disequilibrium, we did not impute the data; we screened out regions of long-range linkage disequilibrium, as well as local linkage disequilibrium, using the software PLINK's sliding window procedure.
Measures
Diagnostic Interview Schedule for Children (Shaffer, Fisher, Lucas, Dulcan, & Schwab-Stone, Reference Shaffer, Fisher, Lucas, Dulcan and Schwab-Stone2000; Shaffer, Fisher, Lucas, & NIMH DISC Editorial Board, Reference Shaffer, Fisher and Lucas1998)
The National Institute of Mental Health Diagnostic Interview Schedule for Children—IV is a structured psychiatric interview for children age 6 years and older. Child responses are Yes or No for most questions, and follow-up questions are determined by previous answers in the module. The interrater reliability (r = .93) and test–retest reliability (r = .64) of the past-year diagnoses have been well established. Furthermore, the Diagnostic Interview Schedule for Children—IV showed moderate validity when compared to diagnoses generated from symptom ratings made after a clinical-style interview (κ = .52). The following seven internalizing modules were selected for use based on age-relevant disorders that corresponded to the research foci: generalized anxiety disorder, separation anxiety disorder, social anxiety disorder, specific phobia, obsessive– compulsive disorder, major depression, and manic disorder. A sum score representing total number of symptoms across modules was computed with an α reliability of 0.83 and was used in primary analyses. Follow-up analyses used disorder-specific symptom counts.
Polygenic sensitivity to the environment score
We formed polygenic scores based on the Twins Early Development Study discovery genome-wide association study indexing genetic influences on environmental sensitivity by predicting identical twin differences in emotional problems (Keers et al., Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016). After quality control, 318,549 SNPs remained in our data, with 53,010 present in both the discovery sample summary statistics and our data. We then filtered out synonymous SNPS resulting in 51,102 SNPs, and used PLINK's clumping procedure to account for nonindependence among the SNPs (threshold of r 2 = .1 and 250 kb), resulting in 36,246 independent SNPs. We formed polygenic sensitivity scores with p value thresholds of .001, .01, .05, and .10, unit-weighting each SNP. We chose not to form a score based on p < .50 because liberal scores often have high overlap with population admixture. The polygenic sensitivity scores contained 47 SNPs for p = .001; 503 SNPs for p = .01; 2,372 SNPs for p = .05; and 4,606 SNPs for p = .10.
Population admixture
We conducted a principal components analysis of all autosomal SNPs across all participants in the sample to represent population admixture using PLINK. We extracted the first 20 components, with the first component (PC1) having an eigenvalue of 28.84 and largely differentiating European American, Latino, and Native American groups from African American groups, with biracial participants falling in the middle. The second component (PC2) had an eigenvalue of 5.62 and largely differentiated non-Latino participants (European and African American) from Latino and Native American participants. The remaining components had eigenvalues ranging from 1.45 to 1.21 and were excluded from further analyses. Every participant had a score on each principal component and thus his or her genetic race/ethnicity was represented when controlling for population admixture.
Covariates
Covariates included in all models included age in months (M = 128.560, SD = 3.362, mean centered prior to analysis), gender (females = 0, males = 1; M = 0.509, SD = 0.500), family monthly income (M = $2,456.110, SD = $1,594.626, Z-scored), study site location (Eugene and Charlottsville compared to Pittsburgh indexed with two dummy codes), and the first two ancestry principal components, PC1 and PC2.
Statistical approach
We examined whether the interaction between intervention status and polygenic score was related to Diagnostic Interview Schedule for Children—IV symptoms and diagnoses using MPlus software version 7.4 (Muthén & Muthén, Reference Muthén and Muthén1998–2015). Outcome variables were counts, but they were normally distributed and not zero inflated, and thus we used ordinary regression. Polygenic scores and the first two ancestry principle components (PC1 and PC2) were Z-scored prior to running models, and covariates were Z-scored or mean centered when appropriate. Initial models included the main effects of all covariates, two-way interactions between polygenic score and age, sex, and income (i.e., best practices for testing Gene × Environment interaction; Keller, Reference Keller2014), the main effects of polygenic score and intervention, and a product term representing the interaction between polygenic score and intervention entered last. Main effects of all covariates were retained in final models regardless of significance, but two-way interactions between polygenic score and covariates were trimmed if they were nonsignificant.
We ran four regression models examining Polygenic × Intervention interaction in relation to total symptoms for each of the four polygenic scores (p = .001, p = .01, p = .05, and p = .10). When interaction models were significant, we followed up by testing individual Diagnostic Interview Schedule for Children—IV symptom scales to see which carried the effect. Finally, we used logistic regression in Mplus with robust maximum likelihood estimation and Monte Carlo integration to examine whether significant interactions held when predicting diagnoses rather than symptoms (0 = no diagnosis, 1 = at least one diagnosis on any Diagnostic Interview Schedule for Children—IV scale). Regions of significance and simple slopes for all significant interactions were calculated using Preacher, Curran, and Bauer's (Reference Preacher, Curran and Bauer2006) online utility for probing interaction effects.
Missing data
Full information maximum likelihood estimation was used to handle missing data. All 515 individuals in the genotyped sample had complete data for intervention status, polygenic score, PC1 and PC2, gender, and study site. Because there was complete data for both variables used in the product term, concerns regarding centering decisions when using full information maximum likelihood estimation with product terms do not apply.
Results
Descriptive statistics and correlations
Descriptive statistics for study variables are presented in Table 1. Skew did not exceed +/– 2.00 and kurtosis did not exceed +/–7.00 (Curran, West, & Finch, Reference Curran, West and Finch1996) for all variables except PC2, which was transformed by adding a constant equal to 1 plus the minimum value to ensure that all values were positive, then square root transforming prior to Z-scoring. Total symptoms ranged from 0 to 64 symptoms, out of a possible 88 symptoms. At the scale level, the maximum number of symptoms ranged from 6 for obsessive–compulsive disorder (out of a possible 7) to 21 for a major depressive episode (out of a possible 22). Diagnoses at the scale level were infrequent and are presented in Table 1 for descriptive purposes. Out of 418 children with Diagnostic Interview Schedule for Children—IV data, 100 had at least one diagnosis (23.9%). Of these 100 individuals, 74 had only one diagnosis (most commonly specific phobia, followed by obsessive–compulsive disorder), 21 had two diagnoses, 4 had three diagnoses, and 1 had four diagnoses.
Table 1. Descriptive statistics and diagnoses
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20230113060854726-0997:S095457941800127X:S095457941800127X_tab1.gif?pub-status=live)
Note: Intervention is coded 0 = control, 1 = intervention. Polygenic scores are calculated as proportion of environmental sensitivity alleles below a particular p threshold. Ancestry PC 1 and 2 are principle components accounting for genetic variation due to race/ethnicity.
Correlations among study variables were computed in Mplus using full information maximum likelihood estimation, with key correlations presented in Table 2, and others presented in text. Intervention status was uncorrelated with polygenic score, ancestry PCs, or other covariates, as expected based on random assignment, but also uncorrelated with total symptoms and all Diagnostic Interview Schedule for Children—IV symptom scales. Correlations between covariates and symptoms were modest and typically nonsignificant, although individuals with higher scores on PC1 (largely those with African American ancestry) showed significantly more symptoms of general anxiety (r = .11, p < .05), obsessive–compulsive disorder (r = .19, p < .01), and specific phobia (r = .12, p < .05). Males reported fewer social anxiety (r = –.11, p < .05) and specific phobia symptoms than females (r = –.19, p < .01), and older children reported fewer symptoms of separation anxiety (r = –.10, p < .05) and specific phobia (r = –.17, p < .01). As expected, all symptom scales were significantly and moderately to highly correlated with each other, although specific phobia symptoms were only modestly related to obsessive–compulsive, major depressive, and manic symptoms. Of note, polygenic scores across the different p thresholds were modestly to highly correlated with each other (with the exception of polygenic scores p = .10 and p = .001, which were not significantly correlated), and some polygenic scores were correlated with PC1 but not PC2 (see Table 2).
Table 2. Zero-order correlations
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20230113060854726-0997:S095457941800127X:S095457941800127X_tab2.gif?pub-status=live)
Note: Intervention is coded 0 = control, 1 = intervention. Ancestry PC 1 and 2 are principle components accounting for genetic variation due to race/ethnicity. Sep. anxiety, separation anxiety. Obsess. comp., obsessive–compulsive. Major dep., major depression. *p ≤ .05. **p ≤ .01.
Testing Polygenic Sensitivity × Intervention Status interactions
Results for regression models testing the first hypothesis examining polygenic sensitivity, intervention status, and their interaction in relation to total symptoms are presented in Table 3. All two-way interactions between polygenic scores and covariates predicting total symptoms were initially included, but were nonsignificant. Specifically, Polygenic p = .05 × Gender, p < .953; Polygenic p = .05 × Income, p < .216; Polygenic p = .05 × Age, p < .499; Polygenic p = .05 × PC1, p < .308; Polygenic p = .05 × PC2, p < .056; and Polygenic p = .05 × Study Site, p < .737 and p < .864 for the two dummy codes. Similarly, Polygenic p = .10 × Gender, p < .925; Polygenic p = .10 × Income, p < .114; Polygenic p = .10 × Age, p < .281; Polygenic p = .10 × PC1, p < .173; Polygenic p = .10 × PC2, p < .564; and Polygenic p = .10 × Study Site, p < .986 and p < .969 for the two dummy codes. Interactions with covariates were removed from final models because none were significant, and excluding them from analyses did not alter the significance of any model. Thus, we present results for models including only the main effect of covariates.
Table 3. Testing polygenic moderation of intervention effects on total symptoms
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20230113060854726-0997:S095457941800127X:S095457941800127X_tab3.gif?pub-status=live)
Note: PC1 and PC2, principle components 1 and 2, respectively.
Total symptoms
The p = .05 polygenic score significantly interacted with intervention status, such that the intervention effect was stronger at higher levels of polygenic sensitivity, and a similar but weaker interaction was found for the p = .10 score (see Table 3). These results support the first hypothesis. There were no significant effects for either the p = .001 or the p = .01 polygenic scores, and they were not considered further. Regions of significance for the interaction effect across different values of polygenic sensitivity, and simple slopes of polygenic score in the control and intervention groups, are presented in Table 4 for significant interactions, and Figure 1 depicts regions of significance and simple slopes for the p = .05 and p = .10 scores. For the p = .05 score, there was a significant main effect of polygenic sensitivity, such that for individuals in the control group, higher polygenic score was significantly associated with higher total symptom count. This result supports the second hypothesis. However, the simple slope of polygenic score in the intervention group was nonsignificant. Testing regions of significance for the polygenic score indicated that the control and intervention groups differed significantly from each other on total symptoms for values of the polygenic score greater than 0.493 SD above the mean and less than 2.019 SD below the mean. As only 10 individuals scored lower than 2.00 SD below the mean (6 in the control group, 4 in the intervention), the lower bound of the region of significance should be interpreted with caution.
Table 4. Regions of significance and simple slopes for significant interactions
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20230113060854726-0997:S095457941800127X:S095457941800127X_tab4.gif?pub-status=live)
Note: Polygenic risk scores were Z-scored, so regions of significance are on a metric of standard deviations above and below the mean. The control and intervention groups are significantly different outside the upper and lower bounds of the region of significance for all models. *p ≤ .05. **p ≤ .01.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20230113060854726-0997:S095457941800127X:S095457941800127X_fig1g.gif?pub-status=live)
Figure 1. Polygenic Score × Intervention interaction. (a) Polygenic p = .05 score and (b) polygenic p = .10 score. Polygenic score is on a Z-score metric, and thus units on the x-axis correspond to standard deviations from the mean. The range of the polygenic scores observed in our sample is from –3.036 to 3.356 for the p = .05 score and from –3.291 to 2.763 for the p = .10 score. Vertical lines denote regions of significance. The control and intervention groups are significantly different from each other outside the vertical lines for the p = .05 score, and to the right of the vertical line for the p = .10 score.
Unlike the p = .05 score, the simple slope of the p = .10 polygenic score was not significant in either the intervention or the control group when examining total symptoms. However, the full interaction model indicated that these two slopes did differ significantly from each other, and testing regions of significance indicated that the control and intervention groups differed significantly from each other on symptom count for values of polygenic sensitivity higher than 0.660 SD above the mean.
Symptom scales
For the two polygenic scores that interacted significantly with intervention status in relation to total symptoms, we followed up by testing the interaction separately for each symptom scale (see Table 5 for regression results, and Table 4 for regions of significance and simple slopes). Findings were similar across the two scores, with the interaction between polygenic score and intervention status significantly associated with symptoms of general anxiety, obsessive–compulsive disorder, and mania. These results support the first hypothesis. In terms of regions of significance and simple slopes for the p = .05 score, significant findings at the scale level were largely similar to findings for total symptoms; however, for obsessive–compulsive symptoms, the control and intervention groups only differed significantly for individuals higher than 1.848 SD above the mean or lower than 1.015 SD below the mean on polygenic score. For the p = .10 score, the simple slope of polygenic score was significant in the control group for both general anxiety and obsessive–compulsive symptoms, such that a higher genetic score was associated with higher symptoms in the control group, with no significant association in the intervention group. This result supports the second hypothesis. For symptoms of mania, the simple slope of polygenic score was significant in the intervention but not the control group, such that polygenic score was associated with lower manic symptoms in the intervention group. However, regions of significance indicated that the control and intervention groups only differed significantly at values of polygenic score greater than 2.499 SD above the mean (three individuals in our sample, two in the intervention and one in the control), suggesting that results should be interpreted with caution.
Table 5. Testing polygenic moderation of intervention effects on symptom scales
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20230113060854726-0997:S095457941800127X:S095457941800127X_tab5.gif?pub-status=live)
Note: Child gender, age, monthly family income, and site were included as covariates in all models, but results are omitted to save space. No covariate significantly predicted symptom outcomes in any model. *p ≤ .05. **p ≤ .01.
Diagnoses
For the two polygenic scores that interacted significantly with intervention status to predict total symptoms, we used logistic regression in Mplus to examine whether polygenic sensitivity interacted with intervention status in relation to the presence versus absence of any diagnosis. The interaction between polygenic score and intervention status was not significantly related to diagnosis for either the p = .05 score (estimate = –.170, SE = .246, p = .489, odds ratio = 0.843) or the p = .10 score (estimate = .027, SE = .243, p = .912, odds ratio = 1.027).
Discussion
We argue that interdisciplinary approaches that capture the synergy between behavior genetics and cultural psychology under the realm of cultural genomics are needed to test contemporary developmental theories and elucidate risk and resilience for child psychopathology. Each of these fields on its own captures important linear influences, but modeling nonlinear coaction across levels of analysis is needed to represent the many feedback loops that exist within and across biological and environmental levels. Empirically, we tested an interaction between a genetic index representing sensitivity to the environment and an efficacious intervention that focused on parenting. The major finding was that polygenic sensitivity moderated the effects of the Family Check-Up intervention on children's symptoms of internalizing psychopathology in a culturally diverse, high economic and family risk sample. Specifically, children who were genetically sensitive to the environment and were randomly assigned to the intervention group had fewer symptoms of internalizing psychopathology than genetically sensitive children assigned to the control condition.
This finding of significant moderation of intervention effects is very important because the intervention group did not linearly predict the child-report Diagnostic Interview Schedule for Children—IV symptoms of internalizing psychopathology in this sample. Thus, earlier in middle childhood (ages 7–8), investigators concluded that the Family Check-Up only demonstrated an indirect effect on the development of parent-reported internalizing symptoms by improving maternal depression during early childhood, with no direct effects (Reuben, Shaw, Brennan, Dishion, & Wilson, Reference Reuben, Shaw, Brennan, Dishion and Wilson2015). However, when genetic moderation was examined, we saw important effects for approximately 25% of the sample, with significant differences between the control and intervention groups beginning at 0.493 SD above the mean on polygenic sensitivity with the p = .05 score. Results support the theoretically based assertion that parenting-based interventions such as the Family Check-Up have large effects on internalizing psychopathology for environmentally sensitive children, but no significant effects for children who are less sensitive to the environment. By averaging across all individuals in the sample, traditional approaches that do not account for individual differences do a disservice to the field and to the population by concluding that interventions have small and sometimes nonsignificant effect sizes on everyone. The children who do not respond to traditional preventive interventions focused on parenting are in need of interventions tailored to their characteristics and circumstances, and this work can proceed once they are identified. Overall, these findings move us closer to using a tailored and personalized medicine approach when providing interventions for child psychopathology.
Despite theoretical support, only one other group has formed a polygenic sensitivity score based on genome-wide association study findings representing children's genetic predisposition to environmental sensitivity. Keers et al. (Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016) conceptualized and empirically tested this polygenic sensitivity score utilizing two samples of children from the United Kingdom, the Twins Early Development Study sample of 12-year-old twin children, and the Genes for Treatment sample of children with diagnosed anxiety disorders. We replicated and expanded the generalization of their findings to a culturally diverse sample of children in the United States using a parenting-based intervention. However, our results focus exclusively on 10-year-olds, and genetic association may vary by chronological age or developmental period. With our follow-up scale-level findings, we reported stronger associations with anxiety disorders and obsessive–compulsive disorder symptoms than depression symptoms. Depression generally has a low base rate in middle childhood, and early life anxiety creates vulnerability for later depressive disorders and impairment across the life span (Emmelkamp & Wittchen, Reference Emmelkamp, Wittchen, Andrews, Charney, Sirovatka and Regier2009). Thus, it may be that genetic association with depression would be stronger in adolescence. This hypothesis should be empirically examined, as should association with developmental outcomes, such as growth in child psychopathology across childhood and adolescence.
Besides Keers et al. (Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016), only one other group has tested moderation of intervention effects using any polygenic score based on a discovery genome-wide association study, with the extant literature largely relying on candidate gene approaches (Bakermans-Kranenburg & van IJzendoorn, Reference Bakermans-Kranenburg and van IJzendoorn2015). Musci and colleagues formed the tobacco polygenic risk score and found that it moderated the effects of an elementary school-based intervention on age of first cigarette smoked (Musci et al., Reference Musci, Masyn, Uhl, Maher, Kellam and Ialongo2015) and age of first marijuana use (Musci et al., Reference Musci, Fairman, Masyn, Uhl, Maher, Sisto and Ialongo2018). Besides the importance of identifying individuals for whom the intervention worked, experimental studies hold great advantages for testing genetic moderation of physical environment and sociocultural effects. Most important, the polygenic score and the measure of the environment are uncorrelated in experimental designs, ensuring independence between genetic influences and changes in the environment. In addition, experimental studies typically use state-of-the-art concurrent and proximal measures of the environment that reduce measurement error. Partially because of these strengths, the power to test genetic moderation is much higher using experimental designs. In one set of simulations, a correlational study requiring 1,300 participants would only need 100 participants with an experimental design to have the same power to test genetic moderation (Bakermans-Kranenburg & van IJzendoorn, Reference Bakermans-Kranenburg and van IJzendoorn2015). Thus, meaningful genetic effects can be obtained with traditional sample sizes that are powered to test moderation.
Theoretical implications of study findings
Our results are consistent with differential susceptibility theory (Belsky & Pluess, Reference Belsky and Pluess2009; Ellis et al. Reference Ellis, Boyce, Belsky, Bakermans-Kranenburg and van IJzendoorn2011) in that children in the intervention group had significantly fewer internalizing symptoms than those in the control group if they were higher on polygenic sensitivity to the environment (Hypothesis 1). In addition, children in the control group with higher polygenic sensitivity had more internalizing symptoms than those also in the control group with lower polygenic sensitivity (Hypothesis 2). These findings support the theoretical assertion that differentially susceptible children do worse in high-risk environments (i.e., control group experiencing economic and family risk) and do better in protective or promotive environments (i.e., intervention group experiencing the same economic risk, but with improved parent mental health and parents utilizing more positive behavioral management after the intervention). Note that we could not compare the diathesis-stress (i.e., an interaction between a dispositional diathesis and environmental stress when predicting negative outcomes), differential susceptibility, and vantage sensitivity theories (i.e., an interaction between a dispositional diathesis and environmental support when predicting positive outcomes) because these models cannot be differentiated unless child outcomes are assessed across the full continuum from negative to positive (Clifford & Lemery-Chalfant, Reference Clifford, Lemery-Chalfant and Pluess2015). Our outcome was symptoms of child psychopathology, which does not capture positive adaptation or flourishing.
The overall finding of genetic differences in response to intervention may explain why past attempts to identify genetic variants linearly associated with child psychopathology have been largely unsuccessful. It also explains other patterns observed in the behavior genetics literature, such as the discrepancy between “too low” SNP heritability (i.e., the degree to which SNP variation accounts for phenotypic variation) compared to heritability estimates obtained from quantitative genetic approaches such as twin and adoption studies.
Polygenic sensitivity to the environment based on identical twin differences
It is standard practice when creating polygenic scores from genome-wide association study results to form the scores at multiple p value cutoffs and test association with all of these scores, as individual SNP effect sizes are unknown and are thought to be small. In line with our findings, Keers et al. (Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016) also reported that the p < .05 and p < .10 polygenic sensitivity scores, but not the p < .01 or p < .001 scores, significantly moderated the cognitive behavioral therapy intervention on children's anxiety. These are not independent findings as these two scores have a high degree of overlap and are correlated, r = .73, p < .01, in our sample. The fact that two relatively conservative scores captured the important variance is encouraging as association with more liberal scores (e.g., p = .50) could substantially increase the chance of spurious results. It is likely that the use of identical twins in the discovery genome-wide association study led to scores that were relatively free of environmental contamination, as the identical twin difference design controls for all genetic (i.e., identical twins have identical genotypes) and family-level influences (e.g., family socioeconomic and sociocultural status) on the phenotypes (Lemery & Goldsmith, Reference Lemery and Goldsmith1999).
At the same time, this score was limited to identical twin differences in composited parent report and self-report of emotional problems in 12-year-olds growing up in the United Kingdom. It is likely that twin differences in emotional problems do not capture all of the genetic influences on environmental sensitivity. Thus, future work should examine genetic association with identical twins of different ages and with different phenotypes including externalizing psychopathology to help inform polygenic sensitivity to the environment.
Study limitations
In addition to many strengths, including the theoretically-based design that included random assignment to an intervention, there are several limitations. First, our results do not generalize to everyone. We examined internalizing psychopathology only at age 10 years, the only age when the Diagnostic Interview Schedule for Children—IV interview was conducted with children. Genetic and environmental influences can vary in their relevance to development at different ages, and thus our findings might be specific to middle childhood (Dick et al., Reference Dick, Barr, Cho, Cooke, Kuo, Lewis and Su2017). Although our sample had adequate representation of European Americans, African Americans, and Latinos, we had little representation of other groups such as Native Americans and Asian Americans. All families were recruited from Women, Infant, and Children nutrition supplement programs for low-income families. Families were additionally screened for family (e.g., maternal depression) and/or child risk (e.g., problem behaviors). Based on these restrictions, the results are most directly applicable to families with economic and family risk for child psychopathology. Second, the polygenic sensitivity score was formed based on a discovery genome-wide association study conducted on genotyped identical twins participating on the longitudinal Twins Early Development Study. The sample of twins was 93% White, which is representative of the population of the United Kingdom (Haworth et al., Reference Haworth, Davis and Plomin2013). However, it is unknown the extent to which SNPs identified in this largely White sample may generalize to more diverse samples such as ours. Third, we relied on children's self-reports during a structured clinical interview to assess outcomes. Although self-report can be potentially biased, we think that self-reported outcomes are more conservative than caregiver-reported outcomes when testing the effects of the Family Check-Up, as the primary focus of this intervention is on changing parenting (Van Ryzin et al., Reference Van Ryzin, Stormshak and Dishion2012). A fourth limitation is that our analysis conflated assignment to the intervention group with receipt of treatment. Each year, 25% to 35% of those assigned to the intervention did not engage in services (Dishion et al., Reference Dishion, Brennan, Shaw, McEachern, Wilson and Jo2014). Thus, true effect sizes are likely to be larger than reported here.
Implications and future directions
Forming polygenic scores based on biological function
An important implication of the theoretical foundation and empirical findings from this study is that important SNPs should not be limited to those that show linear associations with phenotypes in a genome-wide association study. Through consortiums and data harmonization, sample sizes for genome-wide association studies have increased from thousands of participants to tens of thousands and sometimes hundreds of thousands of participants, which ensures that we can identify SNPs with significant main effects, even if they have very small effect sizes. Nearly all of these participants are of European descent in an attempt to control for population stratification of genotypes. With these large samples, measurement of the phenotype is very limited, sometimes involving only a few self- or parent-reported items, or a single dichotomous diagnosis. Furthermore, the genome-wide association study model is elementary, including separate linear main effects of each SNP across the genome, with few covariates included. One could imagine that the relation between a single genetic variant and a complex psychological outcome such as a psychiatric diagnosis might be more complex and involve additional predictors or levels of analysis. Moreover, there is some limited evidence that when the genetically informed model goes beyond main effects and includes gene–environment correlation and interaction, some combinations yield moderate and large effects (Bakermans-Kranenburg & van IJzendoorn, Reference Bakermans-Kranenburg and van IJzendoorn2015). The problem with ignoring the environmental and cultural levels of analysis in behavior genetics studies is that genetic association likely varies by components of the environment and culture, such that the same genotype can lead to different phenotypes in different physical and sociocultural contexts.
Bioinformatics provides methods of incorporating additional knowledge into the selection of genetic polymorphisms to assist in forming polygenic scores for use in genetic association analyses. Bioinformatic tools are available to help identify gene networks and biological pathways with known function in brain regions linked to phenotypes such as developmental psychopathology. One example is the use of gene set enrichment analysis to identify biologically meaningful gene sets from a list of initial SNPs (Holden, Deng, Wojnowski, & Kulle, Reference Holden, Deng, Wojnowski and Kulle2008). The initial list of SNPs can come from a targeted microarray, from proteomics, or from a genome-wide association study with SNPs showing association with the phenotype, for example. Gene set enrichment analysis helps identify phenotype-relevant SNPs, and also identifies mechanisms of action by identifying corresponding biological pathways. Future studies in cultural genomics should use bioinformatics tools to select functionally meaningful genetic variants for further study.
Genetically informative studies of culturally diverse groups
Biased sampling and the neglect of cultural contributions to health and development is threatening the generalizability of what we know about human behavior and social processes generally, and developmental psychopathology more specifically. Over 80% of research participants are Western, educated, industrialized, rich, democratic (WEIRD) people who make up only 12% of the world's population (Henrich, Heine, & Norenzayan, Reference Henrich, Heine and Norenzayan2010). By turning scientific attention toward variable in traditions and lifeways (VITAL) peoples, we can begin to understand cultural and biological interplay and paint a more accurate picture of risk and resilience processes in development.
The study of genetic influences on social, behavioral and health processes is similarly afflicted with a focus on WEIRD populations, threatening what we know about genetic contributions to development. In this field, findings from studies of these populations are considered human universals, while extensive cultural variation in these processes are systematically understudied. Furthermore, the field of behavior genetics has laws based largely on findings from studies of WEIRD populations. Turkheimer (Reference Turkheimer2000) presented the three laws of behavior genetics, with Chabris, Lee, Cesarini, Benjamin, and Laibson (Reference Chabris, Lee, Cesarini, Benjamin and Laibson2015) adding a fourth law. The first law is that all human behavioral traits are affected by genetic variation. The fourth law of behavior genetics added by Chabris et al. (Reference Chabris, Lee, Cesarini, Benjamin and Laibson2015) expands this first law to state that human traits are influenced by many genetic variants of small effect (<1%) rather than a few genes of large effect. The second law is that the effect of being raised in the same family is smaller than the effect of genes, with twin studies of WEIRD samples yielding genetic effects around 50% with negligible effects of the shared environment, or aspects of the environment that create similarities between individuals (Polderman et al., Reference Polderman, Benyamin, De Leeuw, Sullivan, Van Bochoven, Visscher and Posthuma2015).
With truncated trait relevant environmental differences present when studying WEIRD samples, these “laws” are not surprising. For example, when giving the presidential address at the biannual meetings of the Society for Research in Child Development, behavior geneticist Dr. Sandra Scarr made the shocking argument that parenting within the normal range does not matter for child development (Scarr, Reference Scarr1992). With research on WEIRD children, rearing environments are likely less variable and thus genetics accounts for a larger proportion of individual differences than does the environment and culture. However, when we expand the reach of behavior genetics studies to other cultural groups or populations, or we improve our models to better allow for individual differences, we see a different pattern.
Relevant to child psychopathology, a meta-analysis reported that additive genetic influences accounted for nearly half of the variance in self-reported Child Behavior Checklist symptoms of anxiety (48%) and depression (44%), with smaller shared environmental influences (12% and 14%, respectively) with samples made up of WEIRD children (Burt, Reference Burt2009). Longitudinal studies also suggest that stability is largely genetically mediated, with some new genetic influence emerging in adolescence. These findings are in line with the laws of behavior genetics. In contrast, a recent study of Chinese twins who self-reported anxiety and depression on the Child Behavior Checklist longitudinally at approximately 11 and 14 years of age yielded unexpected findings (Zheng, Rijsdijk, Pingault, McMahon, & Unger, Reference Zheng, Rijsdijk, Pingault, McMahon and Unger2016). Heritability estimates were substantially lower (23% and 20%, respectively) at 11 years and decreased to negligible by 14 years. Shared environmental influences were similar at age 11 (20% and 27%, respectively) and increased substantially (57% and 60%, respectively) by 14 years. Stability was largely explained by the shared environment. In China, the transition to adolescence corresponds to environmental changes such that family and neighborhood experiences could have a larger impact on developmental psychopathology. It also could be that active gene–environment correlation occurs more frequently with WEIRD populations as children have more opportunity to select environments that match their genetic dispositions. Active gene–environment correlation would decrease fraternal twin correlations relative to identical twin correlations and thus increase estimates of genetic influence, and decrease estimates of shared environmental influence. Other studies of WEIRD samples utilizing moderated heritability models also elucidate the hidden importance of the sociocultural environment for some traits and some individuals (children's temperament; Lemery-Chalfant, Kao, Swann, & Goldsmith, Reference Lemery-Chalfant, Kao, Swann and Goldsmith2013).
Finally, the third law of behavior genetics states that a large portion of individual differences are not due to genetics nor the family, that is, they are nonshared environmental influences that create differences rather than similarities between individuals (Turkheimer, Reference Turkheimer2000). Behavior geneticists have labeled the attempt to identify what aspects of the trait-relevant environment (e.g., differential treatment by parents and different classrooms) make up the nonshared environment “the Gloomy Prospect,” as most studies yielded null results (Plomin & Daniels, Reference Plomin and Daniels1987). However, differential susceptibility theory indicates that one way genetics may influence health and development is by representing variability in sensitivity to the environment (Belsky & Pluess, Reference Belsky and Pluess2009). Keers et al.’s (Reference Keers, Coleman, Lester, Roberts, Breen, Thastum and Nauta2016) polygenic sensitivity score based on SNPs that predicted identical twin differences in emotional problems suggests that genetic variants can be meaningfully related to identical twin differences. Thus, the difficulty in accounting for estimates of nonshared environmental influence found in twin studies are likely explained in part by environmental influences acting in concert with individual genetic predispositions over time, rather than the simple linear effect of any single environmental factor. With differential susceptibility theory to guide the identification of genetic variants associated with environmental sensitivity, there may be nothing gloomy about “the Gloomy Prospect” at all.
Thus, the laws for WEIRD peoples should be updated to incorporate developmental processes in VITAL peoples to elucidate risk and resilience for developmental psychopathology. The field of behavior genetics is young and stands to gain much by coupling with the field of cultural psychology in the new field of cultural genomics to better represent the full range of genetic and environmental trait-relevant variation in VITAL peoples. Meanwhile, cultural psychology can benefit from integrating genetic and physiological theory and analysis into cultural research, especially as an understanding of genetics research and methodology can provide some of the most compelling arguments against an overly simplistic, deterministic view of the role of genetics in shaping individual outcomes. In line with differential susceptibility theory, the extent to which risk and promotive factors contribute to phenotypes may be a function of a genetic predisposition to environmental sensitivity, or developmental plasticity. Few studies have tested this theory in minority groups.
There are racial–ethnic disparities in developmental outcomes for youth and families, with some initial evidence that genetics moderates the links, for example, between racial discrimination and the development of conduct problems (Brody et al., Reference Brody, Beach, Chen, Obasi, Philibert, Kogan and Simons2011) and criminal arrests (Schwartz & Beaver, Reference Schwartz and Beaver2011). We have an ethical obligation to pursue the public health priority of designing and testing multicultural interventions (Smith et al., Reference Smith, Knoble, Zerr, Dishion and Stormshak2014), and examining putative moderators of intervention efficacy.
Conclusion
We argue for an integration of cultural psychology and behavior genetics under contemporary developmental theories to bridge these perspectives and make developmental science more representative of local cultural conditions and more valid across cultures of the world. Based on our findings, it is likely that intervention effects on environmentally sensitive children are underestimated. We found that the parenting-based Family Check-Up intervention was effective in children with higher polygenic sensitivity to the environment, such that a significant difference in self-reported Diagnostic Interview Schedule for Children—IV symptoms emerged between the intervention and control groups for those 0.493 SD above the mean on polygenic sensitivity to the environment, or 25% of the sample. It is useful to understand which children benefit from parenting-based interventions and which children do not. Children with lower environmental sensitivity may benefit from an alternative intervention, perhaps a cognitive-based intervention. Similar to personalized medicine, it is time to understand individual differences in treatment response and individualize psychosocial interventions to reduce the burden of child psychopathology and maximize well-being for children growing up in a wide range of physical environments and cultures.