1. Introduction
Childbearing involves uncertainty. Imperfect control over conception, marital formation, and dissolution, changes in financial resources, and other mutable factors influence whether and when people have children [Heckman and Willis (Reference Heckman, Willis and Terleckyj1976)]. Because these factors can prevent people from perfectly predicting their future fertility, the number of children people say they intend to have frequently differs from the number of children they end up having [Bachrach and Morgan (Reference Bachrach and Morgan2013)]. In 1985, the Panel Study of Income Dynamics (PSID) asked several thousand people in the United States to state whether they intend to have a child in the future, and, if so, how many children they intend to have [Panel Study of Income Dynamics (2017)]. The PSID then tracked childbearing over the subsequent decades. Fifteen percent of women had a child after stating that they did not intend to do so, or did not have a child after stating that they intended to do so. Twenty-five percent had fewer or more children than they had stated was their intention. Many studies similarly document differences between stated intentions and actual fertility using longitudinal surveys administered in the United States and other countries.Footnote 1
In this paper, I compare the accuracy of two ways of using fertility intentions to predict the number of children that people will have. I use a sample of women aged 18–44 who had never given birth before 1985 and who subsequently had 0.99 children on average. In 1985, these women reported that they intend to have 1.44 children on average in the future, nearly half a child higher on average than their actual future fertility. Alternatively, for the same group of childless women in 1985, I calculate the share who reported that they intend to have at least one child in the future. Then, among women with one child in 1985, I calculate the share who reported that they intend to have at least one more child. I continue with these calculations at higher parities, which together yield a predicted distribution over total future number of children. The average of this distribution, 1.00, almost exactly matches the actual average of 0.99. Responses to the question “do you intend to have a baby at some time?” collected from a broad group of women better predict fertility for childless women than do these women's own responses to the question “how many do you intend to have?” If the goal is to predict average fertility, then the best way to ask people how many children they intend to have may not be to ask them “how many children do you intend to have?”
I consider the role of rational expectations in explaining this surprising finding. Again, because of uncertain future factors, people may not be able to perfectly predict fertility. A person with rational expectations knows the expected distribution of these factors, even though she doesn't know what her draw will be. Manski (Reference Manski1990) provides a framework for evaluating whether intentions are consistent with an assumption that they are formed using rational expectations. I show that intention to have a child is generally consistent with rational expectations, but intended number of children differs from subsequent fertility to a degree that suggests errors in the formation of intentions. However, I also show that the surprising pattern observed among PSID respondents does not require errors in the formation of intentions. Rather, the structure of intentions themselves can provide a particularly inaccurate prediction. If respondents' state their most likely number of children as their intended number of children, then actual fertility can differ from intended fertility in aggregate, even without any errors in the formation of intentions.
2. Panel Study of Income Dynamics
The PSID is a longitudinal survey of several thousand U.S. families, conducted annually between 1968 and 1996 and biannually starting in 1997. In 1985, the PSID recorded fertility intentions of women who were aged 44 or younger and of men whose wives were aged 44 or younger. The PSID asked two questions. First, “Looking to the future, do you intend to have a baby at some time?” The PSID permitted three responses: “Yes,” “No,” and “Maybe; Don't know.” Among respondents who stated that they intend to have a child in the future, the PSID further asked “How many do you intend to have?” The PSID then recorded actual childbearing over the following decades.Footnote 2
I use a sample 3,251 women aged 18–44 in 1985 who were present in the household and responded for themselves; who reported their current number of children; who were not pregnant; who reported the year of birth of every child they have had; who never adopted a child; who never had a twin or other multiple birth; and who answered the initial intentions question, “do you intend to have a baby at some time?” Column 1 of Table 1 provides basic demographic characteristics of this sample. These women were 32 years old on average when observed in 1985. Eighty-six percent were white, 66% were married, and 85% had completed high school (these calculations, and all others in the paper, are made using sampling weights). Eight percent of these women responded with uncertainty when asked whether they intend to have a child in the future. I exclude these women from all estimates that require a response of yes or no to this intentions question.
Table 1. Sample characteristics in 1985
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_tab1.png?pub-status=live)
Notes: The sample consists of women who were present in the household and responded for themselves; who reported their current number of children; who were not pregnant; who reported the year of birth of every child they have had; who never adopted a child; who never had a twin or other multiple birth; and who answered the initial intentions question, “do you intend to have a baby at some time?” The sample in columns 2 and 3 is further restricted to women who either respond “No” when asked whether they intend to have a child in the future, or who respond “Yes” and then also provide an intended number of children. See section 2. Standard errors are provided in parentheses. Data source: Panel Study of Income Dynamics.
I focus much of the analysis in this paper on a subsample of 394 women who had zero children by 1985; whose birth history continued to be recorded until at least age 40Footnote 3; and who either responded “No” when asked whether they intend to have a child in the future, or who responded “Yes” and then also provided an intended number of children. Columns 2 and 3 of Table 1 compare these women to the 202 women with zero children in 1985 who were not followed until age 40. The PSID continued to follow women who were, on average, older, more likely to be white, less likely to be married, and more likely to have completed high school. Except for the share that was white, these differences are all statistically significant at the 5% level.
The 394 women who were aged 18–44 and childless in 1985 (and who continued to be observed by the PSID until at least age 40) later had 0.99 children on average. Because completed fertility was not recorded for the 202 childless women who were not followed until age 40, I cannot directly determine whether the sample restriction biases completed fertility higher or lower. However, I estimate the direction of this bias by first regressing completed number of children on the demographic characteristics listed in Table 1, using the 394 women who were followed. Estimated coefficients from this regression, presented in Table 2, indicate that completed fertility is higher for women who were younger, white, married, or had graduated high school. Using the relationship between completed fertility and each demographic characteristic suggested by these estimated coefficients, I simulate fertility for the 202 women who were not followed. The average value of this simulation, 1.38 children, substantially exceeds the 0.99 children on average that were born to the women who were followed. This comparison suggests that attrition from the survey leads to an underestimate of future fertility.
Table 2. Regression of future fertility on demographic characteristics
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_tab2.png?pub-status=live)
Notes: The dependent variable is number of children after 1985. The sample consists of 394 women who were childless when observed in 1985. R 2 is 0.35. Standard errors are provided in parentheses. Statistical significance at the 10%, 5%, and 1% levels is denoted by *, **, and ***. The estimated constant is 3.20 with a standard error of 0.41, suggesting that 30-year old, nonwhite, unmarried, non-high school educated women are predicted to have 0.29 more children on average (3.20 − 0.097 × 30). See section 2. Data source: Panel Study of Income Dynamics.
3. Two ways of predicting fertility
I begin with the sample of 394 women who were aged 18–44 and childless in 1985, and who continued to be observed until at least age 40. These women went on to have 0.99 children on average: 47% remained childless, 17% had one child, 28% had two children, 8% had three children, and 1% had four children. Column 1 of Table 3 provides these shares, with standard errors in parentheses. In the rest of this section, I compare this actual fertility to two ways of predicting fertility for these same women using intentions reported in 1985.
Table 3. Actual and predicted number of children in the PSID
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_tab3.png?pub-status=live)
Notes: Column 1 provides the distribution of subsequent births to 394 women who were childless in 1985. Column 2 provides the distribution of intended number of births, stated by these same women in 1985. Column 3 provides a distribution predicted for these women using binary intentions collected in 1985. Bootstrap standard errors are provided in parentheses, calculated using 1,000 samples drawn with replacement from the original sample. See section 3.2.2. Data source: Panel Study of Income Dynamics.
3.1 Categorical intentions
The PSID asked these 394 women “do you intend to have a baby?” Thirty-six percent reported that they did not, a smaller share than the 47% who actually remained childless. The remaining 64% were then asked “how many [children] do you intend to have?” Column 2 of Table 3 provides the distribution of responses to this question. I label these intentions “categorical” because responses are integers between one and five children. Eight percent intended to have a single child, again a smaller share than actually achieved this family size. Conversely, 36% intended to have two children and 15% intended to have three children, larger shares than actually achieved these family sizes. The overall average number of intended children, 1.44, therefore substantially exceeds the average number of children actually born to these women. This finding—that actual fertility falls short of intended fertility—is consistent with the evidence in section 2 of an average decline in fertility intentions over time. As women age, they are more likely to revise their intended number of children downward than upward. It therefore follows that average intended number of children exceeds actual fertility.
3.2 Binary intentions
I construct an alternative predicted distribution for these same childless women using only responses to the intentions question “do you intend to have a baby?” Because I use responses of yes and no, I refer to this distribution as “binary.” [Ajzen and Fishbein (Reference Ajzen and Fishbein1980) refer to this intention as a “choice” intention.] Again, 36% of women report that they do not intend to have a child, so binary intentions predict that 36% of women will remain childless, just as categorical intentions do. However, at higher parities the distributions differ. To explain how I construct the predicted distribution using binary intentions and why it differs from the prediction using categorical intentions, I will first introduce a simple simulated population and discuss a related procedure, described by Udry et al. (Reference Udry, Bauman and Chase1973), on which the procedure I develop is based.
3.2.1 Binary intentions in a simulated population
Consider a population in which a childbearing career lasts for 3 years, from age 1 through age 3, and births only occur at ages 2 and 3. There are three types of women: type 1 gives birth when age 2 and age 3, type 2 gives birth when age 3, and type 3 never gives birth. The population is stable, so there are the same numbers of women at each age, and the distribution of women by type is the same at each age. Suppose that there are nine women in the population, one of each type observed at each age. Table 4 displays the cumulative number of children, by age and type, in this population (labeled population A).
Table 4. Simulated populations
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_tab4.png?pub-status=live)
Each birth occurs at the start of the year, after which each woman is asked to report her fertility intentions. All women have perfect control over conception and intentions exactly predict future fertility. Of the three women observed at age 1, two report that they intend to have a child in the future (only the type 3 woman intends to remain childless). Of the three women observed at age 2, only the type 1 woman intends to have a child in the future. No woman observed at age 3 intends to have a child in the future. No matter their age, all type 1 women intend to have two total children, all type 2 women intend to have one child, and all type 3 women intend to remain childless.
Consider just the six women observed when they are childless. The single type 1 woman will have two children, the two type 2 women will have one child, and the three type 3 women will remain childless. Therefore, 50% of childless women have zero children, 33% have one child, and 17% have two children. Categorical intentions are the same: the same type 1 woman intends to have two children, the same two type 2 women intend to have one child, and the same three type 3 women intend to remain childless.
Udry et al. (Reference Udry, Bauman and Chase1973) propose a procedure for using average binary intentions reported at each family size to predict final number of children. Again start with the six women in the simulated population observed when childless, three of whom (the type 1 and type 2 women) report that they intend to have a child in the future. Therefore, 50% of these childless women are predicted to remain childless, whereas the remaining 50% are predicted to have at least one child. Of the two women observed when they already have exactly one child, only one (the type 1 woman) intends to have another child. Therefore, 25% (one-half of 50%) are predicted to have just one child. Because no woman with two children intends to have a third child, the remaining 25% of women are predicted to have two children.
The distribution predicted using binary intentions accurately predicts that 50% of childless women will remain childless, but underpredicts the share that will have one child: 25%, compared to the actual 33%. The share predicted to have two children, 25%, exceeds the actual 17%. The mean of this binary prediction, 0.75 children, is therefore larger than the actual mean, 0.67 children. The age composition of people observed with one child explains this shortfall. Of the three women who are observed when childless but will go on to have at least one child, just the single type 1 woman then have a second child. Yet, among the two women observed when they have one child, half intend to have another child. Intention to have a third child is overrepresented among women who actually have one child, compared to women who have zero children but are predicted to have a first child.
Resolving this discrepancy requires considering not just parity (number of children a woman has), but also the ages at which each child was born. I predict fertility using binary intentions by adding consideration of these birth histories to the procedure of Udry et al. (Reference Udry, Bauman and Chase1973). For example, consider Angela, the type 1 woman observed at age 1. She reports that she intends to have a child, but this stated intention does not identify her type, and it is not yet revealed whether the child will be born when Angela is aged 2 or 3. I predict the timing of Angela's first birth using the birth histories of all women aged 3 who also had zero children as of age 1. There are two such women, one of whom first gave birth when aged 2 and one when aged 3. Because these women have completed childbearing and the population is stable, the timing of their first births predicts that there is a 50% likelihood that Angela will first give birth when aged 2 and a 50% likelihood she will instead first give birth when aged 3.
Suppose Angela first gives birth when aged 2. There is only one woman observed at age 2 who has just had her first birth (the same birth history as Angela up to this point), and this woman intends to have another child. Just one woman who has completed childbearing has a birth history in which the first of two births occurs at age 2, and this woman's second child was born when she was aged 3. Therefore, if Angela first gives birth when aged 2, she is predicted to also give birth when aged 3.
Suppose instead Angela has her first child when she is aged 3. There is again only one woman observed at age 3 who has just had her first birth, and this woman does not intend to have another child. Therefore, if Angela first gives birth when aged 3, she is predicted to nothave any more children. Angela therefore is predicted to have a 0% likelihood of having zero children, a 50% likelihood of having one child, and a 50% likelihood of having two children.
I repeat this procedure for each of the five other women who are observed when childless. Similar to Angela, the type 2 woman observed at age 1 intends to have a child, and is predicted to have an equal chance of having one or two total children. The type 2 woman observed at age 2 intends to have a child, which is predicted to be born when she is aged 3. All type 3 women do not intend to have a child, and are therefore predicted to have zero children. I then aggregate these six distributions, yielding a predicted distribution in which 50% of women have zero children, 33% have one child, and 17% have two children. This distribution exactly predicts actual childbearing.
Again, the procedure of Udry et al. (Reference Udry, Bauman and Chase1973) for using binary intentions overpredicts average fertility for childless women in population A. The rest of Table 4 simulates alternative populations. Type 4 women give birth when aged 2. Population B consists of one woman of each type 1, type 3, and type 4 observed at each age. Population C consists of one woman of each type observed at each age. The procedure of Udry et al. (Reference Udry, Bauman and Chase1973) underpredicts average fertility in population B (0.53 children per woman, compared to the actual 0.60), and exactly predicts the average of 0.71 children per woman in population C. In each case, though, the revised procedure that uses binary intentions and birth histories exactly predicts fertility. Aggregate binary intentions alone can underpredict, overpredict, or happen to accurately predict fertility, but additionally considering birth histories allows the procedure to always accurately predict fertility in stable populations in which intentions accurately reflect future behavior. Later in the paper, I consider reasons why predicted and actual fertility may differ in an actual population.
3.2.2 Binary intentions in the PSID
I now apply the procedure developed in section 3.2.1 to predict fertility for the 394 childless women observed by the PSID in 1985. Again, for a woman of a certain age who intends to have a child, the procedure predicts when this birth will occur using the birth histories of all women who have completed childbearing and who had the same birth history through that age. Because few women give birth after age 40, I construct this sample of women who have completed childbearing using all women aged 40 and older, even though I consider binary intentions reported through age 44. This cutoff at age 40 yields a larger group of women who have completed childbearing than would a higher cutoff, allowing more reliable predictions of the likelihood of giving birth at each age.
Column 3 of Table 3 provides this predicted distribution using binary intentions. Thirty-six percent of women indicate that they do not intend to have a baby and are predicted to have zero children, the same share as predicted using categorical intentions. Binary intentions predict that 35% of women will have exactly one child, a greater share than the 8% of women who state that they intend to have exactly one child. Again, the binary prediction is constructed by starting with each women's response to the binary intentions question, and then based on that response, simulating her future childbearing using the intentions of women at higher parities with the same birth history. The comparison with stated categorical intentions indicates that women with zero children are more likely to intend to have two or more total children than are women who have actually already had their first child. At higher parities, this comparison reverses: 36% of childless women intend to have two total children and 15% intend to have three total children, whereas binary intentions predict that just 23% and 4% of women will do so.
Figure 1 compares the distribution of actual fertility to the distributions predicted using categorical and binary intentions. Each point estimate is bounded by 83% confidence intervals, so that any two pairs of confidence intervals that do not overlap indicate that the difference between the two values is statistically significant at the 5% level. For example, 36% of childless women in 1985 state that they intend to remain childless, substantially less than the 47% who actually do so. Binary intentions also predict that 36% of women remain childless. The share of women with one child predicted using categorical intentions is substantially closer to the actual share than is the share predicted using binary intentions. However, at every family size between two and four children, the binary distribution more accurately predicts actual fertility, and the categorical distribution is statistically significantly different from the actual distribution.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_fig1.png?pub-status=live)
Figure 1. Actual and predicted final number of children, among currently childless women. Notes: Distribution from the first six rows of Table 3. Bootstrapped 83% confidence intervals calculated using 1,000 samples drawn with replacement from the original sample. See section 3.2.2. Data source: Panel Study of Income Dynamics.
This comparison between distributions is confirmed in the final two rows of Table 3. Both predicted distributions have an area of overlap of 0.81 in common with the actual distribution of children born. The mean of the categorical distribution, 1.44, is substantially higher than the actual mean of 0.99 because the categorical distribution underpredicts small families and overpredicts larger families. The binary distribution's overprediction of families with one child balances out its underprediction at all other family sizes, and the mean of the binary distribution, 1.00 children per women, nearly exactly matches the actual mean of 0.99 children. This difference between the means of the categorical and binary distributions is substantial and statistically significant, and suggests that binary intentions better predict average fertility.
4. Explaining differences between predicted and actual fertility
The PSID and many other surveys of intentions do not define for the respondent the meaning of intention. For example, the PSID asks “do you intend to have a baby?” If the respondent interprets this question as “do you want to have a baby?” then she would respond yes if she desires a child. If she instead interprets the question as “do you think you will have a baby?” then she would respond yes if her predicted likelihood of having a child is above some threshold, say 50%. I follow Juster (Reference Juster1966), Ajzen and Fishbein (Reference Ajzen and Fishbein1980), Manski (Reference Manski1990, Reference Manski2004), and other survey researchers and economists by assuming this second interpretation. The remaining discussion in sections 4 and 5 relies on this interpretation of intentions as a prediction of future behavior, not as a statement of preferences.
Consider a person asked to predict a future outcome. Uncertainty about the outcome occurs when at least one factor that influences the outcome is not known to a person at the time the prediction is recorded. A person with rational expectations is subject to this uncertainty but knows the distribution of these factors, and knows how every possible realization of the factors would influence the outcome. For example, some people struggle to conceive a child, but this infecundity is generally unknown at the start of a person's childbearing career. A person with rational expectations would know the probability of infecundity and would take this information into account when predicting future fertility.
This concept of rational expectations has been discussed in the context of expectations about future earnings [Dominitz (Reference Dominitz1998)], intention to work [Manski (Reference Manski1990)], and other predictions, whether elicited as an expectation, as an intention, or some other way. In the remainder of this paper, I similarly use the term rational expectations with regard to fertility intentions. Predictions formed using rational expectations do not necessarily average out to the true outcome. For example, if a group of people all accurately predict that they are likely but not guaranteed to have a child, they may all state that they intend to have a child, even though some will end up childless. Even when intentions are formed using rational expectations, there can still be differences between intended and actual fertility in aggregate.
aManski (Reference Manski1990) establishes a framework for evaluating whether stated intentions are consistent with rational expectations. Using a collection of binary (yes/no) intentions, this framework establishes testable bounds on observed behavior under the assumption that intentions are formed using rational expectations. The test cannot prove that survey respondents have rational expectations. Instead, it determines whether observed behavior is consistent with an assumption that intentions were formed using rational expectations. A wide variety of fertility behavior may fit this criterion.
In Appendix A, I use Manski's framework to demonstrate that yes/no responses to the first intentions question (“do you intend to have a baby?”) asked by the PSID in 1985 are generally consistent with rational expectations. In Appendix B, I extend Manski's framework to allow for yes/no/uncertain responses, and I find that stated intentions are again generally consistent with rational expectations. Only among women with three children in 1985 do stated intentions differ from subsequent fertility in a way that suggests errors in the formation of intentions. In Appendix C, I further extend Manski's framework to allow for categorical responses. I find that responses to the second intentions question (“how many [children] do you intend to have?”) are consistent with rational expectations for women who state that they intend to have zero or two children. For women who intend to have one, three, or four or more children, intended and actual number of children differ enough to suggest errors in the formation of intentions.
These errors could account for the relative inaccuracy of categorical intentions in predicting the number of children that PSID respondents will have. However, in the rest of this section I demonstrate that, even if categorical intentions are formed using rational expectations, binary intentions can still more accurately predict aggregate fertility. I begin with simulated populations 1 and 2 in Table 5. In population 1, 45% of people have a first child. In population 2, 55% of people have a first child. In both populations, 80% of people with one child have a second child, and nobody has a third child. These parity progression ratios shape the distribution of children across completed families. In population 1, 55% of families have zero children, 9% have one child, and 36% have two children. In population 2, 45% of families have zero children, 11% have one child, and 44% have two children.
Table 5. Additional simulated populations
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_tab5.png?pub-status=live)
Notes: The actual distribution of children across completed families in the third row is calculated from the parity progression ratios in the second row. Intentions are formed using rational expectations. For categorical intentions, a person indicates that she intends to have a certain number of children if she judges that number to be the most likely. For binary intentions, a person indicates that she intends to have a child if the probability of doing so is more than 50%. Half of population 3 consists of parents from population 1, the other half from population 2. See section 4.
In both populations, the most common number of children is zero. Assuming that a person's categorical intention equals the most likely category, these intentions predict that everyone in both populations will have zero children. On the contrary, binary intentions vary across the two populations. Assuming that a person intends to have a child if the probability of doing so is at least one-half, nobody in population 1 with zero children intends to have a child because only 45% of people actually do so. Therefore, in a predicted distribution constructed from binary intentions, nobody in population 1 progresses beyond zero children. In population 2, everybody with zero children intends to have a child, as does everybody with one child. Binary intentions predict that everybody in population 2 has two children.
In population 1, the actual distribution of children across completed families has a mean of 0.81, both predicted distributions have a mean of 0, and the overlap between the actual distribution and each predicted distribution is 0.55. The two ways of using intentions to predict fertility perform equally well. In population 2, the mean of the predicted distribution using categorical intentions, 0, is slightly closer to the mean of the actual distribution, 0.99, than is the mean of the predicted distribution using binary intentions, 2. The overlap between the actual distribution and predicted distribution using categorical intentions is 0.45, slightly larger than the 0.44 overlap between the actual distribution and predicted distribution using binary intentions. Categorical intentions predict fertility better than do binary intentions.
In simulated population 1 and in simulated population 2, everyone has the same binary intention, everyone has the same categorical intention, and both ways of using intentions to predict fertility yield distributions in which all probability is on a single number of children. This number of children can differ across the two predicted distributions, but, by assumption, categorical intentions select the most likely number of children. In any homogeneous population, such as populations 1 and 2, categorical intentions predict fertility at least as well as do binary intentions.
This comparison can reverse in a mixed population. In simulated population 3 in Table 5, half of people belong to population 1 and half to population 2. A particular person's membership in population 1 or 2 is known to the person but not the researcher. Fifty percent of people have zero children, 10% have one child, and 40% have two children. The mean number of children predicted using binary intentions in this mixed population, 1, more closely matches the actual mean, 0.9, than does the mean predicted using categorical intentions, 0. Similarly, the overlap between the actual distribution and distribution predicted using binary intentions is 0.9, which is greater than the 0.5 overlap between the actual distribution and distribution predicted using categorical intentions. As with PSID respondents, binary intentions can better predict the distribution of children across completed families than do categorical intentions in this simulated mixed population in which all intentions are consistent with rational expectations.
5. Discussion
A central purpose of surveyed intentions is to predict behavior [Manski (Reference Manski1990)]. To predict the number of children that a group of people will have, the most straightforward approach is to ask each person how many children she intends to have. However, responses to this question can provide a particularly poor prediction. Among a sample of childless women aged 18–44 when observed by the PSID in 1985, average intended number of future children is 1.44, yet these women only end up having 0.99 children on average. Alternatively, stated intentions to have or not have another child, collected from parents with various numbers of current children, can be used to calculate a predicted distribution of children across completed families. Using intentions provided by all women aged 18–44 when observed by the PSID in 1985, the average predicted number of children for childless women is 1.00, substantially closer to the actual realized average of 0.99.
The finding that binary intentions better predict average future fertility is not universal. At higher parities, categorical intentions can provide the better prediction. Panel a of Figure 2 repeats Figure 1, the distributions of actual fertility, fertility predicted using categorical intentions, and fertility predicted using binary intentions for childless women observed by the PSID in 1985. Again, as given the final two rows of Table 3, each predicted distribution has the same overlap with actual fertility, but binary intentions better predict the actual mean of 0.99 children. These values are repeated in the first two lines of Table 6. Panel b of Figure 2 repeats the calculations for women observed by the PSID with one child in 1985. As noted in Table 6, the categorical distribution better predicts actual fertility, as measured by both the overlap and mean of the distributions, although these differences are not statistically significant. As given in panels c and d of Figure 2 and the remaining rows of Table 6, both categorical and binary intentions nearly perfectly predict future fertility for women observed at higher parities.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_fig2.png?pub-status=live)
Figure 2. Actual and predicted number of children, by current number of children. Notes: Distributions generated according to methods described in section 3. Bootstrapped 83% confidence intervals calculated using 1,000 samples drawn with replacement from the original sample. See section 5. Data sources: Panel Study of Income Dynamics; British Household Panel Survey.
Table 6. Actual and predicted number of children in the PSID and BHPS, by parity
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_tab6.png?pub-status=live)
Notes: See notes to Table 3. PSID intentions recorded in 1985, BHPS intentions recorded in 1992. Bootstrap standard errors are provided in parentheses, calculated using 1,000 samples drawn with replacement from the original sample. See section 5. Data sources: Panel Study of Income Dynamics; British Household Panel Survey.
I repeat this comparison using another longitudinal survey that similarly records intentions. In 1992, the British Household Panel Survey (BHPS) asked “do you think you will have any (more) children?” and, if so, “how many (more) children do you think you will have?” [University of Essex Institute for Social and Economic Research (2010)]. The BHPS then recorded actual fertility over the following years. Panel e of Figure 2 presents the actual, categorical, and binary distributions for childless women observed by the BHPS in 1992. The binary distribution better predicts the shares of women who will actually have one child and two children, but the categorical distribution better predicts the share of women who will actually have three children. As given in Table 6, these two predicted distributions have similar overlap with actual fertility, but the binary distribution better predicts average actual number of children born. At higher parities, the two distributions perform about equally well.
Infecundity, inability to find a suitable partner, or other circumstances may stop a person who intends to have a child from actually doing so. Conversely, circumstances such as inadequate contraception may lead a person to have an unintended child. For these reasons, the share of people who intend to have a child can differ from the share that actually do so, even if intentions are formed using rational expectations. However, this paper finds that the number of children that women observed by the PSID actually have is substantially lower on average than the number they intended to have. Errors in the formation of these categorical intentions could explain this difference, as could the realization of future events. For example, the experience raising a first child may affect a parent's desire for a second child. This learning process may explain the relatively poor prediction of categorical intentions among childless women.
A more fundamental problem with using stated intentions to predict fertility lies in how intentions questions are commonly posed. Intentions questions customarily require the respondent to convert her judgment about the probability of future behavior into a binary or categorical response. Even when respondents have rational expectations, conversion from probabilities into intentions can have counterintuitive consequences. For example, more than half of people in simulated population 2 in Table 5 have at least one child, but the most common family size is zero. It is rational for a person in this population at the start of her childbearing career to state that she both intends to have a child in the future and intends to have zero children. Surveys such as the PSID do not allow for this combination of intentions, instead restricting intended number of children to be greater than or equal to one for respondents who intend to have a child.
Asking a respondent to report the probability with which she will engage in a behavior more precisely measures her prediction of the future than does a binary or categorical intention [Juster (Reference Juster1966), Manski (Reference Manski1990)]. Increasing numbers of surveys elicit these probabilities [Manski (Reference Manski2004)]. For example, the 1997 National Longitudinal Survey of Youth asked respondents aged 16–22 in 2001 to report their probabilities of having zero, one, two, and three or more children in the future [Bureau of Labor Statistics (2015)]. Among 456 women who reported a complete predicted probability distribution, summing these distributions yields an aggregate predicted distribution: 30% of women are predicted to have zero additional children, 24% to have one child, 33% to have two children, and 13% to have three or more children. By 2013, 34% of these women had given birth to zero additional children, 27% had one child, 25% had two children, and 13% had three or more children. The overlap of 0.92 between the two distributions demonstrates the accuracy of using reported probabilities to predict actual fertility. However, an additional 399 women, or 47% of the sample, reported probabilities that do not sum to one. Many survey respondents struggle to provide complete probability distributions.
The main finding in this paper—that binary intentions can predict average fertility better than categorical intentions—suggests an alternative way for surveys to elicit a probability distribution: because children are accumulated sequentially, a predicted probability distribution can be formed from a sequence of binary probabilities. For example, a survey would ask a respondent to report the probability that she will have a child. Regardless of her response, the survey would then ask the respondent to report the probability of having a second child, in the hypothetical case that she did in fact have a first child. Additional questions would elicit hypothetical probabilities upon reaching larger family sizes. This sequence of reported probabilities could then be used to calculate the person's predicted probability distribution. For example, the probability of having only one child equals the probability of having a first child multiplied by one minus the probability of having a second child conditional on having the first child. Such a sequence of conditional probability questions may provide a more complete prediction if respondents struggle to calculate their entire probability distribution all at once.
The comparison between binary and categorical fertility intentions in this paper requires several decades' worth of longitudinal survey data. Determining the most accurate way to ask probability questions about lifetime fertility will similarly require a longitudinal survey to ask probability questions multiple ways. However, a person's number of children is a quantity that is reached sequentially but is reversible: particularly in areas with poor health, child mortality may distort prediction of total number of children. Sequential quantities that are accumulated more rapidly and are nonreversible, such as months spent trying to conceive a child or weeks spent searching for a job, may allow for a quicker, more accurate test of relationship between predicted probabilities and observed behavior.
Financial support
This study was supported by the NICHD (T32 HD007339) as part of the University of Michigan Population Studies Center training program. The author gratefully acknowledges the use of the services and facilities of the Population Studies Center (funded by NICHD Center Grant R24 HD041028). The collection of data used in this study was partly supported by the National Institutes of Health under grant number R01 HD069609 and the National Science Foundation under award number 1157698. The opinions and conclusions expressed herein are solely the author's and do not represent the opinions or policy of these funders or any agency of the federal government.
Conflict of interest
The author declares that he has no conflict of interest.
Appendix A: A best-case analysis of binary (yes/no) intentions
In this section, I summarize Manski's (Reference Manski1990) framework for using surveyed intentions to estimate bounds on behavior, under the assumption that people have rational expectations. This framework applies to any question in which a survey respondent is asked to state whether she intends, expects, or anticipates engaging in a behavior. These directives may suggest different degrees of certainty a person needs to believe in order to provide a response of “yes.” Because I use survey data that record fertility intentions, I refer to intentions throughout this document.
Suppose a person is asked whether she intends to have a child in the future. Information, s, available to her at time of survey and information, z, that is realized later determine whether she has a child, y(s, z) = 1, or does not have a child, y(s, z) = 0. P(y|s) is the objective probability distribution of y conditional on s, P z|s is the objective probability distribution of z conditional on s, and P(y = 1|s) = P z[y(s, z) = 1|s]. The respondent does not know what the realization of z will be. However, under the assumption of rational expectations, she knows the distribution of z conditional on s, P z|s, and she knows the function, y(s, ⋅ ), that maps between every possible realization of z and her future childbearing. Whether she states that she intends to have a child, i = 1, or does not intend to have a child, i = 0, depends on the relative losses she associates with the two possible prediction errors (i = 0, y = 1; and i = 1, y = 0). If the probability of having a child is above some threshold, π, she states that she intends to have a child:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn1.png?pub-status=live)
A researcher may wish to predict whether the respondent will have a child. The researcher has access to some subset, x, of the full information, s, available to the respondent, and the researcher knows the respondent's stated intention, i. P s|xi is the probability distribution of s conditional on x and i. The probability that the respondent has a child, given her observable x and i, is $P( {y = 1\vert x, \;i} ) = \mathop \int \nolimits^ P( {y = 1\vert s} ) dP_s\vert xi$. Therefore, assuming that π is the same for everyone and known to the researcher,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn2.png?pub-status=live)
In order for inequality (A.2) to be defined, the probability of having a child must be higher among people who intend to have a child than among people who do not intend to have a child.
The probability of having a child is defined as:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn3.png?pub-status=live)
Inequality (A.2) and equation (A.3) can be combined to provide bounds on P(y = 1|x). Because probabilities are non-negative, P(y = 1|x, i = 0) ≥ 0. From inequality (A.2), π ≤ P(y = 1|x, i = 1). Therefore, P(y = 1|x) ≥ P(y = 1|x, i = 1)P(i = 1|x) ≥ πP(i = 1|x). Similarly, because probabilities cannot exceed one, P(y = 1|x, i = 1) ≤ 1. From inequality (A.2), P(y = 1|x, i = 0) ≤ π. Therefore, P(y = 1|x) ≤ πP(i = 0|x) + P(i = 1|x). Together, these two inequalities yield bounds on P(y = 1|x):
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn4.png?pub-status=live)
Stated intentions are consistent with rational expectations if the probability of having a child lies within this range.
Bounds (A.2) and (A.4) can be tested using survey data, under the assumption of random sampling. Testing the bounds further requires that future events, z, are not subject to aggregate shocks. For example, assume that intention to have a child is based entirely on ability to have a child, and women face a 10% chance of infecundity. If this risk of infecundity is independent across women, then P(y = 1|s) = P z[y(s, z) = 1|s] = 0.9. However, if infecundity instead affects or does not affect all women, then P(y = 1|s) equals 0.9 but P z[y(s, z) = 1|s] ∈ {0, 1}. The best-case analysis in this paper therefore evaluates whether stated intentions are consistent with rational expectations, assuming that each respondent states her best prediction, π is the same for everyone and known to the researcher, and there are no aggregate shocks.
I test bounds (A.2) and (A.4) using PSID data. In 1985, the Panel Study of Income Dynamics (PSID) asked, “Looking to the future, do you intend to have a baby at some time?” [Panel Study of Income Dynamics (2017)]. The PSID permitted three responses: “Yes,” “No,” and “Maybe; Don't Know.” I omit all respondents who chose the third option (in Appendix B, I extend Manski's framework to permit this uncertain option). The PSID then recorded all births over the subsequent decades.
Table A.1 presents estimates for women aged 18–44 when surveyed in 1985 who continued to be observed until at least age 40. I group women by the total number of children to whom they had given birth when surveyed in 1985, x. As given in column 1, among the 401 women with zero children,Footnote 4 66% stated that they intend to have a child in the future, and 34% stated that they do not intend to have a child. The share of women who subsequently had a child is 76% among women who stated that they intend to do so, and 16% among women who stated that they did not intend to do so. These shares provide bounds on the probability threshold, π, above which a woman states that she intends to have a child. That these bounds (and many of the bounds in the rest of Appendices A, B, and C) are wide indicates that, given observed intentions, an assumption that these intentions were formed using rational assumptions is not especially restrictive. Intentions formed using rational expectations can be consistent with a variety of actual fertility outcomes. Finally, the overall share of women who had a child after 1985, 56%, falls within the estimated bounds of 33% and 83%, calculated assuming a symmetric loss function of π = 0.5.
The remaining columns of Table A.1 similarly estimate bounds (A.2) and (A.4) among women with larger families in 1985. For women with zero, one, or two children, bounds (A.2) include π = 0.5. Assuming π = 0.5, bounds (A.4) all include the observed share of women who had a child after 1985. The assumption of a symmetric loss function is not required to reach this conclusion. Alternative assumed thresholds, such as π = 0.25, also satisfy bounds (A.2) and (A.4). Across 1,000 samples drawn with replacement, these findings generally hold: bounds (A.2) are defined for women in at least 95% of samples, and bounds (A.4) are satisfied at all parities in all samples. Even though some women have a child after stating they do not intend to do so, and vice versa, these estimates indicate that, under the best-case hypothesis in which the threshold is the same for all women and known to the researcher, stated intentions are generally consistent with rational expectations.
Appendix B: A best-case analysis of yes/no/uncertain intentions
In this section, I extend Manski's framework to estimate best-case probability bounds on behavior using intentions questions that permit three responses: yes, no, and uncertain. In the binary case, there is some threshold, π, on the probability of having a child, below which the respondent states an intention of no and above which the respondent states an intention of yes. Allowing for an uncertain response requires two thresholds on the probability of having a child: below the lower threshold, α, the respondent states an intention of no, i = 0; above the upper threshold, β, the respondent states intention of yes, i = 1; and between the two thresholds the respondent states an intention of uncertainty, i = 9:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn5.png?pub-status=live)
Therefore, the probability of having a child should be highest among people who state that they intend to have child, lowest among people who state that they do not intend to have a child, and between the two for people who express uncertainty:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn6.png?pub-status=live)
Again, the condition under which these bounds on α and β are defined is not especially strict, and just requires that the probability of having a child increases with stated intention to do so.
By definition,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn7.png?pub-status=live)
Inequality (B.2) and equation (B.3) can be combined to provide bounds on P(y = 1|x). Because probabilities are non-negative, P(y = 1|x, i = 0) ≥ 0. By inequality (B.2), P(y = 1|x, i = 0) ≥ α and P(y = 1|x, i = 9) ≥ β. Therefore, P(y = 1|x) ≥ αP(i = 9|x) + βP(i = 1|x). Similarly, because probabilities cannot exceed one, P(y = 1|x, i = 1) ≤ 1. By inequality (B.2), P(y = 1|x, i = 0) ≤ α and P(y = 1|x, i = 9) ≤ β. Therefore, P(y = 1|x) ≤ αP(i = 0|x) + βP(i = 9|x) + P(i = 1|x). Together, these two inequalities yield bounds on P(y = 1|x):
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn8.png?pub-status=live)
I test bounds (B.2) and (B.4) using PSID data. Table B.1 presents estimates of these bounds using the same sample and intention question as in Appendix A, but now including respondents whose stated intention is “Maybe; Don't Know” (i = 9). As given in column 1, among the 465 women with zero children in 1985, 29% stated that they do not intend to have a child in the future, 14% expressed uncertainty, and the rest stated that they intend to have a child. The share of women who subsequently had a child is 16% among women who stated that they did not intend to do so, 34% among women who expressed uncertainty, and 76% among women who stated that they intend to have a child. These shares provide bounds on the probability thresholds α and β. Finally, the overall share of women who had a child after 1985, 53%, falls within the estimated bounds of 34% and 75%, calculated assuming that α = 0.3 and β = 0.5.
The remaining columns of Table B.1 similarly estimate bounds (B.2) and (B.4) among women with larger families in 1985. All bounds on α are defined, as are bounds on β in all cases except for women with three children.Footnote 5 Assuming that α = 0.3 and β = 0.5, bounds (B.4) all contain the observed shares of women who had a child after 1985. These assumed thresholds imply that a woman provides an uncertain fertility intention when her predicted probability of having a child is between 30% and 50%. Unlike the assumption of π = 0.5 in Appendix A, this loss function is not symmetric and suggests a greater willingness to fail to have a child after stating an intention of yes, compared to having a child after stating an intention of no. However, as in Appendix A, alternative assumed thresholds would yield the same conclusion.
These estimates suggest that, even though stated intentions sometimes disagree with subsequent fertility, intentions are generally consistent with rational expectations. Across 1,000 samples drawn with replacement, these findings generally hold. Bounds (B.2) on α are defined in at least 93% of the samples at all parities. Bounds (B.2) on β are defined in at least 73% of the samples for all women except for those with three children. Bounds (B.4) contain the observed share of women who have a child for at least 99% of the samples at all parities.
Appendix C: A best-case analysis of categorical intentions
In this section, I extend Manski's framework to estimate best-case probability bounds on behavior using intentions questions that permit a single choice among multiple unordered options. The three possible intentions in Appendix B (no, uncertain, and yes) are ordered in the probability of the outcome. With more thresholds, the approach in Appendix B could be modified to permit additional ordered responses (such as no, unlikely, even chance, likely, and yes). When the response options are unordered in probability, it is no longer possible to identify thresholds, and another criterion must be used. I assume that respondents select the modal category. That is, when a person who can have at most m children is asked to state how many children she intends to have, she reports the number that she judges to be most likely:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn9.png?pub-status=live)
I assume that there is a single mode. Therefore, among respondents who state an intention to have q children, the probability of having q children should be greater than the probability of having any other number of children:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn10.png?pub-status=live)
By definition,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn11.png?pub-status=live)
Inequality (C.2) and equation (C.3) can be combined to provide bounds on P(y = q|x). Equation (C.3) can be rewritten as P(y = q|x) ≡ P(y = q|x, i = q)P(i = q|x) + P(y = q|x, i ≠ q)P(i ≠ q|x). Because probabilities are non-negative, P(y = q|x, i ≠ q) ≥ 0. By assumption, if a respondent states that she intends to have q children, then she views this as the most likely outcome. The probability of this outcome, P(y = q|x, i = q), should therefore be at least 1/(m + 1). Therefore:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqnU1.png?pub-status=live)
Similarly, because probabilities cannot exceed one, P(y = q|x, i = q) ≤ 1. By assumption, if a respondent states that she intends to have r ≠ q children, then she does not view q as the most likely outcome. The probability of having q children should be at most one-half: P(y = q|x, i ≠ q) ≤ 1/2. Therefore,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqnU2.png?pub-status=live)
Together, these two inequalities yield bounds on P(y = q|x):
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_eqn12.png?pub-status=live)
I test bounds (C.2) and (C.4) using PSID data. Among respondents who state that they intend to have a child in the future, the PSID further asked “How many do you intend to have?” For respondents who state that they do not intend to have a child in the future, I assign an intended number of children equal to zero. Therefore, as in Appendix A, the sample excludes women who are uncertain about whether they intend to have a child in the future. Table C.1 presents estimates for the 394 women who currently have x = 0 children. Each column of this table represents a number of intended children, q. As given in the first row, the most commonly intended numbers of children are zero and two. Few women intend to have or end up having more than four children, so I top-code the number of children at four.
The third through seventh rows of Table C.1 present statistics used to evaluate bounds (C.2). Among the 136 women who state that they intend to have zero children, 84% in fact do not have a child. This share is greater than the share of women who have any other number of children, satisfying bounds (C.2). Similarly, among women who state that they intend to have two children, the most common outcome is two children. However, among women who intend to have one, three, or four or more children, this intention exceeds the modal outcome by one child, violating bounds (C.2).
Among women who state that they intend to have zero children, 44% do not have a child. This share falls within the estimated bounds (C.4) of 7% and 67%. Similarly, for women with all other intended numbers of children, bounds (C.4) contain the observed share who end up with that number of children. Across 1,000 samples drawn with replacement, these findings generally hold. Bounds (C.2) are satisfied for women who intend to have zero or two children in at least 99% of samples. Bounds (C.4) are satisfied at all intended family sizes below 4, and in 79% of samples for women who intend to have four or more children.
Therefore, among women who state that they intend to have zero or two children, these intentions are consistent with the assumption of rational expectations. Among women who report intentions for another family size, bounds (C.2) are violated. This violation may indicate error in the formation of intentions, but could also be a consequence of some women using an alternative criterion rather than selecting the modal category. For example, a woman may feel social pressure to avoid stating intention to have a particular number of children, leading her to select a number other than her predicted mode.
Manski's framework for testing whether intentions are consistent with rational expectations could be further extended to consider alternative ways of eliciting intended number of children, such as allowing a respondent to report a minimum and maximum intended number of children. By forcing these two values to be equal, the PSID prevents a respondent from reporting a range of number of children that she is equally likely to have. The framework could also be applied to other fertility predictions, such as expected time to next birth. For example, annually between 1969 and 1972, the PSID asked respondents to report whether they expect to have a child in the future. Respondents who expect to have a child were then asked to indicate whether they expect the child to be born within the next year.
Table A.1. Consistency between yes/no fertility intentions and actual fertility
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_tab7.png?pub-status=live)
Table B.1. Consistency between yes/no/uncertain fertility intentions and actual fertility
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_tab8.png?pub-status=live)
Table C.1. Consistency between categorical fertility intentions and actual fertility
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220808031401589-0700:S2054089220000322:S2054089220000322_tab9.png?pub-status=live)