Hostname: page-component-745bb68f8f-f46jp Total loading time: 0 Render date: 2025-02-11T21:59:32.826Z Has data issue: false hasContentIssue false

Mortality decline, productivity increase, and positive feedback between schooling and retirement choices

Published online by Cambridge University Press:  03 March 2021

Zhipeng Cai
Affiliation:
Equity Research, Institutional Securities, Morgan Stanley Asia Limited, Hong Kong
Sau-Him Paul Lau*
Affiliation:
Faculty of Business and Economics, University of Hong Kong, Hong Kong
C. Y. Kelvin Yuen
Affiliation:
Department of Economics, Washington University in St. Louis, One Brookings Drive, St. Louis, MO63130, USA
*
*Corresponding author. E-mail: laushp@hku.hk

Abstract

The twentieth century has seen a phenomenal decline in mortality and an increase in productivity level. These two important events likely affect people's choices of schooling years and retirement age. We first show that in a standard life-cycle model, positive feedback exists between optimal schooling years and retirement age choices. We then evaluate the impact of a mortality or productivity shock on an endogenous variable (schooling years or retirement age) by decomposing it into the direct and indirect effects, where the indirect effect arises from feedback from the other endogenous variable. Finally, we extend the model by including the utility benefit of schooling and show that a negative correlation of schooling years and retirement age is possible. Apart from clarifying the apparently similar concepts of positive co-movement and positive feedback, our results have implications relevant to the economic demography literature.

Type
Research Paper
Copyright
Copyright © Université catholique de Louvain 2021

1. Introduction

The twentieth century has seen a phenomenal decline in mortality and an increase in productivity level. Based on the US data, the (projected) life expectancy at birth for an average person born in 2000 was 80.9 years, 26 years longer than those born a century ago.Footnote 1 Likewise, using real GDP per capita as rough estimates, the productivity level in the USA increased almost seven times from 1900 to 2000.Footnote 2 A similar magnitude of improvement in life expectancy and productivity is also observed in other developed economies. These two changes have led to a much higher level of expected lifetime wealth for younger generations.

Huge demographic and productivity changes, through the effect of expected lifetime wealth, are likely to influence economic decisions, chief among which are retirement and schooling choices. Panel (a) of Figure 1 shows the average retirement age by education group for different cohorts based on the US Census data.Footnote 3 It shows a positive relationship between schooling and retirement age for each cohort. For the 1950 cohort, those with more than college education on average retire 7 years later than those with less than high schooling education. The cross-cohort dynamics of average retirement age and average schooling years is seen in panel (b) of Figure 1, which shows the relationship between these two variables across different cohorts. From the 1930 to the 1950 birth cohorts, individuals become more educated and retire later. Compared to the 1930 cohort, the 1950 cohort has 2 more years of schooling on average, and retires 1.9 years later. On the other hand, from the 1920 to the 1930 cohorts, individuals become more educated but retire earlier. The patterns of cross-sectional data of schooling years and retirement age for different cohorts are consistent with those presented in other papers, such as Hazan (Reference Hazan2009, Figure 1) and Maestas and Zissimopoulos (Reference Maestas and Zissimopoulos2010, Figure 4).

Figure 1. Schooling and retirement age across different cohorts.

The impact of mortality decline and/or productivity increase on schooling years and/or retirement age has been widely studied in the literature. Bloom et al. (Reference Bloom, Canning and Moore2014) consider a life-cycle model with endogenous retirement age, as in Bloom et al. (Reference Bloom, Canning, Mansfield and Moore2007) and Kalemli-Ozcan and Weil (Reference Kalemli-Ozcan and Weil2010). They find that the optimal retirement age is delayed because of mortality decline, but is reduced by productivity increase. Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013) endogenize schooling duration, as in Heijdra and Romp (Reference Heijdra and Romp2009) and Cervellati and Sunde (Reference Cervellati and Sunde2013). They find that the optimal schooling duration rises over time because of either mortality decline or productivity increase. On the other hand, Boucekkine et al. (Reference Boucekkine, de la Croix and Licandro2002), Echevarria and Iza (Reference Echevarria and Iza2006), Sheshinski (Reference Sheshinski2009), and Sánchez-Romero et al. (Reference Sánchez-Romero, d'Albis and Prskawetz2016) consider both schooling and retirement choices, but they focus only on the effects of mortality changes and not those of productivity increase. Other researchers—such as Hazan (Reference Hazan2009), D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012), and Hansen and Lønstrup (Reference Hansen and Lønstrup2012)—also study similar topics.

One feature we observe from the above-mentioned papers is that while the core issues studied in these papers are similar, the results are quite diverse. For example, a mortality decline leads to a rise in retirement age in Bloom et al. (Reference Bloom, Canning and Moore2014) but may lead to a fall in retirement age in Kalemli-Ozcan and Weil (Reference Kalemli-Ozcan and Weil2010). Moreover, the assumptions made by the researchers are sometimes significantly different, making it difficult to compare the underlying reasons of the different results.

In this paper, we study the effects of mortality decline and productivity increase on optimal schooling years and retirement age. To understand the various results in the literature, and to reconcile them as far as possible, we mainly perform theoretical analysis on a life-cycle model based on factors most commonly used in the above papers.

We first examine a life-cycle model focusing only on the productivity-enhancing role of schooling. Starting with a careful analysis of the effect of mortality decline or productivity increase on schooling years or retirement age, we find it useful to understand the interaction of the direct effect (caused by the exogenous shock) and the indirect effect (caused by feedback from the other endogenous variable). A common feature determining the impact of either an exogenous mortality or productivity shock is the positive feedback between optimal schooling years and optimal retirement age, and we trace it to the underlying economic factors captured by the model. We then combine the positive feedback with the other key factor (the signs of the direct effects caused by the exogenous mortality or productivity shock), and provide further analysis. In particular, we show that when both direct effects are of the same sign, or when one direct effect is zero while the other non-zero, then a positive co-movement exists between schooling years and retirement age.

In conducting the analysis, we extend Ben-Porath's (Reference Ben-Porath1967) result to an environment in which both schooling years and retirement age are choice variables. Moreover, we show that a negative direct effect of a mortality decline on retirement age is a necessary (but not sufficient) condition for a negative total effect of a mortality decline on retirement age. This result implies that the lifetime human wealth channel [D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012)] is less likely to explain the decreasing retirement age trend if the schooling duration also responds to the mortality decline. These results are particularly relevant to the economic demography literature.

We obtain the above results in a baseline model focusing purely on the productivity-enhancing role of schooling (i.e., the Ben-Porath mechanism). There are two advantages in using this model: (a) the analysis, while rather tedious, is still manageable; and (b) the intuition of the results is very transparent. However, we observe a major disadvantage when we match the predictions of the model with the data. Even if we allow for various combinations of mortality and productivity shocks, the computational analysis suggests that the model cannot account for the negative correlation of schooling years and retirement age for the earlier cohorts of the twentieth century. These results are robust to various specifications and parameter values.

One direction to deal with this issue is to improve along the computational dimension. Including the social security system [as in Gruber and Wise (Reference Gruber, Wise, Gruber and Wise1999)] may also be helpful to explain the negative correlation of schooling years and retirement age.Footnote 4 While we believe it is valuable to pursue further computational analysis, we think that such an analysis does not fit well with the approach of this paper, which is mainly theoretical. Instead, we conduct further theoretical analysis by introducing an extra factor: the direct utility benefit of schooling, as in Bils and Klenow (Reference Bils and Klenow2000) and Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013). Using the framework of decomposition between the exogenous shocks and the endogenous feedback, we show that the correlation of schooling years and retirement age may be positive or negative in the extended model. In particular, we show that the extended model is able to explain the negative correlation of schooling years and retirement age, provided that the flow utility of schooling is in some intermediate range. Combining this with the results based on the simpler model, we conclude that when positive feedback is present, the co-movement of schooling years and retirement age is usually positive but may occasionally be negative. Positive feedback and positive co-movement are related but different concepts.

The paper is organized as follows. In section 2, we introduce a life-cycle model in which the sole benefit of schooling is its productivity-enhancing effect. In section 3, we provide analytical results regarding the impact of mortality and productivity shocks on schooling years and retirement age. We conduct computational analysis in section 4. Section 5 extends the model to incorporate the direct utility benefit of schooling so as to achieve a better match between the predictions of the extended model and the data. Section 6 concludes the study.

2. A life-cycle model with schooling and retirement choices

We consider a continuous-time life-cycle model with endogenous schooling years and retirement age. As in many papers in the literature, especially Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013) and Bloom et al. (Reference Bloom, Canning and Moore2014), mortality decline and productivity increase are taken as exogenous, and we only investigate the effects, not the causes, of these changes. Thus, we abstract from any health-enhancing expenditure [as in Chakraborty (Reference Chakraborty2004)] or any feedback of human capital accumulation on economic growth [as in Bils and Klenow (Reference Bils and Klenow2000)].

As in many existing papers, we ignore changes in infant and child mortality in our model. Assume that individuals in the model begin to make economic decisions at age N. Define “adult age” as the age measuring from age N. Lifetime uncertainty is present in the economic environment that we study. An individual of cohort b faces an age-specific mortality rate function μ(x; θ b), where x is her (adult) age and θ b is an index of the health level of this cohort.Footnote 5 The age-specific mortality rate function satisfies μ(x; θ b) ≥ 0 and lim xTμ(x; θ b) = ∞, where T is the maximum age in the model. Equivalently, lifetime uncertainty can be represented by the survival function

(1)$$l( x; \;\theta _b) = \exp \left[{-\int_0^x \mu ( t; \;\theta_b) {\rm d}t} \right], \;$$

which is the probability that a cohort b individual lives for at least x years. In the analysis performed in the subsequent sections, the survival functions l(x; θ b) of different cohorts shift over time to reflect mortality changes.

Individuals in the model make three kinds of decisions: the consumption path, studying vs. entering the labor market, and working vs. retirement. To maintain tractability, we follow many existing papers by focusing on the extensive margin of schooling and working decisions. Specifically, we assume that schooling is a discrete choice of either full-time study or no study and that working is a discrete choice of either full-time work or no work. We also assume that the decisions to work after quitting schooling and to retire after quitting work are irreversible. The assumption of not going back to the job market after quitting work in the past is consistent with the evidence in Costa (Reference Costa1998, p. 6) that retirement behavior in most cases is “a complete and permanent withdrawal from paid labor.” In this environment, an individual spends the first S years of her life in school, joins the labor market immediately after graduation, and retires at age R, where S and R are determined as follows.

As mentioned in the introduction, we first consider a model in which the sole benefit of schooling is its role to enhance the individual's productivity level. In the preference side, a cohort b individual values consumption and dislikes the reduction of leisure time when studying or working. She chooses the consumption path, S and R, to maximize her expected lifetime utility, which is given by

(2)$$\int_0^T {\exp } ( {-\rho x} ) l( {x; \;\theta_b} ) \displaystyle{{c{( x ) }^{1-{1 \over \sigma }}-1} \over {1-\displaystyle{1 \over \sigma }}}{\rm d}x-\int_0^R {\exp } ( {-\rho x} ) l( {x; \;\theta_b} ) \nu ( {x; \;\theta_b} ) {\rm d}x, \;$$

where ρ is the subjective discount rate, σ is the coefficient of intertemporal elasticity of substitution, c(x) is the level of consumption at age x, and ν(x; θ b) is the disutility of labor or study of a cohort b individual at age x, capturing her dissatisfaction of leisure reduction from work or study (with the normalization of zero utility corresponding to full leisure time during retirement).Footnote 6 We assume that ν(x; θ b) is non-decreasing in age and may shift down over time to reflect the “compression of morbidity” effect [Fries (Reference Fries1980); Bloom et al. (Reference Bloom, Canning, Mansfield and Moore2007)] of exogenous health improvement.

The flow budget constraint is as follows:

(3)$${a}^{\prime}( x) = \left\{{\matrix{ {[ {r + \mu ( x; \;\theta_b) } ] a( x) + \phi_bh( S) -c( x) } \cr {[ {r + \mu ( x; \;\theta_b) } ] a( x) -c( x) } \cr } } \right.\matrix{ {{\rm if\;}S < x \le R} \cr {{\rm if\;}x \le S{\rm \;or\;}x > R} \cr } , \;$$

where r is the real interest rate, a(x) is the level of financial asset at age x, h(S) is the human capital level of the individual, ϕ b is the index of productivity level of a cohort b individual, and the boundary conditions are

(4)$$a( 0) = 0, \;\quad a( T) \ge 0.$$

Under the above specification, the agent has no bequest motive, and a perfect annuity market exists to fully insure against mortality risk, similar to Yaari (Reference Yaari1965). Therefore, at each age x, the agent can lend or borrow in a perfect financial market with an effective (instantaneous) rate of return r + μ(t; θ b).

According to the budget constraint (3), when an individual works (after studying for S years), her wage rate is given by ϕ bh(S). One may think of this specification as consisting of three components, depending on the following: (a) the compensation to raw labor, which is normalized to be 1; (b) one's level of human capital h(S), which is a function of schooling duration;Footnote 7 and (c) an index ϕ b capturing the changing level of productivity of different cohorts, with a person from a more recent cohort benefiting from a higher value of ϕ b. As in Hazan (Reference Hazan2009) and Cervellati and Sunde (Reference Cervellati and Sunde2013), we assume that the return to schooling, $h^{^{\prime} }( S) /h( S)$, is positive but non-increasing in S.

We assume that no social security system exists in this model, as in Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013) and Bloom et al. (Reference Bloom, Canning and Moore2014). Therefore, the marginal benefit of delaying retirement age is the marginal utility of the extra labor income generated. On the other hand, the marginal cost is the disutility of labor.

We consider a model as similar as possible to those in the literature, especially to Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013) and Bloom et al. (Reference Bloom, Canning and Moore2014).Footnote 8 However, when these two models differ, we choose the assumptions with justification and as standard as possible. We highlight several major features of our model. In terms of labor-leisure choice, we focus on the extensive margin, instead of the intensive margin, in examining retirement age. Thus, we follow Bloom et al. (Reference Bloom, Canning and Moore2014) in specifying the disutility of labor function based on the discrete choice of labor instead of a utility function consisting of a continuous choice of leisure at any point in time. For the schooling duration choice, we follow Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013) to assume that the return to schooling is a decreasing function of schooling years. However, unlike their paper and that of Bils and Klenow (Reference Bils and Klenow2000), we mainly conduct our analysis (in sections 2–4) without relying on a term reflecting the direct utility benefit of schooling.Footnote 9 We believe that the interaction between schooling and retirement choices is most clearly illustrated in a model based only on the productivity-enhancing role of schooling without the direct utility benefit of schooling. In terms of the utility function of consumption, Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013) use a specification with a log utility function with a subsistence level, but Bloom et al. (Reference Bloom, Canning and Moore2014) assume a constant intertemporal elasticity of substitution (CIES) form. We choose the more general CIES specification. We find that our results (such as Propositions 2 and 3) depend on the value of the intertemporal elasticity of substitution (σ), which determines the relative importance of the income and substitution effects, but these two effects will cancel out when σ = 1 (the log case). Apart from these three key differences, a difference in the survival function is also assumed. We follow Bloom et al. (Reference Bloom, Canning and Moore2014) to use the more general non-rectangular survival function. This offers the advantage that the theoretical results hold more generally for different survival functions, and we can also use a realistic survival function in the quantitative analysis.

Since our focus is the impact of mortality decline and productivity increase, we only consider two cohort-specific shocks in the model: θ b and ϕ b. Individuals of different cohorts face different productivity levels (indexed by ϕ b). They also face different survival functions l(x; θ b), and different disutility of labor functions ν(x; θ b), with both functions indexed by θ b. In the remainder of this section, we obtain various choices of a representative individual of a particular cohort, with given θ b and ϕ b (see Appendix A section for a detailed analysis). First, conditional on a particular length of the schooling period and retirement age, we obtain the optimal consumption path of a cohort b individual, defined as c(x, S, R; θ b, ϕ b). We show that the (conditional) optimal consumption path is characterized by

(5)$$c( x, \;S, \;R; \;\theta _b, \;\phi _b) = \exp [ {\sigma ( r-\rho ) x} ] \phi _bc^n( 0, \;S, \;R; \;\theta _b) , \;$$

where

(6)$$c^n( {0, \;S, \;R; \;\theta_b} ) = \displaystyle{{c( {0, \;S, \;R; \;\theta_b, \;\phi_b} ) } \over {\phi _b}} = \displaystyle{{h( S) \int_S^R {\exp } ( {-rx} ) l( x; \;\theta _b) {\rm d}x} \over {\int_0^T {\exp } \{ {-[ {( {1-\sigma } ) r + \sigma \rho } ] x} \} l( x; \;\theta _b) {\rm d}x}}$$

is the initial consumption level normalized by the productivity level. It is clear from (6) that this normalized level is independent of ϕ b.

Second, conditional on the optimal consumption path in (5), we obtain the first-order conditions for the optimal schooling years and retirement age.Footnote 10 Conditional on retirement age (R), the optimal schooling years function, $\tilde{S}( R)$, is defined implicitly according toFootnote 11

(7)$$\phi _bh^{\prime}( {\tilde{S}( R ) } ) \left[{\int_{\tilde{S}( R ) }^R {\exp } ( {-rx} ) l( x ) {\rm d}x } \right] = \phi _b\exp ( {-r\tilde{S}( R ) } ) l( {\tilde{S}( R ) } ) h( {\tilde{S}( R ) } ) .$$

The left-hand side of (7) is the marginal benefit of continuing to study, which is measured by the expected present discounted value of the increases in labor income throughout the individual's working years from age $\tilde{S}( R)$ to age R, because of a higher level of human capital. The right-hand side of (7) is the marginal cost (the expected present discounted value of foregone labor income) of postponing the entry into the labor market at age $\tilde{S}( R)$.Footnote 12

Similarly, conditional on the schooling years (S), the optimal retirement age function, $\tilde{R}( S)$, is defined implicitly according toFootnote 13

(8)$$( \phi _b) ^{1-{1 \over \sigma }}\exp ( {-}r\tilde{R}( S) ) h( S) [ {c^n( 0, \;S, \;\tilde{R}( S) ) } ] ^{-{1 \over \sigma }} = \exp (\!\! - \!\!\rho \tilde{R}( S) ) \nu ( \tilde{R}( S) ) .$$

The left-hand side of (8) is the marginal benefit of delaying the retirement age, and the right-hand side is the corresponding marginal cost. Conditional on a given value of schooling duration, the productivity index ϕ b affects the marginal benefit through two channels. First, an increase in ϕ b leads to an upward shift of the consumption path and the resulting decrease in the marginal utility of consumption. Individuals thus demand more leisure, and retire earlier. Second, a rise in ϕ b causes increases in labor income at all ages. This rise in the price of leisure causes people to demand less leisure by retiring later. These two effects correspond to the income and substitution effects of a change in productivity level on retirement age.Footnote 14

The optimal choices of schooling years and retirement age, S* and R*, are the choices of S and R such that S* and R* satisfy both (7) and (8) simultaneously.Footnote 15 In other words,

(9)$$R^\ast = \tilde{R}( S^\ast ) , \;$$

and

(10)$$S^\ast = \tilde{S}( R^\ast ) .$$

Note that the productivity level ϕ b directly affects condition (8) for the optimal retirement age, but not condition (7) for the optimal schooling years after the cancellation of the common term. However, as will be seen more clearly later, it does affect S* indirectly through R*.

3. Impact of a mortality or productivity shock

This paper examines the impact on optimal schooling years and retirement age of two exogenous changes: mortality decline and productivity increase. Both analytical and computational approaches are useful in a complementary way to understand this behavior. In this section, we derive the comparative static results analytically. We focus on the impact of one exogenous shock at a time, since sharper analytical results are easier to obtain in the absence of the other shock. In section 4, we will conduct computational analysis regarding the impact of both shocks simultaneously. The analytical results of this section turn out to be not only interesting on their own but also helpful in interpreting the computational results.

In Appendix A, we re-write the first-order conditions (7) and (8), evaluated at the optimal choices of S = S* and R = R*, as (A.10) and (A.11). Based on (A.12) and (A.13), we show that when only a mortality decline exists, its impacts on S* and R* are given by

(11)$$\displaystyle{{\partial S^\ast } \over {\partial \theta _b}} = \displaystyle{{\partial \tilde{S}( R^\ast ; \;\theta _b) } \over {\partial \theta _b}} + \displaystyle{{\partial \tilde{S}( R^\ast ; \;\theta _b) } \over {\partial R}}\displaystyle{{\partial R^\ast } \over {\partial \theta _b}}, \;$$

and

(12)$$\displaystyle{{\partial R^\ast } \over {\partial \theta _b}} = \displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) } \over {\partial \theta _b}} + \displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) } \over {\partial S}}\displaystyle{{\partial S^\ast } \over {\partial \theta _b}}, \;$$

where $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \theta _b$ and $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b$ are given in (A.14) and (A.15), and $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R$ and $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S$, representing the interaction between the two endogenous variables, are given by

(13)$$\displaystyle{{\partial \tilde{S}( R^\ast ; \;\theta _b) } \over {\partial R}} = \displaystyle{{\displaystyle{{\exp ( {-}rR^\ast ) l( R^\ast ; \;\theta _b) } \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}}} \over {\displaystyle{{2{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{{{h}^{\prime \prime}( S^\ast ) } \over {{h}^{\prime}( S^\ast ) }}-\mu ( S^\ast ; \;\theta _b) -r}}, \;$$

and

(14)$$\displaystyle{{\partial \tilde{R}( S^\ast ; \;\phi _b, \;\theta _b) } \over {\partial S}} = \displaystyle{{\displaystyle{{{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}} \over {r-\rho + \displaystyle{1 \over \sigma }\displaystyle{{\exp ( {-}rR^\ast ) l( R^\ast ; \;\theta _b) } \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}} + \displaystyle{1 \over {\nu ( R^\ast ; \;\theta _b) }}\displaystyle{{\partial \nu ( R^\ast ; \;\theta _b) } \over {\partial x}}}}.$$

According to (11) and (12), a mortality change affects a particular endogenous variable (S* or R*) both directly and indirectly. The underlying reasons of (11) can be traced back to (7) or (A.10). Since a mortality change affects the optimal choice of schooling years (S*) through the survival function l(.; θ b), the direct effect is captured by the term $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \theta _b$, evaluated at the original retirement age. Moreover, the retirement age (R*) appears in (7), and this term is affected by a mortality change and may affect schooling years; thus, the indirect effect is represented by the product of ∂R*/∂θ b and $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R$. The interpretation of (12) is similar to that in (11), except that the roles of schooling years and retirement age are interchanged.

Similarly, when only a productivity increase occurs, its impacts on S* and R* are given by

(15)$$\displaystyle{{\partial S^\ast } \over {\partial \phi _b}} = \displaystyle{{\partial \tilde{S}( R^\ast ; \;\theta _b) } \over {\partial R}}\displaystyle{{\partial R^\ast } \over {\partial \phi _b}}, \;$$

and

(16)$$\displaystyle{{\partial R^\ast } \over {\partial \phi _b}} = \displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) } \over {\partial \phi _b}} + \displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) } \over {\partial S}}\displaystyle{{\partial S^\ast } \over {\partial \phi _b}}, \;$$

where the various terms on these two equations are given in (13), (14), and (A.16).

The interpretation of (15) and (16) is essentially the same as that of (11) and (12). The only exception is that the productivity level does not affect the optimal choice of schooling years, since it appears equally on both sides of (7) and can be cancelled out. As a result, only an indirect effect can be found in (15).

In either the system of (11) and (12) or that of (15) and (16), the total effect (the sum of the direct and indirect effects) of an exogenous shock on the two endogenous variables can be obtained by solving the two relevant equations simultaneously. Using ψ to represent either θ b or ϕ b, we can solve each of the above two systems as

(17)$$\displaystyle{{\partial S^\ast } \over {\partial \psi }} = M\left[{\displaystyle{{\partial \tilde{S}( R^\ast ; \;\theta_b) } \over {\partial \psi }} + \left({\displaystyle{{\partial \tilde{S}( R^\ast ; \;\theta_b) } \over {\partial R}}} \right)\displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta_b, \;\phi_b) } \over {\partial \psi }}} \right], \;$$

and

(18)$$\displaystyle{{\partial R^\ast } \over {\partial \psi }} = M\left[{\displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta_b, \;\phi_b) } \over {\partial \psi }} + \left({\displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta_b, \;\phi_b) } \over {\partial S}}} \right)\displaystyle{{\partial \tilde{S}( R^\ast ; \;\theta_b) } \over {\partial \psi }}} \right], \;$$

where

(19)$$M = \left({\displaystyle{1 \over {1-\displaystyle{{\partial \tilde{S}( R^\ast ; \;\theta_b) } \over {\partial R}}\displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta_b, \;\phi_b) } \over {\partial S}}}}} \right)> 0, \;$$

because of (A.9).Footnote 16

3.1 Positive feedback between the two endogenous variables

We observe from (17) and (18) the similarities as well as differences for the two systems: ψ = θ b or ψ = ϕ b. In sections 3.2 and 3.3, we will consider their differences by studying each of them individually.Footnote 17 Before that, we first focus on the common elements of these two systems of equations, which are given by the terms $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R$ in (13) and $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S$ in (14). These common elements, which concern the interaction between optimal schooling years and retirement age, exhibit interesting properties, as given in the following proposition. In all the propositions in this paper, we assume that the second-order conditions (A.7) to (A.9) hold for a meaningful maximization problem. (We have verified that they are satisfied computationally in our analysis in section 4.)

Proposition 1. For the life-cycle model given by (1) to (4),

(a) $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R > 0$: anticipating that an exogenous shock will shift up (respectively down) the retirement age function, the agent changes the schooling years in the same direction; and

(b) $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S > 0$: a rise (respectively fall) in schooling years leads to a subsequent change of retirement age in the same direction.

Proof. See Appendix A.

The intuition of Proposition 1 is as follows. We observe from the first-order condition (7) that the retirement age affects the marginal benefit, but not the marginal cost, of increasing schooling years. When the retirement age rises (say, in response to an exogenous shock), it shifts up the marginal benefit schedule because of a longer duration to reap the benefit of education. With an unchanged marginal cost schedule, the increase in retirement age induces the optimal schooling years to move in the same direction, as given in Proposition 1(a).

According to the first-order condition (8), a change in schooling years has two effects on the marginal benefit of continuing working: a term related to human capital function and another related to the normalized consumption level. We show from (6) and (7) that at the optimal schooling years, the effect on normalized consumption level becomes zero.Footnote 18 As a result, only one effect is related to the rate of return of accumulating human capital, as given in (14). Since the rate of return is positive in the relevant region, agents with more schooling will retire later, as given in Proposition 1(b).

According to Proposition 1, $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R > 0$, and $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S > 0.$ The changes in the two endogenous variables S* and R* (caused by a particular exogenous shock, e.g.) reinforce each other.Footnote 19 Positive feedback exists, and this contrasts with the other possibility of negative feedback in which the two derivatives are of opposite signs.Footnote 20

In the process of showing the presence of positive feedback in the above system, our analysis also contributes to the literature about the Ben-Porath mechanism [such as Ben-Porath (Reference Ben-Porath1967); and Hazan (Reference Hazan2009)]. We show in Proposition 1(a) that, in anticipating, for example, an increase in retirement age in the future because of an exogenous shock, the individual chooses longer schooling years to receive the higher benefit of human capital accumulation. Proposition 1(a) extends Ben-Porath's (Reference Ben-Porath1967) result, which is proved assuming the retirement age is fixed, to an environment in which both schooling years and retirement age are choice variables.Footnote 21

3.2 Combining endogenous feedback and exogenous productivity increase

We first examine how an increase in the productivity parameter (ϕ b), other things being equal, affects optimal schooling years and retirement age. The analysis is simpler in this case because productivity increase has no direct effect on schooling years $( \partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \phi _b = 0)$. The results are summarized in the following proposition:

Proposition 2. Consider the life-cycle model given by (1) to (4). If

(20)$$0 < \sigma < 1, \;$$

thenS*/∂ϕ b < 0, andR*/∂ϕ b < 0.

Proof. See Appendix A.

Proposition 2 states that an increase in the productivity parameter (ϕ b) causes schooling years and retirement age to decrease, leading to a positive co-movement of these two variables in this model. The intuition of Proposition 2 is as follows. As observed in (17) and (18), which are obtained from solving (11) and (12) simultaneously, the impact of an exogenous shock (ϕ b) on S* and R* can be expressed as the sum of two terms: the exogenous shock component ($\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \psi$ in (17) or $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \psi$ in (18)) and the endogenous feedback component ($( \partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R) \,( \partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \psi )$ in (17) or $( \partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S) \,( \partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \psi )$ in (18)). Proposition 1, which concerns the feedback (or interaction) term, shows that both $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R$ and $\partial \tilde{R}( S^\ast ; \;\phi _b, \;\theta _b) /\partial S$ are positive.Footnote 22 We call this feature the positive endogenous effect. Together with second-order condition (A.9), we see from (17) and (18), with ψ = ϕ b, that when $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \phi _b = 0$ (the direct effect of the productivity shock on schooling years is zero), each of the two total effects (∂S*/∂ϕ b or ∂R*/∂ϕ b) is positively related to $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \phi _b$ (the direct effect of the productivity shock on retirement age). When 0 < σ < 1 according to (20), the income effect dominates the substitution effect, leading to negative exogenous effect (i.e., $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \phi _b < 0$). Combining the negative exogenous effect and the positive endogenous effect leads to the negative total effect for the productivity shock in Proposition 2. Since the total effects for the productivity shock on S* and R* are negative, a positive co-movement of these two variables occurs after a productivity shock.

For the sake of completeness, we summarize in the following proposition the remaining two cases about the value of σ. The proof, which is only slightly different from that of Proposition 2, is omitted.Footnote 23

Proposition 3. (a) When σ = 1, an increase in productivity level has no effect on both S* and R*.

(b) When σ > 1, an increase in productivity level leads to increases in both S* and R*.

3.3 Combining endogenous feedback and exogenous mortality decline

We now study how a change in the mortality parameter (θ b), other things being equal, affects optimal schooling years and retirement age.

Solving (11) and (12) simultaneously gives (17) and (18), with ψ = θ b. As in section 3.2, it is helpful to decompose the total effect of a mortality decline on schooling years or retirement age into the exogenous (shock) and endogenous (feedback) components. According to Proposition 1, the effects caused by the feedback terms are positive. On the other hand, the exogenous components are given by the two direct effects caused by the mortality decline: $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \theta _b$ and $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b$. We can easily see from (17) and (18) that the total effect on either schooling years or retirement age (∂S*/∂θ b or ∂R*/∂θ b) is a linear combination of these two direct effects.

We first examine these two direct effects separately, and then combine them to obtain the total effect.

3.3.1 The direct effect of a mortality decline on school years

Based on (A.14) as well as (A.7) and (A.18) in Appendix A, a positive value of $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \theta _b$ is equivalent to

(21)$$\displaystyle{{\int _{S^\ast }^{R^\ast } \exp ( {-rx} ) l( {x; \;\theta_b} ) \left[{\int_{S^\ast }^x \left({-\displaystyle{{\partial \mu ( {t; \;\theta_b} ) } \over {\partial \theta_b}}} \right){\rm d}t} \right]{\rm d}x} \over {\int _{S^\ast }^{R^\ast } \exp ( {-rx} ) l( {x; \;\theta_b} ) {\rm d}x}} > 0.$$

According to (7), a mortality decline affects optimal schooling years (S*) directly through a higher future income stream (by increasing the survival probabilities from S* to R*) in the marginal benefit schedule and a foregone current income (by increasing the survival probability at age S*) in the marginal cost schedule. The mortality rate at the current age only affects future survival probabilities but not past survival probabilities; see also (1) and (A.17). Since the survival probabilities of age R* and above do not appear in (7), the effect of a change in θ b on μ(.; θ b) for ages after R* is irrelevant. Moreover, the analysis in (A.18) shows that the effects of a change in θ b on μ(t; θ b) for t ≤ S* on the marginal benefit and marginal cost schedules exactly cancel out. Thus, the direct effect of a change in θ b on optimal schooling years depends only on its impact on μ(t; θ b) for t ∈ [S*, R*].Footnote 24 Equation (21) has a useful interpretation that the linear combination of the effects of a change in θ b on the survival probabilities from age S* to age R*, which only appear in the marginal benefit schedule, is positive.

A more intuitive interpretation can further be obtained in the special case that the mortality decline process causes decreases in the mortality rates of the working years from S* to R*. In this case,

(22)$$\displaystyle{{-\partial \mu ( {t; \;\theta_b} ) } \over {\partial \theta _b}} > 0, \;\forall t\in [ {S^\ast , \;R^\ast } ] .$$

Since (22) is a sufficient condition for (21), we can easily see that (21) is satisfied when a change in θ b decreases mortality rates during working years. The observed mortality decline in the twentieth century usually reduces mortality rates for most (but not necessarily all) ages. Thus, −∂μ(t; θ b)/∂θ b may be negative for some t, and (22) may not hold. However, the above arguments suggest that, based on the linear-combination interpretation described in the previous paragraph, (21) likely holds for many empirically relevant mortality decline processes.Footnote 25

3.3.2 The direct effect of a mortality decline on retirement age

We observe from (8) that a mortality decline affects the normalized consumption level on the marginal benefit schedule and the disutility of labor term on the marginal cost schedule. According to (A.19) in Appendix A, the effect of a mortality decline on the consumption level can be decomposed into two effects, which are called the lifetime human wealth effect and the years-to-consume effect, following (23) of D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012).Footnote 26 Combining (A.15) and (A.19), we show that the sign of $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b$ is the same as that of

(23)$$\eqalign{& \displaystyle{1 \over \sigma }\displaystyle{{\int _0^T \exp \{ {-[ {( {1-\sigma } ) r + \sigma \rho } ] x} \} l( {x; \;\theta_b} ) \left[{\int_0^x \left({-\displaystyle{{\partial \mu ( {t; \;\theta_b} ) } \over {\partial \theta_b}}} \right){\rm d}t} \right]{\rm d}x} \over {\int _0^T \exp \{ {-[ {( {1-\sigma } ) r + \sigma \rho } ] x} \} l( {x; \;\theta_b} ) {\rm d}x}} + \displaystyle{{\left({-\displaystyle{{\partial \nu ( {R^\ast ; \;\theta_b} ) } \over {\partial \theta_b}}} \right)} \over {\nu ( {R^\ast ; \;\theta_b} ) }} \cr & \cr & \quad -\displaystyle{1 \over \sigma }\displaystyle{{\int _{S^\ast }^{R^\ast } \exp ( {-rx} ) l( {x; \;\theta_b} ) \left[{\int_0^x \left({-\displaystyle{{\partial \mu ( {t; \;\theta_b} ) } \over {\partial \theta_b}}} \right){\rm d}t} \right]{\rm d}x} \over {\int _{S^\ast }^{R^\ast } \exp ( {-rx} ) l( {x; \;\theta_b} ) {\rm d}x}}.} $$

Summing up the above analysis, there are three components in the direct effect of a mortality decline on retirement age: the lifetime human wealth effect (by shifting down the marginal benefit schedule), the years-to-consume effect (by shifting up the marginal benefit schedule), and the compression of morbidity effect (by shifting down the marginal cost schedule). If the sum of the years-to-consume and compression of morbidity effects is at least as large as the lifetime human wealth effect, then (23) is non-negative, and $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b \ge 0$. In the main model considered by D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012), in which only the lifetime human wealth and years-to-consume effects exist, they argue that when a mortality decline concentrates on old ages, the lifetime human wealth effect is absent, and the years-to-consume effect is present, resulting in a delay in retirement. On the other hand, if a mortality decline concentrates on younger ages, then the lifetime human wealth effect may dominate, leading to earlier retirement. Thus, a mortality decline that simultaneously affects mortality rates at different ages will generally have an ambiguous effect on retirement age. In the life-cycle model described in section 2, with schooling years being endogenously determined and with an additional compression of morbidity effect, the various effects examined in D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012) are also relevant, leading to the same conclusion that the direct effect of a general mortality decline process on retirement age is usually ambiguous.

3.3.3 The total effects on the endogenous variables

As shown in the above discussions, one direct effect of a mortality decline $( \partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \theta _b)$ is positive, and the sign of the other $( \partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b)$ is ambiguous. The possible different signs of the two direct effects of mortality decline lead to a more complicated (but also more interesting) analysis. Combining these two direct effects, we now obtain results regarding the sign of the total effect of a mortality decline on the two endogenous variables. The results are presented in the following two propositions.

Proposition 4. Consider the life-cycle model given by (1) to (4). Assume that the direct effect of a mortality decline on schooling years is positive. If a mortality decline has a positive direct effect on retirement age ( $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b > 0$), thenS*/∂θ b > 0, andR*/∂θ b > 0.

According to Proposition 4, if both of the direct effects are positive, then a mortality decline leads to a positive total effect on either S* or R*. The intuition can be seen from (17) and (18). Since the two feedback terms, $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R$ and $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S$, are positive according to Proposition 1, we can easily conclude that when the two direct effects are positive, the total effect of a mortality decline on either S* or R* must be positive. As a result, a positive co-movement of schooling years and retirement age occurs.

The next proposition, which is complementary to Proposition 4, focuses on the conditions leading to a negative total effect of a mortality decline on retirement age (∂R*/∂θ b < 0).

Proposition 5. Consider the life-cycle model given by (1) to (4). Assume that the direct effect of a mortality decline on schooling years is positive. A necessary but not sufficient condition forR*/∂θ b < 0 is a negative direct effect of a mortality decline on retirement age $( \partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b < 0)$.

The intuition of Proposition 5 is as follows. In a life-cycle model incorporating both schooling and retirement choices, the effect of a mortality decline on retirement age is given by (18), with ψ = θ b. We observe that the two direct effects ($\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \theta _b$ and $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b$) can be important in determining the sign of the total effect ∂R*/∂θ b. Since a mortality decline has a positive direct effect on schooling years ($\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \theta _b > 0$) when (21) holds, and longer schooling duration induces higher retirement age $( \partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S > 0)$ according to Proposition 1(b), the indirect effect (corresponding to the endogenous change in schooling years) of mortality decline is positive in this case. As a result, the direct effect $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b$ has to be strongly negative for the mortality decline to have an overall negative effect on retirement age.

The above results are relevant to the economic demography literature. In particular, some researchers [such as Kalemli-Ozcan and Weil (Reference Kalemli-Ozcan and Weil2010); D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012); and Bloom et al. (Reference Bloom, Canning and Moore2014)] examine whether the total effect of a mortality decline on retirement age is positive or negative. According to conventional wisdom in this literature, when people are expected to live longer, they tend to delay their retirement so as to earn more resources for their post-retirement days. Empirically, however, the average retirement age trend over time is more complicated than the monotonic increasing relationship predicted by conventional theory. As documented in, for example, Costa (Reference Costa1998, Figure 3.1), the labor force participation rates of US men aged 65 and over declined from over 60% in 1900 to around 20% at the 1990s. Interestingly, the downward trend of the labor force participation rates of men aged 65 and above seemed to reverse during the 1990s [Maestas and Zissimopoulos (Reference Maestas and Zissimopoulos2010), Figure 4].Footnote 27 The evidence in Costa (Reference Costa1998) and Maestas and Zissimopoulos (Reference Maestas and Zissimopoulos2010) is also consistent with the negative relationship between schooling years and retirement age for cohorts before 1930 and the positive relationship between these same factors for cohorts after 1930, as observed in panel (b) of Figure 1.

Different reasons to explain the decreasing trend of retirement age for the cohorts born in the late nineteenth and early twentieth centuries in developed countries have been offered in the literature. Kalemli-Ozcan and Weil (Reference Kalemli-Ozcan and Weil2010) focus on a decrease in uncertainty about the age at death, and show that a mortality decline may lead to early retirement if this uncertainty effect is very strong. On the other hand, D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012) find that a mortality decline may lead to early retirement if the lifetime human wealth effect dominates the years-to-consume effect, and they clarify that this condition is more likely to hold if the mortality decline concentrates on younger ages. Apart from these demographic factors, Gruber and Wise (Reference Gruber and Wise1998, Reference Gruber, Wise, Gruber and Wise1999) examine the role of generous benefits provided by the social security system. Costa (Reference Costa1998) emphasizes the wealth effect associated with sustained economic growth. Bloom et al. (Reference Bloom, Canning and Moore2014) follow-up on this idea and combine the mortality decline and increasing wealth to explain the declining retirement age in the twentieth century.

Our analysis has a direct contribution to the above debate. The analysis of D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012) implies that a necessary and sufficient condition for a mortality decline leading to an early retirement age in a life-cycle model with exogenous schooling years and the compression of morbidity effect is that the lifetime human wealth effect dominates the sum of the years-to-consume and compression of morbidity effects. In terms of the notations of this paper, the condition is equivalent to a negative value of (23).Footnote 28 According to Proposition 5, this negative value, which implies that $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b < 0$, only becomes a necessary condition for a mortality decline leading to earlier retirement when schooling duration is endogenous and (21) holds. This result implies that the lifetime human wealth channel emphasized in D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012) is less likely to explain the declining trend of retirement age in an economic environment in which schooling years also respond to mortality decline. As argued earlier, the direct effect of a mortality decline on schooling years is likely positive (i.e., (21) holds). In this case, even if the necessary condition of a negative value of the direct effect of a mortality decline on retirement age (i.e., (23) is negative) holds, a mortality decline may not be sufficient to generate a negative total effect on retirement age.

4. Quantitative analysis

In this section, we conduct quantitative analysis to examine the impact of mortality decline and productivity increase on schooling years and retirement age. We first conduct the analysis based on the baseline model, and then perform sensitivity analysis.

4.1 Specifications of the baseline model

As far as possible, the specifications in our baseline model are those commonly used in the literature. We use the Gompertz-Makeham specification for the survival function, as in Heijdra and Romp (Reference Heijdra and Romp2009) and Bloom et al. (Reference Bloom, Canning and Moore2014). The Gompertz-Makeham survival function, which involves three parameters, is given as

(24)$$l( x; \;\theta _b) = l^{GM}( x; \;\theta _b) = \exp \left\{{-\mu_{b, 0}x-\displaystyle{{\mu_{b, 1}} \over {\mu_{b, 2}}}[ {\exp ( \mu_{b, 2}x) -1} ] } \right\}, \;$$

where μ b,i (i = 0, 1, 2) is related to the mortality parameter θ b defined before, as follows:

(25)$$\mu _{b, i} = \mu _i( \theta _b) .$$

The corresponding age-specific mortality rate function is given by $\mu ^{GM}( x; \;\theta _b) = {-}{1 \over {l^{GM}( x; \;\theta _b) }}{{\partial l^{GM}( x; \;\theta _b) } \over {\partial x}} = \mu _{b, 0} + \mu _{b, 1}\exp ( \mu _{b, 2}x)$.Footnote 29 Note that the coefficients are cohort-specific.

Following the specification of the disutility of labor function in Bloom et al. (Reference Bloom, Canning and Moore2014), the disutility of labor or study function of cohort b individuals is assumed to be proportional to that cohort's age-specific mortality rate function:

(26)$$\nu ( x; \;\theta _b) = \delta \mu ^{GM}( x; \;\theta _b) = \delta [ {\mu_{b, 0} + \mu_{b, 1}\exp ( \mu_{b, 2}x) } ] , \;$$

where δ > 0.

For the human capital function, we assume

(27)$$h( S) = \exp ( {\gamma S^\lambda } ) , \;$$

where γ > 0 and 0 < λ ≤ 1. This functional form is consistent with those in the literature, such as Cervellati and Sunde (Reference Cervellati and Sunde2013) and Cai and Lau (Reference Cai and Lau2017). According to this specification, the rate of return to schooling is

(28)$$\displaystyle{{{h}^{\prime}( S) } \over {h( S) }} = \gamma \lambda S^{\lambda -1}, \;$$

which is a decreasing function of S when λ is strictly less than 1.

Finally, we assume a constant growth rate in productivity, as in Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013) and Bloom et al. (Reference Bloom, Canning and Moore2014). Normalizing the productivity index for the 1900 cohort as 1, this index for cohort b is given by

(29)$$\phi _b = \exp [ {g( b-1900) } ] , \;$$

where g (g > 0) is the growth rate of the productivity level.

4.2 Calibration

The values of the parameters in the baseline model are chosen to match those in the literature as far as possible. (We will also use other parameter values in the sensitivity analysis.) In particular, we choose ρ = 0.03, r = 0.03, and g = 0.0127, following Bloom et al. (Reference Bloom, Canning and Moore2014). We set σ, the intertemporal elasticity of substitution, to be 0.6. Following Boucekkine et al. (Reference Boucekkine, de la Croix and Licandro2003), we assume that N = 10.Footnote 30 Finally, we assume that T = 100, which corresponds to a maximum biological age (T + N) of 110.

To estimate the parameters of the survival functions, we minimize the sum of squared residuals between the survival probability based on the Gompertz-Makeham specification (24) and the data of US men from the Berkeley Mortality Database. For each cohort, we first transform the data to obtain the survival probabilities conditional on reaching age N and then choose the three survival parameters (μ b,0, μ b,1, and μ b,2) to minimize

(30)$$SSR^l( {\mu_{b, 0}, \;\mu_{b, 1}, \;\mu_{b, 2}} ) = \mathop \sum \limits_{x = 1}^T \left[{\exp \left\{{-\mu_{b, 0}x-\displaystyle{{\mu_{b, 1}} \over {\mu_{b, 2}}}[ {\exp ( \mu_{b, 2}x) -1} ] } \right\}-\displaystyle{{l_{N + x}^{data} } \over {l_N^{data} }}} \right]^2, \;$$

where $l_x^{data}$ is the survival probability data up to age x. Figure 2 shows the comparison of the survival probability data and that of the estimated Gompertz-Makeham model, for the beginning and ending years (1900 and 2000) as well as the mid-point (1950) of the US data in the Berkeley Mortality Database. We observe that the fit is good in every case. The fits in other years, which are not shown, are also good.

Figure 2. Survival probability: data and estimated Gompertz-Makeham survival functions.

For the estimation of parameters γ and λ in the human capital function h(S), we also use the non-linear least-squares method. Specifically, we choose γ and λ to minimize

(31)$$SSR^S( \gamma , \;\lambda ) = \mathop \sum \limits_{x = 0}^{15} \{ {S_{1900 + 5x}^\ast ( \gamma , \;\lambda ) -[ {S_{1900 + 5x}^{data} -( N-6) } ] } \} ^2, \;$$

where $S_b^\ast ( \gamma , \;\lambda )$ denotes the optimal schooling years of cohort b individuals calculated from the model, and $S_b^{data}$ is from the schooling years data set (for US men) used in Goldin and Katz (Reference Goldin and Katz2008), which starts in 1876 and ends in 1975.Footnote 31 There are only 16 observations, for the data in the 5-year interval, covered in both the Berkeley Mortality Database and Goldin and Katz (Reference Goldin and Katz2008). According to the estimation result, $\hat{\gamma } = 0.0482, \;\hat{\lambda } = 0.948$, and the root mean squared error (RMSE) is 0.302, which is reasonably small. The estimated schooling data fits the actual data reasonably well, as seen in Figure 3.

Figure 3. Schooling years.

Finally, we calibrate parameter δ of the disutility function such that the optimal retirement age for the 1900 cohort is 65.Footnote 32 The parameter values of the baseline case are summarized in Table 1.

Table 1. Parameters of the baseline model

4.3 Effects of mortality decline and productivity increase

In the quantitative analysis, we focus on 21 cohorts of US men, starting from 1900 and increasing every 5 years up to 2000. For each cohort, we use the specifications and parameters discussed above, together with the survival probability data of the corresponding cohort. The results of the baseline case are shown in Figure 4, with the optimal retirement ages given in the upper panel and the optimal schooling years given in the lower panel.

Figure 4. Baseline results.

The optimal retirement age rises in early cohorts (from an imposed value of 65 in 1900) until reaching the peak of 69.0 for the 1950 cohort and falls gradually afterward to 67.9 in 2000. We observe from Figure 4 that the rate of increase in retirement age from 1900 to 1950 is larger in magnitude than the rate of decrease from 1950 to 2000. The path of optimal schooling years, on the other hand, shows a rather different profile compared with that of the retirement age. The optimal schooling years series starts from 8.22 years for the 1900 cohort, increases all the way up to 13.71 years for the 1990 cohort, and then decreases slightly to 13.67 years for the 2000 cohort. We also observe from Figure 4 that the rate of increase in schooling years was quite high in the earlier decades, but the increase slowed down in the middle of the century.

To understand the trend of the optimal retirement age and schooling years paths, we perform decomposition exercises by isolating the effects of the two factors individually. Specifically, when analyzing the effect of mortality decline only, we shut down the productivity increase channel by using g = 0 in (29). On the other hand, when analyzing the effect of productivity increase only, we use μ b,j = μ 1900,j (j = 0, 1, 2) in (24) and (26). In other words, we switch off the mortality decline channel by assuming that the survival functions in later decades are the same as the 1900 version. Note that we keep other parameters as those in Table 1 in these decomposition exercises.

We first look at the effect on optimal retirement age in the upper panel of Figure 4. When the productivity level is held constant, the retirement age increases monotonically from 65 to 78.1 in response to reductions in mortality rates. We also observe that the retirement age increases at a decreasing rate, implying that the mortality effect becomes weaker over time. This reflects the well-known fact that the rate of mortality decline in developed countries slowed down in the second half of the twentieth century. As observed in Figure 2, the shifting out of the survival functions from 1950 to 2000 is smaller than that from 1900 to 1950. On the other hand, the optimal retirement age decreases from 65 to 53.0 if only a productivity increase (and thus increased lifetime wealth) occurs. Note that the change in retirement age in response to a productivity increase is almost linear, unlike the effect of a mortality decline.

We observe similar effects on optimal schooling years. When only a mortality decline occurs, optimal schooling years increase monotonically from 8.22 to 18.45. On the other hand, when only a productivity increase occurs, the corresponding figure decreases from 8.22 years to 5.29. Again, we observe that the schooling years path is concave when only a mortality decline occurs, and is rather linear when only a productivity increase occurs.

The two results on retirement age (the positive effect of mortality decline and the negative effect of productivity increase) are consistent with those in Bloom et al. (Reference Bloom, Canning and Moore2014). In this sense, their qualitative results, which are developed for a model with exogenous retirement age, are also relevant when the retirement age is endogenously determined. On the other hand, we find that introducing the choice of schooling years brings new qualitative and quantitative results when we consider the effects of simultaneous changes in mortality and productivity. Bloom et al. (Reference Bloom, Canning and Moore2014) show that the magnitude of the decrease in optimal retirement age in response to the productivity increase is about twice that of the increase in retirement age in response to the mortality decline. As a result, the optimal retirement age decreases from 1901 to 1951 as well as from 1951 to 1996 [Bloom et al. (Reference Bloom, Canning and Moore2014), Table 3]. Our analysis, however, shows that the optimal retirement age increases in the first half of the century and then decreases in the second half. The intuition of this difference can be traced to the concave shape of the path of schooling years, which is caused by the substantial decline of mortality in the first half of the century and the decreasing rate of return of human capital formation. Bloom et al. (Reference Bloom, Canning and Moore2014) do not consider schooling choice, and thus, any indirect effect through schooling years on the retirement age is not captured.Footnote 33

Regarding the impact on schooling years, the positive effect of mortality decline is consistent with that of Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013). On the other hand, the negative effect of productivity increase on schooling years is different from the positive effect in Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013). We will re-examine this issue in section 5.

4.4 Sensitivity analysis

We then perform the sensitivity analysis regarding the parameters listed in Table 1. The results of the sensitivity analysis are reported in a Supplementary Appendix available upon request.

We summarize the results briefly. The computational results are robust with respect to the interest rate (r), the subjective discount rate (ρ), the age when individuals begin making economic decisions (N), and the maximum age in the model (T), at least when they are within some relevant ranges. They are less robust with respect to the growth rate of productivity (g) and the intertemporal elasticity of substitution (σ) individually, but we also find that simultaneous changes in g and σ can lead to retirement age and schooling years paths close to the baseline case.

For each of the analyzed cases, we also perform decomposition exercises by focusing on one exogenous change (mortality decline or productivity increase) only. We find that in each case, the optimal retirement age and schooling years always change monotonically, and these two variables always move in the same direction.

5. An extension: including the direct utility benefit of schooling

In the previous sections, we have used a life-cycle model focusing only on the productivity-enhancing role of schooling. While the relative simplicity of the model allows us to obtain useful results and transparent intuition, we encounter a drawback when we match its predictions with the data. Even if we allow for various combinations of mortality and productivity shocks, explaining the negative correlation between schooling years and retirement age for those born before 1930 is difficult. Additionally, the effect of a productivity increase on schooling is negative according to the model, which is contradictory to the result in Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013) and the conjecture of many researchers.

One way to fix the above inadequacy is to change some specifications or parameter values for the computational analysis. For example, we may consider other specifications of the compression of morbidity term to see whether the direct effect of mortality decline on retirement age is positive or negative. Relaxing the assumption of the constant rate of productivity increase to allow for different rates of productivity improvement at different sub-periods may also be helpful. We leave these channels to be explored in future work focusing on quantitative analysis.Footnote 34 To maintain the theoretical focus of this paper, we decide not to take this route, but instead to extend our model and perform further theoretical analysis.

In this section, we address these issues by adding one ingredient to the model in section 2, and we consider only the effect of productivity increase.Footnote 35 In the extended model, we make two changes. First, we do not consider mortality decline in this section, and simply use the notations μ(x) and l(x) to represent these cohort-invariant functions. Second, instead of the objective function (2), we assume that the individual maximizesFootnote 36

(32)$$ \eqalignb{& {\int_0^T {\exp } ( -\!\!\rho x ) l ( x ) \displaystyle{{c{ ( x ) }^{1-{1 \over \sigma }}-1} \over {1-\displaystyle{1 \over \sigma }}}{\rm d}x + \int_0^S {\exp } ( -\!\!\rho x ) l ( x ) \zeta {\rm d}x } \cr& \qquad \qquad \quad -\int_0^R {\exp ( -\!\!\rho x ) } l ( x ) \nu ( x ) {\rm d}x, \;} $$

where ζ is the flow utility of schooling, and is assumed to be non-negative. This modified objective function incorporates the direct utility benefit of schooling, as in Bils and Klenow (Reference Bils and Klenow2000) and Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013). The other aspects of the model captured by (1), (3), and (4) remain the same.

We show that the first-order conditions are now given by (8) and

$$\phi _bh^{\prime}( {\tilde{S}( R) } ) \left[{\int_{ \tilde{S}( R ) }^R {\exp ( {-}rx) } l( x) {\rm d}x } \right] + \eta ( {\tilde{S}( R) , \;R; \;\phi_b} ) $$
(33)$$ = \phi _b\exp ( {-r\tilde{S}( R) } ) l( {\tilde{S}( R) } ) h( {\tilde{S}( R) } ) , \;$$

where

(34)$$\eta ( {\tilde{S}( R) , \;R; \;\phi_b} ) = \displaystyle{{\exp ( {-\rho \tilde{S}( R) } ) l( {\tilde{S}( R) } ) \zeta } \over {{( \phi _b) }^{-{1 \over \sigma }}{[ {c^n( {0, \;\tilde{S}( R) , \;R} ) } ] }^{-{1 \over \sigma }}}}.$$

(Detailed derivations for this model are given in Supplementary Appendix available upon request.) The first-order condition (8) for retirement age is unaffected by the introduction of the direct utility of schooling term. On the other hand, compared with (7) of the model in section 2, we notice that the marginal cost of extending schooling years, given by the right-hand side term of (33), remains the same, but the marginal benefit now consists of two terms. The first term on the left-hand side of (33) represents the productivity-enhancing effect, which is the same as in (7). Additionally, the individual is assumed to have direct utility from schooling. The marginal direct utility from schooling is given by $\exp ( {-\rho \tilde{S}( R) } ) l( {\tilde{S}( R) } ) \zeta$. Converting this term into monetary units (by dividing it by the marginal utility of consumption) leads to the second term on the left-hand side of (33).

5.1 Positive feedback

Similar to the analysis in section 3, we first obtain the relations between the two endogenous variables. We show that

(35)$$\displaystyle{{\partial \tilde{S}( R^\ast ; \;\phi _b) } \over {\partial R}} = \displaystyle{{\phi _b{h}^{\prime}( S^\ast ) \exp ( {-}rR^\ast ) l( R^\ast ) + \displaystyle{1 \over \sigma }\eta ( S^\ast , \;R^\ast ; \;\phi _b) \left[{\displaystyle{{\exp ( {-}rR^\ast ) l( R^\ast ) } \over {\int_{S^\ast }^{R^\ast } \exp ( {-}rx) l( x) {\rm d}x}}} \right]} \over {\Delta _S}}, \;$$

and that

(36)$$\displaystyle{{\partial \tilde{R}( S^\ast ; \;\phi _b) } \over {\partial S}} = \displaystyle{{\displaystyle{{{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{1 \over \sigma }\displaystyle{1 \over {c^n( 0, \;S^\ast , \;R^\ast ) }}\displaystyle{{\partial c^n( 0, \;S^\ast , \;R^\ast ) } \over {\partial S}}} \over {r-\rho + \displaystyle{1 \over \sigma }\displaystyle{{\exp ( {-}rR^\ast ) l( R^\ast ) } \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x) {\rm d}x}} + \displaystyle{1 \over {\nu ( R^\ast ) }}\displaystyle{{\partial \nu ( R^\ast ) } \over {\partial x}}}}, \;$$

where

(37)$$\eqalign{& \Delta _S = \phi _b\exp ( {-}rS^\ast ) l( S^\ast ) h( S^\ast ) \left[{\displaystyle{{2{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{{{h}^{\prime \prime}( S^\ast ) } \over {{h}^{\prime}( S^\ast ) }}-\mu ( S^\ast ) -r} \right]\cr & \quad + \eta ( S^\ast , \;R^\ast ; \;\phi _b) \left[{\displaystyle{{{h}^{\prime \prime}( S^\ast ) } \over {{h}^{\prime}( S^\ast ) }} + \displaystyle{1 \over \sigma }\displaystyle{{\eta ( S^\ast , \;R^\ast ; \;\phi_b) } \over {\phi_bh( S^\ast ) \int_{S^\ast }^{R^\ast } \exp ( {-}rx) l( x) {\rm d}x}} + \mu ( S^\ast ) + \rho } \right]> 0.} $$

The intuition of various terms in (35) and (36) is as follows. A change in retirement age (R) affects the first-order condition (33) of schooling years through both the productivity-enhancing and direct utility benefit terms. First, it affects the duration that the individual can reap the benefit of schooling. This is represented by the first term on the numerator of (35). We label this the Ben-Porath effect. Second, it affects the direct utility term (through the marginal utility of consumption), as given in the second term of the numerator of (35). We label it the direct utility of schooling effect. Note that when ζ = 0, this effect disappears. On the other hand, a change in schooling years (S) affects the first-order condition (8) of retirement age in two ways. First, it affects the wage rate through the term h (S*)/h(S*) related to the human capital formation. This is given in the first term on the numerator of (36). We label it the return to schooling effect. Second, a change in schooling years affects the individual's lifetime wealth, as captured by the level of normalized consumption at age 0. This is given in the second term on the numerator of (36). Moreover, we show that $( {-}1/c^n( 0, \;S^\ast , \;R^\ast ) ) \,\,( \partial c^n( 0, \;S^\ast , \,\;R^\ast ) /\partial S) \, = \, \eta ( S^\ast , \;R^\ast ; \,\;\phi _b ) / [ \phi _bh( S^\ast ) \int _{S^\ast }^{R^\ast } \,\exp ( {-}rx) l( x) {\rm d}x ]$ and that it is positive when ζ is positive. We label it the consumption level effect. On the other hand, for the model in section 2, (1/c n(0, S*, R*)) (∂c n(0, S*, R*)/∂S) = 0 because of (7).Footnote 37 As a result, the consumption level effect disappears in that model.

We conclude from (35) and (36) that $\partial \tilde{S}( R^\ast ; \;\phi _b) /\partial R > 0$ and that $\partial \tilde{R}( S^\ast ; \;\phi _b) /\partial S > 0$. Thus, positive feedback between them continues to hold in the presence of the direct utility benefit of schooling.

5.2 Positive feedback (but possibly negative co-movement) between schooling and retirement choices

To examine the total effect of productivity increase on schooling years or retirement age, we first consider the two direct effects. For this model, we show that $\partial \tilde{R}( S^\ast ; \;\phi _b) /\partial \phi _b$ is the same as (A.16) of the model in section 2, but

(38)$$\displaystyle{{\partial \tilde{S}( R^\ast ; \;\phi _b) } \over {\partial \phi _b}} = \displaystyle{{-\left({1-\displaystyle{1 \over \sigma }} \right)\displaystyle{1 \over {\phi _b}}\eta ( S^\ast , \;R^\ast ; \;\phi _b) } \over {\Delta _S}}.$$

If 0 < σ < 1 according to (20), then one direct effect is positive $( \partial \tilde{S}( R^\ast ; \;\phi _b) /\partial \phi _b > 0)$ and the other negative $( \partial \tilde{R}( S^\ast ; \;\phi _b) /\partial \phi _b < 0)$. In contrast to the model in section 2, the two direct effects are opposite in sign in the extended model. Combining this feature with positive feedback between the two endogenous variables (positive endogenous effect), the total effects of the productivity increase on schooling years and retirement age may be of the same sign or of different signs, as given in the following proposition:

Proposition 6. Consider the life-cycle model with the direct utility benefit of schooling and an exogenous process of productivity increase, as given by (1), (3), (4), and (32), where θ b is cohort-invariant. If (20) holds, then

(a)S*/∂ϕ b < 0 andR*/∂ϕ b < 0 if

(39)$$0 < \zeta < \left[{\displaystyle{{\exp [ {-\rho ( R^\ast{-}S^\ast ) } ] \displaystyle{{l( R^\ast ) } \over {l( S^\ast ) }}\displaystyle{{{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}} \over {r-\rho + \displaystyle{1 \over {\nu ( R^\ast ) }}\displaystyle{{\partial \nu ( R^\ast ) } \over {\partial x}}}}} \right]\nu ( R^\ast ) ; \;$$

(b)S*/∂ϕ b > 0 andR*/∂ϕ b < 0 if

$$\left[{\displaystyle{{\exp [ {-\rho ( R^\ast{-}S^\ast ) } ] \displaystyle{{l( R^\ast ) } \over {l( S^\ast ) }}\displaystyle{{{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}} \over {r-\rho + \displaystyle{1 \over {\nu ( R^\ast ) }}\displaystyle{{\partial \nu ( R^\ast ) } \over {\partial x}}}}} \right]\nu ( R^\ast ) < \zeta $$
(40)$$< \left[{\displaystyle{{\exp [ {( r-\rho ) ( R^\ast{-}S^\ast ) } ] \left[{\displaystyle{{2{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{{{h}^{\prime \prime}( S^\ast ) } \over {{h}^{\prime}( S^\ast ) }}-\mu ( S^\ast ) -r} \right]} \over {\displaystyle{{{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{{{h}^{\prime \prime}( S^\ast ) } \over {{h}^{\prime}( S^\ast ) }}-\mu ( S^\ast ) -\rho }}} \right]\nu ( R^\ast ) ; \;$$

and

(c)S*/∂ϕ b > 0 andR*/∂ϕ b > 0 if

(41)$$\zeta > \left[{\displaystyle{{\exp [ {( r-\rho ) ( R^\ast{-}S^\ast ) } ] \left[{\displaystyle{{2{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{{{h}^{\prime \prime}( S^\ast ) } \over {{h}^{\prime}( S^\ast ) }}-\mu ( S^\ast ) -r} \right]} \over {\displaystyle{{{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{{{h}^{\prime \prime}( S^\ast ) } \over {{h}^{\prime}( S^\ast ) }}-\mu ( S^\ast ) -\rho }}} \right]\nu ( R^\ast ) .$$

Proposition 6 specifies the conditions that lead to the three possible outcomes. These outcomes are represented graphically in Figure 5.Footnote 38 The interpretations of the conditions are as follows. When the extent of the utility benefit of schooling (ζ) is small and (39) holds, then the direct effect of productivity increase on schooling years is not strong enough. Thus, the indirect effect dominates and the total effect (∂S*/∂ϕ b) is still negative. Similarly, ∂R*/∂ϕ b is negative. A productivity increase causes a positive co-movement of S* and R*. The model in section 2, with ζ = 0, could be treated as a limiting case of case (a). When ζ is large enough, the direct effect dominates and the total effect ∂S*/∂ϕ b turns positive. Furthermore, we find that when ζ is larger than the first threshold in (40) but not the second one, we have the results ∂S*/∂ϕ b > 0 and ∂R*/∂ϕ b < 0. (The two thresholds correspond to the upward-sloping dotted lines in Figure 5.) In contrast to case (a), there is a negative co-movement of S* and R* in case (b). When ζ is substantially large to pass the higher threshold, we have ∂S*/∂ϕ b > 0 and ∂R*/∂ϕ b > 0, as in case (c).Footnote 39 The co-movement of S* and R* becomes positive again.

Figure 5. The impact of productivity increase for the extended model.

The results in Proposition 6 imply that the extended model is able to explain the negative co-movement of schooling years and retirement age, provided that the flow utility of schooling is in the intermediate range given by (40). Moreover, they have relevance for the results in Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013). In their paper, retirement age is assumed to be exogenous. We can see from (17) and (38) that we only need ζ > 0 to guarantee that $\partial \tilde{S}( R^\ast ; \;\phi _b) /\partial \phi _b > 0$ in that environment. On the other hand, when retirement age is also a choice variable, we need parameter ζ to be not only positive, but also larger than the lower threshold in (40) to explain the positive total effect of productivity increase on schooling years (∂S*/∂ϕ b > 0).

6. Conclusion

Mortality decline and productivity increase are two major forces affecting the expected lifetime wealth of different cohorts over the twentieth century. Many researchers [such as Boucekkine et al. (Reference Boucekkine, de la Croix and Licandro2002); Kalemli-Ozcan and Weil (Reference Kalemli-Ozcan and Weil2010); Cervellati and Sunde (Reference Cervellati and Sunde2013); Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013); and Bloom et al. (Reference Bloom, Canning and Moore2014)] have studied the impact of these two events on schooling years and/or retirement age in developed economies. However, the results obtained in these papers are sometimes quite diverse, perhaps because of the different assumptions used.

This paper studies the impact of mortality decline and productivity increase in a life-cycle model with both schooling and retirement choices. Unlike many researchers who perform quantitative analysis to explain the empirical patterns, our primary goal is to obtain insight based on the analytical results of a model with features commonly used in existing papers. We believe this approach is more effective in understanding and reconciling different existing results.

There are four main contributions in this paper. The first two contributions, which are particularly relevant to the economic demography literature, are obtained from the life-cycle model focusing only on the productivity-enhancing role of schooling (sections 2 and 3). First, we show in Proposition 1 that the optimal retirement age increases in response to a rise in schooling years and that optimal schooling duration also rises in response to an (anticipated) increase in retirement age. A by-product of our analysis is that Proposition 1 extends Ben-Porath's (Reference Ben-Porath1967) result to an environment in which both schooling years and retirement age are choice variables. Second, we show in Proposition 5 that a negative direct effect of mortality decline on retirement age (i.e., when (23) is negative) is only a necessary condition for a mortality decline to affect retirement age negatively when schooling duration is endogenous. On the other hand, this condition is the necessary and sufficient condition when schooling duration is exogenous. This result implies that the lifetime human wealth channel suggested by D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012) is less likely to explain the decreasing trend of retirement age when the schooling duration also responds to the mortality decline.

Our third contribution is based on a useful framework to decompose the effect of the mortality decline or productivity increase on schooling years or retirement age in terms of the direct (shock) and indirect (feedback) components, as in (17) and (18). Using this framework, we clarify that positive feedback and positive co-movement are different, but somewhat related, concepts. Based on Propositions 2–4, one can conclude that when the two direct effects are of the same sign or when one direct effect is zero while the other is non-zero, then a positive co-movement exists between the two endogenous variables. On the other hand, Proposition 6 shows that when the two direct effects are of opposite signs, then the result is generally ambiguous, with a negative co-movement of them being possible under condition (40). To summarize, when positive feedback is present, positive co-movement between schooling years and retirement age is usually observed, but negative co-movement between them is also possible depending on the signs of the direct and indirect effects and their relative magnitude.

The fourth contribution is related to the prediction power of the life-cycle model to explain the co-movement of schooling years and retirement age. In particular, we pay attention to the negative co-movement of these two variables for cohorts born before 1930 (see panel (b) of Figure 1). Even though quantitative analysis is not the major focus of this paper, our theoretical and computational results are still helpful in suggesting that a standard life-cycle model focusing only on the productivity-enhancing role of schooling may not be enough to explain the negative co-movement of schooling years and retirement age no matter how to combine the two exogenous shocks. On the other hand, if we extend the baseline model to incorporate the direct utility benefit of schooling (as in section 5), then the extended model can explain the negative correlation of schooling years and retirement age when the flow utility of schooling is in some intermediate range. The main contribution of this paper is simply to point out relevant theoretical results. For the more challenging task of examining whether incorporating this factor can quantitatively explain both the negative co-movement of schooling years and retirement age for cohorts born before 1930 and the positive co-movement of these two variables for cohorts born after 1930, we leave this to future study.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/dem.2020.33

Acknowledgements

We are grateful to an Associate Editor, three referees, seminar/conference participants at Hitotsubashi University, Monash University, University of Tasmania, the European Society for Population Economics (İzmir, Turkey), and the International Conference on Pension, Insurance and Saving (Lisbon, Portugal) for helpful comments and suggestions. We also thank Diego Restuccia for providing the schooling years data, and thank the Research Grants Council of Hong Kong (Project No. 17500917) for financial support.

Appendix A

We derive the first-order and second-order conditions of the main model in section A.1 and provide a detailed analysis of some comparative static exercises in section A.2. Furthermore, the proofs of Propositions 1 and 2 are given in sections A.3 and A.4, respectively.

A.1 First-order and second-order conditions

The intertemporal budget constraint at age 0 is given by

(A.1)$$\int_0^T {\exp } ( {-}rx) l( x; \;\theta _b) c( x, \;S, \;R; \;\theta _b, \;\phi _b) {\rm d}x = \int_S^R {\exp } ( {-}rx) l( x; \;\theta _b) \phi _bh( S) {\rm d}x.$$

We first obtain the individual's optimal consumption path, conditional on schooling years and retirement age. Using standard techniques of dynamic optimization, it is straightforward to obtain

$$c( {x, \;S, \;R; \;\theta_b, \;\phi_b} ) = \exp [ {\sigma ( {r-\rho } ) x} ] c( {0, \;S, \;R; \;\theta_b, \;\phi_b} ) .$$

Substituting this equation into (A.1) and simplifying gives

(A.2)$$c( {0, \;S, \;R; \;\theta_b, \;\phi_b} ) = \displaystyle{{\phi _bh( S) \int _S^R \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x} \over {\int _0^T \exp \{ {-[ {( 1-\sigma ) r + \sigma \rho } ] x} \} l( x; \;\theta _b) {\rm d}x}}.$$

It is easy to see that (6) follows from (A.2).

Differentiating (A.1) with respect to R and S, respectively, we obtain

$$\int_0^T {\exp ( {-}rx) } l( x; \;\theta _b) \displaystyle{{\partial c( x, \;S, \;R; \;\theta _b, \;\phi _b) } \over {\partial S}}{\rm d}x$$
(A.3)$$ = \phi _b\left[{h^{\prime}( S) \int_S^R {\exp ( {-}rx) } l( x; \;\theta_b) {\rm d}x-\exp ( {-}rS) l( S; \;\theta_b) h( S) } \right], \;$$

and

(A.4)$$\int_0^T {\exp ( {-}rx) } l( x; \;\theta _b) \displaystyle{{\partial c( {x, \;S, \;R; \;\theta_b, \;\phi_b} ) } \over {\partial R}}{\rm d}x = \exp ( {-}rR) l( R; \;\theta _b) \phi _bh( S) .$$

Conditional on the optimal consumption path (5), we now obtain the first-order necessary conditions for optimal schooling years and retirement age. Substitute (5) into (2) to express the objective function in terms of S and R only. Denote it by

$$U_b( S, \;R) = & \int_0^T {\exp ( \! -\rho x) } l( x; \;\theta _b) \displaystyle{{c{( {x, \;S, \;R; \;\theta_b, \;\phi_b} ) }^{1-{1 \over \sigma }}-1} \over {1-\displaystyle{1 \over \sigma }}}{\rm d}x$$
$$-\int_{0 }^R {\exp } ( \!-\rho x) l( x; \;\theta _b) \nu ( x; \;\theta _b) {\rm d}x.$$

Differentiating U b(S, R) with respect to S and using (A.3) to simplify, we obtain

$$\displaystyle{{\partial U_b( S, \;R) } \over {\partial S}}$$
(A.5)$$\eqalign{ = ( \phi _b) ^{1-{1 \over \sigma }}c^n( 0, \;S, \;R; \;\theta _b) ^{- {1 \over \sigma }}& \left[{h^{\prime}( S) \int_S^R {\exp } ( {-}rx) l( x; \;\theta_b) {\rm d}x-\exp ( {-}rS) l( S; \;\theta_b) h( S) } \right]\cr \equiv a_1F( {S, \;R} ) , \;} $$

where $a_1 = ( \phi _b) ^{1-{1 \over \sigma }}c^n( 0, \;S, \;R; \;\theta _b) ^{-{1 \over \sigma }}$ and F(S, R) is the remaining term. Since a 1 is non-zero, the first-order condition for schooling is given by F(S, R) = 0, or equivalently, (7).

Differentiating U b(S, R) with respect to R and using (A.4) leads to

(A.6)$$\eqalign{\displaystyle{{\partial U_b( S, \;R) } \over {\partial R}} \,= \;& l( R; \;\theta _b) \left[{{( \phi_b) }^{1-{1 \over \sigma }}& \exp ( {-}rR) h( S) c^n{( 0, \;S, \;R; \;\theta_b) }^{-{1 \over \sigma }}-\exp (\! -\rho R) \nu ( R; \;\theta_b) } \right]\cr \equiv a_2G( S, \;R) , \;} $$

where a 2 = l(R;θ b) and G(S, R) is the remaining term. Since a 2 is non-zero, the first-order condition for retirement age is given by G(S, R) = 0, or equivalently, (8).

The corresponding second-order sufficient conditions, evaluated at the optimal choices, are: (a) ∂2U b(S*, R*)/∂S 2 < 0, which is equivalent to

(A.7)$$\displaystyle{{2{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{{{h}^{\prime \prime}( S^\ast ) } \over {{h}^{\prime}( S^\ast ) }}-\mu ( S^\ast ; \;\theta _b) -r > 0, \;$$

(b) ∂2U b(S*, R*)/∂R 2 < 0, which is equivalent to

(A.8)$$r-\rho + \displaystyle{1 \over \sigma }\displaystyle{{\exp ( {-}rR^\ast ) l( R^\ast ; \;\theta _b) } \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}} + \displaystyle{1 \over {\nu ( R^\ast ; \;\theta _b) }}\displaystyle{{\partial \nu ( R^\ast ; \;\theta _b) } \over {\partial x}} > 0, \;$$

and (c) $\displaystyle{{\partial ^2U_b( S^\ast , \;R^\ast ) } \over {\partial S^2}}{{\partial ^2U_b( S^\ast , \;R^\ast ) } \over {\partial R^2}}-\left[{{{\partial^2U_b( S^\ast , \;R^\ast ) } \over {\partial S\partial R}}} \right]^2 > 0$, which is equivalent toFootnote 40

(A.9)$$1-( \partial S^\ast{/}\partial R) \,( \partial R^\ast{/}\partial S) > 0.$$

A.2 Comparative static analysis

Combining (7) to (10), the optimal choices S* and R* are related by

(A.10)$$h^{\prime}( S^\ast ) \left[{\int_{S^\ast }^{R^\ast } {\exp } ( {-}rx) l( x; \;\theta_b) {\rm d}x} \right] = \exp ( {-}rS^\ast ) l( S^\ast ; \;\theta _b) h( S^\ast ) , \;$$

and

(A.11)$$( \phi _b) ^{1-{1 \over \sigma }}\exp ( {-}rR^\ast ) h( S^\ast ) [ {c^n( 0, \;S^\ast , \;R^\ast ; \;\theta_b) } ] ^{-{1 \over \sigma }} = \exp ( -\rho R^\ast ) \nu ( R^\ast ; \;\theta _b) .$$

We differentiate (A.10) totally to obtain

$$\left[{\displaystyle{{2{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{{{h}^{\prime \prime}( S^\ast ) } \over {{h}^{\prime}( S^\ast ) }}-\mu ( S^\ast ; \;\theta_b) -r} \right]{\rm d}S^\ast $$
(A.12)$$ = \displaystyle{{\exp ( {-}rR^\ast ) l( R^\ast ; \;\theta _b) } \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}}{\rm d}R^\ast{ + } \left[{\displaystyle{{\int_{S^\ast }^{R^\ast } \exp ( {-}rx) \displaystyle{{\partial l( x; \;\theta_b) } \over {\partial \theta_b}}{\rm d}x} \over {\int_{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta_b) {\rm d}x}}-\displaystyle{{\displaystyle{{\partial l( S^\ast ; \;\theta_b) } \over {\partial \theta_b}}} \over {l( S^\ast ; \;\theta_b) }}} \right]{\rm d}\theta _b, \;$$

and differentiate (A.11) totally to obtain

$$\left[{r-\rho + \displaystyle{1 \over \sigma }\displaystyle{1 \over {c^n( 0, \;S^\ast , \;R^\ast ; \;\theta_b) }}\displaystyle{{\partial c^n( 0, \;S^\ast , \;R^\ast ; \;\theta_b) } \over {\partial R}} + \displaystyle{1 \over {\nu ( R^\ast ; \;\theta_b) }}\displaystyle{{\partial \nu ( R^\ast ; \;\theta_b) } \over {\partial x}}} \right]{\rm d}R^\ast $$
$$ = \left[{\displaystyle{{{h}^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{1 \over \sigma }\displaystyle{1 \over {c^n( 0, \;S^\ast , \;R^\ast ; \;\theta_b) }}\displaystyle{{\partial c^n( 0, \;S^\ast , \;R^\ast ; \;\theta_b) } \over {\partial S}}} \right]{\rm d}S^\ast{ + } \left({1-\displaystyle{1 \over \sigma }} \right)\displaystyle{1 \over {\phi _b}}{\rm d}\phi _b$$
(A.13)$$-\left[{\displaystyle{1 \over \sigma }\displaystyle{1 \over {c^n( 0, \;S^\ast , \;R^\ast ; \;\theta_b) }}\displaystyle{{\partial c^n( 0, \;S^\ast , \;R^\ast ; \;\theta_b) } \over {\partial \theta_b}} + \displaystyle{1 \over {\nu ( R^\ast ; \;\theta_b) }}\displaystyle{{\partial \nu ( R^\ast ; \;\theta_b) } \over {\partial \theta_b}}} \right]{\rm d}\theta _b.$$

Straightforward manipulation of (A.12) and (A.13) leads to (11), (12), (15), and (16) in the main text, where the various terms of these equations are given by (13) and (14),

(A.14)$$\displaystyle{{\partial \tilde{S}( R^\ast ; \;\theta _b) } \over {\partial \theta _b}} = \displaystyle{{\displaystyle{{\int_{S^\ast }^{R^\ast } {\exp } ( {-rx} ) \displaystyle{{\partial l( x; \;\theta _b) } \over {\partial \theta _b}}{\rm d}x} \over {\int_{S^\ast }^{R^\ast } {\exp } ( {-rx} ) l( x; \;\theta _b) {\rm d}x}}-\displaystyle{{\displaystyle{{\partial l( S^\ast ; \;\theta _b) } \over {\partial \theta _b}}} \over {l( S^\ast ; \;\theta _b) }}} \over {\displaystyle{{2h^{\prime}( S^\ast ) } \over {h( S^\ast ) }}-\displaystyle{{h^{{\prime}{\prime}}( S^\ast ) } \over {h^{\prime}( S^\ast ) }}-\mu ( S^\ast ; \;\theta _b) -r}}, \;$$
(A.15)$$\displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) } \over { \partial \theta _b}} = \displaystyle{{-\displaystyle{1 \over \sigma }\displaystyle{1 \over {c^n( 0, \;S^\ast , \;R^\ast ; \;\theta _b) }}\displaystyle{{c^n( 0, \;S^\ast , \;R^\ast ; \;\theta _b) } \over {\partial \theta _b}}-\displaystyle{1 \over {\nu ( R^\ast ; \;\theta _b) }}\displaystyle{{\partial \nu ( R^\ast ; \;\theta _b) } \over {\partial \theta _b}}} \over {r-\rho + \displaystyle{1 \over \sigma }\displaystyle{{\exp ( {-}rR^\ast ) l( R^\ast ; \;\theta _b) } \over {\int_{S^\ast }^{R^\ast } {\exp ( {-}rx) } l( x; \;\theta _b) {\rm d}x}} + \displaystyle{{1 } \over {\nu ( R^\ast ; \;\theta _b) }}\displaystyle{{\partial \nu ( R^\ast ; \;\theta _b) } \over {\partial x}}}}, \;$$

and

(A.16)$$\displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) } \over {\partial \phi _b}} = \displaystyle{{\left({1-\displaystyle{1 \over \sigma }} \right)\displaystyle{1 \over {\phi _b}}} \over {r-\rho + \displaystyle{1 \over \sigma }\displaystyle{{\exp ( {-}rR^\ast ) l( R^\ast ; \;\theta _b) } \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}} + \displaystyle{1 \over {\nu ( R^\ast ; \;\theta _b) }}\displaystyle{{\partial \nu ( R^\ast ; \;\theta _b) } \over {\partial x}}}}.$$

The following analysis is useful for the impact of mortality decline in section 3.3. Because of (1), we have

(A.17)$$\displaystyle{{\partial l( {x; \;\theta_b} ) } \over {\partial \theta _b}} = l( {x; \;\theta_b} ) \int _0^x \left({-\displaystyle{{\partial \mu ( {t; \;\theta_b} ) } \over {\partial \theta_b}}} \right){\rm d}t.$$

Using (A.17), it can be shown that

$$\displaystyle{{\int _{S^\ast }^{R^\ast } \exp ( {-}rx) \displaystyle{{\partial l( x; \;\theta _b) } \over {\partial \theta _b}}{\rm d}x} \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}}-\displaystyle{{\displaystyle{{\partial l( S^\ast ; \;\theta _b) } \over {\partial \theta _b}}} \over {l( S^\ast ; \;\theta _b) }}$$
$$ = \displaystyle{{\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) \int _0^x \left({-\displaystyle{{\partial \mu ( t; \;\theta_b) } \over {\partial \theta_b}}} \right){\rm d}t{\rm d}x} \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}}-\int _0^{S^\ast } \left({-\displaystyle{{\partial \mu ( t; \;\theta_b) } \over {\partial \theta_b}}} \right){\rm d}t$$
$$ = \displaystyle{{\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) \left[{\int_0^{S^\ast } \left({-\displaystyle{{\partial \mu ( t; \;\theta_b) } \over {\partial \theta_b}}} \right){\rm d}t + \int_{S^\ast }^x \left({-\displaystyle{{\partial \mu ( t; \;\theta_b) } \over {\partial \theta_b}}} \right){\rm d}t} \right]{\rm d}x} \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}}-\int _0^{S^\ast } \left({-\displaystyle{{\partial \mu ( t; \;\theta_b) } \over {\partial \theta_b}}} \right){\rm d}t$$
(A.18)$$ = \displaystyle{{\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) \int _{S^\ast }^x \left({-\displaystyle{{\partial \mu ( t; \;\theta_b) } \over {\partial \theta_b}}} \right){\rm d}t{\rm d}x} \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}}.$$

We also differentiate c n(0, S*, R*;θ b) in (6) with respect to θ b and use (A.17) to obtain

$$\displaystyle{1 \over {c^n( 0, \;S^\ast , \;R^\ast ; \;\theta _b) }}\displaystyle{{\partial c^n( 0, \;S^\ast , \;R^\ast ; \;\theta _b) } \over {\partial \theta _b}} = \displaystyle{{\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) \left[{\int_0^x \left({-\displaystyle{{\partial \mu ( t; \;\theta_b) } \over {\partial \theta_b}}} \right){\rm d}t} \right]{\rm d}x} \over {\int _{S^\ast }^{R^\ast } \exp ( {-}rx) l( x; \;\theta _b) {\rm d}x}}$$
(A.19)$$-\displaystyle{{\int _0^T \exp \{ {-[ {( 1-\sigma ) r + \sigma \rho } ] x} \} l( x; \;\theta _b) \left[{\int_0^x \left({-\displaystyle{{\partial \mu ( t; \;\theta_b) } \over {\partial \theta_b}}} \right){\rm d}t} \right]{\rm d}x} \over {\int _0^T \exp \{ {-[ {( 1-\sigma ) r + \sigma \rho } ] x} \} l( {x; \;\theta_b} ) {\rm d}x}}.$$

A.3 Proof of Proposition 1

It is straightforward to conclude that the denominator of (13) is positive because of (A.7), and the numerator of (13) is positive. Therefore, $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R > 0.$ This proves (a).

Similarly, the denominator of (14) is positive because of (A.8). On the other hand, the return to schooling h (S*)/h(S*) is positive under standard assumptions and thus, the numerator of (14) is positive. Therefore, $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S > 0.$ This proves (b).

A.4 Proof of Proposition 2

We observe from (17) and (18) that when ψ = ϕ b, each of the two total effects (∂S*/∂ϕ b or ∂R*/∂ϕ b) depends on $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \phi _b$ only, since the other direct effect $( \partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \phi _b)$ is zero. When (20) holds, it is easy to conclude from (A.8) and (A.16) that

(A.20)$$\displaystyle{{\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) } \over {\partial \phi _b}} < 0.$$

According to Proposition 1(a), $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R$ is positive. Moreover, according to (A.9), $1-( \partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R) \,( \partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S) > 0$. Combining the above results, we conclude from (17) and (18) that ∂S*/∂ϕ b and ∂R*/∂ϕ b are of the same sign as that of the direct effect $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \phi _b$, which is negative according to (A.20).

Footnotes

1 The life expectancy data is from the Berkeley Mortality Database (http://www.demog.berkeley.edu/~bmd/).

2 The GDP per capita data is from the Maddison Project (http://www.ggdc.net/maddison/maddison-project/data.htm).

3 We use the harmonized Basic Monthly Current Population Survey data from IPUMS [Flood et al. (Reference Flood, King, Rodgers, Ruggles and Warren2018)]. Detailed data construction and definitions are given in a Supplementary Appendix, which is available upon request.

4 Note that we do not analyze social security in this paper. The detailed features of the social security system (in terms of the payroll tax rate, the level and coverage of pension benefit, eligibility age, etc.) differ substantially among countries and for different sub-periods of the twentieth century. Theoretical results are likely to be less sharp in this more complicated environment. Since focusing on positive feedback between schooling and retirement choices and obtaining its implications are the key concerns of this paper, we decide not to include social security.

5 The health index θ b is defined such that an increase in θ b is interpreted as health improvement (or mortality decline). Thus, ∂μ(t; θ b)/∂θ b is always non-positive because an increase in θ b leads to a downward shift (or at least no shift) of the age-specific mortality rate function μ(t; θ b).

6 In principle, the disutility of labor and disutility of study are different concepts. However, modeling them separately would lead to an extra term, the difference of disutility of study and the disutility of labor at age S*, in the first-order condition (7), where S* is the optimal level of schooling years. We do not take this approach because of two reasons. First, these two terms are not directly observable, and considering the predictions of the model based on them may not be desirable. Second, both terms are likely to be quite small when the individual is young (such as at the optimal age S*), and equation (7) holds as a good approximation in this case.

7 In this model, human capital is accumulated only through formal schooling, following Bils and Klenow (Reference Bils and Klenow2000) and Hazan (Reference Hazan2009). On the other hand, human capital is also accumulated through on-the-job training in Manuelli et al. (Reference Manuelli, Seshadri and Shin2012).

8 The similarities in these two papers are as follows. They both assume a perfect capital market. In their quantitative analyses, they assume a constant growth rate of productivity, and the interest rate equals the rate of pure discounting.

9 The absence of such a term in our model corresponds to ζ = 0 in (9) of Bils and Klenow (Reference Bils and Klenow2000) and β = 0 in (1) of Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013). Note that in section 5, we will extend the model to incorporate the direct utility benefit of schooling.

10 To avoid an unnecessarily lengthy expression, we do not specify the dependence of relevant functions on θ b and ϕ b in (7) to (10), since we focus on optimal choices for a given cohort in this section. When we consider the comparative static results in later sections (with θ b and/or ϕ b changing), the dependence of relevant functions on θ b and ϕ b will be specified explicitly.

11 We could replace R in (7) by R e, where R e is the anticipated retirement age, if we want to emphasize the role of anticipated retirement age in the optimal schooling years function. We then need to further impose that the actual and anticipated values of retirement age are equal (R* = R e) at equilibrium. On the other hand, our simplification by using $\tilde{S}( R)$ instead of $\tilde{S}( R^e)$ is consistent with the interpretation that the individual makes schooling and retirement choices simultaneously in this model. Since no element of time inconsistency exists in our model, both specifications give the same results.

12 If we include the direct cost of education in the model, then it would become a second component of the marginal cost term. Since the presence or absence of the direct cost of education does not affect our main results (Propositions 1–6), we follow other researchers [such as Hazan (Reference Hazan2009); Cervellati and Sunde (Reference Cervellati and Sunde2013); and Restuccia and Vandenbroucke (Reference Restuccia and Vandenbroucke2013)] and do not include this factor.

13 According to (A.6) in Appendix A, when the retirement age (R) increases, the marginal cost is given by the expected disutility term l(R)ν(R), which is then discounted back to age 0 as exp ( − ρR)l(R)ν(R). On the other hand, the marginal benefit is given by the discounted expected increase in labor income, which is exp ( − rR)l(R)ϕ bh(S). This is multiplied by the marginal utility $[ {\phi_bc^n( 0, \;S, \;R) } ] ^{-{1 \over \sigma }}$ to convert it to utility units at age 0. Equation (8) is obtained after cancelling the common term l(R).

14 Note that the term $( \phi _b) ^{1-{1 \over \sigma }}$ in (8) comes from these two effects. One component, ϕ b, comes from the effect of a change in the productivity level on the opportunity cost of delaying retirement, which is labor income ϕ bh(S), and is associated with the substitution effect. The other component, $( \phi _b) ^{-{1 \over \sigma }}$, comes from the marginal utility of initial consumption level (${[ {\phi_bc^n( 0, \;S, \;\tilde{R}( S) ) } ] } ^{-{1 \over \sigma }}$), and is associated with the income effect.

15 An alternative way to express the first-order conditions is obtained by substituting (9) and (10) into (7) and (8). We do this when we conduct comparative static analysis in section 3. However, we keep the definitions of $\tilde{S}( R)$ in (7) and $\tilde{R}( S)$ in (8), because these two terms are particularly useful in interpreting the results in section 3.

16 Note that the solution of (15) and (16) is also given by (17) and (18) with ψ = ϕ b, once we recognize that $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \phi _b = 0$.

17 We decide to consider these two systems separately because the underlying economic reasons are different for the two cases.

18 In section 5, we will further comment on this point for the extended model.

19 As will be seen in Proposition 6, positive values of $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial R$ and $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S$ do not necessarily lead to the positive co-movement of the two endogenous variables. We need to consider this endogenous interaction component together with another component: the signs of the direct effects of the exogenous shocks.

20 Positive feedback is perhaps easiest to understand in a dynamic setting, in which the responses occur sequentially. For example, according to Vietorisz and Harrison (Reference Vietorisz and Harrison1973, p. 369), “positive feedback arises when the induced effect—after completion of the cycle—has the same sign as the original effect and thus reinforces it.” While we do not emphasize the dynamic process of the interaction of the two endogenous variables in our model, we posit that the characterization of positive feedback is appropriate because Proposition 1 suggests that the positive response of one endogenous variable reinforces the movement of the other and vice versa. The emphasis of “mutually reinforcing elements” is also found in the study of positive feedback by Arthur (Reference Arthur1990, p. 99).

21 Ben-Porath (Reference Ben-Porath1967) is interested to know why an individual engages more in human capital investment at a young age. His analysis focuses on an individual of a particular cohort, and the assumption of a fixed retirement age (of people of the same cohort) is reasonable. On the other hand, we study how mortality decline and productivity increase, by changing the expected lifetime wealth, may affect the life-cycle choices (including schooling) of individuals of different cohorts. In this context, the assumption of an unchanged retirement age (for different cohorts) is less desirable, and it is better to allow both schooling years and retirement age to be endogenously determined.

22 The term $\partial \tilde{R}( S^\ast ; \;\phi _b, \;\theta _b) /\partial S$ is unimportant for the proof of Proposition 2 because $\partial \tilde{S}( R^\ast ; \;\theta _b) /\partial \phi _b = 0$, but is important generally; see, for example, the analysis of the effect of mortality decline in section 3.3.

23 The ingredients of the proof of Proposition 3 are very similar to those of Proposition 2. The only difference is that $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \phi _b = 0$ (resp. >0) when σ = 1 (resp. >1).

24 Cai and Lau (Reference Cai and Lau2017, section 3) provide proof of this result in a model with endogenous schooling years but exogenous retirement age.

25 We have verified that (21) holds computationally for the USA from 1900 to 2000.

26 Note that D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012) focus on mortality decline at an arbitrary age to show that mortality reductions at different ages have systematically different effects on retirement age. On the other hand, our specification allows mortality changes occurring at all ages, and we use a change in θ b to capture this more general mortality change. However, both (23) of D'Albis et al. (Reference D'Albis, Lau and Sánchez-Romero2012) and (A.19) have similar economic interpretations.

27 Note that people who retire in the 1990s correspond roughly to various cohorts born in the 1920s and 1930s.

28 The point can be seen from (18) with ψ = θ b. In a model with exogenous schooling years, $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial S = 0$. Thus, a necessary and sufficient condition for ∂R*/∂θ b < 0 is a negative value of the direct effect $\partial \tilde{R}( S^\ast ; \;\theta _b, \;\phi _b) /\partial \theta _b$.

29 Note that μ GM(x; θ b) does not tend to infinity for finite x and thus is different from the convenient assumption of a finite maximum age (T) in the theoretical model. However, this discrepancy does not pose any practical problem in our computational analysis, because we assume T = 110 − N or T = 115 − N, and the estimated values of (l GM(x + N; θ b)/l GM(N; θ b)) are effectively zero for x + N > 110 (see Figure 2).

30 Note than Bloom et al. (Reference Bloom, Canning and Moore2014) assume N = 20, because they focus only on retirement.

31 We use the data, kindly provided by Diego Restuccia, in the 5-year interval to minimize computational time. Our results are essentially the same when we use the annual data.

32 Bloom et al. (Reference Bloom, Canning and Moore2014) also calibrate the disutility parameter of their model such that the optimal retirement age for the 1900 cohort is 65. Note that we present the numerical values of various variables in actual age for easier comparison. For example, we add N to the calculated values of R* (expressed in model age). We also add N − 6 to the calculated values of S*, corresponding to the assumption that children go to school at age 6.

33 At first glance, it appears that including schooling choice in the life-cycle model represents a detrimental move if our main objective is to explain the decrease in retirement age for the cohorts born in the first half of the twentieth century. However, we think that schooling years, together with retirement age, should be modeled as choice variables in studying the impact of mortality decline and productivity increase. This paper adopts this approach, and explains the decreasing trend of retirement age, even though it mainly examines other issues. As will be shown in section 5, introducing the utility benefit of schooling in addition to its productivity-enhancing role helps explain the decrease in retirement age in a model with both schooling and retirement choices.

34 Another possibility is to include social security and changes in social security benefits over time, which have become particularly relevant for advanced countries in the last few decades. See, for example, Gruber and Wise (Reference Gruber and Wise1998, Reference Gruber, Wise, Gruber and Wise1999).

35 Some researchers, such as Hansen and Lønstrup (Reference Hansen and Lønstrup2012) and Sánchez-Romero et al. (Reference Sánchez-Romero, d'Albis and Prskawetz2016), have focused on using mortality changes to explain the negative co-movement of schooling years and retirement age. While the analysis in section 3.3 has pointed out that this is possible by specifying various factors in (23) such that the direct effect of a mortality decline on retirement age is strongly negative, we believe that this may not be the best approach, partly because of our computational results that the direct effect of a mortality decline on retirement age is always positive. Moreover, such an approach does not explain the positive effect of a productivity increase on schooling years. Our proposed method deals with both issues. Cai (Reference Cai2017, Chapter 3) also examines similar issues.

36 Objective functions (2) and (32) can be represented as two special cases of a more general objective function:

$$\int_0^T {\exp } ( -\rho x) l( x; \;\theta _b) {{c{( x) }^{1-{1 \over \sigma }}-1} \over {1-{1 \over \sigma }}}{\rm d}x + \zeta \int_0^S {\exp } ( -\rho x) l( x; \;\theta _b) {\rm d}x-\int_0^R {\exp } ( -\rho x) l( x; \;\theta _b) \nu ( x; \;\theta _b) {\rm d}x.$$

When ζ = 0, this function becomes (2). When ∂l(x; θ b)/∂θ b = ∂ν(x; θ b)/∂θ b = 0, it becomes (32).

37 The intuition is as follows. When ζ = 0, one objective of the individual is to maximize lifetime wealth. Thus, (1/c n(0, S*, R*)) (∂c n(0, S*, R*)/∂S) = 0. When ζ > 0, a non-pecuniary effect occurs, and thus, at the optimal choice, “too much” human capital is accumulated when compared with the model with ζ = 0. Thus, (1/c n(0, S*, R*))(∂c n(0, S*, R*)/∂S) < 0 for the extended model.

38 Note that ζ only affects (33) through (34), but does not affect (8). As a result, different values of ζ in the three cases in Figure 5 are reflected in different positions of $\tilde{S}( R^\ast ; \;\phi _2)$, but the position of $\tilde{R}( S^\ast ; \;\phi _2)$ remains unchanged.

39 In case (c), the direct effect $\partial \tilde{R}( S^\ast ; \;\phi _b) /\partial \phi _b$ is negative, but the total effect ∂R*/∂ϕ b is positive, because ζ is very large and the indirect effect (through the endogenous change in schooling years) is very strong. We believe this case is not empirically important, but we list all three cases in Proposition 6 for the sake of completeness.

40 It can be seen from (A.5) and (A.6) that the first-order condition for S is defined by ∂U b/∂S = a 1F(S, R) = 0, and that for R is defined by ∂U b/∂R = a 2G(S, R) = 0. Therefore, at the optimal choices S* and R*, ∂2U b/∂S 2 = a 1(∂F/∂S), ∂2U b/∂R 2 = a 2(∂G/∂R), and (∂2U b/∂SR) = a 1(∂F/∂R) = a 2(∂G/∂S). Thus, (∂2U b/∂S 2)(∂2U b/∂R 2) − [∂2U b/∂SR]2 > 0 is equivalent to 1 − ((∂F/∂R)/(∂F/∂S))((∂G/∂S)/(∂G/∂R)) > 0, which can be simplified to (A.9), because (∂S*/∂R) = −((∂F/∂R)/(∂F/∂S)) and (∂R*/∂S) = −((∂G/∂S)/(∂G/∂R)). Alternatively, we can differentiate U b(S, R) directly to obtain an expression similar to (A.7) and (A.8). That expression can be shown to be equivalent to (A.9), after using (13) and (14). We prefer (A.9) as it is directly useful for subsequent analysis.

References

Arthur, W. B. (1990) Positive feedbacks in the economy. Scientific American 262(2), 9299.CrossRefGoogle Scholar
Ben-Porath, Y. (1967) The production of human capital and the life cycle of earnings. Journal of Political Economy 75, 352365.CrossRefGoogle Scholar
Bils, M. and Klenow, P. J. (2000) Does schooling cause growth? American Economic Review 90, 11601183.CrossRefGoogle Scholar
Bloom, D. E., Canning, D., Mansfield, R. K. and Moore, M. (2007) Demographic change, social security systems, and savings. Journal of Monetary Economics 54, 92114.CrossRefGoogle ScholarPubMed
Bloom, D. E., Canning, D. and Moore, M. (2014) Optimal retirement with increasing longevity. Scandinavian Journal of Economics 116, 838858.CrossRefGoogle ScholarPubMed
Boucekkine, R., de la Croix, D. and Licandro, O. (2002) Vintage human capital, demographic trends and endogenous growth. Journal of Economic Theory 104, 340375.CrossRefGoogle Scholar
Boucekkine, R., de la Croix, D. and Licandro, O. (2003) Early mortality declines at the dawn of modern growth. Scandinavian Journal of Economics 105, 401418.CrossRefGoogle Scholar
Cai, Z. (2017) Essays on demographic changes and life-cycle behavior. PhD thesis, University of Hong Kong.Google Scholar
Cai, Z. and Lau, S.-H. P. (2017) Impact of mortality reductions on years of schooling and expected lifetime labor supply. Journal of Mathematical Economics 72, 134144.CrossRefGoogle Scholar
Cervellati, M. and Sunde, U. (2013) Life expectancy, schooling, and lifetime labor supply: theory and evidence revisited. Econometrica 81, 20552086.Google Scholar
Chakraborty, S. (2004) Endogenous lifetime and economic growth. Journal of Economic Theory 116, 119137.CrossRefGoogle Scholar
Costa, D. L. (1998) The Evolution of Retirement. Chicago: University of Chicago Press.CrossRefGoogle Scholar
D'Albis, H., Lau, S.-H. P. and Sánchez-Romero, M. (2012) Mortality transition and differential incentives for early retirement. Journal of Economic Theory 147, 261283.CrossRefGoogle Scholar
Echevarria, C. A. and Iza, A. (2006) Life expectancy, human capital, social security and growth. Journal of Public Economics 90, 23232349.CrossRefGoogle Scholar
Flood, S., King, M., Rodgers, R, Ruggles, S and Warren, J. R. (2018) Integrated Public Use Microdata Series, Current Population Survey: Version 6.0 [dataset]. Minneapolis, MN: IPUMS. https://doi.org/10.18128/D030.V6.0.CrossRefGoogle Scholar
Fries, J. F. (1980) Aging, natural death, and the compression of morbidity. New England Journal of Medicine 303(3), 130135.CrossRefGoogle ScholarPubMed
Goldin, C. and Katz, L. F. (2008) The Race Between Education and Technology. Cambridge, MA: Harvard University Press.Google Scholar
Gruber, J. and Wise, D. A. (1998) Social security and retirement: an international comparison. American Economic Review Papers and Proceedings 88(2), 158163.Google Scholar
Gruber, J., and Wise, D. A. (1999) Introduction and summary. In Gruber, J. and Wise, D. A. (eds.), Social Security and Retirement around the World. Chicago: University of Chicago Press, pp. 135.Google Scholar
Hansen, C. W. and Lønstrup, L. (2012) Can higher life expectancy induce more schooling and earlier retirement? Journal of Population Economics 25, 12491264.CrossRefGoogle Scholar
Hazan, M. (2009) Longevity and lifetime labor supply: evidence and implications. Econometrica 77, 18291863.Google Scholar
Heijdra, B. J. and Romp, W. E. (2009) Human capital formation and macroeconomic performance in an ageing small open economy. Journal of Economic Dynamics and Control 33, 725744.CrossRefGoogle Scholar
Kalemli-Ozcan, S. and Weil, D. (2010) Mortality change, the uncertainty effect, and retirement. Journal of Economic Growth 15, 6591.CrossRefGoogle ScholarPubMed
Maestas, N. and Zissimopoulos, J. (2010) How longer work lives ease the crunch of population aging. Journal of Economic Perspectives 24(1), 139160.CrossRefGoogle ScholarPubMed
Manuelli, R. E., Seshadri, A. and Shin, Y. (2012) Lifetime labor supply and human capital investment. Working paper, Washington University St. Louis.CrossRefGoogle Scholar
Restuccia, D. and Vandenbroucke, G. (2013) A century of human capital and hours. Economic Inquiry 51, 18491866.CrossRefGoogle Scholar
Sánchez-Romero, M., d'Albis, H. and Prskawetz, A. (2016) Education, lifetime labor supply, and longevity improvements. Journal of Economic Dynamics and Control 73, 118141.CrossRefGoogle Scholar
Sheshinski, E. (2009) Uncertain longevity and investment in education. Working Paper No. 2784, CESifo.CrossRefGoogle Scholar
Vietorisz, T. and Harrison, B. (1973) Labor market segmentation: positive feedback and divergent development. American Economic Review 63(2), 366376.Google Scholar
Yaari, M. E. (1965) Uncertain lifetime, life insurance, and the theory of the consumer. Review of Economic Studies 32, 137150.CrossRefGoogle Scholar
Figure 0

Figure 1. Schooling and retirement age across different cohorts.

Figure 1

Figure 2. Survival probability: data and estimated Gompertz-Makeham survival functions.

Figure 2

Figure 3. Schooling years.

Figure 3

Table 1. Parameters of the baseline model

Figure 4

Figure 4. Baseline results.

Figure 5

Figure 5. The impact of productivity increase for the extended model.

Supplementary material: PDF

Cai et al. supplementary material

Appendix

Download Cai et al. supplementary material(PDF)
PDF 311 KB