Hostname: page-component-745bb68f8f-v2bm5 Total loading time: 0 Render date: 2025-02-11T07:46:18.904Z Has data issue: false hasContentIssue false

Social security, income inequality and growth*

Published online by Cambridge University Press:  18 May 2011

GILLES LE GARREC
Affiliation:
Sciences Po – OFCE, 69 quai d'Orsay, 75340 Paris Cedex 07, France (e-mail: gilles.legarrec@ofce.sciences-po.fr)
Rights & Permissions [Opens in a new window]

Abstract

In most industrial countries, public pension systems redistribute from workers to retired people, not from high-income to low-income earners. They are close actuarial fairness. However, they are not all equivalent. In particular, some pension benefits are linked to full lifetime average earnings, while others are only linked to partial earnings history. In the latter case, we then show in this article that an actuarially fair pay-as-you-go pension system can both reduce lifetime income inequality and enhance economic growth. We also shed light on the dilemma between inequality and economic growth in retirement systems: greater progressivity results in less lifetime inequlity but also less growth.

Type
Articles
Copyright
Copyright © Cambridge University Press 2011

1 Introduction

For several years, industrial countries have simultaneously experienced an increase in their life expectancy and weakness of their fertility, two trends that characterize the aging process of population. Therefore, with an unchanged age of retirement, the ratio of pensioners to workers (the dependency ratio) should reach in France, for example, 70.1% in 2040, whereas it was 35.8% in 1990. The debate on the financing of our public retirement systems is crucially related to the dependency ratio. Indeed, these systems are financed on a pay-as-you-go (PAYG) basis, i.e. pension benefits are paid through contributions of contemporary workers. Hence, they must cope with the increasingly larger number of pensioners compared to the number of contributors. Changes are unavoidable. If we want to guarantee in the near future the current level of benefits within the same system, it will be necessary either to increase the contribution rate or the length of contribution (by delaying the age of retirement). However, this financing problem calls into question the role of PAYG retirement systems in our societies. For instance, by evaluating the real pre-tax return on non-financial corporate capital at 9.3%Footnote 1 and the growth rate over the same period (1960 to 1995) at 2.6%, Feldstein (Reference Feldstein1995a, Reference Feldsteinb, Reference Feldstein1996) unequivocally advocates the privatisation of retirement systems and the change to fully funded systems. He thus assesses the potential present-value gain to nearly $20 trillion for the United States.

Faced with these financial arguments, it is often sustained (see Cutler, Reference Cutler and Feldstein1998) that PAYG retirement systems are essential instruments for fighting poverty that privatization would end. One can indeed observe that most pension benefit formulas are progressive, especially in Anglo-Saxon countries where pensions are weakly related to earnings (see OECD, 2007). However, appearances can be misleading. When considering the American benefit formula, all empirical studies focusing on the redistributive aspect of the retirement system stressed that the progressivity of the system is only apparent because the formula does not take into account specificities related to gender, life expectancy or institutional features (Burkhauser and Walick, Reference Burkhauser and Walick1981; Garrett, Reference Garrett1995; Gustman and Steinmeier, Reference Gustman and Steinmeier2001; Coronado et al., Reference Coronado, Fullerton and Glass1999, Reference Coronado, Fullerton and Glass2000; Brown et al., Reference Brown, Coronado and Fullerton2006). First, the redistribution within the system is carried out from men towards women. Second, redistribution within the system is to the advantage of people who live longer and, as noted by Deaton and Paxton (Reference Deaton and Paxton1998, Reference Deaton and Paxton1999), differences in life expectancies are strongly related to social inequality: high-income earners live longer than low-income earners. Third, as argued by Lindbeck and Persson (Reference Lindbeck and Persson2003) and Bozio and Piketty (Reference Bozio and Piketty2008), institutional features such as linking pensions to the best or last years tend to favor those with steep age-earnings profiles, i.e. again high-income earners. When considering these elements, Gustman and Steinmeier (Reference Gustman and Steinmeier2001) show that retirement system returns are almost identical whatever the household earnings. In the same line, Coronado et al. (Reference Coronado, Fullerton and Glass2000) and Brown et al. (Reference Brown, Coronado and Fullerton2006) show that the U.S. Social Security has no impact on the GINI index measuring income inequality. As the American pension benefit formula is one of the most progressive (see OECD, 2007), most retirement systems in the industrial world appear, in fact, close to actuarial fairnessFootnote 2 (see Stahlberg, Reference Stahlberg and Persson1990, for the Swedish system).

In this article, focusing on the age-earnings profile feature highlighted by Lindbeck and Persson (Reference Lindbeck and Persson2003) and Bozio and Piketty (Reference Bozio and Piketty2008), we analyze the extent to which public PAYG retirement systems can reduce income inequality whereas most of them are actuarially fair. In particular, we investigate the relation between pension benefits and earnings history. In some countries, the earnings-related part of pension benefits is linked to full lifetime average earnings. In others like Greece, Spain, Sweden (before the 1994 legislation), France and Norway it is linked to the best or last years. As pointed out by Lindbeck and Persson (Reference Lindbeck and Persson2003) and Bozio and Piketty (Reference Bozio and Piketty2008), the way pension benefits are calculated when considering heterogenous work histories and age-earnings profiles can have important consequences in terms of income redistribution, even if the system as a whole stays actuarially fair. To study the impact of the calculation of pension benefits on income redistribution, the construction of a framework, which accurately adresses the main features of work history and how earnings are determined, is necessary.

On one hand, earnings are strongly related to human capital (e.g. Mincer, Reference Mincer1997, Neal and Rosen, Reference Neal, Rosen, Atkinson and Bourguignon2000). As learning activities and human capital formation are concentrated at young ages, a work history can be summarized by two different periods. During the first period, people invest time to be trained, including higher education and job training. This first period is characterized by relatively low earnings. In the second period, they benefit from their human capital investment and then earn more. Age-earnings profiles are thus increased, except for high-school dropouts whose age-earnings profiles are almost flat (e.g., Andolfatto et al., Reference Andolfatto, Ferrall and Gomme2000). Lillard (Reference Lillard1977) then highlighted that the age-earnings profile of a worker is increased with his learning ability and the time he has spent in training. In order to replicate these facts, the Ben-Porath (Reference Ben-Porath1967) human capital model has been widely used (e.g., Mincer, Reference Mincer1997, Neal and Rosen, Reference Neal, Rosen, Atkinson and Bourguignon2000). In this model, individuals maximize the present value of their lifetime earnings by allocating their time between training activities and work. It predicts accurately that individuals with higher abilities will invest more in human capital and therefore will have steeper age-earnings profiles than their counterparts with lower abilities.

On the other hand, earnings are very significantly linked to the earnings of their parents (Bowles and Gintis, Reference Bowles and Gintis2002; d'Addio, Reference d'Addio2007). The easiest way to replicate this fact in the Ben-Porath (Reference Ben-Porath1967) model consists in assuming that children inherit part of the human capital of their parents. However, as shown by Huggett et al. (Reference Huggett, Ventura and Yaron2006), when individuals differ only in their initial human capital endowment, the model generates a counterfactual pattern regarding the US earnings distribution. By contrast, they show that the US earnings distribution can be replicated quite well when considering differences in learning ability across individuals. This suggests that intergenerational earnings persistence is based on the inheritability of learning ability within families. Supporting such a view, education appears as a major contributor to intergenerational earnings mobility and educational differences tend to persist across generations (d'Addio, Reference d'Addio2007). However, as shown by Bowles and Gintis (Reference Bowles and Gintis2002), it does not imply that intergenerational earnings are based only on genetic transmission. Learning ability also reflects non-cognitive personality traits such as, a taste for learning at school, which can be influenced by the family background as much as by the genes.

The rest of the paper is organized as follows. In Section 2, we present the basic assumptions related to the age-earnings profiles and the calculation of pension benefits. In Section 3, we analyze optimal behaviors of individuals and firms considering the basic assumptions. Following Huggett et al. (Reference Huggett, Ventura and Yaron2006), we assume, in particular, that individuals differ only in their learning abilities. In Section 4, we specify the equilibrium features when retirement systems are actuarially fair. In Section 5, we then show that actuarially fair retirement systems, depending on the calculation of pension benefits, can both enhance economic growth and reduce lifetime income inequality. We also shed light on the dilemma between income inequality and economic growth in retirement systems: greater progressivity results in less lifetime income inequality but also less growth. In the last section, we briefly conclude.

2 Age-earnings profile and pension benefits: basic assumptions

2.1 Human capital and age-earnings profiles

The model is a version of the Ben-Porath model (Reference Ben-Porath1967) . Individuals live for three periods: they are young, adult and old. When young, they go to school. During this period, which corresponds to primary and secondary education (compulsory schooling), individuals born in t−1 learn basic knowledge represented by the average knowledge \mathop {\overline{Z} }\nolimits_{t \minus \setnum{1}} of the contemporary working generation. In addition, they can choose to make an effort e t−1 in learning (where e t−1=0 or 1) to pass the final secondary school examination, qualifying for university entrance. In the second period, those who have made the effort can then complement their basic knowledge by pursuing training during a period h t instead of entering directly the labor marketFootnote 3. At the end of their complementary training, their human capital is characterized by:

(1)
Z_{t}^{s} \equals Bh_{t}^{\delta } \mathop {\overline{Z} }\nolimits_{t \minus \setnum{1}} \comma \quad B \gt 0\comma \quad \delta \gt 0\comma

where δ represents the return to complementary training in terms of human capital.

Skilled workers, those who have completed their training before entering the labor market, are thus characterized by a first period h t with no earnings. Afterwards, they earn Z tsw t, where w t is the wage rate per unit of effective labor. Earnings of skilled workers W ts over their whole active period are thus:

(2)
W_{t}^{s} \equals \lpar 1 \minus h_{t} \rpar Z_{t}^{s} w_{t} \comma

and are then characterized by a steep profile. By contrast, unskilled workers are characterized by the basic human capital during all their working period:

(3)
Z_{t}^{u} \equals \mathop {\overline{Z} }\nolimits_{t \minus \setnum{1}}\! \comma

and are then characterized by flat age-earnings profiles:

(4)
W_{t}^{u} \equals Z_{t}^{u} w_{t}.

In a simple and efficient way, we then specify an economy in the line of Lillard (Reference Lillard1977) and Andolfatto et al. (Reference Andolfatto, Ferrall and Gomme2000) where age-earnings profiles of workers increased with the time spent in training and where high-school dropouts have flat age-earnings profiles.

2.2 Pension benefits

The calculation of pension benefits is specific to each country, and can sometimes be very complex. Nevertheless, it has been summarized (see Casamatta et al., Reference Casamatta, Cremer and Pestieau2000; Docquier and Paddison, Reference Docquier and Paddison2003; Sommacal, Reference Sommacal2006) by considering two different parts, a redistributive part (the Beveridgean part), characterized by a basic flat-rate benefit, and an insurance part (the Bismarckian part) characterized by earnings-related benefits. The latter is not necessarily proportional to all contributions and then based on full lifetime average earnings (see OECD, 2007). It is particularly the case in Greece and Spain where benefits are only linked to final salary. It also used to be the case in Sweden before the 1994 legislation. In France, before the Balladur reform of 1993, earnings-related benefits were linked to the ten best years, then gradually to the 25 best years after the reform. In the United States, the 35 best years are considered to calculate the benefits, 20 in Norway.

Let us define \mathop {\tilde{W}}\nolimits_{t}^{\, i} , i=s, u, as the representative earnings on which benefits are based. It does not matter which period is used to calculate the unskilled representative earnings because the age-earnings profile is consistently flat. It follows that:

(5)
\mathop {\tilde{W}}\nolimits_{t}^{\, u} \equals W_{t}^{\, u}.

For the skilled workers, if the reference earnings \mathop {\tilde{W}}\nolimits_{t}^{\, s} corresponds to full lifetime average earnings, we have then W ts, otherwise, if it corresponds to the best or last years, we have Z tsw t. The representative earnings of skilled workers can then be specified as

(6)
\mathop {\tilde{W}}\nolimits_{t}^{\, s} \equals \mu Z_{t}^{s} w_{t} \plus \lpar 1 \minus \mu \rpar W_{t}^{\, s} \comma

where μ=0 if pension benefits are linked to full lifetime average earnings or μ=1 if they are linked to the best or last years.

Assuming that the basic flat-rate benefit \bar{p}_{t \plus \setnum{1}} is based on the contemporary wage of unskilled workersFootnote 4, the calculation of pension benefits for any worker in t is then given by

(7)
p_{t \plus \setnum{1}} \equals \theta _{t \plus \setnum{1}} \mathop {\tilde{W}}\nolimits_{t} \plus \nu _{t \plus \setnum{1}} W_{t \plus \setnum{1}}^{u} \comma

where νt+1 represents the size of the flat-rate component of pension benefits and θt+1 the size of the earnings-related component.

As noted in the introduction, most retirement systems of industrialized economies are close to actuarial fairness. In terms of the retirement system implicit return, i.e. the ratio of the pension benefits received by an individual to the amount of his contributions, this means that

(8)
{{p_{t}^{u} } \over {\tau W_{t \minus \setnum{1}}^{u} }} \approx {{p_{t}^{s} } \over {\tau W_{t \minus \setnum{1}}^{s} }}\comma

where τ is the public pension system contribution rate and p ti is the pension in t of a worker of type i in t−1, i=u, s. If (p tuW t−1u)>(p tsW t−1s), then the retirement system is fiscally to the advantage of low-income earners. In this case, the system is progressive. In the opposite case, (p tuW t−1u)<(p tsW t−1s), it is regressive. By definition, pure contributory retirement systems, i.e., with pension benefits proportional to all contributions and based on full lifetime average earnings, are actuarially fair. It is the case in our framework if \bar{p} \equals 0 and μ=0.

3 Optimal behaviors

3.1 Individuals

Preferences of an individual of type x born in t−1 are described by the following utility function:

(9)
U_{x} \equals {\rm ln}\, c_{t} \plus \beta\, {\rm ln}\, d_{t \plus \setnum{1}} \plus \varepsilon\, {\rm ln}{\kern 1pt} \lpar 1 \minus x\rpar {\kern 1pt} e_{t \minus \setnum{1}} \comma

where c t and d t+1 are, respectively, his consumption when adult and when old, and β is the discount factor; ε ln(1−x) is the utility cost of schooling effort, where x∊[0, 1] represents learning ability. In this setting, a talented child characterized by x=0 endures no cost in making the effort. By contrast, a lazy or untalented child characterized by x=1 will endure an infinite cost and will then always choose not to make the effort, i.e. e t−1=0. As explained in the introduction, x can be considered as an inherited (perfectly here) trait that represents family background and genetic transmission. For simplicity, we will assume that x is uniformly distributed over the population.

During the second life period, individuals consume a part of their disposable income, and save such as

(10)
c_{t} \plus s_{t} \equals W_{t} \lpar 1 \minus \tau \rpar,

where s t is the savings.

In the third life period, individuals are retired. They get back the savings lent to firms with interest, receive their pension from the public retirement system and consume their wealth. The budget constraint is then

(11)
d_{t \plus \setnum{1}} \equals R_{t \plus \setnum{1}} s_{t} \plus p_{t \plus \setnum{1}} \comma

where R t+1 is the real interest factor.

Let \rmOmega _{t}^{i} \equals W_{t}^{i} \lpar 1 \minus \tau \rpar \plus {\textstyle{{p_{t \plus \setnum{1}}^{i} } \over {R_{t \plus \setnum{1}} }}} be the lifetime income of a worker of type i, i=u, s. Considering the reference earnings (6) of a skilled worker in the calculation of his pension benefits, an individual who has chosen to make the effort at school will maximize his lifetime income by spending the following time in training during his second life period:

(12)
h_{t} \equals {\rm inf} \left\{ {h^{\setnum{0}} \left[ {1 \plus {\mu \over {1 \minus \tau \plus \lpar 1 \minus \mu \rpar {\textstyle{{\theta _{t \plus \setnum{1}} } \over {R_{t \plus \setnum{1}} }}}}}{{\theta _{t \plus \setnum{1}} } \over {R_{t \plus \setnum{1}} }}} \right] ; \ 1} \right\}\comma

where h 0=δ/(1+δ) is the training length with no retirement system.

Proposition 1.Linking pension benefits to partial earnings history generates an incentive to be trained longer.

Retirement systems whose pension benefits are based, even partially, on the best or last years generate an incentive for longer training. Initially, the lengthening of training has a negative effect on income. During this period individuals have indeed no earning capacity. However, they earn more afterwards. In addition, as pensions are linked to the best or last years they also benefit, all things being equal, from an increase in their benefits. Following equation (12), individuals who undertake training may find it profitable to be trained longer as an investment in their pension benefits. Note that this incentive disappears completely if pension benefits are based on full lifetime average earnings (\mathop {\tilde{W}}\nolimits_{t}^{s} \equals W_{t}^{s} ) or if the system is totally flat-rate (θt+1=0). Note also that this incentive is weaker as the interest rate increases. Indeed, the higher the interest rate, the lower the present actuarial value of pension benefits.

To summarize, the incentive to be trained longer, generated by the retirement system, is due to the interaction of two factors:

  • pension benefits are linked to the best or last years

  • training results in steeper age-earnings profiles

The utility maximization of an individual subject to budgetary constraints (10) and (11) leads to the following saving function:

(13)
s_{t} \equals {\beta \over {1 \plus \beta }}W_{t}\lpar 1 \minus \tau \rpar \minus {1 \over {1 \plus \beta }}{{p_{t \plus \setnum{1}} } \over {R_{t \plus \setnum{1}} }}.

By reducing simultaneously the disposable income and the need for a future income, a retirement system reduces private savings. This result holds irrespective of the calculation of pension benefits and their financing.

Last, an individual will choose to make the effort at school if the opportunity of complementary training entails a monetary benefit higher than the utility cost associated with the effort, i.e. if (1+β) ln Ωts+ε ln(1−x)⩾(1+β) ln Ωtu. Given the uniform distribution of the parameter x over the population, the proportion of individuals x t* who choose to be trained in t (and then to make the schooling effort in t−1) to become skilled workers is defined by:

(14)
x_{t}\hskip -2.5pt{\ast} \equals 1 \minus \mathop {\left( {I_{t} } \right)}\nolimits^{ \minus \lpar \setnum{1} \plus \beta \rpar \sol \varepsilon } \comma

where I ttstu represents the lifetime income inequality between skilled and unskilled workers in t. Following (14), the higher this inequality, the larger the proportion of individuals incited to be trained: {\textstyle{{{\rm d}x_{t}{ \ast } } \over {{\rm d}I_{t} }}} \gt 0.

3.2 Firms

We consider a competitive sector characterized by a representative firm producing a good, which can be either consumed or invested, according to a Cobb–Douglas technology with constant return to scale:

(15)
Y_{t} \equals F\left( {K_{t} \comma L_{t}^{u} \comma L_{t}^{s} } \right) \equals AK_{t}^{\alpha}\!\! \mathop {\left( {Z_{t}^{u} L_{t}^{u} \plus \left( {1 \minus h_{t} } \right)Z_{t}^{s} L_{t}^{s} } \right)}\nolimits^{\setnum{1} \minus \alpha } \comma {\rm \ }0 \lt \alpha \lt 1\comma

where Y t is the output, K t the capital physical stock, L ti the number of workers of type i in t, i=u, s, and A the total factor productivity. Assuming for simplicity as in Docquier and Paddison (Reference Docquier and Paddison2003), that skilled and unskilled labors are perfect substitutesFootnote 5, L t=Z tuL tu+(1−h t)Z tsL ts represents the labor supply in efficiency units.

Denoting per capita efficient capital by k_{t} \equals {\textstyle{{K_{t} } \over {L_{t} }}} and assuming a total capital depreciation, the optimal conditions resulting from the maximization of the profit are:

(16)
R_{t} \equals A\alpha k_{t}^{\alpha \minus \setnum{1}} \comma
(17)
w_{t} \equals A\left( {1 \minus \alpha } \right)k_{t}^{\alpha }.

Before studying the impact of retirement systems and the calculation of pension benefits on growth and income inequality, we have to characterize the equilibrium and its properties.

4 Equilibrium

The economy is composed of four markets corresponding to the unskilled labor, the skilled labor, the physical capital and the good. In a closed-economy setting, the general equilibrium can be obtained by considering only the clearing of three markets, as according to the Walras law, the fourth is necessarily cleared. In our case, we consider the clearing of the following markets:

  • unskilled labor:

    (18)
    L_{t}^{u} \equals \left( {1 \minus x_{t}{\hskip -3pt \ast } } \right)N_{t \minus \setnum{1}}.
  • skilled labor:

    (19)
    L_{t}^{s} \equals x_{t}{\hskip -2.5pt \ast } N_{t \minus \setnum{1}}.
  • physical capital:

    (20)
    K_{t \plus \setnum{1}} \equals N_{t \minus \setnum{1}} \left[ {x_{t}{\hskip -2.5pt \ast } s_{t}^{s} \plus \left( {1 \minus x_{t}{\hskip -2.5pt \ast } } \right)s_{t}^{u} } \right].

4.1 PAYG social security and the capital accumulation

Retirement systems have PAYG features, i.e. within a period, pension benefits are financed by contributions of workers of the same period. In other words, retirement systems transfer workers' income towards pensioners. Since workers are either skilled or unskilled, the social security balanced budget is defined as follows:

(21)
L_{t \minus \setnum{1}}^{u} p_{t}^{u} \plus L_{t \minus \setnum{1}}^{s} p_{t}^{s} \equals \tau \left[ {L_{t}^{u} W_{t}^{u} \plus L_{t}^{s} W_{t}^{s} } \right].

Assume that in each period population is growing at a constant rate n:

(22)
N_{t} \equals \lpar 1 \plus n\rpar N_{t \minus \setnum{1}}.

Since at date t there is a proportion x t* and 1−x t* of respectively skilled and unskilled workers as specified in equations (18) and (19), the balanced budget of the retirement system (21), with equations (1)–(7) and (22), is rewritten as:

(23)
\theta _{t} \equals {{\left[ {\lpar 1 \plus n\rpar \tau \left[ {x_{t}{\hskip -2.5pt \ast } \left( {1 \minus h_{t} } \right)Bh_{t}^{\delta } \plus 1 \minus x_{t}{\hskip -2.5pt \ast } } \right] \minus \nu _{t} } \right]\left( {x_{t \minus \setnum{1}}{\hskip -13pt \ast }\hskip 10pt Bh_{t \minus \setnum{1}}^{\delta } \plus 1 \minus x_{t \minus \setnum{1}}{\hskip -13pt \ast }\hskip 8pt } \right)} \over {x_{t \minus \setnum{1}}{\hskip -13pt \ast }\hskip 9pt \left[ {\mu \plus \lpar 1 \minus \mu \rpar \left( {1 \minus h_{t \minus \setnum{1}} } \right)} \right]Bh_{t \minus \setnum{1}}^{\delta } \plus 1 \minus x_{t \minus \setnum{1}}{ {\hskip -13pt \ast }\hskip 10pt } }}{{w_{t} } \over {w_{t \minus \setnum{1}} }}.

As is obvious, a decreasing population growth, which corresponds to an aging population in the model, is associated with lower pension benefits, everything else being equal. Considering the social security balanced budget (23) and the capital market clearing (20), with equations (1)–(7), (13), (16), (17) and (22), the dynamics of capital accumulation in the model can be expressed as

(24)
\eqalign{ \tab k_{t \plus \setnum{1}} \left[ {x_{t \plus \setnum{1}}{\hskip -13pt \ast }\hskip 9pt \left( {1 \minus h_{t \plus \setnum{1}} } \right)Bh_{t \plus \setnum{1}}^{\delta } \plus 1 \minus x_{t \plus \setnum{1}}{\hskip -13pt \ast }\hskip 9pt } \right] \cr \tab \quad \equals {{A\alpha \beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar } \over {\left[ {\alpha \lpar 1 \plus \beta \rpar \plus \tau \lpar 1 \minus \alpha \rpar } \right]\lpar 1 \plus n\rpar }}{{x_{t}{\hskip -2.5pt \ast } \left( {1 \minus h_{t} } \right)Bh_{t}^{\delta } \plus 1 \minus x_{t}{\hskip -2.5pt \ast } } \over {x_{t}{\hskip -2.5pt \ast } Bh_{t}^{\delta } \plus 1 \minus x_{t}{\hskip -2.5pt \ast } }}k_{t}^{\alpha }. \cr}

As retirement systems reduce private savings (equation 13), all things being equal, PAYG retirement systems are harmful to the accumulation of physical capital: {\textstyle{{\partial {\textstyle{{1 \minus \tau } \over {\alpha \left( {1 \plus \beta } \right) \plus \tau \left( {1 \minus \alpha } \right)}}}} \over {\partial \tau }}} \lt 0 (equation 24). In addition, as x* and h are both forward-looking variables, their specification is crucial to determine the dynamic properties of the model and the convergence towards its steady-state (balanced growth) path.

4.2 Human capital and actuarial fairness

As noted in the introduction and characterized by equation (8), most retirement systems in the industrial world are close to actuarial fairness. It is obviously the case if pension benefits are based on full lifetime average earnings (\bar{p} \equals 0 and μ=0). It is less obvious that it is also possible if pension benefits are linked only to partial earnings history.

Proposition 2.Retirement systems whose pensions are linked to the best or last years can be actuarially fair if they include a flat-rate component indexed on the unskilled earnings, \bar{p}_{t} \equals \nu _{t} W_{t}^{u} , such as \nu _{t} \equals \tilde{\nu }_{t} \equals {\textstyle{{\lpar 1 \plus n\rpar Bh_{t \minus \setnum{1}}^{\setnum{1} \plus \delta } } \over {Bh_{t \minus \setnum{1}}^{\delta } \minus 1}}}\tau {\textstyle{{x_{t}{\hskip -2.5pt \ast } \lpar 1 \minus h_{t} \rpar Bh_{t}^{\delta } \plus 1 \minus x_{t}{\hskip -2.5pt \ast } } \over {x_{t \minus \setnum{1}}{\hskip -9pt \ast }\hskip 7pt \lpar 1 \minus h_{t \minus \setnum{1}} \rpar Bh_{t \minus \setnum{1}}^{\delta } \plus 1 \minus x_{t \minus \setnum{1}}{\hskip -9pt \ast }\hskip 10pt }}}.

If \nu _{t} \gt \tilde{\nu }_{t} , the retirement system is fiscally to the advantage of low-income earners, {\textstyle{{p_{t}^{u} } \over {\tau W_{t \minus \setnum{1}}^{u} }}} \gt {\textstyle{{p_{t}^{s} } \over {\tau W_{t \minus \setnum{1}}^{s} }}}, and is then progressive. In the opposite case, \nu _{t} \lt \tilde{\nu }_{t} , it is regressive. This feature is easily understandable. On one hand, the flat-rate part of pension benefits is clearly to the advantage of low-income earners: they receive as much as high-income earners whereas they have contributed less. A flat-rate system is obviously progressive. On the other hand, the pension part that is linked to the best or last years, characterized by θt, favors high-income earners as they have steeper age-earnings profiles, as explained by Lindbeck and Persson (Reference Lindbeck and Persson2003) and Bozio and Piketty (Reference Bozio and Piketty2008) . If there is no flat-rate part then the system is regressive. Therefore, there is only one combination of the flat-rate and earnings-related parts, the one specified in Proposition 2, that characterizes actuarial fairness.

Consider an actuarially fair retirement system, i.e. either μ=0 and \mathop {\bar{p}}\nolimits_{t \plus \setnum{1}} \equals 0 or μ=1 and \nu _{t \plus \setnum{1}} \equals \tilde{\nu }_{t \plus \setnum{1}} . In that case, the lifetime income inequality I_{t} \equals {\textstyle{{\rmOmega _{t}^{s} } \over {\rmOmega _{t}^{u} }}} becomes I_{t} \equals {\textstyle{{W_{t}^{s} } \over {W_{t}^{u} }}}. With equations (1)–(4), the proportion of skilled workers in t defined by (14) becomes

(25)
x_{t}{\hskip -2.5pt \ast } \equals 1 \minus {1 \over {\mathop {\left[ {\left( {1 \minus h_{t} } \right)Bh_{t}^{\delta } } \right]}\nolimits^{\lpar \setnum{1} \plus \beta \rpar \sol \varepsilon } }}.

In this configuration, the choice for a young individual to make the effort at school in t−1 to become a skilled worker in t only depends on his learning ability and the length of the training he anticipates to complete. As h 0 corresponds to max{(1−h)Bhδ}, we can deduce from (25) that any increase in the training length compared to the basic level h 0 will lead to a decrease in the skilled workers proportion: \mathop {\left. {{\textstyle{{\partial x_{t}{\hskip -2.5pt \ast } } \over {\partial h_{t} }}}} \right\vert}\nolimits_{h_{t} \geqslant h^{\setnum{0}} }\! \leqslant 0. Following Proposition 1, we can then expect that actuarially fair retirement systems whose pension benefits are based on partial earnings history reduce the proportion of skilled workers.

When the retirement system is purely contributory, i.e. if μ=0 and \bar{p}_{t \plus \setnum{1}} \equals 0, it has no impact on the training length in the second period of life (equation 14). By contrast, as characterized by equation (12), linking pension benefits to partial earnings history generates an incentive to be trained longer, which depends crucially on the actualized Bismarckian component {\textstyle{{\theta _{t \plus \setnum{1}} } \over {R_{t \plus \setnum{1}} }}}. Using equations (16), (17) and (23), (24), we have {\textstyle{{\theta _{t \plus \setnum{1}} } \over {R_{t \plus \setnum{1}} }}} \equals \left[ {{\textstyle{{Bh_{t}^{\delta } \lpar 1 \minus h_{t} \rpar \minus 1} \over {Bh_{t}^{\delta } \minus 1}}}} \right]{\textstyle{{\beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar \tau } \over {\alpha \lpar 1 \plus \beta \rpar \plus \tau \lpar 1 \minus \alpha \rpar }}}. Thus, the training length according to the social security features can be summarized as:

(26)
h_{t} \equals \left\{ {\matrix{ {h^{\setnum{0}} \comma }\hfill \tab {{\rm if\ }\mu \equals 0{\rm \ and\ }\bar{p}_{t \plus \setnum{1}} \equals 0\comma }\hfill \cr {h^{\setnum{0}} \left[ {1 \plus {\textstyle{{\beta \lpar 1 \minus \alpha \rpar \tau } \over {\alpha \lpar 1 \plus \beta \rpar \plus \tau \lpar 1 \minus \alpha \rpar }}}\ {\textstyle{{Bh_{t}^{\delta } \left( {1 \minus h_{t} } \right) \minus 1} \over {Bh_{t}^{\delta } \minus 1}}}} \right]\comma } \tab {{\rm if\ }\mu \equals 1{\rm \ and\ }\nu _{t \plus \setnum{1}} \equals \tilde{\nu }_{t \plus \setnum{1}}.} \cr} } \right.

If μ=1 and \nu _{t \plus \setnum{1}} \equals \tilde{\nu }_{t \plus \setnum{1}} , we deduce from (26) that \mathop {\rm lim}\limits_{h \to h^{\setnum{0}} } {\rm RHS} \gt h^{\setnum{0}} and \mathop {\rm lim}\limits_{h \to \setnum{1}} {\rm RHS} \lt h^{\setnum{0}} . In this case, the training is expressed as a function h t=h(τ) such as h 0h(τ)⩽1. In the case of a pure contributory system, as the latter has no impact on the training length, we will note conveniently that h t=h(μτ), where μ=0, i.e. h(0)=h 0. Thereafter, as the skill choice depends only on the training length (25), it can also be expressed as x t*=x*(μτ), where μ=0 or μ=1. In that case, x t*=x*(0) corresponds to an unchanged proportion of skilled workers in t compared to a situation with no retirement system: x{ \ast } \lpar 0\rpar \equals 1 \minus {\textstyle{1 \over {\mathop {\left[ {\left( {1 \minus h^{\setnum{0}} } \right)Bh^{\setnum{0}\delta } } \right]}\nolimits^{{\textstyle{{\setnum{1} \plus \beta } \over \varepsilon }}} }}}.

4.3 Dynamic properties

As underlined by equations (25) and (26), human capital variables are in their steady-state values independently of the calculation of pension benefits. Therefore, considering an actuarially fair retirement system, the physical capital accumulation dynamics (24) can be rewritten as:

(27)
k_{t \plus \setnum{1}} \equals {{A\alpha \beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar } \over {\left[ {\alpha \lpar 1 \plus \beta \rpar \plus \tau \lpar 1 \minus \alpha \rpar } \right]\lpar 1 \plus n\rpar }}{1 \over {x{ \ast } \lpar \mu \tau \rpar Bh^{\delta } \lpar \mu \tau \rpar \plus 1 \minus x{ \ast } \lpar \mu \tau \rpar }}k_{t}^{\alpha }.

Since α<1, given k 0>0, the model has good dynamic properties and converges to its steady-state (balanced growth) path characterized by h(μτ), x*(μτ) and k \equals \mathop {\left[ {{\textstyle{{A\alpha \beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar } \over {\left[ {\alpha \lpar 1 \plus \beta \rpar \plus \tau \lpar 1 \minus \alpha \rpar } \right]\lpar 1 \plus n\rpar }}}\ {\textstyle{1 \over {x{ \ast } \lpar \mu \tau \rpar Bh^{\delta } \lpar \mu \tau \rpar \plus 1 \minus x{ \ast } \lpar \mu \tau \rpar }}}} \right]}\nolimits^{\setnum{1}\sol \lpar \setnum{1} \minus \alpha \rpar} , where μ=0 or μ=1.

As the convergence is verified, the impact of retirement systems and the calculation of pension benefits on growth and income inequality can now be discussed.

5 Social security, income inequality and growth

On the balanced growth path, we deduce from the labor market clearing relations (18) and (19), as well as from equations (1), (3), (15) and (22), the economic growth rate g:

(28)
1 \plus g \equals {Y \over {Y_{ \minus \setnum{1}} }} \equals \lpar 1 \plus n\rpar {{\bar{Z}} \over {\mathop {\bar{Z}}\nolimits_{ \minus \setnum{1}} }} \equals \lpar 1 \plus n\rpar \left[ {x{ \ast } Bh^{\delta } \plus 1 \minus x{ \ast } } \right].

In addition to the population growth, equation (28) stresses that economic growth depends on the rate of knowledge accumulation which is driven by both the proportion of skilled workers in the economy and the length of training.

A pure contributory system (ν=0 and μ=0) has no impact on the training length (equation 14). As a consequence, considering equation (25), it has also no impact on the proportion of skilled workers. Indeed, in that case, as pension benefits are proportional to all contributions the retirement system can no longer alter the skill choice. Pure contributory systems are then characterized by an unchanged investment in human capital, i.e. h 0 and x* (0), and it follows that:

Proposition 3.Pure contributory retirement systems have no impact on either economic growth or on lifetime income inequality.

Consider alternatively retirement systems whose pensions are linked to the best or last years, i.e. μ=1. If they are actuarially fair, following equation (26), the training is determined by h \equals h^{\setnum{0}} \left[ {1 \plus {\textstyle{{\beta \lpar 1 \minus \alpha \rpar \tau } \over {\alpha \lpar 1 \plus \beta \rpar \plus \tau \lpar 1 \minus \alpha \rpar }}}\ {\textstyle{{Bh^{\delta } \lpar 1 \minus h\rpar \minus 1} \over {Bh^{\delta } \minus 1}}}} \right], where \mathop {\rm lim}\limits_{h \to h^{\setnum{0}} } {\rm RHS} \gt h^{\setnum{0}} and \mathop {\rm lim}\limits_{h \to \setnum{1}} {\rm RHS} \lt h^{\setnum{0}} . This equation thus defines a relation between the training and the contribution rate such as h=h(τ)<1 and {\textstyle{{\partial h} \over {\partial \tau }}} \gt 0. In addition, as the skill choice is specified by x{ \ast } \equals 1 \minus {\textstyle{1 \over {\mathop {\left[ {\lpar 1 \minus h\rpar Bh^{\delta } } \right]}\nolimits^{\lpar\setnum{1} \plus \beta\rpar \sol \varepsilon } }}}, it follows that:

Proposition 4.Compared with no retirement system or with pure contributory systems, actuarially fair retirement systems whose pensions are linked to the best or last years enhance economic growth (at least if the contribution rate is sufficiently low) and lower lifetime income inequality.

At first glance, the result with regard to income inequality appears counter-intuitive. Indeed, the reduction in lifetime income inequality follows the lengthening of the skilled workers' training. This lengthening raises the difference between earnings of the skilled workers and those of the unskilled ones. Such retirement systems then increase earnings inequality. However, from a life cycle perspective, with no retirement system or with a pure contributory system (Proposition 1), individuals who decide to undertake training choose the duration h 0, which maximizes their lifetime income; h 0 thus maximizes the lifetime income inequality between skilled and unskilled workers. A lengthening of the training thus raises lifetime income inequality when h<h 0. Conversely, when h>h 0, a lengthening of the training reduces lifetime income inequality because we move away from the individually optimal training length. Therefore, even if the retirement system does not carry out transfers from high-income to low-income earners, we know from Proposition 1 that such an earnings-related pension benefit formula generates an incentive for longer training. Skilled workers are then encouraged to train themselves more than their individually optimal level. Consequently, actuarially fair retirement systems, whose pensions are linked to the best or last years, reduce lifetime income inequality compared to a situation with no retirement system (or purely contributory).

As such systems reduce lifetime income inequality, from equation (14) it follows that they reduce the proportion of skilled workers in the economy. This has a negative effect on economic growth. On the other hand, they incite skilled workers to train longer. Since the latter effect dominates the former (at least for a sufficiently low size of the system) we can stress a positive impact of PAYG retirement systems on economic growth as empirically reported by Sala-i-Martin (Reference Sala-i-Martin1996) and Zhang and Zhang (Reference Zhang and Zhang2004) . We then extend results obtained by Zhang (Reference Zhang1995), Sala-i-Martin (Reference Sala-i-Martin1996), Kemnitz and Wigger (Reference Kemnitz and Wigger2000), Le Garrec (Reference Le2001) and Zhang and Zhang (Reference Zhang and Zhang2003) with identical ability individuals. Initiated by the lengthening of training, our mechanism is directly related to Kemnitz and Wigger (Reference Kemnitz and Wigger2000) and Le Garrec (Reference Le2001) Footnote 6. It also explains the difference with Docquier and Paddison (Reference Docquier and Paddison2003) . By assuming a fixed training length, they indeed show that retirement systems based on partial earnings history can enhance growth, but only if they are fully funded.

In the current framework, actuarially fair retirement systems whose pensions are linked to the best or last years appear to be to the advantage of low-income earners, not through direct fiscal redistribution, but by enhancing growth. However, are these systems and their features socially optimal? If adopting the view of low-income earners, should we not integrate greater progressivity? and with which consequence for growth?

Proposition 5.For given contribution rates, compared to actuarially fair PAYG retirement systems whose pensions are linked to the best or last years, any marginal increase (decrease) of the flat-rate component reduces (increases) both economic growth and lifetime income inequality.

The flat-rate versus partial earnings-related structure sheds light on the dilemma between income inequality and economic growth in retirement systems. In order to reduce (more) the level of lifetime income inequality, retirement systems must be more progressive. Nevertheless, it is harmful for growth. Indeed, an increase in the flat-rate part, which sustains a reduction of lifetime income inequality, also means, for a given contribution rate, a reduction of the earnings-related part and thus a reduction of the incentive to be trained longer. If a retirement system gets close to a pure flat-rate system, which is the case of Anglo-Saxon countries, it has no impact on the training length (Proposition 1). Consequently, such a system results in a reduction of economic growth while it also reduces lifetime income inequality significantly. Conversely, a pure earnings-related system where benefits are linked to the best or last years boosts economic growth but is regressive. The choice of the intragenerational redistribution degree consists of a lifetime income inequality/growth trade-off. In such a context, if policymakers want to favor future generations, they will opt for more growth and then for greater regressivity. By contrast, if they want to favor contemporary low-income earners, they will opt for greater progressivity.

6 Conclusion

In 1950, life expectancy at birth in Western Europe was 68 years. Nowadays, it is 80 years and should reach 85 years in 2050 (United Nations, 2009). The downside of this trend is the serious threat that is hanging over the financing of our public retirement systems. Financed on a PAYG basis, they must cope with the increasingly larger number of pensioners compared to the number of contributors. Changes are unavoidable and are of major importance in OECD countries. In 2005, the payment of pension benefits represented 38% of all their public social expenditures. As a matter of fact, retirement systems are the major program of industrial countries' redistributive policies and their importance should still grow with the aging of their population.

Claiming the broad inefficiency of PAYG retirement systems (being accused of low return and of distorting individual behaviors), some economists as Feldstein (Reference Feldstein1995a, Reference Feldsteinb, Reference Feldstein1996) stress that these financial difficulties give opportunities to move to fully funded systems. However, replacing conventional PAYG systems by financial – or funded – defined contribution (FDC) systems involves such a large cost of transition that it appears socially and politically difficult to implement such a reform in western democracies. That is why in recent years strong focus has been put on non-financial- or notional- defined contribution (NDC) systems as legislated in Sweden in 1994. As described by Palmer (Reference Palmer, Holzmann and Palmer2006), NDC systems are PAYG systems that mimic FDC systems. Individual contributions are noted on individual accounts. Accounts are credited with a rate of return that reflects demographic and productivity changes.

By basing benefits on individual accounts, NDC systems have undoubtedly desirable features in terms of transparency. However, as existing retirement systems (except in Anglo-Saxon countries) are already close to actuarial fairness, we cannot expect from NDC systems a significant decrease in negative incentive effects. In many respects, introducing a NDC system largely involves moving from a defined benefit to a defined contribution system aiming at the contribution rates' stabilization. From this perspective, one should note that this objective can be achieved similarly within the scope of more conventional defined benefit systems, as seen in the point system in France. In that case, the unit of pension rights is earnings points and not euros and can be adjusted according to demographic and productivity changes as in a NDC system. As stressed by Börsch-Supan (Reference Börsch-Supan, Holzmann and Palmer2006), cleverly designed conventional retirement systems can often do the same job as NDC systems. As shown in this article, they can even do better.

In particular, a conventional retirement system can allow pension benefits to be linked only to partial earnings history as observed in Greece, Spain, France, Norway and in Sweden before the 1994 legislation. In that case, associated with the appropriate flat-rate benefits, the latter can be actuarially fair as a NDC system while enhancing economic growth by promoting the accumulation of human capital. It does better in this matter than a NDC system which has no impact on economic growth.

Furthermore, an actuarially fair retirement system whose pensions are linked to the best or last years lowers lifetime income inequality, whereas a NDC system does not. More generally moving to a NDC system, by nature purely contributory, definitively closes the debate on the progressivity of the retirement system, which is an important one in democracy. Of course, compared to any actuarially fair system, greater progressivity results in negative incentive effects that lead to less economic growth. However, even if the current choice is actuarial fairness, at least a conventional system still allows one to ask whether greater progressivity involving less lifetime inequality is worth the cost in terms of economic growth. To contribute to the debate, as the latter can be enhanced through more investment in human capital, it is worth noting that the negative effect on economic growth associated with greater progressivity could be counterbalanced by any relevant educational policy. Besides, as suggested by our results, a properly designed progressive system with pension benefits linked to the best or last years could significantly decrease lifetime income inequality while being neutral in terms of economic growth.

Appendix A – Proof of propositions

Proposition 1. Following (12), h^{\setnum{0}} \equals {\textstyle{\delta \over {1 \plus \delta }}} \lt 1.

If \mathop {\tilde{W}}\nolimits_{t}^{s} \equals W_{t}^{s} {\rm \ \lpar }\mu \equals 0{\rm \rpar } or θt+1=0, we still have according to (12) h t=h 0.

By contrast, if μ=1 and θt+1>0, {\textstyle{\delta \over {1 \plus \delta }}}\, {\textstyle{\mu \over {1 \minus \tau \plus \lpar 1 \minus \mu \rpar {\textstyle{{\theta _{t \plus \setnum{1}} } \over {R_{t \plus \setnum{1}} }}}}}}\, {\textstyle{{\theta _{t \plus \setnum{1}} } \over {R_{t \plus \setnum{1}} }}} \gt 0 and it follows that h t>h 0.

Proposition 2. Assuming that μ=1, actuarial fairness defined by (8) entails that:

{{\theta _{t} Bh_{t \minus \setnum{1}}^{\delta } \mathop {\overline{Z} }\nolimits_{t \minus \setnum{2}} w_{t \minus \setnum{1}} \plus \nu _{t} \mathop {\overline{Z} }\nolimits_{t \minus \setnum{1}} w_{t} } \over {\tau \left( {1 \minus h_{t \minus \setnum{1}} } \right)Bh_{t \minus \setnum{1}}^{\delta } \mathop {\overline{Z} }\nolimits_{t \minus \setnum{2}} w_{t \minus \setnum{1}} }} \equals {{\theta _{t} \mathop {\overline{Z} }\nolimits_{t \minus \setnum{2}} w_{t \minus \setnum{1}} \plus \nu _{t} \mathop {\overline{Z} }\nolimits_{t \minus \setnum{1}} w_{t} } \over {\tau \mathop {\overline{Z} }\nolimits_{t \minus \setnum{2}} w_{t \minus \setnum{1}} }}

and then:

{{\theta _{t} Bh_{t \minus \setnum{1}}^{\delta } w_{t \minus \setnum{1}} \plus \nu _{t} \left( {x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 10pt Bh_{t \minus \setnum{1}}^{\delta } \plus 1 \minus x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 8pt } \right)w_{t} } \over {\tau \left( {1 \minus h_{t \minus \setnum{1}} } \right)Bh_{t \minus \setnum{1}}^{\delta } w_{t \minus \setnum{1}} }} \equals {{\theta _{t} w_{t \minus \setnum{1}} \plus \nu _{t} \left( {x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 10pt Bh_{t \minus \setnum{1}}^{\delta } \plus 1 \minus x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 8pt } \right)w_{t} } \over {\tau w_{t \minus \setnum{1}} }}.

Introducing (23) with μ=1 entails that:

\eqalign{ \tab {{\left[ {\lpar 1 \plus n\rpar \tau \left[ {x_{t}\hskip -2.5pt{ \ast } \left( {1 \minus h_{t} } \right)Bh_{t}^{\delta } \plus 1 \minus x_{t}\hskip -2.5pt{ \ast } } \right] \minus \nu _{t} } \right]Bh_{t \minus \setnum{1}}^{\delta } \plus \nu _{t} \left( {x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 10pt Bh_{t \minus \setnum{1}}^{\delta } \plus 1 \minus x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 8pt  } \right)} \over {\left( {1 \minus h_{t \minus \setnum{1}} } \right)Bh_{t \minus \setnum{1}}^{\delta } }} \cr \tab \equals \left[ {\lpar 1 \plus n\rpar \tau \left[ {x_{t}\hskip -2.5pt{ \ast } \left( {1 \minus h_{t} } \right)Bh_{t}^{\delta } \plus 1 \minus x_{t}\hskip -2.5pt{ \ast } } \right] \minus \nu _{t} } \right] \plus \nu _{t} \left( {x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 10pt Bh_{t \minus \setnum{1}}^{\delta } \plus 1 \minus x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 8pt  } \right). \cr}

Actuarial fairness with μ=1 is then obtained if:

\nu _{t} \equals {{\left( {1 \plus n} \right)Bh_{t \minus \setnum{1}}^{\setnum{1} \plus \delta } } \over {Bh_{t \minus \setnum{1}}^{\delta } \minus 1}}\tau {{x_{t}\hskip -2.5pt{ \ast } \left( {1 \minus h_{t} } \right)Bh_{t}^{\delta } \plus 1 \minus x_{t}\hskip -2.5pt{ \ast } } \over {x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 8pt  \left( {1 \minus h_{t \minus \setnum{1}} } \right)Bh_{t \minus \setnum{1}}^{\delta } \plus 1 \minus x_{t \minus \setnum{1}}\hskip -13pt{ \ast }\hskip 10pt  }}.

Proposition 4. From equation (28) we derive {\textstyle{{{\rm d}\left( {1 \plus g} \right)} \over {{\rm d}\tau }}} \equals \left( {1 \plus n} \right)\left[ {Bh^{\delta } \minus 1} \right]{\textstyle{{{\rm d}x{ \ast } } \over {{\rm d}\tau }}} \plus x{ \ast } \delta Bh^{\delta \minus \setnum{1}} {\textstyle{{{\rm d}h} \over {{\rm d}\tau }}}.

From equation (25) we derive {\textstyle{{{\rm d}x{ \ast } } \over {{\rm d}\tau }}} \equals {\textstyle{{1 \plus \beta } \over \varepsilon }}\mathop {\left[ {\lpar 1 \minus h\rpar Bh^{\delta } } \right]}\nolimits^{ \minus \lpar \setnum{1} \plus \beta \plus \varepsilon \rpar \sol \varepsilon } {\textstyle{{\partial \left[ {\left( {1 \minus h} \right)Bh^{\delta } } \right]} \over {\partial \tau }}}. Close to τ=0, we show that \mathop {\left. {{\textstyle{{\partial \left[ {\left( {1 \minus h} \right)Bh^{\delta } } \right]} \over {\partial \tau }}}} \right\vert}\nolimits_{\tau \equals \setnum{0}} \equals 0 and then \mathop {\left. {{\textstyle{{{\rm d}x{ \ast } } \over {{\rm d}\tau }}}} \right\vert}\nolimits_{\tau \equals \setnum{0}} \equals 0. It follows \mathop {\left. {{\textstyle{{{\rm d}\left( {1 \plus g} \right)} \over {{\rm d}\tau }}}} \right\vert}\nolimits_{\tau \equals \setnum{0}} \equals x{ \ast } \delta Bh^{\delta \minus \setnum{1}} {\textstyle{{{\rm d}h} \over {{\rm d}\tau }}}.

From (26), and since hh 0, we show that {\rm sign}\left[ {{\textstyle{{{\rm d}h} \over {{\rm d}\tau }}}} \right] \equals {\rm sign}\left[ {{\textstyle{{\partial {\textstyle{\tau \over {\left[ {\alpha \left( {1 \plus \beta } \right) \plus \tau \left( {1 \minus \alpha } \right)} \right]}}}} \over {\partial \tau }}}} \right], where{\textstyle{{\partial {\textstyle{\tau \over {\left[ {\alpha \left( {1 \plus \beta } \right) \plus \tau \left( {1 \minus \alpha } \right)} \right]}}}} \over {\partial \tau }}} \equals {\textstyle{{\alpha \left( {1 \plus \beta } \right)} \over {\mathop {\left[ {\alpha \left( {1 \plus \beta } \right) \plus \tau \left( {1 \minus \alpha } \right)} \right]}\nolimits^{\setnum{2}} }}} \gt 0.

We then show that: \mathop {\left. {{\textstyle{{{\rm d}\left( {1 \plus g} \right)} \over {{\rm d}\tau }}}} \right\vert}\nolimits_{\tau \equals \setnum{0}} \gt 0.

from (14) we derive {\textstyle{{{\rm d}I} \over {{\rm d}\tau }}} \equals {\textstyle{\varepsilon \over {1 \plus \beta }}}I^{\lpar {\setnum{1} \plus \beta \plus \varepsilon \rpar \sol \varepsilon }} {\textstyle{{{\rm d}x{ \ast } } \over {{\rm d}\tau }}}. It follows that {\rm sign}\left[ {{\textstyle{{{\rm d}I} \over {{\rm d}\tau }}}} \right] \equals {\rm sign}\left[ {{\textstyle{{{\rm d}x{ \ast } } \over {{\rm d}\tau }}}} \right].

We know from equation (25) that {\textstyle{{{\rm d}x{ \ast } } \over {{\rm d}\tau }}} \equals {\textstyle{{1 \plus \beta } \over \varepsilon }}\mathop {\left[ {\lpar 1 \minus h\rpar Bh^{\delta } } \right]}\nolimits^{ \minus \lpar \setnum{1} \plus \beta \plus \varepsilon \rpar \sol \varepsilon } {\textstyle{{\partial \left[ {\left( {1 \minus h} \right)Bh^{\delta } } \right]} \over {\partial \tau }}}. Ash^{\setnum{0}} \equals {\rm arg}{\kern 1pt} {\rm max}\left\{ {\lpar 1 \minus h\rpar Bh^{\delta } } \right\}, it follows that \mathop {\left. {{\textstyle{{\partial \left[ {\left( {1 \minus h} \right)Bh^{\delta } } \right]} \over {\partial h}}}} \right\vert}\nolimits_{h\geqslant h^{\setnum{0}} } \leqslant 0.

Knowing from (26) that hh 0 and {\rm sign}\left[ {{\textstyle{{{\rm d}h} \over {{\rm d}\tau }}}} \right] \gt 0, we then have {\textstyle{{{\rm d}x{ \ast } } \over {{\rm d}\tau }}}\leqslant 0 and then: {\textstyle{{{\rm d}I} \over {{\rm d}\tau }}}\leqslant 0.

Proposition 5. For μ=1, with equations (12), (16), (17), (23) and with \nu \equals \tilde{\nu } (Proposition 2), we show that training is determined by:

(29)
h \equals h^{\setnum{0}} \plus {\delta \over {\lpar 1 \plus \delta \rpar \lpar 1 \minus \tau \rpar }}{{\lpar 1 \plus n\rpar \tau \left[ {x{ \ast } \lpar 1 \minus h\rpar Bh^{\delta } \plus 1 \minus x{ \ast } } \right] \minus \nu } \over R}.

The proportion of skilled workers is determined by:

(30)
\matrix{ {{{\varepsilon {\rm log}\lpar 1 \minus x{ \ast } \rpar } \over {1 \plus \beta }} \equals \tab } \cr { \tab {\rm log}\left\{ {1 \minus \tau \plus {{\lpar 1 \plus n\rpar \tau \left[ {x{ \ast } \lpar 1 \minus h\rpar Bh^{\delta } \plus 1 \minus x{ \ast } } \right] \minus \nu } \over R} \plus {{\nu \left[ {x{ \ast } Bh^{\delta } \plus 1 \minus x{ \ast } } \right]} \over R}} \right\}} \cr { \tab \minus {\rm log}\left\{ {\lpar 1 \minus h\rpar Bh^{\delta } \lpar 1 \minus \tau \rpar \plus {{\lpar 1 \plus n\rpar \tau \left[ {x{ \ast } \lpar 1 \minus h\rpar Bh^{\delta } \plus 1 \minus x{ \ast } } \right] \minus \nu } \over R}Bh^{\delta } \plus {{\nu \left[ {x{ \ast } Bh^{\delta } \plus 1 \minus x{ \ast } } \right]} \over R}} \right\}.} \cr}

The interest rate is

(31)
R \equals {{\lpar 1 \plus \beta \rpar \alpha \plus \tau \lpar 1 \minus \alpha \rpar } \over {\beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar }}\lpar 1 \plus n\rpar \left[ {x{ \ast } Bh^{\delta } \plus 1 \minus x{ \ast } } \right].

Equation (29) can be rewritten as

(32)
{{\lpar 1 \plus n\rpar \tau \left[ {x{ \ast } \lpar 1 \minus h\rpar Bh^{\delta } \plus 1 \minus x{ \ast } } \right] \minus \nu } \over R} \equals {{\lpar 1 \plus \delta \rpar \lpar 1 \minus \tau \rpar } \over \delta }\lpar h \minus h^{\setnum{0}} \rpar.

By introducing this last equation in the skilled workers equation (2), we obtain:

(33)
\matrix{ {{{\varepsilon {\rm log}\lpar 1 \minus x{ \ast } \rpar } \over {1 \plus \beta }} \equals \tab } \cr { \tab {\rm log}\left\{ {1 \minus \tau \plus {{\lpar 1 \plus \delta \rpar \lpar 1 \minus \tau \rpar } \over \delta }\lpar h \minus h^{\setnum{0}} \rpar \plus \nu {{\beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar } \over {\left[ {\lpar 1 \plus \beta \rpar \alpha \plus \tau \lpar 1 \minus \alpha \rpar } \right]\lpar 1 \plus n\rpar }}} \right\}} \cr { \tab \minus {\rm log}\left\{ {\lpar 1 \minus h\rpar Bh^{\delta } \lpar 1 \minus \tau \rpar \plus {{\lpar 1 \plus \delta \rpar \lpar 1 \minus \tau \rpar } \over \delta }\lpar h \minus h^{\setnum{0}} \rpar Bh^{\delta } \plus \nu {{\beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar } \over {\left[ {\lpar 1 \plus \beta \rpar \alpha \plus \tau \lpar 1 \minus \alpha \rpar } \right]\lpar 1 \plus n\rpar }}} \right\}.} \cr}

Let be {\textstyle{{\varepsilon {\rm log}\left( {1 \minus x{ \ast } } \right)} \over {1 \plus \beta }}} \equals {\rm log}\left\{ Y \right\} \minus {\rm log}\left\{ Z \right\}, where Y \equals 1 \minus \tau \plus {\textstyle{{\lpar 1 \plus \delta \rpar \lpar 1 \minus \tau \rpar } \over \delta }}\lpar h \minus h^{\setnum{0}} \rpar \plus&#x003C;$&#x003E; &#x003C;$&#x003E; \nu {\textstyle{{\beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar } \over {\left[ {\lpar 1 \plus \beta \rpar \alpha \plus \tau \lpar 1 \minus \alpha \rpar } \right]\lpar 1 \plus n\rpar }}} and Z \equals \lpar 1 \minus h\rpar Bh^{\delta } \lpar 1 \minus \tau \rpar \plus {\textstyle{{\lpar 1 \plus \delta \rpar \lpar 1 \minus \tau \rpar } \over \delta }} \lpar h \minus h^{\setnum{0}} \rpar Bh^{\delta } \plus&#x003C;$&#x003E; &#x003C;$&#x003E; \vskip 3pt\nu {\textstyle{{\beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar } \over {\left[ {\lpar 1 \plus \beta \rpar \alpha \plus \tau \lpar 1 \minus \alpha \rpar } \right]\lpar 1 \plus n\rpar }}}.

We then have:

(34)
{{ \minus \varepsilon } \over {\lpar 1 \plus \beta \rpar \left( {1 \minus x{ \ast } } \right)}}dx{ \ast } \equals {{Y_{h} dh \plus Y_{\nu } d\nu } \over Y} \minus {{Z_{h} dh \plus Z_{\nu } d\nu } \over Z}\comma

where Y_{h} \equals {\textstyle{{\lpar 1 \plus \delta \rpar \lpar 1 \minus \tau \rpar } \over \delta }}\geqslant 0, Z_{h} \equals Y_{h} Bh^{\delta } \equals {\textstyle{{\lpar 1 \plus \delta \rpar \lpar 1 \minus \tau \rpar } \over \delta }}Bh^{\delta } \geqslant 0 and Y_{\nu } \equals Z_{\nu } \equals&#x003C;$&#x003E; &#x003C;$&#x003E; {\textstyle{{\beta \lpar 1 \minus \alpha \rpar \lpar 1 \minus \tau \rpar } \over {\left[ {\lpar 1 \plus \beta \rpar \alpha \plus \tau \lpar 1 \minus \alpha \rpar } \right]\lpar 1 \plus n \rpar }}}\geqslant 0.

From an initial equilibrium \nu \equals \tilde{\nu }, Z \equals \lpar 1 \minus h\rpar Bh^{\delta } Y, we have

  • {\textstyle{{Y_{h} } \over Y}} \minus {\textstyle{{Z_{h} } \over Z}} \equals {\textstyle{{Y_{h} } \over Y}}\left( {1 \minus {\textstyle{1 \over {1 \minus h}}}} \right) \leqslant 0 \Rightarrow \mathop {\left. {{\textstyle{{\partial x{ \ast } } \over {\partial h}}}} \right\vert}\nolimits_{\nu \equals \tilde{\nu }} \geqslant 0.

  • {\textstyle{{Y_{\nu } } \over Y}} \minus {\textstyle{{Z_{\nu } } \over Z}} \equals {\textstyle{{Y_{\nu } } \over Y}}\left( {1 \minus {\textstyle{1 \over {\left( {1 \minus h} \right)Bh^{\delta } }}}} \right)\geqslant 0 \Rightarrow \mathop {\left. {{\textstyle{{\partial x{ \ast } } \over {\partial \nu }}}} \right\vert}\nolimits_{\nu \equals \tilde{\nu }} \leqslant 0.

The form of the differential (34) is then

(i)
{\rm d}x{ \ast } \equals a{\rm d}h \minus b{\rm d}\nu \comma

where a⩾0 and b⩾0.

By replacing the interest rate equation (31) in the formation equation (29), we have

(35)
h \equals h^{\setnum{0}} \plus {{\beta \lpar 1 \minus \alpha \rpar \delta \left[ {\lpar 1 \plus n\rpar \tau \left[ {x{ \ast } \lpar 1 \minus h\rpar Bh^{\delta } \plus 1 \minus x{ \ast } } \right] \minus \nu } \right]} \over {\lpar 1 \plus \delta \rpar \left[ {\lpar 1 \plus \beta \rpar \alpha \plus \tau \lpar 1 \minus \alpha \rpar } \right]\lpar 1 \plus n\rpar \left[ {x{ \ast } Bh^{\delta } \plus 1 \minus x{ \ast } } \right]}}.

Differentiating this equation, we get:

(ii)
{\rm d}h \equals \minus c{\rm d}h \minus e{\rm d}\nu \plus f\, \left[ {\nu \minus \tilde{\nu }} \right]{\rm d}x{ \ast } \comma

where c⩾0, e⩾0 and f⩾0.

We then conclude, with (i) and (ii), for an initial equilibrium \nu \equals \tilde{\nu }, {\textstyle{{{\rm d}h} \over {{\rm d}\nu }}}\leqslant 0 and {\textstyle{{{\rm d}x{ \ast } } \over {{\rm d}\nu }}}\leqslant 0:

{{{\rm d}g} \over {{\rm d}\nu }}\leqslant 0\;{\rm and}\;{{{\rm d}I} \over {{\rm d}\nu }}\leqslant 0.

Footnotes

1 This return combines profits before all federal, state and local taxes with the net interest paid. The method of calculation is described in Feldstein et al. (Reference Feldstein, Dicks-Mireaux and Poterba1983) .

2 Or, as noted by Lindbeck and Persson (Reference Lindbeck and Persson2003), quasi-actuarial fairness since the growth rate which corresponds to the retirement system return is lower than the interest rate.

3 In that case training is a full-time activity that can be assimilated to higher education. We could have assumed alternatively that training is a part-time activity without qualitatively changing the results (see Le Garrec, Reference Le2005).

4 It is designed to ensure that pensioners achieve some minimum standard of living.

5 Assuming alternatively that they are imperfect substitutes would not qualitatively alter the results. First, it would not change the training length as defined in equation (12) at all. By introducing a wage premium for human capital, it would change the skill choice. Nevertheless, it would still allow one to express the latter as a function of the training length in the case of actuarial fairness. The forthcoming propositions would then not be affected.

6 In Zhang (Reference Zhang1995) and Zhang and Zhang (Reference Zhang and Zhang2003), PAYG retirement systems result in more growth by reducing fertility of altruistic parents who consequently invest more in the education of their children. In Sala-i-Martin (Reference Sala-i-Martin1996), old workers are associated with negative externalities in the average stock of human capital. By inducing earlier retirement, PAYG retirement systems then stimulate growth.

References

Andolfatto, D., Ferrall, C. and Gomme, P. (2000) Life-cycle learning, earning, income and wealth, mimeo, November.Google Scholar
Ben-Porath, Y. (1967) The production of human capital and the life cycle of earnings. Journal of Political Economy, 75: 352365.CrossRefGoogle Scholar
Börsch-Supan, A. (2006) What Are NDC Systems? What Do They Bring to Reform Strategies?. In Holzmann, R. and Palmer, E. (eds), Pension Reform: Issues and Prospects for Non-Financial Defined Contribution (NDC) Schemes, chapter 3, The World Bank.Google Scholar
Bozio, A. and Piketty, T. (2008) Pour un nouveau système de retraite: Des comptes individuels de cotisations financés par répartition, Collection du CEPREMAP, éditions Rue d'Ulm.Google Scholar
Bowles, S. and Gintis, H. (2002) The inheritance of inequality. Journal of Economic Perspective, 16(3): 330.CrossRefGoogle Scholar
Brown, J., Coronado, J. L. and Fullerton, D. (2006) The progressivity of social security, NBER Papers on Retirement Research Center Projects, NB06-10, December.Google Scholar
Burkhauser, R. and Walick, J. (1981) Disentangling the annuity from the redistributive aspects of social security in United States. Review of Income and Wealth, 28: 401422.CrossRefGoogle Scholar
Casamatta, G., Cremer, H. and Pestieau, P. (2000) The political economy of social security. Scandinavian Journal of Economics, 102(3): 503522.CrossRefGoogle Scholar
Coronado, J. L., Fullerton, D. and Glass, T. (1999) Distributional impacts of proposed changes to the social security system, NBER WP 6989, March.Google Scholar
Coronado, J. L., Fullerton, D. and Glass, T. (2000) The progressivity of social security, NBER WP 7520, February.CrossRefGoogle Scholar
Cutler, D. (1998) Comment on Gustman A. and Steinmeier T. In Feldstein, M. (ed.), Privatizing Social Security. Chicago: University of Chicago Press for NBER, 357361.Google Scholar
d'Addio, A. C. (2007) Intergenerational transmission of disadvantage: mobility or immobility across generations?, OECD Social, Employment and Migration Working Papers 52, OECD Publishing.Google Scholar
Deaton, A. and Paxton, C. (1998) Aging and inequality in income and health. American Economic Review, 88(2): 248253.Google Scholar
Deaton, A. and Paxton, C. (1999) Mortality, education, income and inequality among american cohort, NBER WP 7140, May.CrossRefGoogle Scholar
Docquier, F. and Paddison, O. (2003) Social security benefit rules, growth and inequality. Journal of Macroeconomics, 25: 4771.CrossRefGoogle Scholar
Feldstein, M. (1995 a) Fiscal policies, capital formation, and capitalism. European Economic Review, 39: 399420.CrossRefGoogle Scholar
Feldstein, M. (1995 b) Would privatizing social security raise economic welfare?, NBER WP 5281.CrossRefGoogle Scholar
Feldstein, M. (1996) The missing piece in policy analysis: social security reform. American Economic Review, 86(2): 114.Google Scholar
Feldstein, M., Dicks-Mireaux, L. and Poterba, J. (1983) The effective taxe rate and the pretax rate of return. Journal of Public Economics, 21(2): 129158.CrossRefGoogle Scholar
Garrett, D. (1995) The effects of differential mortality rates on the progressivity of social security. Economic Inquiry, 33(3): 457475.CrossRefGoogle Scholar
Gustman, A. and Steinmeier, T. (2001) How effective is redistribution under the social security benefit formula?. Journal of Public Economics, 82: 128.CrossRefGoogle Scholar
Huggett, M., Ventura, G. and Yaron, A. (2006) Human capital and earnings distribution dynamics. Journal of Monetary Economics, 53(2): 265290.CrossRefGoogle Scholar
Kemnitz, A. and Wigger, B. (2000) Growth and social security: the role of human capital. European Journal of Political Economy, 16: 673683.CrossRefGoogle Scholar
Le, Garrec (2001) Systèmes de retraite par répartition, mode de calcul des droits à pension et croissance. Louvain Economic Review, 67(4): 357380.Google Scholar
Le, Garrec (2005) Social security, inequality and growth, WP No. 2005-22, OFCE/Sciences Po, December.Google Scholar
Lillard, L. (1977) Inequality: earnings vs. human capital. American Economic Review, 67(2): 4253.Google Scholar
Lindbeck, A. and Persson, M. (2003) The gains from pension reform. Journal of Economic Literature, 41(1): 74–112.CrossRefGoogle Scholar
Mincer, J. (1997) The production of human capital and the life cycle of earnings: variations on a theme. Journal of Labor Economics, 15(1) Part 2: S26S47.CrossRefGoogle Scholar
Neal, D. and Rosen, S. (2000) Theories of the distribution of earnings. In Atkinson, A. and Bourguignon, F. (eds), Handbook of Income Distribution, Amsterdam: Elsevier.Google Scholar
OECD (2007) Pensions at a glance: public policies across OECD countries.Google Scholar
Palmer, E. (2006) What is NDC? In Holzmann, R. and Palmer, E. (eds), Pension reform: issues and prospects for non-financial defined contribution (NDC) schemes, chapter 2, The World Bank.Google Scholar
Sala-i-Martin, X. (1996) A positive theory of social security. Journal of Economic Growth, June, 1(2): 277304.CrossRefGoogle Scholar
Sommacal, A. (2006) Pension systems and intragenerational redistribution when labor supply is endogenous. Oxford Economic Papers, 58(3): 379406.CrossRefGoogle Scholar
Stahlberg, A.-C. (1990) Life cycle income redistribution of the public sector: inter- and intra-generational effects. In Persson, I (ed.), Generating Equality in the Welfare State. Oslo: Norvegian University Press, pp. 97–121.Google Scholar
United Nations (2009) World Population Prospects: The 2008 Revision.Google Scholar
Zhang, J. (1995) Social security and endogenous growth. Journal of Public Economics, 58: 185213.CrossRefGoogle Scholar
Zhang, J. and Zhang, J. (2003) Long-run effects of unfunded social security with earnings-dependent benefits. Journal of Economic Dynamics and Control, 28: 617641.CrossRefGoogle Scholar
Zhang, J. and Zhang, J. (2004) How does social security affect economic growth? Evidence from cross-country data. Journal of Population Economics, 17: 473500.CrossRefGoogle Scholar