Descriptive analysis
Figure 1 shows the distributions of expected and actual retirement ages and the expectation error by SAH status for the sample where retirement ages are observed. The error is defined as
Expectation error = Actual retirement age – Expected retirement age. Because of the small number of observations in the lowest health category (SAH = 1), categories 1 and 2 have been combined in the figure. The distribution of expected retirement ages is concentrated on values 60, 63, and 65, which are sort of “focal points.” Those with the poorest health expect to retire earlier, and among them 60 years is the mode rather than 63 years as among those with a better health status. Explanations for the concentration of retirement at certain ages include social norms, certain ages as default options, and reference dependence (planned retirement income as a reference point) [
33]. In the Finnish case, age 63 is understandable as the lowest retirement age in the post-2005 pension system and 65 as the “traditional” official retirement age in the old system. Age 60 may be related to some public sector pensions or just to the use of a round number as an approximate retirement age.
Actual retirement ages are concentrated in
Figure 1 on the range 63–65, with a clear mode at 63 years, except for those with lowest health status. In our data the argument of expectations as “most probable retirement ages” [
26] was not fully supported, as the modes were not systematically closer to the expected retirement ages than the means. This may be related to the relatively small number of observations for some expected retirement times and the high concentration of retirements at age 63. The distribution of expectation errors is skewed: people are more likely to work longer than they expect, but the mode is still at zero error. Those with poor health expect to retire early and often do, which means that both expected and actual retirement ages are low. However, this does not mean that there is no room for making errors. In fact, the variances of both retirement ages and expectation errors are negatively related to SAH (Panel A of
Table II). One interpretation of this is that for people with low SAH the timing of retirement is more unpredictable.
Table II presents the unconditional connection of health and retirement in slightly aggregated form (but in our statistical analysis we treat the variables as continuous). The expectation error is classified into three groups: retirement before expected age (negative error), retirement at expected age (zero error), and retirement after expected age (positive error). Again, SAH categories 1 and 2 are combined. Panel A shows the connection of the error with health in the sample where censored retirement times are excluded. Among those with the poorest health, 46% retire later than they had expected, and 23% earlier than expected. Of those with the best SAH status, 45% work longer than they had anticipated, and 18% retire earlier than they had expected. The differences in working longer than expected between the health statuses are, therefore, relatively small. The main difference between the health levels comes from those with poor health retiring earlier than expected and, therefore, more often making negative expectation errors.
In Panel B of
Table II, we include into the group “After expected age” also those who have not yet retired but who have passed their expected retirement age. Panel B also shows the share of those who did not reach their expected retirement age by the end of the follow-up period and had not yet retired. For them we cannot tell when retirement happens compared to the expected retirement age. This group is clearly more common among those with better health. Comparison of the two panels shows that there may be a selection problem in the health–retirement relationship because retirement age is more likely to be observed for those with poor health.
The table shows that it is most difficult for persons with poor health (SAH 1 to 3) to estimate their timing of retirement. This group includes both persons who retire earlier than they expected and have negative errors and also persons who underestimate their future ability or willingness to continue to work and, hence, make positive forecast errors. The discrepancy between the expectations and realizations may also be related to being in a position where one has to continue working for financial reasons or because of a rejected disability pension application. Positive forecast errors are common also in the other health groups, but perhaps for different reasons. If good health is positively correlated with socioeconomic position, those with good health may have more opportunities to choose the timing of retirement, and they may also have more enjoyable working conditions.
Estimation results
We have estimated regression models where actual retirement age, expected retirement age, and expectation error are regressed on SAH and the controls. Since the same variables are used in all of the models, the estimates of the expectation error model are simply the differences between the coefficients of the actual and expected retirement age models. The baseline results are shown in
Table III.
The estimates concerning the health variables are collected in
Table IV. The estimates are presented there so that they tell the connection of one standard deviation change in a health variable with months of retirement age, expected retirement age, and expectation error. The baseline results in the first row of Panel A of
Table IV (based on the first line of
Table III) show that health is clearly correlated with both actual and expected retirement ages, when these are regressed on health and controls. Health has a stronger connection with expected retirement age, which produces a negative connection between health and expectation error. In terms of months, the results imply that one standard deviation improvement in health (0.87) is associated with two months later retirement, 3.6 months later expected retirement, and 1.6 months smaller expectation errors. However, as the descriptive evidence in the previous section showed, these results may have been influenced by sample selection, which leaves out many of those with good health who are working longer than they expected.
Among the other variables (reported in
Table III), age is positively related to expected and actual retirement age. Gender, education, and earnings have no effect on the results. The pension accrual rate is positively related to expected retirement age, which supports the incentive role of the age-related pension accrual. Those who are already in partial retirement or whose spouse is retired tend to have lower actual retirement ages, but these variables have a weak relationship to expected retirement age. Public sector pensions are positively related to both actual and expected retirement age. There is evidence that in the smallest plants (reference group under 10 employees) employees expect to retire later, but the relationship of plant size to actual retirement age is not as clear. Finally, the indicator for the 2008 survey has a negative and significant coefficient for actual retirement age. This may be related to the pension reform, which has made the lower limit 63 years a clear focal point in retirement behavior. This is so even though average retirement age has increased partly through restrictions on some of the early exit routes (average observed retirement age in our data is 62.8 for those in the 2003 survey and 63.2 for those in the 2008 survey). The focal point has been observed also in other studies [
34].
As discussed above, there are potential sources of bias. We give attention to biases arising from measurement error, endogeneity, and sample selection. The results of the health variables are presented in different panels of
Table IV.
First, we address the issue of measurement error in health. It can be argued that the indicators for chronic illness and pains and the number of days of sickness absence are more objective measures of health even when they are self-reported. Therefore, they themselves may contain some measurement error. We first use them as health variables instead of SAH. We use both self-reported absences in the past 12 months and register information on absences for which sickness allowance has been paid during the calendar year preceding the survey. Since the variables are measured differently (SAH from 1 to 5, pains and chronic illness as binary indicators, and absences as days), we report the results in terms of one standard deviation change in each health variable. Rows 2–4 of Panel A in
Table IV summarize the coefficients of the health variables scaled in this way and are expressed in months. Naturally, SAH has a sign which is different from the signs of the other health variables, as a higher value of SAH means better health, whereas the reverse is true for the other variables. Generally, SAH and sickness absences as explanatory variables give fairly similar results, whereas the other health variables give lower values for the relationships. The main difference between self-reported and register-based sickness absences is that the former has a stronger relationship with the expected retirement age. The result that SAH overstates the relationship compared to objective health measures is in line with the results of some earlier studies [
8]. On the other hand, the number of sickness absence days may be a better summary measure of health than the binary indicators, and it gives results similar to those with SAH.
An alternative way to proceed is to use these other health measures as instruments for SAH. This is the so-called multiple indicator solution [
23]. The true health status is unobserved, and when SAH is used as a proxy for health the measurement error in SAH is part of the error term of the model. Assume that there are two health indicators that have uncorrelated measurement errors, which are ignorable in the sense that if true health could be controlled, the indicators would have no explanatory power for retirement. Under these assumptions, one health indicator is a valid instrument for another. Panel B of
Table IV shows the associations of one standard deviation change in SAH with the retirement variables when indicators for pains and chronic illness are used as alternative instruments. The point estimates of the coefficient of health increase (in absolute value) in instrumental variable estimation, which is consistent with attenuation caused by measurement error. Using sickness absences (either self-reported or register-based) as an instrument produced very highest estimates in absolute value (not reported in the table).
The under-identification and weak identification tests showed that the instruments are relevant; that is, sufficiently correlated with SAH. When at least two instruments are used, overidentification can be tested; that is, that the set of instruments is not correlated with the error term. We tried using different pairs of two instruments at the same time. Hansen’s J test led to the conclusion that the instruments are valid in all of the three equations only if pains and chronic illness are used together as instruments. In this case, the estimate for the relationship of one standard deviation improvement in health with actual retirement age is 3.5 months, and with expected retirement age 3.8 months. When the number of absences was used together with either of the other instruments, the J-statistic was high in the equation for expected retirement age, and the null hypothesis of no correlation with the errors was rejected. Sickness absence seems to be a problematic variable as an instrument and it may be correlated with the measurement error in SAH, as both are summary measures of health.
Second, we have examined the role of health shocks. They are health-related events in the follow-up period that are more exogenous because they happen after the survey. Health measured in the survey may not play a big role if there are later severe health events. We measure the health shock by the average number of annual sickness absence days between the survey year and retirement, using register information on absences for which the Social Insurance Institution has paid sickness allowance. Use of absences may be problematic, since there may be latent health symptoms that affect both SAH and later sickness absences. To reduce this problem, we concentrate on initially healthy individuals. These are defined as those who have not had absences with sickness allowance during a two-year period (survey year or the year before), have answered in the survey that they have no chronic illnesses, and have a good health status (SAH equal to 4 or 5). These restrictions leave 466 observations, among which the average number of annual sickness absence days in the follow-up period has mean 4.96 and standard deviation 14.54. The correlation of the shock measure with SAH is negative and low in absolute value (–0.05), so the problem of latent health effects should not be serious. Even this measure is subject to justification bias, if people who would like to retire early, for example because of dissatisfaction with their work, take sickness absences to justify the need for early retirement even when their health is not poor. However, obtaining sickness allowance requires a medical certificate, and the allowance is paid only after 10 days of sickness. Therefore, just low job satisfaction without any observed medical symptoms is not sufficient.
Figure 2 shows the unconditional relationship between the shock and actual retirement age. Panel C of
Table IV shows that in the retirement age model the health shock is significant with a negative coefficient. An adverse health shock, measured as a one standard deviation increase in absences, is associated with 2.7 months earlier retirement. This is probably closest to a causal effect that we can get with our data. The estimate is close to the ordinary least squares (OLS) estimates, with absences as the health variable. One interpretation of this result is that the endogeneity of health is not a serious problem, at least when the number of absence days is the health measure. The shock is naturally also negatively related to the expectation error; that is, adverse health shocks lead to earlier retirement than expected. However, the future shocks should not be related to the (earlier) expectations. This is confirmed by the estimations, which supports the assumption that the health shock measure is not related to latent health that is likely to influence retirement expectations.
Third, we have investigated the sample selection issue. We expect that there is a selection bias toward zero in the coefficient of health in the retirement age equation when only observed retirement ages are used, since those with better health are overrepresented among those who have not yet retired during the observation period. For the expected retirement age there is no selection effect, as it is observed for all. The individuals with unobserved retirement times can be treated as censored observations. In practice, the censoring limit—that is, the highest age at which a non-retired individual is observed—is age at the end of the follow-up period in 2013. The limit, therefore, varies across individuals. We estimate the censored regression model using interval regression [
23]. When the retirement age is observed, we have point data (actual retirement age), whereas for the censored observations we know that retirement happens in the interval from age at the end of 2013 to infinity.
The results in Panel D of
Table IV show that censored regression indeed produces higher estimates. One standard deviation change in SAH is now associated with 3.2 months later retirement. As the expected retirement age is observed for all, the censored regression naturally produces the same estimates for the expected retirement age equation as would OLS. The difference in this case between Panels A and D is caused by the larger sample in Panel D. The estimate for health in the expectation errors equation is no longer the simple difference between the estimates in the retirement age and expected retirement age equations, since the models are nonlinear. When censoring is taken into account, health is no longer significantly related to the errors. The negative relationship between SAH and expectation errors in the baseline OLS estimates seems to be caused by more common censoring of retirement ages among healthy individuals.
Finally, as another estimation, where censoring is taken into account, we have used interval regression using information on the age intervals when retirements happen, rather than retirement age as a continuous variable. Using ordered models (ordered probit or logit) for retiring before, at, or after expected retirement age would not take into account the fact that individuals are observed at different ages and at different distances from their expected or actual retirement. In interval regression we can take this information into account. Specifically, retirement can happen in the following cases: in the interval between age at time of survey and expected retirement age; at expected retirement age (point data); in the interval between expected retirement age and age at the end of the follow-up in 2013; or retirement age can be right-censored. This estimation, therefore, combines information on both expected and actual retirement. Since the values for the limits of the intervals are used, the estimation produces association of health with actual retirement age. According to the results (Panel E of
Table IV), one standard deviation increase in SAH is associated with 3.6 months later retirement. Since the intervals are based on expected retirement ages, the estimation does not produce results on expected retirement ages or expectation errors.
Robustness analyses
We have carried out various robustness analyses, which we briefly describe without presenting the results in a table. So far, we have used SAH as a cardinal measure. An alternative is to use indicators for different levels of health. We used indicators for SAH levels 3, 4, and 5, with levels 1 and 2 combined as the reference. The coefficients of the indicators had an increasing pattern in the models for actual and expected retirement age and a decreasing (negative) one in the model for expectation error. Cardinality implies certain restrictions on the coefficients of the indicators: coefficients of adjacent indicators increase by the same amount and there are multiplicity constraints; for example, the coefficient of the fourth indicator is twice the coefficient of the second one. These constraints were accepted in the estimations with the indicators for health levels. Therefore, the use of a cardinal health measure is justified.
To investigate the heterogeneity of the health–retirement relationships, we tried including interactions of the health variable with education (indicators for secondary and tertiary education), gender, and income. The only statistically significant result from this exercise was that the interaction of the indicator for tertiary education and health was positive and significant in the model for actual retirement age. In the expected retirement age equation, even this interaction was insignificant. However, in the censored regression model none of the interactions were significant.
We have examined the sensitivity of the results to the choice of the sample. Since there may be disability retirements among those below age 60, we tried including in the estimations also those in the 2008 survey who were 50 to 54 years old at the end of 2008 and who, therefore, did not reach age 60 by the end of the follow-up. The baseline results were close to those reported above, as one standard deviation change in SAH was associated with 2.4 months later retirement. The similarity of the results is natural, since only 19 additional observations with observed retirement age were included. However, at the same time the extension of the sample added 547 new censored observations. Censored regression with this larger sample resulted in a somewhat higher estimate for SAH than in the baseline model. However, it is difficult to tell whether this was due to the larger sample size or the higher share of censored observations. As another robustness check we estimated the model using only the 2003 survey. Censoring should be a smaller problem in this group. On the other hand, the number of observations in the OLS estimations dropped to 792. The estimate for the health effect dropped and became insignificant, whereas the relationship between SAH and expected retirement age remained almost the same as with the larger sample. In the censored regression model, the health effect on retirement age was again significant. A one standard deviation change in SAH was associated with three months later retirement. This is close to what we obtained in the baseline estimations. Again, it is hard to tell whether the differences in some of the results are due to less censoring or smaller sample size.
A limitation of our analysis is that we deal with full-time retirement only. This restriction is dictated by the survey, which does not ask about expected part-time retirement age. We can still speculate how part-time retirement might affect the results. Had the survey respondents been asked about both expected part-time and full-time retirement age, it is possible that we would find a stronger relationship between health and expected retirement, as those with poor health would be more likely to plan to take part-time retirement. For actual retirements we can examine how part-time retirement affects the relationship between health and retirement. To do this we dropped from the data those who were already in part-time retirement during the surveys, added those who were observed to retire part-time but not full-time, and defined the actual retirement age according to part-time retirement for those for whom both part-time and later full-time retirement were observed. The analysis of our baseline model with this sample showed that one standard deviation increase in SAH was associated with 2.4 months later retirement, which indicates 0.4 months stronger association than in panel A of
Table 2. It is difficult to say how part-time retirement would affect the connection of expectation errors and health if both expected and actual retirement age were affected. Including more flexible retirement channels would possibly lead to more negative expectation errors (retirement earlier than expected) for those with poor health.