Introduction

In an era of self-directed retirement plans, in which individuals and households are called more and more to make their own saving and investment retirement decisions, building retirement wealth proves to be one of the biggest financial challenges of adult life. In the United States (U.S.), the focus of the current work, the May 2020 report on the economic well-being of households in 2019 (Board of Governors of the Federal Reserve System, 2020, p. 4) stated that, “About one fourth of non-retired adults does not have any retirement savings, 60 percent think their retirement savings are not on track, while nearly 6 in 10 non-retirees with self-directed retirement accounts express discomfort in making investment decisions with their accounts.”

Retirement decision-making is a complex process and is studied extensively across different demographic groups. The demographic of this article’s interest is that of gender. By gender, the paper refers to the individual’s gender, male or female. According to the 2021 Federal Reserve System report about the economic well-being of U.S. households (Board of Governors of the Federal Reserve System, 2022a, p. 81), “Among those non-retirees with self-directed savings,” males appear more “comfortable managing their retirement savings” compared to females.

This does not come as a surprise. It is interesting to note that, historically, public policies around retirement programs were originally established to address poverty in retirement, and in particular, retirement income and poverty across gender lines (Meyer, 1990, p. 554). In the extensive review of Embrey and Fox (1997, pp. 33–34), women appear to take fewer risks in their investment than men, and this behavior systematically holds them back in their retirement wealth accumulation.

The situation seems more worrisome for single women. Levine et al. (1999) found gender differences in anticipated retirement income by comparing unmarried men and unmarried women. While controlling for the position in the life cycle stage, and education, Schmidt and Sevak (2006) reported large differences in observed wealth in the U.S. between single-female headed households and married couples, as well as evidence that the wealth holdings of single females are significantly lower than those of single men. Fisher (2010) investigated gender differences in personal saving behaviors among single person households, and found that single women with low risk tolerance were less likely to save, and when they did save, were less likely to choose assets that have greater growth over time, leaving them financially unprepared for retirement. This paper does not attempt to deify financial risk. On the contrary, caution is needed with financial risk and the volatility of returns. However, the work espouses that retirement investment risk-taking contributes to long-run return. For instance, Choi et al. (2004, p. 83) demonstrated that even automatic enrollment to 401 k(s) “at least temporarily anchors participants at a low savings rate and in a conservative investment vehicle.”

Furthermore, as a point of differentiation, the current paper studies the role of gender in risk-taking retirement strategies using extensive data for a period spanning from 1989 to 2019. Adding the time dimension to the analysis, contributes a unique perspective to the ongoing research and policy discussion since an analysis over time has not been conducted before. Further, the focus is on single (never married) individuals. The singles’ sample selection secures the most direct test of gender differences as set in the gender literature, (i.e., in Embrey & Fox, 1997). The research question for this paper is: After controlling for a series of observable factors, what has happened to single women’s and men’s retirement risk-taking from 1989 to 2019?

To answer the question, the paper utilized U.S. retirement data from the Survey of Consumer Finances (Board of Governors of the Federal Reserve System (2022b) from 1989 to 2019 consisting of 11 cross sections of data. Econometrically, the study conducted difference-in-means tests across gender in each of the Survey of Consumer Finances (SCF) waves, and further pooled the 11 cross-sections incorporating a set of control and time variables into a regression framework. In addition, other factors affecting retirement investments are covered. Next, the SCF, as well as the SCF retirement assets in the analysis are presented, the regression framework is developed, followed by the empirical results, findings and a policy discussion including future directions of the research.

Factors that Impact Retirement Investments

Besides gender, which is the focus of the work, a factor frequently regarded as affecting retirement investment is that of financial literacy. Lusardi and Mitchell (2014, p. 6) defined financial literacy as “people’s ability to process economic information and make informed decisions about planning, wealth accumulation, debt, and pensions.” Lusardi and Mitchell (2011) reported a clear interrelationship between financial knowledge and planning. Individuals who displayed higher financial literacy were more likely to plan and plan successfully.

Another factor is that of risk tolerance, which Grable (2000, p. 625) determined as “the maximum amount of uncertainty that someone is willing to accept when making a financial decision.” The premise is that investors who tolerate higher risk tend to obtain higher returns over the long run. Yao et al. (2004) documented that households with very low risk-tolerance were unlikely to invest in stocks, and thus had greater difficulty achieving adequate retirement assets.

Another factor is age. In the majority of findings, the life-cycle theory of asset allocation prevails with individuals becoming more risk averse as they reach retirement, given that older individuals have less time to recover losses than younger ones, or since the need for fixed income increases. Säve-Söderbergh (2012) reported a hump-shaped relationship between risk taking and age with the youngest and oldest investors taking less risk than middle-aged investors. Nonetheless, for Yuh and Hanna (1997) it was younger people, in the 21–30 group, who took more risks in their retirement than people in the 31–40 and 41–50 age groups. According to Yuh and Hanna (1997, p. 3), the rationale is that people in the latter groups “tend to have short-term goals such as housing or college education for their children” and this made them “more conservative in their retirement investments.”

Lastly, behavioral factors have been gaining more and more ground in explaining retirement investments. Benartzi and Thaler (2001) investigated diversification strategies in defined contribution savings plans, and found evidence that participants made decisions that seemed to be based on naive (or confused) notions of diversification. Agnew (2006) provided evidence that some individuals were prone to behavioral biases, including allocation biases and a participation bias in individuals’ 401(k) investments. Lastly, Foster (2017) reported that a lack of trust in retirement providers and a myopic view of pensions might explain young people’s systematic retirement under-saving.

Survey of Consumer Finances

As already mentioned, the database utilized in this paper is the SCF (Board of Governors of the Federal Reserve System, 2022b). The SCF is a triennial interview survey of a nationally representative sample of U.S. families, sponsored by the Board of Governors of the Federal Reserve System with the cooperation of the U.S. Department of the Treasury. The SCF sample is not an equal-probability design. Thus, it is noted beforehand that this paper’s statistics, such as means and medians as well as more complex statistics related to regressions, are sample weighted. The weight used is given in the SCF variable X42001 (or × 42,001 in the latest SCF versions). Moreover, the paper used information from the first implicate of the five available implicates of the public data set.

The SCF is well regarded in the literature. For instance, Bajtelsmit et al. (1999, p. 4) characterized SCF as “the best available source of individual household wealth data collected in the U.S.” For this paper, the SCF provided an elaborate compilation of retirement assets not easily available. More specifically, by retirement assets (ra), the paper, in line with Bhutta et al. (2020, p. 33), refers to: “401(k), 403(b), thrift savings accounts from current or past jobs” and “other current job plans from which loans or withdrawals can be made”, as well as to individual retirement accounts (IRAs), Roth accounts, and Keogh accounts.

Table 1 presents the weighted mean retirement assets of single women and single men for each wave from 1989 to 2019, as well as weighted difference-in-means tests. From Table 1, it is deduced that in 10 of 11 SCF waves, single men possessed a higher dollar value in retirement assets than single women. Further, the difference was significant at the 1% level for the 1998 and 2004 SCF waves. Moreover, the difference was significant at the 5% level for the 2019 SCF, and marginally significant (10% level) for the 1995 and 2007 SCF waves.

Table 1 Weighted mean retirement assets & weighted difference-in-means tests of single women and single men from 1989 to 2019

Subsequently, this paper attempted to build risky retirement assets (rra). First, information was sought in the only investment question available in the SCF. It is noted that this question has several variations from SCF wave to SCF wave. For instance, in the 1989 SCF (Board of Governors of the Federal Reserve System, 1999, p. 88), the investment question is as follows: “How is the money in (this/these) account(s) invested? Is most of it in CDs or other bank accounts, most of it in stocks, most of it in bonds or similar assets, or what?”.

By CDs, the reference was to certificates of deposit. Following this question, the respondents of the 1989 SCF were given the following options to select from: 1 = CDs/bank accounts, money market; 2 = stocks, mutual funds; 3 = bonds/similar assets, T- bills, Treasury notes; 4 = combination of 1, 2, and 3, mixed/diversified, 5 = combination of 2 and 3; 6 = combination of 1 and 2; -7 = other; and 0 = inappropriate. Continuing with the SCF waves, the available options offered to the respondents increased, however, the qualitative character of the available responses remained.

To proxy riskiness in this framework, the paper consulted other approaches in the literature. In particular, the works by Yuh and Hanna (1997), Jianakoplos and Bernasek (1998), and Chang et al. (2018) were studied. All these authors worked with assigning riskiness weights to each option. For instance, for the 1989 SCF, for retirement assets invested in “stock; mutual funds”, Yuh and Hanna (1997, p. 8) assigned a riskiness weight of 1.0, while for retirement assets invested in CDs/bank accounts, money market or bonds/similar assets, T-bills, Treasury notes, a weight of 0.0 was assigned. Similarly, the authors assigned a riskiness weight riskiness of 0.33 to the retirement assets invested in the first combination, and a 0.5 weight to both the second and third combinations. All research sources consulted followed similar courses of action.

In this paper’s approach, weights were likewise assigned, quantifying a pool of risky retirement assets (rra) for each SCF wave. We acknowledge that this methodology is subject to a degree of arbitrariness but given the information available, we deem the approach rational, and in accordance with the theory of a diversified portfolio (Markowitz, 1952, p. 77). The paper’s assigned weights for each SCF wave are available upon request.

Lastly, the paper built the risky share of retirement plans or risky retirement ratio (rrr), as defined by the ratio of total retirement assets (ra) invested in risky retirement assets (rra). Table 2, presents the weighted mean rrr of single women and men from 1989 to 2019. In nine SCF waves, single men had higher risky retirement ratios than single women. Furthermore, the differences in risky retirement shares were significant at the 1% level in the 2001 and 2016 SCF, at the 5% level in the 2004 SCF, marginally significant (10% level) in the 2013 SCF and very slightly beyond the 10% level. in the 1992 SCF and the 2007 SCF. Thus, it could be stated that significant and marginally significant differences were documented in 4 (almost 6) of the 11 waves under study. Interestingly, in the latest 2019 SCF, female and male ratios converged. This might be random, but it could also be the beginning of a newer pattern, and it is worth continuing this investigation.

Table 2 Weighted mean risky retirement ratios & weighted difference- in-means tests for single women and single men from 1989 to 2019

A Regression Framework in the SCF

Next, the paper used a regression model to explain rrr with the help of a series of factors. The suggested framework for every respondent k, is given in Eq. (1).

$$\begin{aligned}{rrr}_{{\text{k}}}=&\;{\upbeta }_{0}+{\upbeta }_{1}*{{\text{ln}}wealth}_{{\text{k}}}+{\upbeta }_{2}*{kids}_{{\text{k}}}+{\upbeta }_{3}*{homeowner}_{{\text{k}}}\\&+\sum_{{\text{i}}=1}^{{\text{i}}=10}{\upgamma }_{{\text{i}}}{age}_{{\text{ik}}}+\sum_{{\text{j}}=1}^{{\text{j}}=4}{\updelta }_{{\text{j}}}{race}_{{\text{jk}}}+\sum_{{\text{q}}=1}^{{\text{q}}=4}{\uplambda }_{{\text{q}}}{tolerance}_{{\text{qk}}}+{{\text{u}}}_{{\text{k}}}\end{aligned}$$
(1)

By wealth, the liquid net worth of the respondents was assumed, and the paper followed the methodology of Friend and Blume (1975, pp. 906–907), who presumed indivisibility and transactability of assets. The paper also followed the Federal Reserve System algorithm (Board of Governors of the Federal Reserve System, 2022b) for the net worth computation. For the rest of the explanatory variables, kids referred to the number of people 18 years or younger in the household, and homeowner was a dummy variable indicating whether or not the respondent was homeowner. It is noted that the paper followed the SCF documentation (Bhutta et al., 2020, p. 8, footnote 13) since “a family was considered a homeowner if at least one person in the family owns at least some part of the family’s primary residence.”

Next, age was a set of dummy variables indicating into which of 10 age categories the respondent fell, and race was a set of dummy variables indicating into which of four race categories the respondent fell. Lastly, tolerance indicated which of four statements of financial risk came closest to the amount of financial risk that the respondent was willing to take when saving or making investments: 1. “take substantial financial risks expecting to earn substantial returns”, 2. “take above average financial risks expecting to earn above average returns”, 3.“take average financial risks expecting to earn average returns”, 4. “not willing to take any financial risks.”

In Tables 3 and 4, the weighted mean values of all the independent variables previously discussed for single women and single men, respectively, and for all the SCF waves from 1989 to 2019 are presented. A very interesting finding arose from the observation of the mean responses of the risk tolerance question. In particular, although the financial risk tolerance question was a hypothetical question, single women’s higher pre-disposition towards selecting “No Risk” as a response in their intended investments, and similarly, single men’s higher pre-disposition towards selecting “Above Average” and “Substantial Risk” options were noteworthy.

Table 3 Weighted descriptive statistics for single women 1989–2019
Table 4 Weighted descriptive statistics for single men 1989–2019

Next, given that the focus of the paper is on the time period analysis, the model of Eq. (1) was used in a pool created by combining the 11 waves from 1989 to 2019. Each wave was treated as a random sample. Thus, the resulting pool was treated as an independently pooled cross-section. Following Wooldridge, (2013, p. 449), this facilitated an increased sample size and more precise estimators and test statistics with more power. Further, to reflect the likelihood that the population may have different distributions in different time periods, the intercept in Eq. (1) was allowed to vary across each SCF period. To accomplish this, year dummy variables were included for all but one periods, with the earliest wave of 1989 serving as the base.

The analysis focused on the pattern of coefficients for the year dummy variables, since it could help answer whether, after controlling for a series of factors, risk-taking in retirement assets for women and men changed over time. Lastly, since rrr could only take values between zero and one, a maximum likelihood Tobit regression procedure was used for the estimations, which allowed for both an upper and lower bound.

Empirical Results

Tobit Regression I

For the purposes of the data analysis, Stata/SE 17.0 was used. Table 5 presents the regression results after running the Tobit model separately for single women and for single men. Regarding the Tobit specification, it is noteworthy that the conditional mean of the dependent variable is: \(\Phi ({\varvec{x}}\widehat{{\varvec{\beta}}}/\widehat{\sigma }){\varvec{x}}\widehat{{\varvec{\beta}}}+\widehat{\sigma }\varphi ({\varvec{x}}\widehat{{\varvec{\beta}}}/\widehat{\sigma })\) as described by Greene (2003, Chapter 19, p. 849). Thus, the marginal effects of the explanatory variables (both continuous and discrete) required some adjustment in their computation. To account for this, the paper followed Cameron et al. (2010, p. 529).

Table 5 Weighted tobit regression results run separately for single women and single men in pooled data from 1989 to 2019

Next, a Wald test adjusted for the use of sample weights was used to examine whether the estimated equations between single women and single men were significantly different. The adjusted Wald test revealed that the estimated equations for single women and single men were significantly different (chi2(28) = 41.90 and Prob > chi2 = 0.030).

From Table 5, it was deduced that for single women, the signs of the coefficients for the year dummy variables showed mixed results, since in six out of ten SCF waves, an increase in female retirement risk-taking was observed in comparison with the base year of 1989, while in four out of ten SCF waves, a decrease in female retirement risk-taking was observed with respect to the base year. It was noteworthy that arithmetically, particularly for the SCF waves showing a decrease in female retirement risk-taking the coefficients were very small and insignificant.

It is stressed that since the analysis controlled for a series of factors, the coefficients on y95, y98, y01, y10, y16, and y19 represented increases in risk-taking in retirement strategies for single women for reasons that were not captured in the explanatory variables. Also, all year dummy variables were individually insignificant, but jointly significant. For single men, the results revealed a clearer pattern. The coefficients on the year dummy variables, showed a consistently positive trend in retirement risk-taking over time. Further, almost all year dummy variables were highly significant (p < 0.05 or p < 0.01), both individually and jointly.

With respect to wealth, other things being equal, the effect of additional wealth on retirement risk-taking was positive for single women but negative for single men. However, the numerical affect was very close to zero and insignificant for both groups. With respect to kids, the presence of one dependent increased retirement risk-taking in single women but the effect was not significant. On the contrary, the presence of one dependent, holding all other factors fixed, significantly decreased retirement risk-taking in single men.

With respect to homeownership, being a homeowner, holding everything else constant, increased retirement risk-taking for both groups. Nonetheless, the corresponding effect was marginally significant (10% level) for single men and insignificant for single women. With respect to race, other things being equal, black single women took fewer retirement risks than white single women, but the effect was not significant. Black single men took fewer retirement risks than white single men, but the effect in this group was marginally significant (10% level). With respect to age, for both single women and single men, all age groups took lower retirement investment risk than the base group of less than 25. This result seemed in accordance with those researchers who argued that it was younger investors who were more prone to retirement risk-taking than the older investors (e.g., Yuh & Hanna, 1997, p. 3).

Tobit Regression II

Table 6 presents the regression results after running the Tobit regression jointly for single women and single men in the pooled SCF data from 1989 to 2019. Here the dummy variable female was included to capture the gender effect, and at the same time, female was interacted with each time variable. By adding the interactions female*y92, female*y95, …, the relationship between rrr and female was allowed to vary in each year.

Table 6 Weighted tobit regression results run jointly for single women and single men in pooled data from 1989 to 2019

With regards to the gender effect results, in 1989 (the base year), other things being equal, a single woman took 14.5% more retirement risk than a single man. Nonetheless, the coefficient for female was insignificant. For the SCF waves after 1989, the gender*time interaction terms were analyzed. Thus, for instance, in 1992, the gap in retirement risk-taking was (0.145—0.364) or -0.219 or -21.9%, that is, holding all other factors constant, being a single women led to taking 21.9% less retirement risk than a single man with respect to the base year of 1989. Similarly calculated, a gender gap of + 5% in 1998, -17.4% in 2001, -8.8% in 2004, -3.7% in 2007, -1.5% in 2010, -6.2% in 2013, -7.2% in 2016, and + 3.8% in 2019 were recorded. The interaction terms were statistically significant in 1992 and marginally significant (10% level) in 2001, but insignificant in the other SCF waves.

To reiterate, every wave was treated as a random sample. Thus, the computed retirement risk-taking gender gap in every wave was dependent on the macro and micro economic environment of each period. Nonetheless, for the analysis with regards to the base year of 1989, it was deduced that in eight out of 10 SCF waves, being female led to lower retirement risk-taking than being male. Lastly, it is noteworthy that, over time, the gender gap dropped algebraically in magnitude.

Conclusion: Policies & Future Directions

This final part of the paper concludes with a more general discussion highlighting policies in the literature that may lead to an increase in individuals’ and households’ retirement savings and investment, regardless of gender. As Thaler and Benartzi (2004, p. 165) commented, saving for retirement requires self-control. Further, households are prone to mistakes, and especially some low-saving households would especially welcome aid in making saving decisions. Thus, the use of prescriptive savings programs committing individuals to allocations towards retirement savings can significantly increase retirement wealth. Moreover, Benartzi and Thaler (2013) recommended four essential ingredients to any comprehensive plan to facilitate adequate retirement saving: availability, automatic enrollment, automatic investment, and automatic escalation.

Other researchers studied the effect of informational interventions. For instance, Goldin et al. (2017) studied how variation in the choice environment, e.g. by nudging individuals towards higher or lower contribution rates, could affect retirement savings decisions. For other researchers, comprehension becomes central. Borrowing concepts from domains such as educational psychology, several researchers tested whether retirement products could be simplified and comprehension could be improved.

Clark et al. (2019) found that among workers participating in a retirement plan, individuals who received an informational flyer increased their contributions in the months following the intervention relative to the control group. Lastly, McGowan and Lunn (2020) studied the influence of explanatory diagrams on people’s understanding of how pensions work, and they found that although diagrams did not necessarily improve comprehension of pension products, they nonetheless supported decision-making by facilitating causal thinking.

Finally, financial literacy proves pivotal. Lusardi and Mitchell (2023, p. 148) reported findings that in the U.S., 30–40% of wealth inequality near retirement could be accounted for by financial literacy. Accordingly, they concluded that “financial literacy matters”, and it matters a great deal. Almenberg and Dreber (2015) showed that when controlling for basic financial literacy, the estimated gender gap in stock market participation diminished. In the same spirit, Van Rooij et al. (2011) showed that financial literacy affected financial decision-making, and that those with high literacy were more likely to invest in stocks.

Consequently, targeting financial education to specific groups of the population that are more ailing and less sophisticated proves imperative. For sure, building education takes time. However, a continuing education pattern, helping both women and men to first identify their saving and retirement needs, second to make informed investment decisions, and third to monitor their effort, could prove beneficial.

Regarding future directions of the current research, an interesting pattern to explore would be to account for household income in the regression framework and the analysis. There is extensive literature available (Sullivan & Meschede, 2016; Tamborini & Kim, 2020) that documents that there is a material income gap on the basis of gender and race that could be expected to affect both retirement assets, and subsequently retirement risk-taking. Therefore, significant differences in gender and race with respect to risky retirement ratios could be more related to labor market failures, for instance wage discrimination, and occupational segregation within the context of a Blinder-Oaxaca decomposition. Thus, the current research could extend policy implications to address income inequality. In particular, acknowledging a reasonable correlation between wealth and income, the analysis could use first, wealth versus income variables, followed by wealth and income variables, and/or wealth versus the income ratio (that is wealth divided by income) to also take into account potential considerations of heteroscedasticity.