1 Introduction

Ever since the work of Keynes (1936) and Katona (1951), economists have widely acknowledged the importance of subjective (psychological) determinants of economic behaviour. In an effort to properly quantify economic sentiment, US researchers have focused mainly on the University of Michigan consumer surveys and the corresponding indicators of consumer sentiment (see Curtin (2019) for details). On the other hand, European scholars and practitioners have greatly benefited from the Joint Harmonized EU Programme of Business and Consumer Surveys (European Commission 2021). Its wide sectoral coverage has stimulated researchers from various fields of economics to assess the predictive characteristics of sectoral sentiment measures. Perhaps the most widely utilized European indicator of this sort is the Economic Sentiment Indicator (ESI hereinafter). It is a composite leading indicator derived as a weighted average of five sectoral confidence indicators (for the industrial sector, retail trade, construction, services, and the consumer sector). Although several attempts have been made to enhance its methodological grounds and improve its forecasting accuracy (Gelper and Croux 2009; Sorić et al. 2016; Lukač and Čižmešija 2021), the ESI has stood the test of time, and the European Commission has not altered its quantification procedure due to the high opportunity costs of such revisions.

On the basis of the above studies, it seems safe to conclude that the phenomenon of economic sentiment has been thoroughly examined from a methodological viewpoint. However, we are still quite puzzled about the generating process of economic sentiment. How persistent is the stated phenomenon? Do major economic and political events represent mere transitory shocks to economic sentiment, or is the ESI generated through a long-memory process? Answers to these questions lie in the domain of fractional unit root testing. Namely, the development of fractional unit root tests allows us to quantify the degree of persistence of economic sentiment. These concepts have been widely utilized to test the persistence of various sorts of political sentiment and approval rates (Box-Steffensmeier and Smith 1998; Lebo et al. 2000; Box-Steffensmeier and Tomlinson 2000). They have even been employed for persistence analysis of macroeconomic fundamentals such as inflation (Gadea and Mayoral 2006; Kumar and Okimoto 2007), exchange rates (Cheung 1993; Cheung and Lai 2001), and stock market indices (Caporale et al. 2021). However, the literature is completely silent on the issue of economic sentiment persistence. This paper aims to fill that gap in the literature by examining ESI data for all EU economies.

Our results indicate that the ESI largely exhibits the characteristics of a long-memory process. We postulate that this finding has very important policy implications. Namely, it is widely acknowledged that the ESI is a leading indicator of aggregate economic activity (Gayer 2005; Van Aarle and Kappler 2012; Sorić et al. 2013) and stock market returns (Fernandes et al. 2013). The established high persistence of economic sentiment signifies that extreme events such as high uncertainty and global economic crises have secular effects on the ESI, not purely transitory ones. Relatedly, one must bear in mind that market economies are not necessarily able to self-adjust to new equilibria after a sizable shock (Dimand 2005). In such circumstances, not only do such large negative shocks directly weaken the economy, but they also indirectly aggravate the already gloomy economic outlook by corroding economic sentiment. From the viewpoint of the Lucas (1976) critique, the rational expectations hypothesis (REH) postulates that any type of economic policy should be futile since rational agents should be able to anticipate economic shocks, so economic sentiment is expected to be highly correlated with the business cycle itself. However, the REH has been firmly empirically disapproved (Acemoglu and Scott 1994; Sorić et al. 2020a, b), so the persistence of economic sentiment enables it to act as a separate driver of the real economy, apart from economic fundamentals. In that sense, our findings motivate a revival of Keynesian-type, timely, well-thought and well-communicated countercyclical policies that should prevent such a deterioration of economic sentiment in response to any kind of large economic shock.

Our second contribution lies in examining the potential structural shift in the persistence of economic sentiment. Namely, the global financial crisis marked the end of the Great Moderation era (Bean 2010). During the mentioned period of secularly reduced macroeconomic volatility, the ESI has firmly been established as a short-term predictor of economic activity (Čižmešija and Škrinjarić 2021). In this paper, we find evidence that the 2008 recession did not disturb the long-memory properties of economic sentiment. The ESI’s persistence has remained approximately the same since 2008, accentuating the role of economic sentiment in governing business cycles and shifting policy-makers’ attention to the longevity of sentiment shocks.

Even when we account for endogenously determined multiple structural breaks in the generating process of economic sentiment, the results do not change much. The ESI indeed exhibits long-memory characteristics.

Finally, we find that new EU member states exhibit only marginally more intensive persistence of economic sentiment than the core EU economies. This result obviously stems from the considerably higher macroeconomic volatility and uncertainty surrounding the former countries. It seems safe to conclude that the economic sentiment of the EU as a whole (regardless of its composition) exhibits long-memory characteristics.

The remainder of the manuscript is organized as follows. Section 2 discusses the rare empirical tests of sentiment persistence. Section 3 addresses data and methodological issues. The fourth section presents the obtained results, while the concluding section provides some policy implications and directions for future research.

2 Literature review

In the 1970s, economists became more occupied with the statistical properties of economic time series, trying to cope with central macroeconomic issues—namely, understanding business cycles and growth (Snowdon and Vane 2005). From a purely statistical standpoint, the initial equation in that sense is the basic first-order autoregressive model:

$$ y_{t} = c_{t} + \phi y_{t - 1} + \varepsilon_{t} $$
(1)

where \({c}_{t}\) is a constant, often referred to as the ‘drift’ of the dependent variable, \(\phi \) is the autoregressive parameter standing alongside the lagged dependent variable, and \({\varepsilon }_{t}\) is a white noise error term. In case \(0<\phi <1\), any shock to \({\varepsilon }_{t}\) would result in a transitory effect on the variable of interest. However, if \(\phi =1\) and the process evolves as a random walk, a certain shock to \({\varepsilon }_{t}\) generates permanent, long-run effects.

These statistical properties gave economists the notion that they could discriminate between impulse (transitory) and persistent (propagation) economic mechanisms. In a very influential paper, Nelson and Plosser (1982) found that the US real GNP is as persistent as a random walk, so the changes that we perceive in output are in fact permanent. This finding, later confirmed by Campbell and Mankiw (1987), Stock and Watson (1988) and Durlauf (1989) challenged conventional wisdom, as it suggested that the economic forces determining the trend are the same as those causing the cycle (Snowdon and Vane 2005). The view that long-term growth and short-run fluctuations are not distinct occurrences gave rise to the neoclassical counterrevolution and opened a window to a different way of understanding business cycles.

Pursuing a random walk model, the present value of national output is in fact the summation of unrelated random past events that permanently shift the process from its deterministically defined mean or trend (Silverberg and Verspagen 2003). One might presume that something similar occurs when examining associated variables in the same manner. As a commonly acknowledged leading indicator of economic activity, economic sentiment might also be a long-memory process. For instance, an already established view is that the time dependence of voting sentiment arises from people’s prior life experiences and even sluggishly evolving social norms (Deboef 2000). If economic sentiment also exhibits high persistence, the odds are high that agents’ economic attitude is enduringly influenced by significant events, therefore explaining considerable fractions of output fluctuations.

Apart from a better understanding of how the process of peoples’ economic sentiment evolves, there is another benefit of knowing its true memory characteristics. Long memory specifically means that the process can be drawn away to a new equilibrium either as a consequence of a new turn of events or by targeted economic policy measures. Loosely speaking, the former might be interpreted as leaving things to chance, while the latter might be interpreted as taking the matter into one’s own hands. Therefore, if economic sentiment is a highly persistent process, it serves as an argument for interfering in economic matters. In this case, prudently conducted economic policy might be crucial to achieve stability or general well-being, but policy-makers could also obtain an indication of the preferable direction and design of policy measures. Given the scarcity of behavioural economics literature on this issue, we are motivated to discover the generating process of economic sentiment series.

In fact, most formal statistical studies of sentiment persistence do not come from economics but from political science—which is somewhat strange, as economists tend to see themselves as methodologically dominant over other social scientists (Fourcade et al. 2015). One of the earliest contributions in this respect was provided by Box-Steffensmeier and Smith (1996), who applied fractional integration tests on US survey data to reveal that partisanship equilibrates rather slowly. Its response to major societal shocks lasts for several years. The same line of argumentation and methodological framework was continued by Box-Steffensmeier and Smith (1998) with a comparison of the persistence of partisanship to that of similar political phenomena. In this sense, Box-Steffensmeier and Smith (1998) provided evidence that US partisanship exhibits higher persistence than consumer confidence and the presidential approval rate, while Box-Steffensmeier et al. (1998) found that macropartisanship equilibrates more slowly than macroideology. Perhaps the most comprehensive study of this sort was done by Lebo et al. (2000), who identified patterns of long memory in a rich US set of commonly utilized political and economic variables, including macropartisanship, consumer sentiment, and several of its components. It is also worthwhile to note that Box-Steffensmeier and Tomlinson (2000) found rather mixed results in a similar empirical exercise. They concluded that both the US congressional approval rate and economic expectations (both survey-based measures) can be either integrated of order one or fractionally integrated.

Along with the research on the US, similar literature was growing for democracies in Europe. Byers et al. (1997, 2000) pioneered the discovery that UK macropartisanship exhibits fractional dynamics (with an integration order of approximately 0.75). Later, Clarke and Lebo (2003) and Davidson (2005) empirically confirmed this finding by identifying the long-memory characteristics of British government support. Something similar has also been detected in German, Danish and French multiparty systems (Hellwig 2007; Ohmura, 2021), further confirming the robustness of this result.

On the other hand, to the best of the authors’ knowledge, the long-memory properties of the ESI have not been analysed and discussed in the context of a series of fractional integration tests. However, the literature does consider several financial variables that can be related to the concept of economic sentiment. As Bachmann et al. (2013: 243) pointed out, stock market indices are often viewed as forward-looking variables, just like the ESI. In this context, it is worth noting the work of Schadner (2021), who studied a set of US and European stock market sentiment indicators based on implied volatility measures and investor surveys. He used multifractal detrended fluctuation analysis to observe that stock market sentiment is universally antipersistent, meaning that investors tend to abruptly overreact to news. The observed antipersistence is even larger for the European market than for the US. Further analysis reveals that fear (implying negative news) and greed (positive news) have diverging effects on persistence. The greater the fear, the stronger is the overreaction of market sentiment. On the other hand, greed drives sentiment closer to a random walk process. Other contributions in this field deal specifically with stock market indices. A noteworthy contribution in this respect was provided by Caporale et al. (2016), who employed parametric and semiparametric fractional integration tests on two Ukrainian stock market indices. Their results firmly point to the conclusion that stock markets exhibit long-memory properties. However, the magnitude of the observed persistence is time varying. It considerably increased during the global financial crisis, making the market more predictable and more vulnerable to shocks. Very similar results were also provided by Caporale et al. (2018). They extracted a measure of market fear from the VIX and found that it does not show signs of long memory in the growth phases of the cycle. Nevertheless, bearish markets induce persistence.Footnote 1 Observing the variables’ statistical features in the system composed of investor sentiment and returns in Germany, Lux (2012) found that medium-run sentiment is more persistent than short-run sentiment, which is characterized by abrupt mood changes.

To some degree, our research is also related to the work of Ghonghadze and Lux (2012). Using similar survey-based data, these authors estimated a canonical opinion dynamics model for selected EU countries and compared its forecasting performance to that of its time series counterpart, Autoregressive Fractionally Integrated Moving Average (ARFIMA) model. A natural follow-up is to investigate in detail the characteristics of the EU-wide economic sentiment process with a set of different fractional integration estimation methods.

3 Data and methodology

Let us first briefly present the methodological foundation of the ESI, the focal variable in our study. The input for quantifying the ESI is gathered from the EU Business and Consumer Surveys. These surveys are conducted each month in all EU member states and several other European countries using a fully harmonized methodological framework (European Commission 2021). They assess a wide variety of microeconomic (e.g., households’ financial position and consumption patterns) and macroeconomic phenomena (e.g., inflation, unemployment and the general economic climate in the country) via nationally representative samples. Survey questions are commonly qualitative, so respondents are offered answers on an ordinal scale (assessing whether the variable at hand increased (will increase) or decreased (will decrease) in the observed past (future) period. The obtained answers are quantified in the form of response balances (i.e., differences between positive and negative answers to each survey question). This concept enables the European Commission to calculate different composite indicators of economic sentiment. For brevity, we introduce only the concept of the ESI.

The ESI is conceptualized as a weighted average of 15 seasonally adjusted survey response balances from five separate economic sectors. From the industrial sector, it comprises production expectations, order books, and stocks of finished products (the latter with an inverted sign). Regarding the services sector, it assesses responses on the overall business climate and the recent and expected evolution of demand. Third, it utilizes response balances on the expected evolution of four variables from the consumer sector: the financial situation of households, the general economic situation, unemployment (with an inverted sign) and savings, all for the horizon of the next 12 months. With regard to the construction sector, the ESI comprises the response balances of current order books and employment expectations. Last, two variables from the retail trade sector are also included: current and future business conditions and stocks (the latter with an inverted sign).

The computation of the ESI comprises three methodological steps. First, each of the 15 individual ESI components is standardized over a frozen period to avoid data revisions with every additional monthly observation. Second, each component is attached to its corresponding weight, aiming to mimic the overall structure of the economy and enable tracking with GDP as a reference variable. The current weights are as follows: industry 40%, services 30%, consumers 20%, construction 5%, and retail trade 5%. In the application of the stated weighting scheme, the total weight of each sector is divided by the number of assessed balances from that sector. For example, each of the two retail trade balances is attached a weight of 2.5% (summing to a total weight of 5% for the retail sector). For easier interpretation, the weighted average is finally scaled to have a long-term mean of 100 and a standard deviation of 10.

We assess monthly ESI data for 27 EU member states, the United Kingdom, and the EU and euro area (EA) aggregate. The analysed time period is chosen based on the introduction of tendency surveys in individual economies (1985M01–2021M06 at most). The examined periods for each assessed country, along with the descriptive statistics of the underlying series, are shown in the Appendix.

One of the starting points in almost any time series analysis is testing for unit root and assessing the stationarity property. Stationarity implies a process with short memory that absorbs all shocks and avoids behaviour change. On the other hand, nonstationarity implies long-memory persistence that permanently alters behaviour after the shock. The most popular tests to determine stationarity are the augmented Dickey–Fuller (ADF) test, the Phillips–Perron (PP) test and the Kwiatkowski–Phillips–Schmidt–Shin (KPSS) test. These tests start by assuming a simple AR(1) process \({y}_{t}={c}_{t}+\phi {y}_{t-1}+{\varepsilon }_{t}\) as in Eq. (1). The model can be altered in many ways, such as by dropping the constant term and adding a trend line (depending upon the time series’ properties). Unit root tests are essentially based on the parameter \(\phi \), testing whether it is significantly different from unity or if the first-differenced time series is stationary. However, just exploring \(\phi \) does not necessarily provide enough evidence to assess long-memory persistence.

Fractional integration is a specific strand of the literature that explores different angles of the long-memory property. The main idea is to abandon the binary world of stationarity I(0) versus unit root I(1) and explore differencing with any real value. Fractionally integrated series may naturally occur as a result of aggregation (Robinson 1978, Granger 1980 and Leipus et al. 2013) or changes in the original series, such as regime switching or structural shocks (Diebold and Inoue 2001 and Gil-Alana 2008). Granger (1980) studies the aggregation effect and provides evidence that long-memory series result from (cross-sectional) aggregation of heterogeneous autoregressive processes. After individual short-memory series are averaged, the average may exhibit long-memory effects. Diebold and Inoue (2001) demonstrate how regime-switching behaviour can be confused for the long-memory property. However, several economic phenomena, such as real exchange rates (Gil-Alana 2000) and inflation (Gadea and Mayoral 2006), indeed have strong persistence.

The theoretical foundation for using fractionally integrated analysis lies in the possible aggregation effect. The ESI indicator is by its construction a (cross-sectional) weighted average of individual consumers’ and managers’ responses. It comprises 15 series weighted according to sector importance. Although individual series may exhibit some persistence, their aggregate can be a series with long memory. With all this in mind, we depart from the usual dichotomy of stationarity vs. nonstationarity within the \(ARMA(p,q)\) framework and explore the possibility of differencing with any real number d. The \(ARFIMA\left( {p,d,q} \right)\) setup builds upon this idea and introduces the following model:

$$ {\Phi }\left( L \right)\left( {1 - L} \right)^{d} y_{t} = {\Theta }\left( L \right)\varepsilon_{t} $$
(2)

where \(L\) is the lag operator and \({\Phi }\left( L \right) = 1 - \phi_{1} L - \phi_{2} L^{2} - \phi_{3} L^{3} - \ldots \phi_{p} L^{p}\), \({\Theta }\left( L \right) = 1 + \theta_{1} L + \theta_{2} L^{2} + \ldots + \theta_{q} L^{q}\), and \(\phi_{1} , \phi_{2} , \ldots ,\phi_{p} , \theta_{1} , \theta_{2} , \ldots \theta_{q}\) are parameters. \(\left( {1 - L} \right)^{d}\) is the fractionally integrated operator defined with

$$ \left( {1 - L} \right)^{d} = \mathop \sum \limits_{k = 0}^{\infty } \frac{{{\Gamma }\left( {k - d} \right)}}{{{\Gamma }\left( { - {\text{k}}} \right){\Gamma }\left( {k + 1} \right)}}L^{k} $$
(3)

where \({\Gamma }\left( . \right)\) is the gamma function. Parameter d determines long-run dependencies (stationarity), while ARMA components cover short-run dependencies. Granger and Joyeux (1980) demonstrate that if all roots of \({\Phi }\left( L \right)\) and \({\Theta }\left( L \right)\) lie outside the unit circle and if \(- \,0.5 < d < 0.5\), then process \(y_{t}\) is stationary and invertible. However, if \(d \ge 0.5\), then \(y_{t}\) is nonstationary, and the variance is not finite. If d = 0, then \(y_{t}\) is a short-memory process with exponential descent in the autocorrelation function. If \(0 < d < 0.5\), then \(y_{t}\) is still a short memory process with autocorrelations that die out according to a hyperbolic function. If \(0.5 \le d < 1\), \(y_{t}\) is a nonstationary mean-reverting process, and shocks do not have a permanent effect (they may be long-lasting but not permanent). If \(d \ge 1\), then \(y_{t} \) is nonstationary, and shocks have a permanent effect. These four categories (\(d = 0\), \(0 < d < 0.5\), \(0.5 \le d < 1\), and \(d \ge 1\)) depict more colourful behaviour than the conventional unit root (UR) tests.

Estimating the parameters in (2) is not as easy as in (1). We can first estimate parameter d and then, conditional on this, re-estimate all the other parameters. Some techniques allow for all parameters to be estimated at the same time, and some skip estimation in the AR and MA parts. We explore several estimation techniques in Stata software: the Geweke and Porter-Hudak method (GPH hereinafter; gphudak procedure in Stata), a modified form of GPH introduced by Phillips (1999a, 1999b) (MGPH hereinafter; modlpr procedure in Stata), and Robinson’s (1995) estimate (ROB hereinafter; roblpr procedure in Stata).

The GPH method (Geweke and Porter-Hudak 1983) is a seminonparametric framework that estimates parameter d in the ARFIMA(p,d,q) model without estimating the AR and MA parts. It also allows testing for long memory by assuming values for d. The estimation base is a spectral regression model:

$$ \log \left( {I\left( {\lambda_{j} } \right)} \right) = c - d\log \left| {1 - e^{{i\lambda_{j} }} } \right|^{2} + \eta_{j} , j = 1 \ldots ,n $$
(4)

where fundamental frequencies \(\lambda_{j} = 2\pi j/T,{ }T = 1,2, \ldots T - 1\), \(n \ll T\), \(\eta_{j}\) is an i.i.d. error term with mean and variance equal to \(\left( {0,\sigma_{\eta }^{2} } \right)\), and \(c\) and \(d\) are parameters. The discrete Fourier transformation of variable \(y_{t}\) equals:

$$ w\left( {\lambda_{j} } \right) = \frac{1}{{\sqrt {2\pi n} }}\mathop \sum \limits_{t = 1}^{n} y_{t} e^{{it\lambda_{j} }} $$
(5)

and \(I\left( {\lambda_{j} } \right)\) from (4) is given as \(I\left( {\lambda_{j} } \right) = w\left( {\lambda_{j} } \right)w\left( {\lambda_{j} } \right)^{*}\). After the periodogram of \(y_{t}\) is calculated, the parameter d is estimated from (4) with ordinary least squares. The estimator equals:

$$ \hat{d} = \frac{{\mathop \sum \nolimits_{j = 1}^{T} I\left( {\lambda_{j} } \right)\log \left| {1 - e^{{i\lambda_{j} }} } \right|}}{{2\mathop \sum \nolimits_{j = 1}^{T} \log^{2} \left| {1 - e^{{i\lambda_{j} }} } \right|}}. $$
(6)

The variable part of the estimation formula determines the number of fundamental frequencies that are included in Formula (6). If T is too small, then d is estimated on a small sample, but if T is too large, then medium- and high-frequency components of the spectrum can influence the estimate. The usual approach is to use T that equals \(T^{0.5} = n\).

Phillips (1999a, b) analyses the GPH method and concludes that it has low performance for differencing parameters larger than one. He modifies the log periodogram estimator in (5) by including an exact representation of the discrete Fourier transformation of \(y_{t}\) in the unit root case:

$$ w^{\prime}\left( {\lambda_{j} } \right) = \frac{{w\left( {\lambda_{j} } \right)}}{{1 - e^{{i\lambda_{j} }} }} - \frac{{e^{{i\lambda_{j} }} }}{{1 - e^{{i\lambda_{j} }} }}\frac{{y_{n} }}{{\sqrt {2\pi n} }} $$
(7)

with the modification

$$ v\left( {\lambda_{j} } \right) = w^{\prime}\left( {\lambda_{j} } \right) + \frac{{e^{{i\lambda_{j} }} }}{{1 - e^{{i\lambda_{j} }} }}\frac{{y_{n} }}{{\sqrt {2\pi n} }} $$
(8)

and periodogram ordinates with property \(I_{v} \left( {\lambda_{j} } \right) = v\left( {\lambda_{j} } \right) v\left( {\lambda_{j} } \right)^{*}\). After the GPH method is modified, the corresponding test is consistently against both hypotheses \(d < 1\) and \(d > 1\). The MGPH method also inherits the problem of selecting the number of fundamental ordinates, so the same recommendation remains: choose T that equals \(T^{0.5} = n\).

Robinson (1995) builds upon the GPH approach and the Whittle function in the frequency domain (Dahlhaus 1989) and suggests a semiparametric estimator for d. The Stata procedure roblpr (details in Baum 2000) covers the estimation of the parameter d in a more general situation having G series at the same time and allowing common parameters. The procedure simplifies to a univariate case with G = 1. The periodogram for variable \(y_{t}^{\left( g \right)} ,{ }g = 1,2, \ldots ,G\) equals:

$$ I_{g} \left( {\lambda_{j} } \right) = \frac{1}{2\pi n}\left| {\mathop \sum \limits_{t = 1}^{n} y_{t}^{\left( g \right)} e^{{it\lambda_{j} }} } \right|^{2} $$
(9)

and the vector of corresponding differencing parameters \({\varvec{d}}\) is estimated with the equation

$$ \left[ {\begin{array}{*{20}c} {\hat{\user2{c}}} \\ {\hat{\user2{d}}} \\ \end{array} } \right] = vec{ }\left\{ {\user2{Y^{\prime}Z}\left( {\user2{Z^{\prime}Z}} \right)^{ - 1} } \right\} $$
(10)

where \(X_{g,j} = \log I_{g} \left( {\lambda_{j} } \right)\), \({\varvec{c}} = \left( {c_{1} , \ldots c_{G} } \right){^{\prime}}\), \({\varvec{d}} = \left( {d_{1} , \ldots d_{G} } \right){^{\prime}}\), \({\varvec{Z}} = \left( {{\varvec{Z}}_{1} , \ldots {\varvec{Z}}_{T} } \right){^{\prime}}\) \({\varvec{Z}}_{j} = \left( {1, - 2\log \lambda_{j} } \right){^{\prime}}\), and \({\varvec{X}}_{g} = \left( {X_{g,1} , \ldots X_{g,T} } \right){^{\prime}}\). The number of fundamental ordinates T has the same properties as in the GPH and MGPH procedures.

4 Empirical results

The starting point of our analysis is conducting the conventional unit root tests (ADF, PP, and KPSS), the results of which appear in Table 1. Most countries exhibit concordant results with all UR tests yielding the same conclusion, while some countries exhibit mixed evidence. Some p values are very close to the 5% significance level, casting doubt on whether the true differencing parameter is equal to some other value different from zero or one.

Table 1 Unit root test results

The results of the previously discussed three fractional unit root tests on the ESI series for individual EU member states (plus the UK) are presented in Table 2. All parameters are estimated in Stata software. According to the documentation for the Stata procedures gphudak (Baum and Wiggins 2006), modlpr (Baum and Wiggins 2000a) and roblpr (Baum and Wiggins 2000b), we estimate the differencing parameter for both the first-differenced and the original series and compare the results for robustness. If the differencing parameter for \({y}_{t}\) in gphudak is larger than 0.5, we report the estimated parameter for the differenced series adjusted for differencing (\(\widehat{d}+1\)); otherwise, we report the parameter for the nontransformed series \(\widehat{d}\). In this way, all the estimated parameters are comparable, and we can compute an average or any other statistical indicator.

Table 2 Fractional unit root test results

The purpose of all utilized fractional unit root approaches is to estimate the integration order (d) for every EU member state. To ease interpretation, the total set of results is further summarized by grouping the analysed countries into several clusters, as presented in Table 3. For the first group of countries, the ESI’s integration order turned out to be equal to zero according to all three estimation approaches, implying that the ESI is a stationary short-memory process. In other words, in the Netherlands, the ESI evolves with no particular long-term trend; it exhibits solely short-term fluctuations. The second cluster, with a slightly higher persistence degree (predominantly \(0<d<0.5\)), comprises just one country as well. Only in Denmark does the ESI exhibit the tendency of (slowly) returning to the former trend so that any shock to the ESI has a merely transitory effect. On the other hand, in the third cluster, the ESI shows predominantly persistent properties (predominantly \(0.5\le d<1\)), whereas in the fourth country group, the ESI has a unit root (\(d=1\)) with respect to all three conducted approaches.

Table 3 Country clusters according to statistical properties of ESI

In that case, a shock to the ESI has a permanent effect and persists in every future period. The last group of countries shows mixed evidence of the ESI having mean-reverting properties according to one estimation method but also characteristics of a fully persistent process according to other estimates.

Although one might initially expect the ESI’s integration order to take very different and opposing values depending on the country, the results reveal that there are very few cases of countries with d being zero or smaller than 0.5. This means that the ESI does not evolve as a stationary process in the vast majority of the observing countries but rather takes on different forms of nonstationarity. With the ESI exhibiting properties of either full or prevailing persistence in more than half of the EU member states, we can conclude that sentiment generally takes on characteristics of a true long-memory process in Europe. In a way, this finding can be related to the ‘new modesty hypothesis’. The notion that agents’ assessments of ‘normal’ economic growth secularly diminish over time in European economies. These assessments are not governed by transitory shocks such as specific turmoil episodes or rising uncertainty but are highly persistent and predominantly of a long-term character (Sorić et al. 2020a, b). The high persistence of the ESI found in this paper can also be indirectly linked to the results of Gelper and Croux (2009), who reveal that the ESI’s information content can considerably add to macroeconomic predictions at longer forecast horizons. In a sense, both our results and those of Gelper and Croux (2009) move the ESI away from the commonly accepted perception that soft (survey) indicators are merely short-run economic barometers. The established long-memory process of the ESI presents evidence that there are possible long-term effects as well.

In socioeconomic terms, either high or full persistence implies excessive sensitivity of European agents to economic, political and other events affecting their financial situation, employment status or overall well-being. Since the 1980s, Europe has gone through a range of occurrences that must have had some psychological effects. For example, there is evidence that after the early 1980s recessions, unemployment in OECD Europe remained at tenaciously high levels (unlike in the US), so the long-term repercussions have been much more persistent in Europe (Snowdon and Vane 2005). Afterwards, Europe experienced a series of political and administrative disruptions, such as the introduction of the common currency in 1999, waves of EU and EA expansions and terrorist attacks. Finally, a relatively long period of only mild recessions was followed by the most severe economic collapse after the Great Depression, creating the atmosphere of ‘the Great Panic’ (Bean 2010).

When we look more thoroughly at how countries are clustered, it is easy to find the common political, geographical and economic denominator for countries within groups characterized by high or full persistence. These two clusters contain almost all new EU member states (10 out of 13), with every new member state (NMS) showing at least some evidence of ESI persistence. A similar conclusion can also be drawn from Table 2, which presents that the ESI’s median integration order is higher for the NMSs than the old member states (OMSs). Such results are not surprising given the turbulent recent history of the Slavic and Baltic countries, where severe political dissolutions (of the Soviet Union, Czechoslovakia, Yugoslavia) were followed relatively quickly by membership in European integrations. At the same time, these peripheral and less developed countries proved to be fairly vulnerable to the world economic crisis.

At this point, it would be worthwhile to dig deeper and look for an explanation of agents’ built-up sensitivity in the field of psychology and social learning theory. Proposed by the influential cognitive and social psychologist Bandura (1977), social learning theory suggests that some new behaviour patterns could be gained simply by observing and imitating other people in the environment. In our context, agents’ behavioural response to an unusual event might be intensified or prolonged as a result of their perceiving others’ emotional reactions and acquiring others’ behaviour. Economic agents might, for example, feel excess pessimism and panic during a stressful event, being exposed to others’ overreactions or even sensationalist media news. The key point of this theory is that individuals do not just passively absorb information, but cognition, environment and behaviour all mutually interact, which is described by the concept known as reciprocal determinism (Bandura 1977; Grusec 1992). Lux (2009) indicates that survey-based data (such as the ESI) are in fact most likely the result of a social process of opinion formation among agents.

Moreover, Garner (1981) emphasizes the principle that new behavioural patterns require mutual reinforcement and exchange of information among peers, so a uniform mass reaction happens over a much longer time horizon. This is clearly a feature of persistent processes. Therefore, if social learning makes people more perceptive of different events and people’s response is a prolonged and idle process, then the long and profoundly integrated collective pessimism and alertness of European agents comes as no surprise.

Building on the results thus far, our next intention is to inspect whether the established high persistence of the ESI is primarily driven by economic fundamentals. We do this by assessing whether the level of ESI persistence has changed considerably in the wake of the Great Recession.

The period prior to 2008 is known in the literature as the Great Moderation era. The term was initially proposed by Stock and Watson (2002), and it denotes three decades of low inflation and interest rates, overall steady macroeconomic aggregates and low-volatility business cycles with mild occasional recessions (Blanchard and Simon 2001; González Cabanillas and Ruscher 2008). In the same period, the Business and Consumer Surveys matured and started to be widely used for short-term forecasting—assuming that the low volatility of economic fundamentals would last. However, although stable, the fundamentals apparently were not sound, so the global economy collapsed soon after Lehman Brothers’ demise.

The Great Recession marked the end of the Great Moderation era (Bean 2010) and induced immense macroeconomic volatility. It started with a financial breakdown that soon grew into an economic crisis and finally manifested as a sovereign debt crisis. A series of unfortunate economic developments after 2008 were claimed to have become ‘deeply embedded in heads of economic agents’ (Adamowicz and Walczyk 2013), and one might suspect that this sequence of events acted as a strong stimulus for a further intensification of the long-memory properties of economic sentiment.

Since the crisis had spilled over from the US to most European countries by 2008, this year serves as an exogenously determined breakpoint. Specifically, September 2008 is taken as a breakpoint due to the bankruptcy of Lehman Brothers, which triggered a wide series of negative economic shocks around the globe. Again, the same three fractional unit root tests are employed, and the results are presented in Tables 4 and 5.

Table 4 Integration order in OMS in periods before and after the Great Recession
Table 5 Integration order in NMS in periods before and after the Great Recession

The estimated values of d do not change dramatically, but there is evidence of growth in d after the 2008 economic breakdown. Specifically, out of the 27 observed countries (Croatian data are partly unavailable), there are 15 cases with growing d according to the GPH, 16 cases for the MGPH estimates, and 15 cases according to the ROB approach. Furthermore, in 11 analysed countries (6 OMSs and 5 NMSs), the integration order increases after 2008 according to all the employed tests. Further indicating a growing integration order, the median d rises for both OMSs and NMSs given the results of all three approaches. In this sense, the ESI displays very similar properties in the solid majority of EU countries.

After recognizing that the global financial crisis may have been a game-changing event for the generating process of economic sentiment, we opt for the nonparametric Wilcoxon signed-rank test to test if the differences in the average ESI persistence before and after 2008 are statistically significant. This test is chosen because it is a nonparametric version of the dependent samples t test, allowing for data dependence in the time domain (which is obviously a feature of any macroeconomic time series).

A negative entry in the second column of Table 6 means that the median of the obtained integration order for a particular method is higher after the global financial crisis than during the period of Great Moderation. As the presented p values suggest, these differences are not statistically significant at conventional levels.

Table 6 Wilcoxon test results

Although our descriptive calculations in Tables 4 and 5 favour such a premise, the Wilcoxon test results reveal that the increment in the ESI’s persistence after 2008 is not statistically significant.

This finding has to be interpreted in the proper context. Namely, our calculations in Tables 2, 3, 4, 5 reveal that the ESI exhibits almost remarkable persistence, accentuating the role of economic sentiment in governing business cycles and prolonging sentiment shocks. The results presented in Table 6 could also be related to the generally found decline in the persistence of macroeconomic fundamentals (Kumar and Okimoto 2007) in recent decades. Since our estimates diverge from such findings, we are keen to interpret them as a sign that economic sentiment indeed constitutes an autonomous driving force of the macroeconomy, quite independent of economic fundamentals (Starr 2010).

Finally, Tables 4 and 5 reveal that for literally all specifications, the NMSs exhibit slightly higher persistence than the OMSs. However, this descriptive finding is not corroborated via the Mann‒Whitney nonparametric test for independent samples. A negative entry in the second column of Table 7 means that the median of the obtained integration order for a particular method is higher for NMSs than for OMSs. A glance at Table 7 reveals that the differences in the median integration orders are mostly not significant at conventional levels.

Table 7 Mann–Whitney test results

5 Conclusion

Although the ESI’s short-term predictive accuracy is widely established in the literature, not much has been said so far about its persistence. This paper offers an initial contribution in this respect by assessing a battery of different fractional unit root tests on the ESI series for individual EU member states. Depending on the method at hand and the examined country, our results strongly point to characteristics of a long-memory process, finding that ESI shocks mostly either fade away very slowly or even manifest permanently.

Furthermore, we recognize the 2008 global crisis and the underlying end of the Great Moderation era as a potential turning point in the generating process of economic sentiment. The subsequent cascade of extreme economic events (global financial crisis, the European sovereign debt crisis, Brexit, and the recent COVID-19 pandemic) has triggered exceptionally high macroeconomic volatility, but the effect of these events on ESI persistence is only marginal. Regardless of the chosen subperiod, sentiment shocks seem to be deeply imprinted in agents’ collective memory and their assessments of current and future macroeconomic trajectories.

The finding that sentiment is a persistent process might certainly have policy implications. It is very important to know how long people’s reaction to different occurrences in the environment lingers, especially when some of those events are a direct repercussion of government decisions (e.g., adoption of the euro, degree of policy cyclicity, ways of combatting COVID-19). Namely, if agents’ response to a policy change is short-lived, then the positive effects of successful government actions fade rather quickly. On the other hand, if a policy error is made, there is ‘only’ a chance of creating a V-shaped crisis in sentiment, which is likely to be reflected in a transient downward phase of the business cycle. In this case, the government could pursue an experimental policy-making approach, being less anxious about potentially causing a permanent deterioration in general welfare.

However, if economic sentiment persists, there is no room for tentative policy try-outs. Improperly chosen instruments and negligently implemented measures would likely result in long-lasting pessimism, while the final outcome might be a recession with a U- or even L-shaped recovery. As the ESI proved to be highly persistent, we believe that the main implication for European policy-makers is that they have a growing responsibility to identify adequate policy instruments and to adopt general policy of a character that the general public will embrace.

Several decades after its introduction, the ESI remains an intriguing topic for both academics and practitioners. Future work is certainly needed to fully grasp the functioning mechanisms and the generating process of economic sentiment. A potentially useful step in this sense would be to provide a sectoral analysis and assess the persistence of each of the ESI’s components (the five sectoral confidence indicators) individually. In this way, we might be able to scrutinize whether the longevity of sentiment shocks is heterogeneous across economic sectors or whether all types of sentiment shocks manifest in a similar manner.