1 Introduction

Many decisions involving risk are not made by individuals themselves, but are delegated to others (e.g., business managers, financial or legal advisors). Then the question arises to what extent the decision maker considers hisFootnote 1 client’s risk preferences. Both his responsibility and whether and how he is held accountable for his decisions and their outcomes by his client will affect his actions (Batteux et al., 2019; Bolton et al., 2015; Pahlke et al., 2015). This study focuses on different types and degrees of accountability (Lerner & Tetlock, 1999; Tetlock, 1985) to reach a better understanding of interpersonal risk-taking. Accountability types refer to what a decision maker is held accountable for (Patil et al., 2014): His actions (process accountability) or his actions’ outcomes (outcome accountability). With degrees of accountability, we refer to the consequences for the decision maker, which may range from mere criticism to serious material or immaterial consequences (Lerner & Tetlock, 1999). We consider clients’ evaluations, a way of holding decision makers accountable that is present in many situations and that is subjective in nature. Even though an evaluation made by a client has the potential of giving a decision maker an incentive to act in her best interest, it is prone to outcome bias, i.e., to the client holding the decision maker accountable for events he is not responsible for (Baron & Hershey, 1988; Brownback & Kuhn, 2019; Gurdal et al., 2013). In order to investigate both the determinants of outcome bias in evaluations of delegated risk choices and the effects that these evaluations have on the respective choices, we conducted three laboratory experiments.

The experiments model a setting in which a client authorizes a decision maker to choose between risky alternatives on her behalf. Before choosing, the decision maker receives information about which alternative his client would prefer. After observing the outcome, the client subjectively evaluates the decision maker. While having this basic setting in common, the three experiments test for the effects of different types and degrees of accountability in that they manipulate the information available to clients when evaluating decision makers as well as the consequences which evaluations have for decision makers. In Experiment 1, we benchmark outcome bias in evaluations and how evaluations affect interpersonal risk-taking. The setting is such that clients are informed about both the decision maker’s risk choice and the outcome of the decision before they make an evaluation. Evaluations are mere feedback: They do not have any material consequences for decision makers. In Experiment 2 as well as in Experiment 3, we alter the degree of accountability by introducing monetary consequences of evaluations for decision makers. In Experiment 3, we additionally change the information available to the client in that the client does not observe both the decision and the outcome, but the outcome only.

A common characteristic of delegated risky decision making in the real world is an information environment where individuals can observe outcomes of other individuals in their domain, which allows them to make peer comparisons. Peer comparisons may affect delegated risk-taking directly, but also indirectly through their effect on clients’ evaluations, if these become more biased towards outcomes. To test for such an indirect effect of peer comparisons, we contrast, in all three experiments, a baseline situation where clients can observe their own outcomes only with a situation where they additionally receive relative outcome information.

This study integrates and complements both research on interpersonal risk-taking (e.g., Pahlke et al., 2012, 2015; Batteux et al., 2019, 2020; Polman & Wu, 2020) and on outcome bias in subjective evaluations of risk-taking decisions (e.g., Baron & Hershey, 1988; Gurdal et al., 2013; König-Kersting et al., 2021). With respect to clients’ evaluations, results of Experiments 1 and 2 show that while evaluations are sensitive to both decisions and outcomes in a situation where they represent mere feedback, they are more strongly biased towards outcomes and reflect almost no decision accountability when they have monetary consequences for decisions makers. Results of Experiment 2 further indicate that, when evaluations have monetary consequences for decision makers, outcome bias increases when an evaluator receives information about others’ outcomes that allow her to draw peer comparisons. With respect to delegated risk-taking, results from Experiment 1 show that evaluations represent a form of accountability that has the potential to increase the likelihood of delegated risk choices being aligned with clients’ risk preferences. However, results of Experiment 2 provide evidence for dysfunctional consequences of outcome bias in evaluations for interpersonal risk-taking: When clients’ evaluations become increasingly biased towards outcomes, decision makers more likely make choices that are not in their clients’ interest. Finally, results of Experiment 3 show that clients do not hold decision makers accountable for their decisions, but focus on outcomes only, when decisions are unobservable, even though they can make a probabilistic inference from the observed outcome to the decision.

Our study models a variety of real-world relationships. Examples are delegated investment management, where professional fund managers seek for positive evaluations to attract more funds, or financial risk management, where executives expect to be held accountable by their shareholders for their risk-management decisions. Other examples refer to relationships between financial, legal, or medical advisors and their clients. Furthermore, our experimental manipulations of the information environment model important characteristics of practical situations. While in some situations of delegated risk-taking a client can observe the risk choices made on her behalf, in many situations the actions taken remain hidden to her. Also, outcomes of risk-taking decisions can be frequently compared to the outcomes of comparable decisions made by others: Investment portfolios’ returns can be compared against the market, and professionals receive awards or star ratings based on such comparisons. Corporate managers are not only held accountable for absolute performance, but are also evaluated against industry averages.

By integrating the two research streams on interpersonal risk-taking and outcome bias, our study makes contributions to theory and has implications for practice. We complement preceding studies of outcome bias in that we systematically manipulate both the information environment and the consequences which evaluations have for the decision maker. Furthermore, we provide evidence about how decision makers react when they are confronted with outcome bias in the evaluations they receive. Such evidence is crucial for deepening our understanding of the role of subjective evaluations in agency relationships. We conclude from our finding that increases in outcome bias are associated with increases in misaligned risk choices that a client’s own behavior may contribute to a conflict of interest between her and the decision maker. Our study has practical implications in that it informs practitioners about determinants of outcome bias that relate to their information environment and in that it sheds light on potential dysfunctional effects of subjective evaluations. Even though such effects have been investigated in other contexts (e.g., Bol, 2011; Prendergast & Topel, 1993), our study is among the first to address delegated risk-taking. Our findings suggest that evaluators, when they hold decision makers accountable, should keep in mind the causal relationship between risk-taking decisions and outcomes. Otherwise, they may give misleading feedback that contributes to decision makers taking unwanted risks, or being overly conservative. In the following we first review both streams of literature to which we contribute, then we point out research gaps and position our study.

1.1 Risk-taking for others and accountability

When individuals make risky decisions on behalf of others instead of their own, depending on the perspectives they take (Tunney & Ziegler, 2015), they may act according to their own risk preference, or may reveal a shift in behavior towards taking more or less risk (Batteux et al., 2019, 2020; Füllbrunn et al., 2022; Polman & Wu, 2020).Footnote 2 Evidence is inconclusive: Studies have observed a significant shift towards more risky choices (e.g., Pollmann et al., 2014; Rigoli et al., 2018; Sun et al., 2017), but also shifts to being more cautious with other people’s money (e.g., Chakravarty et al., 2011; Eriksen et al., 2020). Prior research has also investigated determinants of risk-taking for others such as the decision frame (Batteux et al., 2019), responsibility (e.g., Bolton et al., 2015; Füllbrunn & Luhan, 2020), or social distance (e.g., Batteux et al., 2017; Montinari & Rancan, 2018; Zhang et al., 2017).

As the objective of our study is to make a connection between interpersonal risk-taking decisions and the evaluations of such decisions, our study focuses on how accountability affects risk-taking.Footnote 3 Accountability refers to the implicit or explicit expectation of a decision maker to be held accountable by others for his actions (Lerner & Tetlock, 1994, 1999). It can only arise if individuals are able to link an observed outcome to these actions, or to observe the actions themselves. Then, it implies that the decision maker receives feedback from others, is called to justify his actions and/or their outcomes, or faces consequences such as rewards or punishments. Studies addressing the effect of accountability on interpersonal risk-taking find evidence for shifts in risk-taking behavior with increased accountability (Lefebvre & Vieider, 2014; Pahlke et al., 2012; Weigold & Schlenker, 1991). When accountability implies monetary consequences, the decision maker tends to focus on his reward prospects when making his decisions. In case of a discretionary reward, this implies trying to anticipate the judgements underlying the rewarding decision. Even though discretionary rewards have been investigated in the context of interpersonal risk-taking (de Oliveira et al., 2017; Gurdal et al., 2013; König-Kersting et al., 2021; Pollmann et al., 2014), few studies have addressed the effects of such rewarding decisions on decision makers’ behaviors.Footnote 4

1.2 Outcome bias

Investigations of outcome bias (building on Fischhoff, 1975) cover a wide range of fields including not only financial, but also ethical, judicial or medical decisions (e.g., Arkes et al., 1981; Kamin & Rachlinski, 1995; Gino et al., 2010). Baron and Hershey (Baron & Hershey, 1988, 1992) were the first to conceptualize the effects of outcomes on evaluations. According to their definition of outcome bias, an evaluation is biased towards the outcome whenever the client has all relevant information to evaluate the decision, observes the decision itself, and still uses the outcome for the evaluation. In contrast, when the decision is unobservable, the client rationally uses the outcome to make an inference about the information not available to her. Now, the evaluation reflects outcome bias when it underweights or even contradicts the inference. Studies analyzing determinants of outcome bias have found that outcome bias is sensitive to social comparisons (Sezer et al., 2016), to comparisons between actual outcomes and potential outcomes of foregone alternatives (Baron & Hershey, 1988; van Dijk & Zeelenberg, 2005; Seta et al., 2015), to prior expectations (Schkade & Kilbourne, 1991), or to perceptions of the controllability of outcomes (Ghosh, 2005).

Our study is related to investigations of outcome bias in the evaluations and/or discretionary rewards clients give to decision makers who make risky choices on their behalf. In their seminal study on outcome bias, Baron and Hershey (1988) find that clients (hypothetically) blame decision makers both for choosing a risky alternative after a bad outcome and for choosing a safe alternative after observing a better outcome of a risky option. In a delegated portfolio management setting, both Asparouhova et al. (2015) and Anufriev et al. (2019) find that fund managers attract more funds from investors when they have been successful in the recent past, an effect that is consistent with outcome bias in investors’ evaluations of fund managers. De Oliveira et al. (2017) and König-Kersting et al. (2021) investigate discretionary rewarding decisions. Rewards that increase in outcomes are not only consistent with outcome bias, but also with distributive fairness preferences, as clients may simply share their wealth with their decision makers. The results of both de Oliveira et al. (2017) and König-Kersting et al. (2021) are supportive for outcome biases to actually affect clients’ decisions beyond distributive fairness, though. In Gurdal et al. (2013), clients both make evaluations and grant discretionary rewards; results show that both are biased towards outcomes.

1.3 The present research

This study asks two research questions: What determines outcome bias in subjective evaluations? And: What are the effects of outcome biases in evaluations on interpersonal risk-taking? With respect to the first research question, our study narrows a research gap in that we conduct a series of experiments in which we vary the information environment of the relationship between client and decision maker as well as the consequences evaluations have for the decision maker. With respect to the second research question, our analysis breaks new ground. A key construct behind our research questions is accountability, because both require an analysis of the relative emphasis that clients place on the decision process versus the outcome in their evaluations (Lerner & Tetlock, 1999; Patil et al., 2014). Process versus outcome accountability refers to outcome bias because the less a client focuses on the decision and the more she focuses on the outcome, the stronger is her potential outcome bias. But types accountability are also crucial for connecting evaluations with risk-taking decisions, as a decision maker will try to anticipate what he will be held accountable for (König-Kersting et al., 2021). In the following, we discuss four aspects of our study’s setting which are of particular importance for investigating our research questions. In doing so, we position our study in the literature.

Knowledge about clients’ preferences

Few studies on interpersonal risk-taking have investigated situations in which a decision maker has information about his client’s risk preference (Bolton et al., 2015; Kling et al., 2022; König-Kersting et al., 2021). If the decision maker has no such information, there is only a minimum degree of accountability, as a client cannot legitimately hold the decision maker accountable for his decision when the decision maker had no information about the client’s preferences in the first place. We thus consider a situation where the decision maker is informed about his client’s risk preference before he makes a risk choice on her behalf.

Observability of decisions

In analyzing how decision makers react to financial incentives or competitive pressure (e.g., Andersson et al., 2020; Kirchler et al., 2018; Sheedy et al., 2019), previous studies have considered mechanisms that are exogenous to the client-decision maker relationship. In our study accountability is endogenous, as it comes from the client’s subjective evaluation. When the client observes both the outcome and the decision, it is easy for her to hold the decision maker accountable. In contrast, when the decision remains hidden to the client, decision accountability can only arise from a probabilistic inference about the decision. The latter situation has high external validity in that delegated risk-taking in reality is usually subject to some information asymmetry between decision maker and client. Contrasting the two situations thus can provide important insights.

Information about other outcomes

When decision makers can observe others’ outcomes, their behavior is affected in a way that is consistent with an activation of competitive preferences (e.g., Dijk et al., 2014; Kirchler et al., 2018; Lindskog et al., 2022; Wang, 2017). We are interested in the effect of such information on the evaluations clients give to decision makers and thus contrast a situation in which clients receive information about other clients’ outcomes with a situation where there is no such relative outcome information. We have three reasons for doing so. First, observing others’ outcomes is typical for real-life situations of delegated risk-taking. Second, we expect the manipulation to provide insights into our first research question addressing the determinants of outcome bias because we expect relative outcome information being present to increase outcome bias in evaluations. Third, a manipulation that increases outcome bias in evaluations helps to provide insights into our second research question about how outcome bias in evaluations affect interpersonal risk-taking.

Monetary consequences of evaluations

With few exceptions, prior studies have modeled the relationship between client and decision maker such that the client rewards, but does not evaluate, the decision maker.Footnote 5 Our series of experiments allow us to investigate outcome bias in evaluations and how (potentially biased) evaluations affect delegated risk-taking decisions with monetary consequences of evaluations being both present and absent.

2 Method

2.1 Basic experimental design and procedures

All experiments share the same basic design. There was a pre-stage and a main stage. The function of the pre-stage was to elicit risk preferences such that in the main stage each decision maker could be informed about his client’s preference as revealed from her decision in the pre-stage. In the pre-stage, each participant chose between two alternatives with equal expected outcomes but different levels of risk. Potential outcomes were 10, 20, or 30 experimental currency units (ecu, 4 ecu = 1 EUR) for both alternatives, but probabilities differed: For the low-risk alternative, the probabilities were 5%, 90%, and 5% for the low, medium, and high outcome, respectively, whereas the respective probabilities were 30%, 40%, and 30% for the high-risk alternative. Participants were told that their choices would play a role in the second part of the study, without giving any specific information. At the beginning of the main stage, participants were assigned the role of either client or decision maker. Assignment of roles was random but subject to the group of clients having an equal number of participants preferring the high risk and low risk alternative in the pre-stage, respectively. Dyads were matched for a single round in a stranger design such that each decision maker faced each client exactly once. In each round, the decision maker chose between two alternatives with equal expected values but different risk levels. The alternatives were similar to those of part one, but outcomes ranged from 0 to 40 ecu; Fig. 1 shows probability distributions.

Fig. 1
figure 1

Outcome distributions of the low risk and high risk alternatives. The Figure shows outcome probabilities for the two alternatives available to decision makers in the main stage of the experiments

Before making his decision, the decision maker was informed whether his client had preferred low risk or high risk in the pre-stage. A client could not revise this implicit goal communicated to the decision maker. After the risk choice, the outcome was generated by the computer system and presented to the client, potentially with additional information depending on the specific design of the experiment. Then, the client evaluated the decision maker on a scale from 1 to 7; no specifications of evaluations were given except information about 1 being the worst and 7 the best evaluation.Footnote 6 The round ended with the evaluation being communicated to the decision maker, and with client and decision maker receiving a summary report of the round. Over all experiments and treatments, a client received a payment that was equal to the outcome of her own risk choice in the pre-stage plus the outcome from a randomly selected round of the main stage. A decision maker also received the outcome of his risk choice in the pre-stage; the second part of his compensation varied across experiments.

The presentation of experiments in this paper is not equal to the sequence in which they were conducted: We started with Experiments 2 and 3, Experiment 1 was conducted last. Participants were recruited from the authors’ universities’ student subject pools. Experiments 2 and 3 were run in a university computer lab, whereas Experiment 1 was run online.Footnote 7 All sessions were run with 12 clients and 12 decision makers interacting over 12 rounds in a stranger design. Each experiment had three sessions. Sample sizes were derived from prior research investigating similar settings (Bolton et al., 2015; de Oliveira et al., 2017; König-Kersting et al., 2021).

2.2 Experimental manipulations

The settings modeled in the three experiments address different types and degrees of accountability. First, we manipulate the information environment to investigate whether and how clients’ evaluations reflect decision versus outcome accountability. We contrast a situation in which a client can observe both the outcome of the risky decision and the decision itself (Experiments 1 and 2) with a situation where the decision is unobservable (Experiment 3). When the decision and its outcome are observable, a client can hold the decision maker directly accountable for both. Then, any reliance on the outcome in the evaluation indicates outcome bias (Baron & Hershey, 1988). If instead the decision is unobservable to the client, she can hold the decision maker accountable for the decision only indirectly based on her inference from the observed outcome to the hidden decision. Then, feedback becomes ambiguous because a decision maker cannot clearly identify whether the decision or the outcome drives the evaluation.

Second, we vary the degree of accountability in that we contrast a situation in which evaluations have no material consequences (Experiment 1) with a situation where evaluations determine decision makers’ financial compensations (Experiment 2 and 3). With such monetary consequences, we expect evaluations to have stronger effects on delegated risk-taking than without. As part of Experiment 1, we also test a Baseline condition where clients do not evaluate decision makers. Decision makers’ compensations vary across experiments: They receive a fixed payment for the second stage of Experiment 1, whereas their pay for the second stage of both Experiments 2 and 3 depends on the evaluations they receive from their clients. More specifically, the rank which a decision maker’s average evaluation has among the average evaluations of all decision makers in his session is determined, and pay is rank-dependent with 80 ecu for rank one, 60 for rank two, 40 for rank three, 20 for rank four, 10 for ranks five through eight, and zero for all other ranks.

In all three experiments, two treatment conditions are tested between subjects which differ with respect to the information clients receive about outcomes: With relative outcome information being present (ROI present), each client learns her own outcome and is additionally provided with a ranking list of the outcomes of all clients in her peer group, including her own; she does not receive information about other decision makers’ decisions, though. With relative outcome information being absent (ROI absent), no such additional information is presented. Figure 2 illustrates the experimental design.

Fig. 2
figure 2

Illustration of the experimental Design

3 Results

3.1 Experiment 1: Benchmarking outcome bias and interpersonal risk-taking

Experiment 1 investigates subjective evaluations and their potential effects on risk-taking decisions in a setting where clients can observe both the outcome and the decision and where evaluations have no monetary consequences for the decision maker, whose compensation for the main stage of the experiment is fixed. The experiment not only tests two conditions with relative outcome information (ROI) being either absent or present, but also a Baseline condition were clients do not evaluate decision makers. The experiment provides a benchmark: Clients observe both outcomes and risk-taking decisions, and thus any reliance on the outcome in the evaluation indicates outcome bias (Baron & Hershey, 1988). Furthermore, the degree of accountability is low in that evaluations have no material consequences for decision makers so that they cannot imply any financial incentives for risky or cautious shifts in decisions; if they still affect delegated risk-taking, it is the feedback conveyed by the evaluation itself that affects behavior.

Data

In total, 214 subjects participated in the experiment, 72 participants each in the ROI absent and the ROI present conditions, and 70 participants in the Baseline condition.Footnote 8

Evaluations

Panel A of Table 1 presents data on evaluations after an aligned (ALIGNED = 1) versus misaligned decision, after observing an outcome below (OUTCOME < 20) versus above the mean, and overall. EVAL is the average evaluation a client makes over all rounds. The data show that evaluation levels exceed the medium category (4), indicating leniency (Bol, 2011; Prendergast & Topel, 1993). Both decisions and their outcomes drive evaluations: Clients not only give significantly higher evaluations after observing an aligned as opposed to a misaligned decision (ROI absent: 5.17 vs. 4.02; ROI present: 5.25 vs. 4.16; p < 0.01 in either case),Footnote 9 but evaluations are also clearly higher after above than below average outcomes (ROI absent: 5.63 vs. 3.91; ROI present: 5.85 vs. 4.14; p < 0.001 in either case). The latter result documents outcome bias in evaluations: Even though clients can observe decisions, they hold decision makers accountable for the outcomes of their decisions. Finally, the data show that the treatment manipulation − relative outcome information being absent versus present − has no effect on evaluations, as neither evaluation levels (4.82 vs. 4.97) nor distances (between evaluations after aligned vs. misaligned decisions: 5.17 − 4.02 = 1.15 vs. 5.25 − 4.16 = 1.09; between evaluations after high vs. low outcomes: 5.63 − 3.91 = 1.72 vs. 5.85 − 4.14 = 1.71) differ (substantially) across conditions. We conclude that in Experiment 1, clients hold decision makers accountable for both their decisions and these decisions’ outcomes, and that providing clients with relative outcome information has no effect on evaluations.

Table 1 Experiment 1: Evaluations and risk-taking decisions

Risk-taking decisions

Panel B of Table 1 shows frequency data on risk choices for the two conditions in which clients’ make evaluations, and for the Baseline condition where there are no evaluations. As we expect both the client’s and the decision maker’s risk preferences to influence the decision, we disaggregate the data by preference matches versus mismatches; a match (MATCH = 1) occurs when client and decision maker preferred the same alternative in the pre-stage of the experiment; otherwise, there is a mismatch. Aligned choices are consistently more frequent when risk preferences match, with the differences in frequencies being significant (p < 0.01 in all cases) except for the ROI present condition; here the frequency is equally low for both cases (MATCH = 1 and MATCH = 0). In comparing the data between the Baseline and the ROI absent condition, we can give a first answer to the question whether subjective evaluations represent an effective means to hold decision makers accountable for their risk-taking. The answer is yes as the data show that the frequency of an aligned decision (ALIGNED = 1) is significantly higher in the ROI absent than in the Baseline condition (75.7% vs. 66.7%, z70 = 3.30, p < 0.001). However, once ROI is present, the frequency of an aligned decision drops to the level of the Baseline condition, and the difference in frequencies between ROI being absent versus present is significant (75.7% vs. 66.0%, z70 = 3.21, p = 0.001). That is, even though peer comparisons do not affect clients’ evaluations, they affect decision makers’ risk choices.

Additional analyses (untabulated) show that the effect represents a risky shift in behavior, as decision makers considerably more frequently switch to the high risk option when ROI is present relative to absent. This observation is consistent with prior findings about peer comparisons affecting risk-taking (e.g., Dijk et al., 2014; Kirchler et al., 2018), but it occurs in a situation where decision makers do not act on behalf of themselves, but on behalf of clients.

3.2 Experiment 2: Effect of outcome bias in evaluations on delegated risk-taking

Experiment 2 increases the degree of accountability compared to Experiment 1 in that we introduce monetary consequences of evaluations. Again, two conditions are tested contrasting an information environment in which clients can make peer comparisons (ROI present) or not (ROI absent).

Data

In total, 144 subjects participated in the experiment, 72 (36 clients and 36 decision makers) in each condition (ROI absent and ROI present).

Evaluations

Table 2 shows descriptive data on evaluations, which convey that (now with monetary consequences) clients no longer hold decision makers accountable for their risk choices, but only for the outcomes of their choices instead: Evaluations are only slightly better after an aligned relative to a misaligned decision (ROI absent: 4.72 vs. 4.34; ROI present: 4.42 vs. 4.28), but are clearly better after an above average relative to a below average outcome (ROI absent: 5.47 vs. 3.69; ROI present: 5.69 vs. 3.05; p < 0.001 in either case). This gives clear support to outcome bias driving evaluations. Testing for the effect of ROI present relative to absent, we see that evaluations become more extreme, as the distance between evaluations after high vs. low outcomes is significantly larger (ROI absent: 5.47 − 3.69 = 1.78 vs. ROI present: 5.69 − 3.05 = 2.64; z70 =  − 2.90, p = 0.003). This indicates that providing clients with relative outcome information increases outcome bias.Footnote 10

Table 2 Experiment 2: Evaluations

Risk-taking decisions

Table 3 shows frequency data on risk choices in Panel A, and results of fixed-effects probit regressions in Panel B. The data in Panel A show that, as in Experiment 1, aligned choices are consistently more frequent when risk preferences match. While the difference is relatively small and not significant when relative outcome information is absent (ROI absent: 78.6% vs. 70.2%), the frequency of an aligned choice drops sharply when preferences do not match instead of match with ROI present (73.0% vs. 38.6%, z35 = 4.71, p < 0.001). The overall difference in frequencies of aligned decisions with ROI being present versus absent is significant, too (76.1% vs. 56.5%, z70 = 5.37, p < 0.001). In combination with the evaluation data, this result is consistent with outcome bias in evaluations affecting risk-taking decisions: If clients do not hold decision makers accountable for making an aligned risk choice, but increasingly focus on outcomes instead, we can expect decision makers to feel less committed to their clients’ risk preferences, and especially so when there is a preference mismatch.

Table 3 Experiment 2: Risk-taking decisions

To gain further insights, we analyze how evaluations affect decisions through the feedback that the decision maker receives from evaluations. To do so, we align the outcome scale (0 − 40 points) with the evaluation scale (1 − 7) by mapping the outcome into one of seven intervals (0 − 5, 6 − 11, 12 − 17, 18 − 22, 23 − 28, 29 − 34, 35 − 40), and then define a FEEDBACK measure that indicates how strongly the client accounts for the decision in her evaluation. FEEDBACK is equal to the evaluation minus the outcome interval after an aligned decision and equal to the outcome interval minus the evaluation after a misaligned decision. A positive value of FEEDBACK can be interpreted as the client rewarding an aligned and punishing a misaligned choice, respectively, whereas a negative value indicates that an aligned choice is discouraged and a misaligned choice encouraged, respectively.Footnote 11

Panel B of Table 3 shows results of fixed-effects probit regressions estimating the likelihood of an aligned choice in the two treatment conditions, respectively, and a model pooling the data. All three models show a significant effect of FEEDBACK: When decision makers experience that clients actually hold them accountable for their risk choices, their propensity to make an aligned choice significantly increases. A preference match has a positive effect on the likelihood to make an aligned choice when ROI is present, but the effect is not significant when ROI is absent. The pooled model further shows that ROI being present implies a negative shift in the likelihood of an aligned choice, and that FEEDBACK has a significantly stronger effect on behavior with ROI being present relative to absent; that is, holding a decision maker accountable for his risk choice has a stronger effect when ROI is present. The results indicate that outcome bias in evaluations mediates the effect of ROI being present relative to absent on delegated risk-taking. We formally test for the mediation following the three-step procedure suggested in Baron and Kenny (1986). In the first step, we test whether the average FEEDBACK significantly differs between the two treatment conditions and find that this is the case (ROI absent vs. ROI present: 0.560 vs. 0.176, z70 = 3.79, p < 0.001). In the second step, we estimate the pooled probit model of Table 3, Panel B, without including FEEDBACK (results untabulated), while the third step includes FEEDBACK and is thus equivalent to the pooled model displayed in Panel B of Table 3. The regression results show that the mediation is partial, but not total, as the indicator variable for ROI being present keeps having a significantly negative coefficient in the pooled regression model: Risk-taking decisions are less frequently aligned when ROI is present, both because of a direct effect of such information available, and because of the mediation.

3.3 Experiment 3: Exploring whether evaluations reflect Bayesian inferences

Experiment 3 varies the form of accountability: Now the decision is hidden to the client, who can only make an inference from the observed outcome to the unobserved action. As there is a low risk and a high risk alternative, the Bayesian inference from outcome to the decision is such that, when the client prefers high risk, both high and low outcomes are indicative of the decision maker having made an aligned decision, whereas, when the client prefers low risk, the same outcomes indicate a misaligned choice. More precisely, outcomes below 16 and above 24 are less likely for the low risk alternative and more likely for the high risk alternative, whereas outcomes within the two levels are more likely for low than for high risk (see Fig. 1). Applying Bayes’ rule, a client who prefers low risk (high risk) infers from an outcome between 16 and 24 that the decision was more likely aligned than misaligned (more likely misaligned than aligned), whereas an outcome above 24 or below 16 indicates a misaligned (an aligned) choice.

Data

As in Experiments 1 and 2, the assignment of participants’ roles was such that the group of clients should have an equal number of decision makers preferring the high risk and low risk alternative. However, this did not work out in one session of the condition with ROI present, where only five individuals preferred the low-risk alternative in the pre-stage. Data come from 143 student participants, 72 in the ROI absent and 71 in the ROI present condition.Footnote 12

Evaluations

Panel A of Table 4 presents evaluation data by clients’ risk preferences and by outcome ranges. If evaluations reflected Bayesian inferences, clients preferring low risk would respond to outcomes close to the mean with the highest evaluations, whereas clients preferring high risk would do the opposite. In stark contrast, the data reveal that evaluation patterns are almost identical for the two groups: Irrespective of a client’s risk preference, average evaluations are significantly higher for medium than for low as well as for high than for medium outcomes. This gives clear support to outcome accountability, but almost no decision accountability reflected in evaluations. The data further show a stronger outcome bias for ROI being present than absent, as evaluations are a little lower for low and considerably higher for high outcomes; a corresponding test of differences in evaluations after high vs. medium outcomes is significant (data pooled for all clients, ROI absent: 5.72 − 4.58 = 1.14 vs. ROI present: 6.41 − 4.39 = 2.02, z63 =  − 3.43, p < 0.001), whereas a test of differences in evaluations after medium vs. low outcomes is not (ROI absent: 4.58 − 2.47 = 2.11 vs. ROI present: 4.39 − 2.23 = 2.16).

Table 4 Experiment 3: Evaluations and risk-taking decisions

Risk-taking decisions

Given that we find hardly any decision accountability in clients’ evaluations, we have no reason to expect evaluations to contribute to risk-taking decisions being more aligned with clients’ risk preferences. Panel B of Table 4 presents frequency data on risk choices. The pattern differs from what we saw in Experiment 1, as the frequency of an aligned decision (ALIGNED = 1) slightly increases from ROI absent (62.5%) to ROI present (66.9%). The difference is insignificant, though. A closer look at the data (untabulated) shows that clients who preferred low risk in the pre-stage of the experiment receive aligned choices in 70.4% of the cases in the ROI absent condition, but only 54.4% of the cases in the ROI present condition, a pattern that is very similar to what we observed in Experiment 1. In contrast, the data for clients who preferred high risk clearly contradict our previous findings. Overall, the evidence from Experiment 2 is inconclusive with respect to delegated risk-taking; we comment on this finding in the discussion of our results.

4 Discussion

This study complements prior research on the determinants of outcome bias and extends research on interpersonal risk-taking by investigating how delegated risk-taking is affected by subjective evaluations which decision makers receive from their clients. We first discuss the study’s contributions along its research questions, then we address practical implications and limitations.

Determinants of outcome bias

In all three experiments, clients’ evaluations are biased towards outcomes and reflect only limited or even no decision accountability. Results show considerable differences across experiments and conditions, though. The differences we observe support and complement explanations for outcome bias and its determinants given in the literature. Baron and Hershey (1988) argue that a potential driver of outcome bias is overgeneralization: Evaluators believe that outcomes always matter and that higher outcomes are always indicative of better decisions. Overgeneralization leads to attribution errors in that evaluators will erroneously attribute the outcome not to external conditions, but to the decision maker (Heider, 1958; Ross, 1977). Our observation that providing clients with information about other clients’ outcomes increases outcome biases is consistent with this explanation, as prior research has successfully used such a manipulation to increase outcome salience (e.g., Kirchler et al., 2018; Schoenberg & Haruvy, 2012), and, ceteris paribus, the relative weight a client gives to the outcome in her evaluation can be expected to increase when the outcome becomes more salient. Our results on the effects of relative outcome information furthermore complement previous findings showing that outcome bias increases when evaluators are confronted with outcomes of foregone alternatives (e.g., Seta et al., 2015). While relative outcome information in our study is unlikely to activate regret (as clients cannot observe outcomes of foregone alternatives), it still increases outcome salience and represents an additional source of outcome bias instead. Our finding also corresponds to evidence about actual risk-taking behavior being affected by relative outcome information (e.g., Kirchler et al., 2018, 2020). Moreover, our result that outcome bias is more pronounced when evaluations have monetary consequences for decision makers is consistent with the explanation of outcome bias proposed by Gurdal et al. (2013). The authors argue that a client who finds herself in an unfamiliar situation may refer to a similar situation salient to her and will make her rewarding decision as if she were in this similar situation. Likely, in our setting the similar situation is one in which better (worse) outcomes are indicative of better (worse) decisions. Then, the client will be biased towards outcomes, and the effect can be expected to more likely occur when evaluations affect decision makers’ compensations than in a situation where there is no such link. The same logic applies to the effect of relative outcome information: In real world situations of risky decision making, the outcomes of peers are often affected by the same external factors, and peer comparisons help to filter such factors out and thus improve the accuracy of the evaluation of the decision. However, the setting of all experiments is such that other clients’ outcomes are uncorrelated and thus uninformative even if the decision is unobservable. Comparing the results of Experiments 1 and 2 further suggests that there is an interaction effect of relative outcome information and monetary consequences of evaluations, as providing clients with relative outcome information significantly increases outcome bias in Experiment 2, whereas there is no such effect in Experiment 1.Footnote 13

Effects of subjective evaluations on delegated risk-taking

Our work also makes contributions to the literature on interpersonal risk-taking, which has not only stressed the importance of different forms of accountability, but also of the decision versus outcome dimensions of accountability. When a decision maker acts as a perfect surrogate, he accepts his client’s risk preference to guide his decision. If instead he fails to take his client’s perspective, his choice may reflect his own risk preference so that it will be misaligned whenever risk preferences do not match, or he may feel unable to make any reasonable forecast of how his decisions affect evaluations and thus more or less randomizes his choice. Our first finding from Experiment 1 is that evaluations represent a form of accountability that has the potential to increase the likelihood of delegated risk choices being aligned with clients’ risk preferences. Our second and main finding from Experiment 2 however is that outcome bias in evaluations, reflecting a lower degree of decision accountability and a higher degree of outcome accountability, has dysfunctional consequences for interpersonal risk-taking, as increases in outcome bias are associated with risk choices becoming less frequently aligned with clients’ risk preferences. Our interpretation of this finding is that it is a client’s own behavior that contributes to a conflict of interest between her and the decision maker. However, even though the mediation analysis from Experiment 2 demonstrates that the feedback a client gives to a decision maker with her evaluation affects the likelihood of an aligned choice in subsequent rounds, the results from Experiments 1 and 2 indicate that relative outcome information also has a direct effect, as the mediation is only partial in Experiment 2 and as we find an increase in the frequency of misaligned decisions with relative outcome information present even though outcome bias does not increase in Experiment 1.

Experiment 3 provides evidence on evaluations and risk-taking decisions when clients cannot observe the risk choice. Here, for evaluations to affect decision makers’ behaviors, they have to be able to identify the degree of decision versus outcome accountability in the evaluation, which is difficult under hidden action. For example, with an observable decision, when a decision maker gets a negative evaluation after a low outcome even though his choice was aligned, he can conclude that it was the outcome that led the client’s evaluation. In contrast, when the choice is unknown to the client, the decision maker cannot be sure about what the client’s evaluation implies. More generally, to infer an outcome bias from evaluations under information asymmetry requires the decision maker to re-enact the client’s belief revision, which he cannot easily do without knowing the client’s beliefs. Evidence on delegated risk-taking from Experiment 3 indicates that even though decisions are more frequently misaligned when the client’s and the decision maker’s risk preferences do not match, providing clients with the opportunity to make peer comparisons does not increase the frequency of misaligned decisions.Footnote 14

Practical implications, limitations and future directions

Our findings have at least two practical implications. First, they may help to design management control systems in hierarchical organizations (e.g., Merchant & van der Stede, 2017). Such systems serve to guide subordinates such that they contribute to successfully carrying out an organization’s plans, and holding subordinates accountable for their actions and the outcomes of these actions is a core mechanism of management control. Our findings suggest that superiors, in their evaluations of subordinates, may not sufficiently take into account that performance results from the combination of effort, strategy, and factors outside a subordinate’s control. Our setting models risk-taking as a specification of strategy, and we find that participants acting in the role of evaluating superiors fail to consistently account for the subordinate’s strategy in their evaluations. Furthermore, we observe that subjective evaluations are sensitive to other outcomes which are in no way informative about the quality of the decision that has been made. As such it could be beneficial to deliberately exclude such information from evaluations to make them less prone to outcome bias.

Second, it is often argued that ill-designed financial reward systems are a root cause for investment professionals to not act in the interests of their clients (e.g., Chevalier & Ellison, 1997), for managers to take excessive risks (e.g., Bebchuk & Spamann, 2010; Fahlenbrach & Stulz, 2011), or even for managers to act overly conservative (e.g., Gormley & Matsa, 2016; Wiseman & Gomez-Mejia, 1998). However, if clients and shareholders showed outcome bias in their evaluations of these managers in the way we observe in our experiments, their evaluations might contribute to managers’ risk-taking decisions being misaligned with the objectives initially communicated to them. Even though this distortion in incentives might be equally problematic as ill-designed financial incentives, it is significantly more difficult to address.

Our research is subject to several limitations which narrow its contributions, but also provide future research opportunities. The most important limitation of our study is likely that delegation of risk-taking is not at the discretion of the client, but compulsory. There is thus no way of knowing whether and why clients would delegate the risk-taking decision, and observing a misaligned choice is not necessarily a problem if the client preferred delegating the decision in the first place.Footnote 15 There are two potential reasons why a client would delegate the risk-taking decision even though the decision manager has—as in our setting—no superior information about risky prospectsFootnote 16: First, the client may be unsure about her own preferences or future decisions. Then, delegating the risk choice and observing outcomes allow her to learn, and an evaluation may reflect the client updating her preferences. Second, risk preferences maybe affected by emotional reactions to risk rather than cognitive evaluations (Loewenstein et al., 2001). Delegating risk-taking then implies that the risk choice is de-emotionalized (Batteux et al., 2020), and the client may evaluate the decision maker being well aware of this effect of delegation. If clients delegate risk-taking decisions for these potential reasons, we have to be careful with our interpretation of outcome bias in clients’ evaluations contributing to delegated risk-taking decisions being misaligned with the objectives of clients, as clients may be fully satisfied with the risk-taking decisions made on their behalf.

Another limitation of our findings is that in our setting, not only clients received relative outcome information, but also decision makers. Results from Experiment 1 support the conjecture that being able to make peer comparisons does not only affect clients’ evaluations, but also directly decision makers’ behaviors. Such a direct effect has been observed before (e.g., Dijk et al., 2014; Kirchler et al., 2018, 2020). While our experimental design does not allow us to clearly disentangle the indirect and direct effects of relative outcome information, modifications of the design would achieve this goal. Finally, we modeled the situation such that the decision maker chooses between two alternatives only. While such a design is standard in investigations of outcome bias, it does not allow to observe gradual shifts towards more or less risky alternatives.

5 Conclusion

This study reports findings from three experiments investigating determinants of outcome bias in clients’ subjective evaluations, and the effects these evaluations have on interpersonal risk choices. We provide insights into the determinants of outcome bias and present evidence for outcome bias in evaluations contributing to decision makers making risk choices that are not aligned with their client’s risk preferences. Our findings contribute to explaining the behavior of decision makers and their clients in various contexts, and have practical implications in that they shed light on potential dysfunctional effects of subjective evaluations. Fund managers not only have incentive contracts that may induce them to take risks, but, potentially equally importantly, have career incentives that depend on the subjective evaluations they receive from their superiors and private investors. Managers in companies make operating, strategic or financial decisions affecting risk, and the subjective evaluations they receive from their board members, shareholders or other stakeholders contribute to explaining managers’ risk-taking behavior. Advisors and other service agents who have significant influence on their clients’ decisions are likely to anticipate biases in their clients’ evaluations, with the potential results that their recommendations are not in their clients’ best interest. There have been numerous examples stressing the role of ill-designed financial incentives in explaining dysfunctional behaviors of, e.g., investment professionals, corporate managers, financial or legal advisors, or sales representatives. In an admittedly broad view, this study provides a complementary explanation for control failures in that it focuses on the bounded rationality of those who make subjective evaluations of decision makers. When evaluators focus on outcomes in a way that they seem to forget about the causal relationship between the actions a decision maker takes and the probabilities of alternative outcomes from such actions, they may implicitly motivate their decision makers to make decisions that are not aligned with the objectives that were communicated to them in the first place. Such decisions may not necessarily represent excessive risk-taking, but also overly conservative behavior; in either case, though, conflicts of interest would likely not be mitigated, but aggravated.