1 Introduction

Suppose your monthly income is made up of a fixed salary component, y, and a bonus, c. The bonus partly depends on factors outside your control (e.g., market demand for the product you are selling) such that it is obtained only with probability \(0<p<1\). Further suppose you are planning a trip to Galápagos Islands next month and need to book a hotel right now as otherwise everything will be booked out. If you can afford a five-star hotel only in case the bonus is actually paid and cancellation is sufficiently expensive, you will end up booking a four-star hotel (which you can afford from the fixed salary), although you might have preferred the better rated accommodation. This describes a situation of a temporal context, i.e., some decisions must be made before the resolution of uncertainty, and the example illustrates an instance where a sure outcomes may yield different utility than the risky ones. For such intermediate decisions, the minimal outcome of an uncertain choice alternative plays a special role as it is a sure payoff that can instantly be incorporated into current wealth holdings and, thus, is available for the execution of part of a plan before uncertainty resolves (e.g., for immediate consumption purposes). Indeed, it is the common practice of most banks to finance loans by mainly considering the sure salary component as basis for an individual’s credibility to pay back a mortgage on a property.

We propose a model of decision making under uncertainty called chance theory (CT); it incorporates a differential treatment of sure versus uncertain outcomes. As the sure component of the salary in the above example can lead to higher utility than the risky bonus, CT evaluates the corresponding lottery \((1-p,y;p,y+c)\) by

$$\begin{aligned} CT(1-p,y;p,y+c)=v(y)+pu(c), \end{aligned}$$

where v is called the utility for wealth and u is referred to as utility for chance.Footnote 1

In agreement with its prominent role, the sure component of an act has special value under CT. All other outcomes of an act are seen as potential extra payoffs which may be available in addition to the sure component. These genuinely uncertain increments for improvement cannot immediately be used, except for planning conditional on the resolution of uncertainty. Consequently, the chances are of a different value to the decision maker.Footnote 2

Note that the minimal outcome of an act is not a reference point in CT. In reference-dependent models like prospect theory (PT; Kahneman & Tversky, 1979; Tversky & Kahneman, 1992) a reference point is formulated for a given choice situation and all alternatives are evaluated with respect to this reference point; indeed, the recent PT-foundation for risk in Werner and Zank (2019) shows how the reference point of PT emerges endogenously from preferences and proves that it must be unique and common to all lotteries. By contrast, in CT, as each alternative may have a different minimal outcome, it is evaluated with respect to that alternative’s own minimal outcome. In this sense CT is also different to behavior where the choice among two lotteries depends on the best of the minimum outcomes of those lotteries, as suggested in the recent study of Baillon et al. (2020) or in other existing reference-dependent models which feature gains and losses (e.g., Sugden, 2003; Bleichrodt, 2009; Schmidt et al., 2008). At best, one may regard the reference-dependence in CT as being lottery dependent.Footnote 3

Having two separate utility function for risky (u) and riskless outcomes (v) in CT may be desirable from a methodological point of view. In expected utility (EU) both outcome types are evaluated using the same von Neumann-Morgenstern utility. As already noted by Savage (1954) and Luce and Raiffa (1957), the latter utility does not necessarily measure strength of preference for sure outcomes as it reflects both attitude towards wealth and risk attitude without the possibility to identify the two components separately. A concave utility in EU may, therefore, merely result from decreasing marginal utility of wealth and need not reflect an intrinsic aversion towards risk (Dyer & Sarin, 1982). Indeed, the main motivation for the dual theory of Yaari (1987) was two develop a model where risk aversion and decreasing marginal utility of wealth are separated. In the dual theory an agent can be risk averse although marginal utility of wealth is constant. In CT this separation is in some sense more general since v is not required to be linear. As under CT risk attitudes in the strong sense of Rothschild and Stiglitz (1970) are captured entirely by the curvature of u, whereas attitude towards wealth are reflected solely through the curvature of v, the separation of both concepts is achieved exclusively in terms of attitudes towards outcomes and without invoking separate measures that capture attitudes towards probability such as in Yaari (1987), Quiggin (1982) or Kahneman and Tversky (1979). Extensions of CT that incorporate probability weighting are feasible, and descriptively they may be desirable; we do not explore them here.

The aspect we seek to capture is the separate perception of risky versus riskless outcomes, so we focus only on the utility scale. Indeed, empirical evidence suggests that there may be a fundamental difference between riskless and risky utility. The common consequence and common ratio effect of Allais (1953) rely on the existence of safe options. If these options are moved slightly away from certainty, the violation rate of expected utility (EU) is dramatically reduced (Cohen & Jaffray, 1988; Conlisk, 1989). In general, EU seems to perform rather well when only risky options over common outcomes are considered (Camerer, 1992; Harless & Camerer, 1994; Hey & Orme, 1994; Starmer, 2000). More recently, Andreoni and Sprenger (2012), Andreoni and Harbaugh (2010) and Callen et al. (2014) also find evidence for a disproportionate preference for certainty. They argue for so-called u-v models that also impose a different utility function for risky options (u) than the utility for riskless options (v).

Existing u-v models (Fishburn, 1980; Schmidt, 1998; Bleichrodt & Schmidt, 2002; Diecidue et al., 2004; or Neilson, 1992) can be criticized, however, both from a normative and from descriptive perspective. These models either imply violations of (transparent) stochastic dominance or of transitivity, which appers to rule them out as normative models. Violations of transparent stochastic dominance are observed extremely rarely in empirical studies (Loomes & Sugden, 1998; Carbone & Hey, 2000; Hey, 2001). Likewise, the intransitive preference cycles predicted by the model of Bleichrodt and Schmidt (2002) are opposite to those reported in the experimental literature (Starmer & Sugden, 1989). Consequently, also the descriptive validity of existing u-v models can be questioned. By contrast, CT is consistent with transitivity and stochastic dominance and can therefore be regarded as a model that respects many normative desiderata, especially for the context of temporal uncertainty.

Regarding optimal planning in a temporal decision making context, the early examples and discussions in Markowitz (1959, Chapter 10 & 11), Mossin (1969), or Spence and Zeckhauser (1972) indicate that, even if a decision maker acts perfectly rational, unresolved uncertainty may make it impossible to have intermediate decisions made such that they turn out to be efficient ex post. Under EU this implies an increased aversion towards risk, a property that is also captured by CT: consistency with stochastic dominance implies that CT-decision makers always disprefer a lottery to receiving its expected value for sure, i.e., they always display weak risk aversion. For non-temporal settings, such globally consistent behavior may, therefore, be somewhat restrictive. As in real life by far most risky decisions are made in a temporal context, people may be set to give specific attention to the worst outcome and do so even in non-temporal settings. In view of the evidence on different evaluations of risky and riskless options, we think that CT may be a descriptively viable model also in such contexts. As we show below, some mileage in accommodating descriptive phenomena is gained by CT’s built-in certainty effect, which can, for instance, explain the EU-paradoxes of Allais (1953) and Rabin (2000).

The distinctive role of the minimal outcome of a lottery, often referred to as security level, is not unique to CT. Lopes (1987) showed empirically that decision makers focus on the security level when deciding between uncertain alternatives. Such evidence motivated the development of models by Gilboa (1988), Jaffray (1988), and Cohen (1992), and more recently Diecidue van de Ven (2008), which propose that the utility function in EU should depend on the security level. These models, however, do not achieve a separation of riskless and risky utility.

A complementary approach to u-v models, such as CT, is to give extra weight to the minimal outcome without a separation of riskless and risky utility. A focus on the worst outcome also appears in social welfare analysis (Rawls, 1971) and in finance (Roy, 1952); a similar pessimistic outlook reappears in the context of ambiguity aversion (Gilboa & Schmeidler, 1989), where the maximum of the smallest EU-values over a set of priors is decisive for choice behavior. As a consequence, the empirically found ambiguity seeking for unlikely gains or for likely losses (Kilka & Weber, 2001; Abdellaoui et al., 2005; Baillon & Bleichrodt, 2015; Dimmock et al., 2016; Trautmann & Wakker, 2018) cannot be accommodated. To accommodate the latter, it was suggested to include the opposite, an ambiguity seeking multiple prior evaluation, and invoke a parameter that weights ambiguity aversion and ambiguity seeking in some proportion, leading to the \(\alpha\)-Max-Min-model of Ghirardato et al. (2004). Closer to our setting are, however, the models that make adjustments to EU by keeping “the best and worst in mind” (Chateauneuf et al., 2007; Webb & Zank, 2011; Webb, 2015). In these latter theories attention is, in addition to the minimal outcome, also given to the best outcome of a lottery, while intermediate outcome carry a reduced weight in the evaluation of alternatives (see also Lopes, 1987). Our version of CT does not permit special attention to the best outcome, as an extra dimension to also account for optimistic considerations beyond the degree of freedom that we allow for the worst case outcome (e.g., the four-fold pattern of risk attitude in Tversky & Kahneman, 1992) is not incorporated. That said, the tools presented in this paper could in principle be used to develop extensions of CT that incorporate such potential concerns. While we have not progresed in this direction, we note that an approach to separate the best outcome and the worst outcome from the remaining outcomes of an act, keeping the former are evaluated by the same utility, was presented in Alon (2014).Footnote 4

The next section introduces a simple version of CT and shows how the model can explain the afore-mentioned EU-paradoxes. Implications of CT for wealth and risk attitudes are presented in Sect. 3, where also a simple application to intertemporal decision making is considered. Subsequently, CT for the general case of uncertainty, which includes risk as special case, is formally introduced (Sect. 4). Section 5 provides a behavioral foundation for CT and Sect. 6 concludes. Proofs are contained in the Appendix.

2 Basic properties of chance theory

This paper presents general preference foundations for CT under uncertainty. The approach we choose is such that the proposed preference foundations also apply for the case of choice under risk, i.e., the special case of uncertainty with known probabilities for events. We derive the model using the standard principles of completeness, transitivity, monotonicity and continuity in outcomes, which are supplemented by two additive separability conditions. The first of these demands that a minimal outcome of an act and the chances to improve upon it are valued separately. The second implies that chances for improvements are evaluated by an EU-like functional. To obtain first insights into the components governing CT and the predictions of the model, this section considers the special case of an individual choosing among risky alternatives, i.e., finite probability distributions (lotteries) over monetary outcomes.

To use formal notation, as indication of a weak (strict) preference for a lottery over a second one, we employ the traditional symbol \(\succcurlyeq\) ( \(\succ\)); indifference is indicated by \(\sim\), with the reversed preference symbols, \(\preccurlyeq\) and \(\prec\), defined as usual. Next we show that the interpretation of outcomes as a sure payoff plus risky chances together with basic rationality principles impose some restrictions on the utility functions in CT. Consider again the example from the introduction, where the lottery \((1-p,y;p,y+c)\) is evaluated by

$$\begin{aligned} CT(1-p,y;p,y+c)=v(y)+pu(c), \end{aligned}$$

with \(v:\mathbb {R}\rightarrow \mathbb {R}\) and \(u:\mathbb {R}_{+}\rightarrow \mathbb {R}_{+}\) such that \(u(0)=0\), are strictly increasing and continuous functions. As CT-preferences are required to accord with the property of monotonicity in outcomes, this means that an improvement in an outcome of a lottery leads to a strictly preferred lottery. Clearly, an increment of \(\varepsilon >0\) in c leads to a preferred lottery as u is strictly increasing. Alternatively, an increment of \(0<\varepsilon <c\) in y leads to a lottery with a higher sure payoff \(y^{\prime }=y+\varepsilon\) and simultaneously reduced chance, \(c-\varepsilon\), i.e., we obtain the lottery \((1-p,y^{\prime };p,y^{\prime }+c-\varepsilon )\). This being a preferred lottery, CT requires that

$$\begin{aligned} v(y^{\prime })+pu(c-\varepsilon )>v(y)+pu(c) \end{aligned}$$

or, equivalently,

$$\begin{aligned} v(y+\varepsilon )-v(y)>p[u(c)-u(c-\varepsilon )]. \end{aligned}$$

Without imposing bounds on the admissible values for the monetary outcomes, the latter inequality needs to hold for all real y and positive c as well as all values of \(\varepsilon \in (0,c)\) and all probabilities \(p\in (0,1)\). Taking limits when p approaches 1 leads to

$$\begin{aligned} v(y+\varepsilon )-v(y)\ge u(c)-u(c-\varepsilon ) \end{aligned}$$
(1)

for all \(y\in \mathbb {R},\varepsilon >0,\) and \(c-\varepsilon \ge 0\).

Condition (1) is rather strong as it demands that the minimal slope of v is at least as large as the maximal slope of u. For instance, choosing v too concave or u too convex (e.g., simple power-functions) would violate (1). These implications are, however, also driven by the availability of all real values as domain for outcomes and the richness of the probability interval that is naturally given in decision under risk. As we show in Sect. 4, for settings of uncertainty, where the state space may be less rich (e.g., we have finitely many states of nature or there exist a least likely event with positive subjective probability), the analog of condition (1) does not apply in such a restrictive manner.

We now formally define CT under risk for lotteries with more than two outcomes. In CT each lottery \(P=(p_{1},y;p_{2},y+c_{2};\ldots ;p_{l},y+c_{l})\), with real-valued outcomes y and \(y+c_{i},c_{i}\ge 0,i=2,\ldots ,l\) for some \(l\ge 1\) and probabilities \(p_{i},i=1,\ldots ,l\), which are nonnegative and sum to 1,Footnote 5 is evaluated by

$$\begin{aligned} CT(P)=v(y)+\sum _{i=2}^{l}p_{i}u(c_{i}). \end{aligned}$$
(2)

with strictly increasing and continuous utility functions \(v:\mathbb {R} \rightarrow \mathbb {R}\) and \(u:\mathbb {R}_{+}\rightarrow \mathbb {R}_{+}\) with \(u(0)=0\), such that

$$\begin{aligned} v(y+\varepsilon )-v(y)\ge u(c)-u(c-\varepsilon ) \end{aligned}$$
(3)

for all \(y\in \mathbb {R},\varepsilon >0,\) and \(c-\varepsilon \ge 0\). Condition (3) has rather specific behavioral implications, two of which are discussed in the next subsections.

2.1 Chance theory implies risk averse behavior

As it turns out, for decision under risk, a CT-individual always displays weak risk aversion. That is, she prefers the expected value of a risky lottery to the lottery itself. To illustrate this, consider the choice between \(\$50\) for sure and a flip of a fair coin between \(\$0\) and \(\$100\), regarded as a 50:50-lottery. Weak risk aversion demands that \(v(50)\ge v(0)+0.5u(100)\). Since \(u(0)=0\), the right hand side of this inequality can be reformulated as \(v(0)+0.5[u(100)-u(50)]+0.5[u(50)-u(0)]\), and rearranging terms yields the inequality

$$\begin{aligned} v(50)-v(0)\ge 0.5[u(100)-u(50)]+0.5[u(50)-u(0)]. \end{aligned}$$

According to condition (1), \(v(50)-v(0)\) is at least as large as each of the utility differences on the right hand side of the latter inequality, such that under CT the sure \(\$50\) is indeed weakly preferred to the lottery. We believe that this form of risk aversion is reasonable particular in the presence of temporal risk. Note that temporal risk also implies a specific degree of risk aversion for EU (Mossin, 1969; Spence & Zeckhauser, 1972) which, assuming a timeless-temporal setting, can be interpreted as aversion to delayed resolution of uncertainty (Kreps & Porteus, 1978).

2.2 Chance theory can accomodate EU-paradoxes

In this subsection we show how CT is that it can accommodate the classical paradoxes of Allais (1953) as well as the paradox of Rabin (2000). That said, by providing an arguably limited modification to the classical model of EU, we cannot obtain a model that achieves full descriptive generality. For instance, the common ratio example of Problems 7&8 of Kahneman and Tversky (1979), which involve only risky alternatives, cannot be accommodated by CT.Footnote 6 Also, CT, like prospect theory, is unable to accommodate a reversal of preferences resulting from the replacement of common gains by common losses (e.g., Brooks & Zank, 2005, p.313). By treating sure outcomes different to risky ones, CT can accommodate reversals of preferences that result from a so-called certainty effect (Kahneman & Tversky, 1979, p.265) or a similar shift of probability mass across different outcomes: when splitting the probability of an event that gives the lowest outcome into lottery-subevents with distinct outcomes, different utility functions for those outcomes are invoked. We illustrate this descriptive feature of CT for the original the common consequence example of Allais (1953):

$$\begin{aligned} \begin{aligned}\text {A}:\quad 100\% \text { chance of } \$1 \text { million} \quad versus \quad \text {B}: \quad&10\% \text { chance of } \$5 \text { million} \\&89\% \text { chance of } \$1 \text { million} \\&\;\;1\% \text { chance of } \$0 \\ \text {A}^{*}:\quad 11\% \text { chance of } \$1 \text { million} \quad versus \quad \text {B}^{*}:\quad&10\% \text { chance of } \$5 \text { million} \\ \! \!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\! 89\% \text { chance of } \$0 \text { million}\quad versus \quad&90\% \text { chance of } \$0 \end{aligned} \end{aligned}$$

In this example many people choose A in the first choice problem and B\(^{\text {*}}\) in the second one, whereas EU demands that subjects either choose A and A\(^{\text {*}}\) or B and B\(^{\text {*}}\) (or they are always indifferent). The commonly observed choices, in terms of CT, yield \(v(1)>v(0)+0.89u(1)+0.1u(10)\) for the first pair and \(v(0)+0.11u(1)<v(0)+0.1u(5)\) for the second. One can demonstrate that both inequalities can be satisfied simultaneously. For instance, subtracting the second inequality from the first one yields \(v(1)-v(0)-0.11u(1)>0.89u(1)\), which implies \(v(1)-v(0)>u(1)-u(0)\) since \(u(0)=0\). This latter inequality, in a weak form, is always satisfied in CT due to Inequality (1). Hence, CT is either consistent with the implications under EU or it implies a violation in the typical direction. A violation in the opposite direction is excluded under CT.

An example for the common ratio effect is given by the following two lottery pairs (Kahneman & Tversky, 1979):

$$\begin{aligned} \begin{matrix}{}\text {C}: &{} 100\% \text { chance of } \$3000 &{} versus &{} \text {D}: &{}89\% \text { chance of } \$4000\\ &{} &{} &{} &{} \! \! \! \! \! \! \! \! \! \! 20\% \text { chance of } \$0 \\ \text {C}^{*}: &{} 25\% \text { chance of } \$3000 &{} versus &{} \text {D}^{*}: &{}20\% \text { chance of } \$4000\\ &{} \! \! \! \! \! \! \! \! \!75\% \text { chance of } \$0 &{} versus &{} &{}\! \! \! \! \! \! \! \! \! \!90\% \text { chance of } \$0\end{matrix} \end{aligned}$$

EU demands that subjects choose either C and C\(^{\text {*}}\) or D and D\(^{\text {*}}\) (abstracting away the case of joint indifference), whereas there is evidence that many people prefer C and D\(^{\text {*}}\). For this choice pattern CT yields \(v(3000)>v(0)+0.8u(4000)\) and \(v(0)+0.25u(3000)<v(0)+0.2u(4000)\). Rearranging the second inequality to \(u(3000)<0.8u(4000)\) and subtracting it from the first one, gives \(v(3000)-v(0)>u(3000)-u(0)\) which, in weak form, is implied by (1). Therefore, CT is either consistent with EU or it implies the typical common ratio effect pattern of choices; a preference for D and C\(^{\text {*}}\) is excluded.

A further paradox challenging the descriptive validity of EU has been put forward by Rabin (2000). Rabin argues that many people would reject a coin flip where they either win \(\$11\) or lose \(\$10\) and they would do so at all initial wealth levels. Rabin’s calibration theorem shows that this small-stake risk aversion implies unrealistic high degrees of risk aversion for larger stakes. In CT, rejecting the coin flip at initial wealth W implies \(v(W)>v(W-10)+0.5u(21)\). From condition (1) we know that \(v(W)-v(W-10)\ge u(10)\). Therefore, a sufficient condition for resolving the Rabin paradox is \(u(10)>0.5u(21)\), e.g., a slight concavity of u at small stakes, a condition which is not imposing restrictions for choice behavior at higher stakes.

3 Attitudes towards wealth and risk

This section considers risk attitude properties of CT. To ensure comparability with existing results under EU, we continue to assume CT for decision under risk. For simplicity and expositional convenience, we also assume that the wealth utility v and the chance utility u are twice continuously differentiable. In particular, this means that condition (3) can be rewritten as

$$\begin{aligned} \inf _{y\in \mathbb {R}}v^{\prime }(y)\ge \sup _{c\in \mathbb {R}_{+}}u^{\prime }(c), \end{aligned}$$

reiterating the potentially strong implications on behavior for CT-preferences under risk, if the domain of outcomes is unrestricted. Next, we explore various implications for risk attitudes, such as weak and strong risk aversion, attitude towards wealth, and prudence.

3.1 Weak and strong risk aversion

Using the example of Sect. 2 we provided some intuition for the claim that a CT-decision maker exhibits weak risk aversion, i.e., prefers the expected value of a lottery to the lottery itself. The next proposition shows that this result holds in general.

Proposition 1

A preference relation which can be represented by CT always displays weak risk aversion.\(\square\)

A stronger concept of risk aversion has been proposed by Rothschild and Stiglitz (1970), who define risk aversion as aversion to mean-preserving spreads. For a lottery \(P=(p_{1},y;p_{2},y+c_{2};\ldots ;p_{l},y+c_{l})\) with strictly positive probabilities \(p_{1},\ldots ,p_{l}\), a mean-preserving spread is given by \(P^{\prime }=(p_{1},y;\ldots ;p_{k},y+c_{k}+\delta /p_{k};\ldots ;p_{k^{\prime }},y+c_{k^{\prime }}-\delta /p_{k^{\prime }};\ldots ;p_{l},y+c_{l})\) whenever \(c_{k}>c_{k^{\prime }}\) and \(\delta >0\) (such that \(c_{k^{\prime }}-\delta \ge 0\)). Strong risk aversion demands \(P\succ P^{\prime }\) whenever \(P^{\prime }\) is a mean-preserving spread of P. In EU weak and strong risk aversion are equivalent and both are implied by a concave utility function for outcomes. The next result shows that strong risk aversion in CT is equivalent to having a strictly concave chance utility function u. This indicates that strong risk attitude is independent of attitude towards wealth, which has to be captured entirely by the curvature of v (cf., next subsection).

Proposition 2

A preference relation which can be represented by CT displays strong risk aversion if and only if u is strictly concave. \(\square\)

Although CT-preferences that satisfy strong risk aversion imply that u is strictly concave and also that weak risk aversion holds, the indifference sets of lotteries have convex kinks at certainty. Therefore, such CT-preferences exhibit first-order risk aversion as defined by Segal and Spivak (1990). This means that a CT-investor requires a strictly positive premium in order to invest in a risky asset. If the CT-investor is strongly risk averse, he will reduce his investment share if the risk in the asset increases through a zero mean risk on the chance component. This follows from the concavity of u. Clearly, the investor who has invested a share of his wealth into a risky asset will also reduce his share of investment if a zero mean risk added to the asset affects the worst outcome. In that case the statement follows from inequality (3)rather than from the concavity of u. Section 3.4 will discuss investment and savings behavior in a dynamic setting. For that it is important to understand attitudes towards wealth, which are discussed next.

3.2 Attitudes towards wealth

On its own, the curvature of v does not influence whether a CT-individual dislikes or likes mean-preserving spreads in chances, but it does determine how weak risk aversion changes when initial wealth increases. We present this result for decision under risk, and remark that the properties derived below remain valid for the more general case of decision under uncertainty. A subject exhibits constant (decreasing, respectively, increasing) absolute risk aversion if

$$\begin{aligned} (1,w)\sim & {} P=(p_{1},y;p_{2},y+c_{2};\ldots ;p_{l},y+c_{l}) \\&\text {implies} \\ (1,w+x)\sim & {} (\prec ,\succ )(p_{1},y+x;p_{2},y+x+c_{2};\ldots ;p_{l},y+x+c_{l}), \end{aligned}$$

for all outcomes w, non-degenerate lotteries P, and payoffs \(x>0\). Substitution of CT gives

$$\begin{aligned} v(w)=v(y)+\sum _{i=2}^{l}p_{i}u(c_{i})\Rightarrow v(w+x)=(<,>)v(y+x)+\sum _{i=2}^{l}p_{i}u(c_{i}), \end{aligned}$$

which, after subtraction of the former equation from the latter equation (respectively, inequality) and cancellation of common terms, yields

$$\begin{aligned} v(w+x)-v(w)=(<,>)v(y+x)-v(y). \end{aligned}$$

Since by monotonicity \(w>y\), this immediately implies the following result.

Proposition 3

A preference relation which can be represented by CT displays constant (decreasing, increasing) absolute risk aversion if and only if v is linear (concave, convex).\(\square\)

Having seen how risk attitude is decomposed into sensitivity to absolute changes in wealth (captured by v) and sensitivity to chance (captured by u) the next subsection looks at comparative risk attitudes.

3.3 Comparative risk attitudes

An individual A is defined to be more risk averse than another individual B if A always dislikes mean-preserving spreads to a greater extent than B does. In EU this is equivalent to A having for all lotteries a lower certainty equivalent than B. In CT matters are somewhat more complex as we have to distinguish whether a mean-preserving spread changes the value of worst outcome within a lottery or not. In the latter case only u does influence the comparative risk aversion between A and B whereas in the former case both u and v are involved.

As a preparation we formally recall the classical definition of a certainty equivalent of a lottery, which we supplement with the analog notion of a (conditional) chance equivalent. For a lottery \(P=(p_{1},y;p_{2},y+c_{2};\ldots ;p_{l},y+c_{l})\) and a preference relation \(\succcurlyeq\), the certainty equivalent, CE(P), is defined as the outcome that, when obtained with certainty, is indifferent to the lottery P, i.e., \((1,CE(P))\sim P\). Similarly, given P with minimal outcome y, we define the conditional chance equivalent CCE(P) as the chance which, when obtained with probability \(1-\mu\) in addition to y or otherwise obtaining just y, is indifferent to the lottery that gives P with probability \(1-\mu\) or otherwise y for all \(\mu \in (0,1)\), i.e., \((\mu ,y;1-\mu ,CCE(P))\sim (\mu ,y;1-\mu ,P)\) for any \(\mu \in (0,1)\).

For CT-preferences the CCE is independent of the value \(\mu \in (0,1)\). This allows us to define comparative risk aversion of the preferences of individuals A and B, \(\succcurlyeq _{A}\) and \(\succcurlyeq _{B}\), as follows: A is more risk averse than B if we have \(CE_{A}(P)<CE_{B}(P)\) and \(CCE_{A}(P)<CCE_{B}(P)\) for all lotteries P.

Proposition 4

Consider two preference relations \(\succcurlyeq _{A}\) and \(\succcurlyeq _{B}\) which can each be represented by CT. Then A is more risk averse than B if and only if \(u_{A}^{\prime \prime }(c)/u_{A}^{\prime }(c)>u_{B}^{\prime \prime }(c)/u_{B}^{\prime }(c)\) and \(v_{A}^{\prime }(y)/u_{A}(c)>v_{B}^{\prime }(y)/u_{B}(c)\) for all \(c>0,y\in \mathbb {R}\).\(\square\)

Proposition 4 indicates that in order to compare risk attitudes of two CT-decision makers we require a comparison of the absolute degree of curvature of their chance utilities and, further, a comparison of the changes in the wealth utility relative to chance utility. Clearly, if both decision makers share the same utility for wealth, then A is more risk averse than B if and only if \(u_{A}^{\prime \prime }(c)/u_{A}^{\prime }(c)>u_{B}^{\prime \prime }(c)/u_{B}^{\prime }(c)\) for all chances \(c>0\). This implies that \(u_{A}\) is a concave transformation of \(u_{B}\), as required by the first inequality in Proposition 4. Alternatively, if both decision makers share the same chance utility, then the second inequality in Proposition 4 reduces to \(v_{A}^{\prime }(y)>v_{B}^{\prime }(y)\). This means that individual A appreciates changes in wealth more that individual B does. This, together with Inequality (3) implies that A displays more risk aversion than B does. In general, however, a comparison of risk behavior requires a comparison of tradeoffs for chance and a comparison of tradeoffs between wealth and chance for both individuals.

3.4 Prudence

According to Kimball (1990) an agent is prudent if adding an insurable zero-mean risk to his future wealth raises his optimal saving. This precautionary saving can be analyzed in a simple model with two periods, 0 and 1, and a discounted utility for period 1. Let \(W_{0}\) be the deterministic wealth in period 0 whereas wealth in period 1 is given by the random variable \(\tilde{W}_{1}\). With expected utility theory preferences, optimal savings S of a weak risk averse agent are determined by maximizing

$$\begin{aligned} U(W_{0}-S)+\beta \mathbb {E}[U(\tilde{W}_{1}+\rho S)], \end{aligned}$$

where \(0<\beta <1\) is the time discount factor and \(\rho =1+r\) for the interest rate \(r\ge 0\), \(\mathbb {E}\) is the expectation operator and U is a standard von-Neumann-Morgenstern utility. If \(\underline{W}_{1}\) denotes the possible minimum wealth in period 1, the equivalent expression under CT is

$$\begin{aligned} v(W_{0}-S)+\beta \{v(\underline{W}_{1}+\rho S)+\mathbb {E}[u(\tilde{W}_{1}- \underline{W}_{1})]\}, \end{aligned}$$

where the difference \(\tilde{W}_{1}-\underline{W}_{1}\) means that payoff \(\underline{W}_{1}\) is subtracted from each outcome of the variable \(\tilde{W} _{1}\) to ensure domain restrictions for the chance utility u. As the term \(\mathbb {E}[u(\tilde{W}_{1}-\underline{W}_{1})]\) is independent of S the optimal level of saving (\(S^{*}\)) corresponds to the solution of the problem

$$\begin{aligned} \max \{v(W_{0}-S)+\beta v(\underline{W}_{1}+\rho S)\}, \end{aligned}$$

subject to the constraint that \(0\le S\le W_{0}\). Therefore, the chance utility u has no influence on the optimal saving \(S^{*}\), hence \(S^{*}\) is completely determined by the curvature of v. It could be an interior solution (\(S^{*}\in (0,W_{0})\), e.g., for strictly concave v) or a corner solution (\(S^{*}\in \{0,W_{0}\}\), e.g., if v is convex). For all cases, if the riskiness of \(\tilde{W}_{1}\) increases without changing \(\underline{W}_{1}\), optimal savings remain unchanged and the individual will proceed with the original consumption plan.

If the riskiness of \(\tilde{W}_{1}\) increases such that \(\underline{W}_{1}\) is affected but changes are sufficiently small, the corner solutions to the optimal savings problem will, in generic cases,Footnote 7 not be affected, hence, leaving the agent with the original consumption plan. For interior solutions, the direction in which \(\underline{W}_{1}\) is affected will lead to adjustments in the optimal savings \(S^{*}\) in the opposite direction. As a result the agent will adjust the consumption plan accordingly; for instance, a reduction in \(\underline{W}_{1}\) calls for an increment in \(S^{*}\), hence the agent exhibits prudence. We summarize this result formally:

Proposition 5

Consider the two period consumption model of this section and an agent with CT-preferences. The following statements hold:

  1. (i)

    The (curvature of the) chance utility u has no effect on optimal savings.

  2. (ii)

    If the agent exhibits prudence, then locally around optimal saving \(S^{*}\) the wealth utility v exhibits decreasing absolute risk aversion, i.e., v is concave at \(S^{*}\).

  3. (iii)

    If optimal saving \(S^{*}\) is unique and takes value in \(\{0,W_{0}\}\), then adding a small zero-mean risk to future wealth, has no effect on the agents consumption plan.\(\square\)

3.5 Chance theory is not an extension of EU

As indicated above, CT usually exhibits a preference for certainty, thereby allowing the model to accommodate the Allais paradox to EU. This type of behavior is well-documented in the experimental literature and cannot be accommodated by standard EU-preferences. Many alternatives to EU that accommodate the certainty effect and allow for EU-preferences as special case do not separate riskless and risky utility. Instead, they introduce probability weighting as an additional means to model risk behavior such as in rank-dependent utility (Quiggin, 1982) or cumulative prospect theory (Tversky & Kahneman, 1992). The tools that we employ to obtain this separation in CT could, in principle, be generalized to include non-linear probability weighting also in CT. Since CT has even in the absence of probability weighting many descriptively desireable properties, we did not explore such additional flexibility here.

We think it is important to show that CT is an alternative that has little overlap with EU. In fact, CT and EU only have expected value preferences in common. To see this, note that for a lottery \(P=(p_{1},y;p_{2},y+c_{2}; \ldots ;p_{l},y+c_{l})\) the EU-expression (assuming von-Neumann-Morgenstern utility index U over final wealth positions) is

$$\begin{aligned} EU(P)=\sum _{i=1}^{l}p_{i}U(y+c_{i}), \end{aligned}$$

while P has CT-value

$$\begin{aligned} CT(P)=\sum _{i=1}^{l}p_{i}[v(y)+u(c_{i})]. \end{aligned}$$

Thus, if \(U(y+c)=v(y)+u(c)\) these values are the same. However, such a utility function U depends on the lottery’s smallest outcome, a dependency that is not available in EU. In particular, for \(y=0\) we obtain \(U(c)=u(c)\) for all \(c\ge 0\), which implies that \(U(y+c)-U(c)=v(y)\) for all \(y\in \mathbb {R}\) and \(c\ge 0\) and, in turn, linearity of U and hence of u and of v. We conclude that both EU and CT represent the same preferences if and only if \(U(y+c)=\alpha (y+c)+\beta\) for some positive \(\alpha\) and a real number \(\beta\).

The analysis in this subsection shows that CT is not an extension to EU but it is an alternative that intersects with EU only in the case of risk neutrality. This reminds of some similarity to mean-variance preferences often employed in finance, which agree with EU-preferences only if the von-Neumann-Morgenstern utility is a quadratic function. The risk aversion in CT seems of a stronger form than the combination of mean and variance of a lottery due to the very specific appreciation of increments in sure wealth in CT.

4 A formal outline for uncertainty

Our preceeding analyzes were exploring the case of decision under risk to obtain a direct comparison with behavior for EU-preferences. For more generality, it is custom to present preference foundations for decision under uncertainty as the results then also apply to the special case of risk, the special case of uncertainty when probabilities for events are known (for instance, by applying the procedure of Köbberling & Wakker, 2003, Section 5.3). Hence, we consider a finite set of states, \(\mathcal {S}=\{1,\ldots ,n\}\), for a natural number \(n\ge 3\), and \(\mathcal {A }=2^{\mathcal {S}}\) is the algebra of subsets of \(\mathcal {S}\).Footnote 8 Elements \(E\in \mathcal {A}\) are called events. An act f assigns to each state a real valued outcome. The set of acts \(\mathcal {F}\) can be identified with the Cartesian product space \(\mathbb {R}^{n}\), and hence, we write \(f=(f_{1},\ldots ,f_{n})\), where \(f_{s}\) is short for f(s), \(s\in \mathcal {S }\).

In CT it is assumed that the decision maker focuses on the worst outcome when choosing between acts. Suppose act f has its worst outcome \(f_{m}\) in state m (as shortcut for ‘minimum’ and the dependence of m on the act f is left implicit in this notation). Then \(f_{m}\) can be regarded as the risk-free wealth of the decision maker when choosing f (or the sure part of the act that can instantly be integrated into current wealth holdings) and \(f_{t}-f_{m},t\ne m\) are the possible improvements in the other states (for which the integration into current wealth is prevented by uncertainty that needs to be resolved). Therefore each act is interpreted as offering a sure outcome and separately a chance for improvements; for short, chances. CT is based on preference conditions such that a decision maker evaluates risk-free wealth with a utility function v and possible increments with a subjective expected utility like evaluation based on a utility function u for chances. Specifically, Chance Theory holds if all acts f are evaluated by

$$\begin{aligned} CT(f)=v(f_{m})+\sum _{t\in \mathcal {S}}\pi _{t}u(f_{t}-f_{m}), \end{aligned}$$
(4)

where state m is the state with the lowest outcome of f, \(\pi _{t},t\in \mathcal {S}\), are the (positive) subjective probabilities of the decision maker generated by a probability measure \(\mathcal {P}\) over events, the (riskless) utility for wealth, \(v:\mathbb {R\rightarrow }\) \(\mathbb {R}\), is strictly increasing and continuous, and the (risky) utility for chances, \(u:\mathbb {R}_{+}\mathbb {\rightarrow }\) \(\mathbb {R}_{+}\), is strictly increasing and continuous with \(u(0)=0\). Further, as we demand that CT is increasing in each outcome, v and u, are related as follows: for all \(y\in \mathbb {R},c>\varepsilon >0\)

$$\begin{aligned} v(y+\varepsilon )-v(y)>\max _{s\in \mathcal {S}}\{ \sum _{t\ne s}\pi _{t}\}[u(c)-u(c-\varepsilon )]. \end{aligned}$$
(5)

Under CT the subjective probabilities are uniquely determined and the utility functions are jointly cardinal with location constraints applying to u (i.e., v and u can be replaced by \(\tilde{v}=\alpha v+\beta\) and \(\tilde{u}=\alpha u\) whenever \(\alpha >0\) and \(\beta \in \mathbb {R}\)).

We have argued for why it is plausible that a decision maker focuses on the worst outcome and evaluates possible increments separately and with a different utility function. A similar segregation procedure is informally discussed in the editing phase of prospect theory (Kahneman & Tversky, 1979). For acts which have only two strictly positive outcomes (\(f_{1}\) and \(f_{2}\)), the minimal gain (say \(f_{1}\)) is segregated in prospect theory and the representation becomes \(V(f_{1})+\pi _{2}(V(f_{2})-V(f_{1}))\), where V denotes the prospect theory value function. Our representation differs by the fact that we obtain two separate utility functions v and u and that incremental gains are evaluated directly by u and not by their value difference. Of course a further difference is that in CT the worst consequence is always segregated, even if some outcomes in the considered act are negative. However, outcomes in our framework are interpreted as potential final wealth positions and not in terms of gains and losses relative to a reference point as in prospect theory. Due to limited liability, final wealth should become negative only in exceptional cases.

5 Preference foundation

We consider a preference relation \(\succcurlyeq\) on the set of acts. As usual, \(f\succcurlyeq g\) means that act f is weakly preferred to act g. The symbols \(\succ\) and \(\sim\) denote strict preference and indifference, respectively. The preference relation \(\succcurlyeq\) is a weak order if it is complete (\(f\succcurlyeq g\) or \(g\succcurlyeq f\) for any acts fg) and transitive. A functional \(V:\mathcal {F}\rightarrow \mathbb {R}\) represents the preference relation \(\succcurlyeq\) if for all \(f,g\in \mathcal {F}\) we have \(f\succcurlyeq g\Leftrightarrow V(f)\ge V(g)\).

To simplify the exposition, we use \(f_{E}g\) for an act that agrees with the act f on event E and with the act g on the complement \(E^{c}\). Also, we use \(h_{s}f\) instead of \(h_{\{s\}}f\) for any state \(s\in \mathcal {S}\). Sometimes we identify constant acts with the corresponding outcome. We may thus write \(f_{E}x\) for an act that agrees with f on E and gives outcome x for states \(s\in E^{c}\). Similarly, we write \(x_{E}f\) for an act that agrees with f on \(E^{c}\) and gives outcome x for states \(s\in E\).

We recall some standard properties for the preference \(\succcurlyeq\) and the corresponding result for a representation V. The preference relation \(\succcurlyeq\) on \(\mathcal {F}\) satisfies (strong) monotonicity if \(f\succ g\) whenever \(f_{s}\ge g_{s}\) for all states \(s\in \mathcal {S}\) with a strict inequality for at least one state. By employing this condition we ensure that the subjective probabilities, derived later, are positive. This follows because monotonicity excludes null states, that is, states where the preference is independent of the magnitude of outcomes. Formally, a state s is null if \(x_{s}f\sim y_{s}f\) for all acts f and all outcomes xy.

The continuity condition defined here is continuity with respect to the Euclidean topology on \(\mathbb {R}^{n}\): \(\succcurlyeq\) satisfies continuity if for any act f the sets \(\{g\in\) \(\mathcal {F}|g\succcurlyeq f\}\) and \(\{g\in \mathcal {F}|g\preccurlyeq f\}\) are closed subsets of \(\mathbb {R}^{n}\). Following Debreu (1954) we know that the preference relation \(\succcurlyeq\) is a continuous monotonic weak order if and only if there exists a representation \(V:\mathcal {F}\rightarrow \mathbb {R}\) for \(\succcurlyeq\) that is continuous and strictly increasing in each argument. Further, this representation is continuously ordinal, that is, V is unique up to strictly increasing continuous transformations.

To obtain foundations for CT, our goal is to restrict the general representation V by requiring a form of additive separability, more restrictive than traditional additive separability (Debreu, 1960). To present the required properties we require further notation. By \(\mathcal {F}_{m}:=\{f\in\) \(\mathcal {F}|f_{t}\ge f_{m},t=1,\ldots ,n\}\) we denote the set of acts that have the worst possible outcome in state \(m\in \mathcal {S}\). One can view acts from \(\mathcal {F}_{m}\) as offering a sure component in state m and (a distribution of) chances \(\tilde{c}=(c_{1}\ldots ,c_{n})\) with \(c_{m}=0\). Thus, for \(f\in \mathcal {F}_{m}\) we can write \(f=(m:f_{m}; \tilde{c}^{f})\), where \(\tilde{c}^{f}=(f_{1}-f_{m},\ldots ,f_{n}-f_{m})\).

To simplify the exposition of the next preference condition we write

$$\begin{aligned} (m:y;\tilde{c}):=(y+c_{1},\ldots ,y+c_{m-1},y,y+c_{m+1},\ldots ,y+c_{n}) \end{aligned}$$

for an act that has its minimal outcome y in state m and outcome \(c_{s}+y\) in all other states \(s\not =m\). With this notation in mind, we propose a strengthening of the triple cancellation property, which has already been used to obtain additive separability (e.g., Wakker, 1989).Footnote 9 The preference relation \(\succcurlyeq\) satisfies consistent worst outcome segregation (WOS) if for all \(m,m^{\prime }\in \mathcal {S}\):

$$\begin{aligned} (m:w;\tilde{c})\sim (m:x;\tilde{d}) \quad &, \quad (m:y;\tilde{c})\sim (m:z;\tilde{d}) \\ \text {and }(m^{\prime }:w;\tilde{c}^{\prime })\sim (m^{\prime }:x;\tilde{d}^{\prime })\quad &\Rightarrow \quad (m^{\prime }:y;\tilde{c}^{\prime })\sim (m^{\prime }:z;\tilde{d}^{\prime }). \end{aligned}$$

WOS entails three important requirements. First, it demands a segregation of the worst outcome from all other possible outcomes; special attention is given to the worst outcome and separately to changes from that worst outcome within an act. Second, when \(m=m^{\prime }\), the condition demands a condition akin of triple cancellation. The latter has frequently been used in the derivation of additively separable representations when the objects of choice are two dimensional. In our case one can interpret state m as the first dimension and the combination of residual states as the second dimension.

The third requirement of WOS is a consistency requirement for equivalent worst outcome tradeoffs. The first two indifferences, \((m:w;\tilde{c})\sim (m:x;\tilde{d})\) and \((m:y;\tilde{c})\sim (m:z;\tilde{d})\), indicate that obtaining w instead of x as worst outcomes is equivalent to obtaining y instead of z as worst outcomes when the preference difference for the potential improvements captured in \(\tilde{c}\) and \(\tilde{d}\) that can be obtained in states other than m are equalized. Similarly, the latter two indifferences in WOS indicate that trading off the worst outcome w for x is equivalent to trading off the worst outcome y for z given the preference difference for the potential improvements in \(\tilde{c}^{\prime }\) and \(\tilde{d}^{\prime }\) that can be obtained states other than \(m^{\prime }\). The consistency requirement says that such equivalent worst outcome tradeoffs are independent of the states where they are observed. This consistency requirement is similar in spirit to the tradeoff consistency property recently proposed by Alon (2014), where consistency of tradeoffs is required for worst outcomes and, different to us, also for best outcomes. The difference here is that we focus exclusively on the worst outcomes and, additionally, equivalent tradeoffs are measured conditional on improvements from the worst outcomes. Combining WOS with the rationality principles of weak order and monotonicity, and requiring continuity we obtain the following result.

Lemma 6

Assume that the preference relation \(\succcurlyeq\) on \(\mathcal {F}\) is a continuous monotonic weak order that satisfies consistent worst outcome segregation. Then, there exists continuous strongly monotonic functions \(v:\mathbb {R}\rightarrow \mathbb {R},U_{-m}:\mathbb {R}_{+}^{n-m}\times \{0\} \times \mathbb {R}_{+}^{m-1}\rightarrow \mathbb {R}_{+}\) with \(U_{-m}(0,\ldots ,0)=0,m=1,\ldots ,n\), such that on each set \(\mathcal {F}_{m}\) the preference \(\succcurlyeq\) is represented by

$$\begin{aligned} W(f)=v(f_{m})+U_{-m}(\tilde{c}^{f}). \end{aligned}$$

Further, for \(\alpha >0\) and real-valued \(\beta ,\) the function v can be replaced by \(\alpha v+\beta\) whenever \(U_{-m}\) is replaced by \(\alpha U_{-m}\), \(m\in \{1,\ldots ,n\}\).\(\square\)

The uniqueness results regarding the utility function v in Lemma 6 state that it is a cardinal function, yet a replacement of v by a cardinally transformed utility necessitates adjustment of \(U_{-m}\) by a corresponding scaled transformation for each state m. Thus, except for the latter restriction, which is due to the requirement that \(U_{-m}\) takes value 0 for the distribution of zero chances, we have joint cardinality of the corresponding functions as is typical for additive representations. However, in contrast to results on general additive representations (Wakker, 1989), the function v is independent of the state m that gives the minimum outcome within an act. Moreover, we do have a family of (binary) additive representations, one for each \(\mathcal {F}_{m},m=1,\ldots ,n\), that share the common wealth utility v.

Next we are concerned with preference conditions that allow for additive separability of the function \(U_{-m},m=1,\ldots ,n\). It turns out that our conditions are stronger, as they require proportionality of the resulting additively separable functions which, in turn, allows for the identification of subjective probabilities for states. We require that the preference relation \(\succcurlyeq\) satisfies tradeoff consistency for chances (TCC):

$$\begin{aligned} \text {if }(f_{m}+a)_{E}f\sim & {} (f_{m}+b)_{E}g\text { and }(f_{m}+c)_{E}f\sim (f_{m}+d)_{E}g \\ \text {then }(f_{m}^{\prime }+a)_{E^{\prime }}f^{\prime }\sim & {} (f_{m}^{\prime }+b)_{E^{\prime }}g^{\prime }\text { implies }(f_{m}^{\prime }+c)_{E^{\prime }}f^{\prime }\sim (f_{m}^{\prime }+d)_{E^{\prime }}g^{\prime }, \end{aligned}$$

whenever all acts are from the same set \(\mathcal {F}_{m},m\in \{1,\ldots ,n\}\), \(m\notin E,E^{\prime }\) and the latter are non-null events.

The general tool of tradeoff consistency has been advanced in Köbberling and Wakker (2003) and applies to outcomes. That way, foundations for subjective EU (Savage, 1954), Choquet EU (Schmeidler, 1989), or modern prospect theory (Tversky & Kahneman, 1992) have been obtained. Here we invoke the principle in a segregated manner only for chances. As a result TCC is meaningless if there are exactly two states of nature. Our assumption of at least three states of nature, paired with monotonicity, circumvents this case.Footnote 10 We obtain the following result.

Theorem 7

The following two statements are equivalent for a preference relation \(\succcurlyeq\) on \(\mathcal {F}\).

  1. (i)

    The preference relation \(\succcurlyeq\) is represented by CT, with (positive) subjective probabilities \(\pi _{s},s=1,\ldots ,n,\) and strictly increasing continuous functions \(v:\mathbb {R}\rightarrow \mathbb {R}\) and \(u:\mathbb {R}_{+}\rightarrow \mathbb {R}_{+}\), with \(u(0)=0\) such that for all \(y\in \mathbb {R},x>\varepsilon >0\) Eq. (5) holds.

  2. (ii)

    The preference relation satisfies weak order, monotonicity, continuity, worst outcome separability and tradeoff consistency for chance.

Further, the subjective probabilities are uniquely determined and vu can be replaced by \(\tilde{v}=\alpha v+\beta\) and \(\tilde{u}=\alpha u\) whenever \(\alpha >0\) and \(\beta \in \mathbb {R}\).\(\square\)

The proof of Theorem 7 is provided in the Appendix.

6 Further discussion and conclusion

In this section we discuss some merits and shortcomings of CT, before we then conclude. As the minimal outcome plays a crucial role in CT, the important question arises how it is determined in real-life choice situations. One could argue that there is always a minimal chance that a meteorite will strike the earth such that immediate death is the worst outcome for all lotteries. In this case CT would basically reduce to EU. There is, however, empirical evidence that people ignore such rare events when making decisions (Sjöberg, 1999, 2000; Stone et al., 1994). In line with this evidence also the editing phase of prospect theory assumes that consequences with very low probabilities are ignored. Also for, e.g., banks it seems unlikely that they take into account such events when deciding upon a loan. When applying CT we therefore propose that decision makers evaluate only events which are relevant for the given choice situation in some sense in “short-term”. According to Gennaioli and Shleifer (2010) such decision makers are called local thinkers (see also Bordalo et al., 2012 for an alternative analysis of such behavior). An alternative approach would be the introduction of thresholds such that the minimal outcome only enters the utility evaluation if its cumulative probability exceeds a certain minimum. This procedure is formalized in Schmidt and Zimper (2007) and could also be integrated into CT.

One can ask why we need another theory of decision making under uncertainty given the large number of alternatives to EU already proposed in the literature. CT is motivated by (i) the fact that certain outcomes may yield higher utility than risky ones in a temporal setting, (ii) the theoretical desirability to separate riskless and risky utility, (iii) recent evidence in favor for a disproportionate preference for certainty in decisions under risk, which can also explain the paradoxes of Allais and Rabin. In view of this, two classes of models seem to be particularly relevant for comparison with CT, previous u-v models and (cumulative) prospect theory. As argued in the introduction, previous u-v models violate transparent stochastic dominance or transitivity and are, therefore, unsatisfactory from both a theoretical and an empirical point of view. One major contribution of CT is the fact that it overcomes these problems. As prospect theory explains a preference for certainty by probability weighting, it does not provide a separation of riskless and risky utility as CT. Prospect theory has often been criticized for being too general as it does not have a theory which determines the location of the reference point (Pesendorfer, 2006; Fudenberg, 2006).Footnote 11 If one is entirely free to choose the location of the reference point, many different variants of behavior can be accommodated such that no concrete implications for economic problems can be derived. In this sense CT is much less general, hence, easier to falsify. We regard it as a strength of CT that it makes rather specific behavioral predictions resulting from relatively weak assumptions. In our view the risk aversion implied by CT is particularly convincing in the presence of temporal risk as the decision maker can plan efficiently only with the minimal value of future wealth. Given this role of minimal outcomes, CT could well be regarded as a normative theory of “interim” decision making under uncertainty since basic rationality requirements, such as transitivity and monotonicity, are also satisfied. Depending on how uncertainty unfolds, this interim decision usually will not be optimal ex-post, in particular as CT demands a “pessimistic approach” to planning that results from the imposed global weak risk aversion. The upshot of such pessimism is that, ex-post, one may have room for pleasant surprises.

7 Appendix: Proofs

Proof of Proposition  1:

We have to show that for all lotteries \(P=(p_{1},x_{1};\ldots ;p_{l},x_{l})\) obtaining the expected value of the lottery, \(EV(P)=\sum _{i=1}^{l}p_{i}x_{i}\), for sure is weakly preferred to the lottery P. That is, \((1,EV(P))\succcurlyeq P\) for all lotteries P. To this aim, let \(x_{m}\) be the smallest outcome of some arbitrary lottery \(P=(p_{1},x_{1};\ldots ;p_{l},x_{l})\). Substitution of CT into the preceding preference gives

$$\begin{aligned} v(EV(P))\ge v(x_{m})+\sum _{i=1}^{l}p_{i}u(x_{i}-x_{m}), \end{aligned}$$

which is equivalent to

$$\begin{aligned} v(EV(P))-v(x_{m})\ge \sum _{i=1}^{l}p_{i}u(x_{i}-x_{m}). \end{aligned}$$
(6)

Observe that for any \(k\in \mathbb {N}\) the following identity holds

$$\begin{aligned} v(EV(P))-v(x_{m})=\sum _{j=1}^{k}\{v[EV(P)-\frac{j-1}{k}(EV(P)-x_{m})]-v[EV(P)-\frac{j}{k}(EV(P)-x_{m})]\}. \end{aligned}$$

By setting

$$\begin{aligned} \Delta v_{k}^{+}:=\min _{j\in \{1,\ldots ,k\}}\{ \frac{v[EV(P)-\frac{j-1}{k}(EV(P)-x_{m})]-v[EV(P)-\frac{j}{k}(EV(P)-x_{m})]}{\frac{1}{k}[EV(P)-x_{m}]}\} \end{aligned}$$

we obtain the following inequality

$$\begin{aligned} v(EV(P))-v(x_{m})&\ge \sum _{j=1}^{k}\frac{\Delta v_{k}^{+}}{k}[EV(P)-x_{m}] \\&= {} \Delta v_{k}^{+}[EV(P)-x_{m}]. \end{aligned}$$

Let \(y\in [x_{m},EV(P)]\) be an outcome where the slope of v is smallest. Such an outcome exists as we assumed that v is continuously differentiable over \(\mathbb {R}\). Then, there exists \(\delta >0\) such that \(\Delta v_{k}^{+}\ge [v(y+\varepsilon )-v(y)]/\varepsilon\) for all \(0<\varepsilon <\delta\). Note that the latter inequality is strict, unless v is linear on \([x_{m},EV(P)]\). It then follows that

$$\begin{aligned} v(EV(P))-v(x_{m})\ge \frac{v(y+\varepsilon )-v(y)}{\varepsilon } [EV(P)-x_{m}] \end{aligned}$$
(7)

for all \(0<\varepsilon <\delta\) with a strict inequality unless v is linear on \([x_{m},EV(P)]\).

Next, observe that for any \(k_{i}\in \mathbb {N}\), \(i=1,\ldots ,l\), the following identity holds

$$\begin{aligned} \sum _{i=1}^{l}p_{i}u(x_{i}-x_{m})=\sum _{i=1}^{l}p_{i}\sum _{j_{i}=1}^{k_{i}}\{u[x_{i}-\frac{j_{i}-1}{k_{i}}(x_{i}-x_{m})]-u[x_{i}-\frac{j_{i}}{k_{i}}(x_{i}-x_{m})]\}. \end{aligned}$$

By setting for each \(i=1,\ldots ,l\)

$$\begin{aligned} \Delta u_{k_{i}}^{-}:=\max _{j_{i}\in \{1,\ldots ,k_{i}\}}\{ \frac{u[x_{i}-\frac{j_{i}-1}{k_{i}}(x_{i}-x_{m})]-u[x_{i}-\frac{j_{i}}{k_{i}}(x_{i}-x_{m})]}{\frac{1}{k_{i}}(x_{i}-x_{m})}\} \end{aligned}$$

we obtain the inequality

$$\begin{aligned} \sum _{i=1}^{l}p_{i}u(x_{i}-x_{m})\le \sum _{i=1}^{l}p_{i}\Delta u_{k_{i}}^{-}(x_{i}-x_{m}). \end{aligned}$$

Further, by setting

$$\begin{aligned} \Delta u_{P}^{-}:=\max _{i\in \{1,\ldots ,l\}}\{ \Delta u_{k_{i}}^{-}\} \end{aligned}$$

we obtain

$$\begin{aligned} \sum _{i=1}^{l}p_{i}u(x_{i}-x_{m})&\le \Delta u_{P}^{-}\sum _{i=1}^{l}p_{i}(x_{i}-x_{m}) \\&= \Delta u_{P}^{-}[EV(P)-x_{m}]. \end{aligned}$$

Let \(c\in [0,\max _{i\in \{1,\ldots ,l\}}(x_{i}-x_{m})]\) be an outcome where the slope of u is largest.Footnote 12 Such an outcome exists as u is continuously differentiable. Then, there exists \(\delta ^{\prime }>0\) such that \(\Delta u_{P}^{-}\le [u(c)-u(c-\varepsilon )]/\varepsilon\) for all \(0<\varepsilon <\delta ^{\prime }\), with a strict inequality unless u is linear on \([0,\max _{i\in \{1,\ldots ,l\}}\{x_{i}-x_{m}\}]\). It then follows that

$$\begin{aligned} \sum _{i=1}^{l}p_{i}u(x_{i}-x_{m})\le \frac{u(c)-u(c-\varepsilon )}{ \varepsilon }[EV(P)-x_{m}] \end{aligned}$$
(8)

for all \(0<\varepsilon <\delta ^{\prime }\) with a strict inequality unless u is linear on \([0,\max _{i\in \{1,\ldots ,l\}}(x_{i}-x_{m})]\).

Let \(\delta ^{*}=\min \{ \delta ,\delta ^{\prime }\}\). Then, the Inequalities (7) and (8) above hold for all \(0<\varepsilon <\delta ^{*}\). Recall that Inequality (3) in the main text also holds, i.e.,

$$\begin{aligned} v(y^{\prime }+\varepsilon ^{\prime })-v(y^{\prime })\ge u(c^{\prime })-u(c^{\prime }-\varepsilon ^{\prime }) \end{aligned}$$

for all \(y^{\prime }\in \mathbb {R},c^{\prime }>\varepsilon ^{\prime }>0\), and thus, it holds in particular for \(y^{\prime }=y,c^{\prime }=c\) and \(\varepsilon ^{\prime }=\varepsilon\) for all \(0<\varepsilon <\delta ^{*}\). Therefore,

$$\begin{aligned} \frac{v(y+\varepsilon )-v(y)}{\varepsilon }[EV(P)-x_{m}]\ge \frac{u(c)-u(c-\varepsilon )}{\varepsilon }[EV(P)-x_{m}] \end{aligned}$$

holds for all \(0<\varepsilon <\delta ^{*}\). Invoking Inequality (7) for the term on the left and Inequality (8) for the term on the right implies Inequality (6) as desired. Further, Inequality (6) is strict unless u is linear on \([0,\max _{i\in \{1,\ldots ,l\}}(x_{i}-x_{m})]\) and v is linear on \([x_{m},EV(P)]\).

As P was an arbitrary non-degenerate lottery, it follows that CT implies weak risk aversion for decision under risk. This completes the proof of Proposition 1.\(\square\)

Proof of Proposition  2:

We have to show that strict concavity of u under CT is equivalent to \(P=(p_{1},x_{1}; \ldots ;p_{l},x_{l})\succ P^{\prime }=(p_{1},x_{1};\ldots ;p_{k},x_{k}+\delta /p_{k};\ldots ;p_{k^{\prime }},x_{k^{\prime }}-\delta /p_{k^{\prime }};\ldots ;p_{l},x_{l})\) whenever \(x_{k}>x_{k^{\prime }}\) and \(\delta >0\).

Step (i): Suppose that \(x_{k^{\prime }}-\delta /p_{k^{\prime }}\) is not the minimal outcome of \(P^{\prime }\). Then P and \(P^{\prime }\) have the same minimal outcome. In this case CT reduces to an EU-like expression. By standard results that apply to EU, it follows that u is must be strictly concave. Conversely, strict concavity of u is sufficient for \(P\succ P^{\prime }\) if \(x_{k^{\prime }}-\delta /p_{k^{\prime }}\) is not the minimal outcome of \(P^{\prime }\).

Step (ii): Suppose now that \(x_{k^{\prime }}\) is the minimal outcome of P. Then \(x_{k^{\prime }}-\delta /p_{k^{\prime }}\) is the minimal outcome of \(P^{\prime }\). We have to show that strict concavity of u implies that \(P\succ P^{\prime }\). After substitution of CT the latter strict preference is equivalent to

$$\begin{aligned} v(x_{k^{\prime }})+\sum _{i=1}^{l}p_{i}u(x_{i}-x_{k^{\prime }})> & {} v(x_{k^{\prime }}-\delta /p_{k^{\prime }})+\sum _{\begin{array}{c} i=1 \\ i\ne k,k^{\prime } \end{array}}^{l}p_{i}u(x_{i}-x_{k^{\prime }}+\delta /p_{k^{\prime }})+ \\&p_{k}u(x_{k}+\delta /p_{k}-x_{k^{\prime }}+\delta /p_{k^{\prime }})+p_{k^{\prime }}u(0). \nonumber \end{aligned}$$
(9)

As u is strictly concave we have

$$\begin{aligned} p_{k}u(x_{k}+\delta /p_{k}-x_{k^{\prime }}+\delta /p_{k^{\prime }})+p_{k^{\prime }}u(0)<p_{k}u(x_{k}-x_{k^{\prime }}+\delta /p_{k})+p_{k^{\prime }}u(\delta /p_{k^{\prime }}). \end{aligned}$$

Therefore it remains to show that

$$\begin{aligned} v(x_{k^{\prime }})+\sum _{i=1}^{l}p_{i}u(x_{i}-x_{k^{\prime }})>v(x_{k^{\prime }}-\delta /p_{k^{\prime }})+\sum _{i=1}^{l}p_{i}u(x_{i}-x_{k^{\prime }}+\delta /p_{k^{\prime }}). \end{aligned}$$
(10)

Rearranging yields

$$\begin{aligned} v(x_{k^{\prime }})-v(x_{k^{\prime }}-\delta /p_{k^{\prime }})> & {} \sum _{i=1}^{l}p_{i}[u(x_{i}-x_{k^{\prime }}+\delta /p_{k^{\prime }})-u(x_{i}-x_{k^{\prime }})]. \nonumber \\\Leftrightarrow & {} \nonumber \\ \sum _{i=1}^{l}p_{i}[v(x_{k^{\prime }})-v(x_{k^{\prime }}-\delta /p_{k^{\prime }})]> & {} \sum _{i=1}^{l}p_{i}[u(x_{i}-x_{k^{\prime }}+\delta /p_{k^{\prime }})-u(x_{i}-x_{k^{\prime }})] \end{aligned}$$
(11)

From condition (3) we know that the utility of wealth difference on the left side exceeds each chance utility difference on the right hand side of the preceding inequality. Therefore, \(P\succ P^{\prime }\) follows whenever \(x_{k^{\prime }}\) is the minimal outcome of P.

Step (iii): It remains to consider the case that \(x_{k^{\prime }}\) is not the minimal outcome of P but \(x_{k^{\prime }}-\delta /p_{k^{\prime }}\) is the minimal outcome of \(P^{\prime }\). Suppose that \(x_{m}\) is the minimal outcome of P. We have \(x_{k^{\prime }}>x_{m}\ge x_{k^{\prime }}-\delta /p_{k^{\prime }}\). If \(x_{m}=x_{k^{\prime }}-\delta /p_{k^{\prime }}\), then \(P\succ P^{\prime }\) follows from strict concavity of u using arguments similar to those in Step (i). Alternatively, if \(x_{k^{\prime }}>x_{m}>x_{k^{\prime }}-\delta /p_{k^{\prime }}\), the mean-preserving spread from P to \(P^{\prime }\) can be obtained from two successive mean-preserving spreads follows. Define \(\delta ^{\prime }\) such that \(x_{m}=x_{k^{\prime }}+\delta ^{\prime }/p_{k^{\prime }}\) and set \(P^{\prime \prime }=(p_{1},x_{1};\ldots ;p_{k},x_{k}+\delta /p_{k};\ldots ;p_{k^{\prime }},x_{k^{\prime }}-\delta /p_{k^{\prime }};\ldots ;p_{l},x_{l})\). By construction \(P^{\prime \prime }\) is a mean-preserving spread of P and both lotteries have the same minimal outcome. Thus, \(P\succ P^{\prime \prime }\) follows as u is concave. Further, \(P^{\prime }\) is a mean-preserving spread of \(P^{\prime \prime }\) with minimal outcome \(x_{k^{\prime }}+\delta ^{\prime }/p_{k^{\prime }}\) and \(x_{k^{\prime }}+\delta ^{\prime }/p_{k^{\prime }}-\delta /p_{k^{\prime }}\). By Step (ii) it then follows that \(P^{\prime \prime }\succ P^{\prime }\). Thus, \(P\succ P^{\prime \prime }\), \(P^{\prime \prime }\succ P^{\prime }\) and transitivity imply \(P\succ P^{\prime }\). Thus, Steps (i)–(iii), complete the proof of Proposition 2. \(\square\)

Proof of Proposition 3:

The proof follows immediately from the arguments provided preceding the proposition in the main text. \(\square\)

Proof of Proposition 4:

Let \(P=(p_{1},x_{1};\ldots ;p_{l},x_{l})\) be a non-degenerate lottery with worst outcome \(x_{m}\). Expressing the definition of \(CCE_{j}(P)\), or \(CCE_{j}\) for short, (\(j=A,B\)), in terms of CT yields: \(v_{j}(x_{m})+(1-\lambda )u_{j}(CCE_{j}-x_{m})=v_{j}(x_{m})+(1-\lambda )\sum _{i=1}^{l}p_{i}u_{j}(x_{i}-x_{m})\) which implies \(u_{j}(CCE_{j}-x_{m})= \sum _{i=1}^{l}p_{i}u_{j}(x_{i}-x_{m}).\) As the \(CCE_{j}\) is defined purely by the component of CT based on u, which is an EU-like form, we know from Pratt (1964) that we have \(CCE_{A}-x_{m}<CCE_{B}-x_{m}\), i.e., \(CCE_{A}<CCE_{B}\) if and only if \(u_{A}^{\prime \prime }(c)/u_{A}^{\prime }(c)>u_{B}^{\prime \prime }(c)/u_{B}^{\prime }(c)\) for all \(c\in \mathbb {R}_{++}\).

Expressing the definition of \(CE_{j}(P)\), or \(CE_{j}\) for short, (\(j=A,B\)), in terms of CT yields: \(v_{j}(CE_{j})=v_{j}(x_{m})+\sum _{i}p_{i}u_{j}(x_{i}-x_{m})\). This implies

$$\begin{aligned} \frac{v_{A}(CE_{A})-v_{A}(x_{m})}{\sum _{i=1}^{l}p_{i}u_{A}(x_{i}-x_{m})}=1= \frac{v_{B}(CE_{B})-v_{B}(x_{m})}{\sum _{i=1}^{l}p_{i}u_{B}(x_{i}-x_{m})}, \end{aligned}$$

which can be expressed as

$$\begin{aligned} \frac{\int _{x_{m}}^{CE_{A}}v_{A}^{\prime }(y)dy}{u_{A}(CCE_{A}-x_{m})}=\frac{ \int _{x_{m}}^{CE_{B}}v_{B}^{\prime }(y)dy}{u_{B}(CCE_{B}-x_{m})}. \end{aligned}$$
(12)

First we assume that \(u_{A}^{\prime \prime }(c)/u_{A}^{\prime }(c)>u_{B}^{\prime \prime }(c)/u_{B}^{\prime }(c)\) and \(v_{A}^{\prime }(y)/u_{A}(c)>v_{B}^{\prime }(y)/u_{B}(c)\) for all \(y\in \mathbb {R}\) and all \(c>0\) holds. We prove that \(CE_{A}<CE_{B}\). From \(u_{A}^{\prime \prime }(c)/u_{A}^{\prime }(c)>u_{B}^{\prime \prime }(c)/u_{B}^{\prime }(c)\) for all \(c>0\) it follows (see above) that \(CCE_{A}<CCE_{B}\). Consider \(C_{A}\) defined by

$$\begin{aligned} \frac{\int _{x_{m}}^{C_{A}}v_{A}^{\prime }(y)dy}{u_{A}(CCE_{B}-x_{m})}=1= \frac{\int _{x_{m}}^{CE_{B}}v_{B}^{\prime }(y)dy}{u_{B}(CCE_{B}-x_{m})}. \end{aligned}$$
(13)

Since \(v_{A}^{\prime }(y)/u_{A}(CCE_{B}-x_{m})>v_{B}^{\prime }(y)/u_{B}(CCE_{B}-x_{m})\) for all \(y\in \mathbb {R}\), we must have \(C_{A}<CE_{B}\). As

$$\begin{aligned} \frac{\int _{x_{m}}^{C_{A}}v_{A}^{\prime }(y)dy}{u_{A}(CCE_{B}-x_{m})}=1= \frac{\int _{x_{m}}^{CE_{A}}v_{A}^{\prime }(y)dy}{u_{A}(CCE_{A}-x_{m})} \end{aligned}$$
(14)

and \(u_{A}(CCE_{A}-x_{m})<u_{A}(CCE_{B}-x_{m})\) we must have \(CE_{A}<C_{A}<CE_{B}\). Thus, \(CE_{A}<CE_{B}\) follows, as desired.

Next, we assume that \(CE_{A}(\tilde{P})<CE_{B}(\tilde{P})\) for all nondegenerate lotteries \(\tilde{P}\) and prove that \(v_{A}^{\prime }(y)/u_{A}(c)>v_{B}^{\prime }(y)/u_{B}(c)\) for all \(c>0,y\in \mathbb {R}\) (as \(u_{A}^{\prime \prime }(c)/u_{A}^{\prime }(c)>u_{B}^{\prime \prime }(c)/u_{B}^{\prime }(c)\) has already been shown). For an arbitrary probability \(0<p<1\) and arbitrary \(c>0,y\in \mathbb {R}\), consider the lottery \(P=(1-p,y;p,y+c)\). Obviously we have \(v_{A}(CE_{A}(P))=v_{A}(y)+pu_{A}(c)\) which yields

$$\begin{aligned} \frac{v_{A}(CE_{A}(P))-v_{A}(y)}{pu_{A}(c)}=1. \end{aligned}$$
(15)

Analogously we get

$$\begin{aligned} \frac{v_{B}(CE_{B}(P))-v_{B}(y)}{pu_{B}(c)}=1. \end{aligned}$$
(16)

This implies

$$\begin{aligned} \frac{\int _{y}^{CE_{A}}v_{A}^{\prime }(z)dz}{pu_{A}(c)}=1=\frac{ \int _{y}^{CE_{B}}v_{B}^{\prime }(z)dz}{pu_{B}(c)}. \end{aligned}$$
(17)

Suppose now that p converges to zero. This implies that \(CE_{A}\) and \(CE_{B}\) converge to y. This follows from the fact that, in CT, the term that depends only on u is continuous in probabilities away from 1. Now \(CE_{A}<CE_{B}\) can only hold if \(v_{A}^{\prime }(y)/u_{A}(c)>v_{B}^{\prime }(y)/u_{B}(c)\). As this argument is valid arbitrary \(c>0,y\in \mathbb {R}\), it is valid for all \(c>0,y\in \mathbb {R}\).

This completes the proof of Proposition 4.\(\square\)

Proof of Proposition 5:

The proof follows immediately from the arguments provided preceding the proposition in the main text. \(\square\)

Proof of Lemma 6:

First we note that by Debreu (1954) the preference conditions imply the existence of a strongly monotonic continuous function \(W:\mathbb {R}^{n}\rightarrow \mathbb {R}^{n}\) that represents the preference \(\succcurlyeq\) on \(\mathcal {F}\). Obviously, W also represents \(\succcurlyeq\) on each set \(\mathcal {F}_{m},m=1,\ldots ,n\).

Let us fix an arbitrary state \(m\in \mathcal {S}\). Without loss of generality let \(m=1\). Continuity and monotonicity imply that for each act \(f\in \mathcal {F}_{1}\) there exists a unique outcome \(x^{f}\) such that \(f\sim (f_{1},x^{f},\ldots ,x^{f})\). Next we restrict the analysis to the set of quasi binary acts \(\{f\in \mathcal {F}_{1}|(f_{1},x^{f},\ldots ,x^{f})\}\). In our notation for acts (i.e., \(f=(1:f_{1};f-f_{1})\)) this set is isomorphic to the two dimensional set \(F_{1}:=\{f\in \mathcal {F} _{1}|(1:f_{1};x^{f}-f_{1})\}\) (\(\cong \mathbb {R}\times \mathbb {R}_{+}\)). The restriction of the preference \(\succcurlyeq\) to \(F_{1}\), which for simplicity we also denote \(\succcurlyeq\), inherits weak order, continuity and monotonicity from \(\succcurlyeq\) on \(\mathcal {F}_{1}\). Additionally, it satisfies the triple cancellation condition formulated with indifferences on \(F_{1}\). In the presence of weak order, monotonicity and continuity, and the structural richness that we have in \(F_{1}\), the indifference version of triple cancellation is equivalent to the preference version of the triple cancellation (see Köbberling & Wakker, 2003, for a similar argument showing that their indifference version of tradeoff consistency is equivalent to the preference version of tradeoff consistency). Hence, by Corollary 3.6 and Remark 3.7 of Wakker (1993a) it follows that there exists (jointly cardinal) continuous and strictly increasing functions \(V_{1}:\mathbb {R}\rightarrow \mathbb {R}\) and \(U_{-1}:\mathbb {R}_{+}\rightarrow \mathbb {R}\) such that \(\succcurlyeq\) on \(F_{1}\) is represented by

$$\begin{aligned} W_{1}(f)=V_{1}(f_{1})+U_{-1}(x^{f}-f_{1}). \end{aligned}$$

Note that continuity and monotonicity imply that \(U_{-1}(0)=0\). This means that, for \(\alpha >0\) and \(\beta\) real, \(V_{1}\) can be replaced by \(\alpha V_{1}+\beta\) whenever \(U_{-1}\) is replaced by \(\alpha U_{-1}\). Next we extend \(U_{-1}\) from \(\mathbb {R}_{+}\) to \(\{0\} \times \mathbb {R}_{+}^{n-1}\). Using the indifference \(f\sim (1:f_{1};0,x^{f}-f_{1},\ldots ,x^{f}-f_{1})\) we can define \(U_{-1}:\{0\} \times \mathbb {R}_{+}^{n-1}\rightarrow \mathbb {R}\) through

$$\begin{aligned} U_{-1}(0,f-f_{1}):=U_{-1}(0,x^{f}-f_{1},\ldots ,x^{f}-f_{1})\equiv U_{-1}(0,x^{f}-f_{1})\text {.} \end{aligned}$$

This way continuity and strong monotonicity of \(U_{-1}\) is inherited through \(x^{f}\). Then, for \(f,g\in \mathcal {F}_{1}\), we have

$$\begin{aligned} f\succcurlyeq g & \Leftrightarrow (f_{1},x^{f}-f_{1})\succcurlyeq (g_{1},x^{g}-g_{1}) \\ &\Leftrightarrow V_{1}(f_{1})+U_{-1}(x^{f}-f_{1})\ge V_{1}(g_{1})+U_{-1}(x^{g}-g_{1}) \\ &\Leftrightarrow V_{1}(f_{1})+U_{-1}(f-f_{1})\ge V_{1}(g_{1})+U_{-1}(g-g_{1}), \end{aligned}$$

demonstrating that \(W_{1}(f)=V_{1}(f_{1})+U_{-1}(f-f_{1})\) represents \(\succcurlyeq\) on \(\mathcal {F}_{1}\). Obviously, the uniqueness results for \(V_{1}\) and \(U_{-1}\) are maintained through this extension of the representation.

In the preceding analysis we have fixed state \(m=1\). The proof for any arbitrary state \(m\in \mathcal {S}\backslash \{1\}\) is completely analogous. Hence, we can conclude that for each state \(m\in \mathcal {S}\) there exists strongly monotonic and continuous functions \(V_{m}:\mathbb {R}\rightarrow \mathbb {R},U_{-m}:\{0\} \times \mathbb {R}_{+}^{n-1}\rightarrow \mathbb {R}\) such that on each set \(\mathcal {F}_{m}\) the preference \(\succcurlyeq\) is represented by

$$\begin{aligned} W_{m}(f)=V_{m}(f_{m})+U_{-m}(f-f_{m}). \end{aligned}$$

The functions \(V_{m},U_{-m}\) satisfy the corresponding uniqueness results for the representation \(W_{m}\) on \(\mathcal {F}_{m},\) for all \(m\in \mathcal {S}\).

Next we apply once more WOS. We take two arbitrary but distinct states \(m,m^{\prime }\in \mathcal {S}\). Locally, in a small neighborhood, we can find for all wx, yz and \(f,g,f^{\prime },g^{\prime }\), such that the following three indifferences hold: \((m:w;f)\sim (m:x;g)\), \((m:y;f)\sim (m:z;g)\) and \((m^{\prime }:w;f^{\prime })\sim (m^{\prime }:x;g^{\prime })\) and all acts are from \(\mathcal {F}_{m}\cap \mathcal {F}_{m^{\prime }}\). This statement requires \(|\mathcal {S}|>2\). By WOS it follows that \((m^{\prime }:y;f^{\prime })\sim (m^{\prime }:z;g^{\prime })\) with these acts being from \(\mathcal {F}_{m}\cap \mathcal {F}_{m^{\prime }}\). As both \(W_{m}\) and \(W_{m^{\prime }}\) represent preferences on \(\mathcal {F}_{m}\cap \mathcal {F}_{m^{\prime }}\) these functions must be ordinal transformations of each other. Further, substitution of \(W_{m}\) in the former two indifferences and taking differences of the resulting equations and cancelling common terms, implies \(V_{m}(w)-V_{m}(x)=V_{m}(y)-V_{m}(z)\). Similarly, substitution of \(W_{m^{\prime }}\) in the latter two indifferences, implies \(V_{m^{\prime }}(w)-V_{m^{\prime }}(x)=V_{m^{\prime }}(y)-V_{m^{\prime }}(z)\). As, wx and yz were arbitrary, it follows that \(V_{m}\) and \(V_{m^{\prime }}\) are (first locally and by continuity also globally) proportional. As \(W_{m}=V_{m}\) and \(W_{m^{\prime }}=V_{m^{\prime }}\) for all constant acts, and the latter are included in \(\mathcal {F}_{m}\cap \mathcal {F}_{m^{\prime }}\), it follows that \(W_{m}\) and \(W_{m^{\prime }}\) are, actually, cardinally related. Hence, we can choose them identical on the set of common acts \(\mathcal {F}_{m}\cap \mathcal {F}_{m^{\prime }}\). In particular this means that \(U_{-m}=U_{-m^{\prime }}\) on \((\mathbb {R}_{+}^{n-m}\times \{0\} \times \mathbb {R}_{+}^{m-1})\cap (\mathbb {R}_{+}^{n-m^{\prime }}\times \{0\} \times \mathbb {R}_{+}^{m^{\prime }-1})\). As m and \(m^{\prime }\) were arbitrary chosen, we can set \(v:=V_{s}\) for all \(s\in \mathcal {S}\). That is

$$\begin{aligned} W_{s}(f)=v(f_{s})+U_{-s}(f-f_{s}) \end{aligned}$$

holds on each set \(\mathcal {F}_{s}\). Further, uniqueness results are maintained for v and \(U_{-s},s\in \mathcal {S}\). This completes the proof of Lemma 6.\(\square\)

Proof of Theorem 7:

First we prove that statement (ii) implies statement (i). Monotonicity of \(\succcurlyeq\) follows from the strong monotonicity of W. Continuity of \(\succcurlyeq\) follows from the continuity of W. The fact that W is representing \(\succcurlyeq\) on \(\mathcal {F}\) implies that \(\succcurlyeq\) is a weak order. Notice that on each set \(\mathcal {F}_{s}\) the act f is evaluated by \(W(f)=v(f_{s})+U(\tilde{c}^{f}),\) where \(U(\tilde{c}^{f})=\sum _{t\in \mathcal {S}\backslash \{s\}}\pi _{t}u(f_{t}-f_{s})\), which is an additively separable representation of \(\succcurlyeq\) on \(\mathcal {F}_{s}\) and has v independent of s. Hence, substitution of W for the indifferences in the definition of WOS immediately shows that \(\succcurlyeq\) satisfies WOS. To derive tradeoff consistency for chance assume that the acts in the following indifferences are all from the same set \(\mathcal {F}_{m}\) for some state \(m\in \mathcal {S}\), and that

$$\begin{aligned} (f_{m}+a)_{E}f & \sim (f_{m}+b)_{E}g, \\ (f_{m}+c)_{E}f & \sim (f_{m}+d)_{E}g \\ \text {and }(f_{m}^{\prime }+a)_{E^{\prime }}f^{\prime }&\sim (f_{m}^{\prime }+b)_{E^{\prime }}g^{\prime }\text { hold} \\ \text {but }(f_{m}^{\prime }+c)_{E^{\prime }}f^{\prime }&\succ (f_{m}^{\prime }+d)_{E^{\prime }}g^{\prime } \end{aligned}$$

for some non-null events \(E,E^{\prime }\) which do not include state m. Substitution of W into the first two indifferences implies that

$$\begin{aligned} v(f_{m})+\sum _{r\in \mathcal {S}\backslash [E\cup \{m\}]}\pi _{r}u(f_{r}-f_{m})+\sum _{t\in E}\pi _{t}u(a)&= v(f_{m})+\sum _{r\in \mathcal {S}\backslash [E\cup \{m\}]}\pi _{r}u(g_{r}-f_{m})+\sum _{t\in E}\pi _{t}u(b) \\&\text {and} \\ v(f_{m})+\sum _{r\in \mathcal {S}\backslash [E\cup \{m\}]}\pi _{r}u(f_{r}-f_{m})+\sum _{t\in E}\pi _{t}u(c)&= v(f_{m})+\sum _{r\in \mathcal {S}\backslash [E\cup \{m\}]}\pi _{r}u(g_{r}-f_{m})+\sum _{t\in E}\pi _{t}u(d) \end{aligned}$$

hold. Subtracting the second equation from the first and cancelling common terms gives:

$$\begin{aligned} \sum _{t\in E}\pi _{t}[u(a)-u(b)]=\sum _{t\in E}\pi _{t}[u(c)-u(d)] \end{aligned}$$

or

$$\begin{aligned} u(a)-u(b)=u(c)-u(d) \end{aligned}$$

as the probabilities \(\pi _{t},t\in E\) is positive.

Substitution of W into the third indifference and the latter preference gives, by using similar calculations,

$$\begin{aligned} u(a)-u(b)<u(c)-u(d), \end{aligned}$$

resulting in a contradiction.

If in the previous analysis, instead of \((f_{m}^{\prime }+c)_{E^{\prime }}f^{\prime }\succ (f_{m}^{\prime }+d)_{E^{\prime }}g^{\prime }\), we assume \((f_{m}^{\prime }+c)_{E^{\prime }}f^{\prime }\prec (f_{m}^{\prime }+d)_{E^{\prime }}g^{\prime }\), a similar contradiction (i.e., \(u(a)-u(b)=u(c)-u(d)\) and \(u(a)-u(b)>u(c)-u(d)\)) is obtained. Hence, \((f_{m}^{\prime }+c)_{E^{\prime }}f^{\prime }\sim (f_{m}^{\prime }+d)_{E^{\prime }}g^{\prime }\) must hold. As \(m\in \mathcal {S}\) was arbitrary, it follows that tradeoff consistency for chance holds on each set \(\mathcal {F}_{s}\). Hence, it holds on \(\mathcal {F}\).

Finally, the property that for all \(y\in \mathbb {R},c>\varepsilon >0\)

$$\begin{aligned} v(y+\varepsilon )-v(y)>u(c)-u(c-\varepsilon ), \end{aligned}$$

holds follows from the strong monotonicity property of W and Lemma 6.

Next we assume statement (ii) and derive statement (i). The assumptions of Lemma 6 are satisfied, hence, there exists continuous strongly monotonic functions \(v:\mathbb {R}\rightarrow \mathbb {R},U_{-s}:\mathbb {R}_{+}^{n-s}\times \{0\} \times \mathbb {R}_{+}^{s-1}\) with \(U_{-s}(0,\ldots ,0)=0,\) \(s\in \mathcal {S}\), such that the preference \(\succcurlyeq\) is represented by

$$\begin{aligned} W(f)=v(f_{s})+U_{-s}(f-f_{s}) \end{aligned}$$

for \(f\in \mathcal {F}_{s}\). Further, for \(\alpha >0\) and real-valued \(\beta ,\) the function v can be replaced by \(\alpha v+\beta\) whenever \(U_{-s}\) is replaced by \(\alpha U_{-s}\), \(s\in \{1,\ldots ,n\}\).

Take an arbitrary state \(m\in \mathcal {S}\). Tradeoff consistency for chance implies that if

$$\begin{aligned} (f_{m}+a)_{t}f & \sim (f_{m}+b)_{t}g, \\ (f_{m}+c)_{t}f & \sim (f_{m}+d)_{t}g \\ \text {and }(f_{m}^{\prime }+a)_{t^{\prime }}f^{\prime } & \sim (f_{m}^{\prime }+b)_{t^{\prime }}g^{\prime }\text { hold,} \end{aligned}$$

then \((f_{m}^{\prime }+c)_{t^{\prime }}f^{\prime }\sim (f_{m}^{\prime }+d)_{t^{\prime }}g^{\prime }\) follows, provided that \(t,t^{\prime }\ne m\) and all acts involved are from the set \(\mathcal {F}_{m}\). Substituting \(W=v+U_{-m}\) we obtain that on \(\mathbb {R}_{+}^{n-m}\times \{0\}\times \mathbb {R}_{+}^{m-1}\) the equalities

$$\begin{aligned} U_{-m}((f_{m}+a)_{t}f-(f_{m},\ldots ,f_{m}))&= U_{-m}((f_{m}+b)_{t}g-(f_{m},\ldots ,f_{m})) \\ U_{-m}((f_{m}+c)_{t}f-(f_{m},\ldots ,f_{m}))& = U_{-m}((f_{m}+d)_{t}g-(f_{m},\ldots ,f_{m})) \\ \text {and }U_{-m}((f_{m}^{\prime }+a)_{t^{\prime }}f^{\prime }-(f_{m}^{\prime },\ldots ,f_{m}^{\prime }))&= U_{-m}((f_{m}^{\prime }+b)_{t^{\prime }}g^{\prime }-(f_{m}^{\prime },\ldots ,f_{m}^{\prime })) \\ \text {imply }U_{-m}((f_{m}^{\prime }+c)_{t^{\prime }}f^{\prime }-(f_{m}^{\prime },\ldots ,f_{m}^{\prime }))&= U_{-m}((f_{m}^{\prime }+d)_{t^{\prime }}g^{\prime }-(f_{m}^{\prime },\ldots ,f_{m}^{\prime })). \end{aligned}$$

This condition is analogous to tradeoff consistency (see Köbberling & Wakker, 2003) for the function \(U_{-m}\) (representing a continuous monotonic preference) on \(\mathbb {R}_{+}^{n-m}\times \{0\}\times \mathbb {R}_{+}^{m-1}\).

Following Köbberling and Wakker (2003, Corollary 10) this implies that there exist positive numbers \(\rho _{-m,s},s\in \mathcal {S}\backslash \{m\}\) and a continuous strictly increasing utility function \(u_{-m}:\mathbb {R}_{+}\rightarrow \mathbb {R}\) such that

$$\begin{aligned} U_{-m}(f-f_{m})=\phi _{-m}[\sum _{s\in \mathcal {S}\backslash \{m\}}\rho _{-m,s}u_{-m}(f_{s}-f_{m})]\text {,} \end{aligned}$$

where \(\phi _{-m}\) is a continuous and strictly increasing transformation of a subjective EU-like representation. Further, \(U_{-m}(0)=0\) implies \(\phi _{-m}[u_{-m}(0)]=0\).

Recall that \(m\in \mathcal {S}\) was arbitrary chosen. Hence, we conclude that for each state \(s\in \mathcal {S}\) there exists a strictly increasing and continuous function \(u_{-s}:\mathbb {R}_{+}\rightarrow \mathbb {R}\) and a strictly increasing and continuous function \(\phi _{-s}:u_{-s}[\mathbb {R}_{+}]\rightarrow \mathbb {R}\) with \(\phi _{-s}[u_{-s}(0)]=0\), and positive numbers \(\rho _{-s,t},t\in \mathcal {S}\backslash \{s\}\) such that \(W_{s}(f):=v(f_{s})+\phi _{-s}[\sum _{t\in \mathcal {S}\backslash \{s\}}\rho _{-s,t}u_{-s}(f_{t}-f_{s})]\) represents the preference on the set of acts \(\mathcal {F}_{s}\).

Take any two distinct states \(s,s^{\prime }\in \mathcal {S}\) and consider the restriction of \(\succcurlyeq\) on the set of acts \(\mathcal {F}_{\{s,s^{\prime }\}}:=\mathcal {F}_{s}\cap \mathcal {F}_{s^{\prime }}\). On this set both \(W_{s}\) and \(W_{s^{\prime }}\) represent \(\succcurlyeq\). Uniqueness results imply that \(\phi _{-s}=\phi _{-s^{\prime }}\) and \(u_{-s^{\prime }}=u_{-s}\) (as both have v in common) and that \(\rho _{-s,t}=\rho _{-s^{\prime },t}=:\rho _{t}\) for all \(t\in \mathcal {S}\backslash \{s,s^{\prime }\}\). As \(s,s^{\prime }\in \mathcal {S}\) were chosen arbitrary it follows that the positive numbers \(\rho _{-s,t}\) are independent of \(s\in \mathcal {S}\), such that we have n positive numbers \(\rho _{t},t\in \mathcal {S}\). Similarly this holds for \(\phi _{-s}\) and \(u_{-s}\); we write \(\tilde{\phi }\) and \(\tilde{u}\), respectively, instead. Further, continuity at 0 implies \(\tilde{u}(0)=0\) and, hence, \(\tilde{\phi }:\mathbb {R}_{+}\rightarrow \mathbb {R}\) with \(\tilde{\phi }(0)=0\).

Now we set \(\rho =\sum _{t\in \mathcal {S}}\rho _{t}\) and define

$$\begin{aligned} \pi _{t}:=\frac{\rho _{t}}{\rho }\text { for each }t\in \mathcal {S} \end{aligned}$$

and obtain a subjective probabilities for the states in \(\mathcal {S}\). Finally, we define \(u:=\rho \tilde{u}\) and adjust \(\tilde{\phi }\) as \(\phi :=\tilde{\phi }[\frac{(\cdot )}{\rho }]\) to maintain the same domain for \(\tilde{\phi }\circ \tilde{u}\) for \(\phi \circ u\).

Hence, we have shown that the representations W of \(\succcurlyeq\) on \(\mathcal {F}\) is of the form

$$\begin{aligned} W(f)=v(f_{s})+\phi [\sum _{t\in \mathcal {S}\backslash \{s\}}\pi _{t}u(f_{t}-f_{s})]\text { for }f\in \mathcal {F}_{s},s\in \mathcal {S}. \end{aligned}$$

Now we apply again tradeoff consistency for chances to show that \(\phi\) is linear. As all functions in the preceding representation are continuous, it follows that, locally, we can always find outcomes \(y\ne x\) and \(y^{\prime },x^{\prime }\), and nonnegative \(c,d,c^{\prime }\) such that the following three equations hold:

$$\begin{aligned} v(y)+\phi [\sum _{t\in \mathcal {S}\backslash \{1\}}\pi _{t}u(c)]&= v(x)+\phi [\sum _{t\in \mathcal {S}\backslash \{1\}}\pi _{t}u(d)], \\ v(y)+\phi [\sum _{t\in \mathcal {S}\backslash \{1\}}\pi _{t}u(d)]&= v(x)+\phi [\sum _{t\in \mathcal {S}\backslash \{1\}}\pi _{t}u(c^{\prime })] \end{aligned}$$

and

$$\begin{aligned} v(y^{\prime })+\phi [\sum _{t\in \mathcal {S}\backslash \{1,2\}}\pi _{t}u(c)]=v(x^{\prime })+\phi [\sum _{t\in \mathcal {S}\backslash \{1,2\}}\pi _{t}u(d)] \end{aligned}$$

hold. As \(y\ne x\) it follows that \(c,d,c^{\prime }\) are distinct chance outcomes and, hence, \(y^{\prime }\ne x^{\prime }\). Written in terms of preferences, the last three equations are equivalent to

$$\begin{aligned} (y,y+c,\ldots ,y+c)&\sim (x,x+d,\ldots ,x+d), \\ (y,y+d,\ldots ,y+d) &\sim (x,x+c^{\prime },\ldots ,x+c^{\prime }) \end{aligned}$$

and

$$\begin{aligned} (y^{\prime },y^{\prime },y^{\prime }+c,\ldots ,y^{\prime }+c)\sim (x^{\prime },x^{\prime },x^{\prime }+d,\ldots ,x^{\prime }+d) \end{aligned}$$

respectively. Applying tradeoff consistency for chances means that the following indifference also holds

$$\begin{aligned} (y^{\prime },y^{\prime },y^{\prime }+d,\ldots ,y^{\prime }+d)\sim (x^{\prime },x^{\prime },x^{\prime }+c^{\prime },\ldots ,x^{\prime }+c^{\prime }) \end{aligned}$$

or, by substituting the representation W, equivalently, that the next equality is satisfied

$$\begin{aligned} v(y^{\prime })+\phi [\sum _{t\in \mathcal {S}\backslash \{1,2\}}\pi _{t}u(d)]=v(x^{\prime })+\phi [\sum _{t\in \mathcal {S}\backslash \{1,2\}}\pi _{t}u(c^{\prime })]. \end{aligned}$$

Combining the four preceding equations by taking differences, we obtain

$$\begin{aligned} \phi [\sum _{t\in \mathcal {S}\backslash \{1\}}\pi _{t}u(c)]-\phi [\sum _{t\in \mathcal {S}\backslash \{1\}}\pi _{t}u(d)] & = \phi [\sum _{t\in \mathcal {S}\backslash \{1\}}\pi _{t}u(d)]-\phi [\sum _{t\in \mathcal {S}\backslash \{1\}}\pi _{t}u(c^{\prime })], \\ \phi [\sum _{t\in \mathcal {S}\backslash \{1,2\}}\pi _{t}u(c)]-\phi [\sum _{t\in \mathcal {S}\backslash \{1,2\}}\pi _{t}u(d)] &= \phi [\sum _{t\in \mathcal {S}\backslash \{1,2\}}\pi _{t}u(d)]-\phi [\sum _{t\in \mathcal {S}\backslash \{1,2\}}\pi _{t}u(c^{\prime })]. \end{aligned}$$

By setting \(\pi _{\mathcal {S}\backslash \{1\}}:=\sum _{t\in \mathcal {S}\backslash \{1\}}\pi _{t}>0\) and \(\pi _{\mathcal {S}\backslash \{1,2\}}:=\sum _{t\in \mathcal {S}\backslash \{1,2\}}\pi _{t}>0\), and noting that \(\pi _{\mathcal {S}\backslash \{1\}}\ne \pi _{\mathcal {S}\backslash \{1,2\}}\) we obtain

$$\begin{aligned} \phi [\pi _{\mathcal {S}\backslash \{1\}}u(c)]-\phi [\pi _{\mathcal {S}\backslash \{1\}}u(d)] &= \phi [\pi _{\mathcal {S} \backslash \{1\}}u(d)]-\phi [\pi _{\mathcal {S}\backslash \{1\}}u(c^{\prime })], \\ \phi [\pi _{\mathcal {S}\backslash \{1,2\}}u(c)]-\phi [\pi _{\mathcal {S}\backslash \{1,2\}}u(d)] &= \phi [\pi _{\mathcal {S}\backslash \{1,2\}}u(d)]-\phi [\pi _{\mathcal {S}\backslash \{1,2\}}u(c^{\prime })]. \end{aligned}$$

This means that, locally, for all chances \(c,d,c^{\prime }\) whenever one of the preceding equation hold then also the other equation is satisfied. This implies, that locally \(\phi [\pi _{\mathcal {S}\backslash \{1\}}u(\cdot )]\) and \(\phi [\pi _{\mathcal {S}\backslash \{1,2\}}u(\cdot )]\) are proportional. As \(\pi _{\mathcal {S}\backslash \{1\}}\ne \pi _{\mathcal {S}\backslash \{1,2\}}\) and both probabilities are positive, this proportionality implies, locally, that \(\phi\) is linear. As all functions are continuous, local linearity implies global linearity. Thus, our representation W of \(\succcurlyeq\) on \(\mathcal {F}\) can be written in the form

$$\begin{aligned} W(f)=v(f_{s})+\sum _{t\in \mathcal {S}\backslash \{s\}}\pi _{t}u(f_{t}-f_{s}) \text { for }f\in \mathcal {F}_{s},s\in \mathcal {S}. \end{aligned}$$

Hence, we have derived statement (i) of the theorem.

By construction the probabilities \(\pi _{s},s\in \mathcal {S}\) are uniquely determined. By construction, for positive \(\alpha\) and some constant \(\beta\), we can replace v and u by \(\alpha v+\beta\) and \(\alpha u\), respectively. That there is no further flexibility in the choice of these functions follows from Lemma 6. This concludes the proof of Theorem 7.\(\square\)