Abstract
In this paper we propose a consensus group decision making scheme under model uncertainty consisting of an iterative two-stage procedure based on the concept of Fréchet barycenter. Each stage consists of two steps: the agents first update their position in the opinion metric space adopting a local barycenter characterized by the agents’ immediate interactions and then a moderator makes a proposal in terms of a global barycenter, checking for consensus at each stage. In cases of large heterogeneous groups, the procedure can be complemented by an auxiliary initial homogenization stage, consisting of a clustering procedure in opinion space, leading to large homogeneous groups for which the aforementioned procedure will be applied. The scheme is illustrated in examples motivated from environmental economics.
Similar content being viewed by others
Notes
A particular example of this result in the special case where the opinion space is the space of probability measures on \({{\mathbb {R}}}\) appeared in previous work of some of the contributors of the present work (Petracou et al. 2022); here this result is extended to the case of more general opinion spaces.
\(\Delta ^{N-1}\) denotes the N-dimensional unit simplex, i.e. \(\Delta ^{N-1}:= \left\{ x\in {{\mathbb {R}}}^{N}\,\,: \,\, \sum _{i=1}^N x_i = 1, \,\,\, x_i \ge 0, \,\, \forall i\right\}\)
which may affect the probability of acceptance of a proposal by an agent, depending on the acceptance or not of the proposal by other agents in the same clique.
Note here, that by agents we may either mean individual agents, or groups of agents, for example the clusters in opinion space M obtained by the clustering procedure proposed in Section ?? (in which case each agent is identified with a cluster, so that \(N=K\)).
This is a reasonable assumption since an agent’s opinion is likely to be more affected by her/his immediate dependencies and/or pressure/interest groups.
For instance, the large group could be the general population of a country, whereas the clusters may correspond to tendencies within the country. As another example we may consider a group of N consumers which is further clustered into K homogeneous subgroups in terms of preferences.
References
Agueh M, Carlier G (2011) Barycenters in the Wasserstein space. SIAM J Math Anal 43(2):904–924
Amirkhani A, Barshooi AH (2022) Consensus in multi-agent systems: a review. Artif Intell Rev 55(5):3897–3935
Arnaudon M, Barbaresco F, Yang L (2012) Medians and means in Riemannian geometry: existence, uniqueness and computation. In Matrix Information Geometry, Springer, pp 169–197
Arnaudon M, Miclo L (2014) Means in complete manifolds: uniqueness and approximation. ESAIM Probab Stat 18:185–206
Baltas I, Dopierala L, Kolodziejczyk K, Szczepański M, Weber G-W, Yannacopoulos AN (2022) Optimal management of defined contribution pension funds under the effect of inflation, mortality and uncertainty. Eur J Oper Res 298(3):1162–1174
Bansal R, Yaron A (2004) Risks for the long run: a potential resolution of asset pricing puzzles. J Financ 59(4):1481–1509
Basili M, Chateauneuf A (2020) Aggregation of experts’ opinions and conditional consensus opinion by the Steiner point. Int J Approx Reason 123:17–25
Bauschke HH, Combettes PL (2017) Correction to: convex analysis and monotone operator theory in Hilbert spaces. Springer, Berlin
Bhamra HS, Uppal R (2014) Asset prices with heterogeneity in preferences and beliefs. Rev Financ Stud 27(2):519–580
Bhatia R, Jain T, Lim Y (2019) On the Bures-Wasserstein distance between positive definite matrices. Expo Math 37(2):165–191
Bigot J (2013) Fréchet means of curves for signal averaging and application to ECG data analysis. Ann Appl Stat 2384–2401
Bishop AN, Doucet A (2021) Network consensus in the Wasserstein metric space of probability measures. SIAM J Control Optim 59(5):3261–3277
Brown DJ (1975) Aggregation of preferences. Q J Econ 89(3):456–469
Das SK, Pervin M, Roy S.K, Weber GW (2021) Multi-objective solid transportation-location problem with variable carbon emission in inventory management: a hybrid approach. Ann Oper Res 1–27
Dasgupta P (2008) Discounting climate change. J Risk Uncertain 37(2):141–169
DeGroot MH (1974) Reaching a consensus. J Am Stat Assoc 69(345):118–121
Ebert S, Wei W, Zhou XY (2020) Weighted discounting-on group diversity, time-inconsistency, and consequences for investment. J Econ Theory 189:105089
Fréchet M (1948) Les éléments aléatoires de nature quelconque dans un espace distancié. Ann Inst Henri Poincaré 10(4):215–310
Gajdos T, Tallon J-M, Vergnaud J-C (2008) Representation and aggregation of preferences under uncertainty. J Econ Theory 141(1):68–99
Gollier C (2002) Discounting an uncertain future. J Public Econ 85(2):149–166
Gollier C (2013) Pricing the planet’s future: the economics of discounting in an uncertain world. Princeton University Press
Gollier C, Koundouri P, Pantelidis T (2008) Declining discount rates: economic justifications and implications for long-run policy. Econ Policy 23(56):758–795
Gollier C, Zeckhauser R (2005) Aggregation of heterogeneous time preferences. J Polit Econ 113(4):878–896
Groom B, Hepburn C, Koundouri P, Pearce D (2005) Declining discount rates: the long and the short of it. Environ Resource Econ 32:445–493
Groom B, Koundouri P, Panopoulou E, Pantelidis T (2007) Discounting the distant future: How much does model selection affect the certainty equivalent rate? J Appl Economet 22(3):641–656
Gupta M (2017) Consensus building process in group decision making-an adaptive procedure based on group dynamics. IEEE Trans Fuzzy Syst 26(4):1923–1933
Heal G (2009) Climate economics: a meta-review and some suggestions for future research. Rev Environ Econ Policy 3(1):4–21
Heal GM, Millner A (2014) Agreeing to disagree on climate policy. Proc Natl Acad Sci 111(10):3695–3698
Hepburn C, Koundouri P, Panopoulou E, Pantelidis T (2009) Social discounting under uncertainty: a cross-country comparison. J Environ Econ Manag 57(2):140–150
Hepburn CJ, Koundouri P (2007) Recent advances in discounting: implications for forest economics. J For Econ 13(2–3):169–189
Herrera-Viedma E, Cabrerizo FJ, Kacprzyk J, Pedrycz W (2014) A review of soft consensus models in a fuzzy environment. Inf Fus 17:4–13
Jackson MO, Yariv L (2014) Present bias and collective dynamic choice in the lab. Am Econ Rev 104(12):4184–4204
Jørgensen S, Martín-Herrán G, Zaccour G (2010) Dynamic games in the economics and management of pollution. Environ Model Assess 15:433–467
Kossioris G, Plexousakis M, Xepapadeas A, de Zeeuw A, Mäler K-G (2008) Feedback Nash equilibria for non-linear differential games in pollution control. J Econ Dyn Control 32(4):1312–1331
Koundouri P (2009) Introduction to the special issue on:‘discounting the long-run future and sustainable development’. Economics (e-journal)
Kropat E, Weber GW (2018) Fuzzy target-environment networks and fuzzy-regression approaches. Numer Algebra Control Optim 8(2):135–155
Li Y, Kou G, Li G, Peng Y (2022) Consensus reaching process in large-scale group decision making based on bounded confidence and social network. Eur J Oper Res 303(2):790–802
Lichtendahl KC Jr, Grushka-Cockayne Y, Winkler RL (2013) Is it better to average probabilities or quantiles? Manag Sci 59(7):1594–1611
Maccheroni F, Marinacci M, Rustichini A (2006) Ambiguity aversion, robustness, and the variational representation of preferences. Econometrica 74(6):1447–1498
May KO (1954) Intransitivity, utility, and the aggregation of preference patterns. Econometr J Econometr Soc 1–13
Nordhaus WD (2007) A review of the Stern review on the economics of climate change. J Econ Literat 45(3):686–702
Özmen A, Kropat E, Weber G-W (2017) Robust optimization in spline regression models for multi-model regulatory networks under polyhedral uncertainty. Optimization 66(12):2135–2155
Özmen A, Weber GW, Batmaz İ, Kropat E (2011) Rcmars: robustification of CMARS with different scenarios under polyhedral uncertainty set. Commun Nonlinear Sci Numer Simul 16(12):4780–4787
Özmen A, Zinchenko Y, Weber G-W (2023) Robust multivariate adaptive regression splines under cross-polytope uncertainty: an application in a natural gas market. Ann Oper Res 324(1–2):1337–1367
Panaretos VM, Zemel Y (2019) Statistical aspects of Wasserstein distances. Ann Rev Stat Appl 6:405–431
Papayiannis GI, Domazakis GN, Drivaliaris D, Koukoulas S, Tsekrekos AE, Yannacopoulos AN (2021) On clustering uncertain and structured data with Wasserstein barycenters and a geodesic criterion for the number of clusters. J Stat Comput Simul 91(13):2569–2594
Papayiannis GI, Psarakis S, Yannacopoulos AN (2023) Modelling of functional profiles and explainable shape shifts detection: an approach combining the notion of the Fréchet mean with the shape-invariant model. Mathematics 11(21):4466
Papayiannis GI, Yannacopoulos AN (2018) Convex risk measures for the aggregation of multiple information sources and applications in insurance. Scand Actuar J 2018(9):792–822
Papayiannis GI, Yannacopoulos AN (2018) A learning algorithm for source aggregation. Math Methods Appl Sci 41(3):1033–1039
Paul A, Pervin M, Roy SK, Maculan N, Weber G-W (2022) A green inventory model with the effect of carbon taxation. Ann Oper Res 309(1):233–248
Pérez IJ, Cabrerizo FJ, Alonso S, Dong Y, Chiclana F, Herrera-Viedma E (2018) On dynamic consensus processes in group decision making problems. Inf Sci 459:20–35
Petracou EV, Xepapadeas A, Yannacopoulos AN (2022) Decision making under model uncertainty: Fréchet-Wasserstein mean preferences. Manag Sci 68(2):1195–1211
Savku E, Weber G-W (2022) Stochastic differential games for optimal investment problems in a Markov regime-switching jump-diffusion market. Ann Oper Res 312(2):1171–1196
Skiadas C (1997) Conditioning and aggregation of preferences. Econometr J Econometr Soc 347–367
Srivastava A, Klassen EP (2016) Functional and shape data analysis, vol 1. Springer, Berlin
Stern N, Stern NH (2007) The economics of climate change: the Stern review. Cambridge University Press, Cambridge
Steyer L, Stöcker A, Greven S (2023) Elastic analysis of irregularly or sparsely sampled curves. Biometrics 79(3):2103–2115
Urena R, Kou G, Dong Y, Chiclana F, Herrera-Viedma E (2019) A review on trust propagation and opinion dynamics in social networks and group decision making frameworks. Inf Sci 478:461–475
Vincke P (1982) Aggregation of preferences: a review. Eur J Oper Res 9(1):17–22
Weber G-W, Defterli O, Gök SZA, Kropat E (2011) Modeling, inference and optimization of regulatory networks based on time series data. Eur J Oper Res 211(1):1–14
Weitzman ML (2007) A review of the Stern review on the economics of climate change. J Econ Lit 45(3):703–724
Zhang R, Huang J, Xu Y, Herrera-Viedma E (2023) Consensus models with aggregation operators for minimum quadratic cost in group decision making. Appl Intell 53(2):1370–1390
Acknowledgements
Professor Phoebe Koundouri wishes to acknowledge “Funding from the European Research Council (ERC) under the ERC Synergy Grant Water-Futures (Grant agreement No. 951424)”
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This paper is dedicated to Professor A. Xepapadeas on the occasion of his retirement from Athens University of Economics and Business, with friendship and admiration.
Appendices
Appendix A: Proof of Proposition 3.3
The proof uses a duality argument. To simplify the exposition we assume that M is compact (or that we focus our attention on a compact subset of M). We define the function \(L: M \times {{\mathbb {R}}}_{+} \times {{\mathbb {R}}}_{+}^{N+2} \rightarrow {{\mathbb {R}}}\) by
and note that
The variables \(\mu _{00},\mu _0, \mu _1, \ldots , \mu _N\) play the role of Lagrange multipliers. Hence problem (5) can be written as
where we used the compact notation \(\mu =(\mu _{00}, \mu _0, \mu _1, \ldots , \mu _N)'\). Using the minimax theorem we can exchange the order of the maximum and the minimum to obtain
The function
is called the dual function and as seen by (17) can help characterize the solution of the primal problem (5). We now proceed to the calculation of the dual function \(D(\mu )\). By rearranging L as
we can see that the only interesting (non-generate) case corresponds to \(\epsilon _1=\epsilon _2=...=\epsilon _N\) with \(s>0\) (i.e. \(\mu _{00}=0\)) and \(s=\epsilon _{\min }^2\) (i.e. \(\mu _0, \mu _1,\mu _2,...,\mu _N>0\)). This leads to the dual function
In this case, the set of Lagrange multipliers \(\mu _1,\mu _2,...,\mu _N\) can be realized as weights since they belong to \(\Delta ^{N-1}\). From now on let us denote \(w=(\mu _1,\mu _2,...,\mu _N)'\) and express
The minimum over \(x \in M\) is realized at the Fréchet barycenter of \({{\mathbb {M}}}\), with weight vector w (as above). This implies that we should look for the solutions of problem (5) among the Fréchet barycenters of \({{\mathbb {M}}}\). It remains to find the appropriate weights for the barycenter. According to the dual formulation (17) the weights which are related to \(\mu >0\) are obtained as the solution of the dual problem
where \(V_{{{\mathbb {M}}}}(w)\) is the Fréchet variance of \({{\mathbb {M}}}\) when choosing w as the weight vector. Hence, we conclude that the solution of (5) is a Fréchet barycenter with weights chosen so as to maximize the corresponding weighted Fréchet variance.
Appendix B: Proof of Proposition 3.6
The proof proceeds in several steps:
Step 1 We note that problem (7) is equivalent to the minimization problem
where \(\psi _i=-\ln \phi _i\). This is easy to see, as maximizing P is equivalent to minimizing \(-\ln P\). Note that the functions \(\psi _i\) are increasing.
Step 2 Let \(\partial \phi\) be the subdifferential of a (convex) function \(\psi\). The first order condition for \({\bar{x}}\) to be a local minimizer of \(\psi\) is that \(0 \in \partial \psi ({\bar{x}})\). Applying Assumption 3.5, we assume that for each \(i=1, \ldots , N\) we can express \(x_i=x_i(\theta _i)\) for some \(\theta _i \in H\), and similarly, any point \(x \in X\) can be expressed as \(x=x(\theta )\) for some \(\theta \in H\). To simplify notation we will not revert to the \(\theta\) parametric notation but keep our initial notation in terms of x. We apply the first order condition to \(\psi (x):=\sum _{i=1}^{N}\psi _i\left( d^2(x,x_i)\right)\), which in parametric form becomes
with the latter used as a simplification of the notation. The problem of minimizing over \(x \in X\) is then transferred to minimizing over the parameter \(\theta \in H\). This yields
where for the second equality we have used the subdifferential calculus (which holds since \(\psi _i\) and \(d^2(x,x_i)\) are continuous). We now apply the subdifferential rule for composite functions (see e.g. Corollary 6.72 in Bauschke and Combettes (2017)). This yields that for each \(i=1,\ldots , N\) it holds that \(z \in \partial \psi _i\left( d^2(\theta ,x_i)\right) (y)\) if there exists \(\alpha _i \in \partial \psi _i \left( d^2(y,x_i)\right)\) and \(w_i \in \partial d^2(\cdot , x_i) (y)\) such that \(z=\alpha _i w_i\). To simplify the exposition let us assume that \(\psi _i\) is \(C^1\), so that the subdifferential of \(\psi _i\) is a singleton consisting of a single value, that of the derivative \(\psi _i'\left( d^2(y,x_i)\right) >0\), with its positivity guaranteed by the fact that \(\psi\) is increasing. Condition (24) together with the above subdifferential rule implies that
where \({\bar{w}}_i=\psi _i'\left( d^2(\cdot , x_i)\right) (\theta ^*) >0\). Defining \(w_{i}=\frac{{\bar{w}}_i}{\sum _{i=k}^{N} {\bar{w}}_{k}} > 0\), and dividing (25) by \(\sum _{k=1}^{N} {\bar{w}}_{k}\) we obtain
where \(w=(w_1, \ldots , w_N)\in \Delta ^{N-1}\). From (26) we conclude that \(\theta ^*\) is the solution of the minimization problem
i.e. \(x^*=x(\theta ^*)\) corresponds to a Fréchet barycenter for the choice of weights \(w=(w_1, \ldots , w_N) \in \Delta ^{N-1}\).
Step 3 It remains to show that such a choice of weights is feasible. The weights \(w=(w_1, \ldots , w_N)\) must be such that if \(x^*=x(\theta ^*)\) is the solution of (26) then it must hold that
Since \(x^*=x(\theta ^*)\) is a function of the weights vector \(w \in \Delta ^{N-1}\), we can interpret (28) as a system of equations for w, the solution of which in \(\Delta ^{N-1}\) will characterize the appropriate weights for the barycenter at which consensus can be reached. Defining the function \(w \rightarrow g(w):=x(\theta ^*)\) where \(\theta ^*\) is the solution of problem (26), we can rewrite (28) as
If the functions \(w \mapsto \psi _i'\left( d^2(g(w),x_i)\right)\) are continuous then a solution to (29) in \(\Delta ^{N-1}\) is guaranteed by Brouwer’s fixed point theorem. It thus remains to show the continuity of the functions \(w \mapsto \psi _i'\left( d^2(g(w),x_i)\right)\). Since \(\psi _i'\) are continuous it suffices to show the continuity of the function \(w \rightarrow g(w):=x(\theta ^*)\) where \(\theta ^*\) is the solution of problem (26).
Step 4 We will use the following definitions: Let \(\left( w^{(n)}\right)\) be a sequence in \(\Delta ^{N-1}\) and define the sequence of functionals \(\psi ^{(n)}: M \rightarrow {{\mathbb {R}}}\), by
If needed (which is not required here) we may express this in terms of the parametric representation \(x=x(\theta )\), as a functional on the space of parameters H.
Step 5 We claim the following: Consider a sequence \(\left( w^{(n)}\right) \subset \Delta ^{N-1}\), such that \(w^{(n)} \rightarrow w\) in \(\Delta ^{N-1}\), and a sequence \(\left( x^{(n)}\right) \subset M\) such that \(x^{(n)} \rightarrow x\) in M. Then, \(\psi ^{(n)}\left( x^{(n)}\right) \rightarrow \psi (x)\) in \({{\mathbb {R}}}\), where \(\psi (x)=\sum _{i=1}^{N} w_{i} d^2(x,x_i)\). To see that, we calculate
with the first term tending to 0 since \(w^{(n)}\rightarrow w\) and the second term tending to 0 since \(x^{(n)} \rightarrow x\).
Step 6 We also claim the following: Consider the sequence \(\left( x_{*}^{(n)}\right) \subset M\) of minimizers of the sequence of functionals \(\left( \psi ^{(n)}\right)\) (i.e. for every \(n \in {{\mathbb {N}}}\), \(x^{(n)}=\arg \min _{x} \psi ^{(n)}(x)\)). Then, if \(x^{(n)} \rightarrow x\) in M, it follows that \(x =\arg \min _{x \in M} \psi (x)\). To show this, consider any \(z \in M\) and a sequence \(\left( z^{(n)}\right) \subset M\) such that \(z^{(n)}\rightarrow z\) in M. Then,
hence, by the fact that z is arbitrary, \(x = \arg \min _{x \in M} \psi (x)\).
Step 7. We are now in position to show the continuity of the barycenter with respect to the weights, i.e. the continuity of the map \(w \rightarrow g(w):=x\left( \theta ^*\right)\) where \(\theta ^*\) is the solution of problem (26). Let \((w^{(n)} \subset \Delta ^{N-1}\) be a sequence of weights and for every \(n \in {{\mathbb {N}}}\), let \(x_{*}^{(n)}\) be the barycenter of \({{\mathbb {M}}}\) for the weight vector \(w^{(n)}\), i.e. the minimizer for the functional \(\psi ^{(n)}\). Assume that the sequence \((x^{(n)})\) (or a subsequence) has a limit \(x \in M\). This can be achieved by a compactness argument (or weak compactness). Then, by step 6, x is the minimizer of \(\psi\), hence the barycenter for the weight vector w. Hence, the map g is continuous. This concludes the proof.
Appendix C: Proof of the Claim in Example 3.7
We recall (see e.g. Bhatia et al. (2019)) that between two measures \(P_i \sim N(\mu _i, S_{i})\), \(i=1,2\), the Wasserstein distance \(W_{2}^{2}(P_1,P_2)\), admits the closed form
Moreover, given a set of probability measures \({{\mathbb {M}}}\) consisting of Gaussian measures \(P_i\), \(i=1, \ldots , M\), and a weight vector \((w_1, \ldots , w_{K})\), the corresponding Wasserstein barycenter \(P_{B}\) is a Gaussian measure \(P_{B} \sim N(\mu , S)\) with \(\mu = \sum _{i=1}^{K} w_{i} \mu _{i}\), and S being a matrix that satisfies the equation
where the notation \(A \# B\) is used to denote the geometric mean between two positive definite symmetric matrices given by
Without loss of generality we will assume that \(\mu _k =0\), \(k=1,\ldots , K\) (else simply center the measures). We will also consider problem (7) on \({{{\mathcal {N}}}}({{\mathbb {R}}}^{d}) \subset {{{\mathcal {P}}}}({{\mathbb {R}}}^{d})\), the subset of Gaussian measures on \({{\mathbb {R}}}^d\). With the above information problem (7) can be expressed as
where \(\psi _k =\ln \phi _k\) and \(g_{k}(S):= \left( S_{k}^{1/2}S S_{k}^{1/2}\right) ^{1/2}\). Problem (32) is an optimization problem on the set of positive definite symmetric matrices. It can be treated by considering the Fréchet derivative of the functional in (32) with respect to S. Using the rules of Fréchet differentiation and assuming sufficient smoothness for the functions \(\psi _{k}\) we have that for any deviation \(S+ \epsilon Z\) from the matrix S the action of the Fréchet derivative \(D\Psi (S)\) on any matrix Z yields
where we use the simplified notation
Moreover, define the quantities
where the positivity of \(\Lambda _k\) is guaranteed by the properties of the functions \(\psi _{k}\). Following Bhatia et al. (2019), we can compute
so that (33) yields (using the linearity of trace) that
The first order condition for the solution of (32) is \([D\Psi (S)]Z=0\), for all possible perturbations Z of the covariance matrix S. Upon defining
the first order condition becomes
which implies that the solution of (32) corresponds to a Gaussian measure with covariance matrix S such that
i.e. \(P^{*}\) is the barycenter of \({{\mathbb {M}}}\) with a selection of weights \(w_{k}\), endogenously obtained by the preferences on the agents towards their anchor point (in other words their bargaining power).
Note that Eq. (34), although formally the same as Eq. (31) has a fundamental difference from (31). In (34) the coefficients \(w_{k}=w_{k}(S)\), i.e. are depending on S, whereas in (31) the coefficients \(w_{k}\) are constants. It remains to show that Eq. (34) admits a solution. To show that we define the operator T, by \(S \mapsto T(S):=\sum _{k=1}^{K} w_{k} \left( S^{1/2} S_{k} S^{1/2}\right) ^{1/2}\). It can be shown that this operator maps the closed convex set \({{{\mathcal {K}}}}=\left\{ S \in {{\mathbb {R}}}_{+}^{d \times d} \,\, \mid \,\, c_1 I \le S \le c_2 I\right\}\), where \(c_1,c_2 \ge 0\) and by \(\le\) we denote the natural ordering \(S_1 \le S_2 \,\, \Longleftrightarrow S_1 -S_2 \ge 0\) (meaning \(S_1-S_2\) positive definite) onto itself. The set \({{{\mathcal {K}}}}\) is convex, and the map T is continuous, so by the Brouwer fixed point theorem T has a fixed point, therefore (34) admits a solution.
Appendix D: Extensions to the Evolutionary Algorithm
The evolutionary scheme presented in this section can be further extended.
1.1 Appendix D.1 A Two Stage Scheme Involving a Clustering Step for Opinion Homogenization and Group Formation
As indicated by the numerical experiments in Sect. 4.3, large degree of inhomogeneity of the positions of the agents in opinion space, especially in cases of large groups, may lead to delay in the convergence to consensus. In certain cases (i.e. in cases of emergency etc), such delays may be unwanted. A way to avoid situations like this is to find ways of grouping the N agents in K as far as possible homogeneous subgroups,Footnote 6 each one characterized by a representative opinion \({\bar{x}}_{k}\), \(k=1,\ldots , K\), and then performing the consensus procedure described in Sect. 4.2 among them.
To this end we propose the following version of the celebrated K-means clustering algorithm. We consider the opinions of the large groups as elements \(\{x_1, \ldots , x_{N}\}\) of the opinion metric space (M, d). The idea is that like opinions will form clusters in this metric space. Upon being able to identify these clusters, we may form a coarse graining of the group into sub-groups of opinions, which can be treated as homogeneous groups given the level of coarse graining. Mathematically, this corresponds to breaking the large group G into k subgroups \(G_{i}\), \(i=1,\ldots , k\), such that \(G = \bigcup _{i=1}^{k} G_{i}\), and \(G_{i} \cap G_{j} = \emptyset\) for \(i \ne j\), with the opinions \(x_{\ell } \in G_{i}\) being as homogeneous as possible. As discussed above, homogeneity of a subgroup will be understood in terms of the Fréchet function of the subgroup, whereas a relevant measure for the center of the group will be the Fréchet barycenter of the subgroup. This scheme can be applied for any relevant metrization of the opinion space M (see e.g. examples in previous section), for the case of the Wasserstein space see Papayiannis et al. (2021). The proposed clustering algorithm to be implemented in the opinion space is summarized in Algorithm 2.
At the convergence of the algorithm, K clusters of opinions are determined, centered at the points \({\bar{x}}_{k}\), \(k=1, \ldots , K\) in opinion space (M, d). Each of these clusters can be understood as a more or less “homogeneous” group of agents in terms of opinions. Denoting the groups by \(G_{k}\), \(k=1, \ldots , K\), we expect our clustering algorithm to perform well in segregating the general group of agents G into subgroups if the Fréchet variance of each subgroup \(V_{k}:= \min _{z \in M} F_{G_{k}}(z)\) is comparatively low. Recall that the Fréchet variance of a subset \(G_{k} \subset M\) can be also understood as an indicator of its homogeneity.
Note that the above algorithm can be expressed in terms of an optimization problem of the form
where
In other words, the elements \(a_{ij}\) provide information as to the membership of the point i to the cluster j, taking the value 1 if i belongs to cluster j and 0 otherwise. The K-means algorithm solves this problem by the following two-step procedure iterating Steps A and B till convergence:
-
A.
Given the centers \({\bar{x}}_{j}\), calculate \(a_{ij}\) solving the minimization problem (36). This generates a membership matrix \(A=(a_{ij}) \in {{\mathbb {R}}}^{N \times K}\) containing binary entries, with each column k of A denoting the composition of the group \(G_{k}\).
-
B.
Given the solution for \(a_{ij}\) from Step A, the new centers are determined by solving (35). Note that this step breaks down into K decoupled problems, each one involving the minimization of the Fréchet function for each \(G_{k}\), or equivalently finding the Fréchet mean of the group, which is recognized as the center of the corresponding cluster. The objective’s value at the minimum will then be the sum of the Fréchet variances of the clusters \(\sum _{k=1}^{K} V_{G_{k}}\).
We close this section by summing up the two-stage group decision process to the following steps:
-
1.
Collect and map all opinions of the group as points \(z_i\), \(i \in G\) into the appropriate opinion space \({\mathcal {M}}\).
-
2.
Perform a clustering procedure in opinion space as proposed in Algorithm 2 to form K groups in opinion space.
-
3.
Identify the group agreement point as a barycenter of the set of opinions \({{\mathbb {M}}}=\{z_{B,1}, \ldots , z_{B,K}\}\), i.e. as a barycenter of barycenters.
1.2 Appendix D.2 Updating the Discounting Parameter
Possible extensions of the scheme proposed in Sect. 4.2 where the sensitivity parameter \(r_i\) could be time-varying can be conceived. For example a possible evolution scheme for the parameter could be described as
considering \(L_i\) as the loss function of the agent i depending on the states of the current opinion set \({\mathbb {M}}(t)\), i.e. indicating the loss (under the assumption that \(L_i\ge 0\)) taking into account the time t and the level of homogeneity in the opinion set \({\mathbb {M}}(t)\). For instance, a possible choice could be the subjective rule
where \(\rho _i > 0\) expresses the agent’s preferences concerning a fast resolution of the problem while \(\epsilon _i>0\) denotes the agents desire to deviate from her/his anchor preferences \(\mu _i(0)\). In this setting, the preferences concerning the time upon which a consensus should be reached governs the determination of the general time-preferences parameter as t grows.
Appendix E: On the Choice of Metrics and the Calculation of the Fréchet Mean
1.1 Appendix E.1 Choice of Metric and the Fréchet Mean
The choice of metric depends on the opinion space. For instance, in the Example 3.2 concerning the social discount rate and the valuation of future uncertain costs, a possible choice of metric space could be determined in thew following way. There are two important components of the opinion space (a) the yield curve characterizing the discount rate and (b) the probability distribution of the future risk to be evaluated. Then, the natural choice of the opinion space is a Cartesian product of two metric spaces \(M=M_1 \times M_2\), where \(M_1\) corresponds to the metric space of possible yield curves and \(M_2\) corresponds to the metric space of possible probability models for the future risk. Let us consider each one separately.
1.1.1 The Space of Discount Rate Curves
As we denoted, \(M_1\) is a space of curves \(f: [0,T] \rightarrow {{\mathbb {R}}}_{+}\), where T is the time horizon in question. Not any curve is a representation of a suitable term structure model, so we restrict ourselves to parametric families of curves, that are consistent with economic reasoning. One possible choice for \(M_1\) could be the parametric family of curves \({{{\mathcal {R}}}}\). A suitable example can be the set defined in (4). To simplify the notation, we will denote all relevant parameters as \(\theta \in \Theta\), where \(\Theta\) is a suitable parameter space (e.g. \((y_1,\phi )\) in the case of \({{{\mathcal {R}}}}\) in (4)). We will denote any element of \(M_1\) as \(\Phi (\cdot ; \theta )\), meaning a function \(t \mapsto f(t)=\Phi (t; \theta )\), parameterized by some \(\theta \in \Theta\) and a suitable function \(\Phi : [0,T]\times \Theta \rightarrow {{\mathbb {R}}}\). By definition we can identify any element \(f \in M_1={{{\mathcal {R}}}} \subset L^{2}(0,T)\) by an element \(\theta \in \Theta \subset {{\mathbb {R}}}^{d}\). We will denote by \(\Phi ^{-1}\) the mapping \({{{\mathcal {R}}}} \rightarrow \Theta\), that assigns to any \(f \in {{{\mathcal {R}}}}\) a relevant \(\theta \in \Theta\), such that \(f(\cdot )=\Phi (\cdot ; \theta )\). We do not necessarily require this mapping to be single valued (but we require \(\Phi\) to satisfy this property).
As stated above, \(M_1={{{\mathcal {R}}}}\) is not a linear space, since linear combinations of functions \(f \in {{{\mathcal {R}}}}\) are not necessarily elements of \({{{\mathcal {R}}}}\). There are various choices for the metric on \(M_1\). One possible choice of metric on \(M_1={{{\mathcal {R}}}}\) could be in terms of a suitable metric for \(\Theta \subset {{\mathbb {R}}}^{d}\), i.e.
where \(\theta _i\) are such that \(f_{i}(\cdot )=\Phi (\cdot ; \theta _i)\), \(i=1,2\), and \(d_{\Theta }\) is a suitable metric for \(\Theta \subset {{\mathbb {R}}}^{d}\) (a possible choice being an \(\ell _{p}\) metric, e.g. the Euclidean metric). The nonlinear nature of the transformation \(\Phi\), turns \(d_{M_1}\) into a metric on \(M_1\) which is not directly related to a metric derivable from a norm on \(L^{2}(0,T)\), eventhough it may be related to a norm in the parameter space \(\Theta\).
Another possible choice could be a metric compatible with the vector space in which the nonlinear set \({{{\mathcal {R}}}}\) is naturally embedded, a choice for which could be the Hilbert space \(L^{2}(0,T)\). Hence a suitable metric for \(M_1\) could be as follows: For any two \(f,f' \in M_1= {{{\mathcal {R}}}}\) there exists a pair \(\theta , \theta ' \in \Theta\) such that f can be identified by \(t \mapsto \Phi (t; \theta )\) and \(f'\) can be identified as \(t \mapsto \Phi (t; \theta ')\). Then we may define the metric
Note that while this metric formally coincides with the \(L^2\) norm, the space \((M_1, d)\) is not a vector space on account of the nonlinearity of \(M_1\), eventhough it is endowed with a metric compatible with the norm of the vector space in which \(M_1\) is embedded in. This metric is typically used for spaces of curves of a given parametric representation, i.e. for spaces of curves of the general form \({{{\mathcal {R}}}}:=\{ t \mapsto \Phi (t; \theta ) \,\,: \,\, t \in [0,T], \,\, \theta \in \Theta \}\), as for example in the shape invariant model which is commonly used in functional data analysis (see e.g. Bigot (2013), or Papayiannis et al. (2023) and references therein). Other choices are of course possible, for alternative choices we refer the reader to Srivastava and Klassen (2016), where a detailed discussion of related issues is provided, see also Steyer et al. (2023) for recent applications.
For the numerical illustrations presented in this paper, we opted for the choice (39), rather than (40), mainly for two reasons: (a) Since often decision making concerning scientific issues is model based, consensus upon a model is naturally reduced to consensus on the parameters of the model and (b) the consensus path in the finite parameter space is easier to illustrate and visualize. On the other hand, experiments were also performed using the alternative metric (40), and the results were qualitatively and quantitatively similar.
For any of the above choices, the Fréchet mean in \(M_1\) is now defined as follows:
where \({\hat{d}}\) is defined as in (40). It is important to note that as defined, \(f_{B} \in {{{\mathcal {R}}}}\), and is not merely an element of the embedding space of \({{{\mathcal {R}}}}\), which is \(L^2(0,T)\). Moreover, we must stress that the averaging obtained in (41) is a nonlinear averaging of the yield curves \(f_{i} \in {{{\mathcal {R}}}}\), and not the standard linear averaging \(f_{A}=\sum _{i=1}^{N} w_{i} f_{i}\) in the embedding space \(L^{2}(0,T)\), with \(f_{A}\) typically having the property \(f_{A} \not \in {{{\mathcal {R}}}}\), hence being not acceptable as a suitable yield curve.
1.1.2 The Space of Probability Models for the Future Risk
We now consider the space of possible distributions of future risks. This a space of probability distributions on \({{\mathbb {R}}}^d\), so a suitable choice of opinion space would be \(M_2= {{{\mathcal {P}}}}({{\mathbb {R}}}^d)\), the space of probability measures on \({{\mathbb {R}}}^d\). This is again a space that does not admit a vector space structure. A suitable metrization is in terms of the Wasserstein metric presented in Example 3.1. In the case where the risks can be represented by a single random variable (i.e. in terms of their pecuniar value only) we can consider distributions on \({{\mathbb {R}}}\). These can be represented as distribution functions F, hence \(M_2\) can be identified with the space of distribution functions. This is clearly not a vector space, but may be embedded on a suitable vector space of functions (e.g. measurable functions). As it turns out, it is not the actual distribution function which is important in the metrization of \({{{\mathcal {P}}}}({{\mathbb {R}}})\), but rather its generalized inverse, the quantile function \(Q:=F^{-1}\). For \(M_2={{{\mathcal {P}}}}({{\mathbb {R}}})\), the Wasserstein metric can be conveniently expressed in terms of quantiles of the distributions; for any two \(P, P' \in {{{\mathcal {P}}}}({{\mathbb {R}}})\) with quantiles \(Q=F^{-1}, Q'=F^{' -1}\), respectively, the chosen Wasserstein metric would be
The corresponding Fréchet mean is well defined and unique (if at least one of the distributions is absolutely continuous with respect to the Lebesgue measure see Agueh and Carlier (2011)) so that
which leads to a representation for \(P_{B}\) represented by a distribution function \(F_{B}\) defined by the quantile averaging scheme
The representation (42) is clearly a nonlinear representation for the mean, unlike the linear mean
which is commonly used in model averaging or consensus formation.
In case where the future risk is not represented in a satisfactory way by a single number, we can consider \(M_2\) as the space of probability measures on \({{\mathbb {R}}}^d\), denoted by \({{{\mathcal {P}}}}({{\mathbb {R}}}^d)\). This is not a vector space also, and can be metrized again using the Wasserstein metric (defined in Example 3.1). In this setup there are in general no closed form solutions for the Wasserstein metric. However, the Wasserstein barycenter, is well defined and unique if at least one of the measures \(P_i\) is absolutely continuous with respect to the Lebesgue measure on \({{\mathbb {R}}}^{d}\), so that
In the special case of Location-Scatter families, where each probability measure is parameterized as \(P_{i}=LS(\mu _i, S_{i})\), where \(\mu _i \in {{\mathbb {R}}}^{d}\), \(S_{i} \in M^{d\times d}_{+}\) the Wasserstein distance between any two measures \(P_1,P_2\) is given by (30), and the corresponding Wasserstein barycenter is a probability measure from the same family
with covariance matrix satisfying the equation
Clearly this is far from the standard linear averaging on the vector space of in which distribution functions are naturally embedded.
The full opinion space in the social discount factor example is \(M_{1}\times M_{2}\), endowed with the metric
As mentioned above, the Fréchet mean is not always uniquely defined. In the examples used here, i.e. the Wasserstein case where at least one of the probability measures are absolutely continuous with respect to the Lebesgue measure, or for certain examples of manifolds, the uniqueness of the Fréchet mean has been established (see e.g. Arnaudon et al. (2012), Arnaudon and Miclo (2014), Agueh and Carlier (2011)). However, the proposed methodology is not directly affected if the chosen opinion space is such that the uniqueness of the Fréchet mean is not guaranteed. In such cases, a local minimizer for the Fréchet functional could serve very well as a local consensus point, and the proposed scheme could easily be applied for the determination of local consensus points. The situation can be considered analogous to the situation concerning Nash equilibria, which may not be unique, but nonetheless even local Nash equilibria can be a useful concept in decision making.
1.2 Appendix E.2 Algorithms for Determining the Fréchet Barycenter
The algorithmic determination of the Fréchet barycenter depends crucially on the choice of the relevant metric space setup. We comment here for the choice of metric space, for the example of the consensus concerning the social discount rate, using the same notation as in the previous subsection.
For the \(M_1\) part we use the finite dimensional representation of the relevant set of curves \({{{\mathcal {R}}}}\), in order to calculate the Fréchet barycenter. In particular, given a set of yield curves \(\{f_1, \ldots , f_N\} \subset {{{\mathcal {R}}}}\), we use their parameterization in terms of the parameters \(\{\theta _1, \ldots , \theta _N\} \subset \Theta\), and express the (infinite dimensional) optimization problem
in terms of the finite dimensional optimization problem
which then characterizes \(f_{B}\) in terms of \(f_{B}= \Phi (\cdot , \theta _{B})\). Problem (46) is treated using standard finite dimensional techniques, e.g. first order optimization methods, or standard higher order methods, e.g. quasi-Newton methods. For the numerical experiments presented here we have used a mixture of gradient based methods and quasi-Newton methods for the study of problem (46) leading to the determination of the Fréchet mean.
For the \(M_2\) part, in the case where the loss distribution is one dimensional, the determination of the barycenter comes directly from the representation (42). The only demanding task from the point of view of computation in (42) is the determination of the relevant quantiles. However, for the application in mind, a suitable class of distributions is the class of generalized extreme value distributions as in (14), for which the quantiles are available in closed form analytically, and for which the quantile averaging (42) reduces to simple parameter averaging. In the case where the loss distribution is multi dimensional, we may restrict our attention to the class of Location Scatter family, and imply the representation of the Wasserstein barycenter in terms of the representation (43), with \(S_{B}\) given by (44). Then, the only computationally demanding part is the determination of \(S_{B}\), in terms of the solution of the matrix Eq. (44). This can be easily performed numerically in terms of the fixed point scheme
which quickly converges to the required value of the covariance matrix \(S_{B}\), corresponding to the Wasserstein barycenter.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Koundouri, P., Papayiannis, G.I., Petracou, E.V. et al. Consensus Group Decision Making Under Model Uncertainty with a View Towards Environmental Policy Making. Environ Resource Econ (2024). https://doi.org/10.1007/s10640-024-00846-1
Accepted:
Published:
DOI: https://doi.org/10.1007/s10640-024-00846-1