1 Introduction

Voting is a ubiquitous method for making decisions with a large number of applications, such as electing political representatives, deciding how to split a public budget between projects, or choosing which services (restaurants, hotels, etc) to recommend to new users based on past user experiences. As such, it has been at the epicenter of research within multiple disciplines including political sciences, economics and computer science [1]. The most prominent question in this research agenda is to identify the best voting rule to use to collectively aggregate the preferences of agents over alternative options into a single winning alternative, with most of the earlier literature focusing on axiomatic properties that good voting rules should have. An alternative way to tackle this question that has been proposed in computer science is through the distortion framework [2] which allows to compare different voting rules based on how well they approximate the optimal choice as measured in terms of a social objective function like the utilitarian social welfare.

Since its inception in 2006 by [3], the distortion framework has been applied to several utilitarian social choice settings (e.g., [4,5,6]). The lion’s share of previous work has focused on centralized models with a single pool of agents whose preferences are directly given as input to a voting rule, which thus can utilize all the given information at once to make a decision. However, there are many applications with multiple pools of agents which make independent local decisions that can be thought of as recommendations for the final decision. To give a concrete example, in many political election systems, the citizens are partitioned into districts based on geographic or other criteria, and vote within their districts to propose the candidate (party) they would like to be selected as the winner.

Inspired by situations like the one described above, [7] initiated the study of the distortion of mechanisms in a distributed single-winner setting where a set of n agents with normalized cardinal values (summing up to 1) over a set of m alternatives are partitioned into k disjoint districts. The authors focused on deterministic mechanisms of the form Plurality-of-f, which first choose a representative alternative for each district according to some rule f, by holding a local election with the agents of the district as the voters, and then picking the winner to be the alternative that is representative of the most districts (i.e., using the Plurality rule). [7] considered mechanisms for which the rule f can be cardinal or ordinal, i.e., it may use the actual numerical information about the preferences of the agents within the districts or just consistent rankings. The authors showed that, when the districts are symmetric (that is, each of them contains the same number of agents), the distortion of a cardinal mechanism, namely Plurality-of-Range-Voting is O(km), and provided an asymptotically matching lower bound of \(\Omega (km)\) on the distortion of any Plurality-of-f mechanism. For ordinal mechanisms, they showed that Plurality-of-Plurality achieves a distortion of \(O(km^2)\), and that this is asymptotically best among all ordinal Plurality-of-f mechanisms.

1.1 Revisiting the Distortion of Distributed Voting

A first observation about the results of [7] is that there is a-priori no reason to restrict our attention to only mechanisms in the class Plurality-of-f, as using other over-districts rules could potentially lead to better distortion. Indeed, follow-up work considered distributed social choice settings with metric preferences [8,9,10,11] without such restrictions on the over-districts rule. In addition, all of the previous work on these settings only considered deterministic mechanisms that use deterministic in-district and over-districts rules. Randomization has proven out to be a very useful tool in achieving better (expected) distortion bounds in the centralized setting (see [4, 12]), so it is only natural to consider randomized mechanisms in the distributed setting as well. Finally, an important question is how the distortion bounds are affected in case the participants act selfishly, and whether there are strategyproof mechanisms with good distortion bounds. This question has been considered in the centralized setting [12,13,14,15] and also in the distributed metric setting [9]; we consider it in the context of the normalized setting of [7] as well.

1.2 Our Contributions

We consider the class of all mechanisms for distributed voting in the setting of [7]. In particular, we consider the \(f_{\text {over}}\)-of-\(f_{\text {in}}\) class of mechanisms, where \(f_{\text {in}}\) is an in-district rule that takes as input the preferences of the agents within each district and outputs a representative alternative for the district, while \(f_{\text {over}}\) is a rule that takes as input the representative alternatives of all districts and chooses one of them as the overall winner. We consider several different cases depending on the nature of \(f_{\text {over}}\) and \(f_{\text {in}}\) (deterministic or randomized), and the type of information they can utilize (cardinal or ordinal). We show the following results; see Table 1 for an overview.

Table 1 An overview of our results. Specific details can be found in the appropriate sections

Deterministic Mechanisms

When \(f_{\text {over}}\) and \(f_{\text {in}}\) are both deterministic and the districts are symmetric, we show that the best possible distortion is \(\Theta (km)\) when the valuation functions of the agents are accessible (cardinal mechanisms), and is \(\Theta (km^2)\) when only ordinal information about the preferences of the agents is available (ordinal mechanisms). The upper bounds were shown by [7] and here we provide asymptotically tight lower bounds. These results show that for general, unstructured (normalized) valuations, employing different over-district rules in fact does not result in improvements on the distortion. We present these results in Section 3.

Randomized Mechanisms

In Section 4, we consider for the first time the distortion of randomized mechanisms in distributed voting. We first prove a simple composition theorem, which shows that using an in-district rule with known distortion \(\delta \) in the centralized setting and then selecting the winner uniformly at random from the set of representatives, defines a distributed mechanism with distortion \(O(k \delta )\). Using this, complemented with new lower bounds, we show that the best possible distortion for cardinal unanimous mechanisms is \(\Theta (k)\); in fact, this is true even when the districts are asymmetric and when \(f_{\text {over}}\) is randomized but \(f_{\text {in}}\) is deterministic.

For ordinal mechanisms, we consider two cases: (a) mechanisms that use deterministic in-district rules \(f_{\text {in}}\), and (b) fully-randomized mechanisms, where both \(f_{\text {over}}\) and \(f_{\text {in}}\) are randomized rules. For (a), we show that the best possible distortion is \(\Theta (km^2)\). The upper bound follows from the bound on Plurality-of-Plurality proven in [7]; here, we provide an asymptotically matching lower bound assuming a natural universal tie-breaking rule. For (b), we prove a simple but very interesting result: For a well-studied class of randomized centralized voting rules called point-voting schemes (e.g., see [16, 17]), there exists a distributed implementation so that there is no effect on the induced probability distribution, even for asymmetric districts. Simply put, using such rules it is possible to escape the ill effects of districts in terms of the distortion, even when the districts are asymmetric. From this result, it follows that there exists a distributed implementation of a well-known mechanism of [4] that achieves distortion \(O(\sqrt{m\log {m}})\), almost matching the best possible lower bound of \(\Omega (\sqrt{m})\).

Strategyproof Mechanisms

For strategyproof mechanisms, which are resilient to possible strategic manipulations, we show that a best-possible distortion bound of \(\Theta (nm)\) for deterministic mechanisms (and more generally mechanisms with a deterministic in-district rule) is easy to achieve by a variation of a dictatorship rule. For randomized mechanisms, since point-voting schemes are strategyproof, the bound \(O(\sqrt{m\log {m}})\) carries over to this class as well. Results about deterministic strategyproof mechanisms are presented in Section 3, and about randomized strategyproof mechanisms in Section 4. Interestingly, these bounds match the best possible ones for strategyproof centralized mechanisms.

Experiments

Finally, in Section 5, we perform indicative experiments using real-world data and synthetic data to evaluate the effect of distributed decision making to the distortion in settings closer to practice. The main conclusions of our experimental results mirror that of our theoretical results in Sections 3 and 4.

1.3 Further Related Work

The distortion literature is by now rather extensive, including topics such as single-winner voting [4,5,6, 18], multi-winner voting [19, 20], matching problems [21, 22], and participatory budgeting [23]. Generally speaking, to avoid unbounded distortion when the cardinal preferences are highly unstructured, most works can be categorized as either studying a normalized utilitarian setting (e.g., [3, 4, 12, 21, 23]) or a metric preference setting (e.g., [5, 6, 18, 20, 24]). Some more recent works have also studied the interplay between information and distortion [22, 25,26,27,28,29,30], and there have also been several works on strategyproofness in the context of distortion [12,13,14,15, 21]. We refer the reader to the survey of [2] for a detailed overview of the related literature.

Besides the aforementioned works on distributed voting, [31] studied a related two-stage setting in which the voters participate in a central election, but the candidates themselves come from local elections within the political parties’ electorates. Beyond distortion, in the context of district-based elections, there have also been other works that have considered the degree of deviation from proportional representation (e.g., see [32] and references therein), and some works that have studied the complexity of manipulation (e.g., see [33,34,35,36]).

2 Preliminaries

An instance I of our problem is given by a tuple \(I = (N,A,\textbf{v},D)\). There is a set N of n agents (or voters) that have preferences over a set A of m alternatives (or candidates). The preferences of each agent \(i \in N\) are captured by a valuation function \(v_i:A\rightarrow \mathbb {R}_{\ge 0}\) that maps every alternative \(a \in A\) to a real non-negative value \(v_i(a) = v_{ia}\). Following previous work, we assume that the valuation functions are normalized such that \(\sum _{a \in A} v_{ia} = 1\) for every \(i \in N\) (unit-sum assumption). Let \(\textbf{v} = (v_i)_{i \in N}\) be the valuation profile consisting of the valuation functions of all agents. The agents are also partitioned into a set D of k disjoint districts.

For every district \(d \in D\), let \(N_d\) be the set of agents it contains, such that \(\bigcup _{d \in D}N_d = N\). In the symmetric case, each district d contains exactly \(\lambda = n/k\) agents. In the asymmetric case, each district d contains a number \(n_d\) of agents. All our lower bounds follow by instances consisting of symmetric districts, whereas our upper bounds in Section 4 hold for asymmetric districts.

2.1 Mechanisms

Our goal is to choose an alternative to satisfy several criteria of interest. This choice must be done using a distributed mechanism that uses an in-district voting rule \(f_{\text {in}}\) and an over-districts voting rule \(f_{\text {over}}\) to implement the following two independent steps:

  • Step 1: For each district d, choose a representative alternative \(a_d \in A\) by holding a local election based on \(f_{\text {in}}\).

  • Step 2: Choose a district representative as the winner based on \(f_{\text {over}}\) by considering the districts as voters and their representatives as the candidates they approve.

For simplicity we refer to such mechanisms as \(f_{{over}}\)-of-\(f_{{in}}\). Different choices of \(f_{\text {in}}\) and \(f_{\text {over}}\) lead to different distributed mechanisms. Note that the in-district rule can in general use various types of information about the preferences of the agents. For instance, it may be able to use exact cardinal information about the valuation functions, or only ordinal information that is induced by the values (i.e., rankings of alternatives that are consistent to the values of the agents for them). In the latter case, we will use \(\sigma _i\) to denote the preference ranking of agent \(i \in N\) so that \(\sigma _i(a)\) is the rank of alternative \(a\in A\) in the ranking of i, and \(\sigma _i(a) < \sigma _i(b)\) if \(v_i(a) \ge v_i(b)\); let \(\varvec{\sigma }=(\sigma _i)_{i \in N}\) be the ordinal profile consisting of the preference rankings of all agents. To be concise in the definitions below, let \(\varvec{\delta }(I)\) be the information about the preferences of the agents in instance \(I = (N,A,\textbf{v},D)\) that is used by a mechanism; that is, \(\varvec{\delta }(I) = \textbf{v}\) in case of cardinal information, or \(\varvec{\delta }(I) = \varvec{\sigma }\) in case of ordinal information.

We will focus on different classes of distributed mechanisms depending on the available information about the preferences of the agents at the district level (cardinal or ordinal), and also on whether their decision is deterministic or randomized (that is, they choose the district representatives or final winner based on probability distributions).

2.2 Social Welfare and Distortion

Given an instance I, the social welfare of an alternative \(a \in A\) is the total value that the agents have for a, that is, \(\text {SW}(a|I) = \sum _{i \in N} v_{ia}\). So, the expected social welfare achieved by a randomized distributed mechanism M that chooses alternative \(a \in A\) as the winner w with probability \(\Pr _M[w=a]\) is

$$\mathbb {E}[\text {SW}(M(I))] = \sum _{a \in A} \Pr _M[w=a] \cdot \text {SW}(a|I).$$

The efficiency of a distributed mechanism is measured by the notion of distortion. The distortion of a distributed mechanism M is the worst-case ratio (over all possible instances with n agents, m alternatives, and k districts) of the maximum social welfare achieved by any alternative over the (expected) social welfare of the alternative chosen by the mechanism as the winner w, that is,

$$\begin{aligned} \text {dist}(M) = \sup _{I} \frac{\max _{a \in A}\text {SW}(a|I)}{\mathbb {E}[\text {SW}(M(\varvec{\delta }(I))]}. \end{aligned}$$

Clearly, \(\text {dist}(M) \ge 1\). When the denominator in the definition of the distortion tends to 0, we will say that the distortion is infinite or unbounded. Our goal is to identify the best possible distributed mechanisms in terms of distortion.

2.3 Strategyproofness

Another important property that we would like our mechanisms to satisfy is that of strategyproofness. A strategyproof mechanism makes decisions such that providing false information never leads to the selection of an alternative that an agent prefers over the alternative chosen when the agent provides truthful information. In particular, for any instance I, it must be the case that \(v_i(M(\varvec{\delta }(I))) \ge v_i(M(\varvec{\delta }(I')))\) for any agent \(i \in N\), where \(I'\) is the instance obtained when only agent i reports information different than that in I.

2.4 Some Useful Observations and Properties

Before we present our technical results, let us briefly discuss some useful properties.

Locality of Distributed Mechanisms

First, observe that any distributed mechanism \(f_{\text {over}}\)-of-\(f_{\text {in}}\) satisfies a locality property in the following sense. A district d (that is, the preferences of a number of agents) appears in different instances if in each of these instances there is a district with the same number of agents and the same information about their preferences as in d (depending on what is required by the mechanism). Since the information is the same, the in-district rule \(f_{\text {in}}\) must decide the same alternative as the representative of the district in all these instances. Similarly, in all instances where the mechanism has decided the same set of district representatives, the over-districts rule \(f_{\text {over}}\) must decide the same final winner.

Distortion of Distributed vs Centralized

Another useful observation is that the distortion of a distributed mechanism \(f_{\text {over}}\)-of-\(f_{\text {in}}\) is at least as much as the distortion of the in-district centralized voting rule \(f_{\text {in}}\). Indeed, when \(k=1\), there is only one representative alternative chosen by \(f_{\text {in}}\), and thus this alternative must be chosen as the winner by \(f_{\text {over}}\); this is also true for instances with \(k \ge 2\) districts which are all copies of one district. Consequently, the distortion of \(f_{\text {in}}\) is a lower bound on the distortion of \(f_{\text {over}}\)-of-\(f_{\text {in}}\).

Strategyproofness

Observe that for a distributed mechanism \(f_{\text {over}}\)-of-\(f_{\text {in}}\) to be strategyproof it is necessary that the in-district rule \(f_{\text {in}}\) is strategyproof. This again follows by how the mechanism would work in instances with a single district, in which case the over-districts rule \(f_{\text {over}}\) does not play any role in the selection of the final winner.

Unanimity

A few of our results will require the in-district rules \(f_{in}\) to be unanimous. Unanimity stipulates that if all of the agents have the same alternative as the top preference, that alternative must be selected (with probability 1). Unanimity is a very natural property of “reasonable” voting rules, especially deterministic ones. For randomized rules, there might be reasons to consider non-unanimous choices, e.g., see [13, 16].

3 Deterministic Mechanisms

We start with deterministic distributed mechanisms and focus explicitly on the case of symmetric districts in this section (that is, the size of each district is \(\lambda \)). When full information about the valuations of the agents is known at the district level, [7] showed that the mechanism Plurality-of-Range-Voting, which chooses the representative of each district to be the alternative with maximum social welfare for the agents in the district, has distortion O(km). We show that this mechanism is asymptotically best possible over all possible deterministic distributed mechanisms that use unanimous in-district rules (but may not use Plurality as the over-districts rule).

Theorem 3.1

The distortion of any deterministic distributed mechanism with a unanimous in-district rule is \(\Omega (km)\).

Proof

Let M be some deterministic distributed mechanism with a unanimous in-district rule. Without loss of generality, whenever there are k distinct district representatives \(\{a_1,\ldots ,a_k\}\), we assume that M chooses \(a_1\) as the overall winner. Let \(\varepsilon > 0\) be some positive infinitesimal and consider the following instance with k districts \(\{d_1, \ldots , d_k\}\) and \(m > k\) alternatives:

  • In district \(d_1\), all agents have value \(1/m + \varepsilon \) for alternative \(a_1\), and value \(1/m - \varepsilon /(m-1)\) for any other alternative.

  • For any \(\ell \in \{2,\ldots ,k\}\), in district \(d_\ell \), all agents have value \(1/2 + \varepsilon \) for alternative \(a_\ell \), value \(1/2 -\varepsilon \) for alternative x, and value 0 for any other alternative.

Since the in-district rule is unanimous, the district representatives are alternatives \(\{a_1, \ldots , a_k\}\), and the overall winner is thus \(a_1\). The social welfare of alternative \(a_1\) is approximately \(\lambda /m\), whereas the social welfare of alternative x is approximately \(k\cdot \lambda /2\), leading to distortion \(\Omega (km)\).\(\square \)

When only ordinal information about the preferences of the agents is available, [7] showed that Plurality-of-Plurality, which chooses the favorite alternative of most of the agents in a district as its representative and then the alternative that represents the most districts as the winner, has distortion \(O(km^2)\). We show that this mechanism is asymptotically best possible among all ordinal distributed mechanisms (without any restrictions), thus improving upon the result of [7] who showed that Plurality-of-Plurality is best possible only within the class of mechanisms they studied.

We first prove an easy but important lemma showing that when only ordinal information is available, to achieve finite distortion, it is necessary the representative of each district to be some alternative that is the favorite of at least one agent in the district.

Lemma 3.2

The representative of any district must be some top-ranked alternative, otherwise the distortion is infinite.

Proof

Let d be a district and let T be the set of top-ranked alternatives. Suppose that the representative of d is chosen to be some alternative \(x \not \in T\). Then, in any instance consisting of copies of d, the winner must be x. However, the valuation profile might be such that all agents have value 1 for their favorite alternative and 0 for any other alternative. Consequently, the social welfare of x might be 0, whereas the social welfare of any top-ranked alternative is positive, leading to infinite distortion.

We say that a district is divided if its \(\lambda \) agents are partitioned into m/2 equal-sized sets such that all the \(2\lambda /m\) agents in each set rank the same alternative first and different sets of agents have different top-ranked alternatives. By Lemma 3.2, the representative of such a district must be one of the top-ranked alternatives. The following lemma shows that choosing the representative of a divided district as the winner is, under some circumstances, a bad choice.

Lemma 3.3

Suppose that some alternative \(y_1\) is chosen as the winner by a deterministic ordinal distributed mechanism when the set of representatives is \(\{y_1, \ldots , y_k\}\). If there exists a divided district that is represented by \(y_1\), then there are \(k-1\) districts with representatives \(y_2,\ldots ,y_k\), and altogether these k districts define an instance such that the distortion of the mechanism is \(\Omega (k m^2)\).

Proof

Let M be a deterministic ordinal distributed mechanism that selects \(y_1\) as the winner when the set of representatives is \(\{y_1, \ldots , y_k\}\), and let d be the divided district that is represented by \(y_1\). Consider the following k districts:

  • The first district is a copy of d.

  • For every \(\ell \in \{2,\ldots ,k\}\), the \(\ell \)-th district is such that all agents therein rank \(y_\ell \) first, \(x \not \in \{y_1,\ldots ,y_k\}\) second, and then all other alternatives. By Lemma 3.2, M must choose \(y_\ell \) as the representative of the \(\ell \)-th district, as this is the only top-ranked alternative.

So, indeed the set of representatives is \(\{y_1,\ldots ,y_k\}\) and M chooses \(y_1\) as the winner by assumption. One possible valuation profile is the following:

  • In the first, divided district, the \(2\lambda /m\) agents that rank \(y_1\) first have value 1/m for all alternatives, and the remaining agents all have value 1 for their favorite alternative.

  • For every \(\ell \in \{2,\ldots ,k\}\), all agents in the \(\ell \)-th district have value 1/2 for their two favorite alternatives (\(y_\ell \) and x).

Consequently, the social welfare of \(y_1\) is \(\lambda /m^2\) whereas the social welfare of x is approximately \(k\cdot \lambda /2\), and thus the distortion is \(\Omega (k m^2)\).\(\square \)

Lemma 3.3 shows that deterministic ordinal distributed mechanisms with distortion \(o(km^2)\) must not output the representative of a divided district as the winner when it is given a set of districts with different representatives. However, as we show in the proof of the next theorem, there are instances where such a choice is inevitable, and thus the distortion is \(\Omega (km^2)\).

Theorem 3.4

The distortion of any deterministic ordinal distributed mechanism is \(\Omega (km^2)\).

Proof

Let M be a deterministic ordinal distributed mechanism. We focus on instances with k districts and sets of alternatives \(A \cup B \cup C \cup \{x\}\), where \(A = \{a_1,\ldots ,a_k\}\), \(B = \{b_1, \ldots , b_{m/2+k-1}\}\), and \(C = \{c_1,\ldots ,c_{m-2k}\}\). Without loss of generality, suppose that when the district representatives are \(\{a_1,\ldots ,a_k\}\), M chooses \(a_1\) as the overall winner.

Let \(d_1\) be a divided district with set of top-ranked alternatives \(\{a_1,b_1,\ldots ,b_{m/2-1}\}\). By Lemma 3.3, if \(a_1\) is the representative of \(d_1\), then there exists an instance such that the distortion of M is \(\Omega (km^2)\). So, suppose that the representative of \(d_1\) is some other top-ranked alternative, say \(b_1\). Again by Lemma 3.3, if \(b_1\) is chosen as the winner whenever she is part of a representative set consisting of k distinct alternatives, then the distortion of M would be \(\Omega (km^2)\). So, let us assume that when the district representatives are \(\{b_1,a_2,\ldots ,a_k\}\), the winner is an alternative different than \(b_1\), say \(a_2\).

We can now repeat this argument step by step for each alternative \(a_\ell \), \(\ell \in \{2,\ldots ,k\}\). In particular, let \(d_\ell \) be a divided district with top-ranked alternatives \(\{a_\ell ,b_\ell ,\ldots ,b_{m/2+\ell -2}\}\) (note that alternatives \(b_1,\ldots ,b_{\ell -1}\) do not appear as top-ranked alternatives in \(d_\ell \)). By Lemma 3.3, if \(a_\ell \) is the representative of \(d_\ell \) then the distortion of M is \(\Omega (km^2)\), so the representative is some other alternative from the set \(\{b_\ell ,\ldots ,b_{m/2+\ell -2}\}\), say \(b_\ell \). Again by Lemma 3.3, if \(b_\ell \) is chosen as the winner whenever she is part of a representative set consisting of k distinct alternatives, then the distortion of M would be \(\Omega (km^2)\). So, when the district representatives are \(\{b_1,\ldots ,b_\ell , a_{\ell +1},\ldots ,a_k\}\), the winner is an alternative not in \(\{b_1,\ldots ,b_\ell \}\), say \(a_\ell \).

The last step of this repeated argument leads to the lower bound of \(\Omega (km^2)\): We have reached an instance with set of representatives \(\{b_1,\ldots ,b_k\}\) all of whom are representative of some divided district, and thus no matter who of them is chosen as the winner, by Lemma 3.3 there exists an instance that includes the corresponding divided district and \(k-1\) unanimous districts (like in the proof of the lemma) such that the distortion is \(\Omega (km^2)\).\(\square \)

Finally, let us discuss the case of deterministic strategyproof distributed mechanisms. [14] showed that the distortion of any deterministic centralized strategyproof voting rule (including those that have access to the valuation functions) is \(\Theta (nm)\). From the discussion Section 2.4, we directly obtain a lower bound of \(\Omega (nm)\) for the distributed setting as well. A tight upper bound is also not hard to derive by considering the straightforward First-of-First mechanism which works as follows:

  • For each district d, choose the favorite alternative of the first agent therein as the representative.

  • Choose the representative of the first district as the winner.

Theorem 3.5

First-of-First is strategyproof and achieves an asymptotically best possible distortion of \(\Theta (nm)\) within the class of deterministic strategyproof distributed mechanisms.

Proof

The mechanism is clearly strategyproof since the winner is the favorite alternative of the first agent of the first district who acts as a dictator. Since the winner is ranked first by an agent, the social welfare of the mechanism is at least 1/m. The maximum possible social welfare is n, and thus the distortion is O(nm).\(\square \)

4 Randomized Mechanisms

We start our discussion on randomized distributed mechanisms by analyzing a general class of mechanisms that we call Uniform-of-\(\delta \) -Approximate. A mechanism M in this class works as follows:

  • For each district d, M chooses the representative \(a_d\) according to some centralized voting rule \(f_{\text {in}}\) that has distortion at most \(\delta \).

  • M chooses the winner uniformly at random from the set of representatives.

Picking the winner uniformly at random from the representatives that have been selected seems to be the most natural choice as there is not much information about the preferences of the agents in the districts, and essentially all we can do is assign higher proportional probability to an alternative that is representative of more districts. We have the following result.

Theorem 4.1

The distortion of any Uniform-of-\(\delta \) -Approximate mechanism is \(O(k\delta )\).

Proof

Consider an arbitrary instance. Let o be the optimal alternative, \(a_d\) the representative of district d, and w the final winner. Denote by \(\text {SW}_d(x)\) the social welfare of alternative x only from the agents in d; clearly, \(\text {SW}(x) = \sum _{d \in D}\text {SW}_d(x)\). The expected social welfare of the mechanism is

$$\begin{aligned} \mathbb {E}[\text {SW}(M)]&= \sum _{a \in A} \Pr [w=a] \cdot \text {SW}(a) \\&= \frac{1}{k} \sum _{a \in A} \left( \sum _{d \in D} \Pr [a_d = a] \right) \text {SW}(a) \\&= \frac{1}{k} \sum _{d \in D} \sum _{a \in A} \Pr [a_d = a] \cdot \text {SW}(a) \\&= \frac{1}{k} \sum _{d \in D} \mathbb {E}[\text {SW}_d(a_d)] \\&\ge \frac{1}{k} \sum _{d \in D} \mathbb {E}[\text {SW}_d(a_d)] \end{aligned}$$

Since \(a_d\) is chosen based on a voting rule with distortion at most \(\delta \), we have that \(\mathbb {E}[\text {SW}(a_d)] \ge \frac{1}{\delta } \cdot \text {SW}_d(o)\). Combining this together with the fact that \(\text {SW}(o) = \sum _{d \in D} \text {SW}_d(o)\), and using the linearity of expectation, we obtain

$$\begin{aligned} \mathbb {E}[\text {SW}(M)]&\ge \frac{1}{k} \sum _{d \in D} \mathbb {E}[\text {SW}_d(a_d)] \\&\ge \frac{1}{k} \sum _{d \in D} \frac{1}{\delta } \cdot \text {SW}_d(o) \\&= \frac{1}{k\delta } \cdot \text {SW}(o). \end{aligned}$$

Hence, the distortion of the mechanism is at most \(k\delta \).\(\square \)

Theorem 4.1 is a simple composition theorem, analogous to the one presented by [8] for the metric setting. Based on it, we can define randomized distributed mechanisms with proven distortion guarantees by appropriately choosing the in-district rule. Before we continue, observe that the sizes of the districts do not appear in the proof of Theorem 4.1, and thus the distortion of any Uniform-of-\(\delta \) -Approximate mechanism is \(O(k\delta )\) even if the districts are asymmetric. So, the distortion of the mechanism depends on the number of agents only if the distortion \(\delta \) of the in-district rule depends on the number of agents.

If cardinal information is available at the district level, by using Range-Voting with \(\delta =1\) as the in-district rule, we obtain the following.

Corollary 4.2

The distortion of Uniform-of-Range-Voting is O(k).

If only ordinal information about the preferences of the agents is given at the district level, then we can use Plurality with \(\delta = O(m^2)\) and the randomized rule Stable-Lottery mechanism of [12] with \(\delta = O(\sqrt{m})\) as the in-district rule to obtain the following results.

Corollary 4.3

The distortion of Uniform-of-Plurality is \(O(k m^2)\).

Corollary 4.4

The distortion of Uniform-of-Stable-Lottery is \(O(k \sqrt{m})\).

An important question to ask next is under what circumstances the aforementioned upper bounds of Corollaries 4.2, 4.3 and 4.4 are tight. First, we show that Uniform-of-Range-Voting is the best among mechanisms with unanimous in-district rules which may even use cardinal information.

Theorem 4.5

The distortion of any randomized distributed mechanism with a unanimous in-district rule is \(\Omega (k)\).

Proof

Let \(\varepsilon > 0\) be a positive infinitesimal. Consider an instance with the following k symmetric districts: For any \(\ell \in [k]\), in district \(d_\ell \), all \(\lambda \) agents therein have value \(1/2+\varepsilon \) for alternative \(a_\ell \), \(1/2-\varepsilon \) for alternative x, and 0 for any other alternative. Since, the in-district rule is unanimous, the representative of district \(d_\ell \) must be \(a_\ell \) with probability 1. Hence, no matter what the probability of choosing a district representative as the winner is, the expected social welfare of the mechanism is \(\lambda \cdot (1/2+\varepsilon )\). However, the social welfare of alternative x is \(k\cdot \lambda \cdot (1/2 - \varepsilon )\), and thus the distortion is \(\Omega (k)\).\(\square \)

If we consider non-unanimous in-district rules, but require the in-district rule to be deterministic, then we can show a weaker lower bound of \(\Omega (\sqrt{k})\); notice that the theorem also implies the same bound for fully deterministic distributed mechanisms without unanimous in-district rules.

Theorem 4.6

The distortion of any randomized distributed mechanism with a deterministic in-district rule is \(\Omega (\sqrt{k})\).

Proof

Consider a district \(d_\ell \) in which all agents have value 1/2 for alternative \(a_\ell \), value \(1/(2\sqrt{k})\) for each alternative in \(\{b_1,\ldots ,b_{\sqrt{k}}\}\), and 0 for any other alternative. If the representative of this district is not \(a_\ell \), then in instances consisting of copies of this district, the distortion is at least \(\sqrt{k}\); in particular, it is at least that much if some alternative in \(\{b_1,\ldots ,b_{\sqrt{k}}\}\) is chosen and infinite if any other alternative is chosen. So, suppose that the representative of \(d_\ell \) is \(a_\ell \).

Next, consider an instance with k symmetric districts \(d_1, \ldots , d_k\). By the above discussion, for any \(\ell \in [k]\), the representative of \(d_\ell \) is alternative \(a_\ell \) with social welfare \(\lambda /2\) (note that only the agents of \(d_\ell \) have positive value, equal to 1/2, for \(a_\ell \)). Hence, no matter which district representative is chosen as the winner (or the probability distribution over the representatives), the (expected) social welfare of the mechanism is \(\lambda /2\). In contrast, the social welfare of any alternative in \(\{b_1,\ldots ,b_{\sqrt{k}}\}\) is \(k \cdot \lambda /(2\sqrt{k}) = \sqrt{k}\cdot \lambda /2\), and thus the distortion is \(\sqrt{k}\).\(\square \)

Next, we show that Uniform-of-Plurality is the best possible among ordinal randomized distributed mechanisms with deterministic in-district rules, assuming an arbitrary but fixed ordering of the alternatives. This is quite surprising, as it shows that randomization over the districts is not better than just choosing an arbitrary alternative that is representative of the most districts (i.e., not better than Plurality-of-Plurality).

Theorem 4.7

The distortion of any ordinal distributed mechanism with a deterministic in-district rule is \(\Omega (km^2)\), when there exists an arbitrary but fixed tie-breaking ordering of the alternatives.

Proof

Without loss of generality, suppose that the tie-breaking ordering of the alternatives is \(a_1 \succ \ldots \succ a_k \succ b_1 \succ \ldots \succ b_{m/2-1} \succ x \succ c_1 \succ \ldots \succ c_{m/2-k}\); the naming of the alternatives is arbitrary but is assumed to be known and can be exploited. For simplicity, for any set of alternatives X, denote by [X] an arbitrary ordering of the alternatives in X.

Consider an instance with k symmetric districts such that in district \(d_\ell \) there is a set of \(2\lambda /m\) agents with preference ordering \(a_\ell \succ x \succ [A \setminus \{a_\ell ,x\}]\), a set of \(2\lambda /m\) agents with preference ordering \(b_1 \succ x \succ [A \setminus \{b_1,x\}]\), \(\ldots \), and a set of \(2\lambda /m\) agents with preference ordering \(b_{m/2-1} \succ x \succ [A \setminus \{b_{m/2-1},x\}]\). By Lemma 3.2, the representative of \(d_\ell \) must be one of the top-ranked alternatives (otherwise the distortion of the mechanism would be infinite). Since \(a_\ell \) is ranked above the other alternatives in the tie-breaking ordering, she is chosen as the representative of \(d_\ell \). Hence, the set of representatives is \(\{a_1, \ldots , a_k\}\), and the winner is chosen according to some probability distribution over this set.

The valuation profile may be such that the \(2\lambda /m\) agents in district \(d_\ell \) that rank \(a_\ell \) first have value 1/m for all alternatives, while all other agents in \(d_\ell \) have value 1/2 for their two favorite alternatives. Consequently, the social welfare of alternative \(a_\ell \) is \(2\lambda /m^2\), and thus the social welfare of the mechanism is also this much, no matter the probability distribution over the district representatives. In contrast, the social welfare of x is approximately \(k\lambda /2\), leading to a distortion of \(\Omega (km^2)\).\(\square \)

When randomization at the district level can be leveraged by ordinal distributed mechanisms, then we achieve distortion much better than what is implied by Corollary 4.4, while also achieving strategyproofness. In particular, there are several centralized voting rules that can be implemented as distributed mechanisms, in the sense that they define the same probability distribution over the alternatives. One such important class of voting rules is that of point-voting schemes, which is part of a larger class of strategyproof mechanisms [16, 17, 37] and includes rules with almost best possible distortion guarantees [4, 12].

4.1 Point-Voting Schemes

A point-voting scheme chooses an agent uniformly at random and then outputs her t-th favorite alternative with probability \(p_t\), where \(p_1\ge \ldots \ge p_m \ge 0\) and \(\sum _{t = 1}^m p_t = 1\). Hence, the probability according to which the point-voting scheme using the probability vector \(\textbf{p}=(p_1,\ldots ,p_m)\) chooses alternative \(a\in A\) as the winner w is \(\Pr [w=a] = \frac{1}{n} \sum _{i \in N} p_{\sigma _i(a)}\), where \(\sigma _i(a)\) is the position that i ranks a in her preference ranking \(\sigma \).

There are many point-voting schemes of interest. For every positional scoring rule using the scoring vector \(\textbf{s} = (s_1,\ldots ,s_m)\), we can define a point-voting scheme \(f(\textbf{s})\) by normalizing the scoring vector, that is, define \(p_t = s_t / \left( \sum _{j \in [m]} s_j \right) \) for every \(t \in [m]\) so that the winning probability of alternative a is

$$\begin{aligned} \Pr [w=a]&= \frac{1}{n} \sum _{i \in N} \frac{s_{\sigma _i(a)}}{\sum _{j \in [m]} s_j} \\&= \frac{\sum _{i \in N}s_{\sigma _i(a)}}{n \cdot \sum _{j \in [m]} s_j}. \end{aligned}$$

Another important point-voting scheme is the rule that chooses each alternative uniformly at random; in this case, we have \(p_t = 1/m\) for every \(t \in [m]\) so that \(\Pr [w=a] = \frac{1}{n} \sum _{i \in N} \frac{1}{m} = \frac{1}{m}\).

For any point-voting scheme f that uses a probability vector \(\textbf{p}\), we consider the distributed mechanism Proportional-of-f -Point-Voting, which works as follows:

  • For every district d, choose the representative \(a_d\) to be alternative \(a \in A\) with probability \(\frac{1}{{n_d}} \sum _{i \in N_d} p_{\sigma _i(a)}\).

  • Choose the winner to be the representative of district d with probability \(n_d/n\).

Theorem 4.8

Proportional-of-f -Point-Voting defines the same probability distribution as the point-voting scheme f.

Proof

The probability that alternative a is chosen as the winner by Proportional-of-f -Point-Voting is

$$\begin{aligned} \Pr [w = a]&= \sum _{d \in D} \Pr [w = a_d] \cdot \Pr [a_d = a] \\&= \sum _{d \in D} \frac{n_d}{n} \cdot \frac{1}{n_d} \sum _{i \in N_d} p_{\sigma _i(a)} \\&= \frac{1}{n} \sum _{i \in N} p_{\sigma _i(a)}, \end{aligned}$$

that is, Proportional-of-f -Point-Voting chooses a with the same probability as f.\(\square \)

Theorem 4.8 shows that Proportional-of-f -Point-Voting achieves the same distortion bound as the point-voting scheme f it uses as the in-district rule, and also that it inherits its strategyproofness property. This is extremely useful, as there are centralized voting rules that are based on point-voting schemes and achieve almost the best possible distortion.

[4] considered a voting rule that is a convex combination of two point-voting schemes: With probability 1/2 choose an alternative uniformly at random, and with probability 1/2 run the point-voting scheme defined by normalizing the harmonic scoring rule \(H = (1, 1/2, \ldots , 1/m)\). We will refer to this mechanism as BCHLPS. [4] showed that this voting rule has distortion \(O(\sqrt{m \log {m}})\). An important property of point-voting schemes is that any rule that is a convex combination of point-voting schemes is also a point-voting scheme. The following lemma is similar to lemmas proved before in the literature (e.g., see [13, 17]); we provide a proof for completeness.

Lemma 4.9

Let \(f_1,\ldots ,f_\kappa \) be point-voting schemes defined by the probability vectors \(\textbf{p}_1, \ldots , \textbf{p}_\kappa \). For any non-negative numbers \(q_1,\ldots ,q_\kappa \) such that \(\sum _{j \in [\kappa ]} q_j = 1\), the voting rule f that chooses the outcome of \(f_j\) with probability \(q_j\) is a point-voting scheme.

Proof

Let \(\sigma \) be an arbitrary preference profile. For any \(j \in [\kappa ]\), denote the t-th coordinate of \(\textbf{p}_j\) as \(p_{j,t}\), and let \(P_j(a) = \Pr [a = f_j(\sigma )]\) be the probability of choosing a as the winner according to point-voting scheme \(f_j\). Then, the voting rule f chooses alternative a as the winner w with probability

$$\begin{aligned} \Pr [w=a]&= \sum _{j \in [\kappa ]} q_j \cdot P_j(a) \\&= \sum _{j \in [\kappa ]} q_j \cdot \left( \frac{1}{n} \sum _{i \in N} p_{j,\sigma _i(a)} \right) \\&= \frac{1}{n} \sum _{i \in N} \sum _{j \in [\kappa ]} q_j \cdot p_{j,\sigma _i(a)}. \end{aligned}$$

Hence, f is a point-voting scheme defined by the probability vector \(\textbf{p}\) with \(p_t = \sum _{j \in [\kappa ]} q_j \cdot p_{j,t}\).\(\square \)

Consequently, by Theorem 4.8 and Lemma 4.9, we can construct a randomized ordinal distributed mechanism based on the point-voting scheme of [4] that achieves the same distortion bound and is strategyproof.

Corollary 4.10

There exists a randomized ordinal strategyproof distributed mechanism with distortion \(O(\sqrt{m \log {m}})\).

This distortion bound is almost best possible as the lower bound of \(\Omega (\sqrt{m})\) for randomized centralized rules holds trivially for distributed mechanisms by considering single-district instances.

5 Experiments

In this section, we perform experiments with real and synthetic datasets, aiming to identify patterns in the distortion of several well-known voting rules and examine whether these support our theoretical findings. It is well-documented in the literature (e.g., see [4, 7]) that when working with real or realistic preferences, it often is the case that the distortions bounds are small numbers quite close to 1. In this sense, our goal is not primarily to demonstrate the distortion bounds themselves, but rather the dependence of these bounds on the distributed decision-making process, in particular the number of districts, as well as the use of randomization. We perform two main experiments, one with real-world preferences and valuation data, and one with synthetic data. All our experiments are with symmetric districts.

5.1 Experiments with the Jester Dataset

For our first experiment, we use the Jester Joke Dataset [38]. The dataset contains ratings for 100 different jokes in the range \([-10,10]\), provided by 70000 users. We chose to work with this dataset as it has also been employed by [4] in the context of centralized distortion bounds, and also by [7] for the distortion of deterministic distributed mechanisms that use plurality as the over-district rule.

Following the methodology developed in these works, we construct inputs consisting of ratings for the 8 most-rated jokes. In particular, we perform 1000 random runs in which we sample 100 users from the set of all (approximately 50, 000) users that have provided rankings for all eight jokes, and then partition them into k equal-sized districts uniformly at random, for \(k \in \{1, 2, 5, 10, 20, 25\}\). Clearly, the case of \(k=1\) corresponds to the centralized setting and will be used as a reference point. We interpret the ratings of the jokes as cardinal valuations: to be consistent with our setting (and with the experiments of [4, 7]), we add 10 to each user’s rating vector, to ensure that the values are positive and then apply the unit-sum normalization. For these inputs, we compute the average distortion of a set of 20 voting rules over the 1000 runs of the experiment. In particular, we consider distributed mechanisms \(f_\text {over}\)-of-\(f_\text {in}\), where for \(f_\text {over}\) we use Plurality or Uniform, whereas for \(f_\text {in}\) we have:

Deterministic Rules

We use simple voting scoring rules, namely Plurality (PL), Veto, Borda and Harmonic, as well as Range-Voting (RV), which in the case of \(k=1\) finds the optimal alternative.

Randomized Rules

Here we use several natural point-voting schemes with probability vectors that are proportional to the aforementioned scoring rules (recall the definition from Section 4), namely

  • Proportional to Plurality Score (PropPL);

  • Proportional to Borda Score (PropBorda);

  • Proportional to Veto Score (PropVeto);

  • Proportional to Harmonic Score (PropHarmonic).

We also use the rule of [4] (we refer to it as BCHLPS in the following); recall that this is a point-voting scheme that with probability 1/2 selects an alternative at random and with probability 1/2 runs the PropHarmonic rule defined above. As established in Corollary 4.10 (and the discussion before the statement of the corollary), this is best possible in terms of the worst-case distortion.

Table 2 Distortion bounds of various voting rules based on valuations defined by the provided scores of the Jester dataset and random district partitions

The results of our experiments can be seen in Table 2. In the table we only present the results where as \(f_{\text {over}}\), we used Plurality for deterministic rules and Uniform for randomized rules. This is in accordance to our approach in the theoretical results in previous sections. The bounds for the cases not shown are quite similar, and slightly larger in general. For each of the randomized rules, we perform 300 runs and calculate their expected social welfare, which we then use to calculate the distortion.

From the results of Table 2 we observe that, as expected, the existence of multiple districts has an adverse effect on the distortion of deterministic mechanisms, which becomes worse compared to the centralized case \(k=1\). For these rules, we can also observe that the distortion generally increases as k increases. In contrast, the distortion of randomized rules remains virtually unchanged for any value of k. This is in complete accordance with our theoretical findings, where we established that these rules induce the same probability distribution.

Another crucial observation is that, in terms of the absolute distortion numbers, randomization does not seem to help; if anything, it makes the distortion bounds worse! This can be justified by the fact that real-world instances like those from the Jester dataset display a large degree of homogeneity, which results in the simple deterministic rules performing quite well. On the other hand, randomization often leads to suboptimal choices even on such “well-behaved” instances, demeaning the distortion bounds on average. Surprisingly, among ordinal voting rules, Borda seems to perform best across the board even though the theoretical distortion of Borda is in fact unbounded.

Table 3 Distortion bounds of various voting rules based on valuations defined according to several probability distributions and random district partitions

5.2 Experiments with Synthetic Datasets

We also perform experiments with datasets that are generated from probability distributions. In particular, and to be consistent with the Jester experiment presented above, we create instances with 100 agents and 8 alternatives, by first drawing the values of the agents from a certain distribution, and then constructing the induced ordinal preference profile from those values. We use the following distributions:

  • Uniform distribution in [1, 100]. This is the simplest case, where all possible values are equally likely.

  • Beta distribution with \(\alpha = 1/10\) and \(\beta = 1/10\). This distribution has a symmetric convex pdf function centered around a mean of 1/2, assigning higher probabilities to values very close to 1 or 0.

  • Exponential distribution with exponent 4, i.e., the pdf is \(f(x)=4e^{4}\) for \(x \ge 0\) and \(f(x)=0\) otherwise. This distribution generates values close to 0 with high probability, and as the values increase, the probability of them being generated decreases exponentially.

For the rest of the experiment, we perform similar steps as in the case of the Jester dataset: We normalize the values to sum up to 1, and run the set of mechanisms described above. For each randomized mechanism we now perform 150 individual runs and calculate its expected welfare. We calculate the average distortions over 500 runs of the experiment for k symmetric districts, where \(k \in \{1, 2, 5, 20, 25\}\). Note that the number of runs and the number of district sizes is slightly smaller in this experiment, because it is more computationally intensive (as we need to calculate bounds for 3 different distributions). Again, we use Plurality as \(f_{\text {over}}\) for deterministic and Uniform for randomized mechanisms; the results for the other cases were similar and are not reported.

The results can be found in Table 3. Similarly to the Jester experiment, it is evident that the distortion of the deterministic mechanisms becomes worse for \(k \ge 2\), whereas it remains pretty much the same for randomized mechanisms. Again, we observe that randomization results in worse distortion bounds overall, and that Borda performs best among deterministic mechanisms. Interestingly, contrary to the Jester dataset, here we do not see a clear pattern of the distortion increasing as k increases for deterministic mechanisms (other than the jump from \(k=1\) to \(k=2\)). This is probably due to the fact that the synthetic instances are highly homogeneous, and with uniform random district partitions, the districts end up being quite uniform, regardless of their number and size.

The role of Unit-Sum

We remark here that normalizing the values to sum up to 1 effectively makes the Uniform and Exponential distributions pretty similar, and this is reflected in the results. To get a sense of the effect of normalization, we also ran the experiments without it. We observe that the distortions for the exponential distribution are now larger than those of the uniform distribution. In general, the distortion bounds still lie in the range [1.03, 1.15] for all distributions, but their average values (over all documented distortion bounds) are larger for all distributions except Uniform. It is also the case that for the Beta distribution, the bounds of deterministic mechanisms are much closer to those of randomized ones. The distortion of randomized mechanisms is still almost the same for any number of districts.

6 Open Problems

From our results, an interesting technical challenge is to remove the requirement for a consistent tie-breaking ordering from the statement of Theorem 4.7. Similarly, we could attempt to remove unanimity from the lower bound of Theorem 3.1; although unanimity is usually pretty natural, removing it would make the theorem stronger. More interestingly, our result about point-voting schemes in Theorem 4.8 crucially does not depend on the normalization of the valuations, and hence also could be applied verbatim to the metric distributed social choice setting studied by [8], where randomized mechanisms have never been considered; this seems like a natural starting point for such an investigation. Another direction could be to investigate whether using different rules for choosing the representatives of different districts (rather than using the same rule for all districts) could lead to improved distortion bounds.