1 Introduction

Committee selection rules (a.k.a. multiwinner voting rules) have received a considerable amount of attention recently due to their broad applications in social choice, multi-agent systems, recommendation systems, etc. [31, 39, 40, 49]. How efficiently winning candidates with respect to a committee selection rule can be calculated is one of the most significant criteria to evaluate the applicability of these rules. Many rules, such as STV, Bloc, k-Borda, approval voting, satisfaction approval voting, admit polynomial-time algorithms for computing winners [3, 28]. However, there are also salient committee selection rules with respect to which winners are NP-hard to compute. Among them are particularly the minimax approval voting (MAV) [11, 50], the proportional approval voting (PAV) [3, 43, 70], and the Chamberlin–Courant’s approval voting (CCAV) [18, 61]. Nevertheless, as these rules possess their own merits in many other aspects [2, 33, 43, 44, 49], researchers investigated the parameterized complexity of winner determination problems under these rules, with the hope of procuring as many fixed-parameter tractability results as possible. While many realistic parameters have been considered in the literature so far, there are still many relevant but underappreciated parameters. This paper aims to take a further step towards breaking the complexity barrier against the applicability of the above-mentioned three approval-based rules by extensively expanding the set of meaningful parameters leading to FPT-algorithms for the winner determination problems. Notably, in addition to many traditional parameters, we also study several structural parameters of the incidence graphs of approval-based elections. The incidence graph of an approval-based election is a bipartite graph whose vertex set is the candidate set union the voter set, and there is an edge between a candidate and a voter if and only if this voter approves the candidate. To date, less is known about whether some structural parameters of the incidence graphs such as the treewidth and the size of a maximum matching lead to some FPT-algorithms. These parameters are intimately connected to several important single parameters. For example, they are lower bounds of both the number of candidates m and the number of voters n, and hence any FPT-algorithm with respect to these two structural parameters directly carry over to m and n.

Organization. In Sect. 2, we give definitions and notations used in the paper. Then, we elaborate on related works and summarize our main results in Sect. 3. Our concrete results are encapsulated in Sects. 46. Specifically, Sect. 4 studies single parameters, Sect. 5 explores combinations of the single parameters occurring in Sect. 4 and Sect. 6 focuses on the aforementioned structural parameters. For an overview of our main results, we refer to Table 1. We complete the paper by recapping our contributions and laying out several promising avenues for future research in Sect. 7.

2 Preliminaries

In this section, we give essential notions related to our study.

2.1 Elections

We study approval-based multiwinner voting. In this setting, an election is a tuple \(E=(C, V)\) where C is a set of candidates and V is a multiset of votes. Each vote of V is cast by a voter and is defined as a subset of C. In this paper, we interchangeably use the terms vote and voter. We say that a vote v approves a candidate c if \(c\in v\). For each candidate cV(c) denotes the set of votes in V approving c. Let k be a nonnegative integer. A k-set is a set of cardinality k. A committee (respectively, k-committee) is a subset (respectively, k-subset) of candidates. A k-committee selection rule maps each election (CV) and every nonnegative integer k such that \(k\le |C|\) to a collection of k-committees of C, winning k-committees of (CV) under this rule.

The Hamming distance between two sets v and \(v'\) is defined as

$$\begin{aligned} \mathscr{H}(v,v')=|v\setminus v'|+|v'\setminus v|. \end{aligned}$$

We study the following k-committee selection rules.

MAV:

In general, MAV selects k-committees as close as possible to every vote, where the closeness is measured by the Hamming distance. Precisely, the MAV score of a committee \(w\) with respect to (CV) is \({\textsf {MAV}}(V,w)=\max _{v\in V}\mathscr{H}(v,w)\). MAV selects k-committees with the minimum MAV score.

CCAV:

A vote v is satisfied with a committee \(w\) if and only if at least one of v’s approved candidates is contained in \(w\), i.e., \(v\cap w\ne \emptyset\). Candidates in \(v\cap w\) are regarded as representatives of v in \(w\). The CCAV score of a committee \(w\) with respect to (CV) is \({\textsf {CCAV}}(V,w)=|\{v\in V :v\cap w\ne \emptyset \}|\). CCAV selects k-committees with the maximum CCAV score.Footnote 1

PAV:

The PAV score of a committee \(w\) with respect to an election (CV) is \({\textsf {PAV}}(V,w)=\sum _{v\in V, v\cap w\ne \emptyset } \sum _{i=1}^{|v\cap w|}\frac{1}{i}\). PAV selects k-committees with the maximum PAV score.

Note that by the above definitions, \({\textsf {CCAV}}(V,\emptyset )={\textsf {PAV}}(V,\emptyset )=0\) holds.

For each \(\tau \in \{\text {MAV, CCAV, PAV}\}\), we study the following problem.

\(\tau {\textsc {-Multiwinner}}\)

Input:

An election \(E=(C, V)\), an integer \(k\le |C|\), and a rational number \(d\).

Question:

Is there a k-committee \(w\subseteq C\) such that \(\tau (V, w)\le d\) for \(\tau =\text {MAV}\), and \(\tau (V,w)\ge d\) for \(\tau \in \{\text {CCAV, PAV}\}\)?

Throughout this paper, we study the following single parameters and consistently use the corresponding notations given below.

  • \(m=|C|\).

  • \(n=|V|\).

  • \(d\): the threshold score of a desired committee.

  • k: size of winning committees.

  • \(\overline{k}=m-k\).

  • \(\triangle _{\text {V}}=\max _{v\in V}|v|\) is the maximum number of candidates a vote approves.

  • \(\triangle _{\text {C}}=\max _{c\in C}|V(c)|\) is the maximum number of votes approving a candidate in common.

The first four parameters are natural and have been substantially covered in the literature. (See Sect. 3 for the details.) The study of \(\overline{k}\) is motivated by the observation that in many real-life decision-making scenarios winners are picked by eliminating a small number of losers, or more relevantly, many decision-making processes are directly designed to select losers other than selecting winners. The last two parameters \(\triangle _{\text {V}}\) and \(\triangle _{\text {C}}\) have been either explicitly or implicitly studied in the literature. In many real-world applications voters are allowed to approve only a few candidates. In some other scenarios, voters are cognitively limited or time constrained so that they are only able to evaluate a small number of candidates. In these cases, \(\triangle _{\text {V}}\) is relatively small. Regarding the parameter \(\triangle _{\text {C}}\), observe that it is no greater than n. So, it is small whenever n is small. Moreover, as our goal is to provide a landscape of the parameterized complexity of \(\tau {\textsc {-Multiwinner}}\) as complete as possible, it is theoretically significant to study this parameter so as that we can offer results for combined parameters including \(\triangle _{\text {C}}\).

2.2 Graphs

We assume the reader is familiar with the basic concepts of graph theory, and refer to [12, 73] for notions in graph theory used but not defined in the paper. We only reiterate some basic notions below. A graph is a pair (NA) where N is a set of vertices and A is a set of edges over N. Each edge is defined as an unordered pair of vertices, and between every two vertices there can be at most one edge. A multigraph is a generalization of a graph where between every two vertices there may exist multiple edges, and there may exist loops on vertices. A hypergraph is a generalization of a graph where every edge is a subset of vertices. A multihypergraph is a generalization of a hypergraph so that there may exist multiple edges consisting of the same vertices.

A matching of a graph \(G=(N, A)\) is a subset M of A such that no two edges in M share a common vertex. A vertex v is saturated by M if v is a vertex in some edge in M. A maximum matching of G is a matching with the maximum cardinality among all matching of G. We use \(\alpha (G)\) to denote the size of a maximum matching of G.

2.3 Parameterized complexity

A parameterized problem is a subset of \(\varSigma ^*\times {\mathbb {N}}\), where \(\varSigma\) is a finite alphabet. A parameterized problem can be either fixed-parameter tractable (FPT) or fixed-parameter intractable. In particular, a parameterized problem is FPT if there is an algorithm which correctly determines for each instance \((I, \kappa )\) of the problem whether \((I, \kappa )\) is a Yes-instance in time \(O(f(\kappa )\cdot |I|^{O(1)})\), where f is a computable function and \(|I|\) is the size of I. Fixed-parameter intractable problems are further classified into many classes including W[1]-hard, W[2]-hard, etc. For greater details on parameterized complexity theory, we refer to [22, 25, 27].

3 Related works and our contributions

In this section, we discuss some important related works and outline our main contributions.

3.1 Single parameters

\(\tau {\textsc {-Multiwinner}}\) has many natural parameters inherent in its definition, say, the seven single parameters listed at the end of Sect. 2.1. All these seven parameters except \(\overline{k}\) have been explicitly or implicitly investigated in the literature prior to our work.

First, it is easy to see that MAV-Multiwinner, CCAV-Multiwinner, and PAV-Multiwinner are FPT with respect to the parameter m. Misra, Nabeel, and Singh [57] proved that MAV-Multiwinner is FPT with respect to the parameters \(d\) and n, but becomes W[2]-hard when parameterized by k. Betzler, Slinko, and Uhlmann [4] proved that CCAV-Multiwinner is FPT with respect to the parameter n, but turned out to be W[2]-hard when k is the parameter. Moreover, they considered a dual parameter \(R=n-d\). They proved that CCAV-Multiwinner is NP-hard even when \(R=0\), but presented an FPT-algorithm with respect to the combined parameter \(k+R\).Footnote 2 Aziz et al. [3] proved that PAV-Multiwinner is W[1]-hard with respect to k even if every voter approves two candidates.

We complement these results as follows. First, we close some gaps and improve an FPT-algorithm. Concretely, we propose an FPT-algorithm for PAV-Multiwinner when parameterized by n. We also observe that CCAV-Multiwinner is equivalent to the Partial Hitting Set problem, and as a consequence of a previous result for the Partial Hitting Set problem, CCAV-Multiwinner can be solved in \(O^*(2^{O(d)})\) time or in \(O^*(2^{O(n)})\) time.Footnote 3 It should be noted that our new observation-based algorithm substantially improves the previous best FPT-algorithm for CCAV-Multiwinner parameterized by n studied in [4] which runs in time \(O^*(n^n)\). Second, we study the parameter \(\overline{k}=m-k\), the number of nonwinning candidates. We show that MAV-Multiwinner, CCAV-Multiwinner, and PAV-Multiwinner are all W[1]-hard with respect to this parameter, even when every voter approves two candidates. Third, from previous results by other researchers, we achieve numerous dichotomy results with respect to the two natural parameters \(\triangle _{\text {V}}\) and \(\triangle _{\text {C}}\). It is known that MAV-Multiwinner, CCAV-Multiwinner, and PAV-Multiwinner are already NP-hard when \(\triangle _{\text {V}}=2\) and \(\triangle _{\text {C}}=3\) [3, 50, 62]. We prove that MAV-Multiwinner and CCAV-Multiwinner become polynomial-time solvable if \(\triangle _{\text {V}}\le 1\) or \(\triangle _{\text {C}}\le 2\), and PAV-Multiwinner becomes polynomial-time solvable if \(\min \{\triangle _{\text {V}}, \triangle _{\text {C}}\}=1\) or \(\triangle _{\text {V}}=\triangle _{\text {C}}=2\).

3.2 Combined parameters

Obviously, if a problem is FPT with respect to a parameter \(\kappa\), it is FPT with respect to any combined parameter including \(\kappa\). As except PAV-Multiwinner with respect to \(d\) whose fixed-parameter tractability is open, MAV-Multiwinner, CCAV-Multiwinner, and PAV-Multiwinner are FPT with respect to mn, and \(d\), it only makes sense to study combinations of other parameters. As \(k+\overline{k}=m\), \(\tau {\textsc {-Multiwinner}}\) for \(\tau \in \{\text {MAV}, \text {CCAV}, \text {PAV}\}\) is FPT with respect to \(k+\overline{k}\). For MAV and CCAV, the remaining combinations of two single parameters are \(k+\triangle _{\text {V}}\), \(k+\triangle _{\text {C}}\), \(\overline{k}+\triangle _{\text {V}}\), and \(\overline{k}+\triangle _{\text {C}}\). We establish many FPT-results with respect to these combined parameters. Concretely, we obtain FPT-results for MAV-Multiwinner and CCAV-Multiwinner when parameterized by \(k+\triangle _{\text {C}}\) and \(\overline{k}+\triangle _{\text {C}}\). However, as MAV-Multiwinner, CCAV-Multiwinner, and PAV-Multiwinner are W[1]-hard with respect to \(\overline{k}\) (even when every vote approves two candidates), they are W[1]-hard when parameterized by \(\overline{k}+\triangle _{\text {V}}\). For the parameter \(k+\triangle _{\text {V}}\), we develop an FPT-algorithm for MAV-Multiwinner, but we show that CCAV-Multiwinner is W[1]-hard even when every vote approves two candidates. Concerning PAV, a reduction by Aziz et al. [3] implies that PAV-Multiwinner is W[1]-hard with respect to \(k+\triangle _{\text {V}}\). We are unable to prove the FPT-membership of PAV-Multiwinner with respect to \(d\), but we show that combining \(d\) and \(\triangle _{\text {V}}\) leads to an FPT-result.

We would like to point out that Misra, Nabeel, and Singh [57] studied kernelization of MAV-Multiwinner with respect to the combined parameters \(d+m\) and \(n+k\), and showed that these problems do not admit any polynomial kernels unless \(\textsf {CoNP}\subseteq \textsf {NP}/\textsf {Poly}\)Footnote 4. In addition, Liu and Guo [53] studied the combined parameter \(k+n\) for some generalizations of MAV-Multiwinner. Betzler, Slinko, and Uhlmann [4] proved that CCAV-Multiwinner is W[2]-hard with respect to the combined parameter of \(R=n-d\) and k. Though not explicitly stated, their reduction actually implies that CCAV-Multiwinner does not admit any polynomial kernel with respect to both n and m, unless the polynomial hierarchy collapses to the third level.Footnote 5

3.3 Structural parameters

Heretofore, the most widely studied structural parameters in the setting of multiwinner voting are based on various concepts of restricted preference domains such as single-peaked preferences and single-crossing preferences (see, e.g., [20]). Particularly, these parameters measure how far an election is away from a specific domain of preferences. In this paper, we study two structural parameters of incidence graphs of elections. Recall that the incidence graph of an election \(E=(C, V)\) is a bipartite graph \(G_E\) with vertex set \(C\cup V\) so that there is an edge between a candidate \(c\in C\) and a vote \(v\in V\) if and only if \(c\in v\). We prove that CCAV-Multiwinner is FPT with respect to treewidth of incidence graphs, and MAV-Multiwinner and PAV-Multiwinner are FPT if we combine the treewidth and the parameter k. When parameterized by the size of maximum matchings of incidence graphs, we have FPT-algorithms for all three rules.

3.4 Other related works

In addition to the extensive effort made from the angle of parameterized complexity, much exploration on the complexity of \(\tau {\textsc {-Multiwinner}}\) restricted to preference domains has been pursued over the past few years. Betzler, Slinko, and Uhlmann [4] developed polynomial-time algorithms for CCAV-Multiwinner in the single-peaked domain. This algorithm was subsequently extended to an FPT-algorithm with respect to the parameter single-peaked width by Cornaz, Galand, and Spanjaard [20]. Yu, Chan, and Elkind [81] studied the domain of single-peaked on trees, and obtained both polynomial-time algorithms and NP-hardness results for many variants of CCAV. One of their results [81, Theorem 4.1] is in essence an FPT-algorithm for CCAV-Multiwinner with respect to the combined parameter \(m+k+\lambda\), where \(\lambda\) is the number of leaves of the underlying tree. A follow-up paper by Peters and Elkind [59] addressed several open questions left in [81]. Later, Elkind and Lackner [29] proposed 13 different restricted domains of dichotomous preferences, and obtained several polynomial-time algorithms and FPT-algorithms for \(\tau {\textsc {-Multiwinner}}\) for \(\tau\) being CCAV, MAV, and PAV with respect to the parameter \(\triangle _{\text {V}}\), and the combined parameter \(k+d\), when the given elections fall into certain specific categories of their proposed domains. Liu and Guo [53] presented polynomial-time algorithms for MAV-Multiwinner in two of the domains proposed by Elkind and Lackner. Peters and Lackner [60] proposed the domain of single-peaked on a circle, an appealing generalization of the single-peaked domain. They provided a polynomial-time algorithm for CCAV-Multiwinner in this domain. Peters [58] observed a relationship between voting problems restricted to the single-peaked domain and totally unimodular integer linear programming. In view of this observation and the fact that totally unimodular integer linear programming is polynomial-time solvable [64], a variety of polynomial-time solvability results were obtained, including one for PAV-Multiwinner.Footnote 6 Skowron et al. [68] complemented these results by showing that CCAV-Multiwinner is polynomial-time solvable restricted to the single-crossing domain. Clearwater, Puppe, and Slinko [19] then extended this result to the domain of single-crossing on trees. Yang [78] expanded several domains of Elkind and Lackner [29] to directed tree-embedded domains, and derived a number of polynomial-time algorithms for MAV-Multiwinner, CCAV-Multiwinner, and PAV-Multiwinner restricted to these domains.

As traditional approaches tackling NP-hard problems, approximation and heuristic algorithms for \(\tau {\textsc {-Multiwinner}}\) where \(\tau \in \{\text {MAV}, \text {CCAV}, \text {PAV}\}\) have also been perpetually reported over the past few years [16, 23, 35, 65, 66].

We would like to mention that the (parameterized) complexity of \(\tau {\textsc {-Multiwinner}}\) for numerous ranking-based multiwinner voting rules \(\tau\) has been considerably studied in the literature, too [4, 32, 42]. On top of that, the (parameterized) complexity of winner determination problems for several variants of multiwinner voting rules has been explored recently [13, 75].

Finally, we remark that, besides winner determination problems, investigating the complexity of many strategic voting problems, such as manipulation, control, and bribery, has gained increasing interest in recent years as well [14, 34, 76, 77]. We refer to [48] for a comprehensive survey of approval-based multiwinner voting where many other important issues not discussed above have been greatly elaborated on.

Table 1 A summary of the parameterized complexity of \(\tau {\textsc {-Multiwinner}}\)

4 Single parameters

In this section, we investigate some predominant single parameters.

4.1 Parameters \(d\)n, and \(\overline{k}\)

We start with an FPT-algorithm for PAV-Multiwinner with respect to n. Before presenting our algorithm, let us recall the main idea of an FPT-algorithm for MAV-Multiwinner with respect to n [57]. First, the candidates are partitioned into at most \(2^n\) subsets, each consisting of all candidates approved by exactly the same votes. Then, what matters for solving the problem is only how many candidates from each subset are contained in a desired k-committee. Based on this observation, MAV-Multiwinner can be reduced to integer linear programming (ILP) by assigning to each subset defined above an integer variable. As the number of variables is bounded by \(2^n\), Lenstra’s theorem [51] provides an FPT-algorithm for MAV-Multiwinner with respect to n. Although the FPT-algorithm with respect to n for CCAV-Multiwinner derived by Betzler, Slinko, and Uhlmann [4] is not ILP-based, it is easy to see that CCAV-Multiwinner also admits a similar ILP formulation. Unfortunately, this framework does not apply to PAV-Multiwinner. The reason is that the objective in this case is a nonlinear function. To overcome this obstacle, we resort to an FPT-framework proposed by Bredereck et al. [15] for the Mixed Integer Programming With Simple Piecewise Linear Transformations problem. In fact, we need only a special case of the problem defined below. For a vector \(x\in {\mathbb {Z}}^p\) and an integer \(i\in [p]\), we use \(x_i\) to denote the i-th component of x.

A real-valued function f is concave if for every x and y such that \(x<y\) and every \(\lambda\) such that \(0\le \lambda \le 1\) it holds that

$$\begin{aligned} f(\lambda \cdot x + (1-\lambda ) \cdot y)\ge \lambda \cdot f(x)+ (1-\lambda ) \cdot f(y). \end{aligned}$$

Intuitively, a function is concave if for every xy, and z such that \(x\le y\le z\), the point (yf(y)) is not below the straight line determined by the two points (xf(x)) and (zf(z)). A piecewise linear concave function is a piecewise linear function that is concave.

Integer Programming With Simple Piecewise Linear Transformations (IPWSPLT)

Input:

A collection \(\{f_{i,j} :i\in [p], j\in [q]\}\) of \(p\cdot q\) piecewise linear concave functions, and a vector \(b\in {\mathbb {Z}}^p\).

Question:

Is there a vector \(x\in {\mathbb {Z}}^q\) such that for every \(i\in [p]\) it holds that

 

\(\sum _{j=1}^q f_{i,j}(x_j)\le b_i?\)      (1)

The original problem MIPWSPLT studied by Bredereck et al. [15] is more general in that it allows the existence of additional variables which may take nonintegral values, and allows the occurrences of both piecewise linear concave functions and piecewise linear convex functions simultaneously.

Lemma 1

([15]) IPWSPLT can be solved in time \(O({\textsf {poly} }(|I|, t) \cdot q^{2.5q+o(q)})\), where \(|I|\) is the number of bits encoding the input, and t is the maximum number of pieces per function.

We note that Lemma 1 still holds if in the definition of IPWSPLT the less than sign is replaced with the greater than sign or the equal sign in (1) for several \(i\in [p]\) [15].

Now we are ready to present our FPT-algorithm for PAV-Multiwinner with respect to n. We provide indeed an algorithm for a more general problem called Annotated PAV-Multiwinner. In this problem, we are given an election (CV), a subset \(C'\subseteq C\) of candidates, an integer k such that \(|C'|\le k\le |C|\), and a number \(d\), and the question is whether there is a k-committee \(w\subseteq C\) such that \(C'\subseteq w\subseteq C\) and PAV\((V, w)\ge d\). Clearly, PAV-Multiwinner is a special case of Annotated PAV-Multiwinner where \(C'=\emptyset\). This generalization is afterward exploited in an algorithm presented in Sect. 6.

Fig. 1
figure 1

An illustration of the piecewise linear concave function f in the proof of Theorem 1

Theorem 1

Annotated PAV-Multiwinner is FPT with respect to n.

Proof

We prove Theorem 1 by reducing Annotated PAV-Multiwinner to IPWSPLT.

Let \(I=(E, C', k, d)\) be an instance of Annotated PAV-Multiwinner, where \(E=(C, V)\) and \(C'\subseteq C\). Let \(n=|V|\) be the number of votes. In what follows, we construct an integer programming formulation of Annotated PAV-Multiwinner, prove the correctness of the reduction, and prove that the formulation is an instance of IPWSPLT. We create two types of variables. First, we create a variable \(x_v\) for each vote \(v\in V\), indicating the number of v’s approved candidates in a desired k-committee. Second, for each \(U\subseteq V\), let \(C_U\) be the set of candidates approved by all votes in U but disapproved by votes from \(V\setminus U\), i.e., \(C_U=\{c\in C :V(c)=U\}\). We create a variable \(x_U\) for each \(U\subseteq V\) which indicates the number of candidates from \(C_U\) that are supposed to be in a desired k-committee. The constraints are as follows.

  1. (1)

    for each variable \(x_U\)\(U\subseteq V\), it holds that \(|C'\cap C_U|\le x_U\le |C_U|\).

  2. (2)

    For each \(v\in V\), we have \(x_v=\sum _{v\in U\subseteq V} x_U\).

  3. (3)

    As we aim to select a committee of cardinality k, we have \(\sum _{U\subseteq V}x_U=k\).

  4. (4)

    For the last constraint, we need to define a piecewise linear concave function \(f: {\mathbb {R}}_{\ge 0}\rightarrow {\mathbb {R}}_{\ge 0}\) as follows. First, \(f(0)=0\). Second, for each positive integer x, we define \(f(x)=\sum _{i=1}^x\frac{1}{i}\). Third, for each real x such that \(y< x< y+1\) and y is a nonnegative integer, we define

    $$\begin{aligned} f(x)=f(y)+(x-y)\cdot (f(y+1)-f(y)). \end{aligned}$$

    Fig. 1 illustrates the function f. The last constraint is then \(\sum _{v\in V} f(x_v)\ge d\), which ensures that the desired k-committee has PAV score at least \(d\).

Now we show the correctness of the reduction, i.e., we show that the given instance I is a Yes-instance if and only if the above integer programming has a feasible solution.

\((\Rightarrow )\) Assume that I is a Yes-instance, i.e., there exists a k-committee \(w\) so that \(C'\subseteq w\subseteq C\) and \({\textsf {PAV}}(V,w)\ge d\). We assign to the variables constructed above the corresponding values, i.e., we let \(x_{U}=|w\cap C_U|\) for each \(U\subseteq V\), and let \(x_v=|v\cap w|\) for each \(v\in V\). Obviously, \(|w\cap C_U|\le |C_U|\). As \(C'\subseteq w\), it holds that \(|C'\cap C_U|\le |w\cap C_U|\). Therefore, constraints defined in (1) hold. As for every two distinct \(U, U'\subseteq V\)\(C_U\) and \(C_{U'}\) are disjoint and, moreover, \(\bigcup _{{U}\subseteq V}C_{{U}}=C\), it holds that

$$\begin{aligned} |v\cap w|=\sum _{v\in U\subseteq V} |v\cap w\cap C_U|=\sum _{v\in U\subseteq V} |w\cap C_U|=\sum _{v\in U\subseteq V} x_U \end{aligned}$$
(1)

for all \(v\in V\), and \(\sum _{U\subseteq V}|w\cap C_U|=|w|=k\). In other words, all constraints described in (2)–(3) are satisfied. Notice that by the definition of \(C_U\), if \(v\not \in U\) then \(C_U\cap v=\emptyset\), and if \(v\in U\) then \(C_U\subseteq v\). This ensures the correctness of Equality (1). Finally, the constraint given in (4) holds because \(\sum _{v\in V} f(x_v)\) is exactly the PAV score of \(w\) with respect to V which is equal to or greater than \(d\).

\((\Leftarrow )\) Assume that the above integer programming has a feasible solution. We show below that I is a Yes-instance by constructing a desired k-committee \(w\). Initially, let \(w=\emptyset\). For every \(x_U\), \(U\subseteq V\), in the feasible solution, we arbitrarily select \(x_U\) candidates from \(C_U\) so that all candidates in \(C'\cap C_U\) are selected, and add them into \(w\). By the constraints in (1), these \(x_U\) candidates exist for each \(U\subseteq V\). By the constraint in (3), we know that \(w\) consists of k candidates. By the constraints in (2), \(x_v\) is exactly the number of candidates from \(w\) approved in v. As a result, \(\sum _{v\in V} f(x_v)\) is exactly the PAV score of \(w\) with respect to V. Then, from the constraint in (4), it follows that \({\textsf {PAV}}(V,w)\ge d\). Finally, as \(\bigcup _{U\subseteq V}C_U=C\), by the definition of \(w\) we know that \(C'\) is contained in \(w\). Now we can conclude that I is a Yes-instance of Annotated PAV-Multiwinner.

Next, we show that the above integer programming is an instance of IPWSPLT. To this end, we reiterate that IPWSPLT contains ILP as a special case. Constraints described in (1)–(3) are standard constraints (or can be transformed into standard form trivially) of ILP. It is easy to see that the function f defined in (4) is a piecewise linear concave function. Therefore, the above integer programming is an instance of IPWSPLT.

It remains to analyze the running time of the algorithm. The above reduction clearly takes FPT-time in n, and the number of variables is \(n+2^n\). Then, by Lemma 1 and the correctness of the reduction, Annotated PAV-Multiwinner is FPT with respect to n. \(\square\)

We note that a similar FPT-algorithm for winners computation for a large class of ranking-based multiwinner voting rules has been also derived by Faliszewski et al. [32, Theorem 16].

Now we study the parameter \(\overline{k}=m-k\), i.e., the number of candidates not in a desired k-committee. In regard to MAV, the NP-hardness proof by LeGrand [50] actually already implied that MAV-Multiwinner is W[1]-hard with respect to \(\overline{k}\). Precisely, LeGrand’s NP-hardness reduction is from the Vertex Cover problem, where vertices correspond to candidates, and edges correspond to votes so that each vote approves exactly the candidates corresponding to its two endpoints. It is easy to see that there is a vertex cover of cardinality \(\kappa\) if and only if there is a \(\kappa\)-committee so that the Hamming distance between this committee and each vote is at most \(\kappa\). The -hardness of MAV-Multiwinner follows from that Vertex Cover is W[1]-hard with respect to \(m-\kappa\) where m is the number of vertices (candidates) [26].

Corollary 1

([26, 50]) MAV-Multiwinner is W[1]-hard with respect to \(\overline{k}\) even when every vote approves two candidates.

It should be noted that Vertex Cover with respect to \(m-\kappa\) is exactly a parameterized variant of the Independent Set problem. We provide the formal definition of the problem below because it will be used to establish our next intractability result.

An independent set of a graph is a subset of pairwise nonadjacent vertices.

\(\kappa \)-Independent Set (\(\kappa \)-IDS)

Input:

A graph \(G=(N, A)\) and an integer \(\kappa\).

Parameter:

\(\kappa\).

Question:

Does G admit an independent set of cardinality at least \(\kappa\)?

Theorem 2

CCAV-Multiwinner is W[1]-hard with respect to \(\overline{k}\), even when every vote approves two candidates.

Proof

To prove Theorem 2, we offer a reduction from \(\kappa\)-IDS to CCAV-Multiwinner, similar to the one for MAV-Multiwinner discussed above by LeGrand [50]. Let \((G=(N, A), \kappa )\) be an instance of \(\kappa\)-IDS. We construct an instance \(((C, V), k, d)\) of CCAV-Multiwinner as follows. For each vertex \(c\in N\), we create one candidate denoted by the same symbol for simplicity. Let \(C=N\) and let \(m=|C|\). For each edge \(\{c,c'\}\in A\), we create one vote v approving exactly c and \(c'\), i.e., \(v=\{c, c'\}\). Let V be the set of the created votes. Finally, we set \(\overline{k}=\kappa\) (hence \(k=m-\kappa\)) and set \(d=|A|\).

The correctness of the reduction is easy to check. If there is an independent set I of size \(\overline{k}\), the CCAV score of the \((m-\overline{k})\)-committee \(N\setminus I\) is \(d\). Conversely, if there is an \((m-\overline{k})\)-committee \(w\) of CCAV score \(d\), every vote has at least one of its approved candidates in the committee. Due to the construction, this implies that \(N{\setminus } w\) is an independent set. \(\square\)

For PAV, we also obtain a W[1]-hardness result via a reduction from the following problem.

Minimum (\(\kappa \))-Vertex Subgraph (\(\kappa \)-MVS)

Input:

A graph \(G=(N, A)\) and two integers \(\kappa\) and \(\ell\).

Parameter:

\(\kappa\).

Question:

Is there \(S\subseteq N\) such that \(|S|=\kappa\) and \(G[N\setminus S]\) has at most \(\ell\) edges?

Cai [17] proved that the \(\kappa\)-MVS problem is W[1]-hard even when the input graph is regular, i.e., all vertices have the same degree.

Theorem 3

PAV-Multiwinner is W[1]-hard with respect to \(\overline{k}\), even when every vote approves two candidates.

Proof

We provide a reduction from \(\kappa\)-MVS to PAV-Multiwinner. Let \((G, \kappa , \ell )\) be an instance of \(\kappa\)-MVS such that every vertex of G has degree r for some positive integer r. Let \(G=(N, A)\) and let \(m=|N|\). Without loss of generality, we assume that \(\kappa <m\). We construct an instance \(((C, V), k, d)\) of PAV-Multiwinner as follows. For each vertex \(c\in N\), we create one candidate in C denoted by the same symbol, and for each edge \(\{c,c'\}\in A\), we create one vote approving c and \(c'\) in V. We set \(k=m-\kappa\), and hence \(\overline{k}=\kappa\). Finally, we set \(d=(m-\kappa )\cdot r-\frac{\ell }{2}\). It remains to show the correctness.

\((\Rightarrow )\) Suppose that there exists \(S\subseteq N\) so that \(|S|=\kappa\) and \(G[N\setminus S]\) contains exactly \(\ell '\le \ell\) edges. Then the PAV score of the committee \(N{\setminus } S\) is

$$\begin{aligned} \frac{3}{2}\ell '+\left( (m-\kappa )\cdot r-2\ell '\right) = (m-\kappa )\cdot r-\frac{\ell '}{2}\ge d. \end{aligned}$$

Obviously, \(|N\setminus S|=m-|S|=k\). Therefore, the PAV-Multiwinner instance is a Yes-instance.

\((\Leftarrow )\) Conversely, suppose that \(w\subseteq C\) is an \((m-\kappa )\)-committee of PAV score at least \(d\). Let \(\ell '\) be the number of votes whose both approved candidates are in \(w\). Due to the reduction, every candidate is approved by exactly r votes. Therefore, there are exactly \((m-\kappa )\cdot r-2 \ell '\) votes which have exactly one of their approved candidates in \(w\). Hence, the PAV score of \(w\) is \(\frac{3}{2}\ell '+((m-\kappa )\cdot r-2\ell ')\), which is at least \(d\). It immediately follows that \(\ell '\le \ell\), implying that \(N{\setminus } w\) is a Yes-witness of the \(\kappa\)-MVS instance \((G, \kappa , \ell )\). \(\square\)

It should be pointed out that a reduction established by Skowron, Faliszewski, and Lang [67, Theorem 5] for showing NP-hardness of a related problem also implies that PAV-Multiwinner is W[1]-hard with respect to \(\overline{k}\).Footnote 7

Now we move on to the parameter \(d\). We observe that CCAV-Multiwinner is equivalent to the Partial Hitting Set problem which has been intensively studied in the literature.

Partial Hitting Set

Input:

A universe U, a collection \(\mathscr{S}\) of subsets of U, and two nonnegative integers a and b.

Parameter:

b.

Question:

Is there \(S\subseteq U\) such that \(|S|=a\) and S intersects at least b elements of \(\mathscr{S}\)?

Clearly, by taking \(C=U\), \(V=\mathscr{S}\), \(k=a\), and \(d=b\) in the above definition we obtain CCAV-Multiwinner. Bläser [5] derived an algorithm running in time \(O^*(2^{O(b)})\) for Partial Hitting Set. This leads to the following corollary.

Corollary 2

([5]) CCAV-Multiwinner can be solved in time \(O^*(2^{O(d)})\).

Observe that every CCAV-Multiwinner instance where \(d>n\) is a No-instance. As a result, Corollary 2 implies an algorithm for CCAV-Multiwinner running in time \(O^*(2^{O(n)})\), which appreciably improves the \(O^*(n^n)\) algorithm presented in [4].

Corollary 3

([5]) CCAV-Multiwinner can be solved in time \(O^*(2^{O(n)})\).

Bläser [5] also presented a randomized algorithm of running time \(O^*((2e)^b)\) which solves the Partial Hitting Set problem correctly with probability at least \(1-e^{-1}\), where e is the Napier’s constant. We arrive at the following corollary.

Corollary 4

([5]) There is a randomized algorithm running in time \(O^*((2e)^d)\) which solves CCAV-Multiwinner correctly with probability at least \(1-e^{-1}\).

4.2 Parameters \(\triangle _{\text {V}}\) and \(\triangle _{\text {C}}\)

In this section, we study the two parameters \(\triangle _{\text {V}}\) and \(\triangle _{\text {C}}\). First, as Vertex Cover remains NP-hard when restricted to cubic graphs (i.e., 3-regular graphs) [38], the reduction for the NP-hardness of MAV-Multiwinner established by LeGrand [50] implies that MAV-Multiwinner is NP-hard even when every candidate is approved by three votes and every vote approves two candidates. It remains to study the cases where \(\triangle _{\text {V}}\le 1\) or \(\triangle _{\text {C}}\le 2\).

Recall that for a committee selection rule \(\tau\) and an election \(E=(C, V)\), \({\tau }(E, k)\) is the set of all optimal k-committees under \(\tau\). More precisely, for \(\tau\) being MAV, \(\tau (E, k)\) consists of all k-committees of C with the minimum Hamming distance to the votes, and for \(\tau\) being CCAV and PAV, \({\tau }(E, k)\) consists of all k-committees of C with the maximum CCAV and PAV scores, respectively.

It is easy to see that every election \(E=(C, V)\) where every vote approves only one candidate admits a k-committee, \(k\le |C|\), which is optimal with respect to MAV, CCAV, and PAV at the same time. In fact, any optimal k-committee under the approval voting (AV) is such a k-committee. The AV score of a candidate c is \(|V(c)|\), the number of votes approving c, and AV selects k-committees with the maximum sum of AV scores of candidates in the committees.

Observation 1

Let \(E=(C, V)\) be an election where every vote in V approves at most one candidate, and let \(k\le |C|\) be an integer. It holds that \({\text{AV}}(E, k)\subseteq {\text {MAV} }(E,k)\) and \({\text {AV} }(E, k)={\text {CCAV} }(E,k)={\text {PAV} }(E,k)\).

To see that \({\text {AV}}(E, k)\subseteq {\text {MAV}}(E,k)\) instead of \(\text {AV}(E, k)= {\text {MAV}}(E,k)\) in Observation 1, consider an election with two candidates a and b, where a is approved by two votes, and b is approved by a third vote which does not approve a. For \(k=1\), both \(\{a\}\) and \(\{b\}\) are optimal with respect to MAV, but only \(\{a\}\) is optimal with respect to AV.

The following corollary follows from Observation 1 and the clear fact that an optimal k-committee with respect to AV can be computed in polynomial time.

Corollary 5

For each \(\tau \in \{{\text {MAV} }, {\text {CCAV} }, {\text {PAV} }\}\), \(\tau {\textsc {-Multiwinner}}\) is polynomial-time solvable when \(\triangle _{\text {V} }\le 1\).

Regarding \(\triangle _{\text {C}}\), we have several polynomial-time solvability results for \(\triangle _{\text {C}}\le 2\). One of our results is based on the following polynomial-time solvable problem [52].

Simple b-Edge Cover of Multigraphs (SECM)

Input:

A multigraph \(G=(N, A)\), a function \(f: N\rightarrow {\mathbb {Z}}^+\), and an integer \(\kappa\).

Question:

Is there \(A'\subseteq A\) such that \(|A'|\le \kappa\) and every \(v\in N\) is incident to at least f(v) edges in \(A'\)?

If we require \(|A'|=\kappa\) in the above definition, we obtain the exact version of SECM (E-SECM). Clearly, E-SECM can also be solved in polynomial time.

Every election \(E=(C, V)\) can be represented by a multihypergraph where every vote \(v\in V\) is considered as a vertex and every candidate \(c\in C\) is considered as an edge consisting of vertices in V(c). When a candidate is approved by only one vote, there is a loop on this vote. We use H(E) to denote this multihypergraph representing E. Clearly, given an election, its multihypergraph representation can be computed in polynomial time. When \(\triangle _{\text {C}}\le 2\)H(E) degenerates to a multigraph.

For a class \(\mathscr{H}\) of multihypergraphs, we say that an election E is an \(\mathscr{H}\)-election if \(H(E)\in \mathscr{H}\).

Theorem 4

MAV-Multiwinner and CCAV-Multiwinner are polynomial-time solvable if \(\triangle _{\text {C} }\le 2\).

Proof

We derive polynomial-time algorithms for the special cases of MAV-Multiwinner and CCAV-Multiwinner stated in the theorem as follows. Let \((E, k, d)\) be an instance of MAV-Multiwinner or CCAV-Multiwinner, where \(E=(C, V)\). We first compute the multigraph representation \(H(E)=(V, A)\) of E which can be done in polynomial time, where A is the set of edges corresponding to candidates in C.

MAV. We solve MAV-Multiwinner by reducing it to E-SECM. Recall that for each \(v\in V\)\(|v|\) is the number of candidates approved by v which is equal to the number of edges incident to v in H(E). Observe that the Hamming distance between every vote v and every k-committee can be at most \(|v|+k\). Therefore, if \(d\ge |v|+k\), we can safely remove v from V without changing the answer to the instance. In light of this fact, we assume now that \(|v|+k>d\) for every \(v\in V\). Let \(f: V \rightarrow {\mathbb {Z}}^+\) be a function such that \(f(v)=\lceil \frac{|v|+k-d}{2}\rceil\) for every \(v\in V\). By setting \(\kappa =k\) we complete the construction of an instance \((H(E), f, \kappa )\) of E-SECM. Assume that there exists \(A'\subseteq A\) of cardinality \(\kappa\) so that every \(v\in V\) is incident to at least f(v) edges in \(A'\). Let \(w\) be the k-committee corresponding to \(A'\). The Hamming distance between every vote \(v\in V\) and \(w\) is

$$\begin{aligned} |v\setminus w|+|w\setminus v|\le (|v|-f(v))+(k-f(v))\le d. \end{aligned}$$

The proof for the other direction is analogous.

CCAV. We derive a greedy algorithm for CCAV-Multiwinner. Let \(H'\) be the graph obtained from H(E) by

  1. (1)

    removing all loops, and

  2. (2)

    for every two vertices between which there are multiple edges, removing all but any arbitrary one of these multiple edges.

Let M be a maximum matching of \(H'\), and let V(M) be the set of vertices saturated by M. We distinguish between two cases.

  • If \(|M|\ge k\), we arbitrarily select k edges in M, and let \(w\) be the k-committee corresponding to these selected edges.

  • If \(|M|<k\), let \(w\) be the set of candidates corresponding to edges in M. Let \(E'\) be the election obtained from E by removing all votes in V(M) and all candidates approved only by votes in V(M). As M is a maximum matching of \(H'\), no two votes of \(E'\) approve a common candidate. Then, for every nonempty vote in \(E'\), we arbitrarily select one candidate approved by the vote. Let \(w'\) denote the set of all these selected candidates. If \(|w'|\ge k-|M|\), we include into \(w\) any arbitrary \(k-|M|\) candidates from \(w'\); otherwise, we include into \(w\) all candidates of \(w'\) together with any arbitrary \(k-|w\cup w'|\) remaining candidates.

In either case, we conclude that the given instance of CCAV-Multiwinner is a Yes-instance if and only if \({\textsf {CCAV}}(V, w)\ge d\). \(\square\)

Now we study special cases of PAV-Multiwinner where \(\triangle _{\text {C}}\) and \(\triangle _{\text {V}}\) are very small integers. We need the following notions. A path is a graph comprised of a sequence \(v_1\)\(v_2\)\(\dots\)\(v_t\) of t vertices and \(t-1\) edges so that there is an edge between two vertices if and only if they are consecutive in the sequence. A cycle is a graph obtained from a path by adding an edge between the first and the last vertices. A hairstick is a graph obtained from a path by adding one loop either on the first vertex or on the last vertex. A double headed hairstick (DH-hairstick) is a graph obtained from a path by adding one loop on both the first vertex and on the last vertex. We refer to Fig. 2 for an illustration of these graphs.

Fig. 2
figure 2

An illustration of cycles, paths, hairsticks, and DH-hairsticks

Lemma 2

Let \(\mathscr{H}\) be the set of all paths, cycles, hairsticks, and DH-hairsticks. Then, given an \(\mathscr{H}\)-election, we can compute a PAV optimal k-committee of the election in polynomial time.

We defer the proof of Lemma 2 to the appendix.

Theorem 5

PAV-Multiwinner is polynomial-time solvable if \(\triangle _{\text {C} }\le 1\) or \(\triangle _{\text {V} }=\triangle _{\text {C} }= 2\).

Proof

Let \(I=(E, k, d)\) be an instance of PAV-Multiwinner where \(E=(C, V)\).

We consider first the case where \(\triangle _{\text {C}}\le 1\). The following algorithm finds an optimal k-committee. First, let \(w=\emptyset\). Then, we arrange votes in V in a cyclic order (the relative orders of the votes do not matter), and starting from any arbitrary vote we consider the votes one-by-one in a clockwise order. In particular, if the currently considered vote v has at least one approved candidate not contained in \(w\), i.e., \(v\setminus w\ne \emptyset\), we add any arbitrary candidate from \(v\setminus w\) into \(w\); otherwise we proceed to the next vote. The procedure runs until \(|w|=k\) or \(w\) cannot be expanded in the way described above. Finally, we conclude that I is a Yes-instance if and only if \({\textsf {PAV}}(V, w)\ge d\).

Now we consider the case where \(\triangle _{\text {C}}=\triangle _{\text {V}}=2\). We first compute the multigraph representation H(E) of E. Given a subgraph of H(E), the subelection of E restricted to this subgraph refers to E restricted to candidates and votes corresponding respectively to the edges and the vertices of the subgraph. Observe that every connected component of H(E) is either a path, a cycle, a hairstick, or a DH-hairstick, and by Lemma 2, for every integer j, an optimal j-committee of each subelection restricted to a connected component can be computed in polynomial time. Based on this, we derive a dynamic programming algorithm. Precisely, let \((H_1, H_2, \dots , H_z)\) be an arbitrary order of the connected components of H(E). For each \(i\in [z]\), let \(m^{\le i}\) be the number of edges in the first i connected components in the order. We maintain a table T(ij) where \(i\in [z]\) and \(j \le \min \{k, m^{\le i}\}\) is a nonnegative integer. We define T(ij) as the PAV score of an optimal j-committee in the election restricted to the first i connected components. By Lemma 2, T(1, j) for all possible j can be computed in polynomial time. We use the following recursion to compute T(ij), assuming that all entries \(T(i', j')\) such that \(i'<i\) have been computed. For each \(j'\le j\), let \(d(j')\) be the PAV score of an optimal \(j'\)-committee of E restricted to \(H_i\), which can be computed in polynomial time by Lemma 2. Then, we have that

$$\begin{aligned} T(i, j)=\max _{\begin{array}{c} j'\in [j]\cup \{0\},\\ j-j'\le m^{\le i-1} \end{array}} \{d(j')+T(i-1, j-j')\}. \end{aligned}$$

After all entries are computed, we conclude that the given instance I is a Yes-instance if and only if \(T(z, k)\ge d\). \(\square\)

5 Combined parameters

The study in the previous section revealed that the parameters k\(\overline{k}\)\(\triangle _{\text {V}}\), and \(\triangle _{\text {C}}\) generally lead to fixed-parameter intractability results. In this section, we explore whether combining two or three of them offers us FPT-results.

5.1 Combining k with \(\triangle _{\text {V}}\) and \(\triangle _{\text {C}}\)

We first consider the combined parameter \(k+\triangle _{\text {C}}\), starting with an FPT-result for MAV-Multiwinner.

Theorem 6

MAV-Multiwinner is FPT with respect to \(k+\triangle _{\text {C} }\).

Proof

We prove the theorem by giving an FPT-algorithm for MAV-Multiwinner as follows. Let \(I=(E, k, d)\) be an instance of MAV-Multiwinner, where \(E=(C, V)\). Let \(n=|V|\). If \(n\le k\cdot \triangle _{\text {C}}+1\), then as MAV-Multiwinner is FPT with respect to n [57], we can solve I in FPT-time in \(k+\triangle _{\text {C}}\). Otherwise, let \((v_1,v_2,\dots ,v_n)\) be a linear order on V such that \(|v_i|\ge |v_{i+1}|\) for every \(i\in [n-1]\), i.e., \(v_i\) approves at least the same number of candidates as \(v_{i+1}\) does. Then, the algorithm deletes the last \(n-(k\cdot \triangle _{\text {C}}+1)\) votes in this order, and solves the remaining instance by an FPT-algorithm with respect to the number of votes (e.g., the one presented in [57]).

It remains to prove the correctness of the algorithm. Let \(E'\) be the election after the deletion of the votes as described above. Clearly, every k-committee of the original election with MAV score \(d\) is a k-committee of \(E'\) with MAV score at most \(d\). To show the correctness for the opposite direction, let \(w\) be a k-committee of \(E'\) with MAV score \(d\). As every candidate is approved by at most \(\triangle _{\text {C}}\) votes, at most \(k\cdot \triangle _{\text {C}}\) votes in \(E'\) intersect \(w\). As \(E'\) contains the first \(k\cdot \triangle _{\text {C}}+1\) votes in the order defined above, there exists \(i\in [k\cdot \triangle _{\text {C}}+1]\) so that the vote \(v_i\) does not approve any candidate from \(w\). The Hamming distance between \(w\) and \(v_i\) is \(k+|v_i|\le d\). As \(|v_j|\le |v_i|\) for every deleted vote \(v_j\), \(j\ge k\cdot \triangle _{\text {C}}+2\), the Hamming distance between \(w\) and \(v_j\) is at most \(k+|v_j|\le k+|v_i|\le d\). It follows that \(w\) has MAV score \(d\) in the election E. \(\square\)

Let us move on to CCAV. Obviously, if \(w\) is a k-committee, then at most \(k\cdot \triangle _{\text {C}}\) votes intersect \(w\). Hence, the CCAV score of every optimal k-committee is bounded from above by \(k\cdot \triangle _{\text {C}}\). This observation leads to a simple algorithm for CCAV-Multiwinner: if \(d>k\cdot \triangle _{\text {C}}\), return “No”; otherwise, solve it by Corollary 2 in time \(O^*(2^{O(d)})=O^*(2^{O(k\cdot \triangle _{\text {C}})})\).

Corollary 6

CCAV-Multiwinner is FPT with respect to \(k+\triangle _{\text {C} }\).

Now we study the combined parameter \(k+\triangle _{\text {V}}\). Observe that MAV-Multiwinner admits a straightforward FPT-algorithm with respect to \(k+\triangle _{\text {V}}\) based on an algorithm in [57]: if \(d>k+\triangle _{\text {V}}\), every k-committee has MAV score at most \(d\), and hence we directly conclude that the given instance is a Yes-instance; otherwise, we solve it by the FPT-algorithm with respect to \(d\) proposed in [57].

Corollary 7

MAV-Multiwinner is FPT with respect to \(k+\triangle _{\text {V} }\).

Aziz et al. [3] proved that PAV-Multiwinner is W[1]-hard when the parameter is k even if \(\triangle _{\text {V}}=2\). This implies that PAV-Multiwinner is W[1]-hard with respect to \(k+\triangle _{\text {V}}\). For CCAV, Betzler, Slinko, and Uhlmann [4] proved that CCAV-Multiwinner is W[2]-hard with respect to k by a reduction from the Hitting Set problem. However, in the reduction the maximum number of candidates approved by a vote is not bounded from above by a constant. Using a reduction from the Partial Vertex Cover problem, we show that CCAV-Multiwinner is W[1]-hard with respect to k in this special case.

Partial Vertex Cover (PVC)

Input:

A graph G and two integers \(\kappa\) and \(\ell\).

Parameter:

\(\kappa\).

Question:

Is there a subset \(S\subseteq V(G)\) such that \(|S|=\kappa\) and S covers at least \(\ell\) edges of G?

It is known that PVC is W[1]-hard with respect to \(\kappa\) [41].

Theorem 7

CCAV-Multiwinner is W[1]-hard with respect to k even if every vote approves two candidates.

Proof

Given an instance \((G, \kappa , \ell )\) of PVC, we create a CCAV-Multiwinner instance as follows. We regard each vertex as a candidate and regard each edge as a vote approving exactly the two candidates corresponding to its two endpoints. The reduction is completed by setting \(k=\kappa\) and \(d=\ell\). The correctness is easy to see. \(\square\)

5.2 Combining \(\overline{k}\) with \(\triangle _{\text {V}}\) and \(\triangle _{\text {C}}\)

We have shown that MAV-Multiwinner, CCAV-Multiwinner, and PAV-Multiwinner are W[1]-hard or W[2]-hard with respect to the single parameter \(\overline{k}\) even when \(\triangle _{\text {V}}=2\). It follows that these problems are W[1]-hard or W[2]-hard when parameterized by \(\overline{k}+\triangle _{\text {V}}\). Hence, we focus only on the combined parameter \(\overline{k}+\triangle _{\text {C}}\). We first prove that MAV-Multiwinner is FPT with respect to this parameter by reducing it to an FPT problem which is a generalization of the r-Set Packing problem.

Generalized r-Set Packing (GrSP)

Input:

A universe U, a multiset \(\mathscr{S}\) of r-subsets of U, a function \(f: U\rightarrow {\mathbb {N}}_{0}\), and an integer \(\kappa\).

Parameter:

\(\kappa +r\).

Question:

Is there an \(S\subseteq \mathscr{S}\) such that \(|S|=\kappa\) and every \(u\in U\) occurs in at most f(u) elements of S?

It is known that GrSP is FPT [79]Footnote 8. It is easy to verify that the variant of the GrSP problem where each \(s\in \mathscr{S}\) is of cardinality at most r, instead of exactly r, is reducible to GrSP in polynomial time: for each \(s\in \mathscr{S}\) such that \(|s|<r\), create \(r-|s|\) new elements in U, add them into s, and set \(f(u)=1\) for every newly introduced element in U. Therefore, the above variant is also FPT. In the following, we use G\(r_{\le }\)SP to denote this variant.

Theorem 8

MAV-Multiwinner is FPT with respect to \(\overline{k}+\triangle _{\text {C} }\).

Proof

Let \(I=(E, k, d)\) be an instance of MAV-Multiwinner, where \(E=(C, V)\). Note that the Hamming distance between each vote v and each k-committee is at least \(|k-|v||\). Therefore, if there exists a vote \(v\in V\) such that \(|v|<k\) and \(d<k-|v|\), we immediately conclude that the given instance I is a No-instance. In what follows, we assume that for all \(v\in V\) either it holds that \(|v|\ge k\), or it holds that \(|v|<k\) and \(d\ge k-|v|\). We reduce I to an instance of G\(r_{\le }\)SP in polynomial time as follows. Precisely, let \(U=V\), and let \(\mathscr{S}=\{V(c) :c\in C\}\). Clearly, each element of \(\mathscr{S}\) is of cardinality at most \(\triangle _{\text {C}}\). Hence, we set \(r=\triangle _{\text {C}}\). Regarding the function f, for each \(v\in V\) we define

$$\begin{aligned} f(v)=\left\lfloor \frac{d+|v|-k}{2}\right\rfloor . \end{aligned}$$

By the above assumption, the function f is nonnegative. Finally, we define \(\kappa =\overline{k}=|C|-k\).

It remains to prove that the two instances are equivalent.

\((\Rightarrow )\) Suppose that the G\(r_{\le }\)SP instance is a Yes-instance. In particular, let \(S\subseteq \mathscr{S}\) be such that \(|S|=\kappa\) and every \(u\in U\) occurs in at most f(u) elements of S. Let \(C'\) be the subset of candidates corresponding to S. Clearly, \(|C'|=|S|=\overline{k}\). We claim that \(w=C\setminus C'\) is a k-committee with MAV score at most \(d\). Due to the above construction, every vote v occurs in at most f(v) submultisets from S. This implies that v has at least \(|v|-f(v)\) of its approved candidates in \(w\). Hence, the Hamming distance between v and \(w\) is at most

$$\begin{aligned} f(v)+(k-(|v|-f(v)))=2\cdot f(v)+k-|v|\le d. \end{aligned}$$

In other words, the instance I is a Yes-instance.

\((\Leftarrow )\) Suppose that I is a Yes-instance, i.e., there is a k-committee \(w\subseteq C\) such that \({\textsf {MAV}}(V, w)\le d\). Let \(C'=C{\setminus } w\). Then, for each vote \(v\in V\) at least \(\lceil \frac{|v|+k-d}{2}\rceil\) of its approved candidates must be in \(w\). In other words, at most \(|v|-\lceil \frac{|v|+k-d}{2}\rceil =f(v)\) of v’s approved candidates can be in \(S=\{V(c) :c\in C'\}\). This implies that the G\(r_{\le }\)SP instance is a Yes-instance. \(\square\)

Next, we prove that CCAV-Multiwinner is FPT with respect to the combined parameter \(\overline{k}+\triangle _{\text {C}}\) as well.

Theorem 9

CCAV-Multiwinner is FPT with respect to \(\overline{k}+\triangle _{\text {C} }\).

Proof

We prove Theorem 9 by deriving a branch-and-bound FPT-algorithm with respect to \(\overline{k}+\triangle _{\text {C}}\). Let \(I=((C, V), k, d)\) be an instance of CCAV-Multiwinner. Throughout the algorithm, let \(\overline{k}=|C|-k\).

We first remove from C all candidates not approved by any votes, and remove from V all empty votes. Then, we assume that neither of C and V is empty, since otherwise the instance can be solved trivially. Observe now that if a vote \(v\in V\) approves more than \(\overline{k}\) candidates, any k-committee intersects v. In light of this observation, we need only to focus on votes approving at most \(\overline{k}\) candidates. Let \(U\subseteq V\) be the submultiset of votes from V approving at most \(\overline{k}\) candidates. Let \(B=\cup _{v\in U}v\) be the set of candidates approved by votes from U. If \(U=\emptyset\) or \(|B|\le k\), any k-committee containing B satisfies all votes. In this case, we conclude that I is a Yes-instance if and only if \(d\le |V|\). Assume now that \(|B|\ge k+1\). For each candidate \(c\in C\), let \(V^1(c)\) be the multiset of votes in U that approve c only, i.e., \(V^1(c)=\{v\in U :v=\{c\}\}\). Let \(c^{\star }\in B\) be such that \(|V^1(c^{\star })| \le |V^1(c)|\) for all \(c\in B\). Let \(A=\bigcup _{v\in V(c)\cap U} v\). A significant observation is that there is an optimal k-committee which does not contain A. (A proof for the observation is given later.) In line with this observation, we solve the instance by branching on which candidate from A is not contained in a certain optimal k-committee. In particular, we create \(|A|\) branching cases, one for each \(x\in A\). In the branching case for \(x\in A\), we reset \(C:=C\setminus \{x\}\) (note that this implies that \(\overline{k}\) is decreased by one), \(d:=d-|V{\setminus } U|\), and \(V :=U\), and solve the subinstance iteratively by the above procedure. We use two pruning criteria to determine when to terminate the branching: \(d\le 0\) or \(\overline{k}=0\). When we arrive at a branching node where \(d\le 0\) and \(\overline{k}\ge 0\), it holds clearly that the given instance I is a Yes-instance, and thus in this case we terminate the whole algorithm by returning “Yes”. When we arrive at a branching node where \(\overline{k}=0\) and \(d>0\), we determine that I is a Yes-instance if \(\textsf {CCAV}(V, C)\ge d\), and discard the corresponding branching case otherwise.

Now we prove the correctness of the above mentioned observation. To this end, assume that \(w\) is an optimal k-committee such that \(A\subseteq w\). Then, we can obtain another optimal k-committee from \(w\) by replacing \(c^{\star }\) (notice that \(c^{\star }\in A\)) with any arbitrary candidate from \(B\setminus w\) (as \(|B|\ge k+1\) such a candidate exists). The reason is that every vote from \((V(c^{\star })\cap U)\setminus V^1(c^{\star })\) approves at least one candidate from \(w\setminus \{c^{\star }\}\) (because \(A\subseteq w\)). Hence, removing \(c^{\star }\) from \(w\) may only affect the existence of representatives of votes from \(V^1(c^{\star })\) in the committee. However, as \(|V^1(c^{\star })| \le |V^1(c)|\) for all \(c\in B\), adding any arbitrary candidate from \(B\setminus w\) into \(w\) makes all votes from \(V^1(c^{\star })\) which do not have any representatives before have representatives now.

Regarding the running time of the algorithm, note that \(|A|\le \triangle _{\text {C}} \cdot \overline{k}\). Hence, each branching node has at most \(\triangle _{\text {C}} \cdot \overline{k}\) children, implying that the branching algorithm has running time \(O^*((\triangle _{\text {C}} \cdot \overline{k})^{\overline{k}})\). \(\square\)

5.3 Combining \(d\) and \(\triangle _{\text {V}}\)

It is known that MAV-Multiwinner and CCAV-Multiwinner are both FPT with respect to the parameter \(d\). However, it is unknown whether PAV-Multiwinner is FPT with respect to \(d\). Even though we are unable to resolve this open question, we provide an FPT-algorithm when combining \(d\) and \(\triangle _{\text {V}}\).

Theorem 10

PAV-Multiwinner is FPT with respect to \(d+\triangle _{\text {V} }\).

Proof

We prove the theorem by giving a branch-and-bound FPT-algorithm. Let \(I=(E, k, d)\) be an instance of PAV-Multiwinner, where \(E=(C, V)\).

First, if C contains a candidate approved by at least \(d\) votes in V, any k-committee including this candidate is a desired committee. So, in this case, we directly conclude that I is a Yes-instance.

Second, let \(C_0\subseteq C\) be the set of candidates not approved by any votes in V. We remove from the election (CV) all candidates in \(C_0\), and reset \(k :=\min \{k, |C|\}\). It is easy to verify that the two instances before and after this step are equivalent. Afterwards, if \(k=|C|\), we conclude that I is a Yes-instance if and only if \({\textsf {PAV}}(V, C)\ge d\).

From now on, let us assume that every candidate is approved by at least one and at most \(d-1\) votes in V, and \(k<|C|\). For each \(S\subsetneq C\) and each \(c\in C\setminus S\), we define

$$\begin{aligned} \varDelta (S, c)={\textsf {PAV}}(V,S\cup \{c\})-{\textsf {PAV}}(V,S), \end{aligned}$$

as the marginal contribution of c to the PAV score of the committee \(S\cup \{c\}\). In our branching tree, each branching node is associated with a subset \(S\subsetneq C\) of cardinality at most k, which is supposed to be contained in a desired committee. The root of the branching tree is associated with \(S=\emptyset\). Suppose we are now at a branching node associated with a subset \(S\subsetneq C\). Let \(c\in C{\setminus } S\) be a candidate such that \(\varDelta (S, c)\ge \varDelta (S, c')\) for all \(c'\in C\setminus S\). Let \(A=(\bigcup _{v\in V(c)}v) \setminus S\) be the set of candidates approved by votes approving c but not contained in S. Then, we create \(|A|\) branching cases, one for each \(x\in A\). The set associated with the branching case for \(x\in A\) is \(S\cup \{x\}\). The correctness of our branching is rooted in the fact (a proof supporting this fact is provided later) that, if the given instance I is a Yes-instance, there exists at least one desired k-committee which includes at least one candidate from A.

We terminate the branching when the branching depth of the current branching node reaches \(\min \{k, d\cdot \triangle _{\text {V}}\}\). Recall that the branching depth of a branching node is the number of edges on the path from the root to the node in the branching tree. By our branching strategy, the branching depth of a branching node associated with S is exactly \(|S|\). Therefore, when \(k\le d\cdot \triangle _{\text {V}}\) and the branching depth of the current node associated with S reaches k, we have that \(|S|=k\). In this case, if S has PAV score at least \(d\) with respect to V, we conclude that the given instance I is a Yes-instance. Consider now the case where \(k>d\cdot \triangle _{\text {V}}\). Note that \(\varDelta (S,x)\ge \frac{1}{\triangle _{\text {V}}}\) for each possible S and x. Therefore, if the current branching node associated with S has branching depth \(d\cdot \triangle _{\text {V}}\)S is a \(k'\)-committee, \(k'\le k\), of PAV score at least \(d\). As a result, when \(k>d\cdot \triangle _{\text {V}}\) and the current node has branching depth \(d\cdot \triangle _{\text {V}}\), we directly conclude that I is a Yes-instance. If none of the branching nodes leads to a “Yes”-answer, we conclude that I is a No-instance.

Note that \(|A|\le |V(c)|\cdot \triangle _{\text {V}}\le d\cdot \triangle _{\text {V}}\), where A is as defined above. Therefore, the running time of the whole algorithm is bounded by \(O^*((d\cdot \triangle _{\text {V}})^{d\cdot \triangle _{\text {V}}})\).

To show the correctness, it suffices to prove the following claim (corresponding to the aforementioned fact).

Claim

Let \(S\subsetneq C\) be a subset of at most \(k-1\) candidates, and let \(c\in C\setminus S\) be a candidate such that \(\varDelta (S, c)\ge \varDelta (S, c')\) for all \(c'\in C\setminus S\). Let \(A=(\bigcup _{v\in V(c)}v)\setminus S\). Then, if there is a k-committee \(w\subsetneq C\) such that \({\textsf {PAV} }(V, w)\ge d\), \(S\subsetneq w\), and \(A\cap w=\emptyset\), there exists a k-committee \(w'\subsetneq C\) such that \({\textsf {PAV} }(V, w')\ge d\), \(S\subseteq (w\cap w')\), and \(A\cap w'\ne \emptyset\).

Now we prove the claim. Let Sc, and A be as stipulated in the claim. Suppose that there is a k-committee \(w\subsetneq C\) such that \({\textsf {PAV}}(V, w)\ge d\), \(S\subsetneq w\), and \(A\cap w=\emptyset\). Let \(c'\) be any candidate in \(w{\setminus } S\), and let \(w'=w{\setminus } \{c'\}\cup \{c\}\). Obviously, \(S\subseteq (w\cap w')\) and \(A\cap w'\ne \emptyset\). To complete the proof, it suffices to show that \(\textsf {PAV}(V, w')\ge d\). First, as \(w\cap A=\emptyset\) and \(S\subseteq w\), it holds that \(\varDelta (w, c)=\varDelta (S, c)\). Second, as \(S\subseteq w\setminus \{c'\}\), it holds that \(\varDelta (w\setminus \{c'\}, c')\le \varDelta (S, c')\). Third, it is clear that \(\varDelta (w\setminus \{c'\}, c)\ge \varDelta (w, c)\). We also reiterate that \(\varDelta (S, c)\ge \varDelta (S, c')\). Putting this all together, we obtain \(\varDelta (w\setminus \{c'\}, c)\ge \varDelta (w\setminus \{c'\}, c')\) which implies \({\textsf {PAV}}(V, w')\ge {\textsf {PAV}}(V, w)\ge d\). The proof is completed. \(\square\)

We have derived many FPT-algorithms with respect to different single parameters and their combinations. Our results reveal that MAV-Multiwinner and CCAV-Multiwinner are FPT with respect to any combinations of three single parameters studied so far. Unfortunately, we have only a few FPT-results for PAV-Multiwinner even for combinations of two single parameters. We finish this section by remarking that PAV-Multiwinner is FPT with respect to \(k+\triangle _{\text {V}}+\triangle _{\text {C}}\): if \(d> {k\cdot \triangle _{\text {C}}}\), return “No”; otherwise, solve the instance in FPT-time via Theorem 10.

Corollary 8

PAV-Multiwinner is FPT with respect to \(k+\triangle _{\text {V} }+\triangle _{\text {C} }\).

6 Structural parameters

In this section, we study two structural parameters of incidence graphs of elections. For an election E, let \(G_E\) denote the incidence graph of E.

6.1 Treewidth

Treewidth is a widely-studied notion to measure the closeness of a graph to a tree [63]. It has been shown that a great deal of graphs stemming from innumerable combinatorial problems in a variety of areas have bounded treewidth (see, e.g., [47, 56, 71, 74]). Additionally, it has been scrutinized that even when treewidth is large, using tree decompositions could also be helpful for designing algorithms [55]. From a theoretical point of view, a myriad of NP-hard problems are known to be FPT with respect to the parameter treewidth [21]. With regard to the applicability of treewidth in multiwinner voting, we point out that the treewidth of the incidence graph of an election is no greater than the number m of candidates and the number n of voters in the election, implying that any FPT-algorithm for \(\tau {\textsc {-Multiwinner}}\) with respect to the treewidth also runs in FPT-time in m and n.

A tree decomposition of a graph \(G=(N, A)\) is a tuple \((T,\mathscr{B})\), where \(T=(L, F)\) is a rooted tree with vertex set L and edge set F, and \(\mathscr{B}=\{B_x \subseteq N :x\in L\}\) is a collection of subsets of vertices of G such that the following three conditions are fulfilled simultaneously:

  • For each vertex \(v\in N\) in G there exists at least one \(B_x\in \mathscr{B}\) such that \(v\in B_x\), i.e., every vertex of G is in at least one element of \(\mathscr{B}\).

  • For each edge \(\{v,u\}\in A\) in G there exists at least one \(B_x\in \mathscr{B}\) such that \(v,u\in B_x\), i.e., every edge of G is contained in at least one element of \(\mathscr{B}\).

  • If a vertex \(v\in N\) is in \(B_x, B_y\in \mathscr{B}\), then v is in every \(B_z\in \mathscr{B}\) where z is a vertex on the unique path between x and y in T.

The width of the tree decomposition is defined as \(\max _{B\in \mathscr{B}}{|B|-1}\). The treewidth of a graph G, denoted \(\omega (G)\), is the minimum possible width of tree decompositions of G. Elements of \(\mathscr{B}\) are called bags. To avoid confusion, in the following we call vertices of T nodes.

A more refined notion commonly used in designing FPT-algorithms is the so-called nice tree decomposition [9]. In particular, a nice tree decomposition \((T, \mathscr{B})\) of a graph G is a tree decomposition of G which further satisfies the following conditions simultaneously:

  • Every bag \(B\in \mathscr{B}\) associated with the root or a leaf of T is empty, i.e., \(B_x=\emptyset\) if x is the root or a leaf of T.

  • Nonleaf nodes of T are categorized into introduce nodes, forget nodes, and join nodes such that:

    • Each introduce node x has exactly one child y such that \(B_y\subsetneq B_x\) and \(|B_x{\setminus } B_y|=1\), i.e., \(B_x\) has exactly one more element than \(B_y\).

    • Each forget node x has exactly one child y such that \(B_x\subsetneq B_y\) and \(|B_y{\setminus } B_x|=1\), i.e., \(B_x\) is obtained from \(B_y\) by removing one element.

    • Each join node x has exactly two children y and z such that \(B_x=B_y=B_z\).

It is easy to see from the definition that in a nice tree decomposition, each vertex can be introduced multiple times but can be only forgotten once. Moreover, when a vertex is introduced in a bag \(B_x\), all of its neighbors contained in bags associated with nodes in the subtree rooted at x are contained in \(B_x\).

Lemma 3

([45]) Let G be a graph of p vertices. Then, given a tree decomposition of G of width \(\omega\), a nice tree decomposition of G of width \(\omega\) having \(O(p\cdot \omega )\) nodes can be computed in polynomial time.

It has long been known that calculating treewidth is NP-hard even for bipartite graphs [7]. However, determining whether the treewidth of a graph is at most \(\omega\) is FPT with respect to \(\omega\) and, moreover, powerful heuristic and approximation algorithms for calculating treewidth have been perpetually reported [6, 8, 72]. Besides, treewidth of chordal bipartite graphs can be computed in polynomial time,Footnote 9 and the gap between chordal bipartite graphs and general bipartite graphs is quite small [46]. It is also well-known that every graph of p vertices admits an optimal tree decomposition of \(O(p)\) nodes. Then, by Lemma 3, when we study FPT-algorithms with respect to treewidth or a combined parameter involving treewidth, it does not lose any generality to assume that a nice tree decomposition is given.

Theorem 11

CCAV-Multiwinner can be solved in time \(O^*(4^{\omega })\) if a nice tree decomposition of width \(\omega\) and of \(\textsf {poly} (p)\) nodes of the incidence graph of the input election is given, where p is the number of vertices of the incidence graph.

Proof

Let \((E, k,d)\) be an instance of CCAV-Multiwinner where \(E=(C, V)\) and \(k\le |C|\). In addition, let \((T, \mathscr{B})\) be a nice tree decomposition of \(G_E\) of width \(\omega\) that has \(\textsf {poly}(p)\) nodes, where \(p=|C|+|V|\). We design a dynamic programming algorithm running in time \(O^*(4^{\omega })\) as follows.

For each bag \(B_x\in \mathscr{B}\) associated with a node x in the tree T, let \(C(B_x)\) and \(V(B_x)\) be the set of candidate-vertices and the set of vote-vertices contained in \(B_x\), respectively. Moreover, let \(C(T_x)\) be the set of candidates in bags associated with nodes in the subtree rooted at x. Obviously, \(C(B_x)\subseteq C(T_x)\). We maintain for each node x in T a 3-dimensional table \(D_x(C', V', k')\), where \(C'\) is a subset of \(C(B_x)\) such that \(|C'|\le k\)\(V'\) is a submultiset of \(V(B_x)\), and \(k'\) is an integer such that \(|C'|\le k'\le \min \{k, |C(T_x)|\}\). We say that a \(k'\)-committee \(w\) is a valid committee for the entry \(D_x(C', V', k')\) if all the following conditions hold simultaneously:

  1. (1)

    \(w\subseteq C(T_{x})\);

  2. (2)

    \(C'=C(B_x)\cap w\); and

  3. (3)

    a vote in \(V(B_x)\) has a representative in \(w\) if and only if this vote is in \(V'\), i.e., for every \(v\in V(B_x)\) it holds that \(v\cap w\ne \emptyset\) if and only if \(v\in V'\).

The value of the entry \(D_x(C', V', k')\) is the CCAV score of an optimal valid \(k'\)-committee for the entry. If \(D_x(C', V', k')\) admits no valid \(k'\)-committees, we define \(D_x(C', V', k')=-\infty\). Clearly, each table associated with a bag has at most \(2^{\omega +1}\cdot (k+1)\) entries.

The algorithm updates the tables from those associated with the leaves up to the one associated with the root of T. The values of entries associated with leaves are all 0 (recall that each leaf bag is empty). We update the entry \(D_x(C', V', k')\) associated with a nonleaf node x in T as follows.

First, if some candidate from \(C'\) is approved by some vote from \(V(B_x)\setminus V'\), we set \(D_x(C', V', k')=-\infty\). Otherwise, we consider the following cases with respect to the types of x.

  • x is a join node

  • Let y and z be the two children of x. We have \(B_x=B_y=B_z\). In this case, we let

    $$\begin{aligned}D_x(C', V', k')= \max _{\begin{array}{c} k'_1+k'_2=k'-|C'|\\ 0\le k'_1\le |C(T_y)\setminus C'|\\ 0\le k'_2\le |C(T_z)\setminus C'|\\ V_1', V_2'\subseteq V', V_1'\cup V_2'=V' \end{array}} \left\{ D_y\left( C', V_1', k'_1+\left| C'\right| \right) +D_z\left( C', V_2', k'_2+\left| C'\right| \right) -\left| V_1'\cap V_2'\right| \right\} . \end{aligned}$$

    It is obvious that there are \(O^*(2^{\omega })\) different combinations of \(k_1'\)\(k_2'\)\(V_1'\), and \(V_2'\) to consider in the max function. As a result, \(D_x(C', V', k')\) can be computed in \(O^*(2^{\omega })\) time.

  • x is an introduce node

  • Let y be the child of x, and let \(\{h\}=B_x\setminus B_y\). We further distinguish between two subcases.

    • h is a vote

    • If \(h\not \in V'\), then due to the above discussion h does not approve any candidates from \(C'\). We set \(D_x(C', V', k')=D_y(C', V', k')\). If \(h\in V'\), then we set \(D_x(C', V', k')=D_y(C', V'\setminus \{h\}, k')+1\) if h approves at least one candidate from \(C'\); and set \(D_x(C', V', k')=-\infty\) otherwise.

    • h is a candidate

    • If \(h\not \in C'\), we set \(D_x(C', V', k')=-\infty\) when \(k'=|C(T_x)|\), and set \(D_x(C', V', k')=D_y(C', V', k')\) when \(k'<|C(T_x)|\). If \(h\in C'\), let \(V_x(h)\) be the submultiset of votes in \(V'\) approving h. Then, we set

      $$\begin{aligned} D_x\left( C', V', k'\right) =\max _{U\subseteq V_x(h)}\left\{ D_y\left( C'\setminus \left\{ h\right\} , V'\setminus U, k'-1\right) +|U|\right\} . \end{aligned}$$

      Note that in this case if \(V_x(h)=\emptyset\), we have that \(D_x(C', V', k')=D_y(C'{\setminus } \{h\}, V', k'-1)\). As \(|V_x(h)|\le \omega\), \(D_x(C', V', k')\) can be calculated in \(O^*(2^{\omega })\)-time.

  • x is a forget node

  • Let y be the child of x and let \(\{h\}= B_y\setminus B_x\). We consider the following two subcases.

    • h is a vote

    • We set \(D_x\left( C', V', k'\right) =\max \left\{ D_y\left( C', V', k'\right) , D_y\left( C', V'\cup \left\{ h\right\} , k'\right) \right\}\).

    • h is a candidate

    • Clearly, \(h\not \in C'\). If \(|C'|=k'\) or h is approved by some votes in \(V(B_x)\setminus V'\), we set \(D_x(C', V', k')=D_y(C', V', k')\); otherwise, let

      $$\begin{aligned} D_x\left( C', V', k'\right) =\max \left\{ D_y\left( C', V', k'\right) , D_y\left( C'\cup \left\{ h\right\} , V', k'\right) \right\} . \end{aligned}$$

By the definition of the table, \(D_r(\emptyset , \emptyset , k)\) is the CCAV score of an optimal valid k-committee for the entry, where r is the root of T (recall that the bag associated with the root is empty). By Conditions (1)–(3) given above, every k-committee of C is valid for \(D_r(\emptyset , \emptyset , k)\). Therefore, after all tables are computed, we conclude that the given instance of CCAV-Multiwinner is a Yes-instance if and only if \(D_r(\emptyset , \emptyset , k)\ge d\).

It remains to analyze the running time of the algorithm. Each node in the nice tree decomposition is associated with a table of \(O^*(2^{\omega })\) entries. Due to the above procedure, calculating an entry corresponding to a forget node takes polynomial time, and calculating an entry corresponding to a join or an introduce node takes \(O^*(2^{\omega })\) time. As the nice tree decomposition has polynomially many nodes, the running time of the algorithm is \(O^*(4^{\omega })\). \(\square\)

Similar algorithms for PAV-Multiwinner and MAV-Multiwinner can be derived but with the running time being bounded by \(O^*({(k+1)^{\omega }})\). The reason is that in these cases we need to maintain more information in order to solve the problem. For example, for PAV-Multiwinner, we need to maintain for each vote in \(V'\) the number of its approved candidates that are supposed to be in a desired committee, which requests a table of size at least \(O^*((k+1)^{\omega })\).

Theorem 12

PAV-Multiwinner and MAV-Multiwinner are FPT with respect to the combined parameter \(k+\omega\), if a nice tree decomposition of width \(\omega\) having \(\textsf {poly} (p)\) nodes of the incidence graph of the input election is given, where p is the number of vertices of the incidence graph.

Proof

To prove the theorem, we derive FPT-algorithms for PAV-Multiwinner and MAV-Multiwinner with respect to \(k+\omega\). Let \(I=(E,k,d)\) be an instance of \(\tau {\textsc {-Multiwinner}}\) where \(\tau \in \{\text {PAV}, \text {MAV}\}\), \(E=(C, V)\), and \(k\le |C|\). In addition, let \(p=|C|+|V|\), and let \((T, \mathscr{B})\) be a nice tree decomposition of \(G_E\) of width \(\omega\) that has \(\textsf {poly}(p)\) nodes. The algorithms have the same skeleton as the one in the proof of Theorem 11 for CCAV-Multiwinner. More concretely, we maintain a table for each node of T, compute the tables in a bottom-up manner, and determine if the given instance I is a Yes-instance according to the table associated with the root. To delineate the algorithms, we need the following notations. For an integer i, we define \(f(i)=\sum _{j=1}^i\frac{1}{j}\) if \(i\ge 1\) and define \(f(i)=0\) otherwise. For a function \(\varrho : A\rightarrow B\) and an element \(a\in A\), \(\varrho _{-a}\) denotes \(\varrho\) restricted to the domain \(A\setminus \{a\}\), i.e., \(\varrho _{-a}: A{\setminus } \{a\}\rightarrow B\) is a function such that for all \(a'\in A{\setminus } \{a\}\) it holds that \(\varrho _{-a}(a')=\varrho (a')\). Additionally, let \(C(B_x)\)\(V(B_x)\), and \(C(T_x)\) be defined as in the proof of Theorem 11. Furthermore, in parallel with \(C(T_x)\), we let \(V(T_x)\) denote the multiset of all votes contained in bags associated with nodes in the subtree rooted at x.

PAV. For each node x in T, we maintain a table \(D_x(C', k', \mu )\) where \(C'\subseteq C(B_x)\), \(|C'|\le k'\le \min \{k, |C(T_x)|\}\), and \(\mu : V(B_x)\rightarrow [k]\cup \{0\}\) is a function. We note that if \(V(B_x)=\emptyset\)\(\mu\) is an empty function. We say that a \(k'\)-committee \(w\subseteq C(T_x)\) is valid for \(D_x(C', k', \mu )\) if Conditions (1) and (2) listed in the proof of Theorem 11, and the following condition are satisfied simultaneously:

  1. (D)

    \(|v\cap w|=\mu (v)\) for all \(v\in V(B_x)\).

We define \(D_x(C', k', \mu )\) as the maximum possible PAV score of \(k'\)-committees valid for \(D_x(C', k', \mu )\) with respect to \(V(T_x)\). More precisely,

$$\begin{aligned} D_x(C', k', \mu ):=\max _{\begin{array}{c} w\subseteq C(T_x), |w|=k'\\ w~\text {is valid for}~D_x(C', k', \mu ) \end{array}}\textsf {PAV}(V(T_x), w), \end{aligned}$$

if \(D_x(C', k', \mu )\) admits at least one valid \(k'\)-committee, and \(D_x(C', k', \mu )=-\infty\) otherwise.

The tables for the leaves can be computed trivially according to the definition of the tables. We show how to update an entry \(D_x(C', k', \mu )\) by distinguishing the types of the node x.

  • x is a join node

  • Let y and z be the two children of x. We let

    $$\begin{aligned} D_{x} (C^{\prime } ,k^{\prime } ,\mu ) & = \max _{{k_{1}^{\prime } ,k_{2}^{\prime } ,\mu _{1} ,\mu _{2} }} \{ D_{y} \left( {C^{\prime } ,k_{1}^{\prime } + \left| {C^{\prime } } \right|,\mu_{1}^{\prime}} \right) - g(\mu _{1}^{\prime}) \\ & \quad + D_{z} \left( {C^{\prime } ,k_{2}^{\prime } + |C^{\prime } |,\mu _{2}^{\prime}} \right) - g(\mu _{2}^{\prime} ) \\ & \quad + g(\mu )\} , \\ \end{aligned}$$

    where

    1. (a)

      \(k'_1\) and \(k'_2\) run over all integers so that \(k_1'+k_2'=k'-|C'|\), \(0\le k'_1\le |C(T_y) {\setminus } C'|\), and \(0\le k'_2\le |C(T_z) \setminus C'|\);

    2. (b)

      \(\mu _1\) and \(\mu _2\) run over all functions from \(V(B_x)\) to nonnegative integers so that for all \(v\in V(B_x)\) it holds that \(\mu _1(v)+\mu _2(v)=\mu (v)-|v\cap C'|\);

    3. (c)

      for each \(i\in [2]\)\(\mu _i'\) is a function from \({V(B_x)}\) to \([k]\cup \{0\}\) so that \(\mu _i'(v)=\mu _i(v)+|v\cap C'|\) for all \(v\in V(B_x)\); and

    4. (d)

      for each \(\rho \in \{\mu , \mu _1', \mu _2'\}\), \(g(\rho )=\sum _{v\in V(B_x)} f(\rho (v))\).

    By Condition (4), the number of different combinations of \(\mu _1\) and \(\mu _2\) in (b) is bounded from above by \(O^*((k+1)^{\omega +1})\cdot O^*((k+1)^{\omega +1})=O^*(k^{2\omega })\). As a result, an entry in this case can be computed in time \(O^*(k^{2\omega })\). (Note that by the above recursion, when \(V(B_x)=\emptyset\) we have that \(D_x(C', k', \mu )= \max _{k'_1, k'_2} \{D_y(C', k'_1+|C'|, \mu )+D_z(C', k'_2+|C'|, \mu )\}\).)

  • x is an introduce node

  • Let y be the child of x, and let \(\{h\}=B_x\setminus B_y\). We further distinguish between the following two subcases.

    • h is a vote

    • In this case, we set

      $$\begin{aligned} D_x(C', k', \mu )= {\left\{ \begin{array}{ll} -\infty , &{} \text {if}~\mu (h)\ne |h\cap C'|\\ D_y(C', k', \mu _{-h})+f(\mu (h)), &{} \text {otherwise}\\ \end{array}\right. } \end{aligned}$$
    • h is a candidate

    • If \(h\not \in C'\), we set \(D_x(C', k', \mu )=-\infty\) when \(k'=|C(T_x)|\), and set \(D_x(C', k', \mu )=D_y(C', k', \mu )\) when \(k'<|C(T_x)|\). If \(h\in C'\), we set \(D_x(C', k', \mu )=-\infty\) if there exists \(v\in V(B_x)\) such that \(h\in v\) and \(\mu (v)=0\), and set \(D_x(C', k', \mu )=D_y(C'{\setminus } \{h\}, k'-1, \mu ')+\sum _{v\in V(B_x), h\in v}\frac{1}{\mu (v)}\) otherwise, where \(\mu ': V(B_y)\rightarrow [k]\cup \{0\}\) is the function so that for all \(v\in V(B_y)\) it holds that \(\mu '(v)=\mu (v)-|v\cap \{h\}|\). (If \(V(B_x)=\emptyset\)\(\mu '\) is an empty function.)

  • x is a forget node

  • Let y be the child of x and let \(\{h\}= B_y\setminus B_x\). Similar to the above case we distinguish between the following two subcases.

    • h is a vote We set \(D_x(C', k', \mu )=\max _{\begin{array}{c} \mu ': V(B_y)\rightarrow [k]\cup \{0\}\\ \mu '_{-h}=\mu \end{array}}D_y(C', k', \mu ')\). Obviously, we have at most \({(k+1)^{\omega +1}}\) different functions \(\mu '\) to check, and hence such an entry \(D_x(C', k', \mu )\) can be computed in \(O^*({k}^{\omega })\) time.

    • h is a candidate In this case, we set

      $$\begin{aligned} D_x(C', k', \mu )= {\left\{ \begin{array}{ll} D_y(C', k', \mu ), &{} \text {if}~|C'|=k'\\ \max \{D_y(C', k', \mu ), D_y(C'\cup \{h\}, k', \mu )\}, &{} \text {otherwise}\\ \end{array}\right. } \end{aligned}$$

After the table for the root r is computed, we conclude that the given instance I is a Yes-instance if and only if \(D_r(\emptyset , k, \mu )\ge d\), where \(\mu\) is an empty function (recall that the bag associated with the root is empty). This is because that by the definition of the table, \(D_r(\emptyset , k, \mu )\) is the maximum possible PAV score of a valid k-committee for \(D_x(\emptyset , k, \mu )\) with respect to \(V(T_r)=V\).

MAV. For each node x in T, we maintain a table \(D_x(C', k', \mu )\) whose components are defined as with the ones for PAV. The validity of a \(k'\)-committee for an entry is also defined the same. However, in this case each entry takes only binary values 1 and 0. The entry \(D_x(C', k', \mu )\) is 1 if and only if there is at least one \(k'\)-committee \(w\) which is valid for the entry (i.e., a \(k'\)-committee satisfying Conditions (1)–(2) in the proof of Theorem 11, and Condition (4) given above) and, moreover, for each \(v\in V(T_x)\setminus V(B_x)\), it holds that \(|w\cap v|\ge \frac{k+|v|-d}{2}\). Observe that, by the definition of nice tree decomposition, none of the votes in \(V(T_x)\setminus V(B_x)\) approves any candidates from \(C\setminus C(T_x)\). Therefore, for every k-committee \(w'\subseteq C\) such that \(w'\cap C(T_x)=w\) and every \(v\in V(T_x){\setminus } V(B_x)\), it holds that \(v\cap w=v\cap w'\), implying that the Hamming distance between v and \(w'\) is at most \(d\) if and only if \(|v\cap w|\ge \frac{k+|v|-d}{2}\). The requirement \(|w\cap v|\ge \frac{k+|v|-d}{2}\) ensures that as long as the given instance admits a Yes-witness \(w'\subseteq C\) such that \(w'\cap C(T_x)=w\), the existence of this Yes-witness is safely preserved in the table.

The tables for the leaves can be computed trivially according to their definitions. We show how to update an entry \(D_x(C', k', \mu )\) by distinguishing the types of the node x.

  • x is a join node

  • Let y and z be the two children of x. In this case, we set \(D_x(C', k', \mu )=1\) if and only if there are two functions \(\mu _1, \mu _2: V(B_x)\rightarrow [k]\cup \{0\}\) and two nonnegative integers \(k_1'\) and \(k_2'\) such that

    1. (a)

      \(k'_1+k'_2=k'-|C'|\);

    2. (b)

      for all \(v\in V(B_x)\) it holds that \(\mu _1(v)+\mu _2(v)=\mu (v)-|v\cap C'|\) and, moreover,

    3. (c)

      \(D_y\left( C', k'_1+|C'|, {\mu _1'}\right) =D_z\left( C', k'_2+|C'|, \mu _2'\right) =1\), where for each \(i\in [2]\)\(\mu _i'\) is the function from \({V(B_x)}\) to \([k]\cup \{0\}\) so that \(\mu _i'(v)=\mu _i(v)+|v\cap C'|\) for all \(v\in V(B_x)\).

    It is obvious that there are \(O^*(k^{O(\omega )})\) different combinations of \(k_1'\)\(k_2'\)\(\mu _1\), and \(\mu _2\) to enumerate. As a result, it takes \(O^*(k^{O(\omega )})\) time to compute \(D_x(C', k', \mu )\). (Note that when \(V(B_x)=\emptyset\), the above recursion indicates that \(D_x(C', k', \mu )=1\) if and only if there are \(k_1'\) and \(k_2'\) as above so that \(D_y(C', k_1'+|C'|,\mu )=D_z(C', k_2'+|C'|,\mu )=1\).)

  • x is an introduce node

  • Let y be the child of x, and let \(\{h\}=B_x\setminus B_y\). We further distinguish between the following two subcases.

    • h is a vote

    • We set \(D_x(C', k', \mu )=1\) if and only if \(\mu (h)=|C'\cap h|\) and \(D_y(C', k', \mu _{-h})=1\).

    • h is a candidate

    • If \(h\not \in C'\), we set \(D_x(C', k', \mu )=0\) when \(k'=|C(T_x)|\), and set \(D_x(C', k', \mu )=D_y(C', k', \mu )\) when \(k'<|C(T_x)|\). If \(h\in C'\), we set \(D_x(C', k', \mu )=1\) if and only if \(D_y(C'\setminus \{h\}, k'-1, \mu ')=1\) where \(\mu ': V(B_y)\rightarrow [k]\cup \{0\}\) is a function so that for all \(v\in V(B_y)\) it holds that \(\mu '(v)=\mu (v)-|v\cap \{h\}|\). (When \(V(B_x)=\emptyset\), \(D_x(C', k', \mu )=D_y(C'{\setminus } \{h\}, k'-1, \mu )\).)

  • x is a forget node

  • Let y be the child of x and let \(\{h\}= B_y\setminus B_x\).

    • h is a vote

    • We set \(D_x(C', k', \mu )=1\) if and only if

      1. (1)

        \(\mu (h)\ge \frac{k+|h|-d}{2}\), and

      2. (2)

        there is a function \(\mu ': V(B_y)\rightarrow [k]\cup \{0\}\) so that \(\mu =\mu '_{-h}\) and \(D_y(C', k', \mu ')=1\).

      The first condition is to ensure that there is a k-committee which contains a valid \(k'\)-committee \(w\) for the entry and the Hamming distance between \(w\) and h is at most \(d\).

    • h is a candidate

    • If \(|C'|=k'\), we set \(D_x(C', k', \mu )=D_y(C', k', \mu )\). Otherwise, we set \(D_x(C', k', \mu )=1\) if and only if \(D_y(C', k', \mu )+D_y(C'\cup \{h\}, k', \mu )\ge 1\).

After the table for the root r is computed, we conclude that the given instance I is a Yes-instance if and only if \(D_r(\emptyset , k, \mu )=1\), where \(\mu\) is an empty function. The reason for this is that by the definition of the table, it holds that \(D_r(\emptyset , k, \mu )=1\) if and only if there is at least one k-committee \(w\subseteq C(T_r)=C\) which is valid for \(D_r(\emptyset , k, \mu )\) and, moreover, for each \(v\in V(T_r)\setminus V(B_r)=V\), it holds that \(|w\cap v|\ge \frac{k+|v|-d}{2}\).

The running times of the algorithms for PAV-Multiwinner and MAV-Multiwinner are dominated by the total size of all tables which is bounded from above by \(O^*(k^{O(\omega )})\). \(\square\)

6.2 Maximum matching

It is well-known that for every bipartite graph G, the size of a maximum matching of G equals the size of a minimum vertex cover of G [54]. As the treewidth of a graph is bounded from above by the size of a minimum vertex cover of the graph [37], it follows from Theorem 11 that CCAV-Multiwinner is FPT with respect to \(\alpha (G_E)\), where E is a given election. We show that MAV-Multiwinner and PAV-Multiwinner are also FPT with respect to this parameter. Observe that the size of a maximum matching of \(G_E\) can be at most \(\min \{m, n\}\), where m is the number of candidates and n is the number of votes in the election E. Consequently, every FPT-algorithm with respect to \(\alpha (G_E)\) also runs in FPT-time in n and m.

Theorem 13

MAV-Multiwinner and PAV-Multiwinner are FPT with respect to \(\alpha (G_E)\), where E is the election in the input.

Proof

Let \(I=(E, k, d)\) be an instance of \(\tau {\textsc {-Multiwinner}}\), where \(E=(C, V)\) and \(\tau \in \{\text {MAV}, \text {PAV}\}\). Let \(G_E\) be the incidence graph of E, and let M be a maximum matching of \(G_E\). Hence, \(\alpha (G_E)=|M|\). For simplicity, we write \(\alpha\) for \(\alpha (G_E)\). Let C(M) and V(M) be the set of candidates and the multiset of votes saturated by M, respectively. Obviously, \(|C(M)|=|V(M)|=\alpha\). We derive algorithms for MAV and PAV as follows.

MAV. We split I into at most \(2^{\alpha }\) subinstances, each of which takes I and a subset \(C'\subseteq C(M)\) of size at most k as input, and asks whether there is a k-committee \(w\subseteq C\) of MAV score at most \(d\) such that \(C'= C(M)\cap w\). It is easy to see that I is a Yes-instance if and only if at least one of the subinstances is a Yes-instance. We show how to solve each subinstance in polynomial time.

Let \(I'=(I, C')\) be a subinstance. Let v be a vote in \(V{\setminus } V(M)\). Assume that \(w\) is a k-committee such that \(w\cap C(M)=C'\). As M is a maximum matching of \(G_E\), no vote from \(V\setminus V(M)\) approves any candidate from \(C\setminus C(M)\). Consequently, the Hamming distance between v and \(w\) is \(|v|+k-2|v\cap C'|\). Hence, if there is a vote in \(V\setminus V(M)\) such that \(|v|+k-2|v\cap C'|>d\), the subinstance is a No-instance. So, let us assume that this is not the case. Now the task is to identify a subset H of \(k-|C'|\) candidates from \(C\setminus C(M)\) such that the Hamming distance between every vote from V(M) and the committee \(H\cup C'\) is at most \(d\), or equivalently, for every \(v\in V(M)\)H contains at least \(\frac{|v|+k-d}{2}-|v\cap C'|\) of v’s approved candidates in \(C\setminus C(M)\). For each \(v\in V(M)\), let \(f(v)=\frac{|v|+k-d}{2}-|v\cap C'|\). We reduce the subinstance into an ILP with a bounded number of variables. Specifically, for each \(U\subseteq V(M)\), let \(C_U\) be the set of candidates in \(C\setminus C(M)\) approved by all votes in U but not approved by any vote in \(V(M){\setminus } U\), and let \(m_U=|C_U|\). For each \(U\subseteq V(M)\), we create one nonnegative integer variable \(x_{U}\) which indicates the number of candidates from \(C_U\) that are supposed to be in a desired committee. The constraints are as follows.

  • First, for each variable \(x_U\) we have that \(0\le x_U\le m_U\).

  • Second, as we seek \(k-|C'|\) candidates from \(C{\setminus } C(M)\), it holds that

    $$\begin{aligned} \sum _{U\subseteq V(M)}x_U=k-|C'|. \end{aligned}$$
  • Finally, for every \(v\in V(M)\), it holds that

    $$\begin{aligned} \sum _{U\subseteq V(M), v\in U}x_U\ge f(v). \end{aligned}$$

    This inequality ensures that the Hamming distance between every vote \(v\in V(M)\) and the desired committee is at most \(d\).

It is known that ILP is FPT with respect to the number of variables [51]. As the above ILP has at most \(2^{\alpha }\) variables, the subinstance can be solved in FPT-time in \(\alpha\). As there are at most \(2^{\alpha }\) subinstances, the whole algorithm runs in FPT-time in \(\alpha\).

PAV. The algorithm is analogous to the above one for MAV. First, we split the given instance I into subinstances by enumerating all possible intersections \(C'\) of C(M) and a desired k-committee. Then, every nonempty vote \(v\in V\setminus V(M)\) such that \(v\cap C'\ne \emptyset\) provides a PAV score \(\sum _{i=1}^{|v\cap C'|}\frac{1}{i}\) to every k-committee whose intersection with C(M) is exactly \(C'\). Let

$$\begin{aligned} d'=\sum _{v\in V\setminus V(M), v\cap C'\ne \emptyset } \sum _{i=1}^{|v\cap C'|}\frac{1}{i}. \end{aligned}$$

The question now is equivalent to solving an instance \(((C, V(M)), C', k, d-d')\) of Annotated PAV-Multiwinner, which can be done in FPT-time in \(|V(M)|=|M|=\alpha\) (Theorem 1). \(\square\)

7 Conclusion

We have investigated the parameterized complexity of \(\tau {\textsc {-Multiwinner}}\) for \(\tau\) being the three prevalent approval-based k-committee selection rules MAV, CCAV, and PAV, aiming at providing plentiful fixed-parameter tractability results with respect to meaningful parameters. We studied many natural single parameters, their combinations, and two structural parameters of incidence graphs of elections, and obtained an almost complete landscape of the parameterized complexity of \(\tau {\textsc {-Multiwinner}}\) for \(\tau \in \{\text {MAV}, \text {CCAV}, \text {PAV}\}\) with respect to these parameters. For a summary of our concrete results, we refer to Table 1. It should be noted that many of our tractability results were obtained by reducing \(\tau {\textsc {-Multiwinner}}\) to well-studied graph/set problems. As advocated by several researchers [30, 36], a remarkable advantage of using the reduction scheme is that we can automatically update our results with the state-of-the-art of these graph/set problems. In addition, though many reductions are trivial and direct, we deem that pointing out the connections between \(\tau {\textsc {-Multiwinner}}\) and the graph/set problems benefits researchers from different communities.

Our exploration leaves several intriguing problems for future research. We select two open questions that we believe to be the most challenging.

Open Question 1

Is PAV-Multiwinner FPT with respect to \(d\)?

Open Question 2

Is PAV-Multiwinner FPT with respect to the treewidth of the incidence graph of the given election?

Besides these open questions, improving the FPT-algorithms presented in the paper or investigating the kernelizations of FPT problems are also promising avenues for future research. We remark that Agrawal et al. [1] studied kernelizations of the Partial Hitting Set problem. Their results imply that CCAV-Multiwinner admits a polynomial kernel with respect to \(d+\triangle _{\text {V}}\). It is interesting to see if the same holds for PAV-Multiwinner. Regarding MAV, Misra, Nabeel, and Singh [57] have shown that MAV-Multiwinner is unlikely to admit any polynomial kernels with respect to \(d+m\) and \(n+k\), assuming standard complexity hypothesis. This means that MAV-Multiwinner is unlikely to admit any polynomial kernels with respect to smaller parameters (e.g., \(d\)n, \(d+\triangle _{\text {V}}\), etc.).

Finally, we would like to mention that recently a few researchers have examined a number of structural parameters based on real electoral data or data generated by promising models, largely motivated by previous theoretical works on the parameterized complexity of voting problems with respect to these parameters (see, e.g., [10, 69]). Although graphs of small treewidth arisen from various research areas have been continually reported in the literature, to the best of our knowledge, analogous works on the treewidth of incidence graphs of elections have not been conducted heretofore. We hope that our algorithms presented in Sect. 6 would inspire such an investigation.