Generalized minimum 0-extension problem and discrete convexity

Dvorak, Martin; Kolmogorov, Vladimir

doi:10.1007/s10107-024-02064-5

Generalized minimum 0-extension problem and discrete convexity

Full Length Paper
Series A
Open access
Published: 07 March 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Mathematical Programming Submit manuscript

Generalized minimum 0-extension problem and discrete convexity

Download PDF

356 Accesses
Explore all metrics

Abstract

Given a fixed finite metric space $(V,\mu )$, the minimum 0-extension problem, denoted as $\mathtt{0\hbox {-}Ext}[{\mu }]$, is equivalent to the following optimization problem: minimize function of the form $\min \nolimits _{x\in V^n} \sum _i f_i(x_i) + \sum _{ij} c_{ij}\hspace{0.5pt}\mu (x_i,x_j)$ where $f_i:V\rightarrow \mathbb {R}$ are functions given by $f_i(x_i)=\sum _{v\in V} c_{vi}\hspace{0.5pt}\mu (x_i,v)$ and $c_{ij},c_{vi}$ are given nonnegative costs. The computational complexity of $\mathtt{0\hbox {-}Ext}[{\mu }]$ has been recently established by Karzanov and by Hirai: if metric $\mu $ is orientable modular then $\mathtt{0\hbox {-}Ext}[{\mu }]$ can be solved in polynomial time, otherwise $\mathtt{0\hbox {-}Ext}[{\mu }]$ is NP-hard. To prove the tractability part, Hirai developed a theory of discrete convex functions on orientable modular graphs generalizing several known classes of functions in discrete convex analysis, such as $L^\natural $-convex functions. We consider a more general version of the problem in which unary functions $f_i(x_i)$ can additionally have terms of the form $c_{uv;i}\hspace{0.5pt}\mu (x_i,\{u,v\})$ for $\{u,\!\hspace{0.5pt}\hspace{0.5pt}v\}\in F$, where set $F\subseteq \left( {\begin{array}{c}V\\ 2\end{array}}\right) $ is fixed. We extend the complexity classification above by providing an explicit condition on $(\mu ,F)$ for the problem to be tractable. In order to prove the tractability part, we generalize Hirai’s theory and define a larger class of discrete convex functions. It covers, in particular, another well-known class of functions, namely submodular functions on an integer lattice. Finally, we improve the complexity of Hirai’s algorithm for solving $\mathtt{0\hbox {-}Ext}[{\mu }]$ on orientable modular graphs.

Discrete convexity and polynomial solvability in minimum 0-extension problems

Article 24 October 2014

Decreasing minimization on M-convex sets: background and structures

Article 27 October 2021

On the edge metric dimension of convex polytopes and its related graphs

Article 28 October 2019

1 Introduction

Consider a metric space $(V,\mu )$ where V is a finite set and $\mu $ is a nonnegative function $V\times V\rightarrow \mathbb R$ satisfying the axioms of a metric: $\mu (x,y)=0$ $\Leftrightarrow $ $x=y$, $\hspace{0.5pt}\mu (x,y)=\mu (y,x)$, $\hspace{0.5pt}\mu (x,y)+\mu (y,z)\ge \mu (x,z)$ for all $x,y,z\in V$. We study optimization problems of the following form:

$$\begin{aligned} \min _{x\in V^n} f(x),\qquad \quad f(x)=\sum _{i\in [n]} f_i(x_i) + \sum _{1\le i<j\le n}c_{ij} \mu (x_i,x_j) \end{aligned}$$

(1)

where weights $c_{ij}$ are nonnegative. If unary terms $f_i:V\rightarrow \mathbb R$ are allowed to be arbitrary nonnegative functions then this is a well-studied Metric Labeling Problem [22]. Another important special case is when the unary terms are given by

$$\begin{aligned} f_i(x_i)=\sum _{v\in V}c_{vi}\mu (x_i,v) \end{aligned}$$

(2)

with nonnegative weights $c_{vi}$. This is a classical facility location problem, known as multifacility location problem [33]. It can be interpreted as follows: we are going to locate n new facilities in V, where the facilities communicate with each other and communicate with existing facilities in V. The cost of the communication is propositional to the distance. The goal is to find a location of minimum total communication cost. The multifacility location problem is also equivalent to the minimum 0-extension problem formulated by Karzanov [20]. We denote $\mathtt{0\hbox {-}Ext}[{\mu }]$ to be class of problems of the form (1), (2).

Optimization problems of the form above have applications in computer vision and related clustering problems in machine learning [4, 11, 14, 22]. $\mathtt{0\hbox {-}Ext}[{\mu }]$ also includes a number of basic combinatorial optimization problems. For example, the multiway cut problem on k vertices can be obtained by setting $(V,\mu )$ to be the uniform metric on $|V|=k$ elements; it can be solved in polynomial time (via a maximum flow algorithm) if $k=2$, and is NP-hard for $k\ge 3$.

We explore a generalization of $\mathtt{0\hbox {-}Ext}[{\mu }]$ in which the unary terms are given by

$$\begin{aligned} f_i(x_i)=\sum _{v\in V}c_{vi}\mu (x_i,v) + \sum _{U\in F}c_{Ui}\hspace{0.5pt}\mu (x_i,U). \end{aligned}$$

(3)

Here F is a fixed set of subsets of V, $c_{vi},c_{Ui}$ are nonnegative weights, and $\mu (x_i,U)=\min _{v\in U}\mu (x_i,v)$. We refer to this generalization as $\mathtt{0\hbox {-}Ext}[{\mu ,F}]$. In the facility location interpretation, allowing terms of the form $c_{Ui} \mu (x_i,U)$ means that the i-th facility can be “served” by any of the facilities in U, and it can choose to communicate with the closest facility to minimize the communication cost.

Note that $\mathtt{0\hbox {-}Ext}[{\mu }]=\mathtt{0\hbox {-}Ext}[{\mu ,\varnothing }]$. Furthermore, $\mathtt{0\hbox {-}Ext}[{\mu ,2^V}]$, where $2^V=\{U\,|\,U\subseteq V\}$ is the set of all subsets of V, is the restricted Metric Labeling Problem [8], which is equivalent to the Metric Labeling Problem [6, Section 5.2].

1.1 Complexity classifications

The computational complexity of $\mathtt{0\hbox {-}Ext}[{\mu }]$ has been established in [15, 20]. The tractability criterion is based on the properties of graph $H_\mu =(V,E,w)$ defined as the minimal undirected weighted graph whose path metric equals $\mu $. Clearly, we have

$$\begin{aligned} E=\left\{ xy\in \left( {\begin{array}{c}V\\ 2\end{array}}\right) \;\bigg |\; \forall z\in V-\{x,y\}:\; \mu (x,y)<\mu (x,z)+\mu (z,y) \right\} \end{aligned}$$

and w is the restriction of $\mu $ to E. For brevity, we usually denote the elements of $\left( {\begin{array}{c}V\\ 2\end{array}}\right) $ as xy instead of $\{x,y\}$.

In order to state the classification of $\mathtt{0\hbox {-}Ext}[{\mu }]$, we need to introduce a few definitions.

Orientable modular graphs Let us fix metric $\mu $. For nodes $x,y\in V$ let $I(x,y)=I_\mu (x,y)$ be the metric interval of x, y, i.e. the set of points $z\in V$ satisfying $\mu (x,z)+\mu (z,y)=\mu (x,y)$. Metric $\mu $ is called modular if for every triplet $x,y,z\in V$ the intersection $I(x,y)\cap I(y,z) \cap I(x,z)$ is non-empty. (Points in this intersection are called medians of x, y, z.) We say that graph H is modular if $H=H_\mu $ for a modular metric $\mu $.

Let o be an edge-orientation of graph H with the relation $\rightarrow _o$ on $V\times V$. This orientation is called admissible for H if, for every 4-cycle $(x_1,x_2,x_3,x_4)$, condition $x_1\rightarrow _o x_2$ implies $x_4\rightarrow _o x_3$. H is called orientable if it has an admissible orientation.

Theorem 1

([20]) If $H_\mu $ is not orientable or not modular then $\mathtt{0\hbox {-}Ext}[{\mu }]$ is NP-hard.

Theorem 2

([15]) If $H_\mu $ is orientable modular then $\mathtt{0\hbox {-}Ext}[{\mu }]$ can be solved in polynomial time.^{Footnote 1}

Our results We extend the classification above to problems $\mathtt{0\hbox {-}Ext}[{\mu ,F}]$ in which all subsets $U\in F$ have cardinality 2, i.e. $F\subseteq \left( {\begin{array}{c}V\\ 2\end{array}}\right) $. To formulate the tractability criterion, we need to introduce some definitions. Let o be an orientation of (H, F), i.e. each edge of H is assigned an orientation, and each element of F is assigned an orientation. We say that o is admissible for (H, F) if it is admissible for H and, for every $\{x,y\}\in F$ with $x\rightarrow _o y$, the following holds: if P is a shortest x-y path in H then all edges of P are oriented according to o. We say that H is F-orientable modular if it is orientable modular and (H, F) admits an admissible orientation o. We can now formulate the main result of this paper.

Theorem 3

If $H_\mu $ is F-orientable modular then $\mathtt{0\hbox {-}Ext}[{\mu ,F}]$ can be solved in polynomial time. Otherwise $\mathtt{0\hbox {-}Ext}[{\mu ,F}]$ is NP-hard.

To prove the tractability part, we define L-convex functions on extended modular complexes, and show that they can be minimized in polynomial time. This generalizes L-convex functions on modular complexes introduced by Hirai in [15, 16].

1.2 Discrete convex analysis

As Hirai remarks, the approach in [15, 16] had been inspired by discrete convex analysis developed in particular in [13, 27, 28, 30, 32] and [12, Chapter VII]. This is a theory of convex functions on integer lattice $\mathbb Z^n$, with the goal of providing a unified framework for polynomially solvable combinatorial optimization problems including network flows, matroids, and submodular functions. Hirai’s work extends this theory to more general graph structures, in particular to orientable modular graphs, and provides a unified framework for polynomially solvable minimum 0-extension problems and related multiflow problems.

We develop a yet another generalization. To illustrate the relation to previous work, consider two fundamental classes of functions on the integer lattice $V=[k]=\{1,2,\ldots ,k\}$: submodular functions and $L^\natural $-convex functions. These are functions $f:[k]^n \rightarrow \overline{\mathbb R}$ satisfying conditions (4) and (5), respectively^{Footnote 2}

(4)

(5)

where all operations are applied componentwise.

If, for example, $f(x)=\sum _i f_i(x_i) + \sum _{ij} f_{ij}(x_j-x_i)$, then f is submodular if all functions $f_{ij}$ are convex, and f is $L^\natural $-convex if all functions $f_i$ and $f_{ij}$ are convex. The class of submodular functions on [k] is strictly larger than the class of $L^\natural $-functions. However, $L^\natural $-convex functions possess additional properties that allow more efficient minimization algorithms, such as the Steepest Descent Algorithm [24, 29, 31].

The theory developed in [15, 16] covers $L^\natural $-convex functions and several other function classes, such as bisubmodular functions, k-submodular functions [18], skew-bisubmodular functions [19], and strongly-tree submodular functions [23]. However, it excludes submodular functions on [k] for $k\ge 3$, which is a fundamental class of functions in discrete convex analysis. This paper fills this gap by introducing a unified framework that includes all classes of functions mentioned above.

Algorithms for solving $\mathtt{0\hbox {-}Ext}[{\mu }]$ and $\mathtt{0\hbox {-}Ext}[{\mu ,F}]$ The tractability of $\mathtt{0\hbox {-}Ext}[{\mu }]$ for orientable modular $\mu $ was proven in [16] as follows. Given an instance $f:V^n\rightarrow {{\overline{{\mathbb R}}}}$, Hirai defines a different instance $f^*_\times :(V^*)^n\rightarrow {{\overline{{\mathbb R}}}}$ with the same minimum, where $|V^*|=O(|V|^2)$. Function $f^*_\times $ is then minimized using the Steepest Descent Algorithm (SDA). This is an iterative technique that at each step computes a minimizer of $f^*_\times $ in a certain local neighborhood of the current iterate (by solving a linear programming relaxation). We refer to this technique as the SDA$^*$ approach.

We present an alternative algorithm (for tractable classes of $\mathtt{0\hbox {-}Ext}[{\mu ,F}]$) that minimizes function f directly via a version of SDA that we call -SDA (“diamond-SDA”). Both approaches terminate after O(|V|) steps. However, one step of SDA$^*$ is more expensive than one step of SDA: the LP problem involves up to $O(|V^*|)=O(|V|^2)$ labels per node in the former approach compared to O(|V|) labels in the latter approach. Thus, we improve the complexity of solving $\mathtt{0\hbox {-}Ext}[{\mu }]$.

Orthogonal generalizations of the minimum 0-extension problem In [16] Hirai considered the minimum 0-extension problem on swm-graphs (that generalize orientable modular graphs), and defined L-extendable functions on swm-graphs. Minimizing L-extendable functions on swm-graphs is an NP-hard problem (unless the graph is orientable modular); however, these functions admit a discrete relaxation on an orientable modular graph (which is an L-convex function). The relaxation can be minimized in polynomial time and yields a partial optimal solution for the original function. Based on this, Chalopin, Chepoi, Hirai and Osajda [5] obtained a 2-approximation algorithm for the minimum 0-extension problem on swm-graphs. They also developed the theory of swm-graphs.

In [17] Hirai and Mizutani considered minimum 0-extension problem for directed metrics, and provided some partial results (including a dichotomy for directed metrics of a star graph).

1.3 Valued Constraint Satisfaction Problems (VCSPs)

Results of this paper can be naturally stated in the framework of Valued Constraint Satisfaction Problems (VCSPs). This framework is defined below.

Let us fix a finite set D called a domain. A cost function over D of arity n is a function of the form $f:D^n\rightarrow \overline{\mathbb R}$. It is called finite-valued if $f(x)<\infty $ for all $x\in D^n$. We denote ${{\,\mathrm{{\texttt {dom}}}\,}}f=\{x\in D^n\,|\,f(x)<\infty \}$. A (VCSP) language over D is a (possibly infinite) set $\Phi $ of cost functions over D. Language $\Phi $ is called finite-valued if all functions $f\in \Phi $ are finite-valued.

A VCSP instance $\mathcal{I}$ is a function $D^n\rightarrow {{\overline{{\mathbb R}}}}$ given by

$$\begin{aligned} f_\mathcal{I}(x)\ =\ \sum _{t\in T} f_t(x_{v(t,1)},\ldots ,x_{v(t,n_t)}). \end{aligned}$$

(6)

It is specified by a finite set of variables [n], finite set of terms T, cost functions $f_t: D^{n_t}\rightarrow {{\overline{{\mathbb R}}}}$ of arity $n_t$ and indices $v(t, k)\in [n]$ for $t\in T, k =1,\ldots , n_t$. A solution to $\mathcal{I}$ is a labeling $x\in [n]^V$ that minimizes $f_\mathcal{I}(x)$. Instance $\mathcal{I}$ is called a $\Phi $-instance if all terms $f_t$ belong to $\Phi $. The set of all $\Phi $-instances is denoted as ${\text {VCSP}}(\Phi )$. Language $\Phi $ with finite $|\Phi |$ is called tractable if instances in ${\text {VCSP}}(\Phi )$ can be solved in polynomial time, and NP-hard if ${\text {VCSP}}(\Phi )$ is NP-hard. If $|\Phi |$ is infinite then $\Phi $ is called tractable if every finite $\Phi '\subseteq \Phi $ is tractable, and NP-hard if there exists finite $\Phi '\subseteq \Phi $ which is NP-hard.

A key algorithmic tool in the VCSP theory is the Basic Linear Programming (BLP) relaxation of instance $\mathcal{I}$. We refer to [25] for the description of this relaxation. We say that BLP solves instance $\mathcal{I}$ if this relaxation is tight, i.e. its optimal value equals $\min _{x\in D^n}f_\mathcal{I}(x)$.

The following results are known; we refer to Sect. 6 for the definition of a “binary symmetric fractional polymorphism”.

Theorem 4

([25]) Let $\Phi $ be a finite-valued language. Then BLP solves $\Phi $ if and only if $\Phi $ admits a binary symmetric fractional polymorphism. If the condition holds, then an optimal solution of an $\Phi $-instance can be computed in polynomial time.

Theorem 5

([34]) If a finite-valued language $\Phi $ does not satisfy the condition in Theorem 4 then $\Phi $ is NP-hard.

Application to the minimum 0-extension problem Consider again a metric space $(V,\mu )$ and subset $F\subseteq \left( {\begin{array}{c}V\\ 2\end{array}}\right) $. For a set $U\subseteq V$ let $\delta _U:V\rightarrow \{0,\infty \}$ be the indicator function of set U, with $\delta _U(v)=0$ iff $v\in U$. For brevity, we write $\delta _{\{u_1,\ldots , u_k\}}$ as $\delta _{u_1\ldots u_k}$. Clearly, the minimum 0-extension problems introduced earlier can be equivalently defined by the following languages over domain $D=V$:

$$\begin{aligned} \mathtt{0\hbox {-}Ext}[{\mu }]= & {} \{\mu \} \;\cup \; \{\delta _u\,{:}\,u\in V\} \\ \mathtt{0\hbox {-}Ext}[{\mu ,F}]= & {} \{\mu \} \;\cup \; \{\delta _u\,{:}\,u\in V\} \;\cup \; \{\delta _U\,{:}\,U\in F\} \end{aligned}$$

Note that the existence of the dichotomy given in Theorem 3 follows from Theorems 4 and 5 (but not the specific criterion for tractability).^{Footnote 3}

1.4 Summary of contributions

As described in earlier sections, our first contribution is complexity classification of $\mathtt{0\hbox {-}Ext}[{\mu ,F}]$ for subsets $F\subseteq \left( {\begin{array}{c}V\\ 2\end{array}}\right) $ with an explicit criterion for tractability, which is achieved by generalizing Hirai’s theory of modular complexes to extended modular complexes. While some of the proofs are relatively straightforward extensions of the corresponding proofs in [15, 16], there are also a number of proofs where we use novel techniques. We already mentioned a new -SDA algorithm that improves the complexity of solving the standard 0-minimum extension problem. In order to analyze this algorithm, we introduce new binary operations for (extended) modular complexes and establish their properties. Another key technical component that we use is the notion of f-extremality that we introduce. We believe that these concepts deepen our understanding of orientable modular graphs.

As our last contribution, we prove that the BLP relaxation directly solves L-convex functions on extended modular complexes. Previously, this was shown to hold (for standard modular complexes) only assuming that $\texttt {P}\ne \texttt {NP}$ (see Section 6 in [15]).

The rest of the paper is organized as follows. Section 2 reviews Hirai’s theory and defines L-convex functions on modular complexes and the SDA algorithm for minimizing them. Section 3 generalizes this to L-convex functions on extended modular complexes and presents -SDA algorithm. Both sections use the notion of submodular functions on valuated modular semilattices, which are formally defined in Sect. 4. All proofs missing in Sect. 3 are given in Sects. 5 and 6.1; this completes the proof of the tractability direction of Theorem 3. The NP-hardness direction of Theorem 3 is proven in Sect. 6.2. Section 7 concludes the paper with a list of open problems.

2 Background on orientable modular graphs

Notation for graphs If H is a simple undirected graph and o its edge orientation, then the pair (H, o) can be viewed as a simple directed graph. We will usually denote this graph as $\Gamma =(V_\Gamma ,E_\Gamma ,w_\Gamma )$. $\Gamma $ is called oriented modular, or a modular complex, if it is an admissible orientation of an orientable modular graph [15, 21].

We let $\rightarrow _\Gamma $ be the edge relation of $\Gamma $, i.e. condition $u\rightarrow _\Gamma v$ means that there is an edge from u to v in $\Gamma $. When $\Gamma $ is clear from the context, we may omit subscript $\Gamma $ and write $V,E,w,\rightarrow $, etc. A path $(u_0,u_1,\ldots ,u_k)$ in a directed graph $\Gamma $ is defined as a path in the undirected version of $\Gamma $, i.e. for each i we must have either $u_i\rightarrow u_{i+1}$ or $u_{i+1}\rightarrow u_{i}$. An x-y path in $\Gamma $ is a path from $x\in V$ to $y\in V$. With some abuse of notation we sometimes view $\Gamma $ as a set of its nodes, and write e.g. $v\in \Gamma $ to mean $v\in V$.

Orbits For an undirected graph $H=(V,E,w)$ edges $e,e'\in E$ are called projective if there is a sequence of edges $(e_0,e_1,\ldots ,e_m)$ with $(e_0,e_m)=(e,e')$ such that $e_i,e_{i+1}$ are vertex-disjoint and belong to a common 4-cycle of H. Clearly, projectivity is an equivalence relation on E. An equivalence class of this relation in called an orbit [21]. Edge weights $w:E\rightarrow \mathbb R_{>0}$ are called orbit-invariant if $w(e)=w(e')$ for any pair of edges $e,e'$ in the same orbit (equivalently, for vertex-disjoint edges $e,e'$ belonging to a common 4-cycle).

Theorem 6

([1, 21]) Consider undirected graph $H=H_\mu =(V,E,w)$.

(a):: If $\mu $ is modular then w is orbit-invariant, and path P is shortest in H if and only if it is shortest in (V, E, 1).
(b):: The following conditions are equivalent: (i) H is (orientable) modular; (ii) w is orbit-invariant and (V, E, 1) is (orientable) modular.

Metric spaces For a weighted directed or undirected graph $G=(V,E,w)$ let $\mu _G$ and $d_G$ (or simply $\mu $ and d, when G is clear) be its path metrics w.r.t. edge lengths w and 1, respectively. If graph G is directed then edge orientations are again ignored.

For a metric space $(V,\mu )$, subset $U\subseteq V$ is called convex if $I(x,y)\subseteq U$ for every $x,y\in U$. Note, if G is an orientable (or oriented) modular graph then the definitions of the metric interval I(x, y) and of convex sets coincide for metric spaces $(V_G,\mu _G)$ and $(V_G,d_G)$ (by Theorem 6).

Posets A modular complex $\Gamma $ known to be an acyclic graph [15, Lemma 2.3], and thus induces a partial order $\preceq $ on V. Partially ordered sets (posets) play a key role in the study of oriented modular graphs. Below we describe basic facts about posets and terminology that we use, mainly following [15, 16].

Consider poset $\mathcal{L}$ with relation $\preceq $. For elements p, q with $p\preceq q$, the interval [p, q] is the set $\{x\in \mathcal{L}\,|\,p\preceq x\preceq q\}$. A chain from p to q of length k is a sequence $u_0\prec u_1\prec \ldots \prec u_k$ with $(u_0,u_k)=(p,q)$, where notation $a\prec b$ means that $a\preceq b$ and $a\ne b$. The length r[p, q] of interval [p, q] is defined as the maximum length of a chain from p to q. If $\mathcal{L}$ has the lowest element (denoted as 0) then the rank r(a) of element $a\in \mathcal{L}$ is defined by $r(a)=r[0,a]$, and elements of rank 1 are called atoms.

Element q covers p if $p\prec q$ and there is no element u with $p\prec u\prec q$. The Hasse diagram of $\mathcal{L}$ is a directed graph on $\mathcal{L}$ with the set of edges $\{p\rightarrow q\,|\,q\hbox { covers }p\}$, and the covering graph of $\mathcal{L}$ is the corresponding undirected graph.

A pair $x,y\in \mathcal{L}$ is said to be upper-bounded (resp. lower-bounded) if x, y have a common upper bound (resp. common lower bound). The lowest common upper bound, if exists, is denoted by $x\vee y$ (the “join” of x, y), and the greatest common lower bound, if exists, is denoted by $x\wedge y$ (the “meet” of x, y). $\mathcal{L}$ is called a (meet-)semilattice if every pair $x,y\in \mathcal{L}$ has a meet, and it is called a lattice if every pair $x,y\in \mathcal{L}$ has both a join and a meet. If $\mathcal{L}$ is a semilattice and $x,y\in \mathcal{L}$ are upper-bounded then $x\vee y$ is known to exist.

A (positive) valuation of a semilattice $\mathcal{L}$ is a function $v:\mathcal{L}\rightarrow \mathbb R$ satisfying

$$\begin{aligned} v(q)-v(p) > 0&\qquad \qquad \forall p,q\in \mathcal{L}:\,p\prec q \end{aligned}$$

(7a)

$$\begin{aligned} v(p)+v(q) = v(p\wedge q)+v(p\vee q)&\qquad \qquad \forall p,q\in \mathcal{L}:\,p,q\hbox { upper-bounded} \end{aligned}$$

(7b)

In particular, if $\mathcal{L}$ is a lattice then (7b) should hold for all $p,q\in \mathcal{L}$. A semilattice with valuation v will be called a valuated semilattice. We will view the Hasse diagram of a valuated semilattice $\mathcal{L}$ (and the corresponding covering graph) as weighted graphs, where the weight of $p\rightarrow q$ is given by $v(q)-v(p)>0$.

A lattice $\mathcal{L}$ is called modular if for every $x,y,z \in \mathcal{L}$ with $x \preceq z$ there holds $x \vee (y \wedge z) = (x \vee y) \wedge z$. A semilattice $\mathcal{L}$ is called modular [2] if for every $p\in \mathcal{L}$ poset $(\{x\in \mathcal{L}\,|\,x\preceq p\},\preceq )$ is a modular lattice, and for every $x,y,z \in \mathcal{L}$ the join $x \vee y \vee z$ exists provided that $x \vee y$, $y \vee z$, $z \vee x$ exist. It is known that a lattice $\mathcal{L}$ is modular if and only if its rank function $r(\cdot )$ is a valuation [3, Chapter III, Corollary 1]. Furthermore, a (semi)lattice is modular if and only if its covering graph is modular, see [35, Proposition 6.2.1]; [2, Theorem 5.4]. The Hasse diagram of a (valuated) modular semilattice is known to be oriented modular [15, page 13].

Boolean pairs From now on we fix a modular complex $\Gamma =(V,E,w)$. Graph B is called a cube graph if it is isomorphic to the Hasse diagram of the Boolean lattice $\{0,1\}^k$ for some $k\ge 0$. A pair of vertices (p, q) of $\Gamma $ is called a Boolean pair if $\Gamma $ contains cube graph B as a subgraph so that p and q are respectively the source and the sink of B. Let $\sqsubseteq $ be the following relation on V: $p\sqsubseteq q$ iff (p, q) is a Boolean pair in $\Gamma $. We have $p\sqsubseteq p$ for any $p\in V$, and condition $p\sqsubseteq q$ implies that $p\preceq q$. Graph $\Gamma $ is called well-oriented if the opposite implication holds, i.e. if relations $\preceq $ and $\sqsubseteq $ are the same. Note that relation $\sqsubseteq $ is not necessarily transitive. We write $p\sqsubset q$ to mean $p\sqsubseteq q$ and $p\ne q$. Let be the graph with nodes V and edges $\{p\rightarrow q\,|\,p,q\in V,p\sqsubset q\}$. Clearly, $\Gamma $ is a subgraph of (ignoring edge weights).

For a vertex $p\in V$ define the following subsets of V:

$$\begin{aligned} \mathcal{L}^\uparrow _p(\Gamma )=\{q\in V\,|\,p\preceq q\}&\qquad \mathcal{L}^\downarrow _p(\Gamma )=\{q\in V\,|\,q\preceq p\}&\end{aligned}$$

(8a)

$$\begin{aligned} \mathcal{L}^+_p(\Gamma )=\{q\in V\,|\,p\sqsubseteq q\}&\qquad \mathcal{L}^-_p(\Gamma )=\{q\in V\,|\,q\sqsubseteq p\}&\qquad \mathcal{L}^\pm _p(\Gamma )=\mathcal{L}^+_p(\Gamma ) \cup \mathcal{L}^-_p(\Gamma ) \end{aligned}$$

(8b)

When $\Gamma $ is clear from the context, we will omit it for brevity (and if $\Gamma $ is not clear, we may write $\preceq _\Gamma $ and $\sqsubseteq _\Gamma $ instead of $\preceq $ and $\sqsubseteq $). We view $\mathcal{L}^\uparrow _p,\mathcal{L}^+_p$ as posets with relation $\preceq $, and $\mathcal{L}^\downarrow _p,\mathcal{L}^-_p$ as posets with the reverse of relation $\preceq $. Note, if $\Gamma $ is well-oriented then $\mathcal{L}_p^+=\mathcal{L}^\uparrow _p$ and $\mathcal{L}_p^-=\mathcal{L}^\downarrow _p$ for every node p of $\Gamma $.

Lemma 7

[[15, Proposition 4.1, Theorem 4.2, Lemma 4.14]] Let $\Gamma $ be a modular complex.

(a):: If elements a, b are upper-bounded then $a\vee b$ exists. Similarly, if a, b are lower-bounded then $a\wedge b$ exists.
(b):: Consider elements p, q with $p\preceq q$. Then $\mathcal{L}^\uparrow _p$, $\mathcal{L}^\downarrow _p$, $\mathcal{L}^+_p$, $\mathcal{L}^-_p$ are modular semilattices, and $[p,q]=\mathcal{L}^\uparrow _p\cap \mathcal{L}^\downarrow _p$ is a modular lattice. Furthermore, these (semi)lattices are convex in $\Gamma $, and function $v(\cdot )$ defined via $v(a)=\mu _\Gamma (p,a)$ is a valid valuation of these (semi)lattices.

We will always view $\mathcal{L}^\uparrow _p$, $\mathcal{L}^\downarrow _p$, $\mathcal{L}^+_p$, $\mathcal{L}^-_p$ as valuated semilattices, where the valuation is defined as in the lemma.

L-convex functions Next, we review the notion of an L-convex function on a modular complex $\Gamma $ introduced by Hirai in [15, 16]. The definition involves the following steps.

First, Hirai defines the notion of a submodular function on a valuated modular semilattice $\mathcal{L}$. These are functions $f:\mathcal{L}\rightarrow \overline{\mathbb R}$ satisfying certain linear inequalities. Formulating these inequalities is rather lengthy, and we defer it to Sect. 4.
Second, Hirai defines 2-subdivision of $\Gamma $ as the directed weighted graph $\Gamma ^*=(V^*,E^*,w^*)$ constructed as follows:
1. (i)
  set $V^*=\{[p,q]\,|\,p\sqsubseteq q\}$;
2. (ii)
  for each $[p,q],[p,q']\in V^*$ with $q\rightarrow _\Gamma q'$ add edge $[p,q]\rightarrow [p,q']$ to $\Gamma ^*$ with weight $ w(qq')$;
3. (iii)
  for each $[p',q],[p,q]\in V^*$ with $p'\rightarrow _\Gamma p$ add edge $[p,q]\rightarrow [p',q]$ to $\Gamma ^*$ with weight $ w(p'p)$.
It can be seen that graph $(V^*,E^*)$ is the Hasse diagram of the poset $(V^*,\subseteq )$ (which is the definition used in [16]).^{Footnote 4} Hirai proves that graph $\Gamma ^*$ is oriented modular ([15, Theorem 4.3]) and well-oriented ([15, Lemma 2.14]). Consequently, poset $\mathcal{L}_{[p,p]}^+(\Gamma ^*)=\mathcal{L}_{[p,p]}^\uparrow (\Gamma ^*)$ is a valuated modular semilattice for each $p\in V$. For brevity, this poset will be denoted as $\mathcal{L}_{p}^*(\Gamma )$, or simply as $\mathcal{L}_{p}^*$.
Each function $f:V\rightarrow \overline{\mathbb R}$ is extended to a function $f^*:V^*\rightarrow \overline{\mathbb R}$ via $f^*([p,q])=f(p)+f(q)$.
Function $f:V\rightarrow \overline{\mathbb R}$ is now called L-convex on $\Gamma $ if (i) subset ${{\,\mathrm{{\texttt {dom}}}\,}}f\subseteq V$ is connected in , and (ii) for every $p\in V$, the restriction of $f^*$ to $\mathcal{L}^*_{p}$ is submodular on valuated modular semilattice $\mathcal{L}^*_{p}$.

Minimum 0-extension problem and L-convex functions Next, we describe the relation between problem $\mathtt{0\hbox {-}Ext}[{\mu }]$ and L-convex functions on $\Gamma $ (where $\mu $ is an orientable modular metric and $\Gamma $ is an admissible orientation of $H_\mu $).

If $\Gamma ,\Gamma '$ are two modular complexes, then their Cartesian product $\Gamma \times \Gamma '$ is defined as the directed graph with the vertex set $V_\Gamma \times V_{\Gamma '}$ and the following edge set: there is an edge $(p,p')\rightarrow (q,q')$ iff either (i) $p=q$ and $p'\rightarrow _{\Gamma '} q'$, or (ii) $p'=q'$ and $p\rightarrow _\Gamma q$. The weight of the edge is $w_{\Gamma '}(p'q')$ in the first case and $w_{\Gamma }(pq)$ in the second case. The n-fold Cartesian product $\Gamma \times \ldots \times \Gamma $ is denoted as $\Gamma ^n$.

A Cartesian product $\mathcal{L}\times \mathcal{L}'$ of two posets $\mathcal{L},\mathcal{L}'$ is defined in a natural way (with $(p,p')\preceq (q,q')$ iff $p\preceq p$ and $q\preceq q'$). It is straightforward to verify from definitions that if $\mathcal{L},\mathcal{L}'$ are modular semilattices then so is $\mathcal{L}\times \mathcal{L}'$. If $\mathcal{L},\mathcal{L}'$ are valuated modular semilattices with valuations $v_{\mathcal{L}},v_{\mathcal{L}'}$ then $\mathcal{L}\times \mathcal{L}'$ is also assumed to be valuated with the valuation $v_{\mathcal{L}\times \mathcal{L}'}(p,p')=v_{\mathcal{L}}(p)+v_{\mathcal{L}'}(p')$.

Lemma 8

([15, Lemma 4.7]) Consider modular complexes $\Gamma ,\Gamma '$ and element $(p,p')\in \Gamma \times \Gamma '$.

(a):: Graph $\Gamma \times \Gamma '$ is a modular complex (i.e. oriented modular).
(b):: $\mathcal{L}^\sigma _{(p,p')}(\Gamma \times \Gamma ')=\mathcal{L}^\sigma _{p}(\Gamma )\times \mathcal{L}^\sigma _{p'}(\Gamma ')$ for $\sigma \in \{-,+\}$.

Theorem 9

([15, Theorem 4.8 and Lemma 4.9]; [16, Lemmas 4.1, 4.2 and Proposition 4.4]) Consider modular complexes $\Gamma ,\Gamma '$ and functions $f,f':\Gamma \rightarrow \overline{\mathbb R}$ and $\tilde{f}:\Gamma \times \Gamma '\rightarrow \overline{\mathbb R}$.

(a):: If $f,f'$ are L-convex on $\Gamma $ then $f+f'$ and $c\cdot f$ for $c\in \mathbb R_+$ are also L-convex on $\Gamma $.
(b):: If f is L-convex on $\Gamma $ and ${\tilde{f}}(p,p')=f(p)$ for $(p,p')\in \Gamma \times \Gamma '$ then ${\tilde{f}}$ is L-convex on $\Gamma \times \Gamma '$.
(c):: If ${\tilde{f}}$ is L-convex on $\Gamma \times \Gamma '$ and $f(p)={\tilde{f}}(p,p')$ for fixed $p'\in \Gamma '$ then f is L-convex on $\Gamma $.
(d):: The indicator function $\delta _U:V\rightarrow \{0,\infty \}$ of a $d_\Gamma $-convex set U is L-convex on $\Gamma $.
(e):: Function $\mu _\Gamma :\Gamma \times \Gamma \rightarrow \mathbb R_+$ is L-convex on $\Gamma \times \Gamma $.
(f):: If f is L-convex on $\Gamma $ then the restriction of f to $\mathcal{L}^\sigma _p(\Gamma )$ is submodular on $\mathcal{L}^\sigma _p(\Gamma )$ for every $p\in \Gamma $ and $\sigma \in \{-,+\}$.

It follows that an instance of $\mathtt{0\hbox {-}Ext}[{\mu }]$ can be reduced to the problem of minimizing function $f:\Gamma ^n\rightarrow \overline{\mathbb R}$, $n\ge 1$ such that (i) f is L-convex on $\Gamma ^n$, and (ii) $f(x_1,\ldots ,x_n)$ can be written as a sum of unary and pairwise terms that are L-convex on $\Gamma $ and $\Gamma \times \Gamma $, respectively. This minimization problem is considered next.

Minimizing L-convex functions A key property of L-convex functions is that local optimality implies global optimality.

Theorem 10

[[15, Lemma 2.3]] Let $f:V\rightarrow \overline{\mathbb R}$ be an L-convex function on a modular complex $\Gamma $. If p is a local minimizer of f on $\mathcal{L}^\pm _p(\Gamma )$, i.e. $f(p)=\min _{q\in \mathcal{L}^\pm _p(\Gamma )}f(q)$, then it is also a global minimizer of f, i.e. $f(p)=\min _{q\in V}f(q)$.

This theorem implies that the Steepest Descent Algorithm (SDA) given below is guaranteed to produce a global minimizer of f after a finite number of steps.

By Lemma 8, for $x=(x_1,\ldots ,x_n)$ we have $\mathcal{L}^\sigma _x(\Gamma ^n)=\mathcal{L}^\sigma _{x_1}(\Gamma )\times \ldots \times \mathcal{L}^\sigma _{x_n}(\Gamma )$ for $\sigma \in \{-,+\}$. By Theorem 9(f), function f is submodular on $\mathcal{L}^\sigma _x(\Gamma ^n)$. The result below thus implies that the two minimization problems in line 3 can be solved in polynomial time assuming that f is an instance of $\mathtt{0\hbox {-}Ext}[{\mu }]$.

Theorem 11

[15, Theorem 3.9] Consider VCSP instance $f:\mathcal{L}_1\times \ldots \times \mathcal{L}_n\rightarrow \overline{\mathbb R}$. Suppose that $\mathcal{L}_1,\ldots ,\mathcal{L}_n$ are valuated modular semilattices and function f is submodular on $\mathcal{L}=\mathcal{L}_1\times \ldots \times \mathcal{L}_n$. Then BLP relaxation solves f.

To show that $\mathtt{0\hbox {-}Ext}[{\mu }]$ can be solved in polynomial time, a few additional definitions and results are needed. Elements p, q of a modular complex $\Gamma $ are said to be -neighbors if $p,q\in [a,b]$ for some a, b with $a\sqsubseteq b$. Equivalently, p, q are -neighbors if $p\wedge q,p\vee q$ exist and $p\wedge q\sqsubseteq p\vee q$. Let be an undirected unweighted graph on nodes $V_\Gamma $ such that p, q are connected by an edge in if and only if p, q are -neighbors. This graph is called a thickening of $\Gamma $. The distance from p to q in will be denoted as , or simply as . These distances will give a bound on the number of steps of SDA.

Theorem 12

[16, Theorem 4.7, Lemma 2.18] Suppose that modular complex $\Gamma $ is well-oriented, and function $f:\Gamma ^n\rightarrow \overline{\mathbb R}$ is L-convex on $\Gamma ^n$. SDA terminates after at most iterations where x is the initial vertex and $\texttt{opt}_i(f)=\{x^*_i\,|\,x^*\in \mathop {\mathrm {arg\,min}}\limits f\}\subseteq V_\Gamma $ is the set of minimizers of f projected to the i-th component.

Theorem 13

[5, Proposition 6.10], [16, Lemma 2.14] If $\Gamma $ is a modular complex then graph $\Gamma ^*$ is well-oriented.

Theorem 14

[16, Proposition 4.9], [15, Lemma 4.7] If $\Gamma $ is a modular complex and function $f:\Gamma ^n\rightarrow \overline{\mathbb R}$ is L-convex on $\Gamma ^n$ then function $f^*_\times :(\Gamma ^*)^n\rightarrow \overline{\mathbb R}$ defined via $f^*_\times ([x_1,y_1],\ldots ,[x_n,y_n])=f(x)+f(y)$ is L-convex on $(\Gamma ^*)^n$.

From the results above one can now conclude that any instance $f:\Gamma ^n\rightarrow \overline{\mathbb R}$ of $\mathtt{0\hbox {-}Ext}[{\mu }]$ for an oriented modular metric $\mu $ can be solved in polynomial time. Indeed, apply SDA to minimize function $f^*_\times :(\Gamma ^*)^n\rightarrow \overline{\mathbb R}$. It will produce a minimizer $([x_1,y_1],\ldots ,[x_n,y_n])$ after at most iterations; both x and y are minimizers of f. Hirai gives the bound in [16, Theorem 5.7]. As pointed out by an anonymous reviewer, this bound can be improved to .

Note that in the earlier paper [15] Hirai proved that $\mathtt{0\hbox {-}Ext}[{\mu }]$ can be solved in weakly polynomial time via a different algorithm, namely the Steepest Descent Algorithm (applied to $f:\Gamma ^n\rightarrow \overline{\mathbb R}$) but combined with a cost-scaling approach.

Remark 1

Our terminology is slightly different from that of [16]. In particular, -neighbors were called $\Delta $-neighbors in [16], and a path in was called a $\Delta '$-path.

3 Extended modular complexes

To prove our main tractability result from Theorem 3, we will introduce the following definition.

Definition 15

Let $\Gamma $ be a modular complex. Binary relation $\sqsubseteq $ on $V=V_\Gamma $ is called admissible if it coarsens $\preceq $ (i.e. $p\sqsubseteq q$ implies $p\preceq q$), $p\sqsubseteq p$ for every $p\in V$, $p\sqsubseteq q$ for every edge $p\rightarrow q$, and the following conditions hold.

(15a):: Suppose that $p\sqsubseteq q$, $a\preceq b$ and $a,b\in [p,q]$. Then $a\sqsubseteq b$.
(15b):: Suppose that $p\sqsubseteq q_1$, $p\sqsubseteq q_2$ and $q_1\vee q_2$ exists. Then $p\sqsubseteq q_1\vee q_2$.
(15c):: Suppose that $p_1\sqsubseteq q$, $p_2\sqsubseteq q$ and $p_1\wedge p_2$ exists. Then $p_1\wedge p_2\sqsubseteq q$.

A pair $(\Gamma ,\sqsubseteq )$ where $\Gamma $ is a modular complex and $\sqsubseteq $ is admissible will be called an extended modular complex. With some abuse of notation, we will use letter $\Gamma $ for an extended modular complex, and treat it as an oriented modular graph when needed. Note that previously we used notation $\sqsubseteq $ for Boolean pairs in $\Gamma $. From now on we will write $p\,{{\mathop {\sqsubseteq }\limits ^{{\tiny \mathtt ~Bp}}}}\, q$ if (p, q) is a Boolean pair in $\Gamma $, and reserve notation $\sqsubseteq $ (or $\sqsubseteq _\Gamma $) for the binary relation that comes with an extended modular complex $\Gamma $. As before, we write $p \sqsubset q$ to mean $p \sqsubseteq q$ and $p\ne q$.

Proposition 16

Let $\Gamma $ be a modular complex. (a) Relations ${{\mathop {\sqsubseteq }\limits ^{{\tiny \mathtt ~Bp}}}}$ and $\preceq $ are admissible for $\Gamma $. (b) If relation $\sqsubseteq $ is admissible for $\Gamma $ then $p\, {{\mathop {\sqsubseteq }\limits ^{{\tiny \mathtt ~Bp}}}}\, q$ implies $p \sqsubseteq q$.

This proposition shows that a modular complex is a special case of an extended modular complex (obtained by setting $\sqsubseteq $ to be ${{\mathop {\sqsubseteq }\limits ^{{\tiny \mathtt ~Bp}}}}$). Also, ${{\mathop {\sqsubseteq }\limits ^{{\tiny \mathtt ~Bp}}}}$ and $\preceq $ are respectively the coarsest and the finest admissible relations. Next, we will show that many of the results in Sect. 2 still hold if relation ${{\mathop {\sqsubseteq }\limits ^{{\tiny \mathtt ~Bp}}}}$ is replaced with an arbitrary admissible relation $\sqsubseteq $.

From now on we fix an extended modular complex $\Gamma $. We define graphs and as in Sect. 2, and for $p\in V$ define posets $(\mathcal{L}^+_p,\mathcal{L}^-_p,\mathcal{L}^\pm _p)=(\mathcal{L}^+_p(\Gamma ),\mathcal{L}^-_p(\Gamma ),\mathcal{L}^\pm _p(\Gamma ))$ as in Eq. (8). Note, in all cases $\sqsubseteq $ now has the new meaning (it is the relation that comes with $\Gamma $).

Lemma 17

Let $\Gamma $ be an extended modular complex, and p be its element. Then $\mathcal{L}^+_p,\mathcal{L}^-_p$ are modular semilattices that are convex in $\Gamma $. Furthermore, function $v(\cdot )$ defined via $v(a)=\mu _\Gamma (p,a)$ is a valid valuation of these (semi)lattices.

We define a 2-subdivision of $\Gamma $ exactly as in Sect. 2; it is the graph $\Gamma ^*=(V^*,E^*,w^*)$ where $V^*=\{[p,q]\,{:}\,p,q\in \Gamma ,p\sqsubseteq _\Gamma q\}$. Using condition (15a), it is straightforward to verify that $(V^*,E^*)$ is the Hasse diagram of the poset $(V^*,\subseteq )$. We will show the following result.

Theorem 18

If $\Gamma $ is an extended modular complex then $(V^*,E^*,w^*)$ is a modular complex (i.e. an oriented modular graph).

Recall that if $\Gamma $ is a modular complex (equivalently, an extended modular complex with the relation $\sqsubseteq _\Gamma ={{\mathop {\sqsubseteq }\limits ^{{\tiny \mathtt ~Bp}}}}_\Gamma $) then $\Gamma ^*$ is well-oriented, i.e. ${{\mathop {\sqsubseteq }\limits ^{{\tiny \mathtt ~Bp}}}}_{\Gamma ^*}=\,\preceq _{\Gamma ^*}$. For extended modular complexes this is not necessarily the case, as e.g. for the example in Fig. 1c. We will treat $\Gamma ^*$ as an extended modular complex with the relation $\sqsubseteq _{\Gamma ^*}\,=\,\preceq _{\Gamma ^*}$, so that we have $\mathcal{L}_{[p,p]}^+(\Gamma ^*)=\mathcal{L}_{[p,p]}^\uparrow (\Gamma ^*)$ by construction. Let us set $\mathcal{L}^*_p=\mathcal{L}^*_p(\Gamma )=\mathcal{L}_{[p,p]}^+(\Gamma ^*)=\mathcal{L}_{[p,p]}^\uparrow (\Gamma ^*)$. We can now define L-convex functions exactly as in the previous section:

Function $f:V\rightarrow \overline{\mathbb R}$ is called L-convex on extended modular complex $\Gamma $ if (i) subset ${{\,\mathrm{{\texttt {dom}}}\,}}f\subseteq V$ is connected in , and (ii) for every $p\in V$, the restriction of $f^*$ to $\mathcal{L}^*_{p}=\mathcal{L}^\uparrow _{[p,p]}(\Gamma ^*)$ is submodular on valuated modular semilattice $\mathcal{L}^*_{p}$.

Cartesian products If $(\Gamma ,\sqsubseteq ),(\Gamma ',\sqsubseteq ')$ are two extended modular complexes, then their Cartesian product $(\Gamma ,\sqsubseteq )\times (\Gamma ',\sqsubseteq ')$ is defined as the pair $(\Gamma \times \Gamma ',\sqsubseteq _{\times })$ where $(p,p') \sqsubseteq _{\times } (q,q')$ iff $p \sqsubseteq q$ and $p' \sqsubseteq ' q'$.

Lemma 19

Consider extended modular complexes $\Gamma ,\Gamma '$ and element $(p,p')\in \Gamma \times \Gamma '$.

(a):: $\Gamma \times \Gamma '$ is an extended modular complex.
(b):: $\mathcal{L}^\sigma _{(p,p')}(\Gamma \times \Gamma ')=\mathcal{L}^\sigma _{p}(\Gamma )\times \mathcal{L}^\sigma _{p'}(\Gamma ')$ for $\sigma \in \{-,+\}$.

Theorem 20

Consider extended modular complexes $\Gamma ,\Gamma '$ and functions $f,f':\Gamma \rightarrow \overline{\mathbb R}$ and $\tilde{f}:\Gamma \times \Gamma '\rightarrow \overline{\mathbb R}$.

(a):: If $f,f'$ are L-convex on $\Gamma $ then $f+f'$ and $c\cdot f$ for $c\in \mathbb R_+$ are also L-convex on $\Gamma $.
(b):: If f is L-convex on $\Gamma $ and ${\tilde{f}}(p,p')=f(p)$ for $(p,p')\in \Gamma \times \Gamma '$ then ${\tilde{f}}$ is L-convex on $\Gamma \times \Gamma '$.
(c):: If ${\tilde{f}}$ is L-convex on $\Gamma \times \Gamma '$ and $f(p)={\tilde{f}}(p,p')$ for fixed $p'\in \Gamma '$ then f is L-convex on $\Gamma $.
(d):: The indicator function $\delta _U:V\rightarrow \{0,\infty \}$ is L-convex on $\Gamma $ in the following cases: (i) U is a $d_\Gamma $-convex set; (ii) $U=\{p,q\}$ for elements p, q with $p\sqsubseteq q$.
(e):: Function $\mu _\Gamma :\Gamma \times \Gamma \rightarrow \mathbb R_+$ is L-convex on $\Gamma \times \Gamma $.
(f):: If f is L-convex on $\Gamma $ then the restriction of f to $\mathcal{L}^\sigma _p(\Gamma )$ is submodular on $\mathcal{L}^\sigma _p(\Gamma )$ for every $p\in \Gamma $ and $\sigma \in \{-,+\}$.

Theorem 21

Let $f:V\rightarrow \overline{\mathbb R}$ be an L-convex function on an extended modular complex $\Gamma $. If p is a local minimizer of f on $\mathcal{L}^\pm _p(\Gamma )$, i.e. $f(p)=\min _{q\in \mathcal{L}^\pm _p(\Gamma )}f(q)$, then it is also a global minimizer of f, i.e. $f(p)=\min _{q\in V}f(q)$.

For extended modular complex $\Gamma $ let $\Phi _\Gamma $ be the language over domain $D=V_\Gamma $ that consists of all functions $f:D^n\rightarrow \overline{\mathbb R}$ such that f is L-convex on $\Gamma ^n$. The results above imply that any instance $\mathcal{I}$ of $\Phi _\Gamma $ can be solved by the Steepest Descent Algorithm (Algorithm 1), and each subproblem in line 2 can be solved in polynomial time. However, we do not know whether the number of steps would be polynomially bounded. To get a polynomial bound, we introduce an alternative algorithm, which we denote -SDA.

From Theorems 11 and 20(f) we can conclude that points in lines 3 and 4 can be computed in polynomial time via the BLP relaxation. To see this for , observe that function $f+\delta _U$ for convex set $U=\mathcal{L}^+_{x^-}\cap \mathcal{L}^-_{x^+}$ is L-convex on $\Gamma $ by Theorem 20(a,d), and thus its restriction to $\mathcal{L}^+_{x^-}$ is submodular on $\mathcal{L}^+_{x^-}$ by Theorem 20(f). (Alternatively, the claim can be deduced from Theorem 23 given later.)

Theorem 22

Let $\Gamma $ be an extended modular complex and $f:\Gamma ^n\rightarrow {{\overline{{\mathbb R}}}}$ be an L-convex function on $\Gamma ^n$. -SDA algorithm applied to function f terminates after generating exactly distinct points, where x is the initial vertex and $\texttt {opt}_i(f)$ is as defined in Theorem 12.

Remark 2

Suppose that $\sqsubset _\Gamma \,=\,\preceq _\Gamma $, and the initial vertex x in -SDA is computed as follows: pick some $x_0\in {{\,\mathrm{{\texttt {dom}}}\,}}f$ and then set either $x\in \mathop {\mathrm {arg\,min}}\limits \{f(y)\,|\,y\in \mathcal{L}^-_{x_0}\}$ or $x\in \mathop {\mathrm {arg\,min}}\limits \{f(y)\,|\,y\in \mathcal{L}^+_{x_0}\}$. It can be seen that in that case -SDA becomes equivalent to SDA: we will have at even steps and at odd steps, or vice versa. Thus, Theorem 22 generalizes Theorem 12 in two ways: from modular complexes to extended modular complexes, and by allowing relations $\sqsubseteq _\Gamma $ and $\preceq _\Gamma $ to be distinct.

Note that the algorithm for $\mathtt{0\hbox {-}Ext}[{\mu }]$ described in the previous section required applying SDA on 2-subdivision $\Gamma ^*$. This blows up the size of the graph and the size of LPs that need to be solved at each iteration by an up to a quadratic factor. -SDA provides an alternative that avoids such blow-up.

Remark 3

Algorithm 2 can also be specialized for minimizing $L^\natural $-convex functions (i.e. when $\Gamma $ a directed path on consecutive integers and $a\sqsubset b$ iff $b=a+1$). In this case $\mathcal{L}^-_x(\Gamma ^n)=[x-\textbf{1},x]$, $\mathcal{L}^+_x(\Gamma ^n)=[x,x+\textbf{1}]$ and $\mathcal{L}^+_{x^-}(\Gamma ^n)\cap \mathcal{L}^-_{x^+}(\Gamma ^n)=U_1\times \ldots \times U_n$ where

$$\begin{aligned} U_i={\left\{ \begin{array}{ll} \{x_i-1,x_i\} &{} \hbox {if }(x^-_i,x^+_i)=(x_i-1,x_i) \\ \{x_i,x_i+1\} &{} \hbox {if }(x^-_i,x^+_i)=(x_i,x_i+1) \\ \{x_i\} &{} \hbox {otherwise} \end{array}\right. }. \end{aligned}$$

Note that the resulting algorithm is different from previously proposed versions of SDA [24, 29, 31]. Thus, -SDA adds to the toolbox for $L^\natural $-convex minimization.

We can now show the tractability part of Theorem 3. Suppose that graph $H_\mu $ is F-orientable modular for a metric space $(V,\mu )$ and subset $F\subseteq \left( {\begin{array}{c}V\\ 2\end{array}}\right) $. Choose an admissible orientation of $(H_\mu ,F)$, and let $\Gamma $ be the corresponding extended modular complex with the relation $\sqsubseteq \;=\;\preceq $. Clearly, for any $\{x,y\}\in F$ we have either $x\preceq y$ or $y\preceq x$. From Theorem 20 we conclude that $\mathtt{0\hbox {-}Ext}[{\mu ,F}]\subseteq \Phi _\Gamma $, and so $\mathtt{0\hbox {-}Ext}[{\mu ,F}]$ can be solved in polynomial time by the -SDA algorithm.

More generally, this shows tractability of ${\text {VCSP}}(\Phi _\Gamma )$ for an extended modular complex $\Gamma $ assuming that a feasible solution of any $\Phi _\Gamma $-instance can be computed in polynomial time. Our last result shows that ${\text {VCSP}}(\Phi _\Gamma )$ is tractable even without this assumption.

Theorem 23

If $\Gamma $ is an extended modular complex then BLP relaxation solves $\Phi _\Gamma $, and an optimal solution of any $\Phi _\Gamma $-instance can be computed in polynomial time.

Note that previously Hirai remarked that BLP directly solves $\mathtt{0\hbox {-}Ext}[{\mu }]$ for orientable modular metrics only assuming that $\texttt {P}\ne \texttt {NP}$, as a consequence of VCSP dichotomy for finite-valued languages (see Section 6 in [15]). Theorem 23 now establishes this fact unconditionally.

All proofs are given in Sects. 5 and 6. For Lemmas 17, 19 and Theorems 18, 20, 21 we mostly follow the proofs of the corresponding claims in [5, 15, 16] (replacing properties of relation ${{\mathop {\sqsubseteq }\limits ^{{\tiny \mathtt ~Bp}}}}$ with the properties of an admissible relation $\sqsubseteq $). To analyze -SDA (Theorem 22), we introduce new concepts, such as binary operations and the notion of f-extremality (see Sects. 5.6 and 5.9). Theorem 23 does not have an analogue in [5, 15, 16].

4 Submodular functions on a valuated modular semilattice

Let $\mathcal{L}$ be a valuated modular semilattice with valuation v. This section gives the definition of a submodular function on $\mathcal{L}$, and thus completes the definition of L-convex functions.

Let $\Gamma =(V,E,w)$ be the Hasse diagram of $\mathcal{L}$ where edge $p\rightarrow q$ is assigned weight $v(q)-v(p)>0$. As discussed in Sect. 2, graph $\Gamma $ is oriented modular. Denote $\mu =\mu _\Gamma $ and $d=d_\Gamma $. Recall that $\mathcal{L}$ is viewed as a metric space with the metric $\mu $, and the definitions of the metric interval I(p, q), convex sets, etc would be the same for $(\mathcal{L},\mu )$ and $(\mathcal{L},d)$.

For $p\preceq q$ let us denote $v[p,q]=v(q)-v(p)$. The following facts are known.

Lemma 24

[15, Lemma 2.15] The following holds for each $p,q\in \mathcal{L}$ with $s=p\wedge q$.

(a):: $\mu (p,q)=\mu (s,p)+\mu (s,q)=v[s,p]+v[s,q]$.
(b):: The metric interval I(p, q) is equal to the set of elements u that is represented as $u = a \vee b$ for some $(a,b) \in [s,p] \times [s,q]$, where such a representation is unique, and (a, b) equals $(u \wedge p, u \wedge q)$.
(c):: For $u,u'\in I(p,q)$ there holds $u\wedge u'=(u\wedge u'\wedge p)\vee (u\wedge u'\wedge q)$; in particular $u\wedge u'\in I(p,q)$.

The construction in [15] can be described as follows (see Fig. 2a for a conceptual diagram).

(i):: For $u \in I(p,q)$, let $v_{pq}(u)$ be the vector in $\mathbb R^2_+$ defined by
$$\begin{aligned} v_{pq}(u) = (v[s,u \wedge p], v[s,u \wedge q])\qquad \hbox {where~~~~} s=p\wedge q \end{aligned}$$
(9)
(ii):: Let $\mathop \textrm{Conv} I(p,q)$ denote the convex hull of vectors $v_{pq}(u)$ for all $u \in I(p,q)$.
(iii):: Let $\mathcal{E}(p,q)$ be the set of elements u in I(p, q) such that $v_{pq}(u)$ is a maximal extreme point of $\mathop \textrm{Conv} I(p,q)$. (This set is called “(p, q)-envelope”). Note that $p,q\in \mathcal{E}(p,q)$. Hirai proves that elements of $\mathcal{E}(p,q)$ receive unique coordinates [15, Lemma 3.1]:
$$\begin{aligned} v_{pq}(u)\ne v_{pq}(u') \qquad \forall u\in \mathcal{E}(p,q),u'\in I(p,q)-\{u\} \end{aligned}$$
(10)
(iv):: For $\theta \in [0,1]$ define vector $c_\theta =(1-\theta ,\theta )$. For points $p,q\in \mathcal{L}$ let $p\vee _\theta q$ be the point $u\in I(p,q)$ that maximizes $\langle c_\theta ,v_{pq}(u)\rangle $, assuming that the maximizer is unique. If a maximizer is not unique then instead set $p\vee _\theta q=\perp $ (“undefined”).
(v):: By the property in Eq. (10), there are only a finite number of values $\theta \in [0,1]$ such that $p\vee _\theta q=\perp $ for some $p,q\in \mathcal{L}$. Let $\Theta $ be the set of values $\theta \in [0,1]$ such that $p\vee _\theta q\ne \perp $ for all $p,q\in \mathcal{L}$, then $\Theta \subseteq [0,1]$ has measure 1.

We are now ready to define submodular functions on $\mathcal{L}$. A function $f:\mathcal{L}\rightarrow \overline{\mathbb R}$ is called submodular if every $p,q\in \mathcal{L}$ satisfies

$$\begin{aligned} f(p) + f(q) \ge f(p \wedge q) + \int _{\theta \in \Theta } f(p\vee _\theta q)d\theta \end{aligned}$$

(11a)

This is equivalent to the inequality

$$\begin{aligned} f(p) + f(q) \ge f(p \wedge q) + \sum _{i=0}^k (\theta _i-\theta _{i-1}) f(u_i) \end{aligned}$$

(11b)

where $u_0,\ldots ,u_k$ are the sorted elements of $\mathcal{E}(p,q)$ with $(u_0,u_k)=(p,q)$ and $(\theta _{-1},\theta _k)=(0,1)$,

$$\begin{aligned} \theta _i=\frac{v[s_{i},u_i]}{v[s_i,u_i]+v[s_i,u_{i+1}]}\qquad \hbox {where~~~} s_i=u_i\wedge u_{i+1} \end{aligned}$$

for $i\in [0,k-1]$. (See Fig. 2b for a geometric interpretation of values $\theta _i$).

Hirai also gives a simplified characterization of submodularity. Pair (p, q) is called bounded if $p\vee q$ exists, implying $\mathcal{E}(p,q)=\{p,q,p\vee q\}$ (Fig. 2c). Pair (p, q) is called antipodal if $\mathcal{E}(p,q)=\{p,q\}$ (Fig. 2d). Equivalently, (p, q) is antipodal if and only if

$$\begin{aligned}{} & {} v[a,p] v[b,q] \ge v[p \wedge q, a] v[p \wedge q, b] \nonumber \\{} & {} \qquad \forall (a,b):p \succeq a \succeq p \wedge q \preceq b \preceq q,a\vee b\hbox { exists} \end{aligned}$$

(12)

We say that (p, q) is special if it is either bounded or antipodal. For special pairs inequality (11) reduces to

$$\begin{aligned}&f(p) + f(q) \ge f(p \wedge q) + f(p\vee q) \nonumber \\&\hbox {if}\, (p,q)\, \hbox {bounded} \end{aligned}$$

(13a)

$$\begin{aligned}&v[p \wedge q,q] f(p) + v[p \wedge q,p] f(q) \ge (v[p \wedge q,p] + v[p \wedge q,q])f(p \wedge q) \nonumber \\&\hbox {if}\, (p,q)\, \hbox {antipodal} \end{aligned}$$

(13b)

These inequalities are called respectively submodularity and $\wedge $-convexity inequalities for (p, q).

Theorem 25

[15, Theorem 3.5] Function $f:\mathcal{L}\rightarrow \overline{\mathbb R}$ is submodular if and only if it satisfies

(1):: $\mathcal{E}(p,q) \subseteq {{\,\mathrm{{\texttt {dom}}}\,}}f$ for $p,q \in {{\,\mathrm{{\texttt {dom}}}\,}}f$;
(2):: the submodularity inequality for every bounded pair (p, q);
(3):: the $\wedge $-convexity inequality for every antipodal pair (p, q).

We observe that condition (1) can be strengthened further (though we will not use this observation). For elements $p,q,u,u'\in \mathcal{L}$ we write $(p,q)\lhd (u,u')$ if $u,u'\in I(p,q)-\{p,q\}$ and point $\frac{1}{2}(v_{pq}(u)+v_{pq}(u'))$ lies strictly above the segment $[v_{pq}(p),v_{pq}(q)]$ in $\mathbb R^2$. Note that all elements $u\in \mathcal{E}(p,q)-\{p,q\}$ satisfy $(p,q)\lhd (u,u)$, and pair (p, q) is antipodal if and only if there is no $u\in \mathcal{L}$ with $(p,q)\lhd (u,u)$. The following theorem is proven in [10, Appendix A].

Theorem 26

Function $f:\mathcal{L}\rightarrow \overline{\mathbb R}$ satisfies condition (1) of Theorem 25 if and only if it satisfies the following condition:

(1$'$):: Suppose that $p,q\in {{\,\mathrm{{\texttt {dom}}}\,}}f$ and $\mathcal{E}(p,q)\ne \{p,q\}$. Let $u^-$ and $u^+$ be the elements in $\mathcal{E}(p,q)-\{p,q\}$ that are closest to p and to q, respectively. Then there exists $t\in {{\,\mathrm{{\texttt {dom}}}\,}}f$ such that either $(p,q)\lhd (u^-,t)$ or $(p,q)\lhd (u^+,t)$.

5 Proofs

Throughout the proofs we usually denote $d,\mu $ to be the distance functions in an extended modular complex $\Gamma $, and and to be the distance functions in and in respectively. We write to indicate that nodes x, y are neighbors in (equivalently, that either $x\sqsubset y$ or $y\sqsubset x$). Note that if and only if .

A sequence $(u_0,u_1,\ldots ,u_k)$ of nodes in $\Gamma $ is called a shortest subpath if $\mu (u_0,u_k)=\mu (u_0,u_1)+\mu (u_1,u_2)+\ldots +\mu (u_{k-1},u_k)$. Recall that for a modular graph $\Gamma $ this is equivalent to the condition $d(u_0,u_k)=d(u_0,u_1)+d(u_1,u_2)+\ldots +d(u_{k-1},u_k)$. We will often implicitly use the following fact.

Proposition 27

Consider elements p, q in a modular complex $\Gamma $. (a) If $p\wedge q$ exists then $(p,p\wedge q,q)$ is a shortest subpath. Conversely, if $p\succeq x\preceq q$ and (p, x, q) is a shortest subpath then $x=p\wedge q$. (b) If $p\vee q$ exists then $(p,p\vee q,q)$ is a shortest subpath. Conversely, if $p\preceq x\succeq q$ and (p, x, q) is a shortest subpath then $x=p\vee q$.

Proof

To see the claim, combine Lemma 7 and Lemma 24(a). $\square $

We say that sequence $(x_1,x_2,x_3,x_4)$ is an isometric rectangle if $(x_{i-1},x_i,x_{i+1})$ are shortest subpaths for all $i\in [4]$, where $x_j=x_{j\!\!\mod 4}$. Isometric rectangles have the following properties.

Proposition 28

If $(x_1,x_2,x_3,x_4)$ is an isometric rectangle in graph $\Gamma $ in then $d(x_1,x_2)=d(x_3,x_4)$ and $d(x_1,x_4)=d(x_2,x_3)$. Furthermore, if graph $\Gamma $ is modular then for any $y_1\in I(x_1,x_4)$ there exists $y_2=I(x_2,x_3)$ (namely, a median of $y_1,x_2,x_3$) such that sequences $(x_1,x_2,y_2,y_1)$ and $(y_1,y_2,x_3,x_4)$ are isometric rectangles.

Proof

The definition of an isometric rectangle implies that $d(x_1,x_2)+d(x_2,x_3)=d(x_4,x_1)+d(x_3,x_4)$ and $d(x_2,x_3)+d(x_3,x_4)=d(x_1,x_2)+d(x_4,x_1)$, which in turns implies that $d(x_1,x_2)=d(x_3,x_4)$ and $d(x_1,x_4)=d(x_2,x_3)$. Now suppose that $y_1\in I(x_1,x_4)$ and $y_2$ is a median of $y_1,x_2,x_3$. $(x_1,y_1,x_4,x_3)$ and $(y_1,y_2,x_3)$ are shortest subpaths, and thus so is $(x_1,y_1,y_2,x_3)$. By a symmetric argument, $(x_4,y_1,y_2,x_2)$ is also a shortest subpaths. This implies the claim. $\square $

5.1 Preliminaries

First, we state some known facts about orientable modular graphs that will be needed later in the proofs.

Lemma 29

([2, Proposition 1.7], see [35, Proposition 6.2.6, Chapter I]) A connected graph $\Gamma $ with distances $d=d_\Gamma $ is modular if and only if

(1):: $\Gamma $ is bipartite, and
(2):: for vertices p, q and neighbors $p_1,p_2$ of p with $d(p_1,q)=d(p_2,q)=d(p,q)-1$, there exists a common neighbor $p^*$ of $p_1,p_2$ with $d(p^*, q)=d(p,q)-2$.

Convex sets and gated sets Consider metric space $(V,\mu )$. Recall that subset $U\subseteq V$ is called convex if $I(x,y)\subseteq U$ for every $x,y\in U$. It is called gated if for every $p\in V$ there exists unique $p^*\in U$, called the gate of p at U, such that $\mu (p,q)=\mu (p,p^*)+\mu (p^*,q)$ holds for every $q\in U$. The gate $p^*$ will be denoted as $\Pr _U(p)$ (“projection of p onto U”). Thus, $\Pr _U$ is a map $V\rightarrow U$.

Theorem 30

[9] Let A and $A'$ be gated subsets of $(V,\mu )$ and let $B:= \Pr _{A}(A')$ and $B':= \Pr _{A'}(A)$.

(a):

$\Pr _{A}$ and $\Pr _{A'}$ induce isometries, inverse to each other, between $B'$ and B.

(b):

For $p \in A$ and $p' \in A'$, the following conditions are equivalent:

(i):: $\mu (p,p') = \mu (A,A')$.
(ii):: $p = \Pr _{A}(p')$ and $p' = \Pr _{A'}(p)$.

(c):

B and $B'$ are gated, and $\Pr _{B} = \Pr _{A} \circ \Pr _{A'}$ and $\Pr _{B'} = \Pr _{A'} \circ \Pr _{A}$.

Now let us consider a modular graph $\Gamma =(V,E,w)$. Such graph induces two natural metrics on V, namely $\mu =\mu _\Gamma $ and $d=d_\Gamma $ (shortest path metrics w.r.t. edge lengths w and 1, respectively). In the light of Theorem 6, the definitions of convex sets, gated sets, gates and maps $\Pr _U$ would be the same for both metric spaces ($(V,\mu )$ and (V, d)). In addition, convex and gated sets for such metrics coincide.

Lemma 31

([7], see [15, Lemma 2.9]) Let $\Gamma $ be a modular graph. For $U\subseteq V$, the following conditions are equivalent:

(1):: U is convex.
(2):: U is gated.
(3):: $\Gamma [U]$ is connected and $I(x,y)\subseteq U$ holds for every $p,q\in U$ with $d_\Gamma (p,q)=2$.

Next, we review properties of modular complexes. A path $(p_0,p_1,p_2,\ldots ,p_k)$ in a directed acyclic graph $\Gamma =(V,E,w)$ is said to be ascending if $p_0 \prec p_1\prec \ldots \prec p_k$.

Lemma 32

[15, Lemma 4.13] Let $\Gamma $ be a modular complex. For $p,q \in V$ with $p \preceq q$, a (p, q)-path P is shortest if and only if P is an ascending path from p to q. In particular, $I(p,q) = [p,q]$, any maximal chain in [p, q] has the same length, and the rank r of [p, q] is given by $r(a) = d(a,p)$.

Since set [p, q] is convex in $\Gamma $ by Lemma 7, one can define projection $\Pr _{[p,q]}: V \rightarrow [p,q]$. The lemma below describes some properties of this projection.

Lemma 33

[15, Lemma 4.15] Let $\Gamma $ be a modular complex. For elements $p,q,p',q'$ with $p \preceq q$ and $p' \preceq q'$ define

$$\begin{aligned} u = \Pr \nolimits _{[p, q]} (p') \quad v = \Pr \nolimits _{[p,q]} (q') \quad u' = \Pr \nolimits _{[p',q']} (p) \quad v' = \Pr \nolimits _{[p',q']} (q) \end{aligned}$$

(14)

Then we have:

(1):: $u \preceq v$, $u' \preceq v'$, $\Pr _{[p',q']}([p,q]) = [u', v']$, and $\Pr _{[p,q]}([p',q']) = [u, v]$.
(2):: [u, v] is isomorphic to $[u',v']$ by map $w \mapsto \Pr _{[p',q']}(w)$.

5.2 Properties of extended modular complexes

In this section $\Gamma $ is always assumed to be an extended modular complex on nodes V with relation $\sqsubseteq $.

Lemma 34

Suppose that $p\sqsubseteq q_1$, $p\sqsubseteq q_2$, $q_1\ne q_2$ and q is a common neighbor of $q_1,q_2$ (implying that $d(q_1,q_2)=2$). Then $p\sqsubseteq q$.

Proof

Modulo symmetry, three cases are possible.

$q_1\rightarrow q\rightarrow q_2$. Since $p\sqsubseteq q_2$, condition (15a) gives $p\sqsubseteq q$.
$q_1\rightarrow q\leftarrow q_2$. Then $q=q_1\vee q_2$ by Lemma 7(a), and condition (15b) gives $p\sqsubseteq q$.
$q_1\leftarrow q\rightarrow q_2$. Then $q=q_1\wedge q_2$ by Lemma 7(a), implying $p\preceq q$ (since p lower-bounds $q_1,q_2$). Condition (15a) gives $p\sqsubseteq q$.

$\square $

Lemma 35

Consider elements p, q, a, b such that (p, a, b) and (a, b, q) are shortest subpaths and $p\sqsubseteq q$. Then $a\sqsubseteq b$.

Proof

We use induction on $d(p,q)+d(p,a)+d(q,b)$. First, assume that (a, p, q) is not a shortest subpath. Let $p'$ be a median of a, p, q, then $p'\in [p,q]-\{p\}$ and so $p'\sqsubseteq q$. The induction hypothesis for $p',q,a,b$ gives the claim. We can thus assume that (a, p, q) is a shortest subpath. By a symmetric argument we can also assume that (b, q, p) is a shortest subpath. Sequence (p, q, b, a) is thus an isometric rectangle, and so $d(p,q)=d(a,b)$ and $d(p,a)=d(q,b)$. We now consider 5 possible cases.

$d(p,a)=d(q,b)=0$. The claim then holds trivially.
$d(p,a)=d(q,b)=1$, $q\rightarrow b$. Then $p\preceq b$ and (p, a, b) is a shortest subpath, and so $p\rightarrow a\preceq b$ by Lemma 32. We have $a\vee q\preceq b$ and (a, b, q) is a shortest subpath; this implies that $a\vee q=b$. Since $p\sqsubseteq a$ and $p\sqsubseteq q$, we obtain $p\sqsubseteq a\vee q=b$ by condition (15b), and thus $a\sqsubseteq b$ since $a\in [p,b]$.
$d(p,a)=d(q,b)=1$, $p\leftarrow a$. This case is symmetric to the previous one.
$d(p,a)=d(q,b)=1$, $p\rightarrow a$, $b\rightarrow q$. We will show that $a\preceq b$; this will imply that (p, a, b, q) is a shortest subpath by Lemma 32, contradicting condition $d(p,q)=d(a,b)$. If $d(p,q)=d(a,b)=0$ then the claim is trivial. If $d(p,q)=d(a,b)=1$ then (p, q, b, a) is a 4-cycle, and so $p\rightarrow q$ implies that $a\rightarrow b$. Suppose that $d(p,q)=d(a,b)\ge 2$. By Proposition 28, there exist $x\in I(p,q)-\{p,q\}$ and $y\in I(a,b)$ such that (p, x, y, a) and (q, x, y, b) are isometric rectangles. We have $p\sqsubseteq x\sqsubseteq q$, so by the induction hypothesis $a\sqsubseteq y\sqsubseteq b$, implying the claim.
$d(p,a)=d(q,b)\ge 2$. By Proposition 28, there exist $p'\in I(p,a)$ and $q'\in I(q,b)$ such that $(p,q,q',p')$ and $(p',q',b,a)$ are isometric rectangles. The induction hypothesis for elements $p,q,p',q'$ gives $p'\sqsubseteq q'$. The induction hypothesis for elements $p',q',a,b$ gives $a\sqsubseteq b$. $\square $

Corollary 36

Suppose that $p\sqsubseteq q$ and $A'\subseteq V$ is a convex set in $\Gamma $. Let $p'=\Pr _{A'}(p)$ and $q'=\Pr _{A'}(q)$. Then $p'\sqsubseteq q'$.

Lemma 17

(restated). Let $\Gamma $ be an extended modular complex, and p be its element. Then $\mathcal{L}^+_p,\mathcal{L}^-_p$ are modular semilattices that are convex in $\Gamma $. Furthermore, function $v(\cdot )$ defined via $v(a)=\mu _\Gamma (p,a)$ is a valid valuation of these (semi)lattices.

Proof

By symmetry, it suffices to consider only $\mathcal{L}^+_p$. Note that $\mathcal{L}^+_p$ is a subset of $\mathcal{L}^\uparrow _p$ such that for any $a,b\in \mathcal{L}_p^+$ we have (i) $a\wedge b\in \mathcal{L}_p^+$ and (ii) $a\vee b\in \mathcal{L}_p^+$ assuming that $a\vee b$ exists (by Definition 15). Thus, all claims except convexity follow from the corresponding properties of $\mathcal{L}^\uparrow _p$ (see Lemma 7). Let us show the convexity. In view of Lemma 31, it suffices to show that for any $a,b\in \mathcal{L}_p^+$ with $d(a,b)=2$ we have $I(a,b)\subseteq \mathcal{L}^+_p$. This claim follows directly from Lemma 34. $\square $

In the next two results we use the same notation d for $d_\Gamma $ and $d_{\Gamma ^*}$ (since they can be distinguished by the arguments). Similarly, we use $\mu $ for both $\mu _\Gamma $ and $\mu _{\Gamma ^*}$.

Lemma 37

Let $\Gamma ^*$ be the 2-subdivision of $\Gamma $ on nodes $V^*$. Then

$$\begin{aligned} d([p,q],[p',q'])=d(p,p')+d(q,q')\qquad \forall [p,q],[p',q']\in V^*\end{aligned}$$

Proof

If $P=([p_0,q_0],\ldots ,[p_k,q_k])$ is a path in $\Gamma ^*$ from $[p_0,q_0]=[p,q]$ to $[p_k,q_k]=[p',q']$ then the length of P in $\Gamma ^*$ equals $\sum _{i=0}^{k-1}(d(p_i,p_{i+1})+d(q_i,q_{i+1}))\ge d(p,p')+d(q,q')$; hence $(\ge )$ holds.

To show equality, we use induction on $D=d(p,p')+d(q,q')$. The base case $D=0$ is trivial; suppose that $D\ge 1$. It suffices to show that one of the following holds:

(i):: p has neighbor a in $\Gamma $ such that $a\sqsubseteq q$ and $d(a,p')=d(p,p')-1$.
(ii):: q has neighbor b in $\Gamma $ such that $p\sqsubseteq b$ and $d(b,q')=d(q,q')-1$.

Indeed, if (i) holds then $d([a,q],[p',q'])=d(a,p')+d(q,q')=D-1$ by induction, and hence $d([p,q],[p',q'])\le (D-1)+1=D$, as required. Case (ii) is similar. Alternatively, it would also suffice to show symmetrical cases when $p'$ or $q'$ have appropriate neighbors.

Define $u,v,u',v'$ by (14). Note that $p\preceq u\preceq v\preceq q$ and $p'\preceq u'\preceq v'\preceq q'$. Modulo symmetry, two cases are possible.

$p\ne u$, implying $p\prec u$. Let $a\in [p,u]\subseteq [p,q]$ be an out-neighbor of p in $\Gamma $ (i.e. $p\rightarrow a\preceq u$), then $a\sqsubseteq q$ by condition (15a), and $(p,a,p')$ is a shortest subpath since (p, a, u) and $(p,u,p')$ are shortest subpaths. Thus, case (i) holds.
$(p,q,p',q')=(u,v,u',v')$. Then $(p,q,q',p')$ is an isometric rectangle. By Proposition 28 there exist $a\in I(p,p')$ and $b\in I(q,q')$ such that (p, q, b, a) and $(p',q',b,a)$ are isometric rectangles and $d(q,b)=d(p,a)=1$. We have $a\sqsubseteq b$ by Lemma 35. If $a\rightarrow p$ then Lemma 32 for elements $a\preceq q$ gives $a\preceq b\rightarrow q$. By Lemma 7(a), $p\wedge b$ exists. Since (p, q, b) is a shortest subpath, we have $b\notin I(p,q)=[p,q]$, therefore $p\not \preceq b$ and $p\wedge b=a$. Condition (15c) gives $a\sqsubseteq q$, and so case (i) holds. If $q\rightarrow b$ then by a symmetric argument we conclude that case (ii) holds. The last remaining case $p\rightarrow a,b\rightarrow q$ is impossible by Lemma 32 for elements $p\preceq q$. $\square $

Theorem 18

(restated). If $\Gamma $ is an extended modular complex then $(V^*,E^*,w^*)$ is a modular complex (i.e. an oriented modular graph).

Proof

Any 4-cycle in $\Gamma ^*$ is represented as $([p,q], [p,q'], [p',q'], [p',q])$ or $([p',q], [p',q'], [p,q'], [p,q])$ for some edges $p\rightarrow p',q\rightarrow q'$ in $\Gamma $, or ([p, x], [p, y], [p, z], [p, w]) or ([x, p], [y, p], [z, p], [w, p]) for 4-cycle (x, y, z, w) and vertex p in $\Gamma $. This immediately implies that the orientation of $\Gamma ^*$ is admissible and orientation and $w^*$ is orbit-invariant.

To show that $\Gamma ^*$ is modular, we are going to verify that $\Gamma ^*$ satisfies the two conditions of Lemma 29. If [p, q] and $[p',q']$ are joined by an edge, then $d_{\Gamma }(p,q)$ and $d_{\Gamma }(p',q')$ have different parity. This implies that $\Gamma ^*$ is bipartite.

We next verify condition (2) of Lemma 29. Take intervals $[p,q],[p',q']\in V^*$, and denote $D_p=d(p,p')$, $D_q=d(q,q')$, $D=d([p,q],[p',q'])=D_p+D_q$. Suppose further that we are given two distinct neighbors $[p_1,q_1],[p_2,q_2]$ of [p, q] with $d([p_1,q_1],[p',q'])=d([p_2,q_2],[p',q'])=D-1$. Our goal is to show the existence of a common neighbor $[p^*,q^*]$ of $[p_1,q_1],[p_2,q_2]$ with $d([p^*,q^*],[p',q'])=D-2$.

Modulo symmetry, two cases are possible.

$p_1=p=p_2$. Condition $d([p_i,q_i],[p',q'])=D-1$ implies that $d(q_i,q')=D_q-1$ for $i=1,2$. By Lemma 29(2) for $\Gamma $, there is a common neighbor $q^*$ of $q_1,q_2$ with $d(q^*,q')=D_q-2$. By Lemma 34, $p\sqsubseteq q^*$. Thus, $[p,q^*]$ is a desired common neighbor of $[p,q_1],[p,q_2]$.
$p_1=p$, $q_2=q$. For better readability let us denote $s=p_2$, $t=q_1$. To summarize, we know that $(p,s,p')$ and $(q,t,q')$ are shortest subpaths, $d(p,s)=d(q,t)=1$, $s\sqsubseteq q$, $p\sqsubseteq t$, $p\sqsubseteq q$, $p'\sqsubseteq q'$. It suffices to show $s\sqsubseteq t$; this will imply that [s, t] is a desired common neighbor of $[p_1,q_1],[p_2,q_2]$. Modulo symmetry, three subcases are possible. Case 1: $s\rightarrow p$, $t\rightarrow q$. Then $s\sqsubseteq q$ and $s\preceq p\preceq t\preceq q$, so condition (15a) gives $s\sqsubseteq t$. Case 2: $p\rightarrow s$, $t\rightarrow q$. By Lemma 33, $\Pr _{[p,q]}([p',q'])$ is equal to interval [a, b] for $a=\Pr _{[p,q]}(p')$ and $b=\Pr _{[p,q]}(q')$. Note that $s,t,a,b\in [p,q]$. Since $(p,s,p')$ and $(s,a,p')$ are shortest subpaths, so is $(p,s,a,p')$ and thus $s\preceq a$. Similarly $b\preceq t$. Thus $p\preceq s \preceq a\preceq b\preceq t\preceq q$ and $p\sqsubseteq q$ imply $s\sqsubseteq t$ (by condition (15a)), as desired. Case 3: $s\rightarrow p$, $q\rightarrow t$. Observe that $x\vee y$, $x\wedge y$ are defined and belong to [s, t] for all $x,y\in [s,t]$ by Lemma 7. Consider set $\Pr _{[s,t]}([p',q'])$, which is equal to [u, v] for $u=\Pr _{[s,t]}(p')$ and $v=\Pr _{[s,t]}(q')$ (Lemma 33). We must have $p\not \preceq u$ (implying $p\wedge u=s$); otherwise (s, p, u), $(s,u,p')$ and thus $(s,p,u,p')$, $(s,p,p')$ would be shortest subpaths, contradicting the assumption that $(p,s,p')$ is a shortest subpath and $p\ne s$. Similarly, $v\vee q=t$. Note that $u\sqsubseteq v$ by Corollary 36. Define $a=u\wedge q$. We claim that $a\sqsubseteq t$. Indeed, we have $a\sqsubseteq q$ since $s\preceq a\preceq q$ and $s\sqsubseteq q$. It now suffices to show that there exists b such that $a \sqsubseteq b$ and $b\vee q=t$; the claim will then follow by condition (15b). If $a=u$ then we can take $b=v$. Otherwise, if $a\prec u$, take b to be an out-neighbor of a in [a, u] (i.e. $a\rightarrow b\preceq u$); we have $b\notin [a,q]$ since $a=u\wedge q$, and hence $b\vee q=t$. We have $p\sqsubseteq t$, $a\sqsubseteq t$ and $p\wedge a=p\wedge u \wedge q=s\wedge q=s$, so condition (15c) gives $s\sqsubseteq t$.

$\square $

By combining Lemma 37, Theorem 18 and Theorem 6(a) we obtain:

Corollary 38

Let $\Gamma ^*$ be the 2-subdivision of $\Gamma $ on nodes $V^*$. Then

$$\begin{aligned} \mu ([p,q],[p',q'])=\mu (p,p')+\mu (q,q')\qquad \forall [p,q],[p',q']\in V^*\end{aligned}$$

Lemma 39

Let $f:\Gamma \rightarrow \overline{\mathbb R}$ be an L-convex function on an extended modular complex $\Gamma $. Then for every $p\in \Gamma $ the restrictions of f to $\mathcal{L}^-_p$ and to $\mathcal{L}^+_p$ are submodular functions.

Proof

Let us define $\mathcal{L}_p^{*+}=\{[p,q]\,{:}\,p\sqsubseteq q\}\subseteq \mathcal{L}_p^*$. By Lemma 37, any vertex in any shortest path between [p, q] and $[p,q']$ is of the form [p, u]. Hence $\mathcal{L}_p^{*+}$ is convex in $\Gamma $ and in $\mathcal{L}_p^{*}$. Therefore, submodularity of $f^*$ on $\mathcal{L}_p^*$ implies $f^*$ is submodular on $\mathcal{L}_p^{*+}$ (by [15, Lemma 3.7(4)]). Obviously $\mathcal{L}_p^{*+}$ is isomorphic to $\mathcal{L}_p^+$ by $[p,q] \mapsto q$. By using relation $f(q) = f^*([p,q]) - f(p)$ $(q \in \mathcal{L}^+_p)$, we see the submodularity of f on $\mathcal{L}^+_p$. The proof of submodularity of f on $\mathcal{L}^-_p$ is symmetric. $\square $