Abstract
We consider infinite programming problems with constraint sets defined by systems of infinite number of inequalities and equations given by continuously differentiable functions defined on Banach spaces. In the approach proposed here we represent these systems with the help of coefficients in a given Schauder basis. We prove Abadie constraint qualification under the new infinite-dimensional Relaxed Constant Rank Constraint Qualification Plus and we discuss the existence of Lagrange multipliers via Hurwicz set. The main tools are: Rank Theorem and Lyusternik–Graves theorem.
Similar content being viewed by others
1 Introduction
Let E be a Banach space and \(f_0,\ g_i:\ E \rightarrow \mathbb {R}\), \(i\in I_0\cup I_1\subseteq \mathbb {N}\), \(I_0\cap I_1=\emptyset \) are functions of class \(C^1\). We consider the following infinite programming problem:
i.e., both index sets \(I_0,I_1\) can be infinite.
This problem can be cast into a general framework by using Schauder basis. To this aim, we recall the following facts. Let F be a Banach space. Let \((b_{i})_{i\in \mathbb {N}}\) be a Schauder basis of F, i.e. for all \(x\in F\) there is a unique sequence of scalars \((x_n)\), \(n\in \mathbb {N}\) s.t. \(x=\sum \limits _{n=1}^\infty x_n b_n\).
Proposition 1
([2, Theorem 1.1.3]) Let X be a Banach space and let \((v_n)_{n\in \mathbb {N}}\) be a Schauder basis of X. Then there is a sequence \((v_n^*)_{n\in \mathbb {N}} \in X^*\) such that
and \(x=\sum _{n\in \mathbb {N}} v_n^*(x)v_n\) for each \(x \in X\). Such a pair \((v_n,v_n^*)\) is called biorthogonal system.
When F is finite-dimensional (\({\text {dim}} F= n\)) we assume that \((b_{i})_{i=1,\dots ,n}\) is a basis of F (see [14, Example (i), Chapter 6]). In the sequel, we use the term "basis" when referring to Schauder basis. Let us note that any separable Hilbert space has a basis.
Let \(K\subset F\) be a cone defined as
where \(I_0\cup I_1\subseteq \mathbb {N} \), \(I_0\cap I_1=\emptyset \). Cone K is closed, convex and has no interior points (see [15]). In the sequel, we refer to the cone K as a basis cone (see [16]).
Let \(G:\ E\rightarrow F\) be a \(C^1\) mapping, i.e. \(G:\ E\rightarrow F\) is continuously Fréchet differentiable mapping. Consider the minimization problem,
where \({{\mathcal {F}}}:=\{x\in E\ \mid \ G(x)\in K\}\).
By taking \(F=\ell _2\), \(b_i=e_i\), \(i\in \mathbb {N}\), where \((e_i)_{i\in \mathbb {N}}\), is the canonical basis of \(\ell _2\), the problem (P) is of the form (\(P_0\)). Indeed, in this case, the set \( {{\mathcal {F}}}\) takes the form
In the case when F is finite-dimensional (\(F = \mathbb {R}^\kappa \)) and \(b_i=e_i\), \(i\in \{1,\dots ,\kappa \} \) is the canonical basis of the space F, cone K has a form
where \(I_0\cup I_1 = \{1,\dots ,\kappa \}\).
Sufficient optimality conditions for problem (P) with F—finite dimensional and cone K, given by (3), are considered in [7]. In this paper, we concentrate on the case when F is infinite-dimensional and K is given by (2).
Remark 1
In Banach spaces F with a basis \((b_i)_{i\in \mathbb {N}}\), the basis cone, defined as
was considered in [15, 23]. For \(I_0=\emptyset \) and \(I_1=\mathbb {N}\), the basis cone \(K_{\{b_i\}}\) coincides with K. In the space c of convergent sequences, the basis cone \(K_{\{b_i\}}\) defined as (4) never coincide with the natural nonnegative cone, since the natural nonnegative cone has a nonvoid interior while the basis cone \(K_{\{b_i\}}\) always has a void interior (see [16]).
Let us note that even in Hilbert spaces not all standard nonnegative cones can be represented in the form of (2) as can be seen from the following example.Footnote 1
Example 1
Let \(F=L_2([0,1])\). Suppose that \(f_k \ge 0\), \(k=1,\dots \) is a basis of the nonnegative cone of \(L_2([0,1])\). Observe that, due to the fact that \(L_2^+([0,1])\) is generating, \(f_k\), \(k\in \mathbb {N}\), is also a basis of F. For each \(k\in \mathbb {N}\) let \(F_k\) be the support of \(f_k\) and choose a subset \(M_k \subset F_k\) of measure \(<10^{-k}\), Then the set \(M=\bigcup _{k\in \mathbb {N}} M_{k}\) has measure \(<1/9\), hence the set \(M^\prime =[0,1]\setminus M\) has measure greater than 0.
Let h(t) be the characteristic function of \(M^\prime \), i.e. \(h(t)=1\) for \(t\in M^\prime \) and \(h(t)=0\) for \(t\notin M^\prime \). Obviously, \(h(t)=0\) on M. On the other hand, \(h(t)=\sum _{k\in \mathbb {N}} a_k f_k(t)\) with \(a_k\ge 0\), \(k\in \mathbb {N}\). Take any \(k\in \mathbb {N}\) such that \(a_k>0\). Then \(h(t)>0\) on \(F_k\) and in particular on \(M_k\). However, \(M_k\subset M\), where \(h(t)=0\), a contradiction, i.e., such a basis does not exists.
Now, let us express the mapping appeared in the definition of problem (P) with the use of Schauder basis \(b_i\), \(i\in \mathbb {N}\). We have
By (2), we can rewrite set \({{\mathcal {F}}}\) in an equivalent way as follows
The aim of the present paper is to provide conditions ensuring Abadie CQ,
where \({{\mathcal {T}}}_{{{\mathcal {F}}}}(x_{0})\) is the tangent cone to \({{\mathcal {F}}}\) at \(x_0\) (see (6)) and \(\varGamma _{{{\mathcal {F}}}}(x_{0})\) is the linearized cone to \({{\mathcal {F}}}\) at \(x_0\) (see (7)). In the sequel we concentrate on the case \(I_0\cup I_1=\mathbb {N}\).
Our main tool is Relaxed Constant Rank Constraint Qualification Plus introduced in Sect. 6. Other regularity conditions were recently proposed in [4, 6, 7, 11, 25].
The novelty of our approach relies on the use of the Schauder bases and basic sequences in definitions of the main concepts, namely in the definitions of the CRC+ (Definition 9) and RCRCQ+ (Definition 10). Consequently, the assumptions of the main theorem (Theorem 6), of Proposition 9 on functional dependence and two auxiliary results (Propositions 7, 8) are expressed with the help of Schauder bases of some spaces generated by the derivative of the constraint map.
The proof of Theorem 6 is based on Rank Theorem 3 and Lyusternik–Graves Theorem (Theorem 5). Moreover, in Propositions 10 and 11 we prove the existence of Lagrange multipliers. To the best of our knowledge, the only result concerning the existence of Lagrange multipliers for which the proof is based on Rank Theorem is [9, Theorem 4.1].
Theorem 1
([9, Theorem 4.1]) Let \(x_0\) be a local solution to problem (P) with \(K=\{0\}\) and F be a Banach space. Let \(x_0\) be a local solution to (P). Assume that the following conditions are fulfilled:
-
(A)
\(f_0\) is Fréchet differentiable at \(x_0\) and G is of class \(C^1\) in a neighbourhood of \(x_0\).
-
(B)
\(E_2:= \ker DG(x_0)\) is topologically complemented in E, i.e., \(E=E_1 \oplus E_2\), where \(E_1\) is a closed subspace of E, \(F_1:= DG(x_0)(E)\) is closed and topologically complemented in F, i.e., \(F=F_1 \oplus F_2\), where \(F_2\) is a closed subspace of F.
-
(C)
There exists a neighbourhood of \(x_0\) such that for all x in this neighbourhood \(DG(x)(E)\cap F_2= \{0\}\).
Then there exists \(\lambda \in F^*\) such that \(Df_0(x_0)=\lambda \circ DG(x_0)\).
Remark 2
By Proposition 1 of [8], condition (C) is equivalent to condition
In Proposition 7 of Sect. 4 we provide conditions under which we ensure that (B) holds. In Proposition 8 of Sect. 4 we provide conditions under which we ensure that (C) holds.
The organization of the paper is as follows. In Sect. 2 we recall basic concepts related to Schauder bases. In Sect. 3 we introduce Constant Rank Condition (CRC) and Constant Rank Condition Plus (CRC+). In Sect. 4, Conditions 4., 5., 6. of CRC+ allows us to prove Proposition 7 on the split of the space E. Condition CRC+ allows us to prove Proposition 8 on isomorphisms. Section 5 is devoted to the proof of Proposition 9 on functional dependence. In Sect. 6 we introduce Relaxed Constant Rank Constraint Qualification Plus, which is the main ingredient of main results of Sect. 7. In Sect. 7 we present main result Theorem 6, namely sufficient conditions under which Abadie condition holds for the investigated problem. Section 8 is devoted to the topic of existence of Lagrange multipliers to problem (P). The main result of this section is Proposition 10, in which an important assumption is weak*-closedness of the Hurwicz set \({{\mathcal {M}}}(x_0,0)\). In the closing Sect. 9 we discuss this assumption. We conclude the paper with graphical illustration of the main contribution.
2 Preliminaries
In this section, we recall concepts and facts used throughout the paper.
Definition 1
For a given, possibly nonconvex, set \(Q\subset E\) and \(x\in Q\) the tangent (Bouligand) cone to Q is defined as
The cone
is called the linearized cone to \({{\mathcal {F}}}\) at x.
Definition 2
For any \(x\in {{\mathcal {F}}}\), where \({{\mathcal {F}}}\) is given by (5), I(x) denotes the set of active (inequality) indices of \({{\mathcal {F}}}\) at x,
In the example below we recall standard properties of tangent cone.
Example 2
Let \(G:E\rightarrow \mathbb {R}^{n}\), \(G(x)=(g_{1}(x),\dots ,g_{n}(x))\) and \(K=\mathbb {R}_{-}^n=\{y=(y_{i})\in \mathbb {R}^{n}\ |\ y_{i}\le 0,\ i=1,\dots n\}\), i.e.
where \(g_{i}:E\rightarrow \mathbb {R}\), \(i=1,\dots , n\). Let us calculate \({{\mathcal {T}}}_{K}(G(x_0))\), \(x_0\in E\), \(G(x_0)\in \text {cl}\, K=K\).
-
1.
If \(G(x_0)\in \text {int}\, \mathbb {R}_{-}^n\), then \({{\mathcal {T}}}_{K}(G(x_0))=\mathbb {R}^n.\)
-
2.
If \(G(x_0)\in \text {bd}\, \mathbb {R}_{-}^n\), then \(I(x_0):= \{i\in \{1,\dots ,n\}\ |\ g_{i}(x_0)=0\}\) is nonempty and \({{\mathcal {T}}}_{K}(G(x_0))=\mathbb {R}_{-}^n=\{y=(y_{i})\in \mathbb {R}^{n}\ |\ y_{i}\le 0,\ \ i\in I(x_0)\}\).
In the following proposition, we provide a characterization of tangent cone to K given by (2).
Proposition 2
For any \(y_0\in K\), where K is given by (2), we have
where \(I(y_{0}):=\{i\in I_1\ \mid \ b_i^*(y_0)= 0\}\).
Proof
Take any \(\tilde{z}\in {{\mathcal {T}}}_{K}(y_{0})\). By the definition, there exist \(r(t)\in F\) and \(\varepsilon _0>0\) such that \(r(t)/t\rightarrow 0^+\) and
By this, for all \(t\in [0,\varepsilon _{0})\)
and consequently
Since \(r(t)/t\rightarrow 0\) as \(t\rightarrow 0^+\), we obtain \({{\mathcal {T}}}_{K}(y_{0})\subset \{z\in F\ \mid \ b_{i}^* ( z)\le 0,\ i\in I(y_{0}),\ b_{i}^* ( z)=0,\ i\in I_0\} \).
Now, to see the converse, take any \(\tilde{z}\in \{z\in F\ \mid \ b_{i}^* (\tilde{z})\le 0,\ i\in I(y_{0}),\ b_{i}^* (\tilde{z})=0,\ i\in I_0\}\). We have
Since \(z_k\in {{\mathcal {T}}}_{K}(y_{0})\), \(k\in \mathbb {N}\) and \({{\mathcal {T}}}_{K}(y_{0})\) is closed, \(\tilde{z}\in {{\mathcal {T}}}_{K}(y_{0})\). \(\square \)
Fact 1
Let \(\bar{x} \in E\). Since G is assumed differentiable on E we have
Hence, due to the uniqueness of the representation of elements of the space F in basis \((b_{i})_{i\in \mathbb {N}}\), the coefficients \(g_{i}(\cdot )=b_i^{*}(g(\cdot ))\), \(i\in \mathbb {N}\), are differentiable for all \(x\in E\) and \(b_i^*( DG(\bar{x})z)= Dg_i(x)z \) for any \(z\in E\), \(i\in \mathbb {N}\).
As a consequence we obtain the following proposition.
Proposition 3
Let \(x_0\in {{\mathcal {F}}}\), where \({{\mathcal {F}}}\) is given by (5). Then
where \(I(x_{0}):=\{i\in I_1 \mid g_{i}(x_{0})=0\}\).
Proof
Take any \(d \in \varGamma _{{\mathcal {F}}}(x_0)\). This means that
By Proposition 2,
Hence, by (12),
Now, (11) follows from Fact 1. \(\square \)
2.1 Boundedly-Complete, Shrinking, Besselian and Hilbertian Bases
In this subsection we recall basic definitions and facts related selected types of bases in Banach spaces. These concepts will be extensively used in the sequel.
Definition 3
The closed subspace \(E_1\) of the Banach space E is said to be split, or complemented, if there is a closed subspace \(E_2 \subset E\) such that \(E = E_1 \oplus E_2\).
Proposition 4
([1, Theorem 2.1.15]) If F is a Hilbert space and \(F_1\) a closed subspace, then \(F=F_1\oplus F_1^\perp \). Thus every closed subspace of a Hilbert space splits (see e.g. Definition 2.1.14 of [1]).
Definition 4
([2, Definition 3.2.8]) Let X be a Banach space. A sequence \((v_n)_{n\in \mathbb {N}}\) in X is boundedly-complete if whenever \((a_n)_{n\in \mathbb {N}}\) is a sequence of scalars such that
then the series \(\sum _{n\in \mathbb {N}}a_n v_n\) converges.
Remark 3
Let \((v_n)_{n\in \mathbb {N}}\) in X be boundedly-complete. Then every subsequence \((v_{n_k})_{k \in \mathbb {N}} \) is boundedly-complete. Indeed, suppose that \((v_n)_{n\in \mathbb {N}}\) is boundedly-complete and \((a_{k})\), \(k\in \mathbb {N}\), is such that \(\sup _{N\in \mathbb {N}} \Vert \sum _{k=1}^N a_{k} v_{n_k} \Vert < \infty \). Let
Then
Definition 5
([2, Definition 3.2.5]) A basis \((v_n)_{n\in \mathbb {N}}\) of a Banach space X is shirnking if the sequence of its bioorthogonal functionals \((v_n^*)_{n\in \mathbb {N}}\) is a basis of \(X^*\), i.e., \(\overline{{\text {span}}}(v_n^*,\ n\in \mathbb {N})=X^*\).
Proposition 5
([2, Theorem 3.2.10]) Let \((v_n, v_n^*)\) be a biorthogonal system in a Banach space X. The following are equivalent:
- 1.:
-
\((v_n)_{n\in \mathbb {N}} \) is boundedly-complete,
- 2.:
-
\((v_n^*)_{n\in \mathbb {N}} \) is shrinking basis for \(\overline{{\text {span}}}(v_n^{*},\ n\in \mathbb {N})\),
- 3.:
-
the canonical map \({\text {eval}}_X:\ X\rightarrow \overline{{\text {span}}}(v_n^*,\ n\in \mathbb {N})^*\) defined by \({\text {eval}}_X (x)(h)=h(x)\) for all \(x\in X\) and \(h\in \overline{{\text {span}}}(v_n^*,\ n\in \mathbb {N})\), is an isomorphism.
Remark 4
(Corollary 3.2.11 of [2]) Every Schauder basis of a Hilbert space is boundedly-complete. \(c_0\) has no boundedly-complete basis.
Let us recall James Theorem from 1951.
Theorem 2
(James Theorem, see [2, Theorem 3.2.13]) Let X be a Banach space. If X has a basis \((v_n)_{n\in \mathbb {N}}\) then X is reflexive if and only if \((v_n)_{n\in \mathbb {N}}\) is both boundedly-complete and shrinking.
Definition 6
([26, Definition 11.1]]) We say that basis \((c_i)_{i\in \mathbb {N}}\) in a real Banach space is
-
1.
Besselian if
$$\begin{aligned} \sum _{i\in \mathbb {N}} \alpha _i c_i \quad \text {converges} \implies \sum _{i\in \mathbb {N}} (\alpha _i)^2<+\infty . \end{aligned}$$ -
2.
Hilbertian if
$$\begin{aligned} \sum _{i\in \mathbb {N}} (\alpha _i)^2<+\infty \implies \sum _{i\in \mathbb {N}} \alpha _i c_i \quad \text {converges}, \end{aligned}$$i.e., for every \(\alpha _i \in \mathbb {R}\), \(i\in \mathbb {N}\), with \(\sum _{i\in \mathbb {N}} (\alpha _i)^2<+\infty \) there exists an (obviously unique) x such that
$$\begin{aligned} c_i^*(x)=\alpha _i, \quad i\in \mathbb {N}. \end{aligned}$$
Remark 5
The natural basis of \(\ell _2\) is Besselian. Not all bases in Hilbert spaces are Besselian or Hilbertian. For \(L_2[-\pi ,\pi ]\) see [26, Example 11.2].
Remark 6
([27, Corollary 4.6]) The space \(L_1[0,1]\) has a Besselian basis.
We say that sequence \((b_i)_{i\in \mathbb {N}}\in F\) is a basic sequence if \((b_i)_{i\in \mathbb {N}}\) is Schauder basis of \(\overline{{\text {span}}}( b_i, i\in \mathbb {N} )\), where \(\overline{{\text {span}}}\) denotes the closure of the span and the closure is taken in the strong topology of the space.
Lemma 1
([17, Lemma 3.1]) The basis \((c_i)_{i\in \mathbb {N}}\) of Banach space X is Hilbertian (Besselian) if and only if the basic sequence \((c_i^*)_{i\in \mathbb {N}}\) in \(X^*\) is Besselian (Hilbertian).
Proposition 6
([26, Proposition 3.1]) Let \(\{v_n\}\) be a sequence in Banach space X, and assume that \(v_n\ne 0\) for every n.
Define \(Y:=\{ (c_n)\mid \sum _{n} c_n v_n\quad \text {converges in } X\} \) and the set
Then the following hold:
-
1.
Y is a Banach space.
-
2.
If \(\{v_n\}\) is a basis for X, then Y is topologically isomorphic to X via the mapping \((c_n)\mapsto \sum _{n} c_n v_n\).
For any infinite subset \(J=\{ j_1,j_2,\dots \}\subset \mathbb {N}\) let us denote
with the associated norm
and \(\ell _2(J):=\{ (c_j)_{j\in J} \mid \sum _{i\in J} (c_j)^2 < +\infty \}\) with the associated norm
Remark 7
Observe that if X is a Hilbert space with the inner product \(\langle \cdot \ |\ \cdot \rangle \), the norm \(\Vert x\Vert =\sqrt{\langle x\ |\ x\rangle }\) and the orthonormal basis \((x_{i})_{i\in J}\), then
By James Theorem (Theorem 2), \((x_{i})_{i\in J}\) is boundedly-complete in X. In view of this, when \((x_{i})_{i\in J}\) is an orthonormal, then \(Y(J)=\ell _{2}(J):=\overline{{\text {span}}}(e_{i},\ i\in J)\), where \(e_{i}^{j}=0\) if \(j=i\) and 0 otherwise, for all \(i,j\in J\).
3 Constant Rank Condition
In this section we introduce the Constant Rank Condition (CRC)Footnote 2 for a possibly infinite family of functions defined on a Banach space via Schauder basis. For other forms of CRC which do not refer to Schauder basis, see e.g. [9].
Let E be a Banach space and F a Hilbert space with basis \((b_{i})_{i\in \mathbb {N}}\). Consider \(f:E\rightarrow \overline{{\text {span}}}(b_{i},\ i\in \mathbb {N})\), i.e., \(f(x)=\sum _{i\in \mathbb {N}}f_{i}(x)b_{i}\), where \(f_{i}=b_{i}^{*}(f(x)):E\rightarrow \mathbb {R}\), \(i\in \mathbb {N}\) are continuous functionals.
Definition 7
Let \((f_{i})_{i\in J_1}:\ E \rightarrow Y(J_1)\), \(J_1\subset \mathbb {N}\) be of class \(C^1\). We say that the Constant Rank Condition (CRC in short) holds for \((f_{i})_{i\in J_1}\) at \(x_0\in E\) if there exist a neighbourhood \(V(x_0)\) and a subset \(J_2\subset J_1\) such that
- 1.:
-
\((Df_i(x))_{i\in J_2}\) forms a Schauder basis for \(\overline{{\text {span}}}(D f_{i}(x), i \in J_1)\) for all \(x\in V(x_0)\),
- 2.:
-
for any \(x\in V(x_0)\), there exists a topological isomorphism (linear)
$$\begin{aligned} z_x: \overline{{\text {span}}}( D f_{i}(x), i \in J_2) \rightarrow \overline{{\text {span}}}(D f_{i}(x_0), i \in J_2), \end{aligned}$$(16)such that \(z_x(D f_{i}(x))= D f_{i}(x_0)\), \(i \in J_2\).
We interpret \(z_x\) as an isomorphism of functionals, i.e., \(z_x(Df_i(x)(\cdot ))=Df_i(x_0)(\cdot )\), \(x\in V(x_0)\), \(i\in J_2\), hence, by (16), \((Df_i(x_0))_{i\in J_2}(E)\) and \((Df_i(x))_{i\in J_2}(E)\) are isomorphic for all \(x\in V(x_{0})\).
Remark 8
Let us note that, for \((f_{i})_{i\in J_1}\), \(J_1\subset \mathbb {N}\), where \(J_1\) is finite, the condition
is equivalent to the existence of isomorphism \(z_x\) given in Definition 7 for \((f_{i})_{i\in J_1}\) at \(x_0\) (see [7, Definition 2.1]).
Definition 8
([2, Definition 1.3.1]) Two bases (or basic sequences) \((u_n)_{n\in \mathbb {N}}\) and \((v_n)_{n\in \mathbb {N}}\) in the respective real Banach spaces X and Y are said to be equivalent, if whenever we take a sequence of scalars \((a_n)_{n\in \mathbb {N}}\), then \(\sum _{n\in \mathbb {N}}a_n u_n\) converges if and only if \(\sum _{n\in \mathbb {N}}a_n v_n\) converges.
Let us recall the following fact.
Fact 2
([2, Theorem 1.3.2]) Let \((u_i)_{i\in J}\), \(J\subset \mathbb {N}\), be a basic sequence in a Banach space X and let \((v_i)_{i\in J}\) be a basic sequence in a Banach space Y. The following are equivalent:
-
\((u_i)_{i\in J}\) is a basic sequence equivalent to \((v_i)_{i\in J},\)
-
There is an isomorphism T of \(\overline{{\text {span}}}(u_i )_{i\in J}\) onto \(\overline{{\text {span}}}(v_i)_{i\in J}\) s.t. \(T(u_i)=v_i\) for every \(i\in J\).
In other words, condition 2. of CRC means that a basic sequence \((Df_i(x))_{i\in J_2}\) is equivalent to a basic sequence \((Df_i(x_0))_{i\in J_2}\) in \(E^{*}\) for all x from some neighbourhood \(V(x_0)\).
In the sequel, e.g. in Definition 10, we will use CRC with some additional conditions, which motivates the following definition.
Definition 9
Let \((f_{i})_{i\in J_1}:\ E \rightarrow Y(J_1)\), \(J_1\subset \mathbb {N}\) be of class \(C^1\). We say that the Constant Rank Condition Plus (CRC+ in short) holds for \((f_{i})_{i\in J_1}\) at \(x_0\in E\) if there exist a neighbourhood \(V(x_0)\) and a subset \(J_2\subset J_1\) such that
- 1.:
-
\((Df_i(x))_{i\in J_2}\) forms a Schauder basis for \(\overline{{\text {span}}}(D f_{i}(x), i \in J_1)\) for \(x\in V(x_0)\),
- 2.:
-
for all \(x\in V(x_0)\), there exists a topological isomorphism
$$\begin{aligned} z_x: \overline{{\text {span}}}( D f_{i}(x), i \in J_2) \rightarrow \overline{{\text {span}}}(D f_{i}(x_0), i \in J_2), \text{ for } \text{ all } x\in V(x_0) \end{aligned}$$such that \(z_x(D f_{i}(x))= D f_{i}(x_0)\), \(i \in J_2\)
and additionally
- 3.:
-
\((Df_i(x_0))_{i\in J_2}(E)\) is closed in \(Y(J_2)\) defined in (14) with \(v_i=b_i\), \(i\in J_2\),
- 4.:
-
\((Df_i(x_0))_{i\in J_2}\) forms a basis which is shrinking and boundedly-complete for \(\overline{{\text {span}}}(Df_i(x_0),\ i\in J_1)\), equivalently \(\overline{{\text {span}}}(Df_i(x_0),\ i\in J_1)\) is reflexive and \((Df_i(x_0))_{i\in J_2}\) forms a basis of this space,
- 5.:
-
\((Df_i(x_0))_{i\in J_2}\) is Besselian for \(\overline{{\text {span}}}(Df_i(x_0),\ i\in J_1)\),
- 6.:
-
\(Df_i(x_0)^*\in E\), \(i\in J_2\).
In view of Remark 7 if \((b_{i})_{i\in J_{2}}\) is orthonormal, then, in Definition 9, 3. we have \(Y(J_{2})=\ell _{2}(J_{2})\).
Remark 9
If E is a reflexive space, then CRC+ for \((f_{i})_{i\in J_1}:\ E \rightarrow Y(J_1)\), \(J_1\subset \mathbb {N}\) takes the following form: there exist a neighbourhood \(V(x_0)\) and a subset \(J_2\subset J_1\) such that
- 1.:
-
\((Df_i(x))_{i\in J_2}\) forms a Schauder basis for \(\overline{{\text {span}}}(D f_{i}(x), i \in J_1)\) for all \(x\in V(x_0)\),
- 2.:
-
for any \(x\in V(x_0)\), there exists a topological isomorphism
$$\begin{aligned} z_x: \overline{{\text {span}}}(D f_{i}(x), i \in J_2) \rightarrow \overline{{\text {span}}}(D f_{i}(x_0), i \in J_2), \end{aligned}$$such that \(z_x(D f_{i}(x_0))= D f_{i}(x)\), \(i \in J_2\),
and
- 3.:
-
\((Df_i(x_0))_{i\in J_2}(E)\) is closed in \(Y(J_2)\) defined in (14), with \(v_i=b_i\), \(i\in J_2\),
- 4.:
-
\((Df_i(x_0))_{i\in J_2}\) is Besselian for \(\overline{{\text {span}}}(Df_i(x_0),\ i\in J_1)\).
Remark 10
Let us remark that, if \(J_1\) is finite and E is reflexive, then 3., 4., 5., 6. of Definition 9 are automatically satisfied.
In the definition of CRC+ we assume the closedness of \(((Df_i(x_0))_{i\in J_2})(E)\). Let us recall that the following sets do not have to be closed, as shown in the example below.
Example 3
([10]) Let \(G:\ \ell _2 \rightarrow \ell _2\) be defined as \(G(x)=(\frac{1}{2i}x_i^2)_{i\in \mathbb {N}}\). Then \(DG(\cdot ):=\sum _{n=1}^{\infty } \frac{1}{n} e_n \langle e_n \mid \cdot \rangle \) and
-
1.
\(DG(x)\in L(\ell _2,\ell _2)\) (with \(\Vert DG(x)\Vert =1\)),
-
2.
\(v\in DG(x)(\ell _2)\) if and only if \(\sum _{n\in \mathbb {N}} n^2 |\langle e_n \mid v \rangle |^2<+\infty \),
-
3.
with \(v_0:=\sum _{n\in \mathbb {N}} \frac{1}{n^{\frac{3}{2}}}e_n\) and \(\{v_j\}:=\sum _{n\in \mathbb {N}} \frac{1}{n^{\frac{3}{2}+\frac{1}{j}}}e_n\) we have \(v_0\notin DG(x)(\ell _2)\) yet \(v_j\in DG(x)(\ell _2)\) and \(v_j\rightarrow v_0\) as \(j\rightarrow +\infty \).
4 Complemented Kernels and Isomorphisms
In this section we prove Propositions 7 and 8 which, together with CRC+ will be used in the next sections. We believe that the mentioned propositions are of independent interest.
We start with Proposition 7 providing conditions for a \(C^{1}\) mapping f ensuring that the kernel of its derivative \(Df(x_{0})\) is complemented. Next, it is used in the proof of Lemma 2 which is, in turn, used in the proof of Proposition 8.
Proposition 7
Let E be a Banach space, F be Banach space with a Besselian Schauder basis \((b_i)_{i\in \mathbb {N}}\). Let \((f_{i})_{i\in J}:\ E \rightarrow Y(J)\), \(J\subset \mathbb {N}\) be of class \(C^1\), \(f:\ E \rightarrow F\), \(f(x):=\sum _{i\in J} f_i(x)b_i \), \(x\in E\). Let \(x_0\in E\) and \(E_2=\ker Df(x_0)\), \(X_1:=\overline{{\text {span}}}(Df_i(x_0),\ {i\in J})\), and assume that:
-
(A)
\((Df_i(x_0))_{i\in J}\) forms a shrinking and boundedly-complete basis for \(X_1\),
-
(B)
\((Df_i(x_0))_{i\in J}\) is Besselian for \(X_1\),
-
(C)
\(Df_i(x_0)^*\in E\), \(i\in J\).
Then \(E=E_1 \oplus E_2\), where \(E_1= \overline{{\text {span}}}(Df_i(x_0)^*,\ {i\in J}).\) Moreover, \(X_1\) is a reflexive space.
Proof
By (A), the fact that \(X_1\) is a reflexive space follows immediately from James Theorem (Theorem 2). Let \(v_{i}:=D f_i(x_0)\in E^*\), \(i\in J\). Since \((Df_i(x_0))_{i\in J}\) forms a boundedly-complete basis for \(X_1:=\overline{{\text {span}}}(Df_i(x_0),\ {i\in J})\), by Proposition 5, there exists canonical isomorphism \({\text {eval}}_{X_1}:\ X_1\rightarrow \overline{{\text {span}}}( (Df_i(x_0)^*,\ {i\in J})^*\) defined as
We have \(v_{i}^{*}(v_{j})=v_{j}(v_{i}^{*})=Df_i(x_0)Df_{j}(x_{0})^{*}\) for all \(i\in J\). By (1), (C), Fact 1 and Proposition 5
Now we show that \(E=X_{1}^{*}\oplus \text {ker }Df(x_{0})\). For any \(x\in E\), \(Df(x_{0})(x)= \sum _{j\in J} Df_j(x_0)(x)b_j=\sum _{i\in J} \alpha _i(x) b_i\), \(\alpha _i(x)=Df_i(x_0)(x)\in \mathbb {R}\). Since \((b_i)_{i\in \mathbb {N}}\) is Besselian for F, \(\sum _{i\in J} (\alpha _i(x))^2<+\infty \). Let \(m:=\sum _{i\in J}\alpha _i(x) Df_{i}(x_{0})^{*}\). By (B) and Lemma 1, \((Df_i(x_0)^*)_{i\in J}\) is Hilbertian for \(X_1^*\), m is well defined, i.e., \(m\in X_{1}^{*}\).
By (17), we have
This shows that \(x-m\in \text {ker}\, Df(x_{0})\) which proves the assertion with \(E_{1}:=X_{1}^{*}\) and \(E_{2}:=\text {ker}\, Df(x_{0}).\) \(\square \)
Remark 11
Let us note that space \(\ell _1\) contains no infinite-dimensional reflexive subspaces.
Corollary 1
Let E be a reflexive Banach space, F be Hilbert space with a Besselian Schauder basis \((b_i)_{i\in \mathbb {N}}\). Let \((f_{i})_{i\in J}:\ E \rightarrow Y(J)\), \(J\subset \mathbb {N}\) be of class \(C^1\), \(f:\ E \rightarrow F\), \(f(x):=\sum _{i\in J} f_i(x)b_i \), \(x\in E\). Let \(x_0\in E\) and \(E_2=\ker Df(x_0)\), \(X_1:=\overline{{\text {span}}}(Df_i(x_0),\ {i\in J})\) and assume that:
-
(A)
\((Df_i(x_0))_{i\in J}\) is a basis for \(X_1\),
-
(B)
\((Df_i(x_0))_{i\in J}\) is Besselian for \(X_1\).
Then \(E=E_1 \oplus E_2\), where \(E_1=\overline{{\text {span}}}(Df_i(x_0)^*,\ {i\in J}).\)
Proof
The proof follows directly from James Theorem applied to \(X_1\) and Proposition 7.\(\square \)
Remark 12
Assume that CRC holds for \((f_i)_{i\in J}\) at \(x_0\) with \(V(x_{0})\) as in Definition 7. Then, by 2. of CRC
and
Hence
For any \(f:E\rightarrow F\), and \(J_1\subset \mathbb {N}\), we have \(f(x)=\sum _{i\in J_1} b_i^*(f(x))b_i\), \(x\in E\), where \(f_{i}(x)=b_i^*(f(x))=0\) for \(i\in \mathbb {N}{\setminus } J_1\). Consequently, for any \(e\in E\),
where, by Fact 1, \(Df_{j}(x)e=b_j^*(Df(x)e)\), \(j\in J_1\).
In the lemma below we investigate the coefficients \(Df_{j}(x)e=b_j^*(Df(x)e)\), \(j\in J_1\), \(e\in E\), in a neighbourhood of \( x_{0}\) at which CRC+ holds.
Lemma 2
Let E be a Banach space, F be a Hilbert space and \((b_i)_{\in \mathbb {N}}\) be a Besselian basis of F. Let \((f_{i})_{i\in J}:\ E \rightarrow Y(J)\), \(J\subset \mathbb {N}\) be of class \(C^1\), \(f:\ E \rightarrow F\), \(f(x):=\sum _{i\in J} f_i(x)b_i \), \(x\in E\). Assume that CRC+ holds for \((f_i)_{i\in J_1}\) at \(x_0\) with \(J_2\subset J_1\) and neighbourhood \(V(x_0)\subset U\).
Let \(x\in V(x_0)\), \(e\in E\). Then there exist scalars \(\beta _{j}=\beta _{j}(e)\), \(j \in J_{2}\) depending on e but not on x such that
where
and \(D f_{i}(x_0)^*\in E\), \(i\in J_2,\) (see Proposition 1) are such that
Proof
The existence of \(Df_i(x_0)^*\in E^{**}\), \(i\in J_2\) is ensured by Proposition 1. By 6. of CRC+, \(Df_i(x_0)^*\in E\), \(i\in J_2\). Since, by 5. of CRC+, \(D f_{i}(x_0)\), \(i\in J_2\) forms a basis of \(\overline{{\text {span}}}( Df_i(x_0),\ i \in J_1) \) by Proposition 1, \(D f_{i}(x_0)^*\), \(i\in J_2\) forms a basis of \((\overline{{\text {span}}}( Df_i(x_0),\ i \in J_1))^* \).
By Proposition 7,
By 2. of CRC+ and (18), \(e_{2}\in \text {ker } Df(x)\) for \(x\in V(x_{0})\) and for \(i\in J_1\) we have
\(\square \)
Remark 13
In other words, (20) means that for \(x\in V(x_{0})\) and \(i\in J_{2}\)
In particular, for \(x=x_{0}\) and \(i\in J_{2}\),
Remark 14
By Proposition 1, vectors \(w_j(x_0)\), \(j\in J_2\) are such that
Observe that by (23), for \(x=x_{0}\) and \(j\in J_{1}{\setminus } J_{2}\)
because \(Df(x_{0})Df_{k}^{*}(x_{0})=\sum _{i\in J_1} Df_i(x_0)(Df_k^*(x_0))b_i=b_{k}\) for \(k\in J_{2}\) and, by the definition of \(b^{*}_{j}\),
Hence,
In the following proposition we prove that CRC+ ensures that the mapping defined by (27) is an isomorphism. This proposition together with the Rank Theorem (Theorem 3) will allow us to prove Proposition 9 which is crucial in the proof of the main result in Sect. 7.
Proposition 8
Let E be a Banach space, F be a Hilbert space and assume that \((b_i)_{i\in \mathbb {N}}\) is a Besselian basis of F and \(J_1\subset \mathbb {N}\). Let \((f_i)_{i\in J_1}:\ U \rightarrow Y(J_1)\), \(U\subset E\) open, be a \(C^{1}\) mapping. Assume that CRC+ holds for \((f_i)_{i\in J_1}\) at \(x_0\) with a neighbourhood \(V(x_0)\) with index set \(J_2\subset J_1\).
Then
is an isomorphism, where \(E_1:=X_1^*=\overline{{\text {span}}}(Df_i(x_0)^*,\ i\in J_2)\).
Proof
By 1. of CRC+, for any \(x\in V(x_0)\) for any \(l\in J_1{\setminus } J_2\), \(Df_l(x)(e_1)\) can be expressed by \(Df_i(x)(e_1)\), \(i\in J_2\), i.e., there exists scalars \(\alpha _i^l (x)\), \(i\in J_2\), \(l\in J_1{\setminus } J_2\) such that
Since \(e_1\in E_1\), by Lemma 2, we have
Since \((f_i)_{i\in J_1}\) is a \(C^{1}\) mapping, \(t_{x}\) is a continuous (linear) mapping. Now, we show that \(t_{x}\), \(x\in V(x_{0})\), is a bijection.
1. Step: injectivity. Let \(x\in V(x_0)\). Let us take \(x_1, x_2 \in E_1\). Suppose that \((Df_i(x))_{i\in J_1}(x_1)=(Df_i(x))_{i\in J_1}(x_2)\). Then \((Df_i(x))_{i\in J_1}(x_1-x_2)=0\) and, by Remark 12, \(x_1-x_2\in \ker (Df_i(x)_{i\in J_2})=\ker (Df_i(x_0)_{i\in J_1})=\bigcap _{i\in J_1} Df_i(x_0)\). On the other hand \(x_1-x_2\in E_1\), hence \(x_1=x_2\).
2. Step: surjectivity. Let \(e\in E\) and \(x\in V(x_0)\). By 1. of CRC+,
For \(i\in J_2\)
and for \(l\in J_1\setminus J_2\)
where \(e_1=\sum _{j\in J_2} \beta _j(e)Df_j(x_0)^*\in E_1\) (hence \(Df_i(x)(e)=Df_i(x)(e_1)\), \(i\in J_2\)). Since \(Df_i(x)(e)=Df_i(x)(e_1)\), \(i\in J_2\) and \(Df_l(x)(e)=Df_l(x)(e_1)\), \(l\in J_1\setminus J_2\), we obtain \((Df_i(x))_{i\in J_1}(e)=(Df_i(x))_{i\in J_1}(e_1)\). \(\square \)
5 CRC+ and Functional Dependence
In this section we use CRC+ to prove Proposition 9 which provides conditions for the functional dependence in the form of formula (30) and is based on Propositions 7, 8, and Rank Theorem (Theorem 3). Proposition 9 will be used in the proof of the main result, Theorem 6 of Sect. 7.
For convenience of the reader, we start by recalling the rank and the local representation theorems.
Theorem 3
(Rank Theorem, see [1, Theorem 2.5.15]) Let \(E,\ Y\) be Banach spaces. Let \(x_0\in U\), where U is an open subset of E and \(f:\ U\rightarrow Y\) be of class \(C^1.\)
Assume that \(Df(x_0)\) has a closed split image \(Y_1\) with closed component \(Y_2\) and a split kernel \(E_2\) with closed component \(E_1\) and that for all x in some neighbourhood of \(x_0\), \(Df(x)|E_1:\ E_1 \rightarrow Df(x)(E)\) is an isomorphism.
Then there exist open sets \(U_1\subset Y_1\oplus E_2\), \(U_2\subset E\), \(V_1\subset Y\), \(V_2\subset Y_1\oplus E_2\) and diffeomorphisms of class \(C^1\), \(\varphi :\ V_1\rightarrow V_2\) and \(\psi :\ U_1\rightarrow U_2\), \(x_0=(x_{01},x_{02})\in U_2\subset U\subset E_1\oplus E_2\), i.e. \(x_{01}\in E_1\), \(x_{02}\in E_2\), \(f(x_0)\in V_1\) satisfying
for all \((w,e)\in U_1\).
Theorem 4
([1, Theorem 2.5.14] Local Representation Theorem) Let \(E,\ Y\) be Banach spaces. Let \(f:\ U \rightarrow Y\) be of class \(C^r\), \(r\ge 1\) in a neighbourhood of \(x_0\in U\), \(U\subset E\) open set. Let \(Y_1\) be a closed split image of \(Df(x_0)\) with closed complement \(Y_2\). Suppose that \(Df(x_0)\) has a split kernel \(E_2=\ker Df(x_0)\) with closed complement \(E_1\). Then there are open sets \(U_1\subset U \subset E_1\oplus E_2 \) and \(U_2 \subset Y_1 \oplus E_2\), \(x_0\in U_2\) and a \(C^r\) diffeomorphism \(\psi :\ U_2\rightarrow U_1 \) such that \((f\circ \psi )(u,v)=(u,\eta (u,v))\) for any \((u,v)\in U_1\), where \(u\in E_1\), \(v\in E_2\) and \(\eta :\ U_2\rightarrow E_2\) is a \(C^r\) map satisfying \(D\eta (\psi ^{-1}(x_0))=0\).
Theorems 3 and 4 allow us to prove the following functional dependence result.
Proposition 9
Let E be a Banach space, F be a Hilbert space with Besselian and Hilbertian basis \((b_i)_{i\in \mathbb {N}}\). Let \((f_i)_{i\in J_1}:\ U \rightarrow Y(J_1)\), \(U\subset E\) open, be a \(C^{1}\) mapping. Let \(E_2=\text {ker}\, (Df_i(x_0)_{i\in J_1})\). Assume that CRC+ holds for \((f_i)_{i\in J_1}\), at \(x_0\) with the index set \(J_2\subset J_1\) and a neighbourhood \(V(x_0)\).
Then there exist functions \(h_l:\ Y(J_2) \rightarrow \mathbb {R}\), \(l\in J_1\setminus J_2\) of class \(C^1\) such that
Proof
By 3. of CRC+, \(Y_1:=(Df_i(x_0)(E))_{i\in J_2}\) is a closed subset of \(Y(J_2)\). By Proposition 7, applied to \((f_{i})_{i\in J_2}:\ E \rightarrow Y(J_2)\), \((Df_i)_{i\in J_2}(x_0)\) has a split kernel \(E_2=\ker ((Df_i(x_0))_{i\in J_2})=\ker ((Df_i(x_0))_{i\in J_1})\) with closed complement \(E_1=\overline{{\text {span}}}(Df_i(x_0)^*,i\in J_2)=\overline{{\text {span}}}(Df_i(x_0)^*,i\in J_1)\), \(E=E_1 \oplus E_2\). By Proposition 8, \( (Df_i(x)_{i\in J_1})_{|E_1}:\ E_1\rightarrow (Df_i(x)_{i\in J_1})(E)\), \(x\in V(x_0)\) is an isomorphism. By 1. of CRC+,
where \(\beta _{j}^i\in \mathbb {R}\), \(i\in J_1 {\setminus } J_2\), \(j\in J_2\).
Since \(Y_1\) is closed, by Proposition 4, \(\overline{{\text {span}}}(b_i, i\in J_1)\) splits, i.e., there exists \(Y_2\) such that \(\overline{{\text {span}}}(b_i, i\in J_1)=Y_1\oplus Y_2\).
By Rank Theorem 3, there are open sets \(U_2\subset U \subset E_1\oplus E_2 \) and \(U_1 \subset Y_1 \oplus E_2\), \(x_0\in U_2\), \(V_1\subset \overline{{\text {span}}}( b_i,\ i\in J_1)\), \(V_2\subset Y_1\oplus E_2\) and \(C^1\) diffeomorphisms \(\psi :\ U_1\rightarrow U_2 \), \(\varphi :\ V_1\rightarrow V_2\) such that for all \(x\in U_2\) there exists \((q,e)\in U_1\) such that \(x=\psi (q,e)\) and
By Local Representation Theorem 4, there exists a function \(\eta :\ U_1\rightarrow Y_2\) of class \(C^1\) such that for all \((q,e)\in U_1\)
Let us put \((\bar{f}_i)_{i\in J_1}(q):=(q,\eta (q,e))=\varphi ^{-1}(q,0)\) for \((q,e)\in U_1\), where \(\varphi ^{-1}(q,0)=(\varphi _i^{-1}(q,0))_{i\in J_1}\).
Then, by (31) and (32) we have that
and for any \(l\in J_1\setminus J_2\),
Therefore,
where \(h_l:\ Y(J_2)\rightarrow \mathbb {R}\). \(\square \)
6 Relaxed Constant Rank Constraint Qualification Plus (RCRCQ+)
In this section we introduce the concept of Relaxed Constant Rank Constraint Qualification Plus, which is the crucial assumption of our main result, Theorem 6 of Sect. 7.
Definition 10
Let \((g_{i})_{i\in I_0\cup I_1}:\ E \rightarrow Y(I_0\cup I_1)\), \(I_0\cup I_1\subset \mathbb {N}\), \(I_0\cap I_1=\emptyset \) be of class \(C^1\). We say that Relaxed Constant Rank Constraint Qualification Plus (RCRCQ+ in short) holds for set \({{\mathcal {F}}}\), given by (5), at \(x_0\) if there exists a neighbourhood \(V(x_0)\) such that for all J, \(I_{0}\subset J\subset I_0\cup I(x_0)\), CRC+ holds for \((g_i)_{i\in J}\) at \(x_0\) with neighbourhood \(V(x_0)\), i.e., exists \(J_2\subset J\) such that
- 1.:
-
\((Dg_i(x))_{i\in J_2}\) forms a Schauder basis for \(\overline{{\text {span}}}(D g_{i}(x), i \in J)\) for all \(x\in V(x_0)\),
- 2.:
-
for any \(x\in V(x_0)\), there exists a topological isomorphism
$$\begin{aligned} z_x: \overline{{\text {span}}}(D g_{i}(x), i \in J) \rightarrow \overline{{\text {span}}}(D g_{i}(x_0), i \in J), \end{aligned}$$such that \(z_x(D g_{i}(x_0))= D g_{i}(x)\), \(i \in J_2\),
and additionally
- 3.:
-
\((Dg_i(x_0))_{i\in J_2}(E)\) is closed in \(Y(J_2)\) defined in (14), with \(v_i=b_i\), \(i\in J_2\),
- 4.:
-
\((Dg_i(x_0))_{i\in J_2}\) forms a shrinking and boundedly-complete basis for \(X_1:=\overline{{\text {span}}}(Dg_i(x_0),\ i\in J)\),
- 5.:
-
\((Dg_i(x_0))_{i\in J_2}\) is Besselian for \(\overline{{\text {span}}}(Dg_i(x_0),\ i\in J)\),
- 6.:
-
\(Dg_i(x_0)^*\in E\), \(i\in J_2\).
Remark 15
Let us note that if there is no inequality constraints, i.e. \(I_1=\emptyset \), then the Relaxed Constant Rank Constraint Qualification Plus for set \({{\mathcal {F}} }\) at \(x_0\) is equivalent to Constant Rank Condition Plus for \((g_i)_{i\in I_0}\) at \(x_0\).
Remark 16
If E is a reflexive space, then RCRCQ+ for the set \({{\mathcal {F}}}\) given by (5) takes the following form: there exists a neighbourhood \(V(x_0)\) such that for all J, \(I_{0}\subset J\subset I_0\cup I(x_0)\), exists \(J_2\subset J\) such that
- 1.:
-
\((Dg_i(x))_{i\in J_2}\) forms a Schauder basis for \(\overline{{\text {span}}}(D g_{i}(x), i \in J)\) for all \(x\in V(x_0)\),
- 2.:
-
for any \(x\in V(x_0)\), there exists a topological isomorphism
$$\begin{aligned} z_x: \overline{{\text {span}}}(D g_{i}(x), i \in J) \rightarrow \overline{{\text {span}}}(D g_{i}(x_0), i \in J), \end{aligned}$$such that \(z_x(D g_{i}(x_0))= D g_{i}(x)\), \(i \in J_2\),
and
- 3.:
-
\((Dg_i(x_0))_{i\in J_2}(E)\) is closed in \(Y(J_2)\) defined in (14), with \(v_i=b_i\), \(i\in J_2\),
- 4.:
-
\((Dg_i(x_0))_{i\in J_2}\) is Besselian for \(\overline{{\text {span}}}(Dg_i(x_0),\ i\in J)\).
Let us note that if \(J\subset \mathbb {N}\) is finite, \(|J|=n\) and \(g_i:\ E \rightarrow \mathbb {R}\), \(i\in J\) are of class \(C^1\) in some neighbourhood of \(x_0\) then \(((Dg_i(x_0))_{i\in J})(E)\) is closed in \(\mathbb {R}^n\).
7 Main Result
This section contains our main result of the paper, which is Theorem 6. The following condition will be used in our main theorem.
-
(H1)
For all \(d\in \varGamma _{{\mathcal {F}}}(x_0)\) and for any vector function \(r:\ (0,1)\rightarrow E\) such that \(\Vert r(t)\Vert t^{-1}\rightarrow 0\), as \(t\downarrow 0\), there exists a number \(\varepsilon _0>0\) such that
$$\begin{aligned} g_i(x_0+td+r(t))\le 0 \text { for all } i\in I_1 \setminus I(x_0,d) \text { and for all } t\in (0,\varepsilon _0), \end{aligned}$$where \(I(x_0,d):=\{ i\in I(x_0)\mid \langle D g_i(x_0) \,, \, d \rangle =0 \}\).
Note that if for some \(d\in \varGamma _{{\mathcal {F}}}(x_0)\), set \(I_1\setminus I(x_0,d)\) is finite, then the Condition (H1) is satisfied, cf. [7, Lemma 6.4].
The following examples illustrates condition (H1).
Example 4
Let \(h:\ \mathbb {R}\rightarrow {{\mathcal {X}}}\), \({{\mathcal {X}}}\) is sequence space \(\ell _p\), \(p\ge 1\), \(h_i(x)=a_i x^2 + b_i x + c_i\), \(i=1,\dots \) where \(a_i\le 0\), \(i=1,\dots \) and
Let \({{\mathcal {F}}}:= \{ x \in \mathbb {R} \mid h(x)\in {{\mathcal {X}}}_{-} \}\), where \(h(x)=(h_i(x))_{i\in I_0\cup I_1}\) and \(x_0=0\). Then \(I(x_0)=\{2k,\ k\in \mathbb {N}\}\). Assume that \(b_i> 0\), \(i=1,\ldots \). Then
Let us take \(d=-1\). We have
Since \(a_i,c_i\le 0\) and \(b_i\ge 0\), \(i=1,\dots \) we obtain that \(-1 \in {{\mathcal {T}}}_{{\mathcal {F}}}(0)\). Since \({{\mathcal {T}}}_{{\mathcal {F}}}(0)\subset \varGamma _{{\mathcal {F}}}(0)\), Abadie constraint qualification holds, i.e., \({{\mathcal {T}}}_{{\mathcal {F}}}(0)= \varGamma _{{\mathcal {F}}}(0)\). Let us also prove that \(h_i\) also satisfy condition (H1). Indeed, if \(i\in I(0) {\setminus } I(0;d)\), we have \(b_i \cdot d <0.\) So, inequality
is satisfied for any function r(t) s.t. \(r(t)/t\rightarrow 0^+\) for \(t\in (0, \varepsilon _0)\) for some \(\varepsilon _0\).
Note that condition PMFCQ from [25] is not satisfied for functions in Example 4.
In the following example RCRCQ+ and Abadie constraint qualification are not satisfied.
Example 5
Let \(g_j:\ \mathbb {R}^2 \rightarrow \mathbb {R}\), \(j\in \mathbb {N}\) be defined by
Functions \(g_j:\ \mathbb {R}^2 \rightarrow \mathbb {R}\), \(j\in \mathbb {N}\) are differentiable, and
Then
and
Hence, Abadie constraint qualification does not hold. Moreover, RCRCQ+ does not hold, since
for any \((x_1,x_2)\) s.t. \(x_2\ne 0\). Note that for any \(d\in \varGamma _{{\mathcal {F}}}(0,0)\), \(I_1 {\setminus } I((0,0),d)\) is a finite set, thus it is easy to check that condition (H1) is satisfied.
Our main results exploit the Theorem of Lyusternik-Graves.
Theorem 5
(Lyusternik–Graves) Let X and Y be Banach spaces, let U be a neighborhood of a point \(x_0\in X\), and let \(f:\ U \rightarrow Y\) be a Fréchet differentiable mapping. Assume that f is regular at \(x_0\), i.e., that \(\text {Im}\, Df(x_0)= Y\), and that its derivative is continuous at this point (in the uniform operator topology of the space \(\varGamma (X, Y)\)). Then the tangent space \({{\mathcal {T}}}_M(x_0)\) to the set
at the point \(x_0\) coincides with the kernel of the operator \(Df(x_0)\), i.e.,
Moreover, if the assumptions of the theorem are satisfied, then there exist a neighbourhood \(U'\subset U\) of the point \(x_0\), a number \(K>0\), and a mapping \(\xi \rightarrow x (\xi )\) of the set \(U'\) into X such that
for all \(\xi \in U'\).
Assertion (33) follows from (34), see [20]. For discussion, see e.g. [13, 19]. The assertion (34) is called the generalized Ljusternik Theorem in [12].
Now we are ready to establish our main theorem.
Theorem 6
Let E be a Banach space, F be a Hilbert space and assume that \((b_i)_{i\in \mathbb {N}}\) is a Besselian and Hilbertian basis of F. Let \({{\mathcal {F}}}\subset E\) be given as in (5).
Assume that
-
(i)
RCRCQ+ holds for \({{\mathcal {F}}}\) at \(x_0\in {{\mathcal {F}}}\),
-
(ii)
condition (H1) is satisfied at \(x_0\).
Then Abadie constraint qualification holds, i.e., \(\varGamma _{{\mathcal {F}}}(x_0)={{\mathcal {T}}}_{{\mathcal {F}}}(x_0)\).
Moreover, for each \(d\in {{\mathcal {T}}}_{{\mathcal {F}}}(x_{0})\) there is a vector function \(r:\ (0,1)\rightarrow E\), \(\Vert r(t)\Vert /t\rightarrow 0\) when \(t\downarrow 0\), such that for all t sufficiently small
Proof
The inclusion \({{\mathcal {T}}}_{{\mathcal {F}}}(x_0)\subset \varGamma _{{\mathcal {F}}}(x_0)\) is immediate. To see the converse, take any \(d\in \varGamma _{{\mathcal {F}}}(x_0)\), where by Proposition 3,
Recall that \(I(x):=\{ i\in I_1 \mid g_i(x)=0 \}\).
We start by considering the case \(J:=J(d)\ne \emptyset \), \(|J|=+\infty \).
By RCRCQ+ of \({{\mathcal {F}}}\) at \(x_0\), there exist \(V(x_0)\) and \(J_2\subset J\) such that \((Dg_i(x_0))_{i\in J_2}\) forms a boundedly-complete and shrinking basis for \(\overline{{\text {span}}}(Dg_i(x_0),i\in J)\) and there exists a topological isomorphism z
for (t, r) in some neighbourhood of \((0,0)\in \mathbb {R}\times E\) such that \(z_{t,r}(Dg_i(x_0+td+r))=Dg_i(x_0), i \in J_2\) for all \((t,r)\in \mathbb {R}\times E\) such that \(x_0+td+r\in V(x_0)\).
Let \(f:E\rightarrow Y(J_2)\) be defined as \(f(x):=(g_i(x))_{i\in J_2}\), where \(Y(J_2)\) is defined by (14). By (22), the derivative \(Df(x_0)\) is onto \(Y(J_2)\).
Let us define
By Lyusternik–Graves Theorem 5 applied to the set M,
By applying Lyusternik-Graves Theorem 5 with f at \(x_0\), we obtain that \(d\in {{\mathcal {T}}}_{M}(x_0)\).
-
Case 1
If \(J_2=J\), then \(g_i(x_0+td+r(t))=0\), \(i\in J\) for \(t\in [0,\varepsilon ]\), where \(\varepsilon >0\) and r(t) is given by Lyusternik-Graves Theorem.
-
Case 2
If \(J_2\subsetneq J\) then, by Proposition 9, applied to \(g_i\), \(i\in J\), there exist functions \(h_l\), \(l\in J{\setminus } J_2\) of class \(C^1\), such that
$$\begin{aligned} g_l(x_0+td+r)=h_l((g_i(x_0+td+r))_{i\in J_2}), \end{aligned}$$(37)for (t, r) in some neighbourhood of \((0,0)\subset \mathbb {R}\times E\).
Consider the system
$$\begin{aligned} g_i(x_0+td+r)=0,\quad i\in J \end{aligned}$$(38)with respect to variables t, r. Let us note that system (38) is satisfied for \((t,r)=(0,0).\)
Obviously, by Proposition 9, in some neighbourhood of (0, 0), system (38) is equivalent to
$$\begin{aligned} g_i(x_0+td+r)=0, \quad i\in J_2 \end{aligned}$$(39)with additional condition
$$\begin{aligned} g_l(x_0+td+r)=h_l((g_i(x_0+td+r))_{i\in J_2})=0,\ l\in J\setminus J_2. \end{aligned}$$(40)Note that \(h_l((g_i(x_0))_{i\in J_2})=0\), \(l\in J{\setminus } J_2\) since \(g_l(x_0)=0=h_l((g_i(x_0))_{i\in J_2})=h_l(0)\).
In both cases there exist \(\varepsilon >0\) and a function \(r:\ [0,\varepsilon )\rightarrow E\), \(\Vert r(t)\Vert t^{-1}\rightarrow 0\), \(t\downarrow 0,\) such that
i.e.,
By condition (H1), there exists \(\varepsilon _0>0\) such that
therefore
Thus, \(d\in {{\mathcal {T}}}_{{\mathcal {F}}}(x_0)\).
Now, let us consider the case \(J=\emptyset \) (i.e. the case when both \(I_0=\emptyset \) and \(I(x_0,d)=\emptyset \)). Then, by condition (H1), for any vector function \(r:\ (0,1)\rightarrow E\), \(\Vert r(t)\Vert /t\rightarrow 0\) when \(t\downarrow 0\) there exists \(\varepsilon >0\) such that
i.e., \(d\in {{\mathcal {T}}}_{{\mathcal {F}}}(x_0)\). \(\square \)
8 RCRCQ+ and Lagrange Multipliers
In this section, using [11] and RCRCQ+ condition we will prove non-emptiness of the Lagrange multipliers set.
The Lagrange function or Lagrangian corresponding to problem (P) is a function \(L:\ E \times F\rightarrow \mathbb {R}\),
Definition 11
Let \(Q\subset F\) be a closed, convex set. The normal cone to Q at \(\bar{y} \in F\) is the set
Definition 12
A feasible point \(\bar{x}\in {{\mathcal {F}}}\) of (P) is called a KKT point if there exists \(\bar{\lambda }\in {{\mathcal {N}}}_{K}(G(\bar{x}))\) such that
where K is defined by (2). In this case \(\bar{\lambda }\) is called Lagrange multiplier of (P) at \(\bar{x}\in {{\mathcal {F}}}\).
Following the notation introduced in [11] we recall the set corresponding to KKT points,
In a more general setting, this set has been defined by Hurwicz in [18] and a number of its properties has been shown in [22]. We refer to this set as a Hurwicz set.Footnote 3
By Proposition 5.6 of [11] we immediately get the following result.
Proposition 10
Let E be a Banach space, F be a Hilbert space and assume that \((b_i)_{i\in \mathbb {N}}\) is a Besselian and Hilbertian basis of F. Let \({{\mathcal {F}}}\subset E\) be given as in (5). Let \(x_0\in {{\mathcal {F}}}\) be a local minimizer of (P). Assume RCRCQ+ holds for \({{\mathcal {F}}}\) at \(x_0\in {{\mathcal {F}}}\) with a neigbourhood \(V(x_0)\). Assume that assumption (H1) is satisfied at \(x_0\). Assume that the Hurwicz set \({{\mathcal {M}}}(x_0,0)\) is weakly*-closed. Then the set of Lagrange multipliers at \(x_0\) is nonempty.
Proof
By Theorem 6, Abadie constraint qualification holds for \({{\mathcal {F}}}\) at \(x_0\). The rest of the proof follows the lines of the proof of Proposition 5.6 of [11]. \(\square \)
Remark 17
Let us underline the fact that in both papers [4, 11] in the definition of Abadie constraint qualification the Hurwicz set \({{\mathcal {M}}}(x_0,0)\) is weakly*-closed.
The next two lemmas allow us to provide descriptions of elements of the Hurwicz set \({{\mathcal {M}}}(x_0,0)\).
Lemma 3
Let E be a Banach space and F be a Hilbert space with basis \((b_i)\subset F\), \(i\in \mathbb {N}\). Let \(K=\{ y=\sum _{i\in \mathbb {N} } b_i^*(y)b_i\in F \mid b_i^*(y)\in \mathbb {R},\ i\in I_0, b_i^*(y)\le 0, i\in I_1 \}\), \(I_0\cup I_1=\mathbb {N}\), \(I_0\cap I_1=\emptyset \). Let \(G:\ E\rightarrow F\) be of class \(C^1\). Let \(x_0\in E\) and \(y_0:=G(x_0)\in K\). Then the following hold
where \(I(x_0)=\{ i\in I_1 \mid g_i(x_0)=0 \}\).
Proof
Let \(x_0\in E\) be such that \(y_0=G(x_0)\in K\), i.e. \(y_0=\sum _{i\in \mathbb {N}} b_i^*(y_0)b_i=\sum _{i\in I_0\cup I_1} b_i^*(G(x_0))b_i\), \(b_i^*(G(x_0))\le 0\), \(i\in I_1\). Let \(y\in {{\mathcal {N}}}_K(y_0)\) be arbitrary.
From the definition of normal cone we have
and in consequence
From this, by taking \(k=\sum _{j\in I_0\cup I_1} \alpha _j b_j \in K\), \(\alpha _j=0\), \(j\in I_0\), \(\alpha _j\le 0\), \(j\in I_1\) we get
and equivalently
Take any fixed \(l\in I_0\cup I_1\).
-
Case 1
\(b_l^{*}(y_0)<0\), i.e. \(l\in I_1 {\setminus } I(x_0)\). By taking \(\alpha _i=b_i^*(y_0)=0 \), \(i\in I_0\), \(\alpha _i=b_i^*(y_0)\le 0\), \(i\in I_1{\setminus } \{ l \}\) and \(\alpha _l=0\) we deduce
$$\begin{aligned} b_l^{**}(y) (-b_l^*(y_0)) \le 0. \end{aligned}$$Therefore, \(b_l^{*}(y)\le 0\). On the other hand, by taking \(\alpha _i=b_i^*(y_0)=0\), \(i\in I_0\), \(\alpha _i=b_i^*(y_0)\le 0\), \(i\in I_1{\setminus } \{ l \}\) and \(\alpha _l=2\cdot b_l^{*}(y_0)\le 0\) we obtain
$$\begin{aligned} b_l^{**}(y) b_l^*(y_0) \le 0, \end{aligned}$$i.e., \(b_l^{**}(y)\ge 0\). In conclusion, \(b_l^{**}(y)= 0\).
-
Case 2
\(b_l^{*}(y_0)=0\) and \(l\in I(x_0)\). By taking \(\alpha _i=b_i^*(y_0)=0 \), \(i\in I_0\), \(\alpha _i=b_i^*(y_0)\le 0\), \(i\in I_1{\setminus } \{ l \}\) and \(\alpha _l=-1\) we obtain
$$\begin{aligned} b_l^{**}(y) \cdot (-1) \le 0, \end{aligned}$$i.e. \(b_l^{**}(y)\ge 0\).
-
Case 3
\(b_l^{*}(y_0)=0\) and \(l\in I_0\). Then \(b_l^{**}(y_0)\in \mathbb {R}\).
Therefore,
\(\square \)
Lemma 4
Let E be a Banach space and F be a Hilbert space with basis \((b_i)\subset F\), \(i\in \mathbb {N}\). Let \(K=\{ y=\sum _{i\in \mathbb {N} } b_i^*(y)b_i\in F \mid b_i^*(y)\in \mathbb {R},\ i\in I_0, b_i^*(y)\le 0, i\in I_1 \}\), \(I_0\cup I_1=\mathbb {N}\), \(I_0\cap I_1=\emptyset \). Let \(G:\ E\rightarrow F\) be of class \(C^1\).
Let \(x_0\in E\) and \(y_0:=G(x_0)\in K\). Then
where \(a_i:=b_i^{**}(y)\), \(i\in I_0\cup I(x_0)\), \(a_i\ge 0\), \(i\in I(x_0)\), \(a_i\in \mathbb {R}\), \(i\in I_0\).
Proof
Let \(\langle \cdot , \cdot \rangle _E:\ E\times E^* \rightarrow \mathbb {R}\) be a duality mapping for the pair \(E,E^*\). Observe, that for any \(x\in E\) and for any \(y\in F\)
where \(DG^*(x_0):\ F \rightarrow E^*\) denotes the adjoint operator for \(DG(x_0):\ E\rightarrow F\). Let \(z\in {{\mathcal {M}}}(x_0,0)\), \(z=DG^*(x)(y),\ y\in {{\mathcal {N}}}_K(G(x_0))\), then
Since (44) hold for any \(x\in E\), we obtain that \(z=\sum _{i\in I_0 \cup I(x_0)} a_i Dg_i(x_0) \).\(\square \)
Let us consider now the case when no inequality are present (i.e. \(I_1=\emptyset \)), i.e.,
Such problems has been considered in e.g. in [9, Theorem 4.1]. In this case we are also getting the existence of Lagrange multipliers (see Proposition 11). By Proposition 7 we are getting split of E, which is included in assumption (B) of [9, Theorem 4.1] and by Proposition 8 we are obtaining isomorphism of \(((Dg_i(x))_{i\in I_0})|_{E_1}:\ E_1\rightarrow ((Dg_i(x))_{i\in I_0})(E)\), \(x\in U(x_0)\), which is included in assumption (C) of [9, Theorem 4.1] (see Remark 2).
Proposition 11
Let E be a Banach space, F be a Hilbert space and assume that \((b_i)_{i\in \mathbb {N}}\) is a Besselian and Hilbertian basis of F. Let \({{\mathcal {F}}}\subset E\) be given as in (5), where \(I_1=\emptyset \). Let \(x_0\in {{\mathcal {F}}}\) be a local minimizer of (P). Assume CRC+ holds for \((g_i)_{i\in I_0}\) at \(x_0\in {{\mathcal {F}}}\) with a neigbourhood \(V(x_0)\). Then the set of Lagrange multipliers at \(x_0\) is nonempty.
Proof
Let \(x_0\) be a local minimizer of problem (\(P_0\)) with \(I_1=\emptyset \). The first-order necessary optimality condition is \(Df_0(x_0)h=0\) for all \(h\in {{\mathcal {T}}}_{{\mathcal {F}}}(x_0)\). By the proof of Theorem 6 (see (41) with \(J=I_0\)),
Therefore \(Df_0(x)h=0\) for any \(h\in \ker DG(x_0)\), i.e., \(Df_0(x)\in (\ker DG(x_0))^\perp \). By Proposition 7, \((\ker DG(x_0))^\perp =\overline{{\text {span}}}( Dg_i(x_0)^*, i \in I_0 )\). Therefore \(Df_0(x_0)\in \overline{{\text {span}}}( Dg_i(x_0)^*, i \in I_0 )\), i.e., there exists \(\lambda \in F\) such that \(Df_0(x_0)=\langle DG(x_0)^* \mid \lambda \rangle = \lambda (DG(x_0))\). \(\square \)
9 The Case of \({{\mathcal {M}}}(x_0,0)\), when \(I_0=\emptyset \)
In this section, we consider the case when \(I_0=\emptyset \), i.e., no equalities appear in the description of the constraint set, and provide discussion with NFMCQ condition introduced in [25]. As a corollary of Lemma 3, we obtain the following crucial characterization.
Corollary 2
Let E be a Banach space and F be a Hilbert space with basis \((b_i)\subset F\), \(i\in \mathbb {N}\). Let \(K=\{ y=\sum _{i\in \mathbb {N} } b_i^*(y)b_i\in F \mid b_i^*(y)\le 0, i\in I_1 \}\), \(I_1=\mathbb {N}\). i.e. \(I_0=\emptyset \). Let \(G:\ E\rightarrow F\) be of class \(C^1\). Let \(x_0\in E\) and \(y_0:=G(x_0)\in K\). Then the following hold
where \(I(x_0)=\{ i\in I_1 \mid g_i(x_0)=0 \}\).
Proof
In view of Lemma 3 we need only to show that
Take any \(\bar{y}\in \{ y \in F \mid y=\sum _{i\in I_1} b_i^{**}(y)b_i^*,\ b_i^{**}(y)=0, i\in I_1{\setminus } I(x_0),\ b_i^{**}(y)\ge 0,\ i\in I(y_0)\}\). Take any \(k\in K\). We want to show that
We have
\(\square \)
Corollary 3
Let E be a Banach space and F be a Hilbert space with basis \((b_i)\subset F\), \(i\in \mathbb {N}\). Let \(K=\{ y=\sum _{i\in \mathbb {N} } b_i^*(y)b_i\in F \mid b_i^*(y)\le 0, i\in I_1 \}\), \(I_1=\mathbb {N}\). Let \(G:\ E\rightarrow F\) be of class \(C^1\). Let \(x_0\in E\) and \(y_0:=G(x_0)\in K\). Then
where \(a_i:=b_i^{**}(y)\), \(i\in I_0\cup I(x_0)\), \(a_i\ge 0\), \(i\in I(x_0)\), \(a_i\in \mathbb {R}\), \(i\in I_0\). Moreover, if \(I(x_0)\) is finite, then the Hurwicz set \({{\mathcal {M}}}(x_0,0)\) is closed.
Proof
The implication “\(\implies \)” follows from Lemma 4. Now we proceed to prove the implication “”. Let \(z= \sum _{i\in I(x_0)} a_i Dg_i(x_0)\). Then, by (44), \(z=DG^*(x_0)(\sum _{i\in I_0\cup I(x_0)} a_i b_i^*)\). By Corollary 2, \(y:=\sum _{i\in I_0\cup I(x_0)} a_i b_i^*\in {{\mathcal {N}}}_K(G(x_0))\), hence \(z=DG^*(x_0)(y)\), \(y\in {{\mathcal {N}}}_K(G(x_0))\).
The fact that the Hurwicz set \({{\mathcal {M}}}(x_0,0)\) is closed, in the case if \(I(x_0)\) is finite, follows from closedness of finitely generated cones (see e.g. [3, Corollary 5.25]). \(\square \)
Definition 13
Let X be a Banach space. We say that \((h_i)_{i\in J}\in X\) is relatively positive with respect to \(J_2\subset J\) if for any \(z\in X\) the following implication holds
Equivalently,
Remark 18
Suppose that \(h_i,i\in J_2\) is a basis for X with biorthogonal system \((h_i,h_i^*)\). By [26, Proposition 5.1], \(h_i^*\) is total. By [24], the cone
is pointed. Moreover, if \((h_i)_{i\in J}\in X\) is relatively positive with respect to \(J_2\subset J\), then the set
is a pointed cone.
Proposition 12
Let \(x_0\in {\mathcal {F}}\). Assume that there exists \(J_2\subset J:= I(x_0)\) such that
- 1.:
-
\(X_1:=\overline{{\text {span}}}(Dg_i(x_0),i\in J)=\overline{{\text {span}}}(Dg_i(x_0),i\in J_2)\),
- 2.:
-
\((Dg_i(x_0))_{i\in J_2}\) is a basis for \(X_1\),
- 3.:
-
\((Dg_i(x_0))_{i\in J}\in X_1\) is relatively positive with respect to \(J_2\subset J\),
- 4.:
-
\((Dg_i(x_0))_{i\in J_2}\) is boundedly complete and shrinking sequence (i.e. \(X_1 \) is reflexive),
- 5.:
-
\(Dg_i^*(x_0)\in E\), \(i\in J_2\).
Then the Hurwicz set \({{\mathcal {M}}}(x_0,0)\) is weakly*-closed.
Proof
Let \(\langle \cdot , \cdot \rangle _E:\ E\times E^* \rightarrow \mathbb {R}\) be a duality mapping for the pair \(E,E^*\).
Let \((z_n)\) be a sequence in \( {{\mathcal {M}}}(x_0,0)\subset E^*\) weakly*-converging to some \(z_0\in E^*\). We want to show that \(z_0\in {{\mathcal {M}}}(x_0,0)\), i.e., there exists \(y_0\in {{\mathcal {N}}}_K(G(x_0))\) such that \(z_0=DG^*(x_0)(y_0)\).
There exist \(y_n \in {{\mathcal {N}}}_K(G(x_0))\subset F\), \(y_n=\sum _{i\in \mathbb {N}} b_i^{**}(y_n)b_i^* \), \(n\in \mathbb {N}\) such that \(z_n=DG^*(x_0)(y_n)\), \(n\in \mathbb {N}\).
By Corollary 2, \(y_n=\sum _{i\in I(x_0)}b_i^{**}(y_n)b_i^*\), where \(b_i^{**}(y_n)=0\), \(i\in I_1{\setminus } I(x_0)\), \(b_i^{**}(y_n)\ge 0\), \(i\in I(x_0)\), for any \(n\in \mathbb {N}\). Moreover, \(z_n=DG^*(x_0)(y_n)=DG^*(x_0)(\sum _{i\in I(x_0)}b_i^{**}(y_n)b_i^*)\), \(n\in \mathbb {N}\).
Let \(a_i^n:=b_i^{**}(y_n)\), \(i\in I_1\), \(n\in \mathbb {N}\). Since \((z_n)\) is a sequence in \( {{\mathcal {M}}}(x_0,0)\), by Corollary 3, \(z_n=\sum _{i\in I(x_0)} a_i^n Dg_i(x_0)\in X_1=\overline{{\text {span}}}( Dg_i(x_0), i\in I(x_0))\) and since \((z_n)\) is weakly*-converging to some \(z_0\in E^*\),
Since \((z_n)\) converges weakly* in reflexive \(X_1\), it converges strongly to \(z_0\in X_1\). Since \(\overline{{\text {span}}}( Dg_i(x_0),\ i\in J_2 )=X_1\) is closed, \(z_0=\sum _{i\in J_2} \tilde{\beta }_i Dg_i(x_0)\) for some \(\tilde{\beta }_i\in \mathbb {R}\), \(i\in J_2\).
Moreover, since \(Dg_i(x_0)\), \(i\in J=I(x_0)\) is relatively positive with respect to \(J_2\) we have that
for some \(\tilde{a}_i^n\), such that \(\tilde{a}_i^n\ge 0\), \(i\in J_2\), \(n\in \mathbb {N}\).
By assumption, \((Dg_k(x_0))_{k\in J_2}\) is a basis of \(X_1=\overline{{\text {span}}}( Dg_i(x_0),\ i\in I(x_0) )=\overline{{\text {span}}}( Dg_i(x_0),\ i\in J_2 )\), hence, by Proposition 1, there exists \(Dg_k^*(x_0)\in X_1^*\), \(k\in J_2\) such that
By assumption \((Dg_k^*(x_0))_{k\in J_2}\) is a basis for \(\overline{{\text {span}}}( Dg_i^*(x_0),\ i\in J_2 )\).
Let \(\tilde{x}=Dg_j^*(x_0)\in X_1^*\subset E^{**}\), \(j\in J_2\). By assumption, \(\tilde{x}\in E\). By Corollary 3, (45) and (46) with \(x=\tilde{x}\) we have
which means that
Since \(\tilde{a}_j^n\ge 0\), \(j\in J_2\), \(n\in \mathbb {N}\) we obtain that \(\tilde{\beta }_j\ge 0\), \(j\in J_2\). Therefore, \(z_0=\sum _{i\in J_2} \tilde{\beta }_i Dg_i(x_0)\), \(\tilde{\beta }_j\ge 0\), \(j\in J_2\). By Corollary 3, \(z_0\in {{\mathcal {M}}}(x_0,0)\). \(\square \)
Remark 19
Let us note that if E is reflexive, assumptions 4. and 5. of Proposition 12 are automatically satisfied.
Definition 14
([25, Definition 3]) We say that the system \({{\mathcal {F}}}\) given in (\(P_0\)) with \(I_0=\emptyset \) satisfies the Nonlinear Farkas-Minkowski Constraint Qualification (NFMCQ) at \(x_0\) if the set
is weak*-closed in the product space \(E^* \times \mathbb {R}.\)
Corollary 4
Let E be reflexive and assume that NFMCQ holds. Then
Proof
Let \(z \in {{\mathcal {M}}}(x_0,0)\) and \(I(x_0)=\{i_1,i_2,\dots \}\), \(I_n(x_0)=\{i_1,i_2,\dots ,i_n \}\), \(n\in \mathbb {N}\). Then
Assuming that NFMCQ hold and \(E^*\) is reflexive, we obtain
\(\square \)
Remark 20
Let E be a Banach space and F be a Hilbert space with basis \((b_i)\subset F\), \(i\in \mathbb {N}\). Let \(K=\{ y=\sum _{i\in \mathbb {N} } b_i^*(y)b_i\in F \mid b_i^*(y)\le 0, i\in I_1 \}\), \(I_1=\mathbb {N}\). Let \(G:\ E\rightarrow F\) be of class \(C^1\). Let \(x_0\in E\) and \(y_0:=G(x_0)\in K\). By Corollary 2, for any \(z\in E^*\),
The following example shows that NFMCQ may not hold, when \({{\mathcal {M}}}(x_0,0)\) is weak*-closed.
Example 6
Let \(E=\ell _2\), \(F=\ell _2\) with basis \(b_i=e_i\), \(i\in \mathbb {N}\). Let \(x_0=0\in E\) and consider
where \(g_i(x)=\frac{1}{i}x_i\), \(i\in \mathbb {N}\).
In this case \(I(x_0)=\mathbb {N}\), \(Dg_i(x_0)=(0,\dots ,0,\frac{1}{i},0,\dots )\). By Corollary 2, \(y=(\frac{1}{2},\frac{1}{2^2},\dots )\in {{\mathcal {N}}}_K(G(x_0))\). Moreover, by Corollary 3, \(z:=DG^*(x_0)(y)=(\frac{1}{2},\frac{1}{2\cdot 2^i},\frac{1}{3\cdot 2^3},\dots ) \in {{\mathcal {M}}}(x_0,0)\). We have \(z\notin {\text {cone }} (Dg_i(x_0), i\in I_1)\), and since E is reflexive, we conclude by Corollary 4 that NFMCQ does not hold in this case. However, the assumptions of Proposition 12 hold for \(J_2=I(x_0)=\mathbb {N}\), therefore \({{\mathcal {M}}}(x_0,0)\) is weak*-closed.
We close the paper by illustrating our results with the following graph (Fig. 1).
Notes
Example provided by professor Sergei Konyagin, Academician of RAS, by courtesy of profesor Nikolai Osmolovskii.
Here \(DG(x_0)^*\) is an adjoint operator to \(DG(x_0)\)
References
Abraham, R., Marsden, J.E., Ratiu, T.: Manifolds, Tensor Analysis, and Applications, Volume 75 of Applied Mathematical Sciences, second edition Springer, New York (1988)
Albiac, F., Kalton, N.J.: Topics in Banach space theory, Volume 233 of Graduate Texts in Mathematics. Springer, New York (2006)
Aliprantis, C.D., Border, K.C.: Infinite Dimensional Analysis, third Springer, Berlin (2006)
Andreani, R., Silva, P.: Constant rank constraint qualifications: a geometric introduction. Pesquisa Operacional 34, 481–494, 09 (2014)
Andreani, R., Echagüe, C.E., Schuverdt, M.L.: Constant-rank condition and second-order constraint qualification. J. Optim. Theory Appl. 146(2), 255–266 (2010)
Andreani, R., Behling, R., Haeser, G., Silva, P.J.S.: On second-order optimality conditions in nonlinear optimization. Optim. Methods Softw. 32(1), 22–38 (2017)
Bednarczuk, E.M., Leśniewski, K.W., Rutkowski, K.E.: On tangent cone to systems of inequalities and equations in Banach spaces under relaxed constant rank condition. ESAIM Control Optim. Calc. Var. 27, 22 (2021)
Blot, J.: The rank theorem in infinite dimension. Nonlinear Anal. 10(10), 1009–1020 (1986)
Blot, J.: Rank theorem in infinite dimension and Lagrange multipliers. Appl. Math. Optim. 81(3), 815–822 (2020)
Booss, B., Bleecker, D.D.: Topology and analysis. Universitext. Springer, New York, (1985). The Atiyah–Singer index formula and gauge-theoretic physics, Translated from the German by Bleecker and A. Mader
Börgens, E., Kanzow, C., Mehlitz, P., Wachsmuth, G.: New constraint qualifications for optimization problems in Banach spaces based on asymptotic KKT conditions. SIAM J. Optim. 30(4), 2956–2982 (2020)
Dmitruk, A.V., Milyutin, A.A., Osmolovskiĭ, N.P.: Ljusternik’s theorem and the theory of the extremum. Uspekhi Mat. Nauk 35(6(216)), 11–46, 215 (1980)
Dontchev, A.L., Frankowska, H.: Lyusternik–Graves theorem and fixed points II. J. Convex Anal. 19(4), 955–973 (2012)
Fabian, M., Habala, P., Petr, S., Vicente, M., Pelant, J., Zizler, V.: Functional Analysis and Infinite-Dimensional Geometry, Volume 8 of CMS Books in Mathematics/Ouvrages de Mathématiques de la SMC. Springer, New York (2001)
Fullerton, R.E.: Geometric properties of a basis in a Banach space. In: Proceedings of the International Congress of Mathematicians, volume II, Amsterdam, North Holland Publ (1954)
Fullerton, R.E.: Geometric structure of absolute basis systems in a linear topological space. Pac. J. Math. 12, 137–147 (1962)
Holub, J.R.: Hilbertian, Besselian, and semi-shrinking bases. Studia Math. 37, 203–211 (1970/71)
Hurwicz, L.: Programming in linear spaces [reprint of studies in linear and nonlinear programming, 38–102, Stanford Univ. Press, Stanford, CA, 1958]. In Traces and emergence of nonlinear programming, pp. 131–195. Birkhäuser/Springer Basel AG, Basel (2014)
Ioffe, A.D.: Metric regularity and subdifferential calculus. Uspekhi Mat. Nauk 55(3(333)), 103–162 (2000)
Ioffe, A.D., Tihomirov, V.M.: Theory of extremal problems, volume 6 of Studies in Mathematics and its Applications. North-Holland Publishing Co., Amsterdam (1979). Translated from the Russian by Karol Makowski
Janin, R.: Directional derivative of the marginal function in nonlinear programming. Math. Programming Stud., pp. 110–126 (1984). Sensitivity, stability and parametric analysis
Kurcyusz, S.: On the existence and nonexistence of Lagrange multipliers in Banach spaces. J. Optim. Theory Appl. 20(1), 81–110 (1976)
McArthur, C.W.: Developments in Schauder basis theory. Bull. Am. Math. Soc. 78, 877–908 (1972)
McArthur, C.W., Singer, I., Levin, M.: On the cones associated with biorthogonal systems and bases in Banach spaces. Can. J. Math. 21, 1206–1217 (1969)
Mordukhovich, B., Nghia, T.T.A.: Constraint qualifications and optimality conditions for nonconvex semi-infinite and infinite programs. Math. Program. 139(1–2, Ser. B), 271–300 (2013)
Singer, I.: Bases in Banach spaces. I. Die Grundlehren der mathematischen Wissenschaften, Band 154. Springer, New York (1970)
Wojtaszczyk, P.: Existence of some special bases in Banach spaces. Stud. Math. 47, 83–93 (1973)
Funding
The authors have not disclosed any funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bednarczuk, E.M., Leśniewski, K.W. & Rutkowski, K.E. Constraint Qualification with Schauder Basis for Infinite Programming Problems. Appl Math Optim 88, 66 (2023). https://doi.org/10.1007/s00245-023-10034-0
Accepted:
Published:
DOI: https://doi.org/10.1007/s00245-023-10034-0