Uncertainty quantification for random domains using periodic random variables

Hakula, Harri; Harbrecht, Helmut; Kaarnioja, Vesa; Kuo, Frances Y.; Sloan, Ian H.

doi:10.1007/s00211-023-01392-6

Uncertainty quantification for random domains using periodic random variables

Open access
Published: 12 January 2024

Volume 156, pages 273–317, (2024)
Cite this article

Download PDF

You have full access to this open access article

Numerische Mathematik Aims and scope Submit manuscript

Uncertainty quantification for random domains using periodic random variables

Download PDF

Harri Hakula¹,
Helmut Harbrecht²,
Vesa Kaarnioja³,
Frances Y. Kuo⁴ &
…
Ian H. Sloan⁴

578 Accesses
Explore all metrics

Abstract

We consider uncertainty quantification for the Poisson problem subject to domain uncertainty. For the stochastic parameterization of the random domain, we use the model recently introduced by Kaarnioja et al. (SIAM J. Numer. Anal., 2020) in which a countably infinite number of independent random variables enter the random field as periodic functions. We develop lattice quasi-Monte Carlo (QMC) cubature rules for computing the expected value of the solution to the Poisson problem subject to domain uncertainty. These QMC rules can be shown to exhibit higher order cubature convergence rates permitted by the periodic setting independently of the stochastic dimension of the problem. In addition, we present a complete error analysis for the problem by taking into account the approximation errors incurred by truncating the input random field to a finite number of terms and discretizing the spatial domain using finite elements. The paper concludes with numerical experiments demonstrating the theoretical error estimates.

Method of Moments and Sums of Random Indicators

Article 01 March 2022

Asymptotic Properties of Random Polytopes

Approximation of Probability Density Functions for PDEs with Random Parameters Using Truncated Series Expansions

Article 14 January 2021

1 Introduction

Modeling domain uncertainty is pertinent in many engineering applications, where the shape of the object may not be perfectly known. For example, one might think of manufacturing imperfections in the shape of products fabricated by line production, or shapes which stem from inverse problems such as tomography.

If the domain perturbations are small, one can apply the perturbation technique as in, e.g., [2, 13, 17]. In that approach, which is based on Eulerian coordinates, the shape derivative is used to linearize the problem under consideration around a nominal reference geometry. By Hadamard’s theorem, the resulting linearized equations for the first order shape sensitivities are homogeneous equations posed on the nominal geometry, with inhomogeneous boundary data only. By using a first order shape Taylor expansion, one can derive tensor deterministic partial differential equations (PDEs) for the statistical moments of the problem.

The approach we consider in this work is the domain mapping approach. It transfers the shape uncertainty onto a fixed reference domain and hence reflects the Lagrangian setting. Especially, it allows us to deal with large deformations, see [27, 31] for example. Analytic dependency of the solution on the random domain mapping in the case of the Poisson equation was shown in [3, 15] and in the case of linear elasticity in [14]. Related shape holomorphy for acoustic and electromagnetic scattering problems has been shown in [19, 21] and for Stokes and Navier–Stokes equations in [4]. Mixed spatial and stochastic regularity of the solutions to the Poisson equation on random domains, required for multilevel cubature methods, has been recently proved in [16].

In order to introduce the domain mapping approach, let $(\varOmega ,{\mathcal {F}},{\mathbb {P}})$ be a probability space, where $\varOmega $ is the set of possible outcomes, the $\sigma $-algebra ${\mathcal {F}}\subset 2^{\varOmega }$ is the set of all events, and ${\mathbb {P}}$ is a probability measure. We are interested in uncertainty quantification for the Poisson problem

$$\begin{aligned} {\left\{ \begin{array}{ll} -\varDelta u({{\varvec{x}}},\omega )=f({{\varvec{x}}}),&{}{{\varvec{x}}}\in D(\omega ),\\ u({{\varvec{x}}},\omega )=0,&{}{{\varvec{x}}}\in \partial D(\omega ), \end{array}\right. }\quad \text {for }{\mathbb {P}}\text {-almost every}~\omega \in \varOmega , \end{aligned}$$

(1.1)

where the domain $D(\omega )\subset {\mathbb {R}}^d$, $d\in \{2,3\}$, is assumed to be uncertain. In the domain mapping framework, one starts with a reference domain ${D_{\textrm{ref}}}\subset {\mathbb {R}}^d$. For example, in a manufacturing application, ${D_{\textrm{ref}}}$ might be the planned domain provided by computer-aided design with no imperfections. It is assumed that a random perturbation field

$$\begin{aligned} {{\varvec{V}}}(\omega )\!:{\overline{{D_{\textrm{ref}}}}}\rightarrow \overline{D(\omega )} \end{aligned}$$

is prescribed. The Poisson problem (1.1) on the perturbed domain is then transformed onto the fixed reference domain ${D_{\textrm{ref}}}$ by a change of variable. This results in a PDE on the reference domain, equipped with a random coefficient and a random source term, which are correlated by the random deformation, and complicated by the occurrence of the Jacobian of the transformation. Note that the dependence on the random perturbation field is nonlinear. Nonetheless, in the domain mapping approach, the reference domain ${D_{\textrm{ref}}}$ needs only to be Lipschitz, while the random perturbations ${{\varvec{V}}}(\omega )$ have to be ${\mathcal {C}}^2$-smooth. This is in contrast to the perturbation approach mentioned above, where both the reference domain and the random perturbations need to be ${\mathcal {C}}^2$-smooth.

The approach adopted in this paper is first to truncate the initially infinite number of scalar random variables to a finite (but possibly large) number; then to approximate the transformed and truncated PDE on the reference domain by a finite element method; and finally to approximate the expected value of the solution (which is an integral, possibly high dimensional) by a carefully designed quasi-Monte Carlo (QMC) method—i.e., an equal weight cubature rule.

In [3, 15], the perturbation field was represented by an affine, vector-valued Karhunen–Loève expansion, under the assumption of uniformly independent and identically distributed random variables. In this work, we instead expand the perturbation field ${{\varvec{V}}}(\omega )$ using periodic random variables, following the recent work [23]. The advantage of the periodic setting is that it permits higher order convergence of the QMC approximation to the expectation of the solution to (1.1). The periodic representation is equivalent in law to an affine representation of the input random field, where the random variables entering the series are i.i.d. with the Chebyshev probability density $\rho (z)=\frac{1}{\pi \sqrt{1-z^2}}$, $z\in (-1,1)$. The density $\rho $ is associated with Chebyshev polynomials of the first kind, which is a popular family of basis functions in the method of generalized polynomial chaos (gPC). The choice of density is a modeling assumption, and it might be argued that in many applications choosing the Chebyshev density over the uniform density is a matter of taste rather than conviction, see [22, Sections 1–2] for more discussion.

The paper is organized as follows. The problem is formulated in Sect. 2, after which Sect. 3 establishes the regularity of the solution with respect to the stochastic variables; a more difficult task than in most such applications because of the nonlinear nature of the transformation, but a prerequisite for the design of good QMC rules. Section 4 considers error analysis, estimating separately the errors from dimension truncation, finite element approximation and QMC cubature. In the latter case, the error analysis leads to the design of appropriate weight parameters for the function space, ones that allow rigorous error bounds for the QMC contribution to the error. Numerical experiments that test the theoretical analysis are presented in Sect. 5, and some conclusions are drawn in Sect. 6. The Appendix A contains several technical results required for the parametric regularity analysis.

2 Problem formulation

2.1 Notations

Let $U:={[0,1]^{\mathbb {N}}}$ denote a set of parameters. Let us fix a bounded domain ${D_{\textrm{ref}}}\subset {\mathbb {R}}^d$ with Lipschitz boundary and $d\in \{2,3\}$ as the reference domain.

We use multi-index notation throughout this paper. The set of all finitely supported multi-indices is denoted by ${\mathscr {F}}:=\{{\varvec{\nu }}\in {\mathbb {N}}_0^{{\mathbb {N}}}:|\textrm{supp}({\varvec{\nu }})|<\infty \}$, where $\textrm{supp}({\varvec{\nu }}):=\{j\in {\mathbb {N}}:\nu _j\ne 0\}$ is the support of a multi-index ${\varvec{\nu }}:=(\nu _1,\nu _2,\ldots )$. For any sequence ${{\varvec{b}}}=(b_j)_{j=1}^\infty $ of real numbers and any ${\varvec{\nu }}\in {\mathscr {F}}$, we define

$$\begin{aligned} {{\varvec{b}}}^{{\varvec{\nu }}}:=\prod _{j\in \textrm{supp}({\varvec{\nu }})}b_j^{\nu _j}, \end{aligned}$$

where the product over the empty set is defined as 1 by convention. We also define the multi-index notation

$$\begin{aligned} \partial _{{{\varvec{y}}}}^{{\varvec{\nu }}}:=\prod _{j\in \textrm{supp}({\varvec{\nu }})}\frac{\partial ^{\nu _j}}{\partial y_j^{\nu _j}} \end{aligned}$$

for higher order partial derivatives with respect to variable ${{\varvec{y}}}$.

For any matrix M, let $\sigma (M)$ denote the set of all singular values of M and let $\Vert M\Vert _2:={\max \sigma (M)}$ denote the matrix spectral norm. For a function v on ${D_{\textrm{ref}}}$ we define

$$\begin{aligned} \Vert v\Vert _{L^\infty ({D_{\textrm{ref}}})} :=\, {\left\{ \begin{array}{ll} \mathop {\text {ess sup}}\nolimits _{{{\varvec{x}}}\in {D_{\textrm{ref}}}} |v({{\varvec{x}}})| &{} \text{ if } v: {D_{\textrm{ref}}}\rightarrow {\mathbb {R}}, \\ \mathop {\text {ess sup}}\nolimits _{{{\varvec{x}}}\in {D_{\textrm{ref}}}} \Vert v({{\varvec{x}}})\Vert _2 &{} \text{ if } v: {D_{\textrm{ref}}}\rightarrow {\mathbb {R}}^d, \\ \mathop {\text {ess sup}}\nolimits _{{{\varvec{x}}}\in {D_{\textrm{ref}}}} \Vert v({{\varvec{x}}})\Vert _2 &{} \text{ if } v: {D_{\textrm{ref}}}\rightarrow {\mathbb {R}}^{d\times d}, \end{array}\right. } \end{aligned}$$

where we apply the vector 2-norm (Euclidean norm) or matrix 2-norm (spectral norm) depending on whether $v({{\varvec{x}}})$ is a vector or a matrix. Similarly, we define

$$\begin{aligned} \Vert v\Vert _{W^{1,\infty }({D_{\textrm{ref}}})}:= {\left\{ \begin{array}{ll} \max \!\Big (\displaystyle \mathop {\text {ess sup}}\limits _{{{\varvec{x}}}\in {D_{\textrm{ref}}}} |v({{\varvec{x}}})|,\; \mathop {\text {ess sup}}\limits _{{{\varvec{x}}}\in {D_{\textrm{ref}}}} \Vert \nabla v({{\varvec{x}}})\Vert _2\Big ) &{} \text{ if } v\!: {D_{\textrm{ref}}}\rightarrow {\mathbb {R}}, \\ \max \!\Big (\displaystyle \mathop {\text {ess sup}}\limits _{{{\varvec{x}}}\in {D_{\textrm{ref}}}} \Vert v({{\varvec{x}}})\Vert _2,\; \mathop {\text {ess sup}}\limits _{{{\varvec{x}}}\in {D_{\textrm{ref}}}} \Vert v'({{\varvec{x}}})\Vert _2\Big ) &{} \text{ if } v\!: {D_{\textrm{ref}}}\rightarrow {\mathbb {R}}^d, \end{array}\right. } \end{aligned}$$

where $\nabla v({{\varvec{x}}})$ is the gradient vector if $v({{\varvec{x}}})$ is a scalar, and $v'({{\varvec{x}}})$ is the Jacobian matrix if $v({{\varvec{x}}})$ is a vector. We will also need the standard Sobolev norm

$$\begin{aligned} \Vert v\Vert _{H_0^1({D_{\textrm{ref}}})} :=\, \Vert \nabla v\Vert _{L^2({D_{\textrm{ref}}})} \,=\, \left( \int _{{D_{\textrm{ref}}}} \Vert \nabla v({{\varvec{x}}})\Vert _2^2\,\textrm{d}{{\varvec{x}}}\right) ^{1/2} \end{aligned}$$

for a scalar function $v:{D_{\textrm{ref}}}\rightarrow {\mathbb {R}}$ in $H_0^1({D_{\textrm{ref}}})$. We also define the norm

$$\begin{aligned} \Vert v\Vert _{{\mathcal {C}}^k({\overline{D}})}:=\max _{|{\varvec{\nu }}|\le k}\sup _{{{\varvec{x}}}\in {\overline{D}}}|\partial _{{\varvec{x}}}^{{\varvec{\nu }}}v({{\varvec{x}}})|\quad \text {for }v\in {\mathcal {C}}^k({\overline{D}}), \end{aligned}$$

where $D\subset {\mathbb {R}}^d$ is a nonempty domain.

In this paper we will make use of Stirling numbers of the second kind defined by

$$\begin{aligned} S(n,m):= \frac{1}{m!} \sum _{j=0}^m (-1)^{m-j} \left( {\begin{array}{c}m\\ j\end{array}}\right) j^n \end{aligned}$$

for integers $n\ge m\ge 0$, except for $S(0,0):=1$.

2.2 Parameterization of domain uncertainty

Let ${{\varvec{V}}}\!:{\overline{{D_{\textrm{ref}}}}}\times U\rightarrow {\mathbb {R}}^d$ be a vector field such that

$$\begin{aligned} {{\varvec{V}}}({{\varvec{x}}},{{\varvec{y}}})\,:=\, {{\varvec{x}}}+ \frac{1}{\sqrt{6}}\sum _{i=1}^\infty \sin (2\pi y_i)\,{\varvec{\psi }}_i({{\varvec{x}}}), \quad {{\varvec{x}}}\in {{D_{\textrm{ref}}}},~{{\varvec{y}}}\in U, \end{aligned}$$

(2.1)

with stochastic fluctuations ${\varvec{\psi }}_i: {D_{\textrm{ref}}}\rightarrow {\mathbb {R}}^d$. Denoting the Jacobian matrix of ${\varvec{\psi }}_i$ by ${\varvec{\psi }}_i'$, the Jacobian matrix $J: {D_{\textrm{ref}}}\rightarrow {\mathbb {R}}^{d\times d}$ of vector field ${{\varvec{V}}}$ is

$$\begin{aligned} J({{\varvec{x}}},{{\varvec{y}}}) \,:=\, I+\frac{1}{\sqrt{6}}\sum _{i=1}^\infty \sin (2\pi y_i)\,{\varvec{\psi }}_i'({{\varvec{x}}}),\quad {{\varvec{x}}}\in {D_{\textrm{ref}}},~{{\varvec{y}}}\in U. \end{aligned}$$

We assume that the family of admissible domains $\{D({{\varvec{y}}})\}_{{{\varvec{y}}}\in U}$ is parameterized by

$$\begin{aligned} D({{\varvec{y}}}):={{\varvec{V}}}({D_{\textrm{ref}}},{{\varvec{y}}}),\quad {{\varvec{y}}}\in U, \end{aligned}$$

and define the hold-all domain by setting

$$\begin{aligned} {\mathcal {D}}:=\bigcup _{{{\varvec{y}}}\in U}D({{\varvec{y}}}). \end{aligned}$$

The convention of explicitly writing down the factor $1/\sqrt{6}$ in (2.1) is not a crucial part of the analysis, but it ensures that the mean and covariance of the vector field ${{\varvec{V}}}$ match those of an affine and uniform parameterization for representing domain uncertainty using the same sequence of stochastic fluctuations $({\varvec{\psi }}_i)_{i=1}^\infty $, with the uniform random variables supported on $[-1/2,1/2]$. See also the discussion in [23].

In order to ensure that the aforementioned domain parameterization is well-posed and to enable our subsequent analysis of dimension truncation and finite element errors, we state the following assumptions for later use:

(A1)
For each ${{\varvec{y}}}\in U$, ${{\varvec{V}}}(\cdot ,{{\varvec{y}}})\!:{\overline{{D_{\textrm{ref}}}}}\rightarrow {\mathbb {R}}^d$ is an invertible, twice continuously differentiable vector field.
(A2)
For some $C>0$, there holds
$$\begin{aligned}\Vert {{\varvec{V}}}(\cdot ,{{\varvec{y}}})\Vert _{{\mathcal {C}}^2({\overline{{D_{\textrm{ref}}}}})}\le C\quad \text {and}\quad \Vert {{\varvec{V}}}^{-1}(\cdot ,{{\varvec{y}}})\Vert _{{\mathcal {C}}^2(\overline{D({{\varvec{y}}})})}\le C\end{aligned}$$
for all ${{\varvec{y}}}\in U$.
(A3)
There exist constants $0< \sigma _{\min } \le 1 \le \sigma _{\max }<\infty $ such that
$$\begin{aligned} \sigma _{\min }\le \min \sigma (J({{\varvec{x}}},{{\varvec{y}}})) \le \max \sigma (J({{\varvec{x}}},{{\varvec{y}}})) \le \sigma _{\max }~\text {for all}~{{\varvec{x}}}\in {D_{\textrm{ref}}},~{{\varvec{y}}}\in U, \end{aligned}$$
where $\sigma (J({{\varvec{x}}},{{\varvec{y}}}))$ denotes the set of all singular values of the Jacobian matrix.
(A4)
There holds $\Vert {\varvec{\psi }}_i\Vert _{W^{1,\infty }({D_{\textrm{ref}}})}<\infty $ for all $i\in {\mathbb {N}}$, and
$$\begin{aligned} \sum _{i=1}^\infty \Vert {\varvec{\psi }}_i\Vert _{W^{1,\infty }({D_{\textrm{ref}}})}<\infty . \end{aligned}$$
(A5)
For some $p\in (0,1)$, there holds
$$\begin{aligned} \sum _{i=1}^\infty \Vert {\varvec{\psi }}_i\Vert _{W^{1,\infty }(D_{\textrm{ref}})}^p<\infty . \end{aligned}$$
(A6)
$\Vert {\varvec{\psi }}_1\Vert _{W^{1,\infty }({D_{\textrm{ref}}})}\ge \Vert {\varvec{\psi }}_2\Vert _{W^{1,\infty }({D_{\textrm{ref}}})}\ge \cdots $.
(A7)
The reference domain $D_{\textrm{ref}}\subset {\mathbb {R}}^d$ is a convex, bounded polyhedron.

For later convenience we define the sequence ${{\varvec{b}}}=(b_i)_{i=1}^\infty $ and the constant $\xi _{{{\varvec{b}}}} \ge 0$ by setting

$$\begin{aligned} b_i:=\frac{1}{\sqrt{6}}\Vert {\varvec{\psi }}_i\Vert _{W^{1,\infty }({D_{\textrm{ref}}})}\quad \text {and}\quad \xi _{{{\varvec{b}}}} :=\sum _{i=1}^\infty b_i<\infty . \end{aligned}$$

(2.2)

It follows that we can take $\sigma _{\max }:= 1 + \xi _{{{\varvec{b}}}}$.

Remark. Under assumption (A3), there holds

$$\begin{aligned} \det J({{\varvec{x}}},{{\varvec{y}}})>0\quad \text {for all}~{{\varvec{x}}}\in {\overline{{D_{\textrm{ref}}}}},~{{\varvec{y}}}\in U. \end{aligned}$$

(2.3)

This follows from the continuity of the determinant and $\det J({{\varvec{x}}},{\textbf{0}})=1$.

2.3 The variational formulation on the reference domain

The variational formulation of the model problem (1.1) can be stated as follows: for ${{\varvec{y}}}\in U$, find $u(\cdot ,{{\varvec{y}}})\in H_0^1(D({{\varvec{y}}}))$ such that

$$\begin{aligned} \int _{D({{\varvec{y}}})}\nabla u({{\varvec{x}}},{{\varvec{y}}})\cdot \nabla v({{\varvec{x}}})\,\textrm{d}{{\varvec{x}}}\,=\, \int _{D({{\varvec{y}}})}f({{\varvec{x}}})\,v({{\varvec{x}}})\,\textrm{d}{{\varvec{x}}}\quad \text {for all }~v\in H_0^1(D({{\varvec{y}}})), \end{aligned}$$

(2.4)

where $f\in {\mathcal {C}}^\infty ({\mathcal {D}})$ is assumed to be an analytic function.

We can transport the variational formulation (2.4) to the reference domain by a change of variable. Let us first define the matrix-valued function

$$\begin{aligned} B({{\varvec{x}}},{{\varvec{y}}}) \,:=\, J({{\varvec{x}}},{{\varvec{y}}})^\top J({{\varvec{x}}},{{\varvec{y}}}),\quad {{\varvec{x}}}\in D_{\textrm{ref}},~{{\varvec{y}}}\in U, \end{aligned}$$

(2.5)

the transport coefficient

$$\begin{aligned} A({{\varvec{x}}},{{\varvec{y}}}) \,:=\, B^{-1}({{\varvec{x}}},{{\varvec{y}}})\,\det J({{\varvec{x}}},{{\varvec{y}}}),\quad {{\varvec{x}}}\in {D_{\textrm{ref}}},~{{\varvec{y}}}\in U, \end{aligned}$$

(2.6)

and the transported source term

$$\begin{aligned} f_{\textrm{ref}}({{\varvec{x}}},{{\varvec{y}}}) := {\widehat{f}}({{\varvec{x}}},{{\varvec{y}}})\,\det J({{\varvec{x}}},{{\varvec{y}}}),\quad {\widehat{f}}({{\varvec{x}}},{{\varvec{y}}}) := f\big ({{\varvec{V}}}({{\varvec{x}}},{{\varvec{y}}})\big ), \quad {{\varvec{x}}}\in {D_{\textrm{ref}}},~{{\varvec{y}}}\in U. \end{aligned}$$

Then we can recast the problem (2.4) on the reference domain as follows: for ${{\varvec{y}}}\in U$, find ${\widehat{u}}(\cdot ,{{\varvec{y}}})\in H_0^1({D_{\textrm{ref}}})$ such that

$$\begin{aligned} \int _{{D_{\textrm{ref}}}} \big (A({{\varvec{x}}},{{\varvec{y}}})\nabla {\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\big ) \cdot \nabla {\widehat{v}}({{\varvec{x}}})\,\textrm{d}{{\varvec{x}}}\,=\, \int _{{D_{\textrm{ref}}}}f_{\textrm{ref}}({{\varvec{x}}},{{\varvec{y}}})\,{\widehat{v}}({{\varvec{x}}})\,\textrm{d}{{\varvec{x}}} \end{aligned}$$

(2.7)

for all ${\widehat{v}}\in H_0^1({D_{\textrm{ref}}})$.

The solutions to problems (2.4) and (2.7) are connected to one another by

$$\begin{aligned} u(\cdot ,{{\varvec{y}}}) \,=\, {\widehat{u}}\big ({{\varvec{V}}}^{-1}(\cdot ,{{\varvec{y}}}),{{\varvec{y}}}\big ) \quad \iff \quad {\widehat{u}}(\cdot ,{{\varvec{y}}}) \,=\, u\big ({{\varvec{V}}}(\cdot ,{{\varvec{y}}}),{{\varvec{y}}}\big ),\quad {{\varvec{y}}}\in U. \end{aligned}$$

In the sequel, we focus on analyzing the problem (2.7).

3 Parametric regularity of the solution

In order to develop higher order rank-1 lattice cubature rules for the purpose of integrating the solution ${{\varvec{y}}}\mapsto {\widehat{u}}(\cdot ,{{\varvec{y}}})$ of (2.7) with respect to the parametric variable, we need to derive bounds on the partial derivatives $\partial _{{\varvec{y}}}^{{\varvec{\nu }}}{\widehat{u}}(\cdot ,{{\varvec{y}}})$ in the Sobolev norm $\Vert \cdot \Vert _{H_0^1({D_{\textrm{ref}}})}$.

The analysis presented in this section is based on [15] and proceeds as follows:

We derive bounds for $\partial _{{\varvec{y}}}^{\varvec{\nu }}B^{-1}({{\varvec{x}}},{{\varvec{y}}})$ and $\partial _{{\varvec{y}}}^{\varvec{\nu }}\det J({{\varvec{x}}},{{\varvec{y}}})$ for all ${\varvec{\nu }}\in {\mathscr {F}}$ in Lemmata 3.1 and 3.2. These results are then used to derive a bound on the partial derivatives of the coefficient $\partial _{{\varvec{y}}}^{\varvec{\nu }}A({{\varvec{x}}},{{\varvec{y}}})$ in Lemma 3.3.
Since the right-hand side of (2.7) depends on the parametric variable ${{\varvec{y}}}\in U$, we develop the derivative bounds on $\partial _{{\varvec{y}}}^{{\varvec{\nu }}}f_{\textrm{ref}}({{\varvec{x}}},{{\varvec{y}}})$ in Lemma 3.5, with the aid of the derivative bound developed for $\partial _{{\varvec{y}}}^{\varvec{\nu }}{\widehat{f}}({{\varvec{x}}},{{\varvec{y}}})$ in Lemma 3.4.
An a priori bound is developed for the the solution ${\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})$ of (2.7) in Lemma 3.6.
Finally, the main result of this section is stated in Theorem 3.7 which contains the partial derivative bound for $\partial _{{\varvec{y}}}^{\varvec{\nu }}{\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})$, ${\varvec{\nu }}\in {\mathscr {F}}$, making use of the aforementioned results.

3.1 Parametric regularity of the transport coefficient

Lemma 3.1

Let ${{\varvec{y}}}\in U$ and ${\varvec{\nu }}\in {\mathscr {F}}$. Under assumptions (A1)– (A4), the matrix-valued function B defined by (2.5) satisfies

$$\begin{aligned}&\big \Vert \partial _{{{\varvec{y}}}}^{{\varvec{\nu }}} B^{-1}(\cdot ,{{\varvec{y}}}) \big \Vert _{L^\infty ({D_{\textrm{ref}}})}\\&\quad \le \, \frac{1}{\sigma _{\min }^2}\, \bigg (\frac{4\pi (1+\xi _{{{\varvec{b}}}})}{\sigma _{\min }^2}\bigg )^{|{\varvec{\nu }}|} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} |{{\varvec{m}}}|!\,a_{|{{\varvec{m}}}|}\, {{\varvec{m}}}!\,{{\varvec{b}}}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i), \end{aligned}$$

with the sequence

$$\begin{aligned} a_k \,:=\, \frac{(1+\sqrt{3})^{k+1}-(1-\sqrt{3})^{k+1}}{2^{k+1}\sqrt{3}}, \qquad k\ge 0. \end{aligned}$$

(3.1)

Proof

We will first derive a regularity bound for $\Vert \partial _{{{\varvec{y}}}}^{{\varvec{\nu }}}B(\cdot ,{{\varvec{y}}})\Vert _{L^\infty (D_{\textrm{ref}})}$, ${\varvec{\nu }}\in {\mathscr {F}}$. The bound for $\Vert \partial _{{{\varvec{y}}}}^{{\varvec{\nu }}}B(\cdot ,{{\varvec{y}}})^{-1}\Vert _{L^\infty (D_{\textrm{ref}})}$ follows by applying implicit differentiation to the identity $B^{-1}B=I$.

From the definition of J it is easy to see that for any multi-index ${{\varvec{m}}}\in {\mathscr {F}}$ and any ${{\varvec{x}}}\in {D_{\textrm{ref}}}$ and ${{\varvec{y}}}\in U$ we have

$$\begin{aligned} \partial ^{{\varvec{m}}}_{{\varvec{y}}}J({{\varvec{x}}},{{\varvec{y}}}) \,=\, {\left\{ \begin{array}{ll} I + \frac{1}{\sqrt{6}}\sum _{i\ge 1}\sin (2\pi y_i)\,{\varvec{\psi }}_i'({{\varvec{x}}}) &{} \text{ if } {{\varvec{m}}}= {{\varvec{0}}}, \\ \frac{1}{\sqrt{6}}\,(2\pi )^k\,\sin (2\pi y_j + k\, \frac{\pi }{2})\,{\varvec{\psi }}_j'({{\varvec{x}}}) &{} \text{ if } {{\varvec{m}}}= k\,{{\varvec{e}}}_j,\; k\ge 1,\\ 0 &{} \text{ otherwise. } \end{array}\right. } \end{aligned}$$

(3.2)

Taking the matrix 2-norm, we obtain

$$\begin{aligned} \Vert \partial ^{{\varvec{m}}}_{{\varvec{y}}}J({{\varvec{x}}},{{\varvec{y}}})\Vert _2 \,\le \, {\left\{ \begin{array}{ll} 1 + \frac{1}{\sqrt{6}} \sum _{i\ge 1} \Vert {\varvec{\psi }}_i'({{\varvec{x}}})\Vert _2&{} \text{ if } {{\varvec{m}}}= {{\varvec{0}}}, \\ \frac{1}{\sqrt{6}}\,(2\pi )^k\,\Vert {\varvec{\psi }}_j'({{\varvec{x}}})\Vert _2 &{} \text{ if } {{\varvec{m}}}= k\,{{\varvec{e}}}_j,\; k\ge 1,\\ 0 &{} \text{ otherwise, } \end{array}\right. } \end{aligned}$$

and hence with (2.2) we obtain

$$\begin{aligned} \Vert \partial ^{{\varvec{m}}}_{{\varvec{y}}}J(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})} \,\le \, {\left\{ \begin{array}{ll} 1 + \xi _{{{\varvec{b}}}} &{} \text{ if } {{\varvec{m}}}= {{\varvec{0}}}, \\ (2\pi )^k\,b_j &{} \text{ if } {{\varvec{m}}}= k\,{{\varvec{e}}}_j,\; k\ge 1,\\ 0 &{} \text{ otherwise. } \end{array}\right. } \end{aligned}$$

(3.3)

Now the Leibniz product rule yields for ${\varvec{\nu }}\in {\mathscr {F}}$,

$$\begin{aligned} \partial ^{\varvec{\nu }}_{{\varvec{y}}}B(\cdot ,{{\varvec{y}}}) \,=\, \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \big (\partial ^{{\varvec{m}}}_{{\varvec{y}}}J({{\varvec{x}}},{{\varvec{y}}})^\top \big ) \big (\partial ^{{\varvec{\nu }}-{{\varvec{m}}}}_{{\varvec{y}}}J({{\varvec{x}}},{{\varvec{y}}})\big ), \end{aligned}$$

and after taking the matrix 2-norm on both sides and then the $L^\infty $-norm over ${{\varvec{x}}}$ we obtain

$$\begin{aligned} \big \Vert \partial ^{\varvec{\nu }}_{{\varvec{y}}}B(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty ({D_{\textrm{ref}}})} \le \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \big \Vert \partial ^{{\varvec{m}}}_{{\varvec{y}}}J(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty ({D_{\textrm{ref}}})} \big \Vert \partial ^{{\varvec{\nu }}-{{\varvec{m}}}}_{{\varvec{y}}}J(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty ({D_{\textrm{ref}}})}. \end{aligned}$$

The bounds in (3.3) indicate that only the derivatives with $|\textrm{supp}({\varvec{\nu }})|\le 2$ will survive. Indeed, for ${\varvec{\nu }}= k\,{{\varvec{e}}}_j$ with $k\ge 1$, only ${{\varvec{m}}}= \ell \,{{\varvec{e}}}_j$ for $\ell =0,\ldots , k$ remain, while for ${\varvec{\nu }}= k\,{{\varvec{e}}}_j + k'{{\varvec{e}}}_{j'}$ with $k,k'\ge 1$ and $j\ne j'$, only ${{\varvec{m}}}= k\,{{\varvec{e}}}_j$ and ${{\varvec{m}}}= k'{{\varvec{e}}}_{j'}$ remain. On the other hand, for ${\varvec{\nu }}$ with $|\textrm{supp}({\varvec{\nu }})|\ge 3$, in each term at least one of the factors must vanish. We conclude that

$$\begin{aligned}&\big \Vert \partial ^{\varvec{\nu }}_{{\varvec{y}}}B(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty ({D_{\textrm{ref}}})} \nonumber \\&\quad \,\le \, {\left\{ \begin{array}{ll} (1 + \xi _{{{\varvec{b}}}} )^2 &{} \text{ if } {\varvec{\nu }}= {{\varvec{0}}}, \\ 2\,(1 + \xi _{{{\varvec{b}}}} )\, (2\pi )^k\,b_j + (2^k-2)\, (2\pi )^k\,b_j^2 &{} \text{ if } {\varvec{\nu }}= k\,{{\varvec{e}}}_j,\; k\ge 1,\\ 2\,(2\pi )^{k+k'}\,b_j\,b_{j'} &{} \text{ if } {\varvec{\nu }}=k\,{{\varvec{e}}}_j\!+\!k'{{\varvec{e}}}_{j'},\,\! k,k'\!\ge \!1,\, j\!\ne \!j'\\ 0 &{} \text{ if } |\textrm{supp}({\varvec{\nu }})|\ge 3\text{, } \end{array}\right. } \nonumber \\&\quad \,\le \, {\left\{ \begin{array}{ll} (1+\xi _{{{\varvec{b}}}} )^2&{}\text {if}~{\varvec{\nu }}={{\varvec{0}}},\\ (4\pi )^{|{\varvec{\nu }}|}(1+\xi _{{{\varvec{b}}}} )\prod _{j\in \textrm{supp}({\varvec{\nu }})}b_j&{}\text {if}~|\textrm{supp}({\varvec{\nu }})|\in \{1,2\},\\ 0 &{}\text {if}~|\textrm{supp}({\varvec{\nu }})|\ge 3, \end{array}\right. } \end{aligned}$$

(3.4)

where we used $b_j\le \xi _{{{\varvec{b}}}} $ for all $j\in {\mathbb {N}}$.

Our assumption (A3) immediately yields for all ${{\varvec{y}}}\in U$ that

$$\begin{aligned} \Vert B^{-1}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})} \,=\,\mathop {\text {ess sup}}\limits _{{{\varvec{x}}}\in {D_{\textrm{ref}}}} \Big (\max \,\sigma \big (J^{-1}({{\varvec{x}}},{{\varvec{y}}})\big ) \Big )^2 \,\le \, \frac{1}{\sigma _{\min }^2}, \end{aligned}$$

which proves the lemma for the case ${\varvec{\nu }}= {{\varvec{0}}}$. Now we consider ${\varvec{\nu }}\in {\mathscr {F}}\setminus \{{{\varvec{0}}}\}$. To obtain the derivative bounds on $B^{-1}$ we start with $B^{-1}B = I$ and apply the Leibniz product rule to obtain

$$\begin{aligned} \partial ^{\varvec{\nu }}_{{\varvec{y}}}\big (B^{-1}({{\varvec{x}}},{{\varvec{y}}}) B({{\varvec{x}}},{{\varvec{y}}})\big ) \,=\, \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \big (\partial ^{{\varvec{\nu }}-{{\varvec{m}}}}_{{\varvec{y}}}B^{-1}({{\varvec{x}}},{{\varvec{y}}})\big ) \big (\partial ^{{\varvec{m}}}_{{\varvec{y}}}B({{\varvec{x}}},{{\varvec{y}}})\big ) \,=\, {0}. \end{aligned}$$

Separating out the ${{\varvec{m}}}= {{\varvec{0}}}$ term, post-multiplying both sides by $B^{-1}({{\varvec{x}}},{{\varvec{y}}})$, taking the matrix 2-norm followed by the $L^\infty $-norm over ${{\varvec{x}}}$, and applying (3.4), we obtain

$$\begin{aligned}&\big \Vert \partial ^{\varvec{\nu }}_{{\varvec{y}}}B^{-1}(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty ({D_{\textrm{ref}}})}\\&\quad \le \! \Vert B^{-1}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty \!(D_{\textrm{ref}})}\!\!\!\! \sum _{{{\varvec{0}}}\ne {{\varvec{m}}}\le {\varvec{\nu }}} \!\!\!\left( {\begin{array}{c}\!{\varvec{\nu }}\!\\ \!{{\varvec{m}}}\!\end{array}}\right) \! \big \Vert \partial ^{{\varvec{\nu }}-{{\varvec{m}}}}_{{\varvec{y}}}B^{-1}(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty \!({D_{\textrm{ref}}})} \big \Vert \partial ^{{\varvec{m}}}_{{\varvec{y}}}B(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty \!({D_{\textrm{ref}}})} \\&\quad \le \frac{1+\xi _{{{\varvec{b}}}} }{\sigma _{\min }^2} \sum _{\begin{array}{c} {{\varvec{0}}}\ne {{\varvec{m}}}\le {\varvec{\nu }}\\ |\textrm{supp}({{\varvec{m}}})|\le 2 \end{array}} (4\pi )^{|{{\varvec{m}}}|}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \, \big \Vert \partial ^{{\varvec{\nu }}-{{\varvec{m}}}}_{{\varvec{y}}}B^{-1}(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty ({D_{\textrm{ref}}})} \prod _{j\in \textrm{supp}({{\varvec{m}}})} b_j. \end{aligned}$$

The assertion follows by applying Lemmata A.1 and A.2 together with $\varUpsilon _{{\varvec{\nu }}}=\big \Vert \partial ^{{\varvec{\nu }}}_{{\varvec{y}}}B^{-1}(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty ({D_{\textrm{ref}}})}$, $c_0 = 1/\sigma ^2_{\min }$, $c = 4\pi (1+\xi _{{{\varvec{b}}}})/\sigma _{\min }^2$, $q = 2$, and $\beta _j=b_j$. $\square $

Lemma 3.2

Under assumptions (A1)– (A4), there holds for all ${{\varvec{y}}}\in U$ and ${\varvec{\nu }}\in {\mathscr {F}}$ that

$$\begin{aligned} \Vert \partial _{{{\varvec{y}}}}^{{\varvec{\nu }}}\det J(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})} \,\le \, d!\,(2+\xi _{{{\varvec{b}}}} )^d\,(2\pi )^{|{\varvec{\nu }}|}\, \sum _{{{\varvec{w}}}\le {\varvec{\nu }}}|{{\varvec{w}}}|!\,{{\varvec{b}}}^{{{\varvec{w}}}}\prod _{i\ge 1}S(\nu _i,w_i). \end{aligned}$$

Proof

Let ${{\varvec{y}}}\in U$. The proof is carried out by induction over all the minors (subdeterminants) of the matrix ${J}({{\varvec{x}}},{{\varvec{y}}})=:[J_{\ell ,\ell '}({{\varvec{x}}},{{\varvec{y}}})]_{\ell ,\ell '=1}^d$, ${{\varvec{x}}}\in {D_{\textrm{ref}}}$. For any $q\le d$, let $J^{{\varvec{\ell }},{\varvec{\ell }}'}({{\varvec{x}}},{{\varvec{y}}}):= [J_{\ell _t,\ell '_{t'}}({{\varvec{x}}},{{\varvec{y}}})]_{t,t'=1}^q$ denote the $q\times q$ submatrix specified by the indices ${\varvec{\ell }}:= \{\ell _1,\ldots ,\ell _q\}$ and ${\varvec{\ell }}':=\{\ell '_1,\ldots ,\ell '_q\}$ with $1\le \ell _1<\cdots <\ell _q\le d$ and $1\le \ell '_1<\cdots <\ell '_q\le d$. We will prove by induction on q that

$$\begin{aligned} \Vert \partial _{{{\varvec{y}}}}^{{\varvec{\nu }}} \det J^{{\varvec{\ell }},{\varvec{\ell }}'}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})} \,\le \, q!\,(2+\xi _{{{\varvec{b}}}} )^q\,(2\pi )^{|{\varvec{\nu }}|}\,\sum _{{{\varvec{w}}}\le {\varvec{\nu }}}|{{\varvec{w}}}|!\,{{\varvec{b}}}^{{{\varvec{w}}}}\prod _{i\ge 1}S(\nu _i,w_i). \end{aligned}$$

(3.5)

Similarly to (3.2), we have for the derivatives of matrix elements

$$\begin{aligned} \partial ^{{\varvec{m}}}_{{\varvec{y}}}J_{\ell _t,\ell '_{t'}}({{\varvec{x}}},{{\varvec{y}}}) \,=\, {\left\{ \begin{array}{ll} 1 + \frac{1}{\sqrt{6}}\sum _{i\ge 1} \sin (2\pi y_i)\, [{\varvec{\psi }}_i'({{\varvec{x}}})]_{\ell _t,\ell '_{t'}} &{} \text{ if } {{\varvec{m}}}= {{\varvec{0}}}, \\ \frac{1}{\sqrt{6}}(2\pi )^k\sin (2\pi y_j + k \frac{\pi }{2})[{\varvec{\psi }}_j'({{\varvec{x}}})]_{\ell _t,\ell '_{t'}} &{} \text{ if } {{\varvec{m}}}= k\,{{\varvec{e}}}_j,\, k\ge 1,\\ 0 &{} \text{ otherwise, } \end{array}\right. } \end{aligned}$$

(3.6)

for $t,t'\in \{1,\ldots ,q\}$. Using $\big | [{\varvec{\psi }}_i'({{\varvec{x}}})]_{\ell _t,\ell '_{t'}}\big | \le \Vert {\varvec{\psi }}_i'({{\varvec{x}}})\Vert _2$ and (2.2), we obtain for the $(1\times 1)$-minors

$$\begin{aligned} \Vert \partial _{{{\varvec{y}}}}^{{{\varvec{m}}}}\det J^{\ell _t,\ell '_{t'}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}&= \Vert \partial _{{{\varvec{y}}}}^{{{\varvec{m}}}} J_{\ell _t,\ell _{t'}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}\\&\le {\left\{ \begin{array}{ll} 1+\xi _{{{\varvec{b}}}} &{}\text {if}~{{\varvec{m}}}={{\varvec{0}}},\\ (2\pi )^k\, b_j &{}\text {if}~{{\varvec{m}}}=k\,{{\varvec{e}}}_j,\; k\ge 1,\\ 0 &{} \text {otherwise} \end{array}\right. } \end{aligned}$$

Suppose (3.5) holds for all submatrices of size up to $(q-1)\times (q-1)$, and now we consider the case for a $q\times q$ submatrix. For arbitrary $t'\le q$, the Laplace cofactor expansion yields

$$\begin{aligned} \det J^{{\varvec{\ell }},{\varvec{\ell }}'}({{\varvec{x}}},{{\varvec{y}}}) \,=\, \sum _{t=1}^q (-1)^{t+t'}\, J_{\ell _t,\ell '_{t'}}({{\varvec{x}}},{{\varvec{y}}})\, \det J^{{\varvec{\ell }}\setminus \{\ell _t\},{\varvec{\ell }}'\setminus \{\ell '_{t'}\}}({{\varvec{x}}},{{\varvec{y}}}). \end{aligned}$$

The Leibniz product rule then gives

$$\begin{aligned}&\partial ^{\varvec{\nu }}_{{\varvec{y}}}\det J^{{\varvec{\ell }},{\varvec{\ell }}'}({{\varvec{x}}},{{\varvec{y}}})\\&\quad = \sum _{t=1}^q (-1)^{t+t'}\!\! \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \!\! \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \Big (\partial ^{{\varvec{m}}}_{{\varvec{y}}}J_{\ell _t,\ell '_{t'}}({{\varvec{x}}},{{\varvec{y}}})\Big ) \Big (\partial ^{{\varvec{\nu }}-{{\varvec{m}}}}_{{\varvec{y}}}\det J^{{\varvec{\ell }}\setminus \{\ell _t\},{\varvec{\ell }}'\setminus \{\ell '_{t'}\}}({{\varvec{x}}},{{\varvec{y}}})\Big ). \end{aligned}$$

Together with (3.6) and the induction hypothesis (3.5) we obtain

$$\begin{aligned}&\Vert \partial _{{{\varvec{y}}}}^{{\varvec{\nu }}}\det J^{{\varvec{\ell }},{\varvec{\ell }}'}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})} \nonumber \\&\quad \,\le \, \sum _{t=1}^q \bigg (\Vert J_{\ell _t,\ell '_{t'}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})} \Vert \partial _{{{\varvec{y}}}}^{{\varvec{\nu }}}\det J^{{\varvec{\ell }}\setminus \{\ell _t\},{\varvec{\ell }}'\setminus \{\ell '_{t'}\}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})} \nonumber \\&\qquad +\sum _{j\ge 1} \sum _{k=1}^{\nu _j}\left( {\begin{array}{c}\nu _j\\ k\end{array}}\right) \Vert \partial ^{k{{\varvec{e}}}_j}J_{\ell _t,\ell '_{t'}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}\nonumber \\&\qquad \times \Vert \partial ^{{\varvec{\nu }}-k{{\varvec{e}}}_j}\det J^{{\varvec{\ell }}\setminus \{\ell _t\},{\varvec{\ell }}'\setminus \{\ell '_{t'}\}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}\bigg ) \nonumber \\&\quad \,\le \, \sum _{t=1}^q \bigg ( (1+\xi _{{{\varvec{b}}}} )\, (q-1)!\,(2+\xi _{{{\varvec{b}}}} )^{q-1}\,(2\pi )^{|{\varvec{\nu }}|}\,\sum _{{{\varvec{w}}}\le {\varvec{\nu }}}|{{\varvec{w}}}|!\,{{\varvec{b}}}^{{{\varvec{w}}}}\prod _{i\ge 1}S(\nu _i,w_i) \nonumber \\&\qquad +\sum _{j\ge 1} \sum _{k=1}^{\nu _j}\left( {\begin{array}{c}\nu _j\\ k\end{array}}\right) (2\pi )^k\, b_j\, (q-1)!\,(2+\xi _{{{\varvec{b}}}} )^{q-1}(2\pi )^{|{\varvec{\nu }}|-k}\, \nonumber \\&\qquad \times \sum _{{{\varvec{w}}}\le {\varvec{\nu }}-k{{\varvec{e}}}_j}\bigg (|{{\varvec{w}}}|!\,{{\varvec{b}}}^{{{\varvec{w}}}}\, S(\nu _j-k,w_j)\prod _{{\mathop {\scriptstyle {i\ne j}}\limits ^{\scriptstyle {i\ge 1}}}}S(\nu _i,w_i) \bigg )\bigg ). \end{aligned}$$

(3.7)

To simplify the last term, we obtain in complete analogy with the derivation presented in [23, Lemma 2.2] the identity

$$\begin{aligned}&\sum _{j\ge 1}\sum _{k=1}^{\nu _j}\left( {\begin{array}{c}\nu _j\\ k\end{array}}\right) b_j\sum _{{{\varvec{w}}}\le {\varvec{\nu }}-k{{\varvec{e}}}_j}|{{\varvec{w}}}|!\,{{\varvec{b}}}^{{{\varvec{w}}}}\, S(\nu _j-k,w_j) \prod _{{\mathop {\scriptstyle {i\ne j}}\limits ^{\scriptstyle {i\ge 1}}}} S(\nu _i,w_i)\\&\quad =\, \sum _{{{\varvec{w}}}\le {\varvec{\nu }}}|{{\varvec{w}}}|!\,{{\varvec{b}}}^{{{\varvec{w}}}}\prod _{i\ge 1}S(\nu _i,w_i). \end{aligned}$$

Plugging this expression into (3.7) and collecting terms yields (3.5), which in turn completes the proof of the lemma. $\square $

Lemma 3.3

Under assumptions (A1)– (A4), there holds for all ${{\varvec{y}}}\in U$ and ${\varvec{\nu }}\in {\mathscr {F}}$ that

$$\begin{aligned}&\Vert \partial _{{\varvec{y}}}^{{\varvec{\nu }}}A(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}\\&\quad \le \frac{d!\,(2+\xi _{{{\varvec{b}}}} )^d}{\sigma _{\min }^2} \bigg (\frac{4\pi (1+\xi _{{{\varvec{b}}}})}{\sigma _{\min }^2}\bigg )^{|{\varvec{\nu }}|} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}}(|{{\varvec{m}}}|+1)!\, a_{|{{\varvec{m}}}|}\,{{\varvec{m}}}!\, {{\varvec{b}}}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i), \end{aligned}$$

where the sequence $(a_k)_{k=0}^\infty $ is defined in (3.1).

Proof

Let ${\varvec{\nu }}\in {\mathscr {F}}$. By the Leibniz product rule, we have from (2.6) that

$$\begin{aligned} \partial _{{\varvec{y}}}^{{\varvec{\nu }}}A({{\varvec{x}}},{{\varvec{y}}}) = \sum _{{{\varvec{m}}}\le {\varvec{\nu }}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \Big (\partial _{{\varvec{y}}}^{{{\varvec{m}}}} B^{-1}({{\varvec{x}}},{{\varvec{y}}})\Big )\! \Big (\partial _{{\varvec{y}}}^{{\varvec{\nu }}-{{\varvec{m}}}}\det J({{\varvec{x}}},{{\varvec{y}}})\Big ),\,{{\varvec{x}}}\in {D_{\textrm{ref}}},\,{{\varvec{y}}}\in U. \end{aligned}$$

Taking the matrix 2-norm followed by the $L^\infty $-norm over ${{\varvec{x}}}$, and then applying Lemmata 3.1 and 3.2, we obtain

$$\begin{aligned}&\Vert \partial _{{\varvec{y}}}^{{\varvec{\nu }}}A(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}\\&\quad \le \sum _{{{\varvec{m}}}\le {\varvec{\nu }}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \big \Vert \partial _{{\varvec{y}}}^{{{\varvec{m}}}} B^{-1}(\cdot ,{{\varvec{y}}}) \big \Vert _{L^\infty ({D_{\textrm{ref}}})} \big \Vert \partial _{{\varvec{y}}}^{{\varvec{\nu }}-{{\varvec{m}}}}\det J(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty ({D_{\textrm{ref}}})}\\&\quad \le c_{|{\varvec{\nu }}|}\sum _{{{\varvec{m}}}\le {\varvec{\nu }}}\sum _{{{\varvec{w}}}\le {{\varvec{m}}}}\sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{m}}}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) {\mathbb {A}}_{{\varvec{w}}}\,{{\mathbb {B}}}_{\varvec{\mu }}\,\prod _{i\ge 1} \Big ( S(m_i,w_i)\,S(\nu _i-m_i,\mu _i) \Big ) \\&\quad = c_{|{\varvec{\nu }}|} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \bigg (\sum _{{{\varvec{w}}}\le {{\varvec{m}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) \,{\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{{{\varvec{m}}}-{{\varvec{w}}}}\bigg ) \prod _{i\ge 1}S(\nu _i,m_i), \end{aligned}$$

where ${\mathbb {A}}_{{\varvec{w}}}:= |{{\varvec{w}}}|!\,a_{|{{\varvec{w}}}|}\,{{\varvec{w}}}!\,{{\varvec{b}}}^{{{\varvec{w}}}}$ and ${{\mathbb {B}}}_{\varvec{\mu }}:= |{\varvec{\mu }}|!\, b^{{\varvec{\mu }}}$, and we overestimated some multiplying factors to get

$$\begin{aligned} c_k :=\, \frac{d!\,(2+\xi _{{{\varvec{b}}}} )^d}{\sigma _{\min }^2}\, \bigg (\frac{4\pi (1+\xi _{{{\varvec{b}}}})}{\sigma _{\min }^2}\bigg )^{k}, \qquad k\ge 0. \end{aligned}$$

The last equality is due to Lemma A.3. Using $a_{|{{\varvec{w}}}|}\le a_{|{{\varvec{m}}}|}$ and ${{\varvec{w}}}!\le {{\varvec{m}}}!$, we have

$$\begin{aligned} \sum _{{{\varvec{w}}}\le {{\varvec{m}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) \,{\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{{{\varvec{m}}}-{{\varvec{w}}}}&\le a_{|{{\varvec{m}}}|}\,{{\varvec{m}}}!\,{{\varvec{b}}}^{{\varvec{m}}}\sum _{{{\varvec{w}}}\le {{\varvec{m}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) |{{\varvec{w}}}|!\,|{{\varvec{m}}}-{{\varvec{w}}}|!, \end{aligned}$$

where the last sum over ${{\varvec{w}}}$ can be rewritten as

$$\begin{aligned} \sum _{j=0}^{|{{\varvec{m}}}|} j!\,(|{{\varvec{m}}}|-j)! \sum _{\begin{array}{c} {{\varvec{w}}}\le {{\varvec{m}}}\\ |{{\varvec{w}}}|=j \end{array}}\left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) = \sum _{j=0}^{|{{\varvec{m}}}|} j!\,(|{{\varvec{m}}}|-j)! \left( {\begin{array}{c}|{{\varvec{m}}}|\\ j\end{array}}\right) = (|{{\varvec{m}}}|+1)!, \end{aligned}$$

thus completing the proof. $\square $

3.2 Parametric regularity of the transported source term

Lemma 3.4

Let $f\in {\mathcal {C}}^\infty ({\mathcal {D}})$ be analytic. Then there exist constants $C_f>0$ and $\rho \ge 1$ such that $\Vert \partial _{{\varvec{x}}}^{{\varvec{\nu }}}f\Vert _{L^\infty ({\mathcal {D}})}\le C_f\, {\varvec{\nu }}!\,\rho ^{|{\varvec{\nu }}|}$ for all ${\varvec{\nu }}\in {\mathbb {N}}_0^d$. Furthermore, under the assumptions (A1)– (A4), there holds for all ${{\varvec{y}}}\in U$ and ${\varvec{\nu }}\in {\mathscr {F}}$ that the derivatives of ${\widehat{f}}({{\varvec{x}}},{{\varvec{y}}})=f({{\varvec{V}}}({{\varvec{x}}},{{\varvec{y}}}))$ are bounded by

$$\begin{aligned} \big \Vert \partial _{{{\varvec{y}}}}^{{\varvec{\nu }}}{\widehat{f}}(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty ({D_{\textrm{ref}}})} \le C_f(2\pi )^{|{\varvec{\nu }}|} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \left( {\begin{array}{c}|{{\varvec{m}}}|+d-1\\ d-1\end{array}}\right) |{{\varvec{m}}}|!\rho ^{|{{\varvec{m}}}|} {{\varvec{b}}}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i). \end{aligned}$$

Proof

The bound $\Vert \partial _{{\varvec{x}}}^{{\varvec{\nu }}}f\Vert _{L^\infty ({\mathcal {D}})}\le C_f\, {\varvec{\nu }}!\,\rho ^{|{\varvec{\nu }}|}$ is a consequence of the Cauchy integral formula for analytic functions of several variables (cf., e.g., [20, Theorem 2.2.1]).

Let ${{\varvec{y}}}\in U$. Trivially, we see that $\Vert \widehat{f}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty (D_{\textrm{ref}})}\le C_f$, so we may assume in the following that ${\varvec{\nu }}\in {\mathscr {F}}\setminus \{{{\varvec{0}}}\}$. We will make use of the multivariate Faà di Bruno formula [29, Theorem 3.1 and Remark 3.3]

$$\begin{aligned} \partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widehat{f}}}({{\varvec{x}}},{{\varvec{y}}}) \,:=\, \sum _{\begin{array}{c} {\varvec{\lambda }}\in {\mathbb {N}}_0^d\\ 1\le |{\varvec{\lambda }}|\le |{\varvec{\nu }}| \end{array}} (\partial _{{\varvec{x}}}^{{\varvec{\lambda }}}f)({\varvec{V}}({{\varvec{x}}},{{\varvec{y}}}))\, \kappa _{{\varvec{\nu }},{\varvec{\lambda }}}({{\varvec{x}}},{{\varvec{y}}}), \end{aligned}$$

(3.8)

where we define $\kappa _{{\varvec{\nu }},{\varvec{\lambda }}}({{\varvec{x}}},{{\varvec{y}}})$ for general ${\varvec{\nu }}\in {\mathscr {F}}$ and ${\varvec{\lambda }}\in {\mathbb {Z}}^d$ in a recursive manner as follows: $\kappa _{{\varvec{\nu }},{{\varvec{0}}}}({{\varvec{x}}},{{\varvec{y}}}):= \delta _{{\varvec{\nu }},{{\varvec{0}}}}$, $\kappa _{{\varvec{\nu }},{\varvec{\lambda }}}({{\varvec{x}}},{{\varvec{y}}}):= 0$ if $|{\varvec{\nu }}|<|{\varvec{\lambda }}|$ or ${\varvec{\lambda }}\not \ge {{\varvec{0}}}$ (i.e., ${\varvec{\lambda }}$ contains negative entries), and otherwise

$$\begin{aligned} \kappa _{{\varvec{\nu }}+{{\varvec{e}}}_j,{\varvec{\lambda }}}({{\varvec{x}}},{{\varvec{y}}}) \,:=\, \sum _{\ell =1}^d\sum _{{{\varvec{0}}}\le {{\varvec{m}}}\le {\varvec{\nu }}} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \partial _{{\varvec{y}}}^{{{\varvec{m}}}+{{\varvec{e}}}_j}[{{\varvec{V}}}({{\varvec{x}}},{{\varvec{y}}})]_\ell \; \kappa _{{\varvec{\nu }}-{{\varvec{m}}},{\varvec{\lambda }}-{{\varvec{e}}}_\ell }({{\varvec{x}}},{{\varvec{y}}}). \end{aligned}$$

(3.9)

From the definition of ${{\varvec{V}}}$ in (2.1) it is easy to see that

$$\begin{aligned} \partial ^{{\varvec{m}}}_{{\varvec{y}}}[{{\varvec{V}}}({{\varvec{x}}},{{\varvec{y}}})]_\ell \,=\, {\left\{ \begin{array}{ll} x_\ell + \frac{1}{\sqrt{6}}\sum _{i\ge 1}\sin (2\pi y_i)\,[{\varvec{\psi }}_i({{\varvec{x}}})]_\ell &{} \text{ if } {{\varvec{m}}}= {{\varvec{0}}}, \\ \frac{1}{\sqrt{6}}\,(2\pi )^k\,\sin (2\pi y_j + k\,\frac{\pi }{2})\,[{\varvec{\psi }}_j({{\varvec{x}}})]_\ell &{} \text{ if } {{\varvec{m}}}= k\,{{\varvec{e}}}_j,\; k\ge 1,\\ 0 &{} \text{ otherwise, } \end{array}\right. } \end{aligned}$$

which yields

$$\begin{aligned} \big \Vert \partial _{{\varvec{y}}}^{{{\varvec{m}}}+{{\varvec{e}}}_j}[{{\varvec{V}}}(\cdot ,{{\varvec{y}}})]_\ell \big \Vert _{L^\infty (D_{\textrm{ref}})} \le {\left\{ \begin{array}{ll}(2\pi )^{k+1}\, b_j &{} \text{ if } {{\varvec{m}}}= k\,{{\varvec{e}}}_j,\; k\ge 0,\\ 0&{}\text {otherwise}. \end{array}\right. } \end{aligned}$$

Thus we obtain from (3.9) the recursion

$$\begin{aligned} \Vert \kappa _{{\varvec{\nu }}+{\varvec{e}}_j,{\varvec{\lambda }}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty (D_{\textrm{ref}})} \le b_j\sum _{k=0}^{\nu _j}(2\pi )^{k+1}\left( {\begin{array}{c}\nu _j\\ k\end{array}}\right) \sum _{\begin{array}{c} \ell =1\\ \lambda _\ell >0 \end{array}}^d\Vert \kappa _{{\varvec{\nu }}-k {{\varvec{e}}}_j,{\varvec{\lambda }}-{{\varvec{e}}}_\ell }(\cdot ,{{\varvec{y}}})\Vert _{L^\infty (D_{\textrm{ref}})}. \end{aligned}$$

By Lemma A.4 with $c=2\pi $, we have for all ${\varvec{\nu }}\in {\mathscr {F}}$ and ${\varvec{\lambda }}\in {\mathbb {N}}_0^d$ that

$$\begin{aligned} \Vert \kappa _{{\varvec{\nu }},{\varvec{\lambda }}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty (D_{\textrm{ref}})} \le (2\pi )^{|{\varvec{\nu }}|}\,\frac{|{\varvec{\lambda }}|!}{{\varvec{\lambda }}!}\sum _{\begin{array}{c} {{\varvec{m}}}\le {\varvec{\nu }}\\ |{{\varvec{m}}}|=|{\varvec{\lambda }}| \end{array}}{{\varvec{b}}}^{{{\varvec{m}}}} \prod _{i\ge 1}S(\nu _i,m_i), \end{aligned}$$

which together with (3.8) yields for ${\varvec{\nu }}\in {\mathscr {F}}\setminus \{{{\varvec{0}}}\}$ that

$$\begin{aligned} \big \Vert \partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widehat{f}}}(\cdot ,{{\varvec{y}}})\big \Vert _{L^\infty (D_{\textrm{ref}})}&\le C_f(2\pi )^{|{\varvec{\nu }}|}\sum _{\begin{array}{c} {\varvec{\lambda }}\in {\mathbb {N}}_0^d\\ 1\le |{\varvec{\lambda }}|\le |{\varvec{\nu }}| \end{array}} |{\varvec{\lambda }}|!\,\rho ^{|{\varvec{\lambda }}|}\sum _{\begin{array}{c} {{\varvec{m}}}\le {\varvec{\nu }}\\ |{{\varvec{m}}}|=|{\varvec{\lambda }}| \end{array}}{{\varvec{b}}}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i)\\&= C_f(2\pi )^{|{\varvec{\nu }}|}\sum _{{{\varvec{0}}}\ne {{\varvec{m}}}\le {\varvec{\nu }}} |{{\varvec{m}}}|!\rho ^{|{{\varvec{m}}}|}\, {{\varvec{b}}}^{{{\varvec{m}}}} \bigg (\prod _{i\ge 1}S(\nu _i,m_i)\bigg )\sum _{\begin{array}{c} {\varvec{\lambda }}\in {\mathbb {N}}_0^d\\ |{\varvec{\lambda }}|=|{{\varvec{m}}}| \end{array}}1. \end{aligned}$$

This yields the desired result since $\sum _{{\varvec{\lambda }}\in \mathbb N_0^d,\,|{\varvec{\lambda }}|=|{{\varvec{m}}}|}1 = \left( {\begin{array}{c}|{{\varvec{m}}}|+d-1\\ d-1\end{array}}\right) $. The ${{\varvec{m}}}\ne {{\varvec{0}}}$ condition for the above sum can be dropped since the ${{\varvec{m}}}={{\varvec{0}}}$ term is actually zero when ${\varvec{\nu }}\ne {{\varvec{0}}}$. This allows us to then extend the formula to cover also the case ${\varvec{\nu }}={{\varvec{0}}}$. $\square $

Lemma 3.5

Under the assumptions of Lemma 3.4, there holds for all ${{\varvec{y}}}\in U$ and ${\varvec{\nu }}\in {\mathscr {F}}$ that

$$\begin{aligned}&\Vert \partial _{{\varvec{y}}}^{\varvec{\nu }}f_{\textrm{ref}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}\\&\quad \,\le \, C_f\,d!\,(2+\xi _{{{\varvec{b}}}})^d\,(2\pi )^{|{\varvec{\nu }}|}\sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \frac{(|{{\varvec{m}}}|+d)!}{d!}\,\rho ^{|{{\varvec{m}}}|}\,{{\varvec{b}}}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i). \end{aligned}$$

Proof

The proof follows essentially the same steps as the proof of Lemma 3.3. By the Leibniz product rule, we have

$$\begin{aligned} \partial _{{\varvec{y}}}^{\varvec{\nu }}f_{\textrm{ref}}({{\varvec{x}}},{{\varvec{y}}}) \,=\, \sum _{{{\varvec{m}}}\le {\varvec{\nu }}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \Big (\partial _{{\varvec{y}}}^{{{\varvec{m}}}}{\widehat{f}}({{\varvec{x}}},{{\varvec{y}}})\Big ) \Big (\partial _{{\varvec{y}}}^{{\varvec{\nu }}-{{\varvec{m}}}}\det J({{\varvec{x}}},{{\varvec{y}}})\Big ),\,{{\varvec{x}}}\in {D_{\textrm{ref}}},\,{{\varvec{y}}}\in U. \end{aligned}$$

Taking the $L^\infty $-norm over ${{\varvec{x}}}$ and using Lemmata 3.2 and 3.4 yields

$$\begin{aligned}&\Vert \partial _{{\varvec{y}}}^{\varvec{\nu }}f_{\textrm{ref}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}\\&\quad \le \sum _{{{\varvec{m}}}\le {\varvec{\nu }}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \Vert \partial _{{\varvec{y}}}^{{{\varvec{m}}}}{{\widehat{f}}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty (D_{\textrm{ref}}))} \Vert \partial _{{\varvec{y}}}^{{\varvec{\nu }}-{{\varvec{m}}}}\det J({{\varvec{x}}},{{\varvec{y}}})\Vert _{L^\infty (D_{\textrm{ref}})}\\&\quad \le c_{|{\varvec{\nu }}|} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}}\sum _{{{\varvec{w}}}\le {{\varvec{m}}}}\sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{m}}}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) {\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{\varvec{\mu }}\prod _{i\ge 1} \Big ( S(m_i,w_i)\,S(\nu _i-m_i,\mu _i) \Big ) \\&\quad = c_{|{\varvec{\nu }}|} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \bigg (\sum _{{{\varvec{w}}}\le {{\varvec{m}}}}\left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) \, {\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{{{\varvec{m}}}-{{\varvec{w}}}}\bigg ) \prod _{i\ge 1}S(\nu _i,m_i), \end{aligned}$$

where now ${\mathbb {A}}_{{\varvec{w}}}:= \left( {\begin{array}{c}|{{\varvec{w}}}|+d-1\\ d-1\end{array}}\right) \,|{{\varvec{w}}}|!\,\rho ^{|{{\varvec{w}}}|}\,{{\varvec{b}}}^{{{\varvec{w}}}}$, ${{\mathbb {B}}}_{{\varvec{\mu }}}:= |{\varvec{\mu }}|!\,{{\varvec{b}}}^{{\varvec{\mu }}}$, and $c_k:= C_f\, d!\,(2+\xi _{{{\varvec{b}}}})^d\, (2\pi )^k$. Using $\rho ^{|{{\varvec{w}}}|}\le \rho ^{|{{\varvec{m}}}|}$, we have

$$\begin{aligned} \sum _{{{\varvec{w}}}\le {{\varvec{m}}}}\left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) \, {\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{{{\varvec{m}}}-{{\varvec{w}}}}&\le \rho ^{|{{\varvec{m}}}|}\,{{\varvec{b}}}^{{\varvec{m}}}\sum _{{{\varvec{w}}}\le {{\varvec{m}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) \left( {\begin{array}{c}|{{\varvec{w}}}|+d-1\\ d-1\end{array}}\right) \,|{{\varvec{w}}}|!\,|{{\varvec{m}}}-{{\varvec{w}}}|!, \end{aligned}$$

where the last sum over ${{\varvec{w}}}$ can be rewritten as

$$\begin{aligned}&\sum _{j=0}^{|{{\varvec{m}}}|} \frac{(j+d-1)!\,(|{{\varvec{m}}}|-j)!}{(d-1)!} \sum _{{\mathop {\scriptstyle {|{{\varvec{w}}}|=j}}\limits ^{\scriptstyle {{{\varvec{w}}}\le {{\varvec{m}}}}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) = \sum _{j=0}^{|{{\varvec{m}}}|} \frac{(j+d-1)!\,(|{{\varvec{m}}}|-j)!}{(d-1)!} \left( {\begin{array}{c}|{{\varvec{m}}}|\\ j\end{array}}\right) \\&\quad = |{{\varvec{m}}}|!\sum _{j=0}^{|{{\varvec{m}}}|} \left( {\begin{array}{c}j+d-1\\ j\end{array}}\right) = |{{\varvec{m}}}|!\,\left( {\begin{array}{c}|{{\varvec{m}}}|+d\\ |{{\varvec{m}}}|\end{array}}\right) = \frac{(|{{\varvec{m}}}|+d)!}{d!}. \end{aligned}$$

This completes the proof. $\square $

3.3 Parametric regularity of the transported PDE

We have the following a priori bound.

Lemma 3.6

Under the assumptions of Lemma 3.4, there holds for all ${{\varvec{y}}}\in U$ that

$$\begin{aligned} \Vert {\widehat{u}}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1({D_{\textrm{ref}}})} \le \frac{(1+\xi _{{\varvec{b}}})^2}{\sigma _{\min }^d}\,C_P\,|{D_{\textrm{ref}}}|^{1/2}\, C_f\,d!\,(2+\xi _{{\varvec{b}}})^d \,=: C_{\textrm{init}}, \end{aligned}$$

(3.10)

where $C_P>0$ is the Poincaré constant associated with the domain ${D_{\textrm{ref}}}$.

Proof

We take ${\widehat{v}}({{\varvec{x}}}) = {\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})$ as test function in (2.7) to obtain

$$\begin{aligned}&\int _{{D_{\textrm{ref}}}}(A({{\varvec{x}}},{{\varvec{y}}})\nabla {\widehat{u}}({{\varvec{x}}},{{\varvec{y}}}))\cdot \nabla {\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\,\textrm{d}{{\varvec{x}}}\,=\, \int _{{D_{\textrm{ref}}}}f_{\textrm{ref}}({{\varvec{x}}},{{\varvec{y}}})\,{\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\,\textrm{d}{{\varvec{x}}}\nonumber \\&\quad \,\le \, C_P\, |{D_{\textrm{ref}}}|^{1/2}\,\Vert f_{\textrm{ref}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}\Vert {\widehat{u}}(\cdot ,{{\varvec{y}}})\Vert _{H^1_0({D_{\textrm{ref}}})}, \end{aligned}$$

(3.11)

where we used the Cauchy–Schwarz inequality and the Poincaré inequality $\Vert \cdot \Vert _{L^2({D_{\textrm{ref}}})}\le C_P\,\Vert \cdot \Vert _{H_0^1({D_{\textrm{ref}}})}$, with the constant $C_P>0$ depending only on the domain ${D_{\textrm{ref}}}$.

From the definition of A in (2.6), the left-hand side of (3.11) can be written as

$$\begin{aligned}&\int _{{D_{\textrm{ref}}}} \big \Vert \big (J^{-1}({{\varvec{x}}},{{\varvec{y}}})\big )^{\top }\, \nabla {\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\big \Vert _2^2\,\det (J({{\varvec{x}}},{{\varvec{y}}}))\,\textrm{d}{{\varvec{x}}}\\&\quad \ge \int _{{D_{\textrm{ref}}}} \big (\min \sigma \big (J^{-1}({{\varvec{x}}},{{\varvec{y}}})^{{\top }}\big ) \big )^2\, \Vert \nabla {\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\Vert _2^2\, \big (\min \sigma (J({{\varvec{x}}},{{\varvec{y}}}))\big )^d \,\textrm{d}{{\varvec{x}}}\\&\quad \ge \frac{\sigma _{\min }^d}{\sigma _{\max }^2}\, \Vert {\widehat{u}}(\cdot ,{{\varvec{y}}})\Vert _{H^1_0({D_{\textrm{ref}}})}^2, \end{aligned}$$

where we used assumptions (A3) and (2.3). We recall that $\sigma _{\max }=1+\xi _{{{\varvec{b}}}}$, see the line below (2.2). The proof is completed by combining the upper and lower bounds, and applying the bound for $\Vert f_{\textrm{ref}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}$ by taking ${\varvec{\nu }}= {{\varvec{0}}}$ in Lemma 3.5. $\square $

Theorem 3.7

Under the assumptions of Lemma 3.4, there holds for all ${{\varvec{y}}}\in U$ and ${\varvec{\nu }}\in {\mathscr {F}}\setminus \{{{\varvec{0}}}\}$ that

$$\begin{aligned}&\Vert \partial _{{\varvec{y}}}^{\varvec{\nu }}{\widehat{u}}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1({D_{\textrm{ref}}})} \\&\quad \le 2C_{\textrm{init}}\bigg (\frac{4\pi d!(2\!+\!\xi _{{{\varvec{b}}}})^d(1\!+\!\xi _{{{\varvec{b}}}})^3}{\sigma _{\min }^{d+4}}\bigg )^{|{\varvec{\nu }}|} \!\!\sum _{{{\varvec{0}}}\ne {{\varvec{m}}}\le {\varvec{\nu }}}\! \frac{(|{{\varvec{m}}}|+d-1)!}{(d-1)!}{{\varvec{m}}}!{\varvec{\beta }}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i), \end{aligned}$$

where $\beta _j:= (2+\sqrt{2})\max (1+\sqrt{3},\rho )\,b_j$.

Proof

We prove the theorem by induction on the order of ${\varvec{\nu }}$. The case ${\varvec{\nu }}= {{\varvec{0}}}$ holds by Lemma 3.6. Let ${{\varvec{y}}}\in U$ and ${\varvec{\nu }}\in {\mathscr {F}}\setminus \{{\textbf{0}}\}$, and suppose that the result holds for all multi-indices of order less than $|{\varvec{\nu }}|$.

We differentiate both sides of (2.7) and apply the Leibniz product rule to obtain for all ${\widehat{v}}\in H_0^1({D_{\textrm{ref}}})$ that

$$\begin{aligned}&\int _{{D_{\textrm{ref}}}} \!\!\!\bigg ( \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \big (\partial ^{{\varvec{\nu }}-{{\varvec{m}}}}_{{\varvec{y}}}A({{\varvec{x}}},{{\varvec{y}}})\big )\, \nabla \big (\partial ^{{{\varvec{m}}}}_{{\varvec{y}}}{\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\big )\bigg ) \cdot \nabla {\widehat{v}}({{\varvec{x}}})\,\textrm{d}{{\varvec{x}}}\\&\quad = \int _{{D_{\textrm{ref}}}} \!\!\big (\partial ^{{\varvec{\nu }}}_{{\varvec{y}}}f_{\textrm{ref}}({{\varvec{x}}},{{\varvec{y}}})\big )\,{\widehat{v}}({{\varvec{x}}})\,\textrm{d}{{\varvec{x}}}. \end{aligned}$$

Taking ${\widehat{v}}({{\varvec{x}}}) = \partial ^{\varvec{\nu }}_{{\varvec{y}}}{\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})$ and separating out the ${{\varvec{m}}}={\varvec{\nu }}$ term, we obtain

$$\begin{aligned}&\int _{{D_{\textrm{ref}}}} \big (A({{\varvec{x}}},{{\varvec{y}}})\nabla \big (\partial ^{\varvec{\nu }}_{{\varvec{y}}}{\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\big )\big ) \cdot \nabla \big (\partial ^{\varvec{\nu }}_{{\varvec{y}}}{\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\big )\,\textrm{d}{{\varvec{x}}}\\&\quad \,=\, - \sum _{{\mathop {\scriptstyle {{{\varvec{m}}}\ne {\varvec{\nu }}}}\limits ^{\scriptstyle {{{\varvec{m}}}\le {\varvec{\nu }}}}}} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \int _{{D_{\textrm{ref}}}} \big (\big (\partial ^{{\varvec{\nu }}-{{\varvec{m}}}}_{{\varvec{y}}}A({{\varvec{x}}},{{\varvec{y}}})\big )\, \nabla \big (\partial ^{{{\varvec{m}}}}_{{\varvec{y}}}{\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\big )\big ) \cdot \nabla \big (\partial ^{\varvec{\nu }}_{{\varvec{y}}}{\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\big )\,\textrm{d}{{\varvec{x}}}\\&\qquad + \int _{{D_{\textrm{ref}}}} \big (\partial ^{{\varvec{\nu }}}_{{\varvec{y}}}f_{\textrm{ref}}({{\varvec{x}}},{{\varvec{y}}})\big )\, \big (\partial ^{\varvec{\nu }}_{{\varvec{y}}}{\widehat{u}}({{\varvec{x}}},{{\varvec{y}}})\big )\,\textrm{d}{{\varvec{x}}}. \end{aligned}$$

Using similar steps as in the proof of Lemma 3.6, we then arrive at

$$\begin{aligned}&\frac{\sigma _{\min }^d}{(1+\xi _{{\varvec{b}}})^2}\Vert \partial _{{\varvec{y}}}^{\varvec{\nu }}{\widehat{u}}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1({D_{\textrm{ref}}})}\\&\quad \le \, \sum _{{\mathop {\scriptstyle {{{\varvec{m}}}\ne {\varvec{\nu }}}}\limits ^{\scriptstyle {{{\varvec{m}}}\le {\varvec{\nu }}}}}} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \, \Vert \partial ^{{\varvec{\nu }}-{{\varvec{m}}}}_{{\varvec{y}}}A(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})} \, \Vert \partial ^{{{\varvec{m}}}}_{{\varvec{y}}}{\widehat{u}}(\cdot ,{{\varvec{y}}})\Vert _{H^1_0({D_{\textrm{ref}}})} \\&\qquad + C_P\, |{D_{\textrm{ref}}}|^{1/2}\,\Vert \partial ^{{\varvec{\nu }}}_{{\varvec{y}}}f_{\textrm{ref}}(\cdot ,{{\varvec{y}}})\Vert _{L^\infty ({D_{\textrm{ref}}})}. \end{aligned}$$

Combining this with Lemmata 3.3 and 3.5, we obtain

$$\begin{aligned}&\Vert \partial _{{\varvec{y}}}^{\varvec{\nu }}{\widehat{u}}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1({D_{\textrm{ref}}})} \\&\quad \le \frac{(1+\xi _{{\varvec{b}}})^2}{\sigma _{\min }^d} \sum _{{\mathop {\scriptstyle {{{\varvec{m}}}\ne {\varvec{\nu }}}}\limits ^{\scriptstyle {{{\varvec{m}}}\le {\varvec{\nu }}}}}} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \Vert \partial ^{{{\varvec{m}}}}_{{\varvec{y}}}{\widehat{u}}(\cdot ,{{\varvec{y}}})\Vert _{H^1_0({D_{\textrm{ref}}})}\frac{d!\,(2+\xi _{{{\varvec{b}}}} )^d}{\sigma _{\min }^2} \bigg (\frac{4\pi (1+\xi _{{{\varvec{b}}}})}{\sigma _{\min }^2}\bigg )^{|{\varvec{\nu }}-{{\varvec{m}}}|} \\&\qquad \times \!\!\! \sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{m}}}}(|{\varvec{\mu }}|+1)!\, a_{|{\varvec{\mu }}|}\,{\varvec{\mu }}!\,{{\varvec{b}}}^{{\varvec{\mu }}}\prod _{i\ge 1}S(\nu _i-m_i,\mu _i) \\&\qquad + \frac{(1+\xi _{{\varvec{b}}})^2}{\sigma _{\min }^d}\, C_P\, |{D_{\textrm{ref}}}|^{1/2}\,C_f\,d!\,(2+\xi _{{{\varvec{b}}}})^d\,(2\pi )^{|{\varvec{\nu }}|}\\&\qquad \times \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \frac{(|{{\varvec{m}}}|+d)!}{d!}\,\rho ^{|{{\varvec{m}}}|}\,{{\varvec{b}}}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i). \end{aligned}$$

With some further estimations, this can be expressed in the form of the recursive bound in Lemma A.5 with, using $a_k\le (1+\sqrt{3})^k$ (which follows easily from (3.1)),

$$\begin{aligned} \beta _j&:= \max (1+\sqrt{3},\rho )\,b_j, \\ c_0&:= \frac{(1+\xi _{{\varvec{b}}})^2}{\sigma _{\min }^d}\,C_P\, |{D_{\textrm{ref}}}|^{1/2}\,C_f\,d!\,(2+\xi _{{{\varvec{b}}}})^d = C_{\textrm{init}}, \\ c&:= \frac{(1+\xi _{{\varvec{b}}})^2}{\sigma _{\min }^d}\frac{d!\,(2+\xi _{{{\varvec{b}}}} )^d}{\sigma _{\min }^2} \frac{4\pi (1+\xi _{{{\varvec{b}}}})}{\sigma _{\min }^2}. \end{aligned}$$

In particular, we used $|{\varvec{\nu }}-{{\varvec{m}}}|\ge 1$ for the exponent in $(\frac{4\pi (1+\xi _{{{\varvec{b}}}})}{\sigma _{\min }^2})^{|{\varvec{\nu }}-{{\varvec{m}}}|}$, which allowed us to enlarge the constant as defined in c.

We then conclude from Lemmata A.5, A.7 and A.8 that for ${\varvec{\nu }}\ne {{\varvec{0}}}$

$$\begin{aligned} \Vert \partial _{{\varvec{y}}}^{\varvec{\nu }}{\widehat{u}}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1({D_{\textrm{ref}}})}\! \le 2c_0c^{|{\varvec{\nu }}|}\!\!\sum _{{{\varvec{0}}}\ne {{\varvec{m}}}\le {\varvec{\nu }}}\! (2\!+\!\sqrt{2})^{|{{\varvec{m}}}|}\frac{(|{{\varvec{m}}}|\!+\!d\!-\!1)!}{(d\!-\!1)!}{{\varvec{m}}}!{\varvec{\beta }}^{{{\varvec{m}}}}\!\prod _{i\ge 1}\!S(\nu _i,m_i). \end{aligned}$$

Redefining $\beta _j$ to incorporate the factor $2+\sqrt{2}$ completes the proof. $\square $

4 Error analysis

In practice, solving (2.7) is only possible after truncating the input random field (2.1) to finitely many terms and discretizing the spatial domain using, e.g., finite elements. In this section, we analyze the errors incurred by dimension truncation, finite element discretization, and QMC cubature for the expected value of the dimensionally-truncated finite element solution of (2.7). We also discuss the construction of QMC cubatures with higher order convergence rates independently of the dimension, and present a combined error estimate for the problem.

4.1 Dimension truncation error

We are interested in finding a rate for

$$\begin{aligned} \bigg \Vert \int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{L^1({D_{\textrm{ref}}})}, \end{aligned}$$

where ${{\widehat{u}}}$ denotes the solution to (2.7) as before and the dimensionally-truncated PDE solution is defined by

$$\begin{aligned} \widehat{u}_s(\cdot ,{{\varvec{y}}}):=\widehat{u}(\cdot ,(y_1,\ldots ,y_s,0,0,\ldots ))\quad \text {for}~{{\varvec{y}}}\in U,~s\in \mathbb N.\end{aligned}$$

In order to derive a dimension-truncation error bound for this problem, we employ the Taylor series approach used in [9, 12] together with a change of variable technique. (The Neumann series approach used in, e.g., [7] is not applicable here because the transported problem cannot be recast in affine parametric form.)

Theorem 4.1

Suppose that the assumptions (A1)– (A6) hold. Then the dimensionally-truncated solution to (2.7) satisfies the error bounds

$$\begin{aligned} \bigg \Vert \int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{H_0^1(D_{\textrm{ref}})}={\mathcal {O}}(s^{-2/p+1}) \end{aligned}$$

and

$$\begin{aligned} \bigg \Vert \int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{L^1(D_{\textrm{ref}})}={\mathcal {O}}(s^{-2/p+1}), \end{aligned}$$

where the implied coefficients are independent of the dimension s.

Proof

We start by defining

$$\begin{aligned} {\widetilde{{\varvec{\psi }}}}_i({{\varvec{x}}}):=\,\frac{1}{\sqrt{6}}\varvec{\psi }_i({{\varvec{x}}}),\quad i\in {\mathbb {N}}, \end{aligned}$$

where the fluctuations $(\Vert \varvec{\psi }_i\Vert _{W^{1,\infty }})_{i\ge 1}\in \ell ^p$ for the same p as in (A5). Let us consider an auxiliary PDE problem

$$\begin{aligned} {\left\{ \begin{array}{ll} -\varDelta u_{\textrm{aff}}({{\varvec{x}}},{{\varvec{y}}})=f({{\varvec{x}}}),&{}{{\varvec{x}}}\in {{\widetilde{D}}}({{\varvec{y}}}),\\ u_{\textrm{aff}}(\cdot ,{{\varvec{y}}})|_{\partial {{\widetilde{D}}}({{\varvec{y}}})}=0, \end{array}\right. }\quad \text {for all}~{{\varvec{y}}}\in [-1,1]^{{\mathbb {N}}}, \end{aligned}$$

(4.1)

where the domain realizations ${{\widetilde{D}}}({{\varvec{y}}}):=\widetilde{{\varvec{V}}}(D_{\textrm{ref}},{{\varvec{y}}})$ are defined instead via an affine parametric random perturbation field

$$\begin{aligned} \widetilde{{\varvec{V}}}({{\varvec{x}}},{{\varvec{y}}}):=\,{{\varvec{x}}}+\sum _{i\ge 1}y_i \widetilde{\varvec{\psi }}_i({{\varvec{x}}}) \end{aligned}$$

with $(\Vert \widetilde{\varvec{\psi }}_i\Vert _{W^{1,\infty }})_{i\ge 1}\in \ell ^p$ for the same p as in (A5). The random field $\widetilde{{\varvec{V}}}$ satisfies the hypotheses of [15, Theorem 5] (note that the random variables in [15] are supported on $[-1,1]$ instead of $[-1/2,1/2]$) and the transported PDE solution corresponding to (4.1) satisfies

$$\begin{aligned} \Vert \partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widehat{u}}}_{\textrm{aff}}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1(D_{\textrm{ref}})}\le C\,|{\varvec{\nu }}|!\,{{\widetilde{{{\varvec{b}}}}}}^{{\varvec{\nu }}}, \end{aligned}$$

for some ${{\widetilde{{{\varvec{b}}}}}}\in \ell ^p$, where the p is the same as above.

Let $G\in H^{-1}(D_{\textrm{ref}})$ be arbitrary and define

$$\begin{aligned} {{\widetilde{F}}}({{\varvec{y}}}):=\,\langle G,{{\widehat{u}}}_{\textrm{aff}}(\cdot ,{{\varvec{y}}})\rangle _{H^{-1}(D_{\textrm{ref}}),H_0^1(D_{\textrm{ref}})}. \end{aligned}$$

Clearly,

$$\begin{aligned} \partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widetilde{F}}}({{\varvec{y}}})=\langle G,\partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widehat{u}}}_{\textrm{aff}}(\cdot ,{{\varvec{y}}})\rangle _{H^{-1}(D_{\textrm{ref}}),H_0^1(D_{\textrm{ref}})}. \end{aligned}$$

Fix ${{\varvec{y}}}\in [-1,1]^{{\mathbb {N}}}$. Then developing the Taylor series of ${{\widetilde{F}}}$ about the point $({{\varvec{y}}}_{\le s},{\textbf{0}}):=(y_1,\ldots ,y_s,0,0,\ldots )$ yields

$$\begin{aligned} {{\widetilde{F}}}({{\varvec{y}}})-{{\widetilde{F}}}({{\varvec{y}}}_{\le s},{\textbf{0}})&=\sum _{\ell =1}^k \sum _{\begin{array}{c} |{\varvec{\nu }}|=\ell \\ \nu _i=0~\forall i\le s \end{array}}\frac{{{\varvec{y}}}^{{\varvec{\nu }}}}{{\varvec{\nu }}!}\partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widetilde{F}}}({{\varvec{y}}}_{\le s},{\textbf{0}})\\&\quad +\sum _{\begin{array}{c} |{\varvec{\nu }}|=k+1\\ \nu _i=0~\forall i\le s \end{array}}\frac{k+1}{{\varvec{\nu }}!}{{\varvec{y}}}^{{\varvec{\nu }}}\int _0^1(1-t)^k\partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widetilde{F}}}({{\varvec{y}}}_{\le s},t{{\varvec{y}}}_{>s})\,\textrm{d}t. \end{aligned}$$

Let $k\ge 2$ be an undetermined integer which will be specified later. By defining $\varvec{\theta }({{\varvec{y}}}):=(\sin (2\pi y_i))_{i\ge 1}$, we observe that

$$\begin{aligned} {{\widehat{u}}_{\textrm{per}}}(\cdot ,{{\varvec{y}}})={{\widehat{u}}}_{\textrm{aff}}(\cdot ,\varvec{\theta }({{\varvec{y}}})), \end{aligned}$$

where ${\widehat{u}}_{\textrm{per}}(\cdot ,{{\varvec{y}}})$ is the smooth periodic extension of the solution to (2.7) for all ${{\varvec{y}}}\in [-1,1]^{{\mathbb {N}}}$. Let us define $F({{\varvec{y}}}):=\langle G,{{\widehat{u}}}_{\textrm{per}}(\cdot ,{{\varvec{y}}})\rangle _{H^{-1}(D_{\textrm{ref}}),H_0^1(D_{\textrm{ref}})}$. We obtain by a simple change of variable that

$$\begin{aligned} F({{\varvec{y}}})-F({{\varvec{y}}}_{\le s},{\textbf{0}})&=\sum _{\ell =1}^k \sum _{\begin{array}{c} |{\varvec{\nu }}|=\ell \\ \nu _i=0~\forall i\le s \end{array}}\frac{\varvec{\theta }({{\varvec{y}}})^{{\varvec{\nu }}}}{{\varvec{\nu }}!}\partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widetilde{F}}}(\varvec{\theta }({{\varvec{y}}}_{\le s}),{\textbf{0}})\\&\quad +\sum _{\begin{array}{c} |{\varvec{\nu }}|=k+1\\ \nu _i=0~\forall i\le s \end{array}}\frac{k+1}{{\varvec{\nu }}!}\varvec{\theta }({{\varvec{y}}})^{{\varvec{\nu }}}\int _0^1(1-t)^k\partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widetilde{F}}}(\varvec{\theta }({{\varvec{y}}}_{\le s},t{{\varvec{y}}}_{>s}))\,\textrm{d}t. \end{aligned}$$

Since the last equality holds pointwise for all ${{\varvec{y}}}\in [-1,1]^{{\mathbb {N}}}$, we can integrate the equation on both sides over $U\subset [-1,1]^{{\mathbb {N}}}$. We note that the integral $\int _{0}^1 \sin (2\pi y_i)^{\nu _i}\,\textrm{d}y_i$ takes the value between 0 and 1, and gives the value 0 when $\nu _i = 1$. Thus the summands corresponding to ${\varvec{\nu }}\in {\mathscr {F}}$ with $\nu _i=1$ for at least one $i>s$ vanish in the first term. Hence

$$\begin{aligned}&\int _U (F({{\varvec{y}}})-F({{\varvec{y}}}_{\le s},{\textbf{0}}))\,\textrm{d}{{\varvec{y}}}\\&\quad =\sum _{\ell =2}^k \sum _{\begin{array}{c} |{\varvec{\nu }}|=\ell \\ \nu _i=0~\forall i\le s\\ \nu _i\ne 1~\forall i>s \end{array}}\frac{1}{{\varvec{\nu }}!}\int _U \varvec{\theta }({{\varvec{y}}})^{{\varvec{\nu }}}\partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widetilde{F}}}(\varvec{\theta }({{\varvec{y}}}_{\le s}),{\textbf{0}})\,\textrm{d}{{\varvec{y}}}\\&\qquad +\sum _{\begin{array}{c} |{\varvec{\nu }}|=k+1\\ \nu _i=0~\forall i\le s \end{array}}\frac{k+1}{{\varvec{\nu }}!}\int _U\int _0^1(1-t)^k\varvec{\theta }({{\varvec{y}}})^{{\varvec{\nu }}}\partial _{{\varvec{y}}}^{{\varvec{\nu }}}{{\widetilde{F}}}(\varvec{\theta }({{\varvec{y}}}_{\le s},t{{\varvec{y}}}_{>s}))\,\textrm{d}t\,\textrm{d}{{\varvec{y}}}\\&\quad \le C'\,\Vert G\Vert _{H^{-1}(D_{\textrm{ref}})}\sum _{\ell =2}^k \sum _{\begin{array}{c} |{\varvec{\nu }}|=\ell \\ \nu _i=0~\forall i\le s\\ \nu _i\ne 1~\forall i>s \end{array}}{{\widetilde{{{\varvec{b}}}}}}^{{\varvec{\nu }}} + C''\,\Vert G\Vert _{H^{-1}(D_{\textrm{ref}})}\sum _{\begin{array}{c} |{\varvec{\nu }}|=k+1\\ \nu _i=0~\forall i\le s \end{array}}{{\widetilde{{{\varvec{b}}}}}}^{{\varvec{\nu }}}, \end{aligned}$$

where $C'=C\,k!$ and $C''=C\,(k+1)!$. Recalling the definition of F and taking the supremum over all $G\in H^{-1}(D_\textrm{ref})$ with $\Vert G\Vert _{H^{-1}(D_{\textrm{ref}})}\le 1$ yields for the left-hand side that

$$\begin{aligned}&\sup _{\begin{array}{c} G\in H^{-1}(D_{\textrm{ref}})\\ \Vert G\Vert _{H^{-1}(D_{\textrm{ref}})}\le 1 \end{array}}\int _U \langle G,{{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})\rangle _{H^{-1}(D_{\textrm{ref}}),H_0^1(D_{\textrm{ref}})}\,\textrm{d}{{\varvec{y}}}\\&\quad =\sup _{\begin{array}{c} G\in H^{-1}(D_{\textrm{ref}})\\ \Vert G\Vert _{H^{-1}(D_{\textrm{ref}})}\le 1 \end{array}}\bigg \langle G,\int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \rangle _{H^{-1}(D_{\textrm{ref}}),H_0^1(D_{\textrm{ref}})}\\&\quad =\sup _{\begin{array}{c} G\in H^{-1}(D_{\textrm{ref}})\\ \Vert G\Vert _{H^{-1}(D_{\textrm{ref}})}\le 1 \end{array}}\bigg \langle G,\int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \rangle _{H^{-1}(D_{\textrm{ref}}),H_0^1(D_{\textrm{ref}})}\\&\quad =\bigg \Vert \int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{H_0^1(D_{\textrm{ref}})}, \end{aligned}$$

which yields the overall bound

$$\begin{aligned} \bigg \Vert \int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{H_0^1(D_{\textrm{ref}})} \lesssim \underbrace{\sum _{\ell =2}^k \sum _{\begin{array}{c} |{\varvec{\nu }}|=\ell \\ \nu _i=0~\forall i\le s\\ \nu _i\ne 1~\forall i>s \end{array}}{{\widetilde{{{\varvec{b}}}}}}^{{\varvec{\nu }}}}_{=:\, T_1} + \underbrace{\sum _{\begin{array}{c} |{\varvec{\nu }}|=k+1\\ \nu _i=0~\forall i\le s \\ \end{array} }{{\widetilde{{{\varvec{b}}}}}}^{{\varvec{\nu }}}}_{=:\, T_2}, \end{aligned}$$

(4.2)

where the generic constant is independent of s. The goal will be to balance $T_1$ and $T_2$ in (4.2) by choosing the value of k appropriately.

The term $T_1$ in (4.2) can be bounded similarly to [7, Theorem 1]:

$$\begin{aligned} T_1&=\sum _{\begin{array}{c} 2\le |{\varvec{\nu }}|\le k\\ \nu _i=0~\forall i\le s\\ \nu _i\ne 1~\forall i>s \end{array}}{{\widetilde{{{\varvec{b}}}}}}^{{\varvec{\nu }}}\le \sum _{\begin{array}{c} 0\ne |{\varvec{\nu }}|_\infty \le k\\ \nu _i=0~\forall i\le s\\ \nu _i\ne 1~\forall i>s \end{array}}{{\widetilde{{{\varvec{b}}}}}}^{{\varvec{\nu }}} =-1+\prod _{i>s}\bigg (1+\sum _{\ell =2}^k{{\widetilde{b}}}_i^\ell \bigg )\\&=-1+\prod _{i>s}\bigg (1+\frac{1-{{\widetilde{b}}}_i^{k-1}}{1-{{\widetilde{b}}}_i}{{\widetilde{b}}}_i^2\bigg ) \le -1+\prod _{i>s}\bigg (1+ \tau _k\, {{\widetilde{b}}}_i^2\bigg ), \end{aligned}$$

where we abuse notation slightly by interpreting the value of the term $\frac{1-{{\widetilde{b}}}_i^{k-1}}{1-{{\widetilde{b}}}_i}$ as $k-1$ if $\widetilde{b}_i=1$, and since the sequence is ${{\widetilde{{{\varvec{b}}}}}}$ is non-increasing we defined $\tau _k:= \frac{1-{\widetilde{b}}_1^{k-1}}{1-{\widetilde{b}}_1}$ if ${{\widetilde{b}}}_1\ne 1$ and $\tau _k:=k-1$ if ${{\widetilde{b}}}_1=1$. Moreover, it is an easy consequence of (A6) that

$$\begin{aligned} \sum _{i>s}{{\widetilde{b}}}_i^2\le \bigg (\sum _{i\ge 1}{{\widetilde{b}}}_i^p\bigg )^{2/p}s^{-2/p+1}, \end{aligned}$$

where we used the well-known Stechkin’s lemma (cf., e.g., [25, Lemma 3.3]): for $0<p\le q<\infty $ and any sequence $(a_i)_{i\ge 1}$ of real numbers ordered such that $|a_1|\ge |a_2|\ge \cdots $, there holds

$$\begin{aligned} \bigg (\sum _{i>s} |a_i|^q\bigg )^{1/q}\le \bigg (\sum _{i\ge 1}|a_i|^p\bigg )^{1/p}s^{-1/p+1/q}. \end{aligned}$$

(4.3)

Therefore

$$\begin{aligned} T_1&\le -1+\exp \bigg (\tau _k\sum _{i>s}{{\widetilde{b}}}_i^2\bigg ) = \sum _{j=1}^\infty \frac{\tau _k^j}{j!}\bigg (\sum _{i>s}{{\widetilde{b}}}_i^2\bigg )^j \\&\le \bigg (\exp \bigg (\tau _k\bigg (\sum _{i\ge 1}{{\widetilde{b}}}_i^p\bigg )^{2/p}\bigg )-1\bigg ) s^{-2/p+1}. \end{aligned}$$

The term $T_2$ in (4.2) can be estimated similarly to the approach taken in [9, Theorem 4.1]. The trivial inequality $\frac{|{\varvec{\nu }}|!}{{\varvec{\nu }}!}\ge 1$ and the multinomial theorem imply together that

$$\begin{aligned} T_2 \le \sum _{\begin{array}{c} |{\varvec{\nu }}|=k+1\\ \nu _i=0~\forall i\le s \end{array}}\frac{|{\varvec{\nu }}|!}{{\varvec{\nu }}!}{{\widetilde{{{\varvec{b}}}}}}^{{\varvec{\nu }}}=\bigg (\sum _{i>s}{{\widetilde{b}}}_i\bigg )^{k+1} \le \bigg (\sum _{i\ge 1}{{\widetilde{b}}}_i^p\bigg )^{(k+1)/p}s^{(k+1)(-1/p+1)}, \end{aligned}$$

where the final inequality is a consequence of (4.3). The exponent of s in the bound on $T_2$ is dominated by that of $T_1$ by choosing $k=\lceil 1/(1-p)\rceil $.

Finally, the Poincaré inequality implies that

$$\begin{aligned} \bigg \Vert \int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{L^2(D_{\textrm{ref}})}={\mathcal {O}}(s^{-2/p+1}), \end{aligned}$$

and the $L^1$ error can be obtained by using the Cauchy–Schwarz inequality

$$\begin{aligned}&\bigg \Vert \int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{L^1(D_{\textrm{ref}})}\\&\quad \le |D_{\textrm{ref}}|^{1/2}\bigg \Vert \int _U ({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{L^2(D_{\textrm{ref}})}. \end{aligned}$$

This completes the proof. $\square $

4.2 Finite element error

Let the assumptions (A1)–(A4) and (A7) hold. Let us denote by $\{V_h\}_h$ a family of finite element subspaces of $H_0^1(D_{\textrm{ref}})$ indexed by the mesh size h. We assume that the finite element spaces are spanned by continuous, piecewise linear finite element basis functions and the mesh corresponding to each $V_h$ is obtained from an initial, regular triangulation of $D_{\textrm{ref}}$ by recursive uniform bisection of simplices.

We define the dimensionally-truncated finite element solution for ${{\varvec{y}}}\in U$ as the solution $u_{s,h}(\cdot ,{{\varvec{y}}})\in V_h$ to

$$\begin{aligned} \int _{D_{\textrm{ref}}} (A_s({{\varvec{x}}},{{\varvec{y}}})\nabla {{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}}))\cdot \nabla {{\widehat{v}}}_h({{\varvec{x}}})\,\textrm{d}{{\varvec{x}}}=\int _{D_{\textrm{ref}}}f_{\textrm{ref},s}({{\varvec{x}}},{{\varvec{y}}}){{\widehat{v}}}_h({{\varvec{x}}})\,\textrm{d}{{\varvec{x}}}\end{aligned}$$

for all ${{\widehat{v}}}_h\in V_h$, where we set $A_s({{\varvec{x}}},{{\varvec{y}}}):=A({{\varvec{x}}},(y_1,\ldots ,y_s,0,0,\ldots ))$ as well as $f_{\textrm{ref,s}}({{\varvec{x}}},{{\varvec{y}}}):=f_{\textrm{ref}}({{\varvec{x}}},(y_1,\ldots ,y_s,0,0,\ldots ))$ for all ${{\varvec{y}}}\in U$. Since the reference domain $D_{\textrm{ref}}$ was assumed in (A7) to be a bounded, convex polyhedron, it follows that elements in $W^{1,\infty }(D_{\textrm{ref}})$ have Lipschitz continuous representatives. Hence we may apply Lemma 3.5 and [10, Theorem 3.2.1.2] to obtain

$$\begin{aligned} \Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})\Vert _{H^2(D_{\textrm{ref}})\cap H_0^1(D_{\textrm{ref}})}\le C, \end{aligned}$$

(4.4)

where $\Vert {{\widehat{v}}}\Vert _{H^2(D_{\textrm{ref}})\cap H_0^1(D_{\textrm{ref}})}:=(\Vert {{\widehat{v}}}\Vert _{L^2(D_{\textrm{ref}})}^2+\Vert \varDelta {{\widehat{v}}}\Vert _{L^2(D_{\textrm{ref}})}^2)^{1/2}$ and the constant $C>0$ only depends on d and the diameter of $D_{\textrm{ref}}$.

We have by Céa’s lemma that

$$\begin{aligned} \Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1(D_{\textrm{ref}})}\le C' \inf _{{{\widehat{v}}}_h\in V_h}\Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{v}}}_h\Vert _{H_0^1(D_{\textrm{ref}})}, \end{aligned}$$

(4.5)

for a constant $C'>0$ independent of s, h, and ${{\varvec{y}}}$, which can be seen by letting ${{\widehat{v}}}_h\in V_h$ be arbitrary and deriving

$$\begin{aligned}&\frac{\sigma _{\min }^d}{\sigma _{\max }^2}\Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1(D_{\textrm{ref}})}^2\\&\quad \le \int _{D_{\textrm{ref}}}(A_s({{\varvec{x}}},{{\varvec{y}}})\nabla ({{\widehat{u}}}_s({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}})))\cdot \nabla ({{\widehat{u}}}_s({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}}))\,\textrm{d}{{\varvec{x}}}\\&\quad = \int _{D_{\textrm{ref}}}(A_s({{\varvec{x}}},{{\varvec{y}}})\nabla ({{\widehat{u}}}_s({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}})))\cdot \nabla ({{\widehat{u}}}_s({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{v}}}_h({{\varvec{x}}}))\,\textrm{d}{{\varvec{x}}}\\&\qquad +\int _{D_{\textrm{ref}}}(A_s({{\varvec{x}}},{{\varvec{y}}})\nabla ({{\widehat{u}}}_s({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}})))\cdot \nabla ({{\widehat{v}}}_h({{\varvec{x}}})-{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}}))\,\textrm{d}{{\varvec{x}}}\\&\quad = \int _{D_{\textrm{ref}}}(A_s({{\varvec{x}}},{{\varvec{y}}})\nabla ({{\widehat{u}}}_s({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}})))\cdot \nabla ({{\widehat{u}}}_s({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{v}}}_h({{\varvec{x}}}))\,\textrm{d}{{\varvec{x}}}\\&\quad \le \frac{\sigma _{\max }^d}{\sigma _{\min }^2}\Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1(D_{\textrm{ref}})}\Vert {{\widehat{u}}}_{s}(\cdot ,{{\varvec{y}}})-{{\widehat{v}}}_h\Vert _{H_0^1(D_{\textrm{ref}})}, \end{aligned}$$

where the penultimate step follows from Galerkin orthogonality.

The well-known approximation property (cf., e.g., [8, 26])

$$\begin{aligned} \inf _{{{\widehat{v}}}_h\in V_h}\Vert {{\widehat{v}}}-{{\widehat{v}}}_h\Vert _{H_0^1(D_{\textrm{ref}})}\le C''h\Vert {{\widehat{v}}}\Vert _{H^2(D_{\textrm{ref}})\cap H_0^1(D_{\textrm{ref}})}\quad \text {as}~h\rightarrow 0 \end{aligned}$$

(4.6)

holds for all ${{\widehat{v}}}\in H^2(D_{\textrm{ref}})\cap H_0^1(D_{\textrm{ref}})$ with a constant $C''>0$ independent of h. The inequalities (4.4), (4.5), and (4.6) yield altogether that

$$\begin{aligned} \Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1(D_{\textrm{ref}})}\le C'''h\quad \text {as}~h\rightarrow 0, \end{aligned}$$

(4.7)

where the constant $C'''>0$ is independent of s, h, and ${{\varvec{y}}}$.

An $L^1$ error estimate can be obtained from a standard Aubin–Nitsche duality argument.

Theorem 4.2

Under assumptions (A1)– (A4) and (A7), there holds for the dimensionally-truncated finite element solution that

$$\begin{aligned} \Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{L^1(D_{\textrm{ref}})}\lesssim h^2\quad \text {as}~h\rightarrow 0, \end{aligned}$$

where the generic constant is independent of s, h, and ${{\varvec{y}}}$.

Proof

Since

$$\begin{aligned} \Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{L^1(D_{\textrm{ref}})}\le |D_{\textrm{ref}}|^{1/2}\Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{L^2(D_{\textrm{ref}})}, \end{aligned}$$

it suffices to prove the $L^2$ error estimate.

Let ${{\widehat{g}}}\in L^2(D_{\textrm{ref}})$. For ${{\varvec{y}}}\in U$, denote by ${{\widehat{u}}}_{{{\widehat{g}}},s}(\cdot ,{{\varvec{y}}})\in H_0^1(D_{\textrm{ref}})$ the solution to

$$\begin{aligned} \int _{D_{\textrm{ref}}} (A_s({{\varvec{x}}},{{\varvec{y}}})\nabla {{\widehat{u}}}_{{{\widehat{g}}},s}({{\varvec{x}}},{{\varvec{y}}}))\cdot \nabla {{\widehat{v}}}({{\varvec{x}}})\,\textrm{d}{{\varvec{x}}}=\langle {{\widehat{g}}},{{\widehat{v}}}\rangle _{L^2(D_{\textrm{ref}})}\quad \text {for all}~{{\widehat{v}}}\in H_0^1(D_{\textrm{ref}}). \end{aligned}$$

Testing against ${{\widehat{v}}}={{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})$ and letting ${{\widehat{v}}}_h\in V_h$ be arbitrary, it follows from Galerkin orthogonality of the finite element solution that

$$\begin{aligned}&\langle g,{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\rangle _{L^2(D_{\textrm{ref}})}\\&\quad =\int _{D_{\textrm{ref}}}(A_s({{\varvec{x}}},{{\varvec{y}}})\nabla {{\widehat{u}}}_{{{\widehat{g}}},s}({{\varvec{x}}},{{\varvec{y}}}))\cdot \nabla ({{\widehat{u}}}_s({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}}))\,\textrm{d}{{\varvec{x}}}\\&\quad =\int _{D_{\textrm{ref}}}(A_s({{\varvec{x}}},{{\varvec{y}}})\nabla ({{\widehat{u}}}_{{{\widehat{g}}},s}({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{v}}}_h({{\varvec{x}}})))\cdot \nabla ({{\widehat{u}}}_s({{\varvec{x}}},{{\varvec{y}}})-{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}}))\,\textrm{d}{{\varvec{x}}}\\&\quad \le \frac{\sigma _{\max }^d}{\sigma _{\min }^2}\Vert {{\widehat{u}}}_{{{\widehat{g}}},s}(\cdot ,{{\varvec{y}}})-{{\widehat{v}}}_h\Vert _{H_0^1(D_{\textrm{ref}})}\Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1(D_{\textrm{ref}})} \end{aligned}$$

and, in consequence,

$$\begin{aligned}&\langle g,{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\rangle _{L^2(D_{\textrm{ref}})}\\&\quad \le \frac{\sigma _{\max }^d}{\sigma _{\min }^2}\Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1(D_{\textrm{ref}})}\inf _{{{\widehat{v}}}_h\in V_h}\Vert {{\widehat{u}}}_{{{\widehat{g}}},s}(\cdot ,{{\varvec{y}}})-{{\widehat{v}}}_h\Vert _{H_0^1(D_{\textrm{ref}})}. \end{aligned}$$

Then there holds for all ${{\varvec{y}}}\in U$ that

$$\begin{aligned}&\Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{L^2(D_{\textrm{ref}})}\nonumber \\&\quad =\sup _{\begin{array}{c} {{\widehat{g}}}\in L^2(D_{\textrm{ref}})\\ \Vert {{\widehat{g}}}\Vert _{L^2(D_{\textrm{ref}})}\le 1 \end{array}}\langle {{\widehat{g}}},{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\rangle _{L^2(D_{\textrm{ref}})}\nonumber \\&\quad \le \frac{\sigma _{\max }^d}{\sigma _{\min }^2}\Vert {{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})\!-\!{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1(D_{\textrm{ref}})}\!\sup _{\begin{array}{c} {{\widehat{g}}}\in L^2(D_{\textrm{ref}})\\ \Vert {{\widehat{g}}}\Vert _{L^2(D_{\textrm{ref}})}\le 1 \end{array}}\!\inf _{{{\widehat{v}}}_h\in V_h}\Vert {{\widehat{u}}}_{{{\widehat{g}}},s}(\cdot ,{{\varvec{y}}})\!-\!{{\widehat{v}}}_h\Vert _{H_0^1(D_{\textrm{ref}})}. \end{aligned}$$

(4.8)

By (4.6) and (4.4), we obtain

$$\begin{aligned} \inf _{{{\widehat{v}}}_h\in V_h}\Vert {{\widehat{u}}}_{{{\widehat{g}}},s}(\cdot ,{{\varvec{y}}})-{{\widehat{v}}}_h\Vert _{H_0^1(D_{\textrm{ref}})}&\le C''h\Vert {{\widehat{u}}}_{{{\widehat{g}}},s}(\cdot ,{{\varvec{y}}})\Vert _{H^2(D_{\textrm{ref}})\cap H_0^1(D_{\textrm{ref}})}\\&\le CC''h\Vert {{\widehat{g}}}\Vert _{L^2(D_{\textrm{ref}})}\quad \text {as}~h\rightarrow 0. \end{aligned}$$

Plugging this inequality and (4.7) into (4.8) yields the desired result. $\square $

4.3 Quasi-Monte Carlo cubature error and a choice of weights

Let us consider QMC cubature over the s-dimensional unit cube $U_s:=[0,1]^s$ for finite $s\in {\mathbb {N}}$. An n-point QMC rule is an equal weight cubature rule of the form

$$\begin{aligned} \int _{U_s}F({{\varvec{y}}})\,\textrm{d}{{\varvec{y}}}\approx \frac{1}{n}\sum _{i=1}^nF({{\varvec{y}}}^{(i)}), \end{aligned}$$

where the cubature nodes ${{\varvec{y}}}^{(1)},\ldots ,{{\varvec{y}}}^{(n)}\in [0,1]^s$ are deterministically chosen. In the subsequent analysis, we consider the family of rank-1 lattice rules, where the QMC nodes in $[0,1]^s$ are defined by

$$\begin{aligned} {\varvec{y}}^{(i)}:= \frac{i{\varvec{z}}\bmod n}{n}\quad \text {for}~i\in \{1,\ldots ,n\}, \end{aligned}$$

where ${\varvec{z}}\in \{1,\ldots ,n-1\}^s$ is called the generating vector.

First we briefly recall the theory of lattice rules for 1-periodic integrands with absolutely convergent Fourier series: for

$$\begin{aligned} F({{\varvec{y}}}) = \sum _{{\varvec{\ell }}\in {\mathbb {Z}}^s} {\mathcal {F}}\{F\}_{\varvec{\ell }}\,{\textrm{e}}^{2\pi \textrm{i}{\varvec{\ell }}\cdot {{\varvec{y}}}}, \quad \text{ with }\quad {\mathcal {F}}\{F\}_{\varvec{\ell }}:= \int _{U_s}F({{\varvec{y}}})\,{\textrm{e}}^{-2\pi \textrm{i}{\varvec{\ell }}\cdot {{\varvec{y}}}}\,\textrm{d}{{\varvec{y}}}, \end{aligned}$$

the lattice rule error is given by [30]

$$\begin{aligned} \frac{1}{n}\sum _{i=1}^n F({{\varvec{y}}}^{(i)})-\int _{U_s}F({{\varvec{y}}})\,\textrm{d}{{\varvec{y}}}= \sum _{{\varvec{\ell }}\in \varLambda ^\perp \setminus \{{{\varvec{0}}}\}} {\mathcal {F}}\{F\}_{\varvec{\ell }}, \end{aligned}$$

(4.9)

where the dual lattice $\varLambda ^\perp :=\{{\varvec{\ell }}\in {\mathbb {Z}}^s: {\varvec{\ell }}\cdot {{\varvec{z}}}\equiv 0~(\textrm{mod}~n)\}$ depends on the lattice generating vector ${{\varvec{z}}}$. For integrands belonging to the weighted Korobov space endowed with the norm

$$\begin{aligned} \Vert F\Vert _{\alpha ,{\varvec{\gamma }}} := \sup _{{\varvec{\ell }}\in {\mathbb {Z}}^s}|{\mathcal {F}}\{F\}_{\varvec{\ell }}|\,r_{\alpha ,{\varvec{\gamma }}}({\varvec{\ell }}), \quad \text{ with }\quad r_{\alpha ,{\varvec{\gamma }}}({\varvec{\ell }}) := \gamma _{{\textrm{supp}}({\varvec{\ell }})}^{-1}\prod _{j\in {\textrm{supp}}({\varvec{\ell }})}|\ell _j|^\alpha , \end{aligned}$$

(4.10)

where $\alpha >1$ is the smoothness parameter, ${\varvec{\gamma }}:=(\gamma _{\mathrm {{\mathfrak {u}}}})_{\mathrm {{\mathfrak {u}}}\subseteq \{1:s\}}$ are positive weights, and $\textrm{supp}({\varvec{\ell }}):= \{1\le j\le s: \ell _j\ge 0\}$, we then have the integration error bound

$$\begin{aligned} \bigg |\int _{U_s}F({{\varvec{y}}})\,\textrm{d}{{\varvec{y}}}- \frac{1}{n}\sum _{i=1}^n F({{\varvec{y}}}^{(i)})\bigg | \le \Vert F\Vert _{\alpha ,{\varvec{\gamma }}}\, \sum _{{\varvec{\ell }}\in \varLambda ^{\perp }\setminus \{{\textbf{0}}\}}\frac{1}{r_{\alpha ,{\varvec{\gamma }}}({\varvec{\ell }})}. \end{aligned}$$

It is known [6, Theorem 5] that the component-by-component (CBC) algorithm can be used to find a generating vector satisfying the bound

$$\begin{aligned} \sum _{{\varvec{\ell }}\in \varLambda ^{\perp }\setminus \{{\textbf{0}}\}}\frac{1}{r_{\alpha ,{\varvec{\gamma }}}({\varvec{\ell }})} \le \bigg (\frac{1}{n-1}\underset{=:\,C_{\alpha ,{\varvec{\gamma }}}(\lambda ,s)}{\underbrace{\sum _{\varnothing \ne \mathrm {{\mathfrak {u}}}\subseteq \{1:s\}}\gamma _{\mathrm {{\mathfrak {u}}}}^{\lambda }(2\zeta (\alpha \lambda ))^{|\mathrm {{\mathfrak {u}}}|}}}\bigg )^{1/\lambda } \quad \text{ for } \text{ all }\quad \lambda \in (\tfrac{1}{\alpha },1], \end{aligned}$$

(4.11)

where n was assumed to be prime in [6], but the result also generalizes to non-primes by replacing $n-1$ by the Euler totient function.

In the following we will adapt this theory to the particular integrand $F({{\varvec{y}}}) = {{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}})$ for ${{\varvec{x}}}\in {D_{\textrm{ref}}}$. We derive smoothness-driven product-and-order dependent (SPOD) weights and show that the error can be bounded independent of s; weights of this form have previously appeared in [5, 23].

Theorem 4.3

Suppose that assumptions (A1)– (A5) and (A7) hold. Let $\alpha := \lfloor 1/p\rfloor +1$ with p from Assumption (A5), and choose the weights $\gamma _{\varnothing }:=1$ and

$$\begin{aligned} \gamma _{\mathrm {{\mathfrak {u}}}}:=\sum _{{{\varvec{m}}}_{\mathrm {{\mathfrak {u}}}}\in \{1:\alpha \}^{|\mathrm {{\mathfrak {u}}}|}} \frac{(|{{\varvec{m}}}_{\mathrm {{\mathfrak {u}}}}|+d-1)!}{(d-1)!}\prod _{j\in \mathrm {{\mathfrak {u}}}}({{\widetilde{C}}}^\alpha \, m_j!\,\beta _j^{m_j}\,S(\alpha ,m_j)),~\varnothing \ne \mathrm {{\mathfrak {u}}}\subseteq \{1:s\}, \end{aligned}$$

(4.12)

where $\widetilde{C}:=\frac{2\,d!\,(2+\xi _{{{\varvec{b}}}})^d(1+\xi _{{{\varvec{b}}}})^3}{\sigma _{\min }^{d+4}}$ and $\beta _j:=(2+\sqrt{2})\max (1+\sqrt{3},\rho )\,b_j$. Then a lattice rule can be constructed by the CBC algorithm such that

$$\begin{aligned} \textrm{error} \,:=\, \int _{{D_{\textrm{ref}}}}\bigg |\int _{U_s} {{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}})\,\textrm{d}{{\varvec{y}}}-\frac{1}{n}\sum _{i=1}^n {{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}}^{(i)})\bigg |\,\textrm{d}{{\varvec{x}}}\,=\, {\mathcal {O}}(n^{-1/p}), \end{aligned}$$

where the implied constant is independent of s.

Proof

We will begin by treating $\alpha $ and the weights $\gamma _\mathrm {{\mathfrak {u}}}$ as arbitrary and will specify their values later in the proof. We will assume that $\alpha \ge 2$ is an integer, and we will express the integration error in term of the mixed derivatives of $F({{\varvec{y}}}) = \widehat{u}_{s,h}({{\varvec{x}}},{{\varvec{y}}})$ of order up to $\alpha $ in each coordinate. For $\mathrm {{\mathfrak {u}}}\subseteq \{1:s\}$ we denote by ${\varvec{\nu }}= ({\varvec{\alpha }}_\mathrm {{\mathfrak {u}}};{{\varvec{0}}})$ the multi-index whose component $\nu _j$ is equal to $\alpha $ if $j\in \mathrm {{\mathfrak {u}}}$ and is 0 otherwise. First we observe by differentiating the Fourier series of F that

$$\begin{aligned} \partial ^{({\varvec{\alpha }}_\mathrm {{\mathfrak {u}}};{{\varvec{0}}})}_{{\varvec{y}}}F({{\varvec{y}}}) = \sum _{{\varvec{\ell }}\in {\mathbb {Z}}^s} \Big (\prod _{j\in \mathrm {{\mathfrak {u}}}} (2\pi \textrm{i}\,\ell _j)^\alpha \Big )\, {\mathcal {F}}\{F\}_{\varvec{\ell }}\,{\textrm{e}}^{2\pi \textrm{i}{\varvec{\ell }}\cdot {{\varvec{y}}}}, \end{aligned}$$

from which we conclude that the Fourier coefficients of $\partial ^{({\varvec{\alpha }}_\mathrm {{\mathfrak {u}}};{{\varvec{0}}})}_{{\varvec{y}}}F$ are given by

$$\begin{aligned} \int _{U_s} \partial ^{({\varvec{\alpha }}_\mathrm {{\mathfrak {u}}};{{\varvec{0}}})}_{{\varvec{y}}}F({{\varvec{y}}})\,{\textrm{e}}^{-2\pi \textrm{i}{\varvec{\ell }}\cdot {{\varvec{y}}}}\,\textrm{d}{{\varvec{y}}}= \Big (\prod _{j\in \mathrm {{\mathfrak {u}}}} (2\pi \textrm{i}\,\ell _j)^\alpha \Big )\, {\mathcal {F}}\{F\}_{\varvec{\ell }}. \end{aligned}$$

(4.13)

We are ready to derive the integration error for ${{\widehat{u}}}_{s,h}$. Using (4.9) we have

$$\begin{aligned} \textrm{error}&=\int _{{D_{\textrm{ref}}}}\bigg |\sum _{{\varvec{\ell }}\in \varLambda ^{\perp }\setminus \{{\textbf{0}}\}} {\mathcal {F}}\{{{\widehat{u}}}_{s,h}({{\varvec{x}}},\cdot )\}_{\varvec{\ell }}\bigg |\,\textrm{d}{{\varvec{x}}}\nonumber \\&\le \bigg (\sup _{{\varvec{\ell }}\in {\mathbb {Z}}^s}\int _{{D_{\textrm{ref}}}} r_{\alpha ,{\varvec{\gamma }}}({\varvec{\ell }})\, |{\mathcal {F}}\{{{\widehat{u}}}_{s,h}({{\varvec{x}}},\cdot )\}_{\varvec{\ell }}|\,\textrm{d}{{\varvec{x}}}\bigg ) \sum _{{\varvec{\ell }}\in \varLambda ^{\perp }\setminus \{{\textbf{0}}\}}\frac{1}{r_{\alpha ,{\varvec{\gamma }}}({\varvec{\ell }})}. \end{aligned}$$

(4.14)

Denoting $\mathrm {{\mathfrak {u}}}= \textrm{supp}({\varvec{\ell }})$, we obtain using the definition of $r_{\alpha ,{\varvec{\gamma }}}({\varvec{\ell }})$ in (4.10) with (4.13)

$$\begin{aligned}&\int _{{D_{\textrm{ref}}}} r_{\alpha ,{\varvec{\gamma }}}({\varvec{\ell }})\, |{\mathcal {F}}\{{{\widehat{u}}}_{s,h}({{\varvec{x}}},\cdot )\}_{\varvec{\ell }}|\,\textrm{d}{{\varvec{x}}}= \frac{1}{\gamma _{\mathrm {{\mathfrak {u}}}}} \int _{{D_{\textrm{ref}}}} \Big (\prod _{j\in \mathrm {{\mathfrak {u}}}}|\ell _j|^{\alpha }\Big ) |{\mathcal {F}}\{ {{\widehat{u}}}_{s,h}({{\varvec{x}}},\cdot )\}_{\varvec{\ell }}|\,\textrm{d}{{\varvec{x}}}\nonumber \\&\quad = \frac{1}{\gamma _{\mathrm {{\mathfrak {u}}}}(2\pi )^{\alpha |\mathrm {{\mathfrak {u}}}|}} \int _{{D_{\textrm{ref}}}} \Big (\prod _{j\in \mathrm {{\mathfrak {u}}}}|2\pi \textrm{i}\,\ell _j|^{\alpha }\Big ) |{\mathcal {F}}\{ {{\widehat{u}}}_{s,h}({{\varvec{x}}},\cdot )\}_{\varvec{\ell }}|\,\textrm{d}{{\varvec{x}}}\nonumber \\&\quad = \frac{1}{\gamma _{\mathrm {{\mathfrak {u}}}}(2\pi )^{\alpha |\mathrm {{\mathfrak {u}}}|}} \int _{{D_{\textrm{ref}}}}\bigg |\int _{U_s} \partial ^{({\varvec{\alpha }}_\mathrm {{\mathfrak {u}}};{{\varvec{0}}})}_{{\varvec{y}}}{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}})\,{\textrm{e}}^{-2\pi \textrm{i}{\varvec{\ell }}\cdot {{\varvec{y}}}}\,\textrm{d}{{\varvec{y}}}\bigg |\,\textrm{d}{{\varvec{x}}}\nonumber \\&\quad \le \frac{1}{\gamma _{\mathrm {{\mathfrak {u}}}}(2\pi )^{\alpha |\mathrm {{\mathfrak {u}}}|}} \int _{U_s}\int _{{D_{\textrm{ref}}}} |\partial ^{({\varvec{\alpha }}_\mathrm {{\mathfrak {u}}};{{\varvec{0}}})}_{{\varvec{y}}}{{\widehat{u}}}_{s,h}({{\varvec{x}}},{{\varvec{y}}})|\,\textrm{d}{{\varvec{x}}}\,\textrm{d}{{\varvec{y}}}\nonumber \\&\quad \le \frac{|{{D_{\textrm{ref}}}}|^{1/2}}{\gamma _{\mathrm {{\mathfrak {u}}}}(2\pi )^{\alpha |\mathrm {{\mathfrak {u}}}|}} \int _{U_s}\Vert \partial ^{({\varvec{\alpha }}_\mathrm {{\mathfrak {u}}};{{\varvec{0}}})}_{{\varvec{y}}}{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{L^2({D_{\textrm{ref}}})}\,\textrm{d}{{\varvec{y}}}\nonumber \\&\quad \le \frac{C_P|{{D_{\textrm{ref}}}}|^{1/2}}{\gamma _{\mathrm {{\mathfrak {u}}}}(2\pi )^{\alpha |\mathrm {{\mathfrak {u}}}|}} \int _{U_s}\Vert \partial ^{({\varvec{\alpha }}_\mathrm {{\mathfrak {u}}};{{\varvec{0}}})}_{{\varvec{y}}}{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\Vert _{H_0^1({D_{\textrm{ref}}})}\,\textrm{d}{{\varvec{y}}}, \end{aligned}$$

(4.15)

where for the inequalities we used the unimodularity of the exponential function, changed the order of integration, and applied both Cauchy–Schwarz and Poincaré inequalities.

Combining (4.11), (4.14) and (4.15) together with Theorem 3.7, it follows that the CBC algorithm can be used to find a generating vector satisfying the bound

$$\begin{aligned} \textrm{error}&\le 2\,C_{\textrm{init}}\,C_P|{D_{\textrm{ref}}}|^{1/2}\, C_{\alpha ,{\varvec{\gamma }}}(\lambda ,s)^{1/\lambda }\,\bigg (\frac{1}{n-1}\bigg )^{1/\lambda }\\&\quad \times \max _{\mathrm {{\mathfrak {u}}}\subseteq \{1:s\}}\frac{1}{\gamma _{\mathrm {{\mathfrak {u}}}}}\sum _{{{\varvec{m}}}_{\mathrm {{\mathfrak {u}}}}\in \{1:\alpha \}^{|\mathrm {{\mathfrak {u}}}|}}\frac{(|{{\varvec{m}}}_{\mathrm {{\mathfrak {u}}}}|+d-1)!}{(d-1)!}\prod _{j\in \mathrm {{\mathfrak {u}}}}\big ({{\widetilde{C}}}^{\alpha }m_j!\,\beta _j^{m_j}S(\alpha ,m_j)\big ), \end{aligned}$$

where $\widetilde{C}:=\frac{2\,d!\,(2+\xi _{{{\varvec{b}}}})^d(1+\xi _{{{\varvec{b}}}})^3}{\sigma _{\min }^{d+4}}$, $\beta _j=(2+\sqrt{2})\max (1+\sqrt{3},\rho )b_j$, and $C_{\textrm{init}}$ is defined as in (3.10). We choose the weights by (4.12) to ensure that the factor on the second line is bounded by 1.

Finally, we show that with this choice of weights there is a constant $C_{\alpha ,{\varvec{\gamma }}}(\lambda )$ independent of s such that $C_{\alpha ,{\varvec{\gamma }}}(\lambda ,s)\le C_{\alpha ,{\varvec{\gamma }}}(\lambda )<\infty $. To this end, we plug the weights (4.12) into the expression for $C_{\alpha ,{\varvec{\gamma }}}(\lambda ,s)$ in (4.11), define $S_{\max }(\alpha ):=\max _{k\in \{1:\alpha \}}S(\alpha ,k)$, and use Jensen’s inequality (see, e.g., [18, Theorem 19])

$$\begin{aligned}\sum _k a_k\le \bigg (\sum _k a_k^\lambda \bigg )^{1/\lambda }\quad \text {for}~0<\lambda \le 1~\text {and}~a_k\ge 0,\end{aligned}$$

to find that

$$\begin{aligned}&C_{\alpha ,{\varvec{\gamma }}}(\lambda ,s)\nonumber \\&\quad =\!\sum _{\varnothing \ne \mathrm {{\mathfrak {u}}}\subseteq \{1:s\}} \!\!\! \bigg (\sum _{{{\varvec{m}}}_{\mathrm {{\mathfrak {u}}}}\in \{1:\alpha \}^{|\mathrm {{\mathfrak {u}}}|}} \!\!\!\!\frac{(|{{\varvec{m}}}_{\mathrm {{\mathfrak {u}}}}|\!+\!d\!-\!1)!}{(d\!-\!1)!} \prod _{j\in \mathrm {{\mathfrak {u}}}} \Big ({{\widetilde{C}}}^\alpha m_j!\beta _j^{m_j}S(\alpha ,m_j)\Big )\bigg )^{\lambda } (2\zeta (\alpha \lambda ))^{|\mathrm {{\mathfrak {u}}}|} \nonumber \\&\quad \le \! \sum _{\varnothing \ne \mathrm {{\mathfrak {u}}}\subseteq \{1:s\}}\sum _{{{\varvec{m}}}_{\mathrm {{\mathfrak {u}}}}\in \{1:\alpha \}^{|\mathrm {{\mathfrak {u}}}|}}\!\! \bigg (\frac{(|{{\varvec{m}}}_{\mathrm {{\mathfrak {u}}}}|\!+\!d\!-\!1)!}{(d\!-\!1)!} \prod _{j\in \mathrm {{\mathfrak {u}}}}\! \Big ({{\widetilde{C}}}^{\alpha }m_j!\beta _j^{m_j}S(\alpha ,m_j)(2\zeta (\alpha \lambda ))^{\frac{1}{\lambda }}\Big )\! \bigg )^{\lambda } \nonumber \\&\quad \le \sum _{{\textbf{0}}\ne {{\varvec{m}}}\in \{1:\alpha \}^s}\bigg (\frac{(|{{\varvec{m}}}|\!+\!d\!-\!1)!}{(d\!-\!1)!}\prod _{j=1}^s\mu _j^{m_j}\bigg )^{\lambda }, \end{aligned}$$

(4.16)

where we define $\mu _j:=\max (1,{{\widetilde{C}}}^\alpha \, \alpha !\,S_{\max }(\alpha )\,(2\zeta (\alpha \lambda ))^{1/\lambda })\,\beta _j$. By defining the auxiliary sequence $d_j:=\beta _{\lceil j/\alpha \rceil }$, $j\ge 1$, that is

$$\begin{aligned} d_{j\alpha +1}=d_{j\alpha +2}=\cdots =d_{(j+1)\alpha }=\beta _{j+1},\quad j\in {\mathbb {N}}_0, \end{aligned}$$

the last sum in (4.16) is bounded from above by

$$\begin{aligned} \sum _{\begin{array}{c} \mathrm {{\mathfrak {v}}}\subseteq {\mathbb {N}}\\ |\mathrm {{\mathfrak {v}}}|<\infty \end{array}} \bigg (\frac{(|\mathrm {{\mathfrak {v}}}|+d-1)!}{(d-1)!}\prod _{j\in \mathrm {{\mathfrak {v}}}}d_j\bigg )^{\lambda }&=\sum _{\ell =0}^\infty \bigg (\frac{(\ell +d-1)!}{(d-1)!}\bigg )^\lambda \sum _{\begin{array}{c} \mathrm {{\mathfrak {v}}}\subseteq {\mathbb {N}}\\ |\mathrm {{\mathfrak {v}}}|=\ell \end{array}}\prod _{j\in \mathrm {{\mathfrak {v}}}}d_j^\lambda \\&\le \sum _{\ell =0}^\infty \frac{1}{\ell !}\bigg (\frac{(\ell +d-1)!}{(d-1)!}\bigg )^\lambda \bigg (\sum _{j=1}^\infty d_j^\lambda \bigg )^\ell , \end{aligned}$$

where the final inequality is due to the fact that $\big (\sum _{j=1}^\infty d_j^\lambda \big )^\ell $ includes all products of the form $d_{j_1} d_{j_2}\ldots d_{j_\ell }$, not only those (as in the line above) with all $j_i, \ldots , j_\ell $ different; and since the terms with all $j_1, \ldots , j_\ell $ different are repeated $\ell !$ times, in the last step we can divide by $\ell !$.

By choosing $\lambda =p$ and $\alpha =\lfloor 1/p\rfloor +1$, it is easily seen by Assumption (A5) and an application of the d’Alembert ratio test that the upper bound in the above expression is finite. This implies that the QMC error is of order ${\mathcal {O}}(n^{-1/p})$, with the implied constant independent of the dimension s when the weights are chosen according to (4.12). $\square $

4.4 Overall error estimate

Let ${{\widehat{u}}}$ be the solution to (2.7), ${{\widehat{u}}}_s$ the corresponding dimensionally-truncated solution, and $\widehat{u}_{s,h}$ the dimensionally-truncated FE solution. It is easy to see that

$$\begin{aligned} \begin{aligned}&\bigg \Vert \int _U {{\widehat{u}}}(\cdot ,{{\varvec{y}}})\,\textrm{d}{{\varvec{y}}}-\frac{1}{n}\sum _{k=1}^n {{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}}^{(k)})\bigg \Vert _{L^1(D_{\textrm{ref}})}\\&\quad \le \underset{\text {dimension truncation error}}{\underbrace{\bigg \Vert \int _U({{\widehat{u}}}(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_s(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{L^1(D_{\textrm{ref}})}}}+\underset{\text {FE error}}{\underbrace{\bigg \Vert \int _{U_s}({{\widehat{u}}}_s(\cdot ,{{\varvec{y}}})-{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}}))\,\textrm{d}{{\varvec{y}}}\bigg \Vert _{L^1(D_{\textrm{ref}})}}}\\&\qquad +\underset{\text {QMC error}}{\underbrace{\bigg \Vert \int _{U_s}{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}})\,\textrm{d}{{\varvec{y}}}-\frac{1}{n}\sum _{k=1}^n{{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}}^{(k)})\bigg \Vert _{L^1(D_{\textrm{ref}})}}}.\end{aligned} \end{aligned}$$

(4.17)

We combine the error bounds derived in Sects. 4.1–4.3. The error decomposition (4.17) allows us to deduce the following overall error estimate.

Theorem 4.4

Let $({{\varvec{y}}}_k)_{k=1}^n$ be the lattice cubature nodes in $U_s$ generated by a CBC algorithm using the construction detailed in Sect. 4.3. Suppose that for each lattice point, we solve (2.7) using one common finite element discretization in the domain $D_{\textrm{ref}}$. Under the assumptions (A1)–(A7), we have the combined error estimate

$$\begin{aligned} \bigg \Vert \int _U {{\widehat{u}}}(\cdot ,{{\varvec{y}}})\,\textrm{d}{{\varvec{y}}}-\frac{1}{n}\sum _{k=1}^n {{\widehat{u}}}_{s,h}(\cdot ,{{\varvec{y}}}^{(k)})\bigg \Vert _{L^1(D_{\textrm{ref}})}\le C(s^{-2/p+1}+n^{-1/p}+h^2), \end{aligned}$$

where h denotes the mesh size of the piecewise linear finite element mesh and $C>0$ is a constant independent of s, h, and n.

5 Numerical experiments

Let us consider the domain parameterization

$$\begin{aligned} D({{\varvec{y}}}):=\{(x_1,x_2)\in {\mathbb {R}}^2:0\le x_1\le 1,~0\le x_2\le a(x_1,{{\varvec{y}}})\},\quad {{\varvec{y}}}=(y_j)_{j=1}^s\in U_s, \end{aligned}$$

where

$$\begin{aligned} a(x,{{\varvec{y}}}):=1+\frac{c}{\sqrt{6}}\sum _{j=1}^s\sin (2\pi y_j)\psi _j(x),\quad x\in [0,1]~\textrm{and}~{{\varvec{y}}}\in U_s \end{aligned}$$

(5.1)

with $c>0$ and $\psi _j(x):=j^{-\theta }\cos (j\pi x)$, $\theta >1$. Thus the reference domain in this case is the unit square, and the uncertain boundary is confined to the upper edge of the square. It is possible to write $D({{\varvec{y}}})=V(D({\textbf{0}}),{{\varvec{y}}})$ with the vector-valued expansion

$$\begin{aligned} {\varvec{V}}({{\varvec{x}}},{{\varvec{y}}}):={{\varvec{x}}}+\frac{1}{\sqrt{6}}\sum _{j=1}^s \sin (2\pi y_j){\varvec{\psi }}_j({{\varvec{x}}}), \end{aligned}$$

where

$$\begin{aligned} {\varvec{\psi }}_j({{\varvec{x}}}):=cj^{-\theta }\begin{bmatrix}0\\ x_2\cos (j\pi x_1)\end{bmatrix}. \end{aligned}$$

We note that

$$\begin{aligned} \Vert {\varvec{\psi }}_j'({{\varvec{x}}})\Vert _2=\sqrt{\cos ^2(j\pi x_1)+j^2\pi ^2x_2^2\sin ^2(j\pi x_1)} \end{aligned}$$

(5.2)

and

$$\begin{aligned} \sup _{{{\varvec{x}}}\in D({\textbf{0}})}\sqrt{\cos ^2(j\pi x_1)+j^2\pi ^2x_2^2\sin ^2(j\pi x_1)}=j\pi . \end{aligned}$$

(5.3)

The identity (5.2) can be easily verified from the definition of the matrix spectral norm, and (5.3) follows by observing that $\cos ^2(j \pi x_1) + j^2 \pi ^2 x_2^2\sin ^2(j \pi x_1) = 1 + \sin ^2(j \pi x_1) (j^2 \pi ^2 x_2^2 - 1)$ is maximized by $x_1=\frac{1}{2j}$ and $x_2=1$. The elements of the sequence $(b_j)_{j=1}^\infty $ needed for the construction of the QMC weights now simplify to

$$\begin{aligned} b_j&=\frac{1}{\sqrt{6}}\Vert {\varvec{\psi }}_j\Vert _{W^{1,\infty }({D_{\textrm{ref}}})}\\&=\frac{1}{\sqrt{6}}\!\max \!\bigg \{\underset{{{\varvec{x}}}\in D({\textbf{0}})}{\mathrm{ess\,sup}}\Vert {\varvec{\psi }}_j({{\varvec{x}}})\Vert _2,\underset{{{\varvec{x}}}\in D({\textbf{0}})}{\mathrm{ess\,sup}}\Vert {\varvec{\psi }}_j'({{\varvec{x}}})\Vert _2\bigg \}\\&=\frac{c}{\sqrt{6}}j^{-{\theta }}\!\max \!\bigg \{\underset{{{{\varvec{x}}}\in D({\textbf{0}})}}{\textrm{sup}}|x_2\cos (j\pi x_1)|,\!\underset{{{\varvec{x}}}\in D({\textbf{0}})}{\textrm{sup}}\!\sqrt{\cos ^2(j\pi x_1)\!+\!j^2\pi ^2x_2^2\sin ^2(j\pi x_1)}\bigg \}\\&=\frac{c}{\sqrt{6}}j^{-\theta }\max \{1,j\pi \}=\frac{c\pi }{\sqrt{ 6}}j^{1-\theta }. \end{aligned}$$

Moreover, we can take

$$\begin{aligned} \xi _{{{\varvec{b}}}}=\sum _{j=1}^\infty b_j=\frac{c\pi }{\sqrt{ 6}}\zeta (\theta -1). \end{aligned}$$

The expected rate of QMC convergence is ${\mathcal {O}}(n^{-\theta +1})$.

5.1 Source problem

We consider the Poisson problem (1.1) equipped with $f({{\varvec{x}}}):=x_2$ and the random domain described above for $\theta \in \{2.1,2.5,3.0\}$ and $c=\sqrt{3/2}$. The input random field is truncated to $s=100$ terms. We construct a uniform regular triangulation for the reference domain $D_{\textrm{ref}}=D({\textbf{0}})$ with mesh size $h=2^{-5}$, which is then mapped to each realization of the random domain using ${\varvec{V}}$. The PDE is solved using a first order finite element method. For each realization of ${{\varvec{y}}}\in U_s$, the finite element solution is computed in $D({{\varvec{y}}})$ and then mapped back onto the reference domain $D_{\textrm{ref}}$. This is equivalent to but simpler than computing the solution on the reference domain, see [15]. We solve the PDE problem over QMC point sets with increasing number of cubature nodes n and compute the relative errors

$$\begin{aligned} \frac{\Vert {\mathbb {E}}[{{\widehat{u}}}]-Q_{s,n}({{\widehat{u}}})\Vert _{L^1(D_{\textrm{ref}})}}{\Vert {\mathbb {E}}[{{\widehat{u}}}]\Vert _{L^2(D_{\textrm{ref}})}}, \end{aligned}$$

(5.4)

where $Q_{s,n}$ denotes the s-dimensional QMC operator over n cubature nodes. As the reference solution, we use the approximate QMC solution obtained using $n=1\,024\,207$ cubature nodes. The QMC point sets for this experiment were constructed using the fast CBC algorithm with SPOD weights of the form (4.12), where we set $\sigma _{\min }=\rho =1$ and $c=10^{-6}$ for numerical stability. The results are displayed in the left-hand side image of Fig. 1. In this case, the observed rates $\sigma _{\theta }$ are for $\theta = 2.1$: $\sigma _{2.1}=0.95$, $\theta = 2.5$: $\sigma _{2.5}= 1.15$, $\theta = 3$: $\sigma _{3} = 1.72$. The theoretically predicted convergence rates are clearly observed in the numerical results: using a higher value for the decay rate $\theta $ results in a faster rate of convergence.

In addition, we also consider a nonlinear quantity of interest (QoI)

$$\begin{aligned} G({{\widehat{u}}}(\cdot ,{{\varvec{y}}})):=\Vert \nabla {{\widehat{u}}}(\cdot ,{{\varvec{y}}})\Vert _{L^2(D_{\textrm{ref}})} \end{aligned}$$

and compute the relative errors

$$\begin{aligned} \frac{|{\mathbb {E}}[G({{\widehat{u}}})]-Q_{s,n}(G({{\widehat{u}}}))|}{|{\mathbb {E}}[G({{\widehat{u}}})]|}. \end{aligned}$$

(5.5)

Again, the QMC solution obtained using $n=1\,024\,207$ cubature nodes is used as the reference. The results, computed using the same QMC rules as above, are displayed in the right-hand side image of Fig. 1. The observed rates $\sigma _{\theta }$ are for $\theta = 2.1$: $\sigma _{2.1}=0.89$, $\theta = 2.5$: $\sigma _{2.5}= 1.09$, $\theta = 3$: $\sigma _{3} = 1.74$. Even though the use of a nonlinear QoI results in more irregular convergence behavior, the general trends are still visible, i.e., faster decay of the input random field yields a faster cubature convergence rate.

5.2 Capacity problem

In this section, we consider an example which is beyond the setting considered in the previous sections to demonstrate that our method also works well for a broader class of domain uncertainty quantification problems.

We will consider the Dirichlet–Neumann problem

$$\begin{aligned} {\left\{ \begin{array}{ll} -\varDelta u({{\varvec{x}}},{{\varvec{y}}})=0\quad \text {for}~{{\varvec{x}}}\in D({{\varvec{y}}}),~{{\varvec{y}}}\in U_s,\\ \partial _{\textrm{n}}u|_{x_1=0}=\partial _{\textrm{n}}u|_{x_1=1}=0,\\ u|_{x_2=0}=0,~u|_{x_2=a(x_1,{{\varvec{y}}})}=1, \end{array}\right. } \end{aligned}$$

(5.6)

and its conjugate problem

$$\begin{aligned} {\left\{ \begin{array}{ll} -\varDelta v({{\varvec{x}}},{{\varvec{y}}})=0\quad \text {for}~{{\varvec{x}}}\in D({{\varvec{y}}}),~{{\varvec{y}}}\in U_s,\\ \partial _{\textrm{n}}v|_{x_2=0}=\partial _{\textrm{n}}v|_{x_2=a(x_1,{{\varvec{y}}})}=0,\\ v|_{x_1=0}=0,~v|_{x_1=1}=1. \end{array}\right. } \end{aligned}$$

(5.7)

As the QoI, we consider the capacity

$$\begin{aligned} \textrm{cap}(D({{\varvec{y}}})):=\int _{D({{\varvec{y}}})}|\nabla u({{\varvec{x}}},{{\varvec{y}}})|^2\,\textrm{d}{{\varvec{x}}},\quad {{\varvec{y}}}\in U_s, \end{aligned}$$

where u is the solution to (5.6). We also define its conjugate as

$$\begin{aligned} \overline{\textrm{cap}}(D({{\varvec{y}}})):=\int _{D({{\varvec{y}}})}|\nabla v({{\varvec{x}}},{{\varvec{y}}})|^2\,\textrm{d}{{\varvec{x}}},\quad {{\varvec{y}}}\in U_s, \end{aligned}$$

where v is the solution to (5.7). It is well-known that (cf., e.g., [1, Theorem 4-5])

$$\begin{aligned} \textrm{cap}(D({{\varvec{y}}}))\,\overline{\textrm{cap}}(D({{\varvec{y}}}))=1\quad \text {for all}~{{\varvec{y}}}\in U_s, \end{aligned}$$

(5.8)

provided that $x\mapsto a(x,{{\varvec{y}}})$ specifies a Jordan arc such that $a(x,{{\varvec{y}}})>0$ for all $x\in [0,1]$ and ${{\varvec{y}}}\in U_s$.

The solutions to (5.6) and (5.7) are calculated numerically using hp-FEM. As before, the finite element solutions to (5.6) and (5.7) are not computed on the reference domain $D_{\textrm{ref}}$; for each realization of ${{\varvec{y}}}\in U_s$, the finite element solution is computed in $D({{\varvec{y}}})$. Numerically, we can use the reciprocal relation (5.8) to define an a posteriori error estimate

$$\begin{aligned} \textrm{err}({{\varvec{y}}}):=|1-\textrm{cap}(D({{\varvec{y}}}))\,\overline{\textrm{cap}}(D({{\varvec{y}}}))|, \end{aligned}$$

(5.9)

where $\textrm{cap}(D({{\varvec{y}}}))$ and $\overline{\textrm{cap}}(D({{\varvec{y}}}))$ are computed using hp-FEM, and we tune the finite element approximation to ensure that (5.9) is approximately of the same order across all realizations of $D({{\varvec{y}}})$. One realization of $D({{\varvec{y}}})$ with respective solutions $u({{\varvec{x}}},{{\varvec{y}}})$ and $v({{\varvec{x}}},{{\varvec{y}}})$ is shown in Fig. 2.

The numerical experiments were carried out as follows:

(i)
We set $s=10$, $c=\sqrt{3/2}$, and let $\theta \in \{2.1,2.5,3.0\}$ in (5.1). We solve the problems (5.6) and (5.7) over QMC point sets with increasing number of cubature nodes n and consider the error criteria
$$\begin{aligned} \textrm{Capacity}=\frac{|{\mathbb {E}}[\textrm{cap}(D(\cdot ))]-Q_{s,n}(\textrm{cap}(D(\cdot )))|}{|{\mathbb {E}}[\textrm{cap}(D(\cdot ))]|}, \end{aligned}$$
(5.10)
where $Q_{s,n}$ denotes the s-dimensional QMC operator over n cubature points. As the reference solution, we use the approximate QMC solution computed using $n=1\,024\,207$. (The reference result is computed using a tailored lattice. In two cases the general lattice reference is available and the effect on the convergence curves is negligible.) The results are displayed in Fig. 3a.
(ii)
We set $c=\sqrt{3/2}$, $\theta =2.1$, and let $s\in \{10,40,100\}$. We compute the relative errors (5.10) for the problems (5.6) and (5.7) over QMC point sets with increasing number of cubature nodes n. Again, as the reference solution, we use the approximate QMC solution computed using $n=1\,024\,207$. The results are displayed in Fig. 3b.

The QMC point sets for the above experiments were computed using the fast CBC algorithm using the same specifications as in Sect. 5.1. In order to utilize the a posteriori estimate introduced above, we remark that the boundary conditions of the numerical experiments are not the homogeneous Dirichlet zero boundary conditions analyzed in the earlier theory.

The observed rates $\sigma _{\theta }$ are for $\theta = 2.1$: $\sigma _{2.1}=0.7$, $\theta = 2.5$: $\sigma _{2.5}= 1.3$, $\theta = 3$: $\sigma _{3} = 2.4$ (see Fig. 3). The rate does not appear to depend on the dimension s. The cone of convergence for $\theta = 2.5$ shown in Fig. 3c, is bounded by the rates $\sigma _{2.5\text {A}}= 2.3$ (above) and $\sigma _{2.5\text {B}}= 0.6$ (below). The general convergence patterns are clearly observed. The irregularities in the convergence behavior may be explained by the nonlinear QoI (compare with the experiment in Sect. 5.1) as well as the more complicated boundary conditions: the gradient term in the expression for the capacity appears to be sensitive to obtuse angles at the Dirichlet–Neumann interface for certain realizations of the random domain.

6 Conclusions

We analyzed the Poisson problem subject to domain uncertainty, employing a domain mapping approach in which the shape uncertainty is transferred onto a fixed reference domain. As a result, the coefficient of the transported PDE problem on the reference domain becomes highly nonlinear with respect to the stochastic variables. To this end, we developed a novel parametric regularity analysis for the transported problem subject to such diffusion coefficients. The parametric regularity analysis is an important ingredient in the design of tailored QMC rules and we demonstrated that these methods exhibit higher order cubature convergence rates when the random perturbation field is parameterized using the model introduced in [23] in which an infinite sequence of independent random variables enter the random field as periodic functions. The numerical results display that faster convergence in the stochastic fluctuations leads to faster QMC convergence in the computation of the PDE response, which is consistent with the theory.

Another novel feature of our work is the dimension truncation error analysis for the problem. Many studies in uncertainty quantification for PDEs with random coefficients exploit the parametric structure of the problem, using a Neumann series expansion to obtain dimension truncation error bounds. However, there have been several recent studies suggesting a Taylor series approach which is based on the parametric regularity of the problem [9, 11, 12]. The Taylor series approach can be used to obtain dimension truncation rates for non-affine parametric PDE problems, but a limitation of the aforementioned papers is that the parametric regularity bound needs to be of product-and-order dependent (POD) form in order for the Taylor-based approach to yield useful results. This issue is circumvented in the present paper by the use of a change of variables technique in order to transform our problem into a form in which the Taylor series approach can be used to produce improved dimension truncation error rates.

The results developed in this paper constitute important groundwork for other applications involving domain uncertainty such as domain shape recovery problems.

References

Ahlfors, L.V.: Conformal Invariants: Topics in Geometric Function Theory. Higher Mathematics Series. McGraw-Hill, New York (1973)
Google Scholar
Babuška, I., Chatzipantelidis, P.: On solving elliptic stochastic partial differential equations. Comput. Methods Appl. Mech. Eng. 191, 4093–4122 (2002)
Article ADS MathSciNet Google Scholar
Castrillón-Candás, J.E., Nobile, F., Tempone, R.F.: Analytic regularity and collocation approximation for elliptic PDEs with random domain deformations. Comput. Math. Appl. 71(6), 1173–1197 (2016)
Article MathSciNet Google Scholar
Cohen, A., Schwab, C., Zech, J.: Shape holomorphy of the stationary Navier-Stokes equations. SIAM J. Math. Anal. 50(2), 1720–1752 (2018)
Article MathSciNet Google Scholar
Dick, J., Kuo, F.Y., Le Gia, Q.T., Nuyens, D., Schwab, C.: Higher order QMC Petrov-Galerkin discretization for affine parametric operator equations with random field inputs. SIAM J. Numer. Anal. 52(6), 2676–2702 (2014)
Article MathSciNet Google Scholar
Dick, J., Sloan, I.H., Wang, X., Woźniakowski, H.: Good lattice rules in weighted Korobov spaces with general weights. Numer. Math. 103(1), 63–97 (2006)
Article MathSciNet Google Scholar
Gantner, R.N.: Dimension truncation in QMC for affine-parametric operator equations. In: Owen, A.B., Glynn, P.W. (eds.) Monte Carlo and Quasi-Monte Carlo Methods 2016, pp. 249–264. Springer, Stanford, CA (2018)
Chapter Google Scholar
Gilbarg, D., Trudinger, N.S.: Elliptic Partial Differential Equations of Second Order, 2nd edn. Springer, Berlin (2001)
Book Google Scholar
Gilbert, A.D., Graham, I.G., Kuo, F.Y., Scheichl, R., Sloan, I.H.: Analysis of quasi-Monte Carlo methods for elliptic eigenvalue problems with stochastic coefficients. Numer. Math. 142(4), 863–915 (2019)
Article MathSciNet Google Scholar
Grisvard, P.: Elliptic Problems in Nonsmooth Domains. Pitman Publishing Inc, New York (1985)
Google Scholar
Guth, P.A., Kaarnioja, V.: Generalized dimension truncation error analysis for high-dimensional numerical integration: lognormal setting and beyond (2022). Preprint at arXiv:2209.06176
Guth, P.A., Kaarnioja, V., Kuo, F.Y., Schillings, C., Sloan, I.H.: Parabolic PDE-constrained optimal control under uncertainty with entropic risk measure using quasi-Monte Carlo integration (2022). Preprint at arXiv:2208.02767
Harbrecht, H.: On output functionals of boundary value problems on stochastic domains. Math. Methods Appl. Sci. 33(1), 91–102 (2010)
MathSciNet Google Scholar
Harbrecht, H., Karnaev, V., Schmidlin, M.: Quantifying domain uncertainty in linear elasticity. Tech. Rep. 2023-06, Fachbereich Mathematik, Universität Basel, Switzerland (2023)
Harbrecht, H., Peters, M., Siebenmorgen, M.: Analysis of the domain mapping method for elliptic diffusion problems on random domains. Numer. Math. 134(4), 823–856 (2016)
Article MathSciNet Google Scholar
Harbrecht, H., Schmidlin, M.: Multilevel quadrature for elliptic problems on random domains by the coupling of FEM and BEM. Stoch. Partial Differ. Equ. Anal. Comput. 10(4), 1619–1650 (2022)
Harbrecht, H., Schneider, R., Schwab, C.: Sparse second moment analysis for elliptic problems in stochastic domains. Numer. Math. 109(3), 385–414 (2008)
Article MathSciNet Google Scholar
Hardy, G.H., Littlewood, J.E., Pólya, G.: Inequalities. Cambridge University Press, Cambridge (1934)
Google Scholar
Hiptmair, R., Scarabosio, L., Schillings, C., Schwab, C.: Large deformation shape uncertainty quantification in acoustic scattering. Adv. Comput. Math. 44(5), 1475–1518 (2018)
Article MathSciNet Google Scholar
Hörmander, L.: An Introduction to Complex Analysis in Several Variables, 3rd edn. North-Holland (1990)
Jerez-Hanckes, C., Schwab, C., Zech, J.: Electromagnetic wave scattering by random surfaces: Shape holomorphy. Math. Models Methods Appl. Sci. 27(12), 2229–2259 (2017)
Article MathSciNet Google Scholar
Kaarnioja, V., Kuo, F.Y., Sloan, I.H.: Lattice-based kernel approximation and serendipitous weights for parametric PDEs in very high dimensions. To appear in: A. Hinrichs, P. Kritzer, F. Pillichshammer (eds.). Monte Carlo and Quasi-Monte Carlo Methods (2022). Springer Verlag
Kaarnioja, V., Kuo, F.Y., Sloan, I.H.: Uncertainty quantification using periodic random variables. SIAM J. Numer. Anal. 58(2), 1068–1091 (2020)
Article MathSciNet Google Scholar
Katana: Published online (2010). https://doi.org/10.26190/669X-A286
Kressner, D., Tobler, C.: Low-rank tensor Krylov subspace methods for parametrized linear systems. SIAM J. Matrix Anal. Appl. 32(4), 1288–1316 (2011)
Article MathSciNet Google Scholar
Kuo, F.Y., Schwab, C., Sloan, I.H.: Quasi-Monte Carlo finite element methods for a class of elliptic partial differential equations with random coefficients. SIAM J. Numer. Anal. 50(6), 3351–3374 (2012)
Article MathSciNet Google Scholar
Mohan, P.S., Nair, P.B., Keane, A.J.: Stochastic projection schemes for deterministic linear elliptic partial differential equations on random domains. Int. J. Numer. Methods Eng. 85(7), 874–895 (2011)
Article MathSciNet Google Scholar
Olver, F.W.J., Olde Daalhuis, A.B., Lozier, D.W., Schneider, B.I., Boisvert, R.F., Clark, C.W., Miller, B.R., Saunders, B.V., Cohl, H.S., McClain, M.A., eds.: NIST Digital Library of Mathematical Functions. Release 1.1.6 of 2022-06-30., http://dlmf.nist.gov/
Savits, T.H.: Some statistical applications of Faa di Bruno. J. Multivariate Anal. 97(10), 2131–2140 (2006)
Article MathSciNet Google Scholar
Sloan, I.H., Kachoyan, P.J.: Lattice methods for multiple integration: Theory, error analysis and examples. SIAM J. Numer. Anal. 24(1), 116–128 (1987)
Article ADS MathSciNet Google Scholar
Xiu, D., Tartakovsky, D.M.: Numerical methods for differential equations in random domains. SIAM J. Sci. Comput. 28(3), 1167–1185 (2006)
Article MathSciNet Google Scholar

Download references

Acknowledgements

F. Y. Kuo and I. H. Sloan acknowledge the support from the Australian Research Council (DP210100831). This research includes computations using the computational cluster Katana supported by Research Technology Services at UNSW Sydney [24].

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Mathematics and Systems Analysis, Aalto University School of Science, P.O. Box 11100, 00076, Aalto, Finland
Harri Hakula
Departement Mathematik und Informatik, Universität Basel, Spiegelgasse 1, 4051, Basel, Switzerland
Helmut Harbrecht
Fachbereich Mathematik und Informatik, Freie Universität Berlin, Arnimallee 6, 14195, Berlin, Germany
Vesa Kaarnioja
School of Mathematics and Statistics, UNSW Sydney, Sydney, NSW, 2052, Australia
Frances Y. Kuo & Ian H. Sloan

Authors

Harri Hakula
View author publications
You can also search for this author in PubMed Google Scholar
Helmut Harbrecht
View author publications
You can also search for this author in PubMed Google Scholar
Vesa Kaarnioja
View author publications
You can also search for this author in PubMed Google Scholar
Frances Y. Kuo
View author publications
You can also search for this author in PubMed Google Scholar
Ian H. Sloan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vesa Kaarnioja.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Technical results

Lemma A.1

Let ${\varvec{\nu }}\in {\mathscr {F}}$, $c_0>0$, $c>0$, and $q\in {\mathbb {Z}}^+$. Let ${\varvec{\beta }}=(\beta _j)_{j\ge 1}$ and $(\varUpsilon _{{\varvec{\nu }}})_{{\varvec{\nu }}\in \mathscr {F}}$ be sequences of non-negative real numbers and suppose that

$$\begin{aligned} \varUpsilon _{{{\varvec{0}}}} \,\le \, c_0, \quad \text{ and }\quad \varUpsilon _{{\varvec{\nu }}} \,\le \, \sum _{\begin{array}{c} {{\varvec{0}}}\ne {{\varvec{m}}}\le {\varvec{\nu }}\\ |\textrm{supp}({{\varvec{m}}})|\le q \end{array}} c^{|{{\varvec{m}}}|}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \varUpsilon _{{\varvec{\nu }}-{{\varvec{m}}}} \prod _{j\in \textrm{supp}({{\varvec{m}}})}\beta _j \end{aligned}$$

for all ${\varvec{\nu }}\in {\mathscr {F}}\setminus \{{{\varvec{0}}}\}$. Then

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }}}\,\le \, c_0\, c^{|{\varvec{\nu }}|}\sum _{{{\varvec{m}}}\le {\varvec{\nu }}}{{\varvec{m}}}!\,D_q({{\varvec{m}}})\,{\varvec{\beta }}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i) \qquad \text {for all}~{\varvec{\nu }}\in {\mathscr {F}}, \end{aligned}$$

where we define, with ${{\varvec{e}}}_\mathrm {{\mathfrak {u}}}:= \sum _{j\in \mathrm {{\mathfrak {u}}}} {{\varvec{e}}}_j$,

$$\begin{aligned} D_q({{\varvec{m}}}) \,:=\,{\left\{ \begin{array}{ll}1&{}\text {if}~{{\varvec{m}}}={{\varvec{0}}},\\ \displaystyle \sum _{\begin{array}{c} \mathrm {{\mathfrak {u}}}\subseteq \textrm{supp}({{\varvec{m}}})\\ 0 \ne |\mathrm {{\mathfrak {u}}}|\le q \end{array}} D_q({{\varvec{m}}}-{{\varvec{e}}}_\mathrm {{\mathfrak {u}}})&\text {if}~{{\varvec{m}}}\in {\mathscr {F}}\setminus \{{{\varvec{0}}}\}. \end{array}\right. } \end{aligned}$$

(A.1)

The result is sharp in the sense that all inequalities can be replaced by equalities. The result also extends to $q=\infty $, i.e., we can remove the upper bounds on $|\textrm{supp}({{\varvec{m}}})|$ and $|\mathrm {{\mathfrak {u}}}|$.

Proof

We prove the claim by induction with respect to the order of the multi-indices ${\varvec{\nu }}\in {\mathscr {F}}$. The claim is resolved immediately for ${\varvec{\nu }}={{\varvec{0}}}$, so let us fix ${\varvec{\nu }}\in {\mathscr {F}} \setminus \{{{\varvec{0}}}\}$ and suppose that the assertion holds for all multi-indices with order less than $|{\varvec{\nu }}|$. Then

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }}}&\le \sum _{\begin{array}{c} {{\varvec{0}}}\ne {{\varvec{m}}}\le {\varvec{\nu }}\\ |\textrm{supp}({{\varvec{m}}})|\le q \end{array}} \!\!c^{|{{\varvec{m}}}|} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \bigg (\prod _{j\in \textrm{supp}({{\varvec{m}}})}\beta _j\bigg ) \, c_0\,c^{|{\varvec{\nu }}|-|{{\varvec{m}}}|}\\&\quad \times \sum _{{{\varvec{w}}}\le {\varvec{\nu }}-{{\varvec{m}}}}{{\varvec{w}}}!\, D_q({{\varvec{w}}})\,{\varvec{\beta }}^{{{\varvec{w}}}}\prod _{i\ge 1}S(\nu _i-m_i,w_i)\\&= c_0c^{|{\varvec{\nu }}|}\!\!\!\sum _{\begin{array}{c} {{\varvec{0}}}\ne {{\varvec{m}}}\le {\varvec{\nu }}\\ |\textrm{supp}({{\varvec{m}}})|\le q \end{array}}\! \sum _{{{\varvec{w}}}\le {\varvec{\nu }}-{{\varvec{m}}}}\!\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \!\bigg (\prod _{j\in \textrm{supp}({{\varvec{m}}})}\beta _j\bigg ) {{\varvec{w}}}! D_q({{\varvec{w}}})\,{\varvec{\beta }}^{{{\varvec{w}}}}\prod _{i\ge 1}S(\nu _i-m_i,w_i)\\&= c_0c^{|{\varvec{\nu }}|}\sum _{\begin{array}{c} {{\varvec{w}}}\le {\varvec{\nu }}\\ {{\varvec{w}}}\ne {\varvec{\nu }} \end{array}} {{\varvec{w}}}!\, D_q({{\varvec{w}}})\,{\varvec{\beta }}^{{{\varvec{w}}}} \sum _{\begin{array}{c} {{\varvec{0}}}\ne {{\varvec{m}}}\le {\varvec{\nu }}-{{\varvec{w}}}\\ |\textrm{supp}({{\varvec{m}}})|\le q \end{array}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \! \bigg (\prod _{j\in \textrm{supp}({{\varvec{m}}})}\beta _j\bigg )\!\prod _{i\ge 1}S(\nu _i-m_i,w_i), \end{aligned}$$

where we swapped the order of the sums over ${{\varvec{m}}}$ and ${{\varvec{w}}}$. The inner sum over ${{\varvec{m}}}$ can be rewritten as

$$\begin{aligned} \sum _{\begin{array}{c} \mathrm {{\mathfrak {u}}}\subseteq \textrm{supp}({\varvec{\nu }}-{{\varvec{w}}})\\ 0\ne |\mathrm {{\mathfrak {u}}}|\le q \end{array}} \prod _{j\in \mathrm {{\mathfrak {u}}}} \bigg ( \beta _j\sum _{m_j=1}^{\nu _j-w_j}\left( {\begin{array}{c}\nu _j\\ m_j\end{array}}\right) S(\nu _j-m_j,w_j)\bigg )\prod _{j\not \in \mathrm {{\mathfrak {u}}}}S(\nu _j,w_j), \end{aligned}$$

which can be further simplified using the identity (cf., e.g., [28, formula 26.8.23])

$$\begin{aligned} \sum _{m_j=1}^{\nu _j-w_j}\left( {\begin{array}{c}\nu _j\\ m_j\end{array}}\right) S(\nu _j-m_j,w_j) = (w_j+1)\,S(\nu _j,w_j+1) \quad \text {for}~w_j<\nu _j. \end{aligned}$$

Hence

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }}}&\le c_0 c^{|{\varvec{\nu }}|}\!\sum _{\begin{array}{c} {{\varvec{w}}}\le {\varvec{\nu }}\\ {{\varvec{w}}}\ne {\varvec{\nu }} \end{array}}\!{{\varvec{w}}}!D_q({{\varvec{w}}}){\varvec{\beta }}^{{{\varvec{w}}}} \!\!\sum _{\begin{array}{c} \mathrm {{\mathfrak {u}}}\subseteq \textrm{supp}({\varvec{\nu }}-{{\varvec{w}}})\\ 0\ne |\mathrm {{\mathfrak {u}}}|\le q \end{array}}\! \prod _{j\in \mathrm {{\mathfrak {u}}}}\! \bigg (\beta _j(w_j\!+\!1)S(\nu _j,w_j\!+\!1)\bigg )\!\prod _{j\not \in \mathrm {{\mathfrak {u}}}}\!S(\nu _j,w_j) \\&= c_0 c^{|{\varvec{\nu }}|}\!\sum _{\begin{array}{c} \mathrm {{\mathfrak {u}}}\subseteq \textrm{supp}({\varvec{\nu }})\\ 0\ne |\mathrm {{\mathfrak {u}}}|\le q \end{array}}\! \sum _{{{\varvec{w}}}\le {\varvec{\nu }}-{{\varvec{e}}}_\mathrm {{\mathfrak {u}}}}\!\! {{\varvec{w}}}! D_q({{\varvec{w}}})\,{\varvec{\beta }}^{{{\varvec{w}}}}\! \prod _{j\in \mathrm {{\mathfrak {u}}}}\! \bigg (\!\beta _j(w_j\!+\!1)S(\nu _j,w_j\!+\!1)\!\bigg )\!\prod _{j\not \in \mathrm {{\mathfrak {u}}}}\!S(\nu _j,w_j)\\&=c_0\, c^{|{\varvec{\nu }}|}\sum _{\begin{array}{c} \mathrm {{\mathfrak {u}}}\subseteq \textrm{supp}({\varvec{\nu }})\\ 0\ne |\mathrm {{\mathfrak {u}}}|\le q \end{array}} \sum _{{{\varvec{e}}}_\mathrm {{\mathfrak {u}}}\le {{\varvec{w}}}'\le {\varvec{\nu }}} {{\varvec{w}}}'!\, D_q({{\varvec{w}}}'-{{\varvec{e}}}_\mathrm {{\mathfrak {u}}})\,{\varvec{\beta }}^{{{\varvec{w}}}'}\prod _{j\ge 1}S(\nu _j,w_j'), \end{aligned}$$

where we swapped the order of the sums over ${{\varvec{w}}}$ and $\mathrm {{\mathfrak {u}}}$ and then carried out the change of variable ${{\varvec{w}}}'= {{\varvec{w}}}+{{\varvec{e}}}_\mathrm {{\mathfrak {u}}}$. Changing the order of the sums again leads to

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }}}&\le c_0\, c^{|{\varvec{\nu }}|} \sum _{{{\varvec{w}}}'\le {\varvec{\nu }}} {{\varvec{w}}}'!\, \bigg (\sum _{\begin{array}{c} \mathrm {{\mathfrak {u}}}\subseteq \textrm{supp}({{\varvec{w}}}')\\ 0\ne |\mathrm {{\mathfrak {u}}}|\le q \end{array}} D_q({{\varvec{w}}}'-{{\varvec{e}}}_\mathrm {{\mathfrak {u}}})\bigg ) {\varvec{\beta }}^{{{\varvec{w}}}'} \prod _{j\ge 1}S(\nu _j,w_j'). \end{aligned}$$

Relabeling ${{\varvec{w}}}'$ to ${{\varvec{m}}}$ and applying the definition of $D_q({{\varvec{m}}})$ yields the desired result.

We remark that if the inequalities for $\varUpsilon _0$ and the recursion on $\varUpsilon _{\varvec{\nu }}$ are replaced by equalities then all steps of the proof are equalities and so is the final formula. We also remark that the proof does not make any use of the upper bounds $|\textrm{supp}({{\varvec{m}}})|\le q$ and $|\mathrm {{\mathfrak {u}}}|\le q$, so the result extends trivially to the case $q=\infty $. $\square $

Let ${{\varvec{m}}}\in {\mathscr {F}}$. It is interesting to note that if $|\textrm{supp}({{\varvec{m}}})| \le 2$, for example, ${{\varvec{m}}}_{\mathrm {{\mathfrak {u}}}}= k{{\varvec{e}}}_i + \ell {{\varvec{e}}}_j$ with $k,\ell \in {\mathbb {N}}$, then the numbers (A.1) have the following representation in closed form:

$$\begin{aligned} D_2({{\varvec{m}}})=\sum _{t=0}^{\min \{k,\ell \}}2^t\left( {\begin{array}{c}k\\ t\end{array}}\right) \left( {\begin{array}{c}\ell \\ t\end{array}}\right) . \end{aligned}$$

These are so-called Delannoy numbers. We prove an upper bound for $D_2({{\varvec{m}}})$ in general.

Lemma A.2

The numbers defined by (A.1) with $q=2$ can be bounded by

$$\begin{aligned} D_2({{\varvec{m}}}) \,\le \, a_{|{{\varvec{m}}}|}' \,=\, |{{\varvec{m}}}|!\,a_{|{{\varvec{m}}}|} \quad \text {for all}~{{\varvec{m}}}\in {\mathscr {F}}, \end{aligned}$$

(A.2)

with the sequences

$$\begin{aligned} a_k':=\, k!\,a_k,\qquad a_k :=\, \frac{(1+\sqrt{3})^{k+1}-(1-\sqrt{3})^{k+1}}{2^{k+1}\sqrt{3}},\quad k\ge 0. \end{aligned}$$

Proof

Let us first observe that the sequence $(a_k')_{k=0}^\infty $ can be represented using the recurrence $a_0'=a_1'=1$ and $a_k'=ka_{k-1}'+\frac{k^2-k}{2}a_{k-2}'$, $k\ge 2$; see, e.g., entry A080599 in the On-line Encyclopedia of Integer Sequences (OEIS) and references therein. We also have $a_0=a_1=1$ and $a_k = a_{k-1} + a_{k-2}/2$, $k\ge 2$.

We prove the claim (A.2) by induction with respect to the order of the multi-indices ${{\varvec{m}}}\in {\mathscr {F}}$ while utilizing the recursive characterization of sequence $(a_k')_{k=0}^\infty $. We immediately see that (A.2) holds with equality when $|{{\varvec{m}}}|\le 1$. Fix ${{\varvec{m}}}\in {\mathscr {F}}$ such that $|{{\varvec{m}}}|>1$ and suppose that the claim holds for all multi-indices with order less than $|{{\varvec{m}}}|$. Then

$$\begin{aligned} D_2({{\varvec{m}}}) \,&=\, \sum _{\begin{array}{c} \mathrm {{\mathfrak {u}}}\subseteq \textrm{supp}({{\varvec{m}}})\\ 0\ne |\mathrm {{\mathfrak {u}}}|\le 2 \end{array}} D_2({{\varvec{m}}}-{{\varvec{e}}}_\mathrm {{\mathfrak {u}}})\\&\,\le \, \sum _{\begin{array}{c} \mathrm {{\mathfrak {u}}}\subseteq \textrm{supp}({{\varvec{m}}})\\ 0\ne |\mathrm {{\mathfrak {u}}}|\le 2 \end{array}} a_{|{{\varvec{m}}}|-|\mathrm {{\mathfrak {u}}}|}' \,=\, a_{|{{\varvec{m}}}|-1}' \left( {\begin{array}{c}|\textrm{supp}({{\varvec{m}}})|\\ 1\end{array}}\right) + a_{|{{\varvec{m}}}|-2}' \left( {\begin{array}{c}|\textrm{supp}({{\varvec{m}}})|\\ 2\end{array}}\right) \\&\,\le \, a_{|{{\varvec{m}}}|-1}'\left( {\begin{array}{c}|{{\varvec{m}}}|\\ 1\end{array}}\right) + a_{|{{\varvec{m}}}|-2}' \left( {\begin{array}{c}|{{\varvec{m}}}|\\ 2\end{array}}\right) \,=\, |{{\varvec{m}}}| a_{|{{\varvec{m}}}|-1}' + \frac{|{{\varvec{m}}}|^2\!-\!|{{\varvec{m}}}|}{2} a_{|{{\varvec{m}}}|-2}' \,=\, a_{|{{\varvec{m}}}|}', \end{aligned}$$

as desired. $\square $

Lemma A.3

Let ${\varvec{\nu }}\in {\mathscr {F}}$ and let $({\mathbb {A}}_{\varvec{\nu }})_{{\varvec{\nu }}\in {\mathscr {F}}}$ and $({{\mathbb {B}}}_{\varvec{\nu }})_{{\varvec{\nu }}\in {\mathscr {F}}}$ be arbitrary sequences of real numbers. Then

$$\begin{aligned}&\sum _{{{\varvec{m}}}\le {\varvec{\nu }}}\sum _{{{\varvec{w}}}\le {{\varvec{m}}}}\sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{m}}}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) {\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{\varvec{\mu }}\prod _{i\ge 1} \Big ( S(m_i,w_i)\,S(\nu _i-m_i,\mu _i) \Big ) \nonumber \\&\quad \,=\, \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \bigg (\sum _{{{\varvec{w}}}\le {{\varvec{m}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) \,{\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{{{\varvec{m}}}-{{\varvec{w}}}}\bigg ) \prod _{i\ge 1}S(\nu _i,m_i). \end{aligned}$$

(A.3)

Proof

Moving the sum over ${{\varvec{m}}}$ further inside, we can write

$$\begin{aligned}&\text{ LHS } \text{ of } (\text {A.3})\\&\quad = \sum _{{{\varvec{w}}}\le {\varvec{\nu }}} \sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{w}}}} {\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{\varvec{\mu }}\sum _{{{\varvec{w}}}\le {{\varvec{m}}}\le {\varvec{\nu }}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \prod _{i\ge 1} \Big ( S(m_i,w_i)\,S(\nu _i-m_i,\mu _i) \Big )\\&\quad = \sum _{{{\varvec{w}}}\le {\varvec{\nu }}} \sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{w}}}} {\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{\varvec{\mu }}\left( {\begin{array}{c}{{\varvec{w}}}+{\varvec{\mu }}\\ {{\varvec{w}}}\end{array}}\right) \prod _{i\ge 1}S(\nu _i,w_i+\mu _i) \\&\quad = \sum _{{{\varvec{w}}}\le {\varvec{\nu }}} \sum _{{{\varvec{w}}}\le {\varvec{\mu }}'\le {\varvec{\nu }}} {\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{{\varvec{\mu }}'-{{\varvec{w}}}} \left( {\begin{array}{c}{\varvec{\mu }}'\\ {{\varvec{w}}}\end{array}}\right) \prod _{i\ge 1}S(\nu _i,\mu '_i) \\&\quad = \sum _{{\varvec{\mu }}'\le {\varvec{\nu }}} \bigg (\sum _{{{\varvec{w}}}\le {\varvec{\mu }}'} \left( {\begin{array}{c}{\varvec{\mu }}'\\ {{\varvec{w}}}\end{array}}\right) \,{\mathbb {A}}_{{\varvec{w}}}\, {{\mathbb {B}}}_{{\varvec{\mu }}'-{{\varvec{w}}}}\bigg ) \prod _{i\ge 1}S(\nu _i,\mu '_i) = \text{ RHS } \text{ of } (\text {A.3}), \end{aligned}$$

as required. In the second equality we used the identity (cf., e.g., [28, formula 26.8.23])

$$\begin{aligned} \sum _{m=w}^\nu \left( {\begin{array}{c}\nu \\ m\end{array}}\right) S(m,w)S(\nu -m,\mu )=\left( {\begin{array}{c}w+\mu \\ w\end{array}}\right) S(\nu ,w+\mu ). \end{aligned}$$

In the third equality we introduced a change of variables ${\varvec{\mu }}'= {{\varvec{w}}}+{\varvec{\mu }}$. In the fourth equality we swapped the order of the sums. Finally we relabel ${\varvec{\mu }}'$ to ${{\varvec{m}}}$. $\square $

Lemma A.4

Let $c>0$, ${\varvec{\beta }}=(\beta _j)_{j\ge 1}$ and $(\varUpsilon _{{\varvec{\nu }},{\varvec{\lambda }}})_{{\varvec{\nu }}\in {\mathscr {F}},{\varvec{\lambda }}\in \mathbb Z^d}$ be sequences of non-negative real numbers satisfying

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }},{{\varvec{0}}}} \,&=\, \delta _{{\varvec{\nu }},{{\varvec{0}}}}, \qquad \varUpsilon _{{\varvec{\nu }},{\varvec{\lambda }}} \,=\, 0 \quad \text{ if }\quad |{\varvec{\nu }}|<|{\varvec{\lambda }}| \text{ or } {\varvec{\lambda }}\not \ge {\varvec{0}}, \qquad \text{ and } \\ \varUpsilon _{{\varvec{\nu }}+{{\varvec{e}}}_j,{\varvec{\lambda }}} \,&\le \, \beta _j\sum _{k=0}^{\nu _j} c^{k+1}\left( {\begin{array}{c}\nu _j\\ k\end{array}}\right) \sum _{\begin{array}{c} \ell =1\\ \lambda _\ell >0 \end{array}}^d\varUpsilon _{{\varvec{\nu }}-k{{\varvec{e}}}_j,{\varvec{\lambda }}-{{\varvec{e}}}_\ell } \quad \text{ otherwise }. \end{aligned}$$

Then

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }},{\varvec{\lambda }}}\le c^{|{\varvec{\nu }}|}\frac{|{\varvec{\lambda }}|!}{{\varvec{\lambda }}!} \sum _{\begin{array}{c} {{\varvec{m}}}\le {\varvec{\nu }}\\ |{{\varvec{m}}}|=|{\varvec{\lambda }}| \end{array}}{\varvec{\beta }}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i)\quad \text {for all}~{\varvec{\nu }}\in {\mathscr {F}},~{\varvec{\lambda }}\in {\mathbb {N}}_0^d. \end{aligned}$$

The result is sharp in the sense that all inequalities can be replaced by equalities.

Proof

We prove the claim by induction with respect to the multi-indices ${\varvec{\nu }}\in {\mathscr {F}}$. The case ${\varvec{\nu }}={{\varvec{0}}}$ is resolved immediately by observing that

$$\begin{aligned} \varUpsilon _{{{\varvec{0}}},{{\varvec{0}}}}=1\quad \text {and}\quad \varUpsilon _{{{\varvec{0}}},{\varvec{\lambda }}}=0\quad \text {if}~{\varvec{\lambda }}\ne {{\varvec{0}}}\end{aligned}$$

as desired.

Fix ${\varvec{\nu }}\in {\mathscr {F}}$ and $\varvec{\lambda }\in {\mathbb {N}}_0^d$, and suppose that the claim holds for all pairs of multi-indices, with first multi-index of order $\le |{\varvec{\nu }}|$ and second multi-index $< {\varvec{\lambda }}$. If ${\varvec{\lambda }}={{\varvec{0}}}$, then the claim is trivially true, so we may assume in the following that ${\varvec{\lambda }}\ne {{\varvec{0}}}$. By letting $j\ge 1$ be arbitrary, we find that

$$\begin{aligned}&\varUpsilon _{{\varvec{\nu }}+{{\varvec{e}}}_j,{\varvec{\lambda }}}\\&\quad \le \beta _j\sum _{k=0}^{\nu _j}c^{k+1}\left( {\begin{array}{c}\nu _j\\ k\end{array}}\right) \!\sum _{\begin{array}{c} \ell =1\\ \lambda _\ell >0 \end{array}}^dc^{|{\varvec{\nu }}|-k} \frac{(|{\varvec{\lambda }}|\!-\!1)!}{({\varvec{\lambda }}\!-\!{{\varvec{e}}}_\ell )!}\sum _{\begin{array}{c} {{\varvec{m}}}\le {\varvec{\nu }}-k {{\varvec{e}}}_j \\ |{{\varvec{m}}}|=|{\varvec{\lambda }}|\!-\!1 \end{array}} {\varvec{\beta }}^{{{\varvec{m}}}}S(\nu _j\!-\!k,m_j)\prod _{i\ne j}S(\nu _i,m_i)\\&\quad = c^{|{\varvec{\nu }}|+1}\frac{|{\varvec{\lambda }}|!}{{\varvec{\lambda }}!}\beta _j\sum _{k=0}^{\nu _j}\left( {\begin{array}{c}\nu _j\\ k\end{array}}\right) \sum _{\begin{array}{c} {{\varvec{m}}}\le {\varvec{\nu }}-k {{\varvec{e}}}_j \\ |{{\varvec{m}}}|=|{\varvec{\lambda }}|-1 \end{array}}{\varvec{\beta }}^{{{\varvec{m}}}}S(\nu _j-k,m_j)\prod _{i\ne j}S(\nu _i,m_i). \end{aligned}$$

We introduce ${{\varvec{m}}}':=(m_1,\ldots ,m_{j-1},m_{j+1},\ldots )$, ${\varvec{\nu }}':=(\nu _1,\ldots ,\nu _{j-1},\nu _{j+1},\ldots )$, and $\varvec{\beta '}:=(\beta _1,\ldots ,\beta _{j-1},\beta _{j+1},\ldots )$. Then

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }}+{{\varvec{e}}}_j,{\varvec{\lambda }}}&\le c^{|{\varvec{\nu }}|+1}\frac{|{\varvec{\lambda }}|!}{{\varvec{\lambda }}!}\beta _j\sum _{k=0}^{\nu _j}\left( {\begin{array}{c}\nu _j\\ k\end{array}}\right) \sum _{m_j=0}^{\nu _j-k}\beta _j^{m_j}S(\nu _j-k,m_j)\\&\quad \times \sum _{\begin{array}{c} {{\varvec{m}}}'\le {\varvec{\nu }}'\\ |{{\varvec{m}}}'|=|{\varvec{\lambda }}|-m_j-1 \end{array}} \varvec{\beta '}^{{{\varvec{m}}}'}\prod _{i\ne j}S(\nu _i,m_i')\\&= c^{|{\varvec{\nu }}|+1}\frac{|{\varvec{\lambda }}|!}{{\varvec{\lambda }}!}\sum _{m_j=0}^{\nu _j}\beta _j^{m_j+1} \bigg (\sum _{k=0}^{\nu _j-m_j}\left( {\begin{array}{c}\nu _j\\ k\end{array}}\right) S(\nu _j-k,m_j)\bigg )\\&\quad \times \sum _{\begin{array}{c} {{\varvec{m}}}'\le {\varvec{\nu }}' \\ |{{\varvec{m}}}'|=|{\varvec{\lambda }}|-m_j-1 \end{array}}\varvec{\beta '}^{{{\varvec{m}}}'}\prod _{i\ne j}S(\nu _i,m_i'). \end{aligned}$$

Making use of the fact that (see [28, formula 26.8.25])

$$\begin{aligned} \sum _{k=0}^{\nu -m}\left( {\begin{array}{c}\nu \\ k\end{array}}\right) S(\nu -k,m)=S(\nu +1,m+1)\quad \text {for all}~\nu \in {\mathbb {N}}_0,~m\in \{0,\ldots ,\nu \}, \end{aligned}$$

we obtain

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }}+{{\varvec{e}}}_j,{\varvec{\lambda }}}&\le c^{|{\varvec{\nu }}|+1}\frac{|{\varvec{\lambda }}|!}{{\varvec{\lambda }}!}\sum _{m_j=0}^{\nu _j}\beta _j^{m_j+1}S(\nu _j+1,m_j+1) \sum _{\begin{array}{c} {{\varvec{m}}}'\le {\varvec{\nu }}' \\ |{{\varvec{m}}}'|=|{\varvec{\lambda }}|-m_j-1 \end{array}} \varvec{\beta '}^{{{\varvec{m}}}'}\prod _{i\ne j}S(\nu _i,m_i')\\&= c^{|{\varvec{\nu }}|+1}\frac{|{\varvec{\lambda }}|!}{{\varvec{\lambda }}!} \sum _{{\overline{m}}_j=0}^{\nu _j+1}\beta _j^{{\overline{m}}_j}S(\nu _j+1,{\overline{m}}_j) \sum _{\begin{array}{c} {{\varvec{m}}}'\le {\varvec{\nu }}' \\ |{{\varvec{m}}}'|=|{\varvec{\lambda }}|-{\overline{m}}_j \end{array}}\varvec{\beta '}^{{{\varvec{m}}}'} \prod _{i\ne j}S(\nu _i,m_i'), \end{aligned}$$

where we used the change of variable ${\overline{m}}_j = m_j+1$ in conjunction with the property $S(\nu +1,0)=0$ for all $\nu \in {\mathbb {N}}_0$. Recognizing the above expression as

$$\begin{aligned} c^{|{\varvec{\nu }}|+1}\frac{|{\varvec{\lambda }}|!}{{\varvec{\lambda }}!} \sum _{\begin{array}{c} {{\varvec{m}}}\le {\varvec{\nu }}\\ |{{\varvec{m}}}|=|{\varvec{\lambda }}| \end{array}}{\varvec{\beta }}^{{{\varvec{m}}}}S(\nu _j+1,m_j)\prod _{i\ne j}S(\nu _i,m_i) \end{aligned}$$

concludes the proof. We remark that if the inequality for the recursion on $\varUpsilon _{{\varvec{\nu }},\lambda }$ is replaced by equality then all steps of the proof are equalities and so is the final formula. $\square $

Lemma A.5

Let $C>0$ be a constant and suppose that ${\varvec{\beta }}=(\beta _j)_{j\ge 1}$ is a sequence of non-negative real numbers. Suppose that $\varUpsilon _{{{\varvec{0}}}}\le c_0$ and

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }}}&\le \sum _{{\mathop {\scriptstyle {{{\varvec{m}}}\ne {\varvec{\nu }}}}\limits ^{\scriptstyle {{{\varvec{m}}}\le {\varvec{\nu }}}}}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \varUpsilon _{{{\varvec{m}}}}\, {c^{|{\varvec{\nu }}-{{\varvec{m}}}|}} \sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{m}}}} {(|{\varvec{\mu }}|+1)!}\,{{\varvec{\mu }}!}\,{\varvec{\beta }}^{{\varvec{\mu }}}\prod _{i\ge 1}S(\nu _i-m_i,\mu _i) \\&\quad + {c_0\,c^{|{\varvec{\nu }}|}}\sum _{{\varvec{m}}\le {\varvec{\nu }}} {\frac{(|{{\varvec{m}}}|+d)!}{d!}}\,{\varvec{\beta }}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i) \qquad \text {for all } {\varvec{\nu }}\in {\mathscr {F}}. \end{aligned}$$

Then

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }}}\le {c_0\,c^{|{\varvec{\nu }}|}} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} {{\mathbb {P}}}_{{\varvec{m}}}\,{{{\varvec{m}}}!}\,{\varvec{\beta }}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i) \qquad \text {for all } {\varvec{\nu }}\in {\mathscr {F}}, \end{aligned}$$

where

$$\begin{aligned}&{{\mathbb {P}}}_{{{\varvec{0}}}}=1 \quad \text {and}\quad {{\mathbb {P}}}_{{{\varvec{m}}}}= {\frac{(|{{\varvec{m}}}|+d)!}{{{{\varvec{m}}}!}\,d!}}+\sum _{\begin{array}{c} {{\varvec{w}}}\le {{\varvec{m}}}\\ {{\varvec{w}}}\ne {{\varvec{m}}} \end{array}} {{\mathbb {P}}}_{{{\varvec{w}}}}\,{(|{{\varvec{m}}}|-|{{\varvec{w}}}|+1)!}\quad \text {for}~{{\varvec{m}}}\in {\mathscr {F}}. \end{aligned}$$

(A.4)

The result is sharp in the sense that all inequalities can be replaced by equalities.

Proof

The proof is carried out by induction with respect to the multi-indices ${\varvec{\nu }}\in {\mathscr {F}}$. The base step ${\varvec{\nu }}={{\varvec{0}}}$ is trivially true, so let us assume that ${\varvec{\nu }}\in \mathscr {F}{\setminus }\{{{\varvec{0}}}\}$ is such that the claim holds for all multi-indices with order $<|{\varvec{\nu }}|$. Then the recurrence together with the induction hypothesis yields that

$$\begin{aligned} \varUpsilon _{{\varvec{\nu }}}&\le R_{\varvec{\nu }}+ {c_0\,c^{|{\varvec{\nu }}|}} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}}{\frac{(|{{\varvec{m}}}|+d)!}{d!}}\,{\varvec{\beta }}^{{{\varvec{m}}}}\prod _{i\ge 1}S(\nu _i,m_i), \end{aligned}$$

(A.5)

with

$$\begin{aligned} R_{\varvec{\nu }}&:=\sum _{{\mathop {\scriptstyle {{{\varvec{m}}}\ne {\varvec{\nu }}}}\limits ^{\scriptstyle {{{\varvec{m}}}\le {\varvec{\nu }}}}}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \bigg ({c_0\,c^{|{{\varvec{m}}}|}} \sum _{{{\varvec{w}}}\le {{\varvec{m}}}} {{\mathbb {P}}}_{{{\varvec{w}}}} \,{{{\varvec{w}}}!}\,{\varvec{\beta }}^{{{\varvec{w}}}}\prod _{i\ge 1}S(m_i,w_i)\bigg )\\&\quad \times {c^{|{\varvec{\nu }}-{{\varvec{m}}}|}}\sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{m}}}} {(|{\varvec{\mu }}|+1)!}\,{{\varvec{\mu }}!}\,{\varvec{\beta }}^{{\varvec{\mu }}} \prod _{i\ge 1}S(\nu _i-m_i,\mu _i). \end{aligned}$$

To simplify our presentation we abbreviate temporarily ${\mathbb {A}}_{{\varvec{w}}}:= {{\mathbb {P}}}_{{{\varvec{w}}}}\,{{{\varvec{w}}}!}\,{\varvec{\beta }}^{{{\varvec{w}}}}$ and ${{\mathbb {B}}}_{\varvec{\mu }}:= {(|{\varvec{\mu }}|+1)!}\,{{\varvec{\mu }}!}\,{\varvec{\beta }}^{{\varvec{\mu }}}$. Then we can write

$$\begin{aligned} R_{\varvec{\nu }}&= {c_0\,c^{|{\varvec{\nu }}|}} \sum _{\begin{array}{c} {{\varvec{m}}}\le {\varvec{\nu }}\\ {{\varvec{m}}}\ne {\varvec{\nu }} \end{array}}\left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) \bigg (\sum _{{{\varvec{w}}}\le {{\varvec{m}}}} {\mathbb {A}}_{{\varvec{w}}}\prod _{i\ge 1}S(m_i,w_i)\bigg ) \sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{m}}}} {{\mathbb {B}}}_{\varvec{\mu }}\prod _{i\ge 1}S(\nu _i-m_i,\mu _i) \nonumber \\&= {c_0\,c^{|{\varvec{\nu }}|}}\sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \sum _{{{\varvec{w}}}\le {{\varvec{m}}}} \sum _{{\varvec{\mu }}\le {\varvec{\nu }}-{{\varvec{m}}}} \left( {\begin{array}{c}{\varvec{\nu }}\\ {{\varvec{m}}}\end{array}}\right) {\mathbb {A}}_{{\varvec{w}}}\,{{\mathbb {B}}}_{\varvec{\mu }}\prod _{i\ge 1} \Big ( S(m_i,w_i)\,S(\nu _i-m_i,\mu _i) \Big ) \nonumber \\&\quad - {c_0\,c^{|{\varvec{\nu }}|}}\sum _{{{\varvec{w}}}\le {\varvec{\nu }}} {\mathbb {A}}_{{\varvec{w}}}\,{{\mathbb {B}}}_{{\varvec{0}}}\prod _{i\ge 1} S(\nu _i,w_i) \nonumber \\&= {c_0\,c^{|{\varvec{\nu }}|}}\sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \bigg (\sum _{{{\varvec{w}}}\le {{\varvec{m}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) {\mathbb {A}}_{{\varvec{w}}}\,{{\mathbb {B}}}_{{{\varvec{m}}}-{{\varvec{w}}}}\bigg ) \prod _{i\ge 1} S(\nu _i,m_i)\nonumber \\&\quad - {c_0\,c^{|{\varvec{\nu }}|}}\sum _{{{\varvec{m}}}\le {\varvec{\nu }}} {\mathbb {A}}_{{\varvec{m}}}\,{{\mathbb {B}}}_{{\varvec{0}}}\prod _{i\ge 1} S(\nu _i,m_i) \nonumber \\&= {c_0\,c^{|{\varvec{\nu }}|}} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \bigg (\sum _{{\mathop {\scriptstyle {{{\varvec{w}}}\ne {{\varvec{m}}}}}\limits ^{\scriptstyle {{{\varvec{w}}}\le {{\varvec{m}}}}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) {\mathbb {A}}_{{\varvec{w}}}\,{{\mathbb {B}}}_{{{\varvec{m}}}-{{\varvec{w}}}} \bigg ) \prod _{i\ge 1} S(\nu _i,m_i), \end{aligned}$$

(A.6)

where we used Lemma A.3 in the third equality.

Reinserting the definitions of ${\mathbb {A}}_{{\varvec{w}}}$ and ${{\mathbb {B}}}_{{{\varvec{m}}}-{{\varvec{w}}}}$, the inner sum over ${{\varvec{w}}}$ in (A.6) becomes

$$\begin{aligned} \sum _{{\mathop {\scriptstyle {{{\varvec{w}}}\ne {{\varvec{m}}}}}\limits ^{\scriptstyle {{{\varvec{w}}}\le {{\varvec{m}}}}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) {\mathbb {A}}_{{\varvec{w}}}\,{{\mathbb {B}}}_{{{\varvec{m}}}-{{\varvec{w}}}}&= \sum _{{\mathop {\scriptstyle {{{\varvec{w}}}\ne {{\varvec{m}}}}}\limits ^{\scriptstyle {{{\varvec{w}}}\le {{\varvec{m}}}}}}} \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) {{\mathbb {P}}}_{{{\varvec{w}}}}\,{{{\varvec{w}}}!}\,{\varvec{\beta }}^{{{\varvec{w}}}}\, {(|{{\varvec{m}}}-{{\varvec{w}}}|+1)!}\,{({{\varvec{m}}}-{{\varvec{w}}})!}\,{\varvec{\beta }}^{{{\varvec{m}}}-{{\varvec{w}}}} \nonumber \\&= {{{\varvec{m}}}!}\,{\varvec{\beta }}^{{\varvec{m}}}\sum _{{\mathop {\scriptstyle {{{\varvec{w}}}\ne {{\varvec{m}}}}}\limits ^{\scriptstyle {{{\varvec{w}}}\le {{\varvec{m}}}}}}} {{\mathbb {P}}}_{{{\varvec{w}}}}\, {(|{{\varvec{m}}}-{{\varvec{w}}}|+1)!}. \end{aligned}$$

(A.7)

Substituting (A.6) and (A.7) into (A.5), we obtain

$$\begin{aligned} \varUpsilon _{\varvec{\nu }}&\le {c_0\,c^{|{\varvec{\nu }}|}} \sum _{{{\varvec{m}}}\le {\varvec{\nu }}} \bigg ( {\frac{(|{{\varvec{m}}}|+d)!}{{{{\varvec{m}}}!}\,d!}} + \sum _{{\mathop {\scriptstyle {{{\varvec{w}}}\ne {{\varvec{m}}}}}\limits ^{\scriptstyle {{{\varvec{w}}}\le {{\varvec{m}}}}}}} {{\mathbb {P}}}_{{{\varvec{w}}}}\,{(|{{\varvec{m}}}-{{\varvec{w}}}|+1)!} \bigg ) {{{\varvec{m}}}!}\,{\varvec{\beta }}^{{\varvec{m}}}\prod _{i\ge 1} S(\nu _i,m_i). \end{aligned}$$

The result now follows from the definition of the sequence ${{\mathbb {P}}}_{{\varvec{m}}}$. $\square $

We need to derive an upper bound for the sequences $({{\mathbb {P}}}_{{{\varvec{m}}}})_{{{\varvec{m}}}\in {\mathscr {F}}}$. In anticipation of this, let us define the auxiliary sequence

$$\begin{aligned} \tau _0=1\quad \text {and}\quad \tau _k=\sum _{j=0}^{k-1} (k-j+1)\,\tau _j,\quad k\ge 1. \end{aligned}$$

(A.8)

This sequence is listed as A003480 in The On-Line Encyclopedia of Integer Sequences (OEIS). We observe that this sequence satisfies the following property.

Lemma A.6

The sequence $(\tau _k)_{k\ge 0}$ defined by (A.8) satisfies

$$\begin{aligned} \sum _{{\mathop {\scriptstyle {{{\varvec{w}}}\ne {{\varvec{m}}}}}\limits ^{\scriptstyle {{{\varvec{w}}}\le {{\varvec{m}}}}}}}\tau _{|{{\varvec{w}}}|}\,|{{\varvec{w}}}|!\,(|{{\varvec{m}}}|-|{{\varvec{w}}}|+1)!\,\left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) = \tau _{|{{\varvec{m}}}|}\,|{{\varvec{m}}}|! \qquad \text {for }{{\varvec{m}}}\in {\mathscr {F}}\setminus \{{\textbf{0}}\}. \end{aligned}$$

Proof

Let ${{\varvec{m}}}\in {\mathscr {F}}\setminus \{{\textbf{0}}\}$. Then

$$\begin{aligned}&\sum _{{\mathop {\scriptstyle {{{\varvec{w}}}\ne {{\varvec{m}}}}}\limits ^{\scriptstyle {{{\varvec{w}}}\le {{\varvec{m}}}}}}}\tau _{|{{\varvec{w}}}|}\,|{{\varvec{w}}}|!\,(|{{\varvec{m}}}|-|{{\varvec{w}}}|+1)! \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) =\sum _{j=0}^{|{{\varvec{m}}}|-1}\tau _j\,j!\,(|{{\varvec{m}}}|-j+1)! \sum _{\begin{array}{c} {{\varvec{w}}}\le {{\varvec{m}}}\\ |{{\varvec{w}}}|=j \end{array}}\left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) \\&\quad =\sum _{j=0}^{|{{\varvec{m}}}|-1}\tau _j\,j!\,(|{{\varvec{m}}}|-j+1)!\,\left( {\begin{array}{c}|{{\varvec{m}}}|\\ j\end{array}}\right) =|{{\varvec{m}}}|!\sum _{j=0}^{|{{\varvec{m}}}|-1}\tau _j\,(|{{\varvec{m}}}|-j+1) = \tau _{|{{\varvec{m}}}|}\,|{{\varvec{m}}}|!, \end{aligned}$$

as required. $\square $

Remark. Lemma A.6 may be viewed as an extension of the renewal property of sequence A003480; see corresponding entry in the OEIS for further details.

We are now ready to state an upper bound for the sequence $({{\mathbb {P}}}_{{\varvec{m}}})_{{{\varvec{m}}}\in {\mathscr {F}}}$.

Lemma A.7

The sequence $({{\mathbb {P}}}_{{{\varvec{m}}}})_{{{\varvec{m}}}\in {\mathscr {F}}}$ defined by (A.4) satisfies

$$\begin{aligned} {{\mathbb {P}}}_{{{\varvec{m}}}}\le \frac{2\,\tau _{|{{\varvec{m}}}|}\,(|{{\varvec{m}}}|+d-1)!}{(d-1)!} \qquad \text {for all}~{{\varvec{m}}}\in {\mathscr {F}}\setminus \{{{\varvec{0}}}\}, \end{aligned}$$

where the sequence $(\tau _k)_{k\ge 0}$ is defined by (A.8).

Proof

The proof is carried out by induction with respect to the multi-indices ${{\varvec{m}}}\in {\mathscr {F}}\setminus \{{\textbf{0}}\}$. The base step ${{\varvec{m}}}={\varvec{e}}_j$, $j\ge 1$, is resolved by observing that $\tau _1 = 2$ and

$$\begin{aligned} {{\mathbb {P}}}_{{\varvec{e}}_j} = d+3 \,\le \, 4\,d \,=\, \frac{2\,\tau _{|{{\varvec{e}}}_j|}\,(|{{\varvec{e}}}_j|+d-1)!}{(d-1)!}. \end{aligned}$$

Let ${{\varvec{m}}}\in {\mathscr {F}}{\setminus }\{{\textbf{0}}\}$ with $|{{\varvec{m}}}|\ge 2$ and suppose that the assertion holds for all multi-indices ${{\varvec{w}}}\in \mathscr {F}{\setminus }\{{\textbf{0}}\}$ with $|{{\varvec{w}}}|<|{{\varvec{m}}}|$. Then by separating out the ${{\varvec{w}}}={{\varvec{0}}}$ term in (A.4) and applying the induction hypothesis to ${{\mathbb {P}}}_{{\varvec{w}}}$ for ${{\varvec{w}}}\ne {{\varvec{0}}}$, we obtain

$$\begin{aligned} {{\mathbb {P}}}_{{{\varvec{m}}}}&\le \frac{(|{{\varvec{m}}}|+d)!}{{{\varvec{m}}}!\,d!}+(|{{\varvec{m}}}|+1)!+\sum _{\begin{array}{c} {{\varvec{0}}}\ne {{\varvec{w}}}\le {{\varvec{m}}}\\ {{\varvec{w}}}\ne {{\varvec{m}}} \end{array}} \frac{2\,\tau _{|{{\varvec{w}}}|}\,(|{{\varvec{w}}}|+d-1)!}{(d-1)!}\,(|{{\varvec{m}}}|-|{{\varvec{w}}}|+1)!\\&\le \frac{(|{{\varvec{m}}}|+d)!}{d!}+\frac{(|{{\varvec{m}}}|+d)!}{d!} \\&\quad + \frac{2\,(|{{\varvec{m}}}|+d-1)!}{|{{\varvec{m}}}|!\,(d-1)!} \sum _{\begin{array}{c} {{\varvec{0}}}\ne {{\varvec{w}}}\le {{\varvec{m}}}\\ {{\varvec{w}}}\ne {{\varvec{m}}} \end{array}}\,\tau _{|{{\varvec{w}}}|}\,|{{\varvec{w}}}|!\,(|{{\varvec{m}}}|-|{{\varvec{w}}}|+1)!\, \left( {\begin{array}{c}{{\varvec{m}}}\\ {{\varvec{w}}}\end{array}}\right) \\&= \frac{2\,(|{{\varvec{m}}}|+d)!}{d!} + \frac{2\,(|{{\varvec{m}}}|+d-1)!}{|{{\varvec{m}}}|!\,(d-1)!} \Big (\tau _{|{{\varvec{m}}}|}\,|{{\varvec{m}}}|! - (|{{\varvec{m}}}|+1)!\Big ) \\&= \frac{2\,(|{{\varvec{m}}}|+d-1)!}{(d-1)!}\, \Big (\frac{|{{\varvec{m}}}|+d}{d} + \tau _{|{{\varvec{m}}}|} - (|{{\varvec{m}}}|+1) \Big ) \\&\le \frac{2\,\tau _{|{{\varvec{m}}}|}\,(|{{\varvec{m}}}|+d-1)!}{(d-1)!}, \end{aligned}$$

where we used Lemma A.6 in the first equality. This completes the proof. $\square $

It remains to obtain an estimate on the $(\tau _k)_{k\ge 0}$ sequence.

Lemma A.8

The sequence defined by (A.8) satisfies

$$\begin{aligned} \tau _k\le (2+\sqrt{2})^k\quad \text {for all }k\ge 0. \end{aligned}$$

Proof

The claim is clearly true with equality when $k=0$. If $k\ge 1$, then the sequence admits the following characterization (see the OEIS entry for this sequence and references therein):

$$\begin{aligned} \tau _k=\frac{(2+\sqrt{2})^{k+1}}{4\sqrt{2}}-\frac{(2-\sqrt{2})^{k+1}}{4\sqrt{2}}, \end{aligned}$$

from which the claim readily follows. $\square $

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hakula, H., Harbrecht, H., Kaarnioja, V. et al. Uncertainty quantification for random domains using periodic random variables. Numer. Math. 156, 273–317 (2024). https://doi.org/10.1007/s00211-023-01392-6

Download citation

Received: 31 October 2022
Revised: 13 October 2023
Accepted: 13 December 2023
Published: 12 January 2024
Issue Date: February 2024
DOI: https://doi.org/10.1007/s00211-023-01392-6

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Uncertainty quantification for random domains using periodic random variables

Abstract

Similar content being viewed by others

Method of Moments and Sums of Random Indicators

Asymptotic Properties of Random Polytopes

Approximation of Probability Density Functions for PDEs with Random Parameters Using Truncated Series Expansions

1 Introduction

2 Problem formulation

2.1 Notations

2.2 Parameterization of domain uncertainty

2.3 The variational formulation on the reference domain

3 Parametric regularity of the solution

3.1 Parametric regularity of the transport coefficient

Lemma 3.1

Proof

Lemma 3.2

Proof

Lemma 3.3

Proof

3.2 Parametric regularity of the transported source term

Lemma 3.4

Proof

Lemma 3.5

Proof

3.3 Parametric regularity of the transported PDE

Lemma 3.6

Proof

Theorem 3.7

Proof

4 Error analysis

4.1 Dimension truncation error

Theorem 4.1

Proof

4.2 Finite element error

Theorem 4.2

Proof

4.3 Quasi-Monte Carlo cubature error and a choice of weights

Theorem 4.3

Proof

4.4 Overall error estimate

Theorem 4.4

5 Numerical experiments

5.1 Source problem

5.2 Capacity problem

6 Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Technical results

Technical results

Lemma A.1

Proof

Lemma A.2

Proof

Lemma A.3

Proof

Lemma A.4

Proof

Lemma A.5

Proof

Lemma A.6

Proof

Lemma A.7

Proof

Lemma A.8

Proof

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification

Search

Navigation