Correction to: Numer. Math. (2017) 135:1207–1220 https://doi.org/10.1007/s00211-016-0829-7
1 Introduction
In [1], the influence of random equation ordering in a linear system \(By=b\) on deriving upper bounds for the convergence speed of the classical successive over-relaxation (SOR) method
was studied. For simplicity, it is assumed that the system is consistent with solution y and that B is a complex \(n\times n\) Hermitian positive semi-definite matrix with positive diagonal part D and strictly lower triangular part L. Two strategies of involving permutations of the system were considered. For the so-called shuffled SOR iteration, in each step the SOR update formula (1) is applied to a uniformly at random and independently chosen permutation of the linear system, while for the preshuffled SOR iteration the iteration (1) is performed for \(k=0,1,\ldots \) after a single permutation of the system at the beginning. To study the convergence properties of such iterations, 2-norm estimates involving the lower triangular part \(L_\sigma \) of the permuted matrix \(B_\sigma =P_\sigma B P_\sigma ^*\) played a crucial role, where \(P_\sigma \) denotes the permutation matrix associated with the permutation \(\sigma \) acting according to \((P_\sigma y)_i=y_{\sigma _i}\), \(i=1,\ldots ,n\).
The necessary properties were formulated as Theorem 2 and 3 in [1]. Unfortunately, the proof of Theorem 2 uses a wrong formula for \(P_\sigma ^*L_\sigma L_\sigma ^*P_\sigma \). This was pointed out by T. Yilmaz in [2]. In the next section, a proof of Theorem 2 based on the correct formula is provided, including a slight improvement of the involved constants.
Even though the proof of Theorem 3 in [1] is correct, we use the opportunity to give new estimates for the absolute constant in the inequality
For the class of all Hermitean matrices (not necessarily positive-definite) we show that taking \(C=245\) is feasible which considerably improves the value \(C=C_2=2905\) stated in [1]. For positive semi-definite B, one can even take \(C=122{,}3\). These bounds are consequences of recent quantitative improvements of the Anderson paving conjecture and will be shown in Sect. 3. The derivation in [1] that (2) holds for positive semi-definite B with unit diagonal with the smaller value \(C=C_1=32{,}42\) is based on an flawed application of earlier results on the size of one-sided pavings and thus is not correct. It remains an interesting open question to find more precise bounds for the constant C in (2). For the class of positive semi-definite B with unit diagonal I conjecture that \(C=2/\pi \) is the best possible choice.
2 Correct statement and proof of Theorem 2 from [1]
Theorem 2 in [1] concerns the 2-norm estimate of the matrix
which plays a cruical role in the estimates of the expected squared error of the shuffled SOR iteration, see Theorem 4 a) there. As was mentioned above, its proof uses a wrong formula for the entries of \(P_\sigma ^*L_\sigma L_\sigma ^*P_\sigma \). The correct formula [2] is
where \(H_{i,j}\) are the entries of the Hermitean matrix \(H=B-D\), the non-diagonal part of B, and \(\sigma ^{-1}\) is the permutation inverse to \(\sigma \).
To see (4), recall that
Consequently,
and, by setting \(i=\sigma ^{-1}_s\), \(j=\sigma ^{-1}_t\), we arrive at (4).
Based on (4), we next derive a formula for E in terms of the Hermitean positive-definite matrix \(H^2\), namely,
where \(D_{H^2}\) is the diagonal part of \(H^2\). Indeed, from (3) and (4) we have
where \(n_{m;s,t}\) stands for the cardinality of the set of all permutations \(\sigma \) such that, for some \(k<\min (\sigma ^{-1}_s,\sigma ^{-1}_t)\), we have \(\sigma _k=m\). Equivalently, this is the cardinality of the set of all \(\sigma \) such that \(\sigma ^{-1}_m < \min (\sigma ^{-1}_s,\sigma ^{-1}_t)\). It is not hard to see that these cardinalities equal
Indeed, for the case \(m\ne t=s\), any \(\sigma \) in the associated set is obtained by first choosing two indices k, i with \(k<i\) from \(\{1,\ldots ,n\}\) and setting \(\sigma ^{-1}_m=k\), \(\sigma ^{-1}_s=i\) (this is possible in \(n(n-1)/2\) different ways) and then independently assigning the remaining \(n-2\) indices arbitrarily (this is possible in \((n-2)!\) different ways). A similar reasoning applies to the case \(m\ne s\ne t\ne m\), where one starts with a subset of 3 different indices k, i, j with \(k< i <j\), sets
or alternatively
(altogether \(n(n-1)(n-2)/3\) different possibilities) and assigns the remaining \(n-3\) indices arbitrarily (\((n-3)!\) different possibilities). For index constellations, where \(m=s\) or \(m=t\), one obviously has \(n_{m;s,t}=0\). With this, we arrive for \(s=t\) at
since \(H_{m,m}=0\). Similarly, for \(s\ne t\) we have
This establishes the formula (5).
Note that, up to this point, the calculations hold for any Hermitean B. Since \(H^2\) and its diagonal part \(D_{H^2}\) are automatically positive semi-definite, we thus get
Since \(\Vert H\Vert \le \Vert B\Vert +\Vert D\Vert \le 2\Vert B\Vert \) for any B we also have
for all Hermitean B.
If the Hermitean B is positive semi-definite, then \(H=B-D\) has norm \(\Vert H\Vert \le \Vert B\Vert \) since
Thus, in this case \(\Vert E\Vert \le \frac{1}{2} \Vert B\Vert ^2\). If, in addition, B has unit diagonal (i.e., \(D=I\)) then slightly more precise bounds are possible. Indeed, then
In summary, we have proved the following replacement of Theorem 2 from [1].
Theorem
Let B be an arbitrary Hermitean matrix, and \(H=B-D\) its non-diagonal part. Then the matrix E defined in (3) satisfies
If, in addition, B is positive semi-definite then
Compared to the statement of Theorem 2 in [1], the constants in these estimates are reduced by a factor of two which also leads to better constants in Theorem 4 a) in [1].
3 New constants in Theorem 3 from [1]
The proof of Theorem 3 in [1], i.e., the proof of (2) with a constant C independent of the size of B, is essentially based on the existence of so-called \((k,\epsilon )\)-pavings for Hermitean matrices with zero (or small) diagonal part such as \(H=B-D\). We use a consequence of a recent refinement [3] of the original proof of the Anderson paving conjecture. If one carefully follows the proof of Theorem 1.1 in [3, Section 5.2] specialized to the pair \([H,-H]\) (in particular, if one uses the more precise bound at the end of the proof of Theorem 5.6 there) then one sees that for any \(\epsilon \in (0,1)\) there exists a \((k,\epsilon )\)-paving of H if \(4k^{-1/2}+2k^{-1}\le \epsilon \). Equivalently, for any \(k\ge 20\) there exists a \((k,\epsilon _k)\)-paving for H with
It was shown in the proof of Theorem 3 and in the remarks following it in [1] that this implies the estimate (2) with the constant
(the minimum is achieved for \(k=43\)). Since \(\Vert H\Vert \le 2\Vert B\Vert \) for general Hermitean B and \(\Vert H\Vert \le \Vert B\Vert \) for positive semi-definite B, this yields the respective statements about the constant C in (2) in Sect. 1.
References
Oswald, P., Zhou, W.: Random reordering in SOR-type methods. Numerische Mathematik 135, 1207–1220 (2017). https://doi.org/10.1007/s00211-016-0829-7
Yilmaz, T.: Triangular Truncation Under Permutation in SOR-Type Methods. BSc Thesis, Department of Mathematics/Computer Science, University of Cologne (2023)
Ravichandran, M., Srivastava, N.: Asymptotically optimal multi-paving. Int. Math. Res. Not. 14, 10908–10940 (2021). https://doi.org/10.1093/imrn/rnz111
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Oswald, P., Zhou, W. Correction: Random reordering in SOR-type methods. Numer. Math. 154, 521–525 (2023). https://doi.org/10.1007/s00211-023-01365-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00211-023-01365-9