1 Introduction

One significant subfield of additive group theory and combinatorial number theory is the zero-sum theory that studies the sums behavior of suitable sequences of elements in an abelian finite group G (see, for instance, the surveys [7, 20]). In this context, a typical kind of problem considers the existence of constants \(\ell \) such that any sequence of elements of G whose length is bigger than \(\ell \) satisfies an additive property \({\mathcal {P}}.\) Among these constants, an important role is taken by the classical Erdős–Ginzburg–Ziv constant of a group G that denotes the smallest positive integer \(\ell \) such that any sequence of length \(|S|\ge \ell \) contains a zero-sum subsequence of length \(\exp (G).\) This constant has been well studied in the literature, we refer to the survey paper [20]. Here we recall that in [16], Erdős, Ginzburg, and Ziv completely determined its value over cyclic groups and that in [18, 21, 23], respectively Fox, Sauermann, and Naslund derived nontrivial upper bounds on groups of type \({\mathbb {F}}_p^n\) (they assumed a slightly different definition of the Erdős–Ginzburg–Ziv constant).

In the recent paper [10], Caro and Schmitt generalized this concept, using the m-th degree symmetric polynomial \(e_m(g_1,\ldots ,g_{t})=\sum _{1\le i_1<\cdots <i_m\le t}\) \(\prod _{j=1}^m g_{i_j}\) instead of the sum of the elements of S and considering subsequences of a given length t (see also [1, 3,4,5] that considered some related problems). In particular, they defined the higher degree Erdős–Ginzburg–Ziv constants EGZ(tRm) as follows. For a finite commutative ring REGZ(tRm) is the smallest positive integer \(\ell \) such that every sequence S over R of length \(|S|\ge \ell \) contains a subsequence \(S'\) of length t for which \(e_{m}(S')\) evaluates to the zero element in R. If such \(\ell \) does not exists, EGZ(tRm) is set to \(\infty .\)

They also present several lower and upper bounds to these constants solving the case where R is \({\mathbb {Z}}_2\) and the case where R is \({\mathbb {Z}}_{p^{s}}\) if t and m are powers of the same prime. For a generic finite commutative ring R,  their best lower bound is expressed in term of the generalized Davenport constant D(Rm) of the ring R (see Caro et al. [11]), that is, the smallest integer \(\ell \) such that any sequence S over R of length \(|S|\ge \ell \) contains a subsequence \(S'\) of length \(|S'|\ge m\) for which \(e_m(S')\) equals the zero element of R. Indeed they prove that

$$\begin{aligned} EGZ(t,R,m)\ge t+D(R,m)-m. \end{aligned}$$
(1.1)

This paper aims to determine lower and upper bounds for EGZ(tRm) in case \(R={\mathbb {F}}_q^{n}\) (viewed as a commutative ring) for some prime power q (in the following we will always use the letter q for a prime power and p for a prime). The article is organized as follows. In Section 2, we will present two lower bounds obtained, respectively, using the Lovász local lemma and the expurgation method. Then, in Section 3, we will show that, for sufficiently large n,  our bounds improve the ones given by Caro and Schmitt in the same context. Finally, in Section 4, we prove closed form upper bounds to \(EGZ(p,R,p-1),\) derived from the Ellenberg–Gijswijt [15] and Sauermann [23] bounds for the cap-set problem, in case \(R={\mathbb {F}}_q^{n}\) and \(q=p^k.\) Moreover, we will apply Tao’s slice rank method to provide an upper bound to \(EGZ(3,{\mathbb {F}}_q^n,2)\) and we derive a convex optimization problem that we can solve numerically providing better bounds for \(q = 3^k\) and \(k=2, 3,4,5.\)

2 Lower bounds

In this section, we will present two kinds of probabilistic lower bounds on the Erdős–Ginzburg–Ziv constants of rings of type \({\mathbb {F}}_q^n.\) Both those bounds exploit the following upper bound on the probability that a given t-sequence S of elements in (vectors of) \({\mathbb {F}}_q^n\) is such that \(e_m(S)=0.\) To provide such an upper bound, we exploit the following famous lemma.

Lemma 2.1

(Schwartz–Zippel lemma [24, 27]). Let \(P \in {\mathbb {F}}[x_1, x_2, \ldots , x_t]\) be a non-zero polynomial with degree d. Consider a finite subset \(A \subseteq {\mathbb {F}}.\) If we pick uniformly at random \(r_1, r_2, \ldots , r_t\) from A,  then

$$\begin{aligned} {\mathbb {P}}[P(r_1, r_2, \ldots , r_t) = 0] \le \frac{d}{|A|}. \end{aligned}$$

Using Lemma 2.1, we easily obtain the follow proposition.

Proposition 2.2

Let us choose,  uniformly at random,  a sequence \(S = (g_1, g_2, \) \(\ldots , g_t)\) of \(t\ge m\) vectors of \({\mathbb {F}}_q^n.\) Then

$$\begin{aligned} {\mathbb {P}}[e_m(S)=0]\le \left( \frac{m}{q}\right) ^{n}. \end{aligned}$$

Proof

We prove this result first assuming \(n=1.\) Since \(e_m(S)\) is a polynomial of degree m,  by Lemma 2.1, taking \(A = {\mathbb {F}}_q,\) we obtain

$$\begin{aligned} {\mathbb {P}}[e_m(S)=0] \le \frac{m}{|A|} = \frac{m}{q}. \end{aligned}$$

Now we note that, if we consider a sequence S of \(t\ge m\) vectors of \({\mathbb {F}}_q^n,\) then \({e_m(S)=0}\) if and only if each of the n projections \(\pi _i(S)\) of S over the i-th coordinate satisfies \({e_m(\pi _i(S))=0}.\) Since those projections are independent, it follows that

$$\begin{aligned} {\mathbb {P}}[e_m(S)=0]=\prod _{i=1}^{n}{\mathbb {P}}[e_m(\pi _i(S))=0] \le \left( \frac{m}{q}\right) ^{n}. \end{aligned}$$

\(\square \)

We provide a first new lower bound on \(EGZ(t,{\mathbb {F}}_q^{n},m)\) by exploiting the so-called Lovász local lemma, see also the work [5] of Bitz, Griffith, and He for a similar application of this method. Here we state the lemma (in the symmetric case) for the reader’s convenience.

Lemma 2.3

([17] (see also [2])). Let \(E_1, E_2, \ldots , E_k\) be events in an arbitrary probability space. Suppose that each event \(E_i\) is mutually independent of the set of all other events \(E_j\) but at most d,  and that \({\mathbb {P}}[E_i] \le P\) for all \(1 \le i \le k.\) If

$$\begin{aligned} e d P \le 1, \end{aligned}$$

then \({\mathbb {P}}[\cap _{i=1}^k \overline{E_i}] > 0.\)

Now, we are ready to state the following theorem.

Theorem 2.4

Let \(\ell \) be such that

$$\begin{aligned} e \left[ \left( {\begin{array}{c}\ell \\ t\end{array}}\right) - \left( {\begin{array}{c}\ell -t\\ t\end{array}}\right) \right] \left( \frac{m}{q}\right) ^{n} \le 1 \end{aligned}$$

where \(\left( {\begin{array}{c}\ell -t\\ t\end{array}}\right) \) is set to zero if \(\ell < 2t.\) Then \(EGZ(t,{\mathbb {F}}_q^{n},m) > \ell .\)

Proof

Here we need to prove the existence of a sequence S of length \(\ell \) for which any subsequence \(S'\) of length t is such that \(e_{m}(S')\not =0.\)

Let us choose, uniformly at random, a sequence S of length \(\ell \) in \({\mathbb {F}}_q^{n}.\) For a given subsequence \(S'\) of length t contained in S,  let \(E_{S'}\) be the event such that \(e_{m}(S') =0.\) Clearly, there are \(\left( {\begin{array}{c}\ell \\ t\end{array}}\right) \) such events. Due to Proposition 2.2, we know that

$$\begin{aligned} {\mathbb {P}}[E_{S'}] \le \left( \frac{m}{q}\right) ^{n} \quad \text {for all } S' \subseteq S,\ |S'| = t. \end{aligned}$$

It is easy to see that each event \(E_{S'}\) is mutually independent from all the events \(E_{S''}\) where \(S'' \subseteq S \setminus S'\) and \(|S''| = t.\) Therefore each event \(E_{S'}\) is dependent on at most \(\left( {\begin{array}{c}\ell \\ t\end{array}}\right) - \left( {\begin{array}{c}\ell -t\\ t\end{array}}\right) \) other events. Hence, due to Lemma 2.3, we obtain the thesis. \(\square \)

Now we provide a second lower bound that, in some regime of the parameters, turns out to improve that of Theorem 2.4. The method we use here is sometimes called expurgation in the literature. We refer the reader to the book [2, Chapter 3 (Alterations)].

Theorem 2.5

Let \(\ell \) be such that

$$\begin{aligned} \left( {\begin{array}{c}\ell +s\\ t\end{array}}\right) \left( \frac{m}{q}\right) ^{n}<s+1 \end{aligned}$$

for some \(s\ge 0.\) Then \(EGZ(t,{\mathbb {F}}_q^{n},m) > \ell .\)

Proof

We first note that the thesis is equivalent to prove the existence of a sequence S of length \(\ell \) for which any subsequence \(S'\) of length t is such that \({e_{m}(S')\not =0}.\)

Here we choose, uniformly at random, a sequence T of length \(\ell +s\) and we evaluate the expected value of the random variable X given by the number of subsequences \(T'\) of T of length t and such that \(e_{m}(T')=0.\) Because of Proposition 2.2, we have that

$$\begin{aligned} {\mathbb {E}}(X)\le \sum _{T'\subseteq T: |T'|=t} \left( \frac{m}{q}\right) ^{n}=\left( {\begin{array}{c}\ell +s\\ t\end{array}}\right) \left( \frac{m}{q}\right) ^{n}. \end{aligned}$$

Moreover, due to the hypothesis, we have that \({\mathbb {E}}(X)<s+1.\) It follows that there exists a set T of length \(\ell +s\) with at most s subsequences \(T'\) such that \(e_m(T')=0\) that we call bad subsequences. If we remove from T one element from each bad subsequence, we have removed at most s elements and we are left with a sequence S of length at least \(\ell \) for which any subsequence \(S'\) of length t is such that \(e_{m}(S')\not =0.\) Clearly, we may assume, without loss of generality, that the length of S is exactly \(\ell \) obtaining the thesis. \(\square \)

Remark 2.6

We have been able to compute the optimal value of s in the expurgation bound (Theorem 2.5) only for small values of t,  i.e., \(t=2,3,4,5.\) In all these cases, the expurgation bound performs better than the bound given in Theorem 2.4 obtained using the Lovász local lemma. In view of these results, we are inclined to conjecture that the expurgation bound provides the best bound for every \(\ell \ge t \ge 2.\) However, since we did not succeed to prove this conjecture, we considered it useful to report also the bound given in Theorem 2.4.

3 Comparison to Caro and Schmitt’s bounds

In this section, we discuss the bounds we have obtained in comparison to that of Caro and Schmitt. In particular, in [11, Theorems 3.1 and 3.4], it was proved that for rings of type \({\mathbb {F}}_p^{n}\) (where p is a prime), the following bounds on \(D({\mathbb {F}}_p^{n},m)\) hold:

$$\begin{aligned} nmp-(n-1)m \ge D({\mathbb {F}}_p^{n},m)\ge n p-(n-1)m. \end{aligned}$$
(3.1)

It follows that the lower bound of Caro and Schmitt (1.1) becomes

$$\begin{aligned} EGZ(t,{\mathbb {F}}_p^{n},m)\ge t+n (p-m). \end{aligned}$$
(3.2)

Now we consider our lower bound of Theorem 2.5 with \(s=0\) in the case \(q=p.\) Note that this is not, in general, our best lower bound but it is the easiest to consider. We have that \(EGZ(t,{\mathbb {F}}_p^{n},m)\ge \ell \) if \(\ell \) is such that

$$\begin{aligned} \left( {\begin{array}{c}\ell \\ t\end{array}}\right) \left( \frac{m}{p}\right) ^{n}<1. \end{aligned}$$

We note that \(\frac{\ell ^t}{t!}>\left( {\begin{array}{c}\ell \\ t\end{array}}\right) \) and hence \(EGZ(t,{\mathbb {F}}_p^{n},m)\ge \ell \) for any \(\ell \) such that

$$\begin{aligned} \frac{\ell ^t}{t!}\left( \frac{m}{p}\right) ^{n}<1, \end{aligned}$$

that is,

$$\begin{aligned} \frac{\ell ^t}{t!}<\left( \frac{p}{m}\right) ^{n} \end{aligned}$$

and hence

$$\begin{aligned} EGZ(t,{\mathbb {F}}_p^{n},m)\ge (t!)^{\frac{1}{t}} \left( \frac{p}{m}\right) ^{\frac{n}{t}}. \end{aligned}$$
(3.3)

Now, since (3.3) is, when \(p>m,\) exponential in n,  it is clear that asymptotically in n,  it improves the lower bound of Equation (3.2).

Remark 3.1

From Equation (3.3), we also have that, if \(p>m,\) for sufficiently large n:

$$\begin{aligned} EGZ(t,{\mathbb {F}}_p^{n},m)\ge (t!)^{\frac{1}{t}} \left( \frac{p}{m}\right) ^{\frac{n}{t}}> t+nm(p-1) \ge t+D({\mathbb {F}}_p^n,m)-m \end{aligned}$$

where the last inequality follows from the upper bound of Equation (3.1). This means that, for these kinds of parameters, it does not yield a Caro-Gao-type relation (see [8, 9, 19]), i.e., it does not hold the equality in Equation (1.1).

We also note that the bound of Equation (3.3) can be trivially improved for several values of \(q=p^k.\) Indeed, if \(\left( {\begin{array}{c}t\\ m\end{array}}\right) \not \equiv 0 \pmod {p},\) we have that \(EGZ(t,{\mathbb {F}}_q^{n},m) =\infty .\) It suffices to consider the infinite constant sequence such that \(g_i=1\) for any \(i\in {\mathbb {N}}.\) In this case, we have that, for any subsequence \(S'\) of length t\(e_m(S')=\left( {\begin{array}{c}t\\ m\end{array}}\right) \not \equiv 0\pmod {p}.\) On the other hand, this is a subset of the parameters for which our bounds of Section 2 (and in particular Equation (3.3)) hold. Moreover, we will show in the upcoming section that, at least when \(q=p^k,\) \(t=p,\) and \(m=p-1,\) it is possible to provide nontrivial upper bounds.

Finally, we also note that the bounds here presented can be easily generalized to rings of type \({\mathbb {F}}_{q_1}\times {\mathbb {F}}_{q_2}\times \cdots \times {\mathbb {F}}_{q_n}\) (similarly to those of [10] that Caro and Schmitt stated for rings of type \({\mathbb {Z}}_{m_1}\times {\mathbb {Z}}_{m_2}\times \cdots \times {\mathbb {Z}}_{m_n}\)) but, since we believe this is not a substantial improvement, we prefer to keep the notation of this note as simple as possible and to explicitly consider only rings of type \({\mathbb {F}}_q^n.\)

4 Upper bounds

In this section, we provide an upper bound to \(EGZ(p,{\mathbb {F}}_{q}^{n},p-1)\) with \(q=p^k,\) where p is an odd prime. In the first part of this section, we provide, using the Ellenberg–Gijswijt [15] and Sauermann [23] bounds for the cap-set problem, a general upper bound to \(EGZ(p,{\mathbb {F}}_{q}^{n},p-1)\) for every prime \(p \ge 3.\) Then, we use the so-called slice rank method, introduced by Tao in [25] and revisited by Tao and Sawin in [26] (see also [12] and [22] for a discussion on the method) in order to generalize the polynomial approach introduced in [14] and in [15], to improve the bounds that can be deduced by the Ellenberg–Gijswijt bound for \(p=3\) and \(k = 2, 3, 4, 5.\) Our application of the method is somehow reminiscent of works on the classical Erdős–Ginzburg–Ziv constants of Fox and Sauermann [18] and Naslund [21].

Let us first state the following theorems that will be used in Theorem 4.3 to provide a general bound on \(EGZ(p,{\mathbb {F}}_{q}^{n},p-1)\) for \(q=p^k\) and \(k \ge 2.\)

Theorem 4.1

(Ellenberg–Gijswijt [15]). Let A be a subset of \({\mathbb {F}}_3^n\) which does not contain 3 distinct elements \(x_1,\) \(x_2,\) \(x_3\) such that \(x_1+x_2+x_3 = 0.\) Then,  for \(n \rightarrow \infty ,\) we have that

$$\begin{aligned} |A| \le \left( \frac{3}{8} \root 3 \of {207 + 33 \sqrt{33}} + o(1) \right) ^n \approx (2.756 + o(1))^n. \end{aligned}$$

Theorem 4.2

(Sauermann [23]). Let \(p \ge 5\) be a prime and let A be a subset of \({\mathbb {F}}_p^n\) which does not contain p distinct elements \(x_1,x_2,\ldots ,\) \(x_p\) such that \(x_1+x_2+\cdots +x_p = 0.\) Then,  for \(n \rightarrow \infty ,\) we have that

$$\begin{aligned} |A| \le \left( 2 \sqrt{p} + o(1)\right) ^n. \end{aligned}$$

Now we consider a sequence \(S=(g_1,g_2,\ldots ,g_\ell )\) of elements in \({\mathbb {F}}_{q}^{n}\) with \(q=p^k\) such that every p-tuple of elements \(g_1',g_2',\ldots , g_p'\) of S satisfies \(e_{p-1}(g_1',g_2',\ldots , g'_p)\) \(=\sum _{1\le i_1<\cdots <i_{p-1}\le p}\) \(\prod _{j=1}^{p-1} g'_{i_j} \not =0.\) We note that S cannot have elements repeated more than \(p-1\) times since \(e_{p-1}(g_1',g_2',\ldots , g'_p) = 0\) whenever \(g_1'=g_2'=\cdots = g_p'.\) It means that we can remove the repeated elements in S obtaining a set \(S_1\) with \(|S_1|\ge \frac{|S|}{p-1}.\) Clearly, to upper bound the length of the sequence S,  it suffices to bound the cardinality of \(S_1\) considered as a set (it has no repetitions). Since it does not admit repeated elements, we already have that

$$\begin{aligned} \frac{|S|}{p-1} \le |S_1|\le q^n. \end{aligned}$$
(4.1)

Now, we are ready to state our first result whose proof has been pointed out by an anonymous referee.

Theorem 4.3

Let p be an odd prime and k a positive integer. Then we have that,  when q is not a power of p

$$\begin{aligned} EGZ(p, {\mathbb {F}}_q^n, p-1) = \infty . \end{aligned}$$

While for \(q = p^k,\) we have that

$$\begin{aligned} EGZ(p, {\mathbb {F}}_q^n, p-1) \le {\left\{ \begin{array}{ll} q^{n + o(n)} &{}\text {for } p=3,5 \text { and } k=1, \\ (2.756^k +1)^{n + o(n)} &{}\text {for } p=3 \text { and } k \ge 2, \\ \left( 2^k \sqrt{q}+1 \right) ^{n+o(n)} &{}\text {for } p \ne 3 \text { and } q \ge 7. \end{array}\right. } \end{aligned}$$

Proof

We note that, if \({\mathbb {F}}_q\) has characteristic \(p' \ne p,\) \(\left( {\begin{array}{c}p\\ p-1\end{array}}\right) =p \not \equiv 0 \pmod {p'}.\) In this case, we consider the infinite constant sequence such that \(g_i=1\) for any \(i\in {\mathbb {N}}.\) Here we have that, for any subsequence \(S'\) of length p\(e_{p-1}(S')=\left( {\begin{array}{c}p\\ p-1\end{array}}\right) =p\not \equiv 0\pmod {p'}\) and hence \(EGZ(p,{\mathbb {F}}_{q}^{n},p-1)=\infty \) whenever q is not a power of p. Therefore we can assume that \(q = p^k.\) For \(k=1\) and \(p = 3, 5,\) the upper bound of this theorem follows directly from Equation (4.1). Hence we can suppose that \(k \ge 2.\)

Now, let \(S \subseteq {\mathbb {F}}_q^n\) be a subset not containing p distinct elements \(x_1,x_2,\ldots , x_{p} \in S\) such that \(e_{p-1} (x_1, x_2, \ldots , x_p) = 0.\) For every \(P \subseteq \{1, 2, \ldots , n\},\) let us denote by \(S_P = \{ v \in S\,|\,{\textrm{supp}}(v) = P \},\) where \({\textrm{supp}}(v)\) is defined as the set of coordinates in which v is nonzero, the set of vectors in S that have the same support P. Let also denote by \(S'_P\subseteq {\mathbb {F}}_q^{|P|}\) the set obtained from \(S_P\) restricting every vector \(v \in S_P\) only to coordinates in P. This guarantees us that all the entries of \(S_P'\) are nonzero elements of \({\mathbb {F}}_q.\) Then we construct a new set \(S_P'' \subseteq {\mathbb {F}}_q^{|P|}\) by replacing every vector \((a_1, a_2, \ldots , a_{|P|}) \in S_P'\) by \((a_1^{-1}, a_2^{-1}, \ldots , a_{|P|}^{-1}) \in S_P''.\) Clearly, \(|S_P| = |S_P'| = |S_P''|.\) We claim that \(S_P''\) does not contain p distinct elements summing to zero in \({\mathbb {F}}_q^{|P|}.\) Indeed, suppose by contradiction there exist p distinct vectors \(x_1, x_2, \ldots , x_p \in S_P'\) such that \(x_{1, i}^{-1} + x_{2,i}^{-1} + \cdots + x_{p, i}^{-1} = 0\) for every i,  then we can multiply both sides of the previous equation by \(x_{1, i} x_{2, i} \cdots x_{p, i}\) to obtain that \(e_{p-1} (x_1, x_2, \ldots , x_p) = 0.\) But this is absurd due to the initial hypothesis on S.

Since, as an abelian group under addition, \({\mathbb {F}}_q^{|P|}\) is isomorphic to \({\mathbb {F}}_p^{k |P|},\) by Theorems 4.1 and 4.2, we have that, for fixed p and k\(|S_P| \le \left( u_{p, k} +o(1) \right) ^{|P|}\) for every \(P \subseteq \{1, 2,\ldots , n\}\) such that \(|P| = \alpha n (1+o(1))\) for some \(0< \alpha < 1,\) where

$$\begin{aligned} u_{p, k} := {\left\{ \begin{array}{ll} 2.756^{k} &{}\text {for } p=3 \text { and } k \ge 2, \\ (2 \sqrt{p})^{k} &{}\text {for } p \ne 3 \text { and } p^k \ge 7. \end{array}\right. } \end{aligned}$$

Hence, for any real \(0 < \alpha \le 1/4,\) we get

$$\begin{aligned} |S|&\le \sum _{\begin{array}{c} P \subseteq \{1, 2,\ldots , n\} \\ |P| \le \alpha n \end{array}} |S_P| + \sum _{\begin{array}{c} P \subseteq \{1, 2,\ldots , n\} \\ |P| \ge \alpha n \end{array}} |S_P| {\mathop {\le }\limits ^{({\textrm{i}})}} 2^n q^{\alpha n} +\sum _{i \ge \alpha n}^{n} \sum _{\begin{array}{c} P \subseteq \{1, 2,\ldots , n\} \\ |P| = i \end{array}} |S_P| \\ \end{aligned}$$
$$\begin{aligned}&{\mathop {\le }\limits ^{({\textrm{ii}})}} o(u_{p,k}^n) + \sum _{i \ge \alpha n}^{n} \left( {\begin{array}{c}n\\ i\end{array}}\right) \left( u_{p,k} + o(1) \right) ^i {\mathop {\le }\limits ^{({\textrm{iii}})}} o(u_{p,k}^n) + \left( u_{p,k} + 1 + o(1) \right) ^n \\&= \left( u_{p,k} + 1 \right) ^{n+o(n)}, \end{aligned}$$

where (i) follows since \(|S_P| \le q^{\alpha n}\) for \(|P| \le \alpha n.\) Since \(2 q^{\alpha } = 2 p^{\alpha k} < u_{p,k}\) for every \(\alpha < 1/4\) and \(k \ge 1,\) inequality (ii) holds due to the fact that \(|S_P| \le \left( u_{p,k} + o(1) \right) ^{|P|}.\) Finally inequality (iii) is due to the binomial theorem. Therefore the theorem follows. \(\square \)

We will see that it is possible to improve the bounds given in Theorem 4.3 using the slice rank method for \(q=3^k\) and \(k=2,3,4,5.\)

As done before, let \(S = (g_1, g_2, \ldots , g_{\ell })\) be a sequence of elements in \({\mathbb {F}}_q^n\) with \(q = 3^k\) such that every three elements \(g'_1, g'_2, g'_3\) of S satisfy \(e_{2}(g'_1, g'_2, g'_3) \ne 0.\) Let \(S_1\) be the set obtained from removing the repeated elements in S. By Equation (4.1), we have that \(|S_1| \ge |S|/2.\) Now we split \(S_1\) in \(n+1\) sets \(S_1^0,S_1^1,\ldots , S_1^{n}\) where \(g_i\in S_1^{j}\) if \(g_i\) has exactly j coordinates equal to zero. We note that there exists j such that

$$\begin{aligned} |S_1^{j}|\ge \frac{|S_1|}{n+1} \ge \frac{|S|}{2(n+1)}. \end{aligned}$$

Now, let us recall some definitions and lemmas from [25, 26].

Definition 4.4

A function \(T:A^k\rightarrow {\mathbb {F}}\) is said to be a slice if it can be written in the form

$$\begin{aligned} T(x_1,\ldots ,x_k)=T_1(x_i)T_2(x_1,\ldots ,x_{i-1},x_{i+1},\ldots ,x_k) \end{aligned}$$

where \(T_1:A\rightarrow {\mathbb {F}}\) and \(T_2: A^{k-1}\rightarrow {\mathbb {F}}.\)

Definition 4.5

The slice rank srk(T) of a general function \(T:A^k\rightarrow {\mathbb {F}}\) is the smallest number m such that T is a linear combination of m slices.

Lemma 4.6

([25]). Let A be a finite set and \({\mathbb {F}}\) be a field. Let T(xyz) be a function \(A\times A\times A\rightarrow {\mathbb {F}}\) such that \(T(x,y,z)\not =0\) if and only if \(x=y=z.\) Then \(srk(T) = |A|.\)

In order to apply Lemma 4.6, we want to consider a function that is zero whenever we consider three different elements of \(S_1^j.\) In particular, given \(x,y,z\in {\mathbb {F}}_q^n,\) we consider

$$\begin{aligned} P(x,y,z)=\prod _{i=1}^{n} (1-(x_iy_i+y_iz_i+z_ix_i)^{q-1}). \end{aligned}$$
(4.2)

Lemma 4.7

Let us consider the function P(xyz) on the restricted domain \(S_1^j\times S_1^j\times S_1^j\rightarrow {\mathbb {F}}_q\) where \(q=3^k.\) Then \(P(x,y,z)\not =0\) if and only if \(x=y=z.\)

Proof

Here we have that, if xyz are in \(S_1^j,\) then \(P(x,y,z)\not =0\) if and only if \(x=y=z.\) Indeed, if xy,  and z are three different elements of \(S_1^j,\) they are such that \(xy+yz+zx\not =0\) and hence \(x_iy_i+y_iz_i+z_ix_i\not =0\) for at least one \(i\in [1,n].\) This means that \(1-(x_iy_i+y_iz_i+z_ix_i)^{q-1}=0\) and hence \(P(x,y,z)=0.\)

We note that also if we consider an element \(x\in S_1^j\) repeated twice and \(z\not =x,\) we have that \(P(x,x,z)=0.\) Indeed, since x and z have the same number of zero components, there exists i such that \(x_i\not =0\) and \(x_i\not =z_i.\) Here we have that

$$\begin{aligned} x_ix_i+x_iz_i+z_ix_i=x_i^2+2x_iz_i=x_i(x_i-z_i)\not =0 \end{aligned}$$

since both \(x_i-z_i\) and \(x_i\) are nonzero. It follows that \(P(x,x,z)=0.\) Similarly, we prove that also \(P(z,x,x)=0\) and \(P(x,z,x)=0.\)

Finally, we consider an element x repeated three times. In this case, we have that

$$\begin{aligned} P(x,x,x)=\prod _{i=1}^{n} (1-(x_ix_i+x_ix_i+x_ix_i)^{q-1}) =\prod _{i=1}^{n} (1-(3x_ix_i)^{q-1})=1\not =0. \end{aligned}$$

\(\square \)

As a corollary of Lemmas 4.6 and 4.7, we have that:

Corollary 4.8

$$\begin{aligned} |S|\le 2(n+1)|S_1^{j}|= 2(n+1)srk(P|_{S_1^{j} \times S_1^{j}\times S_1^{j}}). \end{aligned}$$

Now the goal is to upper bound the \(srk(P|_{S_1^{j}\times S_1^{j}\times S_1^{j}}).\) The following lemma will help us to make the first step in this direction.

Lemma 4.9

([25]). Let A be a finite set,  \(A_1\subseteq A,\) and \({\mathbb {F}}\) be a field. Let T(xyz) be a function \(A\times A\times A\rightarrow {\mathbb {F}}.\) Then

$$\begin{aligned} srk(T|_{A_1 \times A_1 \times A_1})\le srk(T). \end{aligned}$$

We immediately get the following corollary:

Corollary 4.10

Considering the function P on the domain \({\mathbb {F}}_{q}^n\times {\mathbb {F}}_{q}^n\times {\mathbb {F}}_{q}^n,\) we have that

$$\begin{aligned} |S|\le 2(n+1)|S_1^{j}|= 2(n+1)srk(P). \end{aligned}$$

Now we aim to prove that srk(P) improves the bound given in Theorem 4.3. For this purpose, we recall the asymptotic rank theory studied by Tao and Sawin in [26] in the special case of polynomial functions (we do not need to consider the very general case of tensor slice rank).

Given a polynomial p(xyz) whose degree in each variables is at most \(\delta ,\) we define \(\Gamma \) as the subset of \(\{0,1,\ldots ,\delta \}^3\) of the triples \((d_1,d_2,d_3)\) such that \(x^{d_1}y^{d_2}z^{d_3}\) has a nonzero coefficient in p. Hence we state the following proposition derived from [26].

Proposition 4.11

Let p(xyz) be a polynomial and let \(\Gamma \) be its support. Then : 

$$\begin{aligned} srk\left( \prod _{i=1}^n p(x_i,y_i,z_i)\right) \le \exp ((H(\Gamma )+o(1))n) \end{aligned}$$

where

$$\begin{aligned} H(\Gamma ):=\sup _{(X_1,\ldots ,X_k)} \min (h(X_1),\ldots ,h(X_k)), \end{aligned}$$

\((X_1,\ldots ,X_k)\) takes values in \(\Gamma ,\) and h(X) is the entropy of the random variable X defined as \(-\sum _{\gamma \in \Gamma '} {\mathbb {P}}[X = \gamma ] \log ({\mathbb {P}}[X = \gamma ])\) and \(\Gamma '\) is the support of X.

In our case, we will not find the exact value of \(H(\Gamma )\) but we will compute it numerically when \(k=2,3,4,5\) solving a convex optimization problem and providing then an upper bound of type \(\exp (H(\Gamma ))^{(n+o(n))}\) where \(\exp (H(\Gamma ))\) is strictly smaller than the bounds given in Theorem 4.3. For this purpose, we will recall the following theorem from [6].

Theorem 4.12

([6, Theorem 8]). Let \(\Gamma \) be a finite subset of \(S \times S \times S\) for some set S and let \(\sigma \in Sym(3)\) be a permutation such that,  for each \(a = (a_1,a_2,a_3) \in \Gamma ,\) also \(\sigma (a)=(a_{\sigma (1)},a_{\sigma (2)},a_{\sigma (3)})\in \Gamma .\) Then there is a random variable Y taking values in \(\Gamma \) such that,  for all \(y\in \Gamma ,\) we have that \({\mathbb {P}}[Y=y] = {\mathbb {P}}[Y=\sigma (y)]\) and

$$\begin{aligned} \min (h(Y_1),h(Y_2),h(Y_3))=H(\Gamma ). \end{aligned}$$

This theorem essentially ensures that the value \(H(\Gamma )\) is attained as a minimum of the entropy of the marginal variables of some random variable Y and that this variable is invariant under permutations that fix \(\Gamma .\) We are now ready to state the following theorem.

Theorem 4.13

Let \(q=3^k,\) then we have that

$$\begin{aligned} EGZ(3,{\mathbb {F}}_{q}^{n},2) \le {\left\{ \begin{array}{ll} 8.315^{n+o(n)} &{} \text {for } k=2, \\ 21.802^{n+o(n)} &{}\text {for } k=3, \\ 58.557^{n+o(n)} &{}\text {for } k=4, \\ 159.812^{n+o(n)} &{}\text {for } k=5. \end{array}\right. } \end{aligned}$$

Proof

We set \(p(x,y,z)=(1-(xy+yz+zx)^{q-1})\) and we consider the following polynomial defined in (4.2):

$$\begin{aligned} P(x,y,z)=\prod _{i=1}^n p(x_i,y_i,z_i). \end{aligned}$$

Hence we can use Proposition 4.11 to evaluate srk(P). For \(q = 9,27,81,243,\) we compute the support \(\Gamma \) of p and then, using Theorem 4.12, we have been able to compute \(H(\Gamma )\) numerically for these cases:

q

9

27

81

243

\(H(\Gamma )\)

2.118

3.082

4.07

5.074

Hence the theorem follows by Corollary 4.10 and Proposition 4.11. \(\square \)

Remark 4.14

One can prove that \(H(\Gamma ) < \log q\) for every \(q = 3^k \ge 9,\) where \(\Gamma \) is the support of the polynomial p(xyz) defined in Theorem 4.13. The reader can find a proof in a previous version of this paper [13].

Remark 4.15

We observe that, in Theorem 4.13, for \(q=3,\) we obtain a weaker bound than for the other cases of q. Indeed, in this case

$$\begin{aligned} \Gamma =\{(0,0,0),(2,2,0),(0,2,2),(2,0,2),(2,1,1),(1,2,1),(1,1,2)\} \end{aligned}$$

and one can easily check that defining Y that has, neatly, the distribution

$$\begin{aligned} (1/4,1/12,1/12,1/12,1/6,1/6,1/6) \end{aligned}$$

over \(\Gamma ,\) \(Y_1,\) \(Y_2,\) and \(Y_3\) all have uniform distributions. It follows that, in this case, \(H(\Gamma )=\log 3\) and hence our proof fails to provide a better upper bound for \(q=3.\) \(\square \)

For the other values of q (i.e., \(q > 243\)), we have not been able to explicitly evaluate \(H(\Gamma )\) since it seems that there are too many variables for this problem to be treated even with the help of a computer.