The Gabrielov–Khovanskii Problem for Polynomials

Aleksandr V. Pukhlikov Department of Mathematical Sciences The University of Liverpool
Liverpool L69 7ZL UK
pukh@liv.ac.uk

Abstract

We state and consider the Gabrielov–Khovanskii problem of estimating the multiplicity of a common zero for a tuple of polynomials in a subvariety of a given codimension in the space of tuples of polynomials. For a bounded codimension we obtain estimates of the multiplicity of the common zero, which are close to optimal ones. We consider certain generalizations and open questions.

Keywords

1 Introduction

2 Statement of the Problem

In this section we give a precise statement of the Gabrielov–Khovanskii problem for polynomials: we introduce the spaces of tuples of polynomials, bi-invariant subvarieties and multiplicities. Then we give an estimate of the codimension of the set of tuples that vanish on a subset of positive dimension. We define the parameter $\beta$, characterizing a subvariety of tuples of polynomials.

2.1 The Space of Tuples of Polynomials

Fix the complex coordinate space ${\mathbb{A}}={\mathbb{C}}^{N}_{(z_{1},\ldots,z_{N})}$ of dimension $N\geqslant 1$ with coordinates $(z_{*})=(z_{1},\ldots,z_{N})$. For $d\in{\mathbb{Z}}_{+}$ let ${\cal P}_{d,N}$ be the linear space of homogeneous polynomials of degree $d$ in $z_{*}$ (in particular, ${\cal P}_{0,N}={\mathbb{C}}$), and for $e\leqslant d$ set

$\displaystyle{\cal P}_{[e,d],N}=\bigoplus\limits^{d}_{i=e}{\cal P}_{i,N},$

for instance, ${\cal P}_{[1,d],N}$ is the space of (non-homogeneous) polynomials of degree $\leqslant d$ with no free term. On each of these spaces acts the matrix group $G_{1}=GL_{N}({\mathbb{C}})$ of linear changes of coordinates. Fix a tuple of integers

$\displaystyle\underline{d}=(d_{1},\ldots,d_{N}),$

where $2\leqslant d_{1}\leqslant\cdots\leqslant d_{N}$, and set

$\displaystyle{\cal P}(\underline{d})=\prod^{N}_{i=1}{\cal P}_{[1,d_{i}],N}$

to be the space of tuples $(f_{1},\ldots,f_{N})$ of polynomials of degree $\leqslant d_{1},\ldots,d_{N}$, respectively, with no free term. On the space ${\cal P}(\underline{d})$, apart from the above-mentioned group $G_{1}$, act two more groups of transformations, which we will now define. The group $G_{21}$ consists of transformations of the form

$\displaystyle(f_{1},\ldots,f_{N})\mapsto(f^{+}_{1},\ldots,f^{+}_{N}),\quad \quad f^{+}_{i}=f_{i}+\sum^{i-1}_{j=1}s_{i,j}(z)f_{j},$

where $s_{i,j}\in{\cal P}_{[0,d_{i}-d_{j}],N}$ are polynomials, fixed for the given transformation. Set ${\cal D}=\{d_{1}\}\cup\cdots\cup\{d_{N}\}\subset{\mathbb{Z}}_{+}$ and let for $d\in{\cal D}$

$\displaystyle n_{d}=\sharp\{i\,|\,d_{i}=d\},$

so that $\sum_{d\in{\cal D}}n_{d}=N$. Now the group $G_{22}$ is defined as the matrix group (realized by block-wise diagonal matrices, where the blocks of the size $n_{d}\times n_{d}$ are ordered by increasing of the integers $d$)

$\displaystyle\prod_{d\in{\cal D}}GL_{n_{d}}({\mathbb{C}}),$

acting on the tuples $(f_{*})\in{\cal P}(\underline{d})$ by linear transformations of the form

$\displaystyle(f_{1},\ldots,f_{N})\mapsto(f_{1},\ldots,f_{N})A.$

Let $G_{2}=\langle G_{21},G_{22}\rangle$ be the group of linear transformations of the space ${\cal P}(\underline{d})$, generated by the subgroups $G_{21}$ and $G_{22}$. The group $G_{2}$ is clearly connected, hence irreducible as an algebraic variety. An irreducible subvariety $B\subset{\cal P}(\underline{d})$ (respectively, a map from ${\cal P}(\underline{d})$ to some set) is said to be bi-invariant, if it is invariant with respect to the action of both groups $G_{1}$ and $G_{2}$.

2.2 Multiplicities

For a tuple $\underline{f}=(f_{1},\ldots,f_{N})\in{\cal P}(\underline{d})$ we define the multiplicity $\mu(\underline{f})=\mu(f_{1},\ldots,f_{N})\in{\mathbb{Z}}_{+}\cup\infty$, setting:

•

$\mu(\underline{f})=\infty$, if the closed algebraic set $\{f_{1}=\cdots=f_{N}=0\}$ has a component of positive dimension, containing the point $o=(0,\ldots,0)\in{\mathbb{A}}$,
•

$\mu(\underline{f})={\rm dim}{\cal O}_{o,{\mathbb{A}}}/(f_{1},\ldots,f_{N})$, otherwise.

Obviously, the function $\mu\colon{\cal P}(\underline{d})\to{\mathbb{Z}}_{+}\cup\{\infty\}$ is bi-invariant. For an arbitrary irreducible subvariety $B\subset{\cal P}(\underline{d})$ set

$\displaystyle\mu(B)={\rm min}_{\underline{f}\in B}\{\mu(\underline{f})\}\in{ \mathbb{Z}}_{+}\cup\infty,$

so that $\mu(B)=\mu(\underline{f})$ for a general tuple $\underline{f}\in B$. Furthermore, set

$\displaystyle\mu(a)={\rm max}\{\mu(B)\,|\,{\rm codim}(B\subset{\cal P}( \underline{d}))\leqslant a\},$

that is, the maximum is taken over all irreducible subvarieties of codimension $a$. If $\langle B\rangle$ is the bi-invariant span of the subvariety $B$, that is, the smallest bi-invariant subvariety in ${\cal P}(\underline{d})$, containing $B$, then, obviously, $\mu(\langle B\rangle)=\mu(B)$, and moreover ${\rm codim}(\langle B\rangle\subset{\cal P}(\underline{d}))\leqslant({\rm codim }(B\subset{\cal P}(\underline{d}))$, so that the number $\mu(a)$ can be defined as the maximum of the numbers $\mu(B)$ over all bi-invariant irreducible subvarieties $B\subset{\cal P}(\underline{d})$ of codimension at most $a$. This obvious remark will be used in the sequel without special references.

Consider the closed subset

$\displaystyle X_{\infty}=\{\underline{f}\in{\cal P}(\underline{d})\,|\,\mu( \underline{f})=\infty\}$

and set $\chi_{\infty}(\underline{d})={\rm codim}(X_{\infty}\subset{\cal P}(\underline{ d}))$. Obviously, $\mu(B)=\infty$, if and only if $B\subset X_{\infty}$, and $\mu(a)=\infty$ if and only if $a\geqslant\chi_{\infty}(\underline{d})$. Consider the irreducible subvariety $X_{\rm line}\subset{\cal P}(\underline{d})$, consisting of such tuples $\underline{f}$, that

$f_{1}|_{L}\equiv\cdots\equiv f_{N}|_{L}\equiv 0$

(2)

for some line $L\ni o$.

Proposition 2.1.

The following equality holds:

$\displaystyle{\rm codim}(X_{\rm line}\subset{\cal P}(\underline{d}))=1-N+\sum^ {N}_{i=1}d_{i}.$

Proof.

Set $X^{+}_{\infty}=\overline{X_{\infty}\backslash X_{\rm line}}$.

Proposition 2.2.

The following estimate holds:

$\displaystyle{\rm codim}(X^{+}_{\infty}\subset{\cal P}(\underline{d})) \geqslant d_{1}N+1.$

Proof.

We use the technique developed in Pukhlikov ([Pukhlikov2001], Section 3). The space ${\mathbb{A}}$ is considered as embedded in the projective space ${\mathbb{P}}={\mathbb{P}}^{N}_{(x_{0}:\cdots:x_{N})}$ as the affine chart $(x_{0}\neq 0)$, the polynomials $f_{1},\ldots,f_{N}$ are represented by polynomials $F_{1},\ldots,F_{N}$, where $F_{i}(o)=0$. We have to estimate the codimension of the subset of tuples $(F_{*})$, for which there exists an irreducible subvariety $Y\ni o$ of positive dimension, which is not a line and such that $F_{i}|_{Y}\equiv 0$ for all $i=1,\ldots,N$. For every such tuple there is a uniquely determined integer $k\in\{0,1,\ldots,N-1\}$, satisfying the two conditions:

•

${\rm codim}_{o}(\{F_{1}=\cdots=F_{k}=0\}\subset{\mathbb{P}})=k$ (where ${\rm codim}_{o}$ means the codimension in a neighborhood of the point $o$, and for $k=0$ the set $\{F_{1}=\cdots=F_{k}=0\}$ is the whole space ${\mathbb{P}}$),
•

the polynomial $F_{k+1}$ vanishes identically on an irreducible component $B$ of the closed set $\{F_{1}=\cdots=F_{k}=0\}$, containing the point $o$, and if $k=N-1$, then $B$ is not a line.

Let

$\displaystyle\alpha_{k}=\sum^{k+1}_{i=1}{d_{i}+N-k\choose d_{i}}-k(N-k)$

be the codimension of the closed set of such tuples $(F_{1},\ldots,F_{k+1})$, that $F_{i}|_{\Lambda}\equiv 0$ for a certain linear subspace $\Lambda\subset{\mathbb{P}}$ of codimension $k$, where $k=0,1,\ldots,N-2$ (in order to see that the codimension of this closed set is indeed equal to $\alpha_{k}$, one argues as in the proof of Proposition 2.1: consider the algebraic set

$\displaystyle\{(\Lambda,\underline{F})\,|\,\underline{F}|_{\Lambda}\equiv 0\} \subset G(k,N)\times\{(\underline{F})\},$

where $G(k,N)$ is the projective Grassmanian of $k$-subspaces in ${\mathbb{P}}^{N}$, and two projections on the direct factors $G(k,N)$ and the space $\{(\underline{F})\}=\{(F_{1},\ldots,F_{N})\}$ of tuples of homogeneous polynomials, introduced above; the obvious details are left to the reader). It is easy to check that $\alpha_{k}\geqslant d_{1}N+1$. Therefore, estimating the codimension of the set of “irregular” tuples $(F_{*})$, we may assume that the irreducible component $B$ of the closed set $\{F_{1}=\cdots=F_{k}=0\}$, on which $F_{k+1}$ vanishes identically, is not a linear subspace. Set

$\displaystyle\beta_{k}={\rm min}_{l\in\{1,\ldots,k\}}[(d_{1}+\cdots+d_{k-l}+d_ {k+1}-(k-l))(N-k+l)+1],$

$k=1,\ldots,N-1$. Now the technique developed in Pukhlikov ([Pukhlikov2001], Section 3) gives the estimate

$\displaystyle{\rm codim}((\overline{X_{\infty}\backslash X_{\rm line}})\subset {\cal P}(\underline{d}))\geqslant{\rm min}_{k\in\{1,\ldots,N-1\}}\beta_{k},$

so that in order to complete the proof of Proposition 2.1 it is sufficient to show that the right-hand side of the last inequality is not smaller than $(d_{1}N+1)$. This is an easy task.

Now replacing in the expression for $\beta_{k}$ the numbers $d_{1},\ldots,d_{k+1}$ by $d=d_{1}$, we get

$\displaystyle\beta_{k}\geqslant{\rm min}_{l\in\{1,\ldots,k\}}[((k-l+1)d-(k-l)) (N-k+l)+1].$

The expression in the square brackets is a quadratic polynomial in $l$ with the senior coefficient $-(d-1)l^{2}$. Since $d\geqslant 2$, the minimum is attained at one of the endpoints of the interval $[1,k]$. For $l=k$ we get the value $dN+1$, which is what we want. For $l=1$ we get

$\displaystyle(k(d-1)+1)(N-k+1)+1.$

Here $k\in\{1,\ldots,N-1\}$ and the last expression is again a quadratic polynomial with the senior coefficient $-(d-1)k^{2}$, that is, the minimum in $k$ is attained at one of the endpoints of the interval $[1,N-1]$. For $k=1$ we get the required value $dN+1$. For $k=N-1$ we get $2d(N-1)-2N+5\geqslant dN+1$. This completes the proof of Proposition 2.2. $\square$

Corollary 2.1.

Assume that

$\displaystyle a\leqslant{\rm min}\left(d_{1}N,\sum^{N}_{i=1}d_{i}-N\right).$

Then the number $\mu(a)$ is finite.

Below we obtain estimates from above for $\mu(a)$, which are close to optimal ones, for the values $a\leqslant N$.

2.3 The Rank of a System of Linear Forms

Let us consider the construction of the Example 1.6 more formally.

Example 2.1.

(See [Pukhlikov2004], Section 3.5) For $b\in{\mathbb{Z}}_{+}$ set

$\displaystyle X(b)=\{\underline{f}\in{\cal P}(\underline{d})\,|\,{\rm rk}(df_{ 1}(o),\ldots,df_{N}(o))\leqslant N-b\}.$

For $b\leqslant N$ the set $X(b)$ is non-empty, closed and bi-invariant, and of codimension

$\displaystyle{\rm codim}(X(b)\subset{\cal P}(\underline{d}))=b^{2}.$

For a general tuple $\underline{f}$ in any irreducible component of the set $X(b)$ there is a subset $I\subset\{1,\ldots,N\}$, $\sharp I=N-b$, such that

$\displaystyle{\rm rk}(df_{i}(o)\,|\,i\in I)=N-b.$

Therefore, for any polynomials $g_{j}\in{\cal P}_{[1,d_{j}],N}$, $j\not\in I$, such that

$\displaystyle dg_{j}(o)\in\langle df_{i}(o)\,|\,i\in I\rangle,$

the tuple $(f^{*}_{1},\ldots,f^{*}_{N})$, given by the conditions

•

$f^{*}_{i}=f_{i}$ for $i\in I$,
•

$f^{*}_{j}=g_{j}$ for $j\not\in I$,

belongs to the same irreducible component of the set $X(b)$, as $\underline{f}$. In other words, the closed algebraic set $Z_{I}(\underline{f})=\{f_{i}=0\,|\,i\in I\}$ in a neighborhood of the point $o\in{\mathbb{A}}$ is a non-singular $b$-dimensional variety, and on the polynomials $f_{j},j\not\in I$, only one condition is imposed: $df_{j}|_{T_{o}Z_{I}(\underline{f})}\equiv 0$. Therefore, for every irreducible component $B\subset X(b)$ we have

$\displaystyle\mu(B)=2^{b}.$

Since ${\rm codim}(B\subset{\cal P}(\underline{d}))\geqslant b^{2}$, we obtain the following estimate for the function $\mu(a)$ from below:

$\displaystyle\mu(a)\geqslant 2^{[\sqrt{a}]}.$

This example motivates introducing a new parameter that characterizes an arbitrary irreducible subvariety $B\subset{\cal P}(\underline{d})$ of codimension $a\in{\mathbb{Z}}_{+}$: set

$\displaystyle\beta(B)=N-\max\limits_{\underline{f}\in B}{\rm rk}(df_{i}(o)\,| \,i=1,\ldots,N).$

Obviously, $\beta(B)={\rm max}\{b\in{\mathbb{Z}}_{+}\,|\,B\subset X(b)\}$. In particular, $\beta(B)\leqslant\sqrt{a}$.

Proposition 2.3.

If $\beta(B)=0$, then $\mu(B)=1$.

Proof.

Proposition 2.4.

The following equality holds: $\mu(1)=2$.

Proof.

3 Reduction to a Smaller Dimension

4 Explicit Estimates for Multiplicities

In this section we obtain estimates for $\mu(a)$ for $a\leqslant N$, which are close to optimal ones. First, we consider the case of a subvariety $B\subset{\cal P}(\underline{d})$ with $\beta(B)=1$ as an example, when it is easy to obtain a precise estimate from above for $\mu(B)$. Then using Theorem 3.1, we construct a recurrent procedure of estimating the multiplicity, based on controlling two parameters, the codimension $a$ and $b=\beta(B)$. (Recall that $a\geqslant b^{2}$.) At first this procedure is applied to obtain the estimates for small values of the codimension $a\leqslant 49$. After that, we consider the general case: in Sects. 4.3–4.5 we prove estimates from above for $\mu(a)$, where the estimating function grows as $C^{\sqrt{a}}$, here $C>0$ is some effectively estimated constant.

4.1 Estimating the Multiplicity for $b=1$

Let ${\cal P}(\underline{d})$ be an irreducible bi-invariant subvariety of codimension $a\leqslant N$.

Proposition 4.1.

Assume that $\beta(B)=1$. Then the inequality $\mu(B)\leqslant a+1$ holds.

Proof.

Remark 4.1.

The estimate in Proposition 4.1 is sharp: for any $a\leqslant N$ there is an irreducible bi-invariant subvariety $B\subset{\cal P}(\underline{d})$ of codimension $a$ with $\beta(B)=1$ and $\mu(B)=a+1$. Indeed, let $B^{\circ}\subset{\cal P}(\underline{d})$ be defined by the conditions

•

the equality ${\rm rk}(df_{1}(o),\ldots,df_{N-1}(o))=N-1$ holds, so that the set $C=\{f_{1}=\cdots=f_{N-1}=0\}$ in a neighborhood of the point $o$ is a curve, non-singular at that point,
•

the inequality ${\rm ord}_{o}\left(f_{N}|_{C}\right)\geqslant a+1$ holds.

It is easy to see that ${\rm codim}(B^{\circ}\subset{\cal P}(\underline{d}))\leqslant a$, so that for the closure $B$ of the bi-invariant span $\langle B^{\circ}\rangle$ the more so ${\rm codim}(B\subset{\cal P}(\underline{d}))\leqslant a$, and $\mu(B)\geqslant a+1$. Therefore, the last two inequalities we have the equality (the strict inequalities are impossible by Proposition 4.1).

Let us show now that if the degrees $d_{i}$ are high enough, then for $\beta(B)=1$ the restriction $a\leqslant N$ for the codimension is not needed.

Proposition 4.2.

Assume that $d_{1}\geqslant a+1$ and $\beta(B)=1$. Then $\mu(B)\leqslant a+1$.

Proof.

Remark 4.2.

It seems that the assumption $d_{1}\geqslant a+1$ can be considerably relaxed.

4.2 Estimating Multiplicities for Small Codimensions

Theorem 3.1 shows that in order to estimate the multiplicity $\mu(B)$ one needs to take into account the value of the parameter $\beta(B)=b$. Let $U\subset{\mathbb{Z}}_{+}\times{\mathbb{Z}}_{+}$ be the set $\{(a,b)\,|\,a\geqslant b^{2}\}$. Let us define inductively the function

$\displaystyle\overline{\mu}\colon U\to{\mathbb{Z}}_{+},$

setting $\overline{\mu}(a,0)\equiv 1$, $\overline{\mu}(a,1)\equiv a+1$, for $a<b(b+1)$

$\displaystyle\overline{\mu}(a,b)=2\overline{\mu}(a-(2b-1),b-1),$

for $a\geqslant b(b+1)$

$\displaystyle\overline{\mu}(a,b)=\overline{\mu}(a-(2b-1),b-1)+{\rm max}\{ \overline{\mu}(a-(2b-1),b-1),\overline{\mu}(a-b,b)\}.$

Theorem 3.1 immediately implies

Theorem 4.1.

Let $B\subset{\cal P}(\underline{d})$ be an irreducible bi-invariant subvariety of codimension $a\leqslant N$ and $\beta(B)=b$. Then the inequality

$\displaystyle\mu(B)\leqslant{\bar{\mu}}(a,b)$

holds. In particular, $\mu(a)\leqslant\max_{0\leqslant b\leqslant\sqrt{a}}{\bar{\mu}}(a,b)$.

For small values of $a$ the function $\overline{\mu}$ is easy to compute by hand; it is not hard to write a computer program, computing $\overline{\mu}$, either. Below we give the table of values $\overline{\mu}(a,b)$ for $a\leqslant 49$, $b\leqslant 7$. The symbol $*$ means that the pair $(a,b)\not\in U$ and the value of the function $\overline{\mu}$ is not defined. Already for these small values of the codimension the growth of the values $\overline{\mu}(a,b)$ can be clearly seen. In boldface we give the maximal value $\overline{\mu}(a,b)$ for a given $a$.

$a$	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	16
$b=0$	1	1	1	1	1	1	1	1	1	1	1	1	1	1	1	1
$b=1$	2	3	4	5	6	7	8	9	10	11	12	13	14	15	16	17
$b=2$	*	*	*	4	6	8	11	14	18	22	27	32	38	44	51	58
$b=3$	*	*	*	*	*	*	*	*	8	12	16	22	28	36	44	55
$b=4$	*	*	*	*	*	*	*	*	*	*	*	*	*	*	*	16
$b=5$	*	*	*	*	*	*	*	*	*	*	*	*	*	*	*	*

$a$	17	18	19	20	21	22	23	24	25	26	27	28
$b=0$	1	1	1	1	1	1	1	1	1	1	1	1
$b=1$	18	19	20	21	22	23	24	25	26	27	28	29
$b=2$	66	74	83	92	102	112	123	134	146	158	171	184
$b=3$	68	82	99	119	140	165	193	223	257	295	335	380
$b=4$	24	32	44	56	72	88	110	136	164	198	238	280
$b=5$	*	*	*	*	*	*	*	*	32	48	64	88
$b=6$	*	*	*	*	*	*	*	*	*	*	*	*

$a$	29	30	31	32	33	34	35	36	37	38	39
$b=0$	1	1	1	1	1	1	1	1	1	1	1
$b=1$	30	31	32	33	34	35	36	37	38	39	40
$b=2$	198	212	227	242	258	274	291	308	326	344	363
$b=3$	429	481	538	600	665	736	812	892	978	1070	1166
$b=4$	330	391	461	537	625	726	841	966	1106	1264	1441
$b=5$	112	144	176	220	272	328	396	476	560	660	782
$b=6$	*	*	*	*	*	*	*	64	96	128	176
$b=7$	*	*	*	*	*	*	*	*	*	*	*

$a$	40	41	42	43	44	45	46	47	48	49
$b=0$	1	1	1	1	1	1	1	1	1	1
$b=1$	41	42	43	44	45	46	47	48	49	50
$b=2$	382	402	422	443	464	486	508	531	554	578
$b=3$	1269	1378	1492	1613	1741	1874	2015	2163	2317	2479
$b=4$	1631	1842	2076	2333	2609	2912	3242	3602	3987	4404
$b=5$	922	1074	1250	1452	1682	1932	2212	2528	2893	3313
$b=6$	224	288	352	440	544	656	792	952	1120	1320
$b=7$	*	*	*	*	*	*	*	*	*	128
$b=8$	*	*	*	*	*	*	*	*	*	*

Now let us consider the problem of obtaining a simple effective upper bound for the multiplicities $\mu(B)$. From the technical viewpoint, one needs to find a simple and visual formalization of the procedure of estimating these numbers in terms of the numbers $\mu(B^{\prime})$ for subvarieties $B^{\prime}\subset{\cal P}(\underline{d}^{\prime})$ in the spaces of truncated tuples $(d^{\prime}_{1},\ldots,d^{\prime}_{N^{\prime}})$ with $N^{\prime}<N$.

4.3 The General Method of Estimating the Multiplicity

The symbol $B$ stands for an irreducible bi-invariant subvariety of codimension $a$ with $\beta(B)=b$. Let us consider the three-letter alphabet $\{A,C_{0},C_{1}\}$. Let ${\cal W}$ be the set of all words in that alphabet, including the empty word $\emptyset$. The length of the word $w$ is denoted by the symbol $|w|\in{\mathbb{Z}}_{+}$. The length of the empty word is equal to zero.

Let us describe a procedure of constructing a sequence of subsets $W_{l}\subset{\cal W}$, $l=0,1,\cdots$. The length of every word

$\displaystyle w\in\mathop{\bigcup}\limits_{l\in{\mathbb{Z}}_{+}}W_{l}$

does not exceed $N$. Set $N(w)=N-|w|\in{\mathbb{Z}}_{+}$. This sequences stabilizes, that is, $W_{l}=W_{l+1}$, starting from some $l=L$. For every word $w\in\cup W_{l}$ we assign a multi-index $\underline{d}(w)\in{\mathbb{Z}}_{+}^{N(w)}$ and an irreducible bi-invariant subvariety

$\displaystyle B[w]\subset{\cal P}(\underline{d}(w))$

of codimension $a(w)$ with $\beta(B[w])=b(w)$.

We start the construction with $W_{0}=\{\emptyset\}$. Set $B[\emptyset]=B\subset{\cal P}(\underline{d})$, where $\underline{d}(\emptyset)=\underline{d}$, so that $a(\emptyset)=a$ and $b(\emptyset)=b$. If $b(\emptyset)=0$, then set $W_{1}=W_{2}=\cdots=W_{0}$: the procedure terminates. Assume that the subsets $W_{0}$,…, $W_{l}$ are already constructed. If for every $w\in W_{l}$ the equality $b(w)=0$ holds, then we set $W_{l+1}=W_{l+2}=\cdots=W_{l}$, terminating the procedure. Otherwise, take any word $w\in W_{l}$ with $b(w)\geqslant 1$. Now apply Theorem 3.1 to the subvariety $B[w]\subset{\cal P}(\underline{d}(w))$ (constructed at a previous step). Consider the words $w_{1}=wA$ and $w_{2}=wC_{i}$, $i\in\{0,1\}$, where $i=1$ in the case of stable reduction and $i=0$, otherwise. Furthermore,

$\displaystyle B[w_{j}]=(B[w])_{j}\subset{\cal P}(\underline{d}^{+}(w)),$

$j=1,2$, in the sense of notations of Theorem 3.1, so that $\underline{d}(w_{j})=(\underline{d}(w))^{+}$ and

$\displaystyle a(w_{j})=(a(w))_{j}={\rm codim}(B[w_{j}]\subset{\cal P}( \underline{d}(w_{j}))),$

$b(w_{1})=b(w)-1$ and $b(w_{2})=b(w)-i$. The inequality

$\displaystyle a(w_{1})\leqslant a(w)-(2b(w)-1)$

holds, in the case of stable reduction the same inequality ifs satisfied by the second codimension,

$\displaystyle a(w_{2})\leqslant a(w)-(2b(w)-1),$

whereas in the case of non-stable reduction the estimate

$\displaystyle a(w_{2})\leqslant a(w)-b(w)$

holds. In any case, however, $a(w_{j})<a(w)$.

The set of words $W_{l+1}$ is obtained from $W_{l}$ by removing the word $w$ and adding the words $w_{1}$, $w_{2}$:

$\displaystyle W_{l+1}=(W_{l}\setminus\{w\})\cup\{w_{1},w_{2}\}.$

In particular, $\sharp W_{l+1}=\sharp W_{l}+1$. Theorem 3.1 implies that

$\displaystyle\sum_{w\in W_{l}}\mu(B[w])\leqslant\sum_{w\in W_{l+1}}\mu(B[w]).$

Therefore, for every $l$ we have the estimate

$\mu(B)\leqslant\sum_{w\in W_{l}}\mu(B[w])$

(8)

Proposition 4.3.

The procedure of constructing the sets $W_{l}\subset{\cal W}$ terminates: for some $l=L$ we have $b(w)=0$ for all words $w\in W_{L}$.

Proof.

4.4 An Estimate for the Cardinality of the Set of Words

We will write the words in the following way:

$\displaystyle w=\tau_{1}\cdots\tau_{K},$

where $\tau_{i}\in\{A,C_{0},C_{1}\}$. Now let

$\displaystyle\nu\colon\{A,C_{0},C_{1}\}\to\{A,C\}$

be the map from the three-letter alphabet to the two-letter one, given by the equalities $\nu(A)=A$, $\nu(C_{i})=C$, and

$\displaystyle\nu\colon w=\tau_{1}\cdots\tau_{K}\mapsto\bar{w}=\nu(\tau_{1}) \cdots\nu(\tau_{K})$

the corresponding map of the set of words. Now we have

Lemma 4.1.

For every $i=0,1,\ldots$ the map $\nu|_{W_{i}}$ is injective. In particular, $\nu|_{W}$ is injective.

Proof.

A stronger claim is true: among all words $\bar{w}=\nu(w)$, $w\in W_{i}$, no one is a left segment of another one. (In particular, no two words are equal, which means the injectivity of the map $\nu|_{W_{i}}$.) The last claim is easy to show by induction. The set $W_{0}$ consists of one word, and for it the claim is trivial. Assume that we have shown it for $W_{i}$, where $i=0,\ldots,e$. If $W_{e+1}=W_{e}$, then there is nothing to prove. If $W_{e+1}\neq W_{e}$, then $W_{e+1}$ is obtained from $W_{e}$ by removing some word $w\in W_{e}$ and adding two words $w_{1}=wA$ and $w_{2}=wC_{\alpha}$, where $\alpha\in\{0,1\}$. For these words we have $\bar{w}_{1}=\bar{w}A$ and $\bar{w}_{2}=\bar{w}C$. Obviously, $\bar{w}_{1}$ and $\bar{w}_{2}$ are not left segments of each other and no word $\bar{w}^{\prime}$ for $w^{\prime}\in W_{e}\backslash\{w\}$ is not a left segment of $\bar{w}_{1}$ or $\bar{w}_{2}$, because otherwise $\bar{w}^{\prime}=\bar{w}_{1}$ or $\bar{w}_{2}$ (since $\bar{w}^{\prime}$ is not a left segment of the word $\bar{w}$ by the inductive assumption), but then $\bar{w}$ would be a left segment of the word $\bar{w}^{\prime}$, contrary to the inductive assumption. In a trivial way $\bar{w}_{1}$ and $\bar{w}_{2}$ are not left segments of the word $\bar{w}^{\prime}$, since otherwise this would have been true for $\bar{w}$ as well, contrary to the inductive assumption. Q.E.D. for the lemma. $\square$

Let $w\in W$ be a word, $w^{\prime}$ its left segment (by the construction of the set $W$ we have

$\displaystyle w^{\prime}\in\mathop{\bigcup}\limits_{l\in{\mathbb{Z}}_{+}}W_{l},$

since $w$ is obtained from the empty word $\emptyset$ by adding letters at the right-hand end when changing from $W_{k}$ to $W_{k+1}$ for certain values $k$), and moreover, $w^{\prime}\neq w$ and $w^{\prime}\tau$ is the left segment of the word $w$ of length $|w^{\prime}|+1$.

Lemma 4.2.

(i)

If $\tau=A$ or $C_{1}$, then the inequality

$\displaystyle a(w^{\prime}\tau)\leqslant a(w^{\prime})-(2b(w^{\prime})-1)$

holds and $b(w^{\prime}\tau)=b(w^{\prime})-1$.
(ii)

If $\tau=C_{0}$, then the inequality

$\displaystyle a(w^{\prime}\tau)\leqslant a(w^{\prime})-b(w^{\prime})$

holds and $b(w^{\prime}\tau)=b(w^{\prime})$.

Proof.

Besides, the inequality (4) implies that for every word $w\in\bigcup_{l\in{\mathbb{Z}}_{+}}W_{l}$ we have the estimate

$a(w)\geqslant b^{2}(w).$

(10)

Example 4.1.

Let us prove Proposition 4.1 in terms of the formalism developed above. Let $b=b(\emptyset)=1$. Now for every word $w\in W_{i}$ we have the alternative: either $b(w)=0$ (and then $w\in W$), or $b(w)=1$ (and then $a(w\tau)\leqslant a(w)-1$ for any letter $\tau$), so that the set $W$ is of the form

$\displaystyle A,\,\,C_{0}A,\,\,C_{0}C_{0}A,\,\,\ldots,\,\,\underbrace{C_{0}C_{ 0}\cdots C_{0}}_{k}A,\,\,\underbrace{C_{0}\cdots C_{0}}_{k}C_{1},$

where $k+1\leqslant a$. Therefore, $\sharp W\leqslant a+1$, as we claimed above in Sect. 4.1.

Let us come back to the general case. Recall that $a\leqslant N$.

Theorem 4.2.

The following inequality holds:

$\sharp W\leqslant 2^{b}\frac{(a-\frac{b(b-1)}{2})^{b}}{(b!)^{2}}.$

(11)

Proof.

Lemma 4.3.

The following inequality holds:

$\displaystyle\sharp(\Delta\cap{\mathbb{Z}}^{b})\leqslant{\rm vol}(\Delta^{+}).$

Proof.

To every point $x=(x_{1},\ldots,x_{b})\in{\mathbb{R}}^{b}$ we correspond the unit cube

$\displaystyle\Gamma(x)=[x_{1},x_{1}+1]\times[x_{2},x_{2}+1]\times\cdots\times[ x_{b},x_{b}+1]\subset{\mathbb{R}}^{b},$

the vertex with the minimum value of the sum of coordinates $x_{1}+\cdots+x_{b}$ of which is the point $x$. If $x\in\Delta$, then $\Gamma(x)\subset\Delta^{+}$, since

$\displaystyle b+(b-1)+\cdots+1+a-b^{2}=a-\frac{b(b-1)}{2}.$

Therefore

$\displaystyle\sharp(\Delta\cap{\mathbb{Z}}^{b})=\sum_{x\in\Delta\cap{\mathbb{Z }}^{b}}{\rm vol}(\Gamma(x))={\rm vol}\left(\bigcup_{x\in\Delta\cap{\mathbb{Z}} ^{b}}\Gamma(x)\right)\leqslant{\rm vol}(\Delta^{+}),$

as we claimed. Q.E.D. for the lemma. $\square$

Computing the volume of the polytope $\Delta^{+}$, we complete the proof of Theorem 4.2.

4.5 Some Calculus

The inequality (11) immediately implies the estimate

$\displaystyle\mu(a)\leqslant\mathop{\max}\limits_{1\leqslant b\leqslant[\sqrt{ a}]}v_{b},$

where

$\displaystyle v_{b}=2^{b}\frac{(a-\frac{b(b-1)}{2})^{b}}{(b!)^{2}}.$

We have to estimate the maximum of the sequence $v_{b}$ on the set $\{1,\ldots,[\sqrt{a}]\}$ by a function that depends on the argument $a$ only. We do it in a few steps. Set

$\displaystyle u_{b}=\frac{1}{2\pi b}\left(\frac{2a-b(b-1)}{b^{2}}e^{2}\right)^ {b}.$

Lemma 4.4.

The inequality $v_{b}\leqslant u_{b}$ holds.

Proof.

Lemma 4.5.

The sequence $u_{b}$ is increasing if the following inequality holds:

$2a-b(b-1)\geqslant\frac{5}{2}b^{2}.$

(12)

Proof.

Corollary 4.1.

For $a\geqslant 17$ the value $b_{\rm max}\in\{1,\ldots,[\sqrt{a}]\}$, at which the maximum of the sequence $u_{b}$ is attained, satisfies the inequality

$\displaystyle 2a-b_{\rm max}(b_{\rm max}-1)\leqslant\frac{5}{3}a.$

Proof.

Corollary 4.2.

(i)

For $a\geqslant 17$ the following estimate holds:

$\displaystyle\sharp W\leqslant q_{b}=\frac{1}{2\pi b}\left(\frac{5a}{3b^{2}}e^ {2}\right)^{b}.$
(ii)

For any $a$ the following estimate holds:

$\displaystyle\sharp W\leqslant w_{b}=\frac{1}{2\pi b}\left(\frac{2a}{b^{2}}e^{ 2}\right)^{b}.$

Proof.

Theorem 4.3.

(i)

For $a\geq 17$ the following estimate holds:

$\displaystyle\mu(a)\leqslant\frac{e^{2}}{2\pi[\sqrt{a}]}\left(\frac{5}{3}e^{2} \right)^{[\sqrt{a}]}.$
(ii)

For any $a$ the following estimate holds:

$\displaystyle\mu(a)\leqslant\frac{e^{2}}{2\pi[\sqrt{a}]}\left(2e^{2}\right)^{[ \sqrt{a}]}.$

Proof.

Remark 4.3.

As we can see from the given proof, the estimate we obtained is not optimal and can be essentially improved. For $b\approx\sqrt{a}$ we have $2a-b(b-1)\approx a$, so that in the inequality of Theorem 4.3 the expression $(2e^{2})$ can be replaced by $e^{2}$. Furthermore, in the proof of Theorem 4.2 we took into account all possible tuples of positions $(m_{1},\ldots,m_{b})$ and all possible distributions of the letters $A$ and $C_{1}$ into $b$ positions. However, since in the set of words $\overline{W}=\nu(W)$ of the two-letter alphabet $\{A,C\}$ no word is a left segment of another word and the map $\nu\colon W\to\overline{W}$ is one-to-one, for a fixed distribution of the letters $A$ and $C_{1}$ into $b$ positions, such that at least two letters $C_{1}$ follow one another, not all tuples $(m_{1},\ldots,m_{b})\in\Delta\cap{\mathbb{Z}}^{b}$ are realized, since two distinct words $w_{1}\neq w_{2}$, $\{w_{1},w_{2}\}\subset W$ can not differ only on a segment consisting of the letters $C_{0},C_{1}$. The question of finding a precise upper estimate for the numbers $\mu(a)$, even in the asymptotic sense, remains an open problem.

	$\displaystyle\mu_{l,N}(B,\delta)$	$\displaystyle\leqslant$	$\displaystyle\mu_{l,N-1}(B_{1},\delta_{1})+\mu_{l,N-1}(B_{2},\delta_{1})$		(14)
		$\displaystyle+\mu_{l-1,N-1}(B_{1},\delta_{21})+\mu_{l-1,N-1}(B_{2},\delta_{22}),$			(14)

		$\displaystyle a-bN+\sum_{j\in I}\varepsilon(j)-\gamma+(b-1)(N-1)-\sum_{j\in I, j\neq e}\varepsilon(j)$
		$\displaystyle\quad=a-N-b-\gamma+\varepsilon(e)+1.$

		$\displaystyle a-bN+\sum_{j\in I}\varepsilon(j)-\gamma-(N-1)+\varepsilon(m)+b(N -1)$
		$\displaystyle\qquad-\sum_{j\in I,j\neq e,j<m}\varepsilon(j)-\sum_{j\in I,j\neq e ,j>m}(\varepsilon(j)-1)-\varepsilon(m)$
		$\displaystyle\quad=a-b-\gamma-(N-1)+\varepsilon(e)+\sharp\{j\in I\backslash\{e \}\,\|\,j>m\}.$

The Gabrielov–Khovanskii Problem for Polynomials

Example 1.1.

Example 1.2.

Example 1.3.

Example 1.4.

Example 1.5.

Conjecture 1.1.

Example 1.6.

Conjecture 1.2.

Example 1.7.

Example 1.8.

Proposition 2.1.

Proof.

Proposition 2.2.

Proof.

Corollary 2.1.

Example 2.1.

Proposition 2.3.

Proof.

Proposition 2.4.

Proof.

Example 3.1.

Example 3.2.

Theorem 3.1.

Proof of Theorem 3.1.

Proposition 3.1.

Proof.

Remark 3.1.

Proposition 3.2.

Proof.

Proposition 4.1.

Proof.

Remark 4.1.

Proposition 4.2.

Proof.

Remark 4.2.

Theorem 4.1.

Proposition 4.3.

Proof.

Lemma 4.1.

Proof.

Lemma 4.2.

Proof.

Example 4.1.

Theorem 4.2.

Proof.

Lemma 4.3.

Proof.

Lemma 4.4.

Proof.

Lemma 4.5.

Proof.

Corollary 4.1.

Proof.

Corollary 4.2.

Proof.

Theorem 4.3.

Proof.

Remark 4.3.

Proposition 5.1.

Proof.

Corollary 5.1.

Proof.

Theorem 5.1.

Proof.

Remark 5.1.

Corollary 5.2.

Proof.

Corollary 5.3.

Proof.

Theorem 5.2.

Theorem 5.3.