1 Introduction
A well-studied subject in number theory is the distribution of roots of polynomials. This tradition includes, for example, Descartes’ rule of signs, the Gauss–Lucas theorem and the Schinzel–Zassenhaus conjecture. Here, we study the relationship between the separation of a polynomial (the minimal distance between its roots) and its Mahler measure. To be precise, we now define these terms.
Definition 1.1. Given a polynomial
$f(x) \in \mathbb {C}[x]$
with roots
$\alpha _1,\ldots ,\alpha _n \in \mathbb {C}$
, the separation of
$f(x)$
is the quantity
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu1.png?pub-status=live)
Since, for any nonzero
$b \in \mathbb {C}$
,
$\operatorname {sep}(f) = \operatorname {sep}(bf)$
, we assume in the remainder of the paper that all polynomials are monic. Our results can be easily adapted otherwise.
Definition 1.2. Given a monic polynomial
$f(x) \in \mathbb {C}[x]$
with roots
$\alpha _1,\ldots ,\alpha _n \in \mathbb {C}$
, the Mahler measure of
$f(x)$
is the quantity
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu2.png?pub-status=live)
These two quantities are useful for rather different reasons. Knowledge about root separation is helpful for writing algorithms that compute the number of real roots of a polynomial (see Koiran [Reference Koiran13]), for bounding the number of solutions to Thue equations (see Grundman and Wisniewski [Reference Grundman and Wisniewski10]) and for deriving lower bounds on the absolute values of certain products of algebraic integers (see Albayrak et al. [Reference Albayrak, Ghosh, Knapp and Nguyen1]).
The Mahler measure is a versatile tool for measuring the complexity of a polynomial. On the one hand, the Mahler measure contains information about the roots of a polynomial via its definition. On the other hand, the Mahler measure contains information about the coefficients of the polynomial: for any polynomial
$f(x) \in \mathbb {C}[x]$
of degree n,
$M(f) \asymp _n H(f)$
, where
$H(f)$
is the maximum absolute value of its coefficients [Reference Bombieri and Gubler3, Lemma 1.6.7]. Additionally, the Mahler measure preserves algebraic information about polynomials since it is multiplicative. These facts together make the Mahler measure valuable for translating information about the roots of a polynomial into information about its coefficients, and vice versa. The Mahler measure is the main object in Lehmer’s conjecture (see [Reference Lehmer14]). Smyth’s excellent survey [Reference Smyth18] contains more information on the use and study of the Mahler measure.
In general, it is valuable to find effective lower bounds on polynomial root separation. After all, separation gives the minimum distance between distinct roots of a polynomial, so a lower bound on separation gives a lower bound on the distance between any two roots of that polynomial. Indeed, this is how separation is used in [Reference Albayrak, Ghosh, Knapp and Nguyen1, Reference Grundman and Wisniewski10, Reference Koiran13]. A foundational lower bound on separation is the following result of Mahler. It relies on the discriminant, defined for monic
$f(x) \in \mathbb {C}[x]$
with roots
$\alpha _1,\ldots ,\alpha _n$
to be
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu3.png?pub-status=live)
Theorem 1.3 (Mahler, [Reference Mahler15]).
Let
$f(x) \in \mathbb {C}[x]$
be separable of degree
$n \geqslant 2$
with discriminant
$D(f)$
. Then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu4.png?pub-status=live)
in particular, if
$f(x) \in \mathbb {Z}[x]$
, then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu5.png?pub-status=live)
This particular result has spurred much investigation, as it leaves many questions open. Is the exponent of
$n-1$
on
$M(f)$
optimal? This remains unknown, although Bugeaud and Dujella [Reference Bugeaud and Dujella5] show that the optimal exponent is at least
$(2n-1)/3$
for
$f \in \mathbb {Z}[x]$
, while Evertse [Reference Evertse9] considers other ways to improve the exponent. Is the separability hypothesis necessary? Rump removes the separability hypothesis in [Reference Rump17], and Dujella and Pejković improve his result in [Reference Dujella and Pejković8]. Can better results be obtained with additional hypotheses (for example, if
$f(x)$
is assumed to be irreducible and/or monic)? Many papers have looked at these kinds of questions, including [Reference Bugeaud and Dujella4, Reference Dujella and Pejković8, Reference Koiran13]. What if we look at distances between the absolute values of the roots, rather than distances between the roots themselves? Bugeaud, Dujella, Fang, Pejković and Salvy look at these questions in [Reference Bugeaud, Dujella, Fang, Pejković and Salvy6, Reference Bugeaud, Dujella, Pejković and Salvy7].
Although lower bounds on separation have been well studied, the authors are not aware of any attempt to provide upper bounds on separation. On the face of it, an upper bound on separation provides less intuitive information about a polynomial’s roots than does a lower bound. However, upper bounds on separation can provide us with useful information for Lehmer’s conjecture, which we will describe after our main theorem.
One can easily obtain a trivial upper bound on separation using the triangle inequality:
$\operatorname {sep}(f) = \min _{\alpha _i \neq \alpha _j} |\alpha _i - \alpha _j| \leqslant 2M(f).$
Combining Mahler’s inequality
$|D(f)| \leqslant n^nM(f)^{2n-2}$
[Reference Mahler15, Theorem 1] with the fact that
$\operatorname {sep}(f)^{n(n-1)} \leqslant |D(f)|$
when
$f(x)$
is separable gives the improved upper bound
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu6.png?pub-status=live)
for separable polynomials
$f(x) \in \mathbb {C}[x]$
. However, numerical experiments conducted by the first author at the outset of this project in [Reference Knapp12] suggested that this upper bound is still suboptimal, and, indeed, we can prove the following result.
Theorem 1.4. Let
$f(x) \in \mathbb {C}[x]$
be monic and separable of degree
$n \geqslant 2$
. Then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu7.png?pub-status=live)
If, in addition,
$f(x) \in \mathbb {R}[x]$
and one of the roots of
$f(x)$
with minimum absolute value is not real (for example, if
$f(x) \in \mathbb {R}[x]$
has no real roots), then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu8.png?pub-status=live)
In both cases, the dependence of the bound on n is optimal, although the constant 34 could possibly be improved.
Our next result finds the optimal constant for certain values of the signature of
$f(x)$
. A polynomial
$f(x) \in \mathbb {R}[x]$
has signature
$(t,s)$
if it has t real roots and s pairs of complex conjugate roots.
Theorem 1.5. Suppose that
$f(x) \in \mathbb {R}[x]$
is monic and separable of degree
$n \geqslant 2$
and signature
$(t,s)$
, where
$s = 0$
,
$(t,s) = (1,1)$
or
$(t,s) = (0,2)$
. Then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu9.png?pub-status=live)
where
$\delta $
is
$1$
if
$t \neq 0$
and
$0$
otherwise, and
$C_{t,s}$
is defined in the following table.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu10.png?pub-status=live)
It is worth noting that each of our main theorems produces the same bounds on absolute separation, defined in [Reference Bugeaud, Dujella, Fang, Pejković and Salvy6] to be
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu11.png?pub-status=live)
since we have
$\operatorname {abs\, sep}(f) \leqslant \operatorname {sep}(f)$
for all f.
With our main theorems described, we can detail the connection to Lehmer’s conjecture.
Conjecture 1.6 (Lehmer’s conjecture).
There exists an absolute constant
$\mu> 1$
so that, for every irreducible, noncyclotomic
$f(x) \in \mathbb {Z}[x]$
of degree at least two,
$M(f) \geqslant \mu $
.
Observe that Theorem 1.3 provides a lower bound on
$M(f)$
that is useful when
$\operatorname {sep}(f)$
is small: that is,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu12.png?pub-status=live)
On the other hand, Theorem 1.4 provides a lower bound on
$M(f)$
that is useful when
$\operatorname {sep}(f)$
is large: that is,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu13.png?pub-status=live)
Hence, proving Lehmer’s conjecture reduces to the case where
$f(x)$
satisfies
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu14.png?pub-status=live)
2 The optimal upper bound up to constant factor
The main goal of this section is to prove Theorem 1.4.
Proof of Theorem 1.4.
Write
$f(x) = \prod _{i=1}^n (x - \alpha _i)$
, where
$|\alpha _1| \leqslant |\alpha _2| \leqslant \cdots \leqslant |\alpha _n|$
.
We first prove that
$\operatorname {sep}(f) \leqslant 2M(f)^{1/(n-1)}$
. Note that, for any
$j \geqslant 2$
, we have
$0 < |\alpha _j - \alpha _1| \leqslant 2|\alpha _j|$
. As a consequence,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu15.png?pub-status=live)
and the claim follows.
Next, we show that if
$f(x) \in \mathbb {R}[x]$
and if there exists
$\alpha _j \notin \mathbb {R}$
with
$|\alpha _j| = |\alpha _1|$
, then, actually,
$\operatorname {sep}(f) \leqslant 2M(f)^{1/n}$
. The argument is very similar, except we may now assume that
$\alpha _1$
and
$\alpha _2$
are complex conjugates. In this case,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu16.png?pub-status=live)
and the claim again follows.
Now we return to the situation where we only assume that
$f(x) \in \mathbb {C}[x]$
is separable of degree n, and we aim to show that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu17.png?pub-status=live)
Since
${34}/{\sqrt {n}} \geqslant 2$
for
$n \leqslant 289$
, we assume that
$n \geqslant 290$
in the following discussion.
Let
$r = \operatorname {sep}(f)/2$
and, for any
$R \geqslant 0$
, let
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu18.png?pub-status=live)
Additionally, for any
$z \in \mathbb {C}$
, let
$B_R(z)$
denote the open ball of radius R centred at z. We may safely assume that
$r \geqslant {1}/{\sqrt {n}}$
, as the theorem easily follows otherwise.
Next, observe that, for any
$i \neq j$
,
$B_r(\alpha _i)$
and
$B_r(\alpha _j)$
are disjoint. Additionally, for any
$R> 0$
,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu19.png?pub-status=live)
As a consequence,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu20.png?pub-status=live)
which implies that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqn1.png?pub-status=live)
Now, for each positive integer
$j \leqslant \lceil \log _2(n)\rceil - 1 =: L$
(here, we use the assumption that
$n \geqslant 3$
to obtain
$L \geqslant 1$
), define
$R_j = r(\sqrt {n/2^j} - 1)$
. Note that
$R_j> 0$
for
$j \leqslant L$
. For each j, we aim to bound from below the number of roots
$\alpha _i$
that satisfy
$R_j < |\alpha _i|$
.
Observe that
$N(R_j) < n/2^j$
by (2.1), and hence, for each j with
$1 \leqslant j \leqslant L$
, there must be at least
$n(2^j - 1)/2^j$
roots
$\alpha _i$
that satisfy
$|\alpha _i| \geqslant R_j$
. Therefore,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqn2.png?pub-status=live)
The remainder of this proof will be dedicated to showing that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu21.png?pub-status=live)
First, note that, for any
$j \leqslant L$
,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu22.png?pub-status=live)
Since
$j \leqslant L$
, we have
$n^{-1/2} \leqslant 2^{-j/2}$
, which implies that
$2^{-j/2} + n^{-1/2} \leqslant 2^{-j/2 + 1}$
. Hence,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu23.png?pub-status=live)
We now make slightly different estimates for
$j = L$
and for
$j < L$
. If
$j = L$
,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu24.png?pub-status=live)
If
$j < L$
, we use the fact that
$2^L < n \leqslant 2^{L+1}$
to find that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu25.png?pub-status=live)
Consequently,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu26.png?pub-status=live)
The series in the above exponents converge, with
$\sum _{j \geqslant 1} 2^{-j} = 1$
and
$\sum _{j \geqslant 1} j \cdot 2^{-(j+1)} = 1$
. Hence, since
$n \leqslant 2^{L+1}$
,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu27.png?pub-status=live)
Finally, we can return to inequality (2.2) and we acquire
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqn3.png?pub-status=live)
Since
$N(R_L) < n/2^L \leqslant 2$
and since
$N(R_L)$
is an integer, we actually have
$N(R_L) \leqslant 1$
. Since we have assumed that
$r \geqslant 1/\sqrt {n}$
, inequality (2.3) implies that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu28.png?pub-status=live)
and we get
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu29.png?pub-status=live)
where we use the fact that
$n \geqslant 290$
in the final inequality.
Note that the additional, stronger statement for
$f(x) \in \mathbb {R}[x]$
also follows. If
$f(x)$
has a nonreal root among its roots of minimum absolute value, then we cannot have
$N(R_L) = 1$
, so we must have
$N(R_L) = 0$
. Hence, inequality (2.3) implies that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu30.png?pub-status=live)
and the theorem statement again follows.
We now demonstrate that both bounds are optimal except possibly for the constant 34. We do this by constructing a family of polynomials whose separation nearly attains the upper bound which we have now proved.
Before we do this, we note the following fact about the Gaussian integers. For
$r \geqslant 0$
, let
$G(r)$
denote the number of Gaussian integers
$a + bi$
for
$a,b \in \mathbb {Z}$
that satisfy
$|a + bi| \leqslant r$
. A well-known result of Gauss (see, for example, [Reference Berndt, Kim and Zaharescu2, page 101]) states that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu31.png?pub-status=live)
Fix an integer
$n \geqslant 2$
and a real number
$t \geqslant 1$
. Set
$R = \sqrt {n/\pi } + \sqrt {2}$
, which implies that there are at least n Gaussian integers
$a + bi$
with
$|a + bi| \leqslant R$
. Select n distinct Gaussian integers
$\alpha _1,\ldots ,\alpha _n$
so that:
-
(1)
$\alpha _1 = 0$ ; and
-
(2)
$|\alpha _j| \leqslant R$ for
$1 \leqslant j \leqslant n$ .
Now define
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu32.png?pub-status=live)
For
$j \geqslant 2$
, we have
$|\alpha _j| \geqslant 1$
, which implies that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu33.png?pub-status=live)
and that
$\operatorname {sep}(f_t) \geqslant M(f_t)^{1/(n-1)}/{1.6\sqrt {n}}.$
Hence, the first upper bound on separation in Theorem 1.4 has the best possible dependence on n.
For the remaining examples, we take
$R = \sqrt {(n+1)/\pi } + \sqrt {2}$
and we choose n distinct Gaussian integers
$\beta _1,\ldots ,\beta _n$
so that:
-
(1)
$0 < |\beta _j| \leqslant R$ for
$1 \leqslant j \leqslant n$ ;
-
(2) the set
$\{\beta _1,\ldots ,\beta _n\}$ is closed under complex conjugation; and
-
(3) there exists
$\beta _j \notin \mathbb {R}$ so that
$|\beta _j| = \min _k|\beta _k|$ .
Observe that condition (1) together with the fact that the
$\beta _k$
are Gaussian integers implies that
$\min _k |\beta _k| \geqslant 1$
. For real
$t \geqslant 1$
, we define
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu34.png?pub-status=live)
and now we have
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu35.png?pub-status=live)
which implies that
$\operatorname {sep}(g_t) \geqslant M(g_t)^{1/n}/{1.7\sqrt {n}}$
. Hence, the second upper bound on separation in Theorem 1.4 has the best possible dependence on n.
We remark that the constant factors of 1.6 and 1.7 can be improved by replacing the square lattice of Gaussian integers with the hexagonal lattice, and with some extra work using partial summation to find a more efficient upper bound on
$M(f_t)$
and
$M(g_t)$
.
3 Improving the constant factor
Our main goal of this section is to prove Theorem 1.5, which we will do in stages.
Proposition 3.1. Let
$f(x) \in \mathbb {R}[x]$
be monic and separable with degree
$n \geqslant 4$
. Suppose, further, that all n of the roots of f are real. Then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu36.png?pub-status=live)
Moreover, as
$n \to \infty $
,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqn4.png?pub-status=live)
and the constant
$2e$
is asymptotically sharp.
Proposition 3.2. Let
$f(x) \in \mathbb {R}[x]$
be monic and separable with degree three. If
$f(x)$
has exactly one real root, then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu37.png?pub-status=live)
Moreover, the constant
$\sqrt {3}$
is optimal.
Proposition 3.3. Let
$f(x) \in \mathbb {R}[x]$
be monic and separable with degree four. If
$f(x)$
has no real roots, then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu38.png?pub-status=live)
Moreover, the constant
$\sqrt {2}$
is optimal.
These propositions cover each of the cases stated in Theorem 1.5 except the case where the signature is
$(2,0)$
or
$(3,0)$
, when the result follows from Theorem 1.4.
We first prove a lemma on bounding binomial coefficients.
Lemma 3.4. For any positive integer
$n \geqslant 3$
,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu39.png?pub-status=live)
Proof. It is known that, for any positive integer
$\ell $
,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu40.png?pub-status=live)
(see, for example, [Reference Johnson11]). If n is even,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu41.png?pub-status=live)
If n is odd, say
$n = 2k + 1$
, then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu42.png?pub-status=live)
and one can now check that
${n}/{((n+1)\sqrt {2n-1})} \leqslant {1}/{\sqrt {2n + 1}}$
to complete the proof.
Proof of Proposition 3.1.
Denote the closest root of
$f(x)$
to 0 by
$\alpha $
and let
$r = \operatorname {sep}(f)$
. Suppose that there are s roots of
$f(x)$
which are less than
$\alpha $
and t roots of
$f(x)$
which are greater than
$\alpha $
. Define
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu43.png?pub-status=live)
and observe that
$M(f) \geqslant M(g)$
because the roots of g are no further from the origin than the corresponding roots of f. Furthermore,
$\operatorname {sep}(f) = r = \operatorname {sep}(g)$
, so it suffices to prove the proposition for
$g(x)$
.
We first consider the case where all roots of g have the same sign. In this case, we can assume that
$\alpha \geqslant 0$
without loss of generality. Then the roots of g are simply given by
$\alpha , \alpha +r, \ldots , \alpha +(n-1)r$
. Thus,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu44.png?pub-status=live)
It follows that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu45.png?pub-status=live)
Next, we consider the case where not all roots of g have the same sign. Let
$\beta $
be the closest root of
$g(x)$
to the origin and write
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu46.png?pub-status=live)
Since
$\beta $
is the closest root of
$g(x)$
to 0 and since
$g(x)$
has a root with the opposite sign to
$\beta $
, we find that
$|\beta | \leqslant r/2$
. We may also assume, without loss of generality, that
$\beta \leqslant 0$
(otherwise, we may apply the same proof to
$g(-x)$
). We now have
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqn5.png?pub-status=live)
Getting an understanding of the lower bound given in inequality (3.2) will require use of the gamma function. We use the fact that
$\Gamma (\tfrac 12) = \sqrt {\pi }$
together with the usual fact that
$\Gamma (z) = (z - 1)\Gamma (z-1)$
to note that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu47.png?pub-status=live)
By Wendel’s inequality [Reference Wendel19], if m is a nonnegative integer, then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu48.png?pub-status=live)
Thus,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu49.png?pub-status=live)
Now we have
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqn6.png?pub-status=live)
from inequality (3.2) and we aim to estimate the right-hand side of this inequality from below in terms of n. We have the restrictions
$0 \leqslant S,T \leqslant n$
and
$S + T + 1 = n$
, so we can replace
$S + 1$
in equation (3.3) by
$n - T$
to find
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqn7.png?pub-status=live)
Using Lemma 3.4 and Robbins’ bound
$n!>\sqrt {2\pi n} (n/e)^n e^{1/(12n+1)}$
[Reference Robbins16] gives
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu50.png?pub-status=live)
by the fact that
$n \geqslant 4$
.
Finally, we return to inequality (3.3) to find that that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu51.png?pub-status=live)
and we can conclude that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu52.png?pub-status=live)
Asymptotically, using Stirling’s approximation
$n! \sim \sqrt {2\pi n} (n/e)^n$
, we can combine inequality (3.3) and inequality (3.4) to deduce that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu53.png?pub-status=live)
as
$n \to \infty $
. It follows that
$\operatorname {sep}(g)\leqslant ({(2e+o(1))}/{n}) M(g)^{{1}/{(n-1)}}$
as
$n \to \infty $
.
Finally, we show that the constant
$2e$
is asymptotically sharp in inequality (3.1). Let
$r>1$
be fixed. For each integer
$n \geqslant 4$
, set
$f_n(x)=\prod _{j=-m}^{m} (x-jr)$
if
$n=2m+1$
is odd, and
$f_n(x)=\prod _{j=-m}^{m+1} (x-jr)$
if
$n=2m+2$
is even. Stirling’s approximation then yields
$\operatorname {sep}(f_n)=({(2e+o(1))}/{n}) M(f_n)^{{1}/{(n-1)}}$
as
$n \to \infty $
.
Next, we consider cubic polynomials that have only one real root.
Proof of Proposition 3.2.
Suppose that the real root of
$f(X)$
is
$\alpha $
; without loss of generality, we may assume that
$\alpha \geqslant 0$
. Let
$\beta $
denote the complex root of
$f(X)$
with positive imaginary part. Write
$\beta = x + iy$
for
$x,y \in \mathbb {R}$
.
We claim that we may assume that
$x \leqslant 0$
. If not, then set
$\beta ' = -x + iy$
and
$g(X) = (X - \alpha )(X - \beta ')(X - \overline {\beta '})$
. Since
$\operatorname {sep}(g) \geqslant \operatorname {sep}(f)$
and
$M(g) = M(f)$
, proving the proposition for
$g(X)$
will prove it for
$f(X)$
. Hence, we only need to prove the proposition under the assumption that
$x \leqslant 0$
.
Let
$R = |\beta |$
. Note that, if
$y \leqslant {\sqrt {3}R}/{2}$
, then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu54.png?pub-status=live)
and the proof is complete. Hence, for the rest of the proof, assume that
$y> {\sqrt {3}R}/{2}$
. In particular, this implies that
$y> -x\sqrt {3}$
.
Our next goal is to reduce to the case where
$\alpha $
is ‘small’. Let t be the unique value in
$\mathbb {R}$
for which the points
$t,\beta ,\overline {\beta }$
form an equilateral triangle. One can check that
$t = \sqrt {3}y + x$
. Note that
$t> -x\sqrt {3} + x = -x(\sqrt {3}-1) \geqslant 0$
. Now, if
$\alpha \geqslant t$
, set
$h(X) = (X - t)(X - \beta )(X - \overline {\beta })$
; we have
$\operatorname {sep}(f) = 2y = \operatorname {sep}(h)$
and
$M(f) \geqslant M(h)$
, so it suffices to prove the proposition for h. Hence, we may assume that
$0 \leqslant \alpha < t$
.
Note that
$M(f)=\max \{1, \alpha \} \cdot \max \{1, x^2+y^2\}$
and
$\operatorname {sep}(f)=|\alpha -\beta |=\sqrt {(\alpha -x)^2+y^2}$
. Our goal is to show that
$M(f)/\operatorname {sep}(f)^2 \geqslant \tfrac 13$
. We divide our proof into two cases.
Case 1:
$x^2 + y^2 \geqslant 1$
. Within this case, we have two subcases. First, assume that
$\alpha < 1$
. Then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu55.png?pub-status=live)
We aim to show that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu56.png?pub-status=live)
or, equivalently, that
$2(x^2 + y^2) \geqslant 1 - 2x.$
If
$-1/2 \leqslant x \leqslant 0$
, then this follows immediately from the fact that
$x^2 + y^2 \geqslant 1$
. If
$x < -1/2$
, then we can use the fact that
$y> -x\sqrt {3}$
to deduce that
$2(x^2 + y^2) \geqslant 8x^2> 1 - 2x.$
Next, assume that
$1 \leqslant \alpha \leqslant t$
. Then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu57.png?pub-status=live)
Observe that the denominator, as a function of
$\alpha $
, will be maximized at either
$\alpha = 1$
or
$\alpha = t$
. In the case of
$\alpha = 1$
, we have already shown that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu58.png?pub-status=live)
So it remains to check that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu59.png?pub-status=live)
First, observe that, since
$t = x + y\sqrt {3}$
,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu60.png?pub-status=live)
Next, observe that
$(t^3 - 2\sqrt {3}t^2y+4ty^2)/(4y^2)$
is a nondecreasing function of t since its partial derivative with respect to t is
$3(t - 2y/\sqrt {3})^2/(4y^2)$
. Note, additionally, that, since
$y \geqslant -x\sqrt {3}$
,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu61.png?pub-status=live)
Since
$(t^3 - 2\sqrt {3}t^2y+4ty^2)/(4y^2)$
is nondecreasing in t and
$t \geqslant ({2}/{\sqrt {3}})y$
, we must have
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu62.png?pub-status=live)
But the fact that
$x^2 + y^2 \geqslant 1$
together with the assumption that
$y \geqslant -x\sqrt {3}$
imply that
$y \geqslant \sqrt {3}/2$
, and hence we can finally conclude that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu63.png?pub-status=live)
Case 2:
$x^2 + y^2 < 1$
. Now note that, since
$x^2 + y^2 < 1$
and
$y \geqslant -x\sqrt {3}$
, we must have
$x \geqslant -1/2$
. Again, we have two subcases. First, if
$\alpha < 1$
, then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu64.png?pub-status=live)
and the proof is complete.
Now assume that
$1 \leqslant \alpha \leqslant t$
. The fact that
$t \geqslant 1$
implies that
$y = (t-x)/\sqrt {3} \geqslant 1/\sqrt {3}$
. In this case,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu65.png?pub-status=live)
which is again minimised when
$\alpha = 1$
or
$\alpha = t$
. We have already shown that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu66.png?pub-status=live)
so it remains to show that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu67.png?pub-status=live)
If
$y \leqslant {\sqrt {3}}/{2}$
, then
${t}/{4y^2}\geqslant {t}/{3} \geqslant {1}/{3}$
. If
$y \geqslant \sqrt {3}/2$
, then we use the fact that
$x \geqslant -\sqrt {1-y^2}$
to get
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu68.png?pub-status=live)
We can see that the constant
$\sqrt {3}$
is optimal for the polynomial
$f(x) = x^3 - 1$
.
Finally, we turn our attention to the case where
$f(x)$
is a quartic with no real roots.
Proof of Proposition 3.3.
Suppose that the roots of
$f(x)$
are
$\alpha ,\bar {\alpha },\beta ,\bar {\beta }$
where
${\mathrm {Im}}[\alpha ]$
,
${\mathrm {Im}}[\beta ]> 0$
and
$|\alpha | = r \leqslant |\beta | = R$
. Note that
$\operatorname {sep}(f) = \min \{2{\mathrm { Im}}[\alpha ],2{\mathrm {Im}}[\beta ], |\alpha - \beta |\}$
. We have the following two cases.
Case 1:
$2 r \leqslant R$
. In this case, we note that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu69.png?pub-status=live)
Case 2:
$r \leqslant R < 2r$
. In this case, we first observe that if
${\mathrm {Im}}[\alpha ] < \tfrac 12{\sqrt {2}}r^{1/2}R^{1/2}$
or if
${\mathrm {Im}}[\beta ] < \tfrac 12\sqrt {2}r^{1/2}R^{1/2}$
, then the proof is complete because
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu70.png?pub-status=live)
Hence,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu71.png?pub-status=live)
and
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu72.png?pub-status=live)
As a result,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu73.png?pub-status=live)
We claim that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu74.png?pub-status=live)
To see this, divide both sides of the inequality by
$r^{1/2}R^{1/2}$
to obtain the equivalent inequality
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu75.png?pub-status=live)
Observe that this new inequality depends only on the ratio
$x = {R}/{r}$
, which we have bounded by
$1 \leqslant x < 2.$
It is now a simple calculus problem to show that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu76.png?pub-status=live)
for
$1 \leqslant x < 2$
, and the proof that
$\operatorname {sep}(f) \leqslant \sqrt {2}M(f)^{1/4}$
is complete.
To see that the bound is sharp, consider the family of polynomials
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250117055619557-0594:S0004972724001199:S0004972724001199_eqnu77.png?pub-status=live)
for
$t \in \mathbb {R}$
with
$t \geqslant 1/\sqrt {2}$
. Every polynomial in this family has
$\operatorname {sep}(f_t) = \sqrt {2}M(f_t)^{1/4}.$
Acknowledgements
The first author would like to thank Chris Sinclair and Joe Webster for their helpful suggestions regarding the development of this project. The second author thanks Gabriel Currier and Kenneth Moore for helpful discussions. Both authors would like to thank Andrej Dujella and Tomislav Pejković for their comments on an early draft of this paper. The authors are also grateful to anonymous referees for their valuable comments and corrections.