1. Introduction
In 1965, Erdős and Pósa [Reference Erdős and Pósa6] showed that every graph
$G$
either contains
$k$
vertex-disjoint cycles or contains a set
$X$
of
$\mathcal{O}(k \log k)$
vertices such that
$G-X$
has no cycles. The
$\mathcal{O}(k \log k)$
bound on the size of
$X$
is best possible up to a constant factor. Using their Grid Minor Theorem, Robertson and Seymour [Reference Robertson and Seymour9] proved the following generalisation: for every planar graph
$H$
, there exists a function
$f_H(k)$
such that every graph
$G$
contains either
$k$
vertex-disjoint subgraphs each having an
$H$
minor, or a set
$X$
of at most
$f_H(k)$
vertices such that
$G-X$
has no
$H$
minor. For
$H=K_3$
, this corresponds to the setting of the Erdős–Pósa theorem.
The theorem of Robertson and Seymour is best possible in the sense that no such result holds when
$H$
is not planar. The original upper bound of
$f_H(k)$
on the size of
$X$
depends on bounds from the Grid Minor Theorem and is large as a result (though it is polynomial in
$k$
if we use the polynomial version of the Grid Minor Theorem, see [Reference Chekuri and Chuzhoy4]). Chekuri and Chuzhoy [Reference Chekuri and Chuzhoy3] subsequently showed an improved upper bound of
$\mathcal{O}_H(k \log ^c k)$
for a fixed planar graph
$H$
, where
$c$
is some large but absolute constant. This was in turn improved to
$\mathcal{O}_H(k \log k)$
by Cames van Batenburg, Huynh, Joret, and Raymond [Reference Cames van Batenburg, Huynh, Joret and Raymond2], thus matching the original bound of Erdős and Pósa for cycles.
An
$\mathcal{O}_H(k \log k)$
bound is best possible when
$H$
contains a cycle. However, when
$H$
is a forest, it turns out that one can obtain a linear in
$k$
bound on the size of
$X$
, as proved by Fiorini, Joret, and Wood [Reference Fiorini, Joret and Wood7]. Their proof gives an
$\mathcal{O}_H(k)$
bound with a non-explicit constant factor that grows very fast as a function of
$|V(H)|$
. This is due to the use of MSO-based tools in the proof, among others. In this short note, we give a simple proof of their result with an optimal dependence on
$t$
and
$k$
when
$H$
is a tree.
Theorem 1.
Let
$T$
be a tree on
$t$
vertices. For every positive integer
$k$
and every graph
$G$
, either
$G$
contains
$k$
pairwise vertex-disjoint subgraphs each having a
$T$
minor, or there exists a set
$X$
of at most
$t(k-1)$
vertices of
$G$
such that
$G-X$
has no
$T$
minor.
Observe that the bound on the size of
$X$
in Theorem1 is tight: if
$G$
is a complete graph on
$tk-1$
vertices, then
$G$
does not contain
$k$
pairwise vertex-disjoint subgraphs each having a
$T$
minor, and every set
$X$
of vertices such that
$G-X$
has no
$T$
minor has size at least
$|V(G)| - (t-1) = t(k-1)$
.
Theorem1 follows immediately from the following more general result for forests.
Theorem 2.
Let
$F$
be a forest on
$t$
vertices and let
$t^{\prime}$
be the maximum number of vertices in a component of
$F$
. For every positive integer
$k$
and every graph
$G$
, either
$G$
contains
$k$
pairwise vertex-disjoint subgraphs each having an
$F$
minor, or there exists a set
$X$
of at most
$tk-t^{\prime}$
vertices of
$G$
such that
$G-X$
has no
$F$
minor.
Let us also point out the following corollary of Theorem1 (proved in the next section).
Corollary 3.
For all positive integers
$p$
and
$k$
, and for every graph
$G$
, either
$G$
contains
$k$
vertex-disjoint subgraphs each of pathwidth at least
$p$
, or
$G$
contains a set
$X$
of at most
$2\cdot 3^{p+1}k$
vertices such that
$G-X$
has pathwidth strictly less than
$p$
.
2. Proof
For a positive integer
$k$
, we use the notation
$[k]\,:\!=\, \{1,\ldots, k\}$
, and when
$k=0$
let
$[k] \,:\!=\, \varnothing$
.
Let
$G$
be a graph. We denote by
$V(G)$
and
$E(G)$
, the vertex set and edge set of
$G$
, respectively. Let
$X\subseteq V(G)$
. Then
$G[X]$
denotes the subgraph of
$G$
induced by the vertices in
$X$
and
$G-X=G[V(G)- X]$
. We define the boundary of
$X$
in
$G$
to be
$\partial _G X \,:\!=\, \left\{v \in X \mid vw\in E(G),\, w\in V(G-X)\right\}$
. We omit the subscript
$G$
when the graph
$G$
is clear from the context.
A path decomposition of
$G$
is a sequence
$(B_1, B_2, \dots, B_q)$
of vertex subsets of
$G$
called bags satisfying the following properties: (1) every vertex of
$G$
appears in a non-empty set of consecutive bags, and (2) for every edge
$uv$
of
$G$
, there is a bag containing both
$u$
and
$v$
. The width of the path decomposition is the maximum size of a bag minus
$1$
. The pathwidth
$\textit{pw}(G)$
of
$G$
is the minimum width of a path decomposition of
$G$
.
A graph
$H$
is a minor of a graph
$G$
if
$H$
can be obtained from a subgraph of
$G$
by contracting edges. Robertson and Seymour [Reference Robertson and Seymour8] proved that there exists a function
$f\,:\,\mathbb{N}\to \mathbb{N}$
such that for every graph
$G$
and every forest
$F$
on
$t$
vertices, if
$\textit{pw}(G)\geqslant f(t)$
then
$G$
contains
$F$
as a minor. Bienstock, Robertson, Seymour, and Thomas [Reference Bienstock, Robertson, Seymour and Thomas1] later showed that one can take
$f(t)=t-1$
, which is best possible. Diestel [Reference Diestel5] subsequently gave a short proof of this result. Our proof of Theorem2 builds on the following slightly stronger result, which appears implicitly in Diestel’s proof [Reference Diestel5].
Lemma 4 ([Reference Diestel5]). Let
$G$
be a graph, let
$t$
be a positive integer, and let
$F$
be a forest on
$t$
vertices. If
$\textit{pw}(G)\geqslant t-1$
, then there exists
$Y\subseteq V(G)$
such that
-
1.
$G[Y]$ has a path decomposition
$(B_1,\ldots, B_q)$ of width at most
$t-1$ such that
$\partial Y\subseteq B_q$ , and
-
2.
$G[Y]$ contains
$F$ as a minor.
We now turn to the proof of Theorem2.
Proof of
Theorem 2
. We prove the following strengthening of Theorem2: Let
$G$
be a graph, let
$c$
be a positive integer, let
$t_1\leqslant \cdots \leqslant t_c$
be non-negative integers, let
$T_1,\ldots, T_c$
be trees with
$|V(T_i)|=t_i$
for every
$i\in [c]$
, let
$x_1,\ldots, x_c$
be non-negative integers, at least one of which is non-zero, and let
$I\,:\!=\, \{i\in [c]\mid x_i \geqslant 1\}$
. Then either
-
1.
$G$ contains pairwise vertex-disjoint subgraphs
$\{M_{i, j}\mid i\in [c],\, j\in [x_i]\}$ such that, for each
$i\in [c]$ and
$j\in [x_i]$ ,
$M_{i,j}$ contains a
$T_i$ minor, or
-
2. there exists
$X\subseteq V(G)$ with
$|X|\leqslant \sum _{i\in I}x_it_i - t_{\max (I)}$ and
$G-X$ does not contain
$T_i$ as a minor for some
$i\in I$ .
We call the tuple
$(G,c,T_1,\ldots, T_c,x_1,\ldots, x_c)$
an instance. Theorem2 follows by letting
$T_1,\ldots, T_c$
be the components of the forest
$F$
and letting
$x_1=x_2=\cdots =x_c=k$
.
Roughly, the proof describes an inductive procedure that attempts to find a pairwise disjoint collection of models, where the number of models of each tree
$T_i$
is
$x_i$
. Induction is on the number
$\sum _{i\in [c]} x_i$
of models still missing from the collection. Failing to find one of the missing models at some step will establish (2).
Let
$(G,c,T_1,\ldots, T_c,x_1\ldots, x_c)$
be an instance, and let
$m\,:\!=\,\min (I)$
. Then
$T_m$
is a smallest tree among
$T_1,\ldots, T_c$
such that
$x_m \geqslant 1$
, that is, such that we are still missing a model of
$T_m$
. In the base case,
$\sum _{i\in [c]} x_i=x_{m}=1$
, and either
$G$
has a
$T_{m}$
minor and the first outcome of the statement holds, or
$G$
has no such minor and the second outcome holds with
$X\,:\!=\,\varnothing$
, since
$\sum _{i\in I} x_it_i -t_{\max (I)}= t_m - t_m = 0$
.
For the inductive case, assume that
$\sum _{i\in [c]} x_i \geqslant 2$
and that the statement holds for instances with smaller values of the sum. If, for every
$i\in I$
,
$G$
has no
$T_i$
minor, then the second outcome of the statement holds with
$X\,:\!=\,\varnothing$
again. Thus, we may assume that
$G$
has a
$T_i$
minor for some
$i\in I$
.
If
$G$
has pathwidth at least
$t_{m}-1$
, apply Lemma4 with
$t=t_{m}$
and
$F=T_{m}$
, and let
$Y$
be the resulting subset of vertices of
$G$
. If
$G$
has pathwidth less than
$t_{m}-1$
, simply let
$Y\,:\!=\,V(G)$
. In either case,
$G[Y]$
has pathwidth at most
$t_m-1$
and has a path decomposition
$(B_1, B_2, \dots, B_q)$
with
$|B_\ell |\leqslant t_{m}$
for all
$\ell \in [q]$
, and such that
$\partial _G Y \subseteq B_q$
. See Figure 1. Furthermore, observe that in both cases
$G[Y]$
has a
$T_i$
minor for some
$i\in I$
, by our assumption on
$G$
.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20241210124520772-0926:S0963548324000415:S0963548324000415_fig1.png?pub-status=live)
Figure 1. The set
$Y$
and the graph
$G_\ell$
whose boundary in
$G$
is contained in
$B_\ell$
.
Let
$\ell \in [q]$
be the smallest index such that
$G_\ell \,:\!=\,G[B_1 \cup \cdots \cup B_\ell ]$
contains a
$T_{i}$
minor for some
$i\in I$
, and let
$i^{\prime}$
be an index in
$I$
such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20241210124520772-0926:S0963548324000415:S0963548324000415_eqn1.png?pub-status=live)
Observe that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20241210124520772-0926:S0963548324000415:S0963548324000415_eqn2.png?pub-status=live)
We claim that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20241210124520772-0926:S0963548324000415:S0963548324000415_eqn3.png?pub-status=live)
To see this, suppose for a contradiction that
$uv$
is such an edge, with
$u\in V(G_\ell )-B_\ell$
and
$v\in V(G)-V(G_\ell )$
. First, note that
$u\in B_1 \cup \cdots \cup B_{\ell -1}$
. If
$v\in Y$
, then
$u$
and
$v$
appear together in some bag
$B_j$
of the path decomposition
$(B_1, B_2, \dots, B_q)$
of
$G[Y]$
, and
$j \gt \ell$
since
$v \notin B_1 \cup \cdots \cup B_\ell$
. However, since
$u\in B_1 \cup \cdots \cup B_{\ell -1}$
and
$u\in B_j$
, we conclude that
$u$
belongs also to
$B_\ell$
, a contradiction. If
$v\notin Y$
, then
$u\in \partial Y$
, and thus
$u\in B_q$
. Again, we deduce similarly that
$u \in B_\ell$
, a contradiction. This completes the proof of (⋆⋆⋆).
Let
$G^{\prime}\,:\!=\,G - V(G_\ell )$
. Let
$x^{\prime}_i \,:\!=\, x_i$
for each
$i\in [c]-\{i^{\prime}\}$
and let
$x^{\prime}_{i^{\prime}} \,:\!=\, x_{i^{\prime}}-1$
. Let
$I^{\prime}=\{i\in [c]\mid x^{\prime}_i\geqslant 1\}$
. Apply induction to the instance
$(G^{\prime},c,T_1,\ldots, T_c,x^{\prime}_1,\ldots, x^{\prime}_c)$
. If it results in a set of vertex-disjoint subgraphs
$\{M^{\prime}_{i,j}\mid i\in [c],\, j\in [x^{\prime}_i]\}$
, with
$M^{\prime}_{i,j}$
containing a
$T_i$
minor for each
$i\in [c]$
and
$j \in [x^{\prime}_i]$
, then we let
$M_{i,j}\,:\!=\,M^{\prime}_{i,j}$
for each
$i\in [c]$
and
$j \in [x^{\prime}_i]$
, and
$M_{i^{\prime},x_{i^{\prime}}} \,:\!=\, G_\ell$
, which using (⋆) results in the desired collection of vertex-disjoint subgraphs. Otherwise, we obtain a set
$X^{\prime}$
of at most
$\sum _{i\in I^{\prime}}x^{\prime}_it_i -t_{\max (I^{\prime})}$
vertices such that
$G^{\prime}-X^{\prime}$
does not contain
$T_a$
as a minor for some
$a\in I^{\prime}$
.
Let
$X\,:\!=\,X^{\prime}\cup B_\ell$
. Observe that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20241210124520772-0926:S0963548324000415:S0963548324000415_eqnU1.png?pub-status=live)
To see why the last inequality holds, there are two cases to consider: (i) if
$\max (I^{\prime})=\max (I)$
, then the inequality follows immediately since
$t_{i^{\prime}}\geqslant t_m$
. (ii) If
$\max (I^{\prime})\lt \max (I)$
, then
$i^{\prime}=\max (I)$
and
$\max (I^{\prime})\geqslant \min (I^{\prime})=m$
, so
$t_{\max (I^{\prime})}+t_{i^{\prime}}-t_m \geqslant t_{i^{\prime}}=t_{\max (I)}$
.
Now, let us show that
$G-X$
does not contain
$T_i$
as a minor, for some
$i\in I$
. Let
$a\in I^{\prime}$
be such that
$G^{\prime}-X^{\prime}$
does not contain
$T_a$
as minor. We will show that we can take
$i=a$
. To do so, it is enough to show that
$X$
meets every inclusion-wise minimal subgraph of
$G$
containing a
$T_a$
minor. Let
$M$
be such a subgraph of
$G$
. Note that
$M$
is connected, since
$T_a$
is connected. Now, observe that by (⋆⋆⋆), either
$M$
is contained in
$G^{\prime}$
, or
$M$
is contained in
$G_\ell -B_\ell$
, or
$M$
contains a vertex of
$B_\ell$
. In the first case,
$M$
contains a vertex of
$X^{\prime}\subseteq X$
, by the choice of
$a$
. The second case is ruled out by (⋆⋆). In the third case,
$M$
contains a vertex of
$B_\ell \subseteq X$
. Thus, we conclude that
$M$
contains a vertex of
$X$
. This concludes the proof.
We may now turn to the proof of Corollary3. We will use the following lemma, which is a special case of a more general result of Robertson and Seymour [Statement (8.7) in [Reference Robertson and Seymour9]].
Lemma 5.
For every graph
$G$
, for every path decomposition
$(B_1, B_2, \dots, B_q)$
of
$G$
, for every family
$\mathcal{F}$
of connected subgraphs of
$G$
, for every positive integer
$d$
, either:
-
1. there are
$d$ pairwise vertex-disjoint subgraphs in
$\mathcal{F}$ , or
-
2. there is a set
$X$ that is the union of at most
$d-1$ bags of
$(B_1, B_2, \dots, B_q)$ such that
$V(F) \cap X \neq \varnothing$ for every
$F \in \mathcal{F}$ .
Proof of
Corollary 3
. It is known (and an easy exercise to show) that, for every positive integer
$p$
, the complete ternary tree
$T_p$
of height
$p$
has pathwidth
$p$
. First, apply Theorem1 on
$G$
with the tree
$T_p$
. If
$G$
contains
$k$
vertex-disjoint subgraphs each containing a
$T_p$
minor, we are done. So we may assume that the theorem produces a set
$X_1$
of at most
$|V(T_p)|(k-1) \leqslant 3^{p+1}(k-1)$
vertices such that
$G-X_1$
has no
$T_p$
minor.
By Lemma4,
$G-X_1$
has a path decomposition
$(B_1, B_2, \dots, B_q)$
of width strictly less than
$3^{p+1}$
. It is easily checked that every inclusion-wise minimal subgraph of
$G-X_1$
with pathwidth at least
$p$
is connected. Apply Lemma5 on
$G-X_1$
with the path decomposition
$(B_1, B_2, \dots, B_q)$
, with
$d=k$
, and with the family
$\mathcal{F}$
of connected subgraphs of
$G-X_1$
with pathwidth at least
$p$
. If
$\mathcal{F}$
contains
$k$
pairwise vertex-disjoint members, we are done. So we may assume that the lemma produces a set
$X_2$
of at most
$3^{p+1}(k-1)$
vertices such that
$X_2$
hits every member of
$\mathcal{F}$
. It follows that
$G-X_1-X_2$
has pathwidth strictly less than
$p$
. Let
$X\,:\!=\, X_1 \cup X_2$
. Since
$|X| \leqslant 3^{p+1}(k-1) + 3^{p+1}(k-1) \leqslant 2\cdot 3^{p+1}k$
, the set
$X$
has the desired properties.
Acknowledgements
This work was done during a visit of Gwenaël Joret and Piotr Micek to the University of Ottawa and Carleton University. The research stay was partially funded by a grant from the University of Ottawa.
Funding statement
G. Joret is supported by a PDR grant from the Belgian National Fund for Scientific Research (FNRS). V. Dujmović is supported by NSERC and a University of Ottawa Research Chair. P. Micek is supported by the National Science Center of Poland under grant UMO-2023/05/Y/ST6/00079 within the WEAVE-UNISONO program. P. Morin is supported by NSERC.