An elementary approach to component sizes in critical random graphs

Umberto De Ambroggio

doi:10.1017/jpr.2022.13

An elementary approach to component sizes in critical random graphs

Part of: Combinatorial probability Graph theory Stochastic processes

Published online by Cambridge University Press: 11 November 2022

Umberto De Ambroggio

Show author details

Umberto De Ambroggio*: Affiliation:
University of Bath
*: *Postal address: Department of Mathematical Sciences, University of Bath, Bath BA2 7AY, UK. Email address: umbidea@gmail.com

Article contents

Abstract
Introduction
Results
Proofs
Funding information
Competing interests
References

Rights & Permissions

Abstract

In this article we introduce a simple tool to derive polynomial upper bounds for the probability of observing unusually large maximal components in some models of random graphs when considered at criticality. Specifically, we apply our method to a model of a random intersection graph, a random graph obtained through p-bond percolation on a general d-regular graph, and a model of an inhomogeneous random graph.

Keywords

Random walk ballot theorem

MSC classification

Primary: 05C80: Random graphs

Secondary: 60G50: Sums of independent random variables; random walks 60C05: Combinatorial probability

Type: Original Article
Information: Journal of Applied Probability , Volume 59 , Issue 4 , December 2022 , pp. 1228 - 1242

DOI: https://doi.org/10.1017/jpr.2022.13 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press on behalf of Applied Probability Trust

1. Introduction

The purpose of this paper is to introduce an elementary tool to obtain simple upper bounds for the probability of observing unusually large maximal components in some important models of random graphs at criticality.

Given any (undirected) graph $\mathbb{G}=(V,E)$ and vertices $v,u\in V$, we write $v\sim u$ if the edge $\{v,u\}$ is present in $\mathbb{G}$ and say that vertices u and v are neighbours. We write $v\leftrightarrow u$ if there exists a path of occupied edges connecting vertices v and u and we adopt the convention that $v\leftrightarrow v$ for every $v\in V$. We let $\mathcal{C}(v)= \{u\in V\colon v\leftrightarrow u\}$ be the component (or cluster) of vertex $v\in V$ and denote its size by $|\mathcal{C}(v)|$. Moreover, we define a largest component $\mathcal{C}_{\max}$ to be any cluster $\mathcal{C}(v)$ for which $|\mathcal{C}(v)|$ is maximal, so that $|\mathcal{C}_{\max}|=\max_{v\in V}|\mathcal{C}(v)|$.

The Erdős–Rényi random graph on $[n]= \{1,\dots,n\}$, denoted by $\mathbb{G}(n,p)$, is the random graph obtained from the complete graph on n vertices by independently retaining each edge with probability $p\in [0,1]$ and deleting it with probability $1-p$.

One of the most surprising aspects of this model is that when p is of the form $p=p(n)=\gamma/n$. then the $\mathbb{G}(n,p)$ random graph undergoes a phase transition as $\gamma$ passes 1. Specifically, if $\gamma < 1$ then $|\mathcal{C}_{\max}|$ is of order $\log\!(n)$, if $\gamma=1$ then $|\mathcal{C}_{\max}|$ is of order $n^{2/3}$, and if $\gamma > 1$ then $|\mathcal{C}_{\max}|$ is of order n. See for instance the books [Reference Bollobás3], [Reference van der Hofstad13], or [Reference Janson, Łuczak and Ruciński15] for proofs of these statements and other interesting properties of this model. See also Krivelevich and Sudakov [Reference Krivelevich and Sudakov19] for a simple proof of the phase transition in $\mathbb{G}(n,p)$.

In [Reference De Ambroggio and Roberts9] De Ambroggio and Roberts introduced a ballot-type result (Lemma 1 below) to provide a new, purely probabilistic proof of the fact that in the $\mathbb{G}(n,p)$ model considered in the so-called critical window, i.e. when p is of the form $p=p(n)=n^{-1}+\lambda n^{-4/3}$, the probability of observing a maximal cluster of size larger than $An^{2/3}$ tends to zero (as $n\rightarrow \infty$) exponentially fast in A. More precisely, they proved that, for large enough n,

(1)

\begin{equation}\dfrac{c}{A^{3/2}}\exp\biggl({-\frac{A^3}{8}+\frac{\lambda A^2}{2}-\frac{\lambda^2A}{2}}\biggr)\leq \mathbb{P}\big(|\mathcal{C}_{\max}|>An^{2/3}\big)\leq \dfrac{c^{\prime}}{A^{3/2}}\exp\biggl({-\frac{A^3}{8}+\frac{\lambda A^2}{2}-\frac{\lambda^2A}{2}}\biggr),\end{equation}

where $0<c\leq c^{\prime}$ are two finite constants, thus showing that in the near-critical $\mathbb{G}(n,p)$ model the number of vertices contained in the maximal component is unlikely to be much larger than $n^{2/3}$.

We remark that the correct asymptotic for $\mathbb{P}\big(|\mathcal{C}_{\max}|>An^{2/3}\big)$ in this critical model was obtained first by Pittel [Reference Pittel26] (whose paper is partially based on an earlier article by Łuczak, Pittel, and Wierman [Reference Łuczak, Pittel and Wierman21]) and more recently by Roberts [Reference Roberts27]. We also mention that Nachmias and Peres [Reference Nachmias and Peres23] used a general martingale argument to establish an exponential upper bound for the probability in (1), but their bound is not optimal.

The purpose of this work is to show that part of the argument used in [Reference De Ambroggio and Roberts9] to prove the upper bound in (1) is quite general and can be used to obtain, in a surprisingly simple way, polynomial upper bounds for $\mathbb{P}(|\mathcal{C}_{\max}|>k)$ in different models of random graphs when considered at criticality. Specifically, we apply our method to three different models, namely a model of a random intersection graph, a random graph obtained through p-bond percolation on a general d-regular graph, and a model of an inhomogeneous random graph, and we show that $|\mathcal{C}_{\max}|$ is unlikely to be much larger than $n^{2/3}$ in these models. In this sense, these random graphs exhibit a similar critical behaviour.

2. Results

In order to better understand the statement of our main result (Theorem 1 below), we first need to recall the definition of an exploration process, which is an algorithmic procedure used to reveal the components of a given graph; see e.g. [Reference De Ambroggio and Roberts9], [Reference Nachmias and Peres22], [Reference Nachmias and Peres23], [Reference Roberts27], and references therein. As we will see in a moment, when the graph under investigation is random such an exploration process reduces the study of component sizes to the analysis of the trajectory of a random process, which looks like (but is not quite) a random walk.

Let $\mathbb{G}=([n],E)$ be any (undirected) graph, and let $V_n$ be a vertex selected uniformly at random from [n]. During the exploration of $\mathcal{C}(V_n)$, each vertex will be either active, explored, or unseen, and its status will change during the course of the exploration. At each step $t\in \{0\}\cup [n]$ of the algorithm, the number of explored vertices will be t, whereas the number of active vertices will be denoted by $Y_t$. At time $t=0$, if $V_n$ is an isolated vertex we stop the procedure; otherwise there exists some vertex $u\in [n]\setminus {V_n}$ with $\{V_n,u\}\in E$. In this case vertices $V_n$ and u are declared active, whereas all other vertices are declared unseen (so that $Y_0=2$). At each step $t\in [n]$ of the algorithm we proceed as follows.

(a) If $Y_{t-1}>0$, we let $u_t$ be the active vertex with the smallest label. We reveal all unseen neighbours of $u_t$ in $\mathbb{G}$ and change the status of these vertices to active. Then we set $u_t$ itself to be explored.
(b) If $Y_{t-1}=0$, we let $u_t$ be the unseen vertex with the smallest label, and:
1. (b.1) if $u_t$ is isolated, we halt the procedure;
2. (b.2) otherwise, there is at least one unseen vertex v such that $\{u_t,v\}\in E$, and we declare both $u_t$ and v active; then we continue with step (a).

Letting $\eta_t$ denote the number of unseen vertices in $\mathbb{G}$ which become active at step t of the exploration process, we see that:

(i) if $Y_{t-1}>0$ then $Y_t=Y_{t-1}+\eta_t-1$;
(ii) if $Y_{t-1}=0$ then $Y_t=\eta_t$.

Remark 1. We remark that our description of an exploration process is slightly different from those provided in [Reference De Ambroggio and Roberts9], [Reference Nachmias and Peres23], and [Reference Roberts27], for example. Indeed, in our setting the algorithm is actually run only in the case where $V_n$ is not an isolated vertex, and the exploration starts from two active vertices and not from one active vertex, as usually happens. Moreover, whenever $Y_{t-1}=0$ and $u_t$ is not isolated, we first reveal one of the neighbours of $u_t$ before proceeding with the exploration. These small modifications will be particularly useful in one of our applications.

Now let $\mathbb{G}=([n],E)$ be any (undirected) random graph, and use the above algorithm to reveal the components of $\mathbb{G}$. It is clear that the $\eta_i$ are now random variables. Observe that, given any $k\in \mathbb{N}=\{1,2,\dots\}$, if $V_n$ is an isolated vertex then $|\mathcal{C}(V_n)|=1$ and hence, in particular, we cannot have $|\mathcal{C}(V_n)|>k$ (as $k\geq 1$). On the other hand, if $V_n$ is not isolated then $|\mathcal{C}(V_n)|>k$ implies that $Y_t=2+\sum_{i=1}^{t}\!(\eta_{i}-1)>0$ for all $t\in [k]$. Therefore we can write

(2)

\begin{align}\mathbb{P}(|\mathcal{C}(V_n)|>k)\leq \mathbb{P}\Biggl(2+\sum_{i=1}^{t}(\eta_{i}-1)>0 \ \text{for all}\ t\in [k]\Biggr).\end{align}

Note that the $\eta_i$ are not independent and, moreover, they have different distributions (one of the reasons is that the number of unseen vertices in the graph decreases during the course of the exploration). Therefore $Y_t$ does not define a random walk.

In order to bound the probability in (2) from above, the idea is to produce a sequence of independent and identically distributed (i.i.d.) random variables $X_i$, bigger than the $\eta_{i}$, that allow us to replace the probability on the right-hand side of (2) with the probability that a random walk (started at 2) stays positive up to time k.

In some random graphs this is an immediate consequence of the model construction, while in other instances one needs more care in order to produce these $X_i$.

Here is our main result.

Theorem 1. Let $\mathbb{G}=([n],E)$ be any (undirected) random graph. Suppose that there exists a sequence of i.i.d. random variables $(X_i)_{i\geq 1}$ taking values in $\mathbb{N}_0$, such that the distribution of $X_1$ may depend on n, satisfying:

(i) $\mathbb{P}(X_1=3)\geq c$ for all sufficiently large n, for some constant $c>0$;
(ii) $\mathbb{P}(|\mathcal{C}(V_n)|>k)\leq \mathbb{P}\bigl(2+\sum_{i=1}^{t}\!(X_i-1)>0\ {for\ all}\ t\in [k]\bigr)$ for every $k=k(n)\in \mathbb{N}$;
(iii) there exist $\delta,\rho=\rho(n)>0$ and $\epsilon=\epsilon(n)\geq 0$ with $\epsilon_n\rightarrow 0$ (as $n\rightarrow \infty$) such that $\mathbb{E}\big({\textrm{e}}^{rX_1}\big)\leq {\textrm{e}}^{r(1+\epsilon)+r^2\delta}$ for every $r\in (0,\rho)$ and all sufficiently large n.

Suppose that $k=k(n)\in \mathbb{N}$ satisfies $\epsilon \sqrt{k}\leq c$ and $\rho \sqrt{k}\geq 1$ for all large enough n, for some finite constant $c>0$. Then

\begin{equation*} \mathbb{P}(|\mathcal{C}_{\max}|>k)\leq \dfrac{C}{\mathbb{P}(X_1=3)}\dfrac{n}{k^{3/2}}\end{equation*}

for all sufficiently large n, where $C>0$ is a finite positive constant which depends solely on $\delta$ and c.

Remark 2. In all our applications the probability $\mathbb{P}(X_1=3)$ is bounded away from zero, so that if $k=k(n)= \lceil An^{2/3} \rceil$ satisfies the two assumptions in the statement of the theorem, then $\mathbb{P}(|\mathcal{C}_{\max}|> \lceil An^{2/3} \rceil)$ is indeed ${\textrm{O}}(A^{-3/2})$.

Remark 3. We note that condition (iii) in Theorem 1 might be stated in different (possibly more general) terms, but we decided to state it in this way because of its simplicity to verify, as shown in our applications.

Our claim that the approach introduced in [Reference De Ambroggio and Roberts9] is robust and that Proposition 1 leads to simple upper bounds for $\mathbb{P}(|\mathcal{C}_{\max}|>k)$ in several models of random graphs at criticality is justified in Sections 2.1, 2.2, and 2.3 below, where we use Proposition 1 to obtain polynomial upper bounds for the above probability in three particular models of random graphs.

We remark that our methodology does not lead to upper bounds for the probability of observing unusually small largest components; in this direction the martingale argument introduced by Nachmias and Peres in [Reference Nachmias and Peres23] seems to be more robust and adaptable to different models of random graphs.

2.1. Critical random intersection graph

Our first application of Theorem 1 involves a model of a random intersection graph; for an introduction to this class of models, we refer the reader to [Reference Frieze and Karoński11].

Here we are interested in the random graph described by Lageras and Lindholm [Reference Lageras and Lindholm20]. Such a random graph, denoted by $\mathbb{G}(n,m,p)$, with a set of vertices $V=\{v_i\colon i\in [n]\}$ and a set of edges E, is constructed from a bipartite graph $\mathbb{B}(n,m,p)$ with two sets of vertices: $A=\{a_j\colon j\in [m]\}$, which we call the set of auxiliary vertices, and V (i.e. the vertex set of $\mathbb{G}(n,m,p)$). Edges in $\mathbb{B}(n,m,p)$ between vertices and auxiliary vertices are present independently with probability $p\in [0,1]$. Two distinct vertices $v_i$ and $v_j$ are neighbours in $\mathbb{G}(n,m,p)$ (i.e. $\{v_i,v_j\}\in E$) if and only if there exists at least one $a_k\in A$ such that both edges $\{a_k,v_i\}$ and $\{a_k,v_j\}$ are present in the bipartite graph $\mathbb{B}(n,m,p)$.

We are interested in the case where $p=p(n)=\gamma/n^{(1+\alpha)/2}$ and $m=m(n)=\lfloor \beta n \rfloor$, where $\alpha,\beta,\gamma> 0$ are fixed parameters of the model.

Stark [Reference Stark28] has shown that the vertex degree distribution (i.e. the distribution of the degree of a vertex selected uniformly at random) is highly dependent on the value of $\alpha$. However, as shown by Deijfen and Kets [Reference Deijfen and Kets10], the clustering is controllable only when $\alpha=1$.

The component structure of the graph was studied for $\alpha\neq 1$, $\gamma>0$, and $\beta=1$ by Behrisch [Reference Behrisch2], whereas it was studied for $\alpha=1$ and $\beta, \gamma>0$ in [Reference Lageras and Lindholm20]. Specifically, Lageras and Lindholm [Reference Lageras and Lindholm20] proved that the $\mathbb{G}(n,m,p)$ model undergoes a phase transition as $\beta \gamma^2$ passes 1. Indeed, setting $\mu= \beta \gamma^2$, they proved that if $\mu<1$ (sub-critical case), then with probability tending to one there is no component in $\mathbb{G}(n,m,p)$ with more than ${\textrm{O}}(\!\log\!(n))$ vertices, while if $\mu >1$ (super-critical case), then with probability tending to one there exists a unique giant component of size $n\delta$ where $\delta\in (0,1)$, and the size of the second largest component is at most of order $\log\!(n)$.

By means of Theorem 1 we show that, in the critical case $\mu=1$, it is unlikely for the largest component to contain more than $n^{2/3}$ vertices.

Proposition 1. Let $\mathbb{G}(n,m,p)$ be the random intersection graph described above. Let $m= \lfloor \beta n \rfloor$, $p= \gamma/n$, and $\mu= \beta \gamma^2$. If $\mu=1$ then, given any $A>1$, when n is sufficiently large we have

\begin{equation*} \mathbb{P}(|\mathcal{C}_{\max}|> \lceil A n^{2/3} \rceil)\leq \dfrac{c_1}{A^{3/2}},\end{equation*}

where $c_1$ is a finite constant which depends solely on $\gamma$ and $\beta$.

2.2. Critical p-bond percolation on d-regular graph

In this section we consider a second application of Theorem 1. Here we analyse a random graph $\mathbb{G}_p$ obtained through p-bond percolation on a general d-regular graph.

In [Reference Nachmias and Peres22] Nachmias and Peres adapted the martingale method they developed in [Reference Nachmias and Peres23] to prove that, for any $d\geq 3$, when $p\leq (d-1)^{-1}$ then

(3)

\begin{equation}\mathbb{P}(|\mathcal{C}_{\max}|> \lceil An^{2/3}\rceil) \leq \dfrac{8}{A^{3/2}};\end{equation}

see [Reference Nachmias and Peres22, Proposition 1.2]. For a random regular graph $\mathbb{G}(n,d,p)$ they were also able to sharpen the upper bound in (3) and to prove a corresponding lower bound. (The $\mathbb{G}(n,d,p)$ random graph is obtained by the following two-step procedure: first we draw uniformly at random a graph from the set of all simple d-regular graphs on [n], and then we retain each edge independently with probability p and delete it with probability $1-p$.) Specifically, in Theorem 2 of [Reference Nachmias and Peres22] it is shown that, when p is of the form

(4)

\begin{equation}p=p(n,d)=\big(1+\lambda n^{-1/3}\big)/(d-1) \quad (\lambda \in \mathbb{R})\end{equation}

and $d\geq 3$ is fixed, then there are constants $C_1,C_2\in (0,\infty)$ depending on $\lambda$ and d such that, for every $A>0$ and all n, $\mathbb{P}(|\mathcal{C}_{\max}|>An^{2/3})\leq A^{-1}C_1 \,{\textrm{e}}^{-C_2 A^3}$. In [Reference Nachmias and Peres22] it is also shown that there exists a constant $C_3\in (0,\infty)$ (also depending on $\lambda$ and d) such that, for large enough A and all n, then $\mathbb{P}(|\mathcal{C}_{\max}|<\lceil A^{-1} n^{2/3}\rceil)\leq C_3A^{-1/2}$, thus proving that the size of $|\mathcal{C}_{\max}|$ is indeed of order $n^{2/3}$ in this model when considered at criticality.

We remark that in [Reference Nachmias and Peres22] the parameter d is not allowed to depend on n. The problem of determining the size of $|\mathcal{C}_{\max}|$ in the critical $\mathbb{G}(n,d,p)$ model when $d=d(n)$ depends on n has been investigated by Joos and Perarnau [Reference Joos and Perarnau16], where the authors proved (among many other things) that for any $d\in \{3,\dots,n-1\}$ and when p is of the form (4), then for all sufficiently large n and $A=A(\lambda)$ we have that $\mathbb{P}(|\mathcal{C}_{\max}|\notin [A^{-1}n^{2/3},An^{2/3}])\leq 20/\sqrt{A}$.

Our goal here is to show that, by means of Theorem 1, we can recover (up to a multiplicative constant) the bound in (3), in a very simple way.

Proposition 2. Let $\mathbb{G}$ be a d-regular graph, $d> 3$, and let $\mathbb{G}_p$ denote the random graph obtained by bond percolation on $\mathbb{G}$ with probability p. If $p\leq 1/(d-1)$ then, given any $A>1$, when n is sufficiently large we have

\begin{equation*} \mathbb{P}(|\mathcal{C}_{\max}|> \lceil An^{2/3}\rceil)\leq \dfrac{c_2}{A^{3/2}}\end{equation*}

for some finite positive constant $c_2$ which depends solely on d.

Remark 4. The requirement $d>3$ is needed because the i.i.d. random variables $(X_i)_{i\geq 1}$ that dominate the $\eta_i$ in the exploration process satisfy

\begin{equation*}\mathbb{P}(X_1=3)=\dfrac{(d-2)(d-3)}{6(d-1)^2}\biggl(1-\dfrac{1}{d-1}\biggr)^{d-4};\end{equation*}

see Section 3.3. Hence, if we want $\mathbb{P}(X_1=3)>0$ (a condition required in Theorem 1), we do need $d>3$.

2.3. Critical inhomogeneous random graph

In this section we discuss our final application of Theorem 1. In the random graph model that we investigate here, the n vertices are endowed with weights, and edges between a pair of vertices are placed independently with probabilities moderated by such weights.

Specifically, let $\textbf{w}=(w_i)_{i\in [n]}$ be a sequence of positive real numbers, which we call the sequence of vertex weights; we think of $w_i$ as the weight assigned to vertex $i\in [n]$. Define $l_n= \sum_{i\in[n]}w_i$, the sum of all weights.

We consider the so-called Norros–Reittu random graph [Reference Norros and Reittu24] as described by Van der Hofstad [Reference van der Hofstad12]. This is an inhomogeneous random graph, that we denote by $NR_n(\textbf{w})$, in which the probability that the edge $\{i,j\}$ is present in $NR_n(\textbf{w})$ (for $1\leq i<j\leq n$) is given by

\begin{equation*} p_{ij}^{NR}= \mathbb{P}(\{i,j\}\in E(NR_n(\textbf{w})))= 1-{\textrm{e}}^{-w_jw_j/l_n},\end{equation*}

and edges are present independently.

Inhomogeneous random graphs were studied extensively by Bollobás, Janson, and Riordan [Reference Bollobás, Janson and Riordan4]. As explained by Janson [Reference Janson14] and further noted by Van der Hofstad [Reference van der Hofstad12], the $NR_n(\textbf{w})$ random graph is closely related to the models studied by Chung and Lu [Reference Chung and Lu5, Reference Chung and Lu6, Reference Chung and Lu7] and Norros and Reittu [Reference Norros and Reittu24], so that the results proved for the $NR_n(\textbf{w})$ random graph apply as well to these other models.

Other models of inhomogeneous random graphs have been studied more recently by Penrose [Reference Penrose25] and by Kang, Pachon, and Rodriguez [Reference Kang, Pachon and Rodriguez18].

It is clear that the topology of the $NR_n(\textbf{w})$ model depends on the choice of the sequence $\textbf{w}$, which we now specify.

Let $F\colon (0,\infty)\mapsto [0,1]$ be a distribution function, and define

\begin{equation*}[1-F]^{-1}(u)= \inf\{s\colon [1-F(s)]\leq u\}, \quad u\in(0,1).\end{equation*}

By convention, we set $[1-F]^{-1}(1)= 0$. We construct the weights as in [Reference van der Hofstad12], namely we set

(5)

\begin{equation}w_j= [1-F]^{-1}(\,j/n),\quad j\in [n].\end{equation}

In [Reference Bollobás, Janson and Riordan4, Theorem 3.13] it was shown that in the $NR_n(\textbf{w})$ random graph with vertex weights as in (5), the proportion of vertices having degree $k\geq 0$, denoted by $N_k$, converges in probability (as $n \rightarrow \infty$) to

\[p_k= \mathbb{E}\biggl({\textrm{e}}^{-W}\dfrac{W^k}{k!}\biggr),\]

where W is a random variable taking values in $(0,\infty)$ with distribution function F. The limiting sequence $(p_k)_{k\geq 0}$ is a so-called mixed Poisson distribution with mixing distribution F.

In order to describe the phase transition for the size of the largest component, define

\begin{equation*} \nu=\mathbb{E}(W^2)/\mathbb{E}(W).\end{equation*}

As explained by Van der Hofstad [Reference van der Hofstad12] (see also [Reference De Ambroggio and Pachon8]), this (positive) real number corresponds to the asymptotic mean of the offspring distribution in a branching process approximation of the exploration of $\mathcal{C}(V_n)$.

In [Reference Bollobás, Janson and Riordan4, Theorem 3.1] it is shown that the graph undergoes a phase transition as $\nu$ passes 1. In particular, if $\nu>1$, the largest component contains a positive proportion of the total number of vertices, whereas if $\nu \leq 1$ the largest component contains a vanishing proportion of vertices.

Van der Hofstad [Reference van der Hofstad12] provided a complete picture of the component structure in the critical $NR_n(\textbf{w})$ model. Specifically, he proved that in the case where

(6)

\begin{equation}\lim_{x\rightarrow \infty}x^{\tau-1}(1-F(x))=c_F\end{equation}

for some constant $c_F>0$ and some $3<\tau <4$, there is a constant $b>0$ such that for all $A>1$ and $n\geq 1$, the $NR_n(\textbf{w})$ random graph satisfies

(7)

\begin{equation}\mathbb{P}(|\mathcal{C}_{\max}|\notin [A^{-1}n^{(\tau-2)/(\tau-1)},An^{(\tau-2)/(\tau-1)}])\leq b/A.\end{equation}

On the other hand, when

(8)

\begin{equation}1-F(x)\leq c_Fx^{-(\tau-1)}\quad (x\geq 0)\end{equation}

for some $c_F>0$ and some $\tau>4$, then there is a constant $b>0$ such that, for all $n\geq 1$ and all $A>1$, the $NR_n(\textbf{w})$ random graph satisfies

(9)

\begin{equation}\mathbb{P}(|\mathcal{C}_{\max}|\notin [A^{-1}n^{2/3},An^{2/3}])\leq b/A.\end{equation}

(In fact Van der Hofstad [Reference van der Hofstad12] proved a more general result, namely that the lower bounds (7) and (9) also remain valid after a small perturbation of the vertex weights; see [Reference van der Hofstad12, Theorems 1.1 and 1.2].)

For a heuristic explanation of the critical behaviour described by (7) and (9), we refer to [Reference van der Hofstad12, Section 1.3].

We also mention that De Ambroggio and Pachon [Reference De Ambroggio and Pachon8] used the first part of the martingale argument introduced by Nachmias and Peres [Reference Nachmias and Peres23] to obtain simple upper bounds for the probability of observing unusually large maximal components in the (critical) $NR_n(\textbf{w})$ random graph for both regimes $\tau\in (3,4)$ and $\tau>4$, even if in the former case (i.e. for $\tau\in (3,4)$) the distribution function F is required to satisfy a stronger condition with respect to the one stated in (6).

Our goal here is to use Theorem 1 to provide a very simple proof of the fact that, in the critical $NR_n(\textbf{w})$ model with vertex weights as in (5) and distribution function F satisfying (8), the largest component is unlikely to contain more than $n^{2/3}$ vertices. More precisely, we prove the following.

Proposition 3. Consider the $NR_n(\textbf{w})$ random graph with weights defined as in (5) above. Suppose that there exists a constant $c_F>0$ and a $\tau >4$ such that $1-F(x)\leq c_Fx^{-(\tau-1)}$ for all $x\geq 0$. Then, given any $A>1$, when n is large enough we have that

\begin{align*}\mathbb{P}(|\mathcal{C}_{\max}|> \lceil An^{2/3} \rceil)\leq \dfrac{c_3}{A^{3/2}},\end{align*}

where $c_3$ is a finite positive constant which depends solely on $c_F$ and $\tau$.

3. Proofs

Here we are going to prove the results stated in Section 2. We start by proving Theorem 1 and subsequently we prove the remaining results, namely Propositions 1, 2, and 3.

The proof of Theorem 1 relies on the following ballot-type estimate, which is taken from [Reference De Ambroggio and Roberts9]. For a general introduction to classical ballot theorems and their generalisations, see for instance [Reference Addario-Berry and Reed1], [Reference Kager17], and references therein.

Lemma 1. Fix $n\in \mathbb{N}$ and let $(W_i)_{i\geq 1}$ be a sequence of i.i.d. valued random variables taking values in $\mathbb{Z}$. Let $r\in \mathbb{N}$, and suppose that $\mathbb{P}(W_1=r)>0$. Define $S_t = \sum_{i=1}^{t}W_i$ for $t\in \mathbb{N}_0$. Then, for any $j\geq 1$, we have

\begin{equation*}\mathbb{P}(r+S_t>0\ {for\ all}\ t\in [n],r+S_{n}=j)\leq \mathbb{P}(X_1=r)^{-1}\dfrac{j}{n+1}\mathbb{P}(S_{n+1}=j).\end{equation*}

3.1. Proof of Theorem 1

Let $k=k(n)\in \mathbb{N}$. By hypothesis, there is a sequence of i.i.d. random variables $X_i$ taking values in $\mathbb{N}_0$ such that, setting $S_t= \sum_{i=1}^{t}\!(X_i-1)$,

\begin{equation*}\mathbb{P}(|\mathcal{C}(V_n)|>k)\leq \mathbb{P}(2+S_t>0 \ \text{for all}\ t\in [k]).\end{equation*}

Using Lemma 1 with $W_i=X_i-1$ and $r=2$, we obtain

(10)

\begin{align}\nonumber\mathbb{P}(2+S_t>0 \ \text{for all}\ t\in [k])&= \sum_{h=1}^{\infty}\mathbb{P}(2+S_t>0 \ \text{for all}\ t\in [k], 2+S_k=h)\\&\leq a\sum_{h=1}^{\infty}\dfrac{h}{k+1}\mathbb{P}(S_{k+1}=h),\end{align}

where we set $a= 1/\mathbb{P}(X_1=3)\in (0,\infty)$. Now let m be a non-negative integer to be specified later. By splitting the series in (10) at $h=m$ we can write

(11)

\begin{equation}a\sum_{h=1}^{\infty}\dfrac{h}{k+1}\mathbb{P}(S_{k+1}=h)\leq a\dfrac{m}{k+1}+\dfrac{a}{k+1}\sum_{h=m+1}^{\infty}h\mathbb{P}(S_{k+1}=h).\end{equation}

Now the series in (11) equals

(12)

\begin{align}\nonumber\dfrac{a}{k+1}&\sum_{h=m+1}^{\infty}h\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i=h+k+1\Biggr)\\&=\dfrac{a}{k+1}\sum_{z=m+k+2}^{\infty}z\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i=z\Biggr)-a\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq m+k+2\Biggr).\end{align}

To proceed, we observe the following: if X is a random variable taking values in $\mathbb{N}_0$, then for any $h\ge1$, we have

\begin{align*}\mathbb{E}(X\unicode{x1D7D9}_{\{X\geq h\}}) = \mathbb{E}\Biggl(\sum_{i=1}^{\infty} \unicode{x1D7D9}_{\{i\le X\}}\unicode{x1D7D9}_{\{X\geq h\}}\Biggr)&= h\mathbb{P}(X\geq h) + \sum_{i=h+1}^{\infty} \mathbb{P}(X\geq i).\end{align*}

Thus the series in (12) equals

\begin{align*}a\dfrac{m+1}{k+1}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq m+k+2\Biggr) +a\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq m+k+2\Biggr)+\dfrac{a}{k+1}\sum_{z=m+k+3}^{\infty}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq z\Biggr).\end{align*}

Substituting the series in (12) with these three terms, we obtain

(13)

\begin{align}\dfrac{a}{k+1}\sum_{h=m+1}^{\infty}h\mathbb{P}(S_{k+1}=h)= a\dfrac{m}{k+1}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq m+k+2\Biggr)+\dfrac{a}{k+1}\sum_{z=m+k+2}^{\infty}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq z\Biggr).\end{align}

Now observe that the series in (13) can be rewritten as follows:

\begin{equation*}\dfrac{a}{k+1}\sum_{z=m+k+2}^{\infty}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq z\Biggr)=\dfrac{a}{k+1}\sum_{h=m+1}^{\infty}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq h+k+1\Biggr).\end{equation*}

Summarizing, so far we have shown that

(14)

\begin{align}\mathbb{P}(|\mathcal{C}(V_n)|>k)&\leq a\dfrac{m}{k+1}+a\dfrac{m}{k+1}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq k+1+(m+1)\Biggr)\notag \\&\quad +\dfrac{a}{k+1}\sum_{h=m+1}^{\infty}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq k+1+h\Biggr).\end{align}

Using our assumption on $\mathbb{E}({\textrm{e}}^{rX_1})$ and Markov’s inequality, we have, for all $h\geq m+1$ and all $r\in (0,\rho)$,

\begin{align*}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq k+1+h\Biggr)&\leq {\textrm{e}}^{-r(k+1)-rh}\mathbb{E}({\textrm{e}}^{rX_1})^{k+1}\\&\leq \exp\bigl\{-r(k+1)-rh+r(1+\epsilon)(k+1)+\delta r^2 (k+1)\bigr\}\\&=\exp\bigl\{-rh+r\epsilon (k+1)+\delta r^2 (k+1)\bigr\}.\end{align*}

Now if k is such that $\rho \sqrt{k}\geq 1$ for all large enough n, then $r= 1/\sqrt{k+1}<\rho$, and hence using this specific value of r we obtain

\begin{equation*}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq k+1+h\Biggr)\leq {\textrm{e}}^{-{{h}/{\sqrt{k+1}}}}\,{\textrm{e}}^{\epsilon \sqrt{k+1}+\delta}.\end{equation*}

Also, if k satisfies $\epsilon \sqrt{k}\leq c$ for all sufficiently large n, we see that $\epsilon \sqrt{k+1}\leq 2c$ and hence we can bound the series in (14) from above as follows:

(15)

\begin{equation}\dfrac{a}{k+1}\sum_{h=m+1}^{\infty}\mathbb{P}\Biggl(\sum_{i=1}^{k+1}X_i\geq h+k+1\Biggr)\leq \dfrac{a\,{\textrm{e}}^{\delta+2c}}{k+1}\sum_{h=m+1}^{\infty}\,{\textrm{e}}^{-{{h}/{\sqrt{k+1}}}}.\end{equation}

Now observe that

\begin{equation*}\sum_{h=m+1}^{\infty}\,{\textrm{e}}^{-{{h}/{\sqrt{k+1}}}}={\textrm{e}}^{-{{(m+1)}/{\sqrt{k+1}}}}\dfrac{1}{1-{\textrm{e}}^{-{{1}/{\sqrt{k+1}}}}}.\end{equation*}

Using the inequality ${\textrm{e}}^{-x}\leq 1-x+x^2/2$ (which is valid for all $x\geq 0$), we see that

\begin{equation*}1-{\textrm{e}}^{-{{1}/{\sqrt{k+1}}}}\geq \dfrac{1}{\sqrt{k+1}}-\dfrac{1}{2(k+1)}\geq \dfrac{1}{2\sqrt{k+1}},\end{equation*}

and hence the expression on the right-hand side of (15) is at most

\begin{equation*}\dfrac{a\,{\textrm{e}}^{\delta+2c}}{k+1}\,{\textrm{e}}^{-{{(m+1)}/{\sqrt{k+1}}}}2\sqrt{k+1}=\dfrac{2a\,{\textrm{e}}^{\delta+2c}}{\sqrt{k+1}}\,{\textrm{e}}^{-{{(m+1)}/{\sqrt{k+1}}}}.\end{equation*}

Thus we obtain

\begin{equation*}\mathbb{P}(|\mathcal{C}(V_n)|>k)\leq a\dfrac{m}{k+1}+a\,{\textrm{e}}^{\delta+2c}\dfrac{m}{k+1} \,{\textrm{e}}^{-{{(m+1)}/{\sqrt{k+1}}}}+\dfrac{2a\,{\textrm{e}}^{\delta+2c}}{\sqrt{k+1}}\,{\textrm{e}}^{-{{(m+1)}/{\sqrt{k+1}}}}.\end{equation*}

Taking $m=\lfloor \sqrt{k+1} \rfloor$ we see that

\begin{equation*}\mathbb{P}(|\mathcal{C}(V_n)|>k)\leq \dfrac{a}{\sqrt{k+1}}+3a\,{\textrm{e}}^{\delta+2c}\dfrac{{\textrm{e}}^{-1}}{\sqrt{k+1}}=C\dfrac{a}{\sqrt{k+1}},\end{equation*}

where we set $C=C(\delta,c)= 1+3\,{\textrm{e}}^{\delta+2c-1}$. Finally, letting

\begin{equation*}N_k=\sum_{v\in [n]}^{}\unicode{x1D7D9}_{\{|\mathcal{C}(v)|> k \}}\end{equation*}

denote the number of vertices that are contained in components of size at least k, we obtain

\begin{equation*}\mathbb{P}(|\mathcal{C}_{\max}|> k)= \mathbb{P}(N_{k}> k)\leq \dfrac{1}{k}\mathbb{E}(N_{k})=\dfrac{n}{k}\mathbb{P}(|\mathcal{C}(V_n)|> k)\leq \dfrac{C}{\mathbb{P}(X_1=3)}\dfrac{n}{k^{3/2}},\end{equation*}

completing the proof of the theorem.

3.2. Proof of Proposition 1

Let $\mathbb{H}(n,m,p)$ be a random (multi-)graph constructed from the bipartite graph $\mathbb{B}(n,m,p)$ by letting the number of edges between $v_i,v_j\in V$ equal the number of auxiliary vertices $a_k$ that are adjacent to both $v_i$ and $v_j$. (Recall that V is the vertex set of the random intersection graph $\mathbb{G}(n,m,p)$ under investigation.) Notice that $\mathbb{G}(n,m,p)$ can be obtained from $\mathbb{H}(n,m,p)$ by coalescing multiple edges between vertices into one single edge. Hence, thanks to this construction, we see that the degree distribution in $\mathbb{G}(n,m,p)$ is dominated by the degree distribution in $\mathbb{H}(n,m,p)$. Notice that the latter is a compound binomial distribution with moment generating function

(16)

\begin{equation}\mathbb{E}({\textrm{e}}^{rX_1})=\biggl\{1-\dfrac{\gamma}{n}+\dfrac{\gamma}{n}\biggl(1-\dfrac{\gamma}{n}+\dfrac{\gamma}{n}\,{\textrm{e}}^{r}\biggr)^{n-1}\biggr\}^{\lfloor \beta n \rfloor},\end{equation}

since (by construction) a vertex $v\in \mathbb{H}(n,m,p)$ is connected to a $\operatorname{Bin}\!(m,p)$ number of auxiliary vertices, and each one of them is connected to an independent $\operatorname{Bin}\!(n-1,p)$ number of vertices in $V\setminus \{v\}$.

Therefore, by revealing the components of $\mathbb{G}(n,m,p)$ using the exploration process described at the beginning of Section 2, we can write

\begin{align*}\mathbb{P}(|\mathcal{C}(V_n)|>k)\leq \mathbb{P}\Biggl(2+\sum_{i=1}^{t}(X_i-1)>0\ \text{for all}\ t\in [k]\Biggr),\end{align*}

where the $X_i$ are i.i.d. compound binomial random variables with moment generating function given in (16).

Using the probability generating function of $X_1$ (which coincides with (16) after substituting ${\textrm{e}}^r$ with r), it is not difficult to show that (for large enough n) the probability $\mathbb{P}(X_1=3)$ is bounded from below by a positive constant which depends solely on $\gamma$ and $\beta$.

Next, in order to apply Theorem 1, we simply need to prove an upper bound for $\mathbb{E}({\textrm{e}}^{rX_1})$. Recalling the expression of the moment generating function of $X_1$ given in (16), we obtain

(17)

\begin{equation}\mathbb{E}({\textrm{e}}^{rX_1})=\exp\biggl\{\lfloor \beta n \rfloor \log\biggl(1-\dfrac{\gamma}{n}+\dfrac{\gamma}{n}\biggl[1+\dfrac{\gamma}{n}({\textrm{e}}^{r}-1)\biggr]^{n-1}\biggr)\biggr\}.\end{equation}

Taking $r\in (0,1)$, we have that ${\textrm{e}}^r-1\leq r+r^2$. Then, since $1+x\leq {\textrm{e}}^x$ for all $x\in \mathbb{R}$, we see that (17) is at most

\begin{equation*}\exp\biggl\{\lfloor \beta n \rfloor \log\biggl(1+\dfrac{\gamma}{n}(\!\exp\{\gamma(r+r^2)\}-1)\biggr)\biggr\}.\end{equation*}

Thus, taking $r<\min\{1,1/2\gamma\}$ (so that $\gamma(r+r^2)<1$), we see that

\begin{equation*}\exp\{\gamma(r+r^2)\} - 1\leq \gamma(r+r^2)+\gamma^2(r+r^2)^2\end{equation*}

and hence, using the fact that $\log\!(1+x)\leq x$ for all $x>-1$, we can write

\begin{align*} \exp\biggl\{\lfloor \beta n \rfloor \log\biggl(1+\dfrac{\gamma}{n}(\!\exp\{\gamma(r+r^2)\}-1)\biggr)\biggr\} \leq \exp\biggl\{\lfloor \beta n \rfloor\biggl(\dfrac{\gamma^2}{n}(r+r^2) + \dfrac{\gamma^3}{n}(r+r^2)^2\biggr)\biggr\}.\end{align*}

Recalling that $\beta \gamma^2=1$, we obtain

\begin{equation*}\exp\biggl\{\lfloor \beta n \rfloor\biggl(\dfrac{\gamma^2}{n}(r+r^2) + \dfrac{\gamma^3}{n}(r+r^2)^2\biggr)\biggr\}\leq \exp \{r+r^2(1+4\gamma)\}.\end{equation*}

Therefore, for all $r\in (0,\min\{1,1/2\gamma\})$, we have

\begin{equation*}\mathbb{E}({\textrm{e}}^{rX_1})\leq {\textrm{e}}^{r+r^2(1+4\gamma)}\end{equation*}

and hence condition (iii) in Theorem 1 is satisfied for $\rho=\min\{1,1/2\gamma\}$, $\delta=1+4\gamma$, and $\epsilon=0$. Hence, taking $k=\lceil A n^{2/3} \rceil$ (which clearly satisfies the requirement $\epsilon \sqrt{k}\leq c$ for $c=0$ as well as the condition $\rho \sqrt{k}\geq 1$) and applying Theorem 1, we obtain

\begin{align*}\mathbb{P}(|\mathcal{C}_{\max}|>\lceil A n^{2/3} \rceil)\leq \dfrac{c_1}{A^{3/2}}\end{align*}

for some finite positive constant $c_1$ which depends solely on $\gamma$ and $\beta$.

3.3. Proof of Proposition 2

Since $\mathbb{G}$ is d-regular, we can use the exploration process described at the beginning of Section 2 to conclude that

\begin{align*}\mathbb{P}(|\mathcal{C}(V_n)|>k)\leq \mathbb{P}\Biggl(2+\sum_{i=1}^{t}(X_i-1)>0\ \text{for all}\ t\in [k]\Biggr),\end{align*}

where the $X_i$ are i.i.d. random variables with $X_i\sim \operatorname{Bin}\!(d-1,p)$, so that

\[\sum_{i=1}^{k+1}X_i\sim \operatorname{Bin}\!((k+1)(d-1),p).\]

(Note that if we had started the exploration process with only one active vertex, now we would have $\eta_1\sim \operatorname{Bin}\!(d,p)$, and hence in particular it would be impossible to dominate $\eta_1$ with a $\operatorname{Bin}\!(d-1,p)$ random variable.) Using a monotonicity argument we can focus on the (critical) case $p=1/(d-1)$. Note that, since $d>3$,

\begin{align*}\mathbb{P}(X_1=3)=\dfrac{(d-2)(d-3)}{6(d-1)^2}\biggl(1-\dfrac{1}{d-1}\biggr)^{d-4}>0.\end{align*}

Next, for all $r\in (0,1)$, using the inequality $1+x\leq {\textrm{e}}^x$ (valid for all $x\in \mathbb{R}$) we see that

\begin{align*} \mathbb{E}({\textrm{e}}^{rX_1})=\biggl(1+\dfrac{1}{d-1}({\textrm{e}}^r-1)\biggr)^{d-1}\leq {\textrm{e}}^{{\textrm{e}}^r-1}\leq {\textrm{e}}^{r+r^2}.\end{align*}

Hence condition (iii) of Theorem 1 is satisfied for $\rho,\delta=1$ and $\epsilon=0$. Thus, taking $k=\lceil An^{2/3}\rceil$ (which satisfies $\epsilon \sqrt{k}\leq c$ for $c=0$, as well as $\rho \sqrt{k}\geq 1$), we arrive at

\begin{align*}\mathbb{P}(|\mathcal{C}_{\max}|>k)\leq c_2\dfrac{n}{k^{3/2}}\end{align*}

for some finite positive constant $c_2$ which depends solely on d.

3.4. Proof of Proposition 3

Before starting the actual proof, we need to recall the definition of size-biased distribution of a non-negative random variable and to introduce a few facts.

Definition 1. For a non-negative random variable X with $\mathbb{E}(X)>0$, the size-biased distribution of X, denoted by $X^*$, is the random variable defined by

\begin{equation*}\mathbb{P}(X^*\leq x)=\dfrac{\mathbb{E}(X\unicode{x1D7D9}_{\{X\leq x\}})}{\mathbb{E}(X)}.\end{equation*}

For proofs of the assertions that appear in the statement of the next result, see [Reference De Ambroggio and Pachon8].

Lemma 2. Suppose that $1-F(x)\leq c_Fx^{-(\tau-1)}$ for all $x\geq 0$, for some $c_F>0$ and $\tau>4$. Let $w_i$ be as in (5). Then $\max\{w_i\colon i\in [n]\}\leq (c_F n)^{1/(\tau-1)}$. Moreover, defining

(18)

\begin{equation}F_n(x)= \dfrac{1}{n}\sum_{i=1}^{n}\unicode{x1D7D9}_{\{w_i\leq x\}}\end{equation}

and letting $W_n$ being a random variable with distribution function $F_n$ and size-biased distribution $W_n^*$, then $\mathbb{E}((W^*_n)^2)\leq C_1$ and $|1-\mathbb{E}(W^*_n)|\leq C_2n^{-{{(\tau-3)}/{(\tau-1)}}}$ for all large enough n, where $C_1$ and $C_2$ are two positive constants which depend on $c_F$ and $\tau$.

As explained in Van der Hofstad [Reference van der Hofstad12] (see also [Reference De Ambroggio and Pachon8]), the cluster exploration of $V_n$ in the $NR_n(\textbf{w})$ random graph can be dominated by the total progeny of a (marked mixed-Poisson) branching process. Specifically, following Van der Hofstad [Reference van der Hofstad12], we can write

\begin{equation*}\mathbb{P}(|\mathcal{C}(V_n)|>k)\leq \mathbb{P}\Biggl(2+\sum_{i=1}^{t}(X_i-1)>0\ \text{for all}\ t\in [k]\Biggr),\end{equation*}

where the $X_i$ are independent mixed Poisson random variables with $X_i\sim \operatorname{Poi}\!(w_{M_i})$ and the $M_i$ are i.i.d. random variables, all distributed as a random variable M with distribution given by

\[\mathbb{P}(M=m)=\dfrac{w_m}{l_n}, \quad m\in [n].\]

As remarked in [Reference van der Hofstad12], a $\operatorname{Poi}\!(w_M)$ random variable converges in distribution to a mixed Poisson random variable with random parameter $W^*$, where $W^*$ is the size-biased distribution of W, the latter being a positive random variable with distribution function F. Therefore $\mathbb{P}(X_1=3)$ converges to $\mathbb{P}(Z=3)$, where $Z\sim \operatorname{Poi}\!(W^*)$. It follows that $\mathbb{P}(X_1=3)\geq \mathbb{P}(Z=3)/2$ for all large enough n, and hence we obtain

\begin{equation*} \mathbb{P}(X_1=3)\geq\dfrac{1}{2}\mathbb{E}(\mathbb{P}(Z=3|W^*))=\mathbb{E}({\textrm{e}}^{-W^*}(W^*)^3)/12>0.\end{equation*}

Taking $r\in (0,1)$ so that ${\textrm{e}}^r-1\leq r+r^2$, we obtain

\begin{align*}\mathbb{E}\bigl[{\textrm{e}}^{rX_1}\bigr]&=\sum_{h=0}^{\infty}\,{\textrm{e}}^{rh}\sum_{i\in [n]}^{}\dfrac{w_i}{l_n}\,{\textrm{e}}^{-w_i}\dfrac{w^{h}_i}{h!}\\ &=\sum_{i\in [n]}^{}\dfrac{w_i}{l_n}\,{\textrm{e}}^{w_i({\textrm{e}}^r-1)}\\ & \leq \sum_{i\in [n]}^{}\dfrac{w_i}{l_n}\,{\textrm{e}}^{w_i(r+r^2)}\\ &=\exp\Biggl\{\log\Biggl(\sum_{i\in [n]}^{}\dfrac{w_i}{l_n}\,{\textrm{e}}^{w_i({\textrm{e}}^r-1)}\Biggr)\Biggr\}.\end{align*}

If moreover $r<(2\max\{w_i\colon i\in [n]\})^{-1}$, then $w_i(r+r^2)<2r w_i\leq 1$ for every $i\in [n]$ and hence, for $0<r<\min\{1,1/2\max\{w_i\colon i\in [n]\}\}$, we can bound

\begin{align*}\log\Biggl(\sum_{i\in [n]}^{}\dfrac{w_i}{l_n}\,{\textrm{e}}^{w_i({\textrm{e}}^r-1)}\Biggr)&\leq \log \Biggl(\sum_{i\in [n]}^{}\dfrac{w_i}{l_n}(1+w_i(r+r^2)+w^2_i4r^2)\Biggr)\\&=\log\Biggl(1+(r+r^2)\sum_{i\in [n]}^{}\dfrac{w^2_i}{l_n}+4r^2\sum_{i\in [n]}^{}\dfrac{w^3_i}{l_n}\Biggr).\end{align*}

Note that if $W_n$ is a random variable with distribution function $F_n$ given in (18) and $W_n^*$ is its size-biased distribution, then

\begin{equation*}\sum_{i\in [n]}^{}\dfrac{w^2_i}{l_n}=\mathbb{E}(W_n^*) \quad\text{and}\quad \sum_{i\in [n]}^{}\dfrac{w^3_i}{l_n}=\mathbb{E}((W^*_n)^2).\end{equation*}

Therefore we arrive at

\begin{equation*}\log\Biggl(\sum_{i\in [n]}^{}\dfrac{w_i}{l_n}\,{\textrm{e}}^{w_i({\textrm{e}}^r-1)}\Biggr)\leq \log\bigl(1+(r+r^2)\mathbb{E}(W_n^*)+4r^2\mathbb{E}((W^*_n)^2)\bigr).\end{equation*}

Thanks to Lemma 2 we know that $\mathbb{E}((W^*_n)^2)\leq C_1$ and $\mathbb{E}(W^*_n)\leq 1+C_2n^{-{{(\tau-3)}/{(\tau-1)}}}$ for all sufficiently large n (for some finite constants $C_1,C_2>0$ which depend on $c_F$ and $\tau$), we see that

\begin{equation*}\log\bigl(1+(r+r^2)\mathbb{E}(W_n^*)+4r^2\mathbb{E}((W^*_n)^2)\bigr)\leq r\bigl(1+C_2n^{-{{(\tau-3)}/{(\tau-1)}}}\bigr)+r^2(1+5C_1)\end{equation*}

for all large enough n. Summarizing, for all positive $r<\min\{1,1/2\max\{w_i\colon i\in [n]\}\}$ we have shown that

\begin{align*}\mathbb{E}({\textrm{e}}^{rX_1})\leq \exp\bigl\{r\bigl(1+C_2n^{-{{(\tau-3)}/{(\tau-1)}}}\bigr)+r^2(1+5C_1)\bigr\},\end{align*}

provided n is sufficiently large. Hence condition (iii) in Theorem 1 is satisfied for $\rho=\rho(n)=\min\{1,1/2\max\{w_i\colon i\in [n]\}\}$, $\delta=1+5C_1$, and $\epsilon=\epsilon(n)=C_2n^{-{{(\tau-3)}/{(\tau-1)}}}$. Note that $k= \lceil An^{2/3} \rceil$ satisfies

\[\epsilon \sqrt{k}={\textrm{O}}\bigl(A^{1/2}C_2n^{-{{(\tau-4)}/{(3(\tau-1))}}}\bigr)\leq 1\]

for all large enough n since A is fixed and $\tau>4$, and $\rho \sqrt{k}\geq 1$ since

\[(2\max\{w_i\colon i\in [n]\})^{-1}\sqrt{k}\geq A^{1/2}n^{{{(\tau-4)}/{(3(\tau-1))}}}\bigl(2c^{1/(\tau-1)}_F\bigr)^{-1}\geq 1.\]

Hence we can apply Theorem 1 to conclude that

\begin{align*}\mathbb{P}(|\mathcal{C}_{\max}|>\lceil An^{2/3} \rceil)\leq \dfrac{c_3}{A^{3/2}},\end{align*}

for some finite positive constant $c_3$ that depends solely on $c_F$ and $\tau$.

Acknowledgement

The author would like to thank two anonymous referees and M. Roberts for useful suggestions that helped to improve the presentation of the paper. Moreover, the author thanks G. Perarnau and A. Pachon for interesting discussions about random intersection graphs on the occasion of the workshop Graphs and Randomness in Turin (January 2019).

Funding information

The author thanks the Royal Society for his PhD scholarship.

Competing interests

There were no competing interests to declare which arose during the preparation or publication process of this article.

References

Addario-Berry, L. and Reed, B. A. (2008). Ballot theorems, old and new. In Horizons of Combinatorics (Bolyai Society Mathematical Studies 17), pp. 9–35. Springer, Berlin and Heidelberg.CrossRef Google Scholar

Behrisch, M. (2007). Component evolution in random intersection graphs. Electron. J. Combin. 14, R17.CrossRef Google Scholar

Bollobás, B. (2001). Random Graphs, 2nd edn. Cambridge University Press, Cambridge.CrossRef Google Scholar

Bollobás, B., Janson, S. and Riordan, O. (2007). The phase transition in inhomogeneous random graphs. Random Structures Algorithms 31, 3–122.CrossRef Google Scholar

Chung, F. and Lu, L. (2002). Connected components in random graphs with given expected degree sequences. Ann. Combinatorics 6, 125–145.CrossRef Google Scholar

Chung, F. and Lu, L. (2003). The average distance in a random graph with given expected degrees. Internet Math. 1, 91–113.CrossRef Google Scholar

Chung, F. and Lu, L. (2006). The volume of the giant component of a random graph with given expected degrees. SIAM J. Discrete Math. 20, 395–411.CrossRef Google Scholar

De Ambroggio, U. and Pachon, A. (2020). Simple upper bounds for the largest components in critical inhomogeneous random graphs. Available at arXiv:2012.09001.Google Scholar

De Ambroggio, U. and Roberts, M. I. (2021). Unusually large components in near-critical Erdős–Rényi graphs via ballot theorems. Available at arXiv:2101.05358.Google Scholar

Deijfen, M. and Kets, W. (2009). Random intersection graphs with tunable degree distribution and clustering. Prob. Eng. Inf. Sci. 23, 661–674.Google Scholar

Frieze, A. and Karoński, M. (2015). Introduction to Random Graphs. Cambridge University Press, Cambridge.CrossRef Google Scholar

van der Hofstad, R. (2013). Critical behaviour in inhomogeneous random graphs. Random Structures Algorithms 42, 480–508.CrossRef Google Scholar

van der Hofstad, R. (2016). Random Graphs and Complex Networks, Vol. 1 (Cambridge Series in Statistical and Probabilistic Mathematics 43). Cambridge University Press, Cambridge.CrossRef Google Scholar

Janson, S. (2009). Asymptotic equivalence and contiguity of some random graphs. Random Structures Algorithms 36, 26–45.Google Scholar

Janson, S., Łuczak, T. and Ruciński, A. (2011). Random Graphs. John Wiley, New York.Google Scholar

Joos, F. and Perarnau, G. (2018). Critical percolation on random regular graphs. Proc. Amer. Math. Soc. 146, 3321–3332.CrossRef Google Scholar

Kager, W. (2011). The hitting time theorem revisited. Amer. Math. Monthly 118, 735–737.CrossRef Google Scholar

Kang, M., Pachon, A. and Rodriguez, P. M. (2018). Evolution of a modified binomial random graph by agglomeration. J. Statist. Phys. 170, 509–535.CrossRef Google Scholar

Krivelevich, M. and Sudakov, B. (2012). The phase transition in random graphs: a simple proof. Random Structures Algorithms 43, 131–138.CrossRef Google Scholar

Lageras, A. N. and Lindholm, M. (2008). A note on the component structure in random intersection graphs with tunable clustering. Electron. J. Combin. 15, N10.CrossRef Google Scholar

Łuczak, T., Pittel, B. and Wierman, J. C. (1994). The structure of a random graph at the point of the phase transition. Trans. Amer. Math. Soc. 341, 721–748.CrossRef Google Scholar

Nachmias, A. and Peres, Y. (2010). Critical percolation on random regular graphs. Random Structures Algorithms 36, 111–148.Google Scholar

Nachmias, A. and Peres, Y. (2010). The critical random graph, with martingales. Israel J. Math. 176, 29–41.CrossRef Google Scholar

Norros, I. and Reittu, H. (2006). On a conditionally Poissonian graph process. Adv. Appl. Prob. 38, 59–75.CrossRef Google Scholar

Penrose, M. D. (2018). Inhomogeneous random graphs, isolated vertices, and Poisson approximation. J. Appl. Prob. 55, 112–136.CrossRef Google Scholar

Pittel, B. (2001). On the largest component of the random graph at a nearcritical stage. J. Combinatorial Theory B 82, 237–269.CrossRef Google Scholar

Roberts, M. I. (2017). The probability of unusually large components in the near-critical Erdős–Rényi graph. Adv. Appl. Prob. 50, 245–271.Google Scholar

Stark, D. (2004). The vertex degree distribution of random intersection graphs. Random Structures Algorithms 24, 249–258.CrossRef Google Scholar

Article contents

An elementary approach to component sizes in critical random graphs

Abstract

Keywords

MSC classification

1. Introduction

2. Results

2.1. Critical random intersection graph

2.2. Critical p-bond percolation on d-regular graph

2.3. Critical inhomogeneous random graph

3. Proofs

3.1. Proof of Theorem 1

3.2. Proof of Proposition 1

3.3. Proof of Proposition 2

3.4. Proof of Proposition 3

Acknowledgement

Funding information

Competing interests

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests