1. Introduction and notation
In recent years, higher computer power and better tools from statistics show that there are many natural phenomena which do not follow the standard normal distribution, but rather exhibit different types of memory, sometimes changing these properties over time. Therefore several different types of extensions of standard stochastic processes have been proposed to try to give a more realistic picture of nature corresponding to what we observe. There are several stochastic processes that are popular today for the modeling of various features of memory associated with a random phenomenon. The most elementary example in the continuous case is the so-called fractional Brownian motion, where a fractional kernel is introduced in an appropriate way that allows us to describe memory in the process. Another way of introducing memory in the discontinuous case is to allow the intensity of a compound Poisson process to be dependent on time; see e.g. [Reference Hawkes10]. These processes, generally known as Hawkes processes, are point processes which allow for self-exciting behavior by letting the conditional intensity be dependent on the past events of the processes. It is worth noting that there exists a close relationship between fractional stochastic differential equations and the limit of certain Hawkes processes; see [Reference El Euch and Rosenbaum8]. In the mid-1990s Véhel and Peltier introduced a new way to model memory in a continuous stochastic process. This was achieved by allowing the strength of the memory of the fractional kernel in a fractional Brownian motion to be a function of time; see e.g. [Reference Peltier and Véhel13]. In this note we will consider a continuous type of process which is inspired by the multifractional Brownian motion, by letting the strength of the memory of this fractional kernel depend not only on time but also on the history of the process itself. This allows for greater flexibility in the modeling of phenomena with a complex memory structure. A similar type of process was introduced by Barrière, Echelard and Véhel [Reference Barrière, Echelard and Véhel2], who allowed the memory at a certain time to be fully determined by the value of the process at that particular time. Our approach represents an alternative way of addressing this memory issue, by allowing this fractional kernel to be dependent on the whole path of the process. In particular, this is done by constructing the process as a solution to a singular, stochastic Volterra equation. Multifractional Brownian motion is interesting in its own right, in that it is a non-stationary Gaussian process that has regularity properties changing in time. A simple version of this process is known as the Riemann–Liouville multifractional Brownian motion and can be represented by the integral
where $ \{ B_{t}\} _{t\in[0,T]}$ is a Brownian motion and h is a deterministic function. Interestingly, if we restrict the process to a small interval, say $ [t-\epsilon,t+\epsilon] $, the local $\alpha$-Hölder regularity of this process on that interval is of order $\alpha\sim h(t)$ if $\epsilon$ is sufficiently small. Thus the regularity of the process is dependent on time. Applications of such processes have been found in fields ranging from Internet traffic and image synthesis to finance; see e.g. [Reference Benassi, Jaffard and Roux3], [Reference Bianchi and Pianese4], [Reference Bianchi, Pantanella and Pianese5], [Reference Corlay, Lebovits and Véhel7], [Reference Lebovits and Véhel12], [Reference Pesquet-Popescu and Véhel14], [Reference Pianese, Bianchi and Palazzo15], and [Reference Riedi and Véhel16]. In 2010 Sornette and Filimonov proposed a self-excited multifractal process to be considered in the modeling of earthquakes and financial market crashes; see [Reference Sornette and Filimonov17]. By a self-excited process, they mean a process where the future state depends directly on all the past states of the process. The model they proposed was defined in a discrete manner. They also suggested a possible continuous-time version of their model, but they did not study its existence rigorously. This article is therefore intended as an attempt to propose a continuous-time version of a model similar to that proposed by Sornette and Filimonov, and we will study its mathematical properties.
We will first consider an extension of a multifractional Brownian process, which is found as the solution to the stochastic differential equation
where $ \{ B_{t}\} _{t\in[0,T]}$ is a general d-dimensional Brownian motion, h is bounded and takes values in (0, 1), and f is a non-negative, sufficiently regular function. Already at this point we could think that the local regularity of the process $X^{h,f}$ would be dependent on the history of $X^{h,f}$ via h and f, in a similar manner to the multifractional Brownian motion in equation (1.1). As we can see, the formulation of the process is via a stochastic Volterra equation with a possibly singular kernel. We will therefore show the existence and uniqueness of this equation, and then say that its solution is a self-exciting multifractional gamma process (SEM-gamma) $X^{h,f}$, due to its resemblance to the gamma kernel; see e.g. [Reference Barndorff-Nielsen, Espen Benth and Veraart1]. We will study the probabilistic properties, and discuss examples of functions h and f, which give different dynamics for the process $X^{h,f}$. The process is neither stationary nor Gaussian in general, and is therefore mathematically challenging to apply in any standard model, for example in finance. Nevertheless, the process has some interesting properties in its own right. The study of such processes could also shed some light on natural phenomena whose behaviour is outside the scope of standard stochastic processes, such as the self-excited dynamics of earthquakes, as argued in [Reference Sornette and Filimonov17].
We will first show the existence and uniqueness of equation (1.2) and then study probabilistic and path properties such as variance and Hölder regularity of the process. We will introduce an Euler–Maruyama scheme to approximate the process, and show its strong convergence as well as estimate its rate of convergence. Finally, we will discuss a particular case of the process, where $f=0$, for which we find out that, under certain parametrizations of h, one can also preserve some desired properties observed in the time series generated by the process itself.
1.1. Notation and preliminaries
Let $T>0$ be a fixed constant. We will use the standard notation $L^{\infty} ( [0,T] ) $ for essentially bounded functions on the interval [0, T]. Furthermore, let $\triangle^{(m)} [a,b] $ denote the m-simplex. That is, define $\triangle^{(m)}[a,b]$ to be given by
We will consider functions $k\colon \triangle^{ (2) } ( [0,T] ) \rightarrow\mathbb{R}_{+}$, which will be used as a kernel in an integral operator, in the sense that we consider integrals of the form
whenever the integral is well-defined. We call these functions Volterra kernels.
Definition 1.1. Let $k\colon \triangle^{(2)} ( [0,T] ) \rightarrow\mathbb{R}_{+}$ be a Volterra kernel. If k satisfies
and
then we say that $k\in\mathcal{K}_{0}.$
We will frequently use the constant C to denote a general constant, which might vary throughout the text. When it is important, we will use a subscript to denote what this constant depends upon, i.e. $C=C_{T}$ to denote dependence on T.
2. Zhang’s existence and uniqueness of stochastic Volterra equations
In this section we will assume that $ \{ B_{t}\} _{t\in [0,T] }$ is a d-dimensional Brownian motion defined on a filtered probability space $(\Omega,\mathcal{F}, \{ \mathcal{F}_{t}\} _{t\in [0,T] },\mathbb{P})$. Consider the Volterra equation
where g is a progressively measurable stochastic process and $\sigma\colon \triangle^{(2)} ( [0,T] ) \times\mathbb{R}^{n}\rightarrow\mathcal{L} (\mathbb{R}^{d},\mathbb{R}^{n}) $ is a measurable function, where $\mathcal{L} (\mathbb{R}^{d},\mathbb{R}^{n}) $ is the linear space of $(n\times d)$-matrices.
Next we write a simplified version of the hypotheses for $\sigma$ and g, introduced previously by Zhang in [Reference Zhang20], which will be used to prove that there exists a unique solution to (2.1).
H1. There exists $k_{1}\in\mathcal{K}_{0}$ such that the function $\sigma$ satisfies the following linear growth inequality for all $(s,t)\in\triangle^{(2)} ([0,T]) ,$ and $x\in\mathbb{R}^{n}$:
\begin{equation*} |\sigma (t,s,x) | ^{2}\leq k_{1} (t,s) \big(1+ |x| ^{2}\big) .\end{equation*}-
H2. There exists $k_{2}\in\mathcal{K}_{0}$ such that the function $\sigma$ satisfies the following Lipschitz inequality for all $(s,t)\in\triangle^{(2)} ([0,T]) ,$ $x,y\in\mathbb{R}^{n}$:
\begin{equation*} |\sigma (t,s,x) -\sigma (t,s,y) | ^{2}\leq k_{2} (t,s) |x-y| ^{2}.\end{equation*} -
H3. For some $p\geq2$, we have
\begin{equation*}\sup_{t\in [0,T] }\int_{0}^{t} [k_{1} (t,s) +k_{2} (t,s) ] \cdot\mathbb{E} [ |g_{s}| ^{p}]\,{\textrm{d}} s<\infty,\end{equation*}where $k_{1}$ and $k_{2}$ satisfy H1 and H2.
Based on the above hypotheses, we can use the following tailor-made version of the theorem on existence and uniqueness found in [Reference Zhang20] to show that there exists a unique solution to (2.1).
Theorem 2.1. (X. Zhang.) Assume that $\sigma\colon \triangle^{(2)} ([0,T]) \times\mathbb{R}^{n}\rightarrow\mathcal{L} (\mathbb{R}^{d},\mathbb{R}^{n}) $ is measurable, and that g is an $\mathbb{R}^{n}$-valued, progressively measurable process satisfying H1–H3. Then there exists a unique measurable, $\mathbb{R}^{n}$-valued, progressively measurable process $X_{t}$ satisfying for all $t\in[0,T]$ the equation
Furthermore, for some $C_{T,p,k_{1}}>0$ we have
where p is from H3, and $k_{1}$ is as given in H1.
It will also be useful in future sections to consider the following additional hypothesis.
H4. The process g is continuous and satisfies, for some $\delta>0$ and for any $p\geq2$,
\begin{equation*}\mathbb{E}\biggl[\sup_{t\in [0,T] } |g_{t}| ^{p}\biggr]<\infty\end{equation*}and\begin{equation*}\mathbb{E}[ |g_{t}-g_{s}| ^{p}]\leq C_{T,p} |t-s| ^{\delta p}.\end{equation*}
Remark 2.1. Hypothesis H4 is equivalent to saying that $g\in C^{\delta}$ $\mathbb{P}$-a.s.
3. Self-exciting multifractional gamma processes
In this section we will build a continuous-time process in a way that allows us to capture the intermittency effect suggested in [Reference Sornette and Filimonov17], which is often observed in turbulent data. In order to do so we need to introduce some basic definitions that will serve as building blocks to introduce such processes properly. All proofs of the results stated in this section are given in the appendix.
Definition 3.1. Consider a non-negative function $f\colon [0,T] \times\mathbb{R}\rightarrow\mathbb{R}_{+}$, such that it satisfies the Lipschitz conditions
for all $x,y\in\mathbb{R}$ and $t,t^{\prime}\in [0,T] $. We say that f is a dampening function if it also satisfies the linear growth condition
for all $x\in\mathbb{R}$, $t\in [0,T] $ and some constant $C>0$.
Definition 3.2. Consider a function $h\colon [0,T] \times\mathbb{R}\rightarrow\mathbb{R}_{+}$ with parameters $ (h_{*},h^{*}) $, such that $h_{*}\leq h^{*}$. We say that h is a Hurst function if h (t, x) takes values in $ [h_{*},h^{*}] \subset (0,1) $ for all $x\in\mathbb{R}^{d}$ and $t\in [0,T] $, and h satisfies the Lipschitz conditions
for all $x,y\in\mathbb{R}$, $t,t^{\prime}\in [0,T] $ and some $C>0$.
Consider the stochastic process given formally by the Volterra equation
where g is a progressively measurable, one-dimensional process, $h\colon [0,T] \times\mathbb{R}\rightarrow\mathbb{R}_{+}$, and B is a one-dimensional Brownian motion. The following lemma shows the existence and uniqueness of the above equation by means of Theorem 2.1. We will discuss the continuity properties of the solution further in this section.
Lemma 3.1. Let $\sigma(t,s,x)=\exp (\!-f (t,x) (t-s) ) (t-s) ^{h(t,x)-{{1}/{2}}}$, such that f is a dampening function and h is a Hurst function with parameters $ (h_{*},h^{*}) $ according to Definitions 3.1 and 3.2 respectively. Then we obtain
where
and
It follows that $\sigma$ satisfies H1–H2.
Since the kernel $\sigma$ proposed also satisfies H1–H2, applying Zhang’s theorem (Theorem 2.1) yields the existence and uniqueness for the solution of what we call a self-exciting multifractional gamma process (SEM-gamma), given as the solution to the equation
as well as bounds on p-moments of the following type:
Next we will show the Hölder regularity for the solution of (3.3). In order to do so we need the following preliminary lemmas, Lemmas 3.2 and 3.3.
Lemma 3.2. Let $T>u>v>0.$ Then, for any $\alpha\leq0$ and $\beta\in [0,1] $, we have
and for $\alpha\in (0,1) $
Lemma 3.3. Let $\sigma(t,s,x)=\exp (\!-f (t,x) (t-s) ) (t-s) ^{h(t,x)-{{1}/{2}}}$, such that f is a dampening function and h is a Hurst function with parameters $ (h_{*},h^{*}) $ according to Definitions 3.1 and 3.2 respectively. Then, for any $0<\gamma<2h_{*}$, there exists $\lambda_{\gamma}\colon \triangle^{(3)} ( [0,T] ) \rightarrow\mathbb{R}$ such that
and
for some constant $C_{T,\gamma}>0.$
Now we are in a position such that we can introduce the Hölder continuity properties of the solution to the SEM-gamma process of (3.3) via the following proposition.
Proposition 3.1. Let $ \{ X_{t}^{h,f}\} _{t\in [0,T] }$ be a SEM-gamma process defined as in Lemma 3.1, and assume that g satisfies H4 for some $\delta>0$. Then $X^{h,f}$ admits a version with $\alpha$-Hölder trajectories for any $\alpha<h_{*}\wedge\delta$. In particular, we have
4. Simulation of self-exciting multifractional gamma processes
The aim of this section is to study a discretization scheme for the SEM-gamma processes proposed in the previous sections. In particular, we will consider an Euler-type discretization and prove that this converges strongly to the original process at a rate depending on $h_{*}$. We end the section providing some examples of numerical simulations using the Euler discretization.
4.1. Euler–Maruyama approximation scheme
Consider a time discretization of the interval [0, T], using a step size $\Delta t={{T}/{N}}>0$. The Euler–Maruyama scheme (EM) yields a discrete-time approximation $\bar{X}_{k}^{h,f}$ of the process $X_{t_{k}}^{h,f}$ for $t_{k}=k\Delta t$ with $k\in \{ 0,\ldots,N\} ,$ given by
for all $k\in \{ 1,\ldots,N\} $, where $\Delta B_{i}=B (t_{i+1}) -B (t_{i}) $.
In order to study the approximation error, it is convenient to consider the continuous-time interpolation of $ \{ \bar{X}_{k}^{h,f}\} _{k\in \{ 0,\ldots,N\} }$ given by
for all $t\in [0,T] $, where $\eta\coloneqq \sum_{i=0}^{k-1}t_{i}\cdot\boldsymbol{1}_{[t_{i},t_{i+1}) }$.
Lemma 4.1. Let f be a dampening function, and let h be a Hurst function with parameters $ (h_{*},h^{*}) $ according to Definitions 3.1 and 3.2 respectively. Let $\bar{X}^{h,f}= \{ \bar{X}_{t}^{h,f}\} _{t\in [0,T] }$ be given as in (4.3). Then
and
for any $\gamma<2h_{*}$, where $C_{T}$ and $C_{T,\gamma}$ are positive constants.
The following result is a combination of Theorem 1 and Corollary 2 in [Reference Ye, Gao and Ding18].
Theorem 4.1. Suppose $\beta>0$, a(t) is a non-negative function locally integrable on $0\leq t<T<+\infty$ and g(t) is a non-negative, non-decreasing continuous function defined on $0\leq t<T$, $g(t) \leq M$ (constant), and suppose u(t) is non-negative and locally integrable on $0\leq t<T$ with
on this interval. Then
If in addition a(t) is a non-decreasing function on [0,T), then
where $E_{\beta}$ is the Mittag–Leffler function defined by
Using Lemma 4.1 and Theorem 4.1, we can show the order of convergence for the approximating scheme.
Theorem 4.2. Let f be a dampening function, and let h be a Hurst function with parameters $ (h_{*},h^{*}) $ according to Definitions 3.1 and 3.2 respectively. Let $\bar{X}^{h,f}= \{ \bar{X}_{t}^{h,f}\} _{t\in [0,T] }$ be given as in (4.3). Then the Euler–Maruyama scheme (4.3) satisfies
where $\gamma\in (0,2h_{*}) $ and $C_{T,\gamma,h_{*}}$ is a positive constant which does not depend on N.
Remark 4.1. In [Reference Zhang19], Zhang introduced an Euler-type scheme for stochastic differential equations of Volterra type and showed that his scheme converges at a certain positive rate, without being very precise. A direct application of his result to our case provides a worse rate than the one we obtain in Theorem 4.2, the reason being that, due to our particular kernel, we are able to use a fractional Grönwall lemma, which provides more precise estimates in our proof.
4.2. Examples
The following examples provide some numerical simulations of the SEM-gamma process and also discuss the observed properties in the time series generated for different Hurst and dampening functions.
Example 4.1. Let $h (x) ={{1}/{{(1+x^{2})}}}\in (0,1) ,$ $f=0.5,$ and $ \{ B_{t}\} _{t\in[0,T]}$ be a one-dimensional Brownian motion. Assume $X_{t}^{h,f}$ starts at zero and define the process as given in (1.2). Figure 1(a) shows the plot of h and Figure 1(b) shows a sample path of the process resulting from the implementation of the EM-approximation given by (4.1). The step size used in all simulations is $\Delta t=1/100$.
In [Reference Sornette and Filimonov17], Sornette and Filimonov suggested a class of self-excited processes that may exhibit all stylized facts found in financial time series as heavy tails (asset return distribution displays heavy tails with positive excess kurtosis), absence of autocorrelations (autocorrelations in asset returns are negligible, except for very short time scales $\simeq20$ minutes), volatility clustering (absolute returns display a positive, significant and slowly decaying autocorrelation function) and the leverage effect (volatility measures of an asset are negatively correlated with its returns), among others stated in [Reference Cont and Tankov6]. The SEM-gamma process resembles these properties for any reasonable choice of $h\subset (0,1) $, given a good choice for the mean reversion, i.e. a controlled dampening function f. SEM-gamma processes may also be useful for modeling the log-volatility of financial assets of different nature, in order to capture the roughness in the underlying volatility, as suggested by Gatheral, Jaisson, and Rosenbaum in [Reference Gatheral, Jaisson and Rosenbaum9]. In fact, squaring or taking the exponential of the SEM-gamma process in Figures 2 and 3 seems to capture the varying Hurst index and clustering phenomena observed for the realized volatility time series, as Gatheral et al. [Reference Gatheral, Jaisson and Rosenbaum9] suggest.
Example 4.2. Figure 2 shows the change in the behavior of the Hurst exponent (a transition from rougher values to smoother values, i.e. $h\approx0$ to $h\approx1$) as we progressively change f from lower values to higher values. In particular we compare $f (x) \equiv c$, where $c\in \{ 0,0.5,1,5\} $.
Remark 4.2. Note from Figure 2 that one can control the clustering effect of the increments and the varying regularity of the process by controlling the parameter f, regardless of the Hurst function chosen as h. This type of effect is easy to achieve in the SEM-gamma process for any h given $f\neq0$, but it can also be achieved for $f=0$ and the right choice of the Hurst function h. The clustering effect is desirable in numerous fields, for example in modeling financial markets, when trying to capture shocks in asset prices. Figure 3 shows the rough nature of the process hidden at a lower scale. This is achieved by zooming in close enough at the case $f (x) =5$.
It also makes sense to let f(x) be a function of x rather than a constant, and in particular, if we take $f (x) =h (x) ={{1}/{{(1+x^{2})}}}$, we can see in Figure 4 how the regime switch in the Hurst exponent is less abrupt, favoring a sustained difference of roughness in time.
5. The SEM process: a particular case of the SEM-gamma process
Consider the stochastic process given by the particular case where $f=0$ in (1.2):
This process is the continuous-time version of the discretized version proposed by Sornette and Filimonov in [Reference Sornette and Filimonov17]. All the previous results for existence and uniqueness of the solution hold, as well as the discretization scheme. It is interesting to note that regime switches in the Hurst exponent are only observed depending on the choice made in the parametrization of the Hurst function. In order to illuminate this fact we propose a series of examples.
5.1. Examples
Let us discuss the SEM processes resulting from considering different functions $h\colon \mathbb{R}\rightarrow (0,1) $.
Example 5.1. Let
$f=0$ and $ \{ B_{t}\} _{t\in[0,T]}$ be a one-dimensional Brownian motion. Assume $X_{t}^{h}$ starts at zero and define the process as given in (1.2). Figure 5(a) shows the plot of h and Figure 5(b) shows a sample path of the process resulting from the implementation of the EM-approximation. Note that this process is smoother than a Brownian motion at the origin and rapidly converges to the classical Brownian motion as the process departs from zero. This implies that the value of h approaches $1/2$ quickly and goes back to 1 when the process $X_{t}^{h,f}$ crosses the x-axis. Note that the Hurst exponent as a function of time has very low frequency in regime changes, as is observed from the red line plot.
Example 5.2. Let
$f=0$ and $ \{ B_{t}\} _{t\in[0,T]}$ be a one-dimensional Brownian motion. Assume $X_{t}^{h}$ starts at zero and define the process as given in (1.2). Figure 6(a) shows the plot of h and Figure 6(b) shows a sample path of the process. It is interesting to note that in this case, contrary to the previous example, we have a rougher process than a Brownian motion at the origin, which temporarily resembles the classical Brownian motion as the sample path departs from zero and gets rougher again whenever the process crosses the x-axis. This makes the process go away from zero even faster due to the increased roughness, producing high frequency in regime changes of the Hurst exponent as a function of time.
Example 5.3. Let $h (x) ={{1}/{{(1+x^{2})}}}\in (0,1) $. This is exactly the same case we examined in the previous section when we considered $f=0$. Note that in the earlier case, the Hurst exponent as a function of time quickly collapsed to zero, making the process the roughest possible and preventing any regime switch in the values of the Hurst function. This illuminates the fact that regime switches in the Hurst functions of SEM processes can also be achieved provided that we make the right choice in the parametrization of the function h. SEM-gamma processes are the opposite case where, regardless of the choice made as a Hurst function, the frequency in the regime switch of h is totally controlled by the dampening function $f>0$ in the long run.
Remark 5.1. The plots in Figure 7 show the autocorrelation function of the absolute value in the time series of the increments in the SEM process (Figure 7(a)) from example (5.3) and in the SEM-gamma process (Figure 7(b)) with $f (x) =0.1$. Note that autocorrelation in the second case is clearly much higher.
Appendix A. Proofs
Proofs of all the results from previous sections are given in this appendix.
A.1 Proof of Lemma 3.1
Proof. We prove the three claims in the order they are stated in the lemma, and start by proving (3.1). Remember that f is a non-negative function and
for all $t\in [0,T] $ and $x\in\mathbb{R}$, so elementary computations show that
which yields (3.1) with $k (t,s) =C_{T} (t-s) ^{2h_{*}-1}$. Next we consider (3.2), and using $x=\exp (\log (x) ) $ we write
where $ (t,s) \in\triangle^{ (2) } ( [0,T] ) $, and consider the following inequality derived from the fundamental theorem of calculus:
Using the Lipschitz assumption on h and the above inequality, we can write the following upper bound:
Recalling that $ |{\textrm{e}}^{-f (t,x) (t-s) }| \leq1$ and that f and h are uniformly Lipschitz, then it is readily checked that (A.3) implies
We will now define the kernels $k_{1}$ and $k_{2}$ which will satisfy hypotheses H1–H3. To this end, let $k_{1}$ and $k_{2}$ be given by
It is now readily seen that $k_{1}$ and $k_{2}$ automatically lie in $\mathcal{K}_{0}$, due to the fact that both $h_{1}$ and $h_{2}$ are functions in $L_{loc}^{1} (\mathbb{R}_{+}) $; see e.g. Section 2.1 in [Reference Zhang20]. This concludes the proof.
A.2 Proof of Lemma 3.2
Proof. For $\alpha=0$ the proof is clear. For $\alpha<1$ and $\alpha\neq0,$ using the remainder of Taylor’s formula in integral form we get
where we have used $ |v+\theta (u-v) | ^{\alpha-1}\leq |v| ^{\alpha-1}$. On the other hand, using $ |v+\theta (u-v) | ^{\alpha-1}\leq\theta^{\alpha-1} |u-v| ^{\alpha-1}$ and assuming that $\alpha\in (0,1) $, we obtain
In what follows we will use the interpolation inequality $a\wedge b\leq a^{\beta}b^{1-\beta}$ for any $a,b>0$ and $\beta\in [0,1] $. Consider the case $\alpha<0.$ Using the interpolation inequality with the simple bound $ |u^{\alpha}-v^{\alpha}| \leq2 |v| ^{\alpha}$ and the bound (A.4), we get
Consider the case $\alpha\in (0,1) $. Using the interpolation inequality with the bounds (A.4) and (A.5), we can write
This concludes the proof.
A.3 Proof of Lemma 3.3
Proof. In order to prove (3.4), it is clear that
Furthermore, note that for all $t>t^{\prime}>s>0$ we can add and subtract the term
to get
where
We start by finding an upper bound for the term $J^{1}$. Using (A.2), the fact that ${\textrm{e}}^{-f (t,x) (t-s) }\leq1,$ and that h is Lipschitz in the time argument, by arguments similar to those in Lemma 3.1, we obtain for any $\delta\in (0,1) $, with $s<t<t^{\prime}$,
Next, in order to bound the term $J^{2}$, we add and subtract the quantity ${\textrm{e}}^{-f (t,x) (t-s) }\break (t^{\prime}-s) ^{h (t^{\prime},x) -{{1}/{2}}}$, to obtain
Using (A.2), we can rewrite the above expression as
Then, adding and subtracting $f (t,x) (t'-s) $, and using the linear growth and Lipschitz conditions on f, we obtain
and we have
Now we apply Lemma 3.2 with $u=t-s$, $v=t^{\prime}-s$ and $\alpha=h (t^{\prime},x) -\frac{1}{2}$ to upper-bound the above inequality. Note that since $h (t^{\prime},x) \in [h_{*},h^{*}] \subset (0,1) ,$
Hence, if $\alpha=h (t^{\prime},x) -\frac{1}{2}\leq0$ (this implies $h_{*}\leq1/2$ and $\alpha\in (\!-\frac{1}{2},0) $), we get
for any $\beta_{1}\in (0,1) $. If $\alpha=h(t^{\prime},x)-\frac{1}{2}>0$ (this implies $h^{*}>1/2$ and $\alpha\in (0,\frac{1}{2}) $), we get
where in the second inequality we have chosen
and in the third inequality we used $(1-\alpha)\in (\frac{1}{2},1) $. Therefore we can write the following bound:
where to get the last inequality we chose $\delta=\beta_{1}$ and $\varepsilon=h_{*}-\beta_{1}$. Therefore, defining
and choosing $\gamma$ such that $0<\gamma<2h_{*},$ we can compute
which concludes the proof.
A.4. Proof of Proposition 3.1
Proof. By Lemma 3.1 there exists a unique solution $X_{t}^{h,f}$ to (3.3) with bounded p-order moments. We will show that the $X_{t}^{h}$ also have Hölder-continuous paths. To this end, we will show that for any $p\in\mathbb{N}$ there exists a constant $C>0$ and a function $\alpha$, both depending on p, such that
where we use the increment notation $q_{s,t}=q_{t}-q_{s}.$ From this we apply Kolmogorov’s continuity theorem (e.g. Theorem 2.8 in [Reference Karatzas and Shreve11, page 53]) in order to obtain the claim. Note that the increment of $X_{s,t}^{h,f}$ minus the increment of g satisfies
and thus using that
for any $q\in\mathbb{N},$ we obtain
Clearly, as $h (t,x) \in [h_{*},h^{*}] \subset (0,1) $, by the Burkholder–Davis–Gundy (BDG) inequality we have
Now consider the term $J_{s,t}^{2}$. Again applying BDG inequality together with the bounds (3.4) and (3.5) from Lemma 3.3, we have that for any $\gamma<2h_{*}$
Combining (A.7) and (A.8), we can see that
Furthermore, again using (A.6) we see that
Thus, invoking the bounds from H4 on g, we obtain
and it follows from Kolmogorov’s continuity theorem that $X^{h,f}$ has $\mathbb{P}$-a.s. $\alpha$-Hölder-continuous trajectories with $\alpha\in (0,h_{*}\wedge\delta) $.
A.5. Proof of Lemma 4.1
Proof. Recall that $k (t,s) =C_{T} (t-s) ^{2h_{*}-1}$ and, since $\eta (s) \leq s$, we have the inequality
Using the Itô isometry, that ${\textrm{e}}^{-2f (t,x) }\leq1$, and (A.1) and (A.9), we obtain
To prove the bound (4.4), note that
Due to the Itô isometry, that ${\textrm{e}}^{-2f (t,x) }\leq1$, and (A.1) and (A.9), we obtain the bounds
Again using the Itô isometry and (3.4) and (3.5), for any $\gamma<2h_{*}$ we can write that
where in the second inequality we used $\lambda_{\gamma} (t,t^{\prime},\eta (s) ) \leq\lambda_{\gamma} (t,t^{\prime},s) $, because $\lambda_{\gamma}$ is essentially a negative fractional power of $(t-s)$ and $\eta (s) \leq s$ and also $\mathbb{E}[ |\bar{X}_{t}^{h,f}| ^{2}]\leq C_{T}$, $0\leq t\leq T$, which we have just proved above. Combining the bounds for $\mathbb{E}[ |J_{1}| ^{2}]$ and $\mathbb{E}[ |J_{2}| ^{2}]$, the result follows.
A.6. Proof of Theorem 4.2
Proof. Define
For any $t\in [0,T] $, we can write
First we bound the second moment of $I_{1} (t) $ in terms of a certain integral of $\varphi$. Using the Itô isometry, (3.2), and the Lipschitz property of h, we get
for $\delta>0,$ arbitrarily small. By adding and subtracting $\bar{X}_{s}^{h,f}$, elementary computations show that
Moreover, combining (4.4) and Lemma 4.1 yields
Therefore, choosing $\delta={{{h_{*}}}/{2}}$, we have
Next we find a bound for the second moment of $I_{2} (t) $. Again using the Itô isometry, (3.4) and (3.5), and Lemma 4.1, we can write
for any $\gamma<2h_{*}$, and where we have used that
Combining (A.10) and (A.11), we obtain
Using Theorem 4.1 with $a (t) =C_{T,\gamma,h_{*}} |\Delta t| ^{\gamma}$, $g (t) =C_{T,\gamma,h_{*}}$ and $\beta=h_{*}$, we can conclude that