1. Introduction
We consider a simple model to incorporate information costs in a continuous-time portfolio-consumption problem. In particular, we study a two-stage composite problem under complete and incomplete filtrations sequentially. The drift process of the stock price is assumed to be of Ornstein–Uhlenbeck type. In the first stage from the initial time, the investor needs to pay information costs to access the full market information generated by both drift and stock price processes, in order to update their dynamic distributions and decide the optimal time to enter the second stage. The information costs may include search costs, storage costs, communication costs, the cost of the investor’s attention, or other service costs. In the present paper we consider a simple linear information cost, which is modeled by a constant cost rate and will be subtracted directly from the amount of the investor’s wealth. That is, the longer the first stage is, the higher the information costs the investor needs to be able to afford. Some previous works have addressed the impact of information costs on optimal investment from different perspectives; see [Reference Kang and Stulz18], [Reference Portes and Rey29], [Reference Ahearne, Griever and Warnock1], and [Reference Keppo, Tan and Zhou20]. In our first stage, the mathematical problem becomes an optimal stopping problem under the complete market information filtration. The second stage starts from the chosen entry time, when the investor terminates the full observation of the drift process. From this point on, the investor instead dynamically chooses his investment and consumption levels based on the prior data inputs and free partial observation of the stock price, which can be formulated as an optimal control problem under an incomplete information filtration. As the value function of the interior control problem depends on the stopping time and data inputs of the drift process, the exterior problem can be interpreted as that of waiting in an optimal way so that the input values can achieve the maximum of the interior functional.
Portfolio optimization under partial observation has been extensively studied in past decades; a few examples with different financial motivations can be seen in [Reference Lakner22, Reference Xia34, Reference Brendle8, Reference Monoyios25, Reference Björk, Davis and Landén6, Reference Bo, Liao and Yu7]. As illustrated in these works, the value function under the incomplete information filtration is strictly lower than its counterpart under the full information filtration, and this gap is usually regarded as the value of information. The present paper attempts to study partial observation from a different perspective, where the full market information is available but costly because more data, services, and personal attention are involved. The information costs may change the investor’s attitude towards the usage of full observation, because it is no longer true that the more information he observes, the higher profit he can attain. Moreover, from previous work on partial observation, we know that the value function eventually depends on the given initial input of the random factor such as the drift process. As in [Reference Lakner22, Reference Brendle8], it is conventionally assumed that the initial data of the unobservable drift is a Gaussian random variable, so that the Kalman–Bucy filtering can be applied. We take this input into account and consider a model in which the investor can wait and dynamically update the distribution of inputs using the full market information, subject to information costs. We show that starting sharp from the initial time to invest and consume under incomplete information may not be optimal.
On the other hand, in recent years, habit formation has provided a new paradigm for modeling consumption rate preferences, which better matches some empirical observations; see [Reference Constantinides11, Reference Mehra and Prescott24]. The literature suggests that past consumption patterns may have a continuing impact on an individual’s current consumption decisions. In particular, the use of linear habit formation preferences, in which there exists an index term that stands for the accumulated consumption history, has been widely accepted. Habit formation preferences have been well studied by [Reference Detemple and Zapatero12, Reference Englezos and Karatzas14, Reference Munk26] in complete market models and by [Reference Yu35, Reference Yu36] in incomplete market models. It is noted that the utility function is decreasing in the habit level. In the present paper, we assume that there is no consumption during Stage 1, and the investor starts to gain consumption habits only in Stage 2. Therefore, an early entry time to Stage 2 may not be the optimal decision, because the investor has a longer time to develop a much higher habit level. This is our second motivation for investigating the exterior optimal entry time problem: to see whether waiting longer can benefit the investor more, as the resulting habit level may be much lower and may lead to a higher value function.
We show that the value function of the composite problem is the unique viscosity solution to some variational inequalities. To this end, we can choose to apply either the classical Perron method or the stochastic version of the Perron method introduced in [Reference Bayraktar and Sirbu2]. For the classical Perron method, to establish the equivalence between the value function and the viscosity solution, we have to either prove the dynamic programming principle or upgrade the global regularity of the solution and prove the verification theorem. The convexity (concavity) of the value function with respect to the state variable is usually crucial in standard arguments to conclude global regularity. However, this property is not clear in our composite problem; see Remark 4.1 for details. The global regularity of the value function is not guaranteed, and so the direct verification proof for our exterior problem becomes difficult. Therefore, we instead choose the stochastic Perron method, which allows us to show the equivalence between the value function and the viscosity solution without global regularity. For some related works on optimal stopping using viscosity solutions, we refer to [Reference Reikvam31] and [Reference Pham27]. Recent works on stochastic control problems using the stochastic Perron method include [Reference Bayraktar and Sirbu2, Reference Bayraktar and Sirbu3, Reference Bayraktar and Sirbu4, Reference Bayraktar and Zhang5, Reference Sirbu33, Reference Lee, Yu and Zhou23]. One important step in completing the argument for the stochastic Perron method is the comparison principle for the associated variational inequalities, which is established in the present paper.
The rest of the paper is organized as follows: Section 2 introduces the market model and the habit formation preferences and formulates the two-stage optimization problem. Section 3 gives the main result on the interior utility maximization problem with habit formation and partial observation. Section 4 studies the exterior optimal entry problem with linear information costs. Using the stochastic Perron method, we show that the value function of the composite problem is the unique viscosity solution of some variational inequalities. Some auxiliary results and proofs are reported in Appendices A and B.
2. Mathematical model and preliminaries
2.1. Market model
Given the probability space $(\Omega,\mathbb{F},\mathbb{P})$ with full information filtration $\mathbb{F}=(\mathcal{F}_{t})_{0\leq t\leq T}$ that satisfies the usual conditions, we consider the market with one risk-free bond and one risky asset over a finite time horizon [0, T]. It is assumed that the bond process satisfies $S^{0}_{t}\equiv1$ , for $t\in[0,T]$ , which amounts to the standard change of numéraire.
The stock price $S_{t}$ satisfies
with $S_{0}=s>0$ . Some empirical studies, such as [Reference Brennan and Xia9, Reference Campbell10, Reference Fama and French15, Reference Poterba and Summers30], have observed that the drift process of many risky assets follows the so-called mean-reverting diffusion. We also consider here that the drift process $\mu_t$ in (2.1) satisfies the Ornstein–Uhlenbeck stochastic differential equation (SDE) by
Here, $(W_{t})_{0\leq t\leq T}$ and $(B_{t})_{0\leq t\leq T}$ are $\mathcal{F}_{t}$ -adapted Brownian motions with correlation coefficient $\rho\in[-1,1]$ . For simplicity, the initial value $\mu_0$ of the drift is a given constant. We assume that the market coefficients $\sigma_{S}$ , $\lambda$ , $\bar{\mu}$ , and $\sigma_{\mu}$ are given nonnegative constants based on calibrations from historical data.
It is assumed that the investor starts with initial wealth $x(0)=x_0>0$ at time $t=0$ . Also, starting from the initial time $t=0$ , access to the full market information $\mathcal{F}_t$ generated by W and B incurs information costs $\kappa t$ , where $\kappa>0$ is the constant cost rate per unit time. As stated earlier, the information costs may include storage costs, search costs, communication costs, the cost of the investor’s attention, or other service costs involved in fully observing the market information $\mathcal{F}_t$ . Moreover, to simplify the mathematical problem, it is assumed that from $t=0$ up to a chosen stopping time $\tau$ , the investor purely waits and updates the dynamic distributions of the processes $\mu_t$ and $S_t$ ; he does not invest or consume at all. This assumption makes sense as long as the value of the optimal entry time $\tau$ is short in the model. The dynamic wealth process including the information costs at time t is simply given by a deterministic function $x(t)=x_0-\kappa t$ for any $t\leq \tau$ .
As the full market information filtration is costly, the investor needs to choose optimally an $\mathcal{F}_{t}$ -adapted stopping time $\tau$ to terminate full information acquisition and enter the second stage. From the chosen stopping time $\tau$ , he switches to the partial observation filtration $\mathcal{F}_t^S=\mathcal{F}_{\tau}\bigvee\sigma(S_u\,:\,\tau\leq u\leq t)$ for $\tau\leq t\leq T$ , which is the union of the sigma-algebra $\mathcal{F}_{\tau}$ and the natural filtration generated by the stock price S up to time t. Moreover, for any time $\tau\leq t\leq T$ , the investor chooses a dynamic consumption rate $c_{t}\geq 0$ and decides the amounts $\pi_{t}$ of his wealth to invest in the risky asset (the rest is invested in the bond). Without paying information costs, he can no longer observe the drift process $\mu_{t}$ and Brownian motions $W_{t}$ and $B_{t}$ for $t\geq \tau$ . Therefore, the investment–consumption pair $(\pi_t, c_t)$ is only assumed to be adapted to the partial observation filtration $\mathcal{F}_t^S$ for $\tau\leq t\leq T$ . Recall that at the entry time $\tau$ , the investor only has wealth $x(\tau)=x_0-\kappa \tau$ left. Under the incomplete filtration $\mathcal{F}_t^S$ , the investor’s total wealth process $\hat{X}_{t}$ can be written as
with the initial value $\hat{X}_{\tau}=x(\tau)=x_{0}-\kappa \tau>0$ . Note that $W_t$ is no longer a Brownian motion under the partial observation filtration $\mathcal{F}_t^S$ ; we have to apply the Kalman–Bucy filtering and consider the innovation process defined by
which is a Brownian motion under $\mathcal{F}^{S}_{t}$ . The best estimate of the unobservable drift process $\mu_t$ under $\mathcal{F}_t^S$ is the conditional expectation process $\hat{\mu}_{t}=\mathbb{E}\big[\mu_{t}\big{|}\mathcal{F}^{S}_{t}\big]$ , for $\tau\leq t\leq T$ with the initial input $\hat{\mu}_{\tau}=\mu_{\tau}$ , $\mathbb{P}$ -almost surely (a.s.), at the stopping time $\tau$ where the distribution of $\mu_{\tau}$ is determined via (2.2) by paying information costs up to $\tau$ . By standard Kalman–Bucy filtering (see Equation (18) of [Reference Brendle8] or Equation (21) of [Reference Monoyios25]), $\hat{\mu}_t$ satisfies the SDE
with $\hat{\mu}_{\tau}=\mu_{\tau}$ , $\mathbb{P}$ -a.s. Moreover, the conditional variance $\hat{\Sigma}(t)=\mathbb{E}\big[\big(\mu_{t}-\hat{\mu}_{t}\big)^2\big{|}\mathcal{F}^{S}_{t}\big]$ satisfies the deterministic Riccati ordinary differential equation (ODE) (see Equation (19) of [Reference Brendle8] or Equation (23) of [Reference Monoyios25])
with the initial value $\hat{\Sigma}(\tau)= \mathbb{E}\big[\big(\mu_{\tau}-\hat{\mu}_{\tau}\big)^2\big{|}\mathcal{F}^{S}_{t}\big] = 0$ , given that $\hat{\mu}_{\tau} = \mu_{\tau}$ , $\mathbb{P}$ -a.s. This can be solved explicitly by
where $k=\lambda^2\sigma_{S}^2+2\sigma_{S}\sigma_{\mu}\lambda\rho+\sigma_{\mu}^2$ , $k_{1}=\sqrt{k}\sigma_{S}+\Big(\lambda\sigma_{S}^2+\sigma_{S}\sigma_{\mu}\rho\Big)$ , and $k_{2}=-\sqrt{k}\sigma_{S}+\Big(\lambda\sigma_{S}^2+\sigma_{S}\sigma_{\mu}\rho\Big)$ .
For the second-stage dynamic control problem, we employ habit formation preferences. In particular, we define $Z_{t}\,:\!=\, Z(c_{t})$ as the habit formation process or the standard of living process, which describes the level of the investor’s consumption habits. It is conventionally assumed that the cumulative preference $Z_{t}$ satisfies the recursive equation $dZ_{t}=(\delta(t)c_{t}-\alpha(t)Z_{t})dt$ , $\tau\leq t\leq T$ (see [Reference Detemple and Zapatero12]), where $Z_{\tau}=z_{0}\geq 0$ is called the initial consumption habit of the investor. Equivalently, we have
which is the exponentially weighted average of the initial habit and the past consumption. Here, the deterministic discount factors $\alpha(t)\geq 0$ and $\delta(t)\geq 0$ measure, respectively, the persistence of the past level and the intensity of the consumption history. In the present paper we are interested in addictive habits; namely, we require that the investor’s current consumption strategies never fall below the level of standard of living, i.e. $c_{t}\geq Z_{t}$ a.s., for $\tau\leq t\leq T$ .
Under the partial observation filtration $(\mathcal{F}^{S}_{t})_{\tau\leq t\leq T}$ , the stock price dynamics (2.1) can be rewritten as $dS_{t}=\hat{\mu}_{t}S_{t}dt+\sigma_{S}S_{t}d\hat{W}_{t}$ , and the wealth dynamics (2.3) can be rewritten as $d\hat{X}_{t}=(\pi_{t}\hat{\mu}_{t}-c_{t})dt+\sigma_{S}\pi_{t}d\hat{W}_{t}$ , $\tau\leq t\leq T$ . To facilitate the formulation of the stochastic control problem and the derivation of the dynamic programming equation, for any $t\in [0,T]$ , we denote by $\mathcal{A}_{t}(x)$ the time-modulated admissible set of the pair of investment and consumption processes $(\pi_{s},c_{s})_{t\leq s\leq T}$ with the initial wealth $\hat{X}_t=x$ , which is $\mathcal{F}^{S}_{s}$ -progressively measurable and satisfies the integrability conditions $\int^{T}_{t}\pi_{s}^2ds<+\infty$ , $\mathbb{P}$ -a.s., and $\int^{T}_{t}c_{s}ds<+\infty$ , $\mathbb{P}$ -a.s., with the addictive habit formation constraint that $c_{s}\geq Z_{s}$ , $\mathbb{P}$ -a.s., $t\leq s\leq T$ . Moreover, no bankruptcy is allowed, i.e., the investor’s wealth remains nonnegative: $\hat{X}_{s}\geq 0$ , $\mathbb{P}$ -a.s., $t\leq s\leq T$ .
2.2. Problem formulation
The two-stage optimal decision-making problem is formulated as a composite problem involving the optimal stopping and the stochastic control afterwards, which is defined by
In particular, starting from the chosen stopping time $\tau$ , we are interested in utility maximization on consumption with habit formation, in which the power utility function $U(x)=x^p/p$ is defined on the difference $c_t-Z_t$ . To simplify the presentation, in the present paper we only consider the case of a risk aversion coefficient $p<0$ . The indirect utility process of the interior control problem is denoted by
To determine the exterior optimal stopping time, we need to maximize over the inputs of the values $\tau$ , $\hat{X}_{\tau}$ , $Z_{\tau}$ , and $\hat{\mu}_{\tau}$ . Recall that the investor does not manage his investment and consumption before $\tau$ ; it follows that $\hat{X}_{\tau}=x_0-\kappa\tau$ , $Z_{\tau}=z_0$ , and $\hat{\Sigma}(\tau)=0$ can all be taken as parameters instead of variables. That is, $\mu_{\tau}=\hat{\mu}_{\tau}$ is the only random input, and we can regard $\mu_t$ as the only underlying state process. Therefore, the dynamic counterpart of (2.6) is defined by
Remark 2.1. We focus on the case $p<0$ in the present paper because then the functions A(t, s), B(t, s), and C(t, s) introduced later, which are solutions to the ODEs (3.4), (3.5), and (3.6), are all bounded, and the utility U(x) is also bounded from above, which significantly simplifies the proof of the verification result in Theorem 3.1 and the comparison results in Proposition 4.1. The other case, $0<p<1$ , can essentially be handled in a similar way. However, in that case, as the process $\hat{\mu}_t$ in (2.4) is unbounded and the functions A(t, s), B(t, s), and C(t, s) may explode at some $t\in [0,T]$ , one needs some additional parameter assumptions to guarantee integrability conditions and martingale properties in the proofs of some of the main results.
Assumption 2.1. In accordance with Remark 3.1 for the interior control problem, it is assumed from this point onwards that $x_0-\kappa t>z_0 m(t)$ for any $0\leq t\leq T$ ; i.e. the initial wealth is sufficiently large, after paying information costs, so that the interior control problem is well defined for any $0\leq t\leq T$ , where m(t) is defined by
We note that m(t) in (2.8) represents the cost of subsistence consumption per unit of standard of living at time t, because the interior control problem is solvable if and only if $\hat{X}_t^{*}\geq m(t)Z_t$ , $0\leq t\leq T$ ; see Lemma B.1.
The function $\widehat{V}$ can be solved in the explicit form given in (3.7) later. The process $\widetilde{V}(t,\mu_t;\, x_0-\kappa t, z_0)$ with the function $\widetilde{V}$ defined in (2.7) is the Snell envelope of the process $\widehat{V}(t, x_0-\kappa t, z_0, \mu_t)$ above. The function $\widetilde{V}$ in (2.7) can therefore be written as
The continuation region, interpreted as the region where the investor continues to use full information observations to update the input value, is denoted by $\mathcal{C}=\{(t,\eta)\in[0,T)\times\mathbb{R}\,:\,\widetilde{V}(t,\eta;\, x_0-\kappa t, z_0)>\widehat{V}(t, x_0-\kappa t,z_0, \eta)\},$ and the free boundary is $\partial\mathcal{C}=\{(t,\eta)\in[0,T)\times\mathbb{R}\,:\,\widetilde{V}(t,\eta;\, x_0-\kappa t, z_0)=\widehat{V}(t, x_0-\kappa t,z_0, \eta)\}$ . Let us denote $\widetilde{V}(t,\eta;\, x_0-\kappa t, z_0)$ by $\widetilde{V}(t,\eta)$ for short when there is no possibility of confusion. By heuristic arguments, we can write the Hamilton–Jacobi–Bellman (HJB) variational inequalities with the terminal condition $\widetilde{V}(T,\eta)=0$ , $\eta\in\mathbb{R}$ , as
where
To simplify notation in the following sections, we shall rewrite (2.9) as
with the operator $F(t,\eta,v,v_t,v_\eta,v_{\eta\eta})\,:\!=\,\mbox{min}\left\{ v-\widehat{V}, \ \ -\frac{\partial v}{\partial t}-\mathcal{L}v\right\}$ .
Remark 2.2. The part $-\frac{\partial \widetilde{V}}{\partial t}-\mathcal{L}\widetilde{V}=0$ in (2.9) is a linear parabolic partial differential equation (PDE) and does not depend on the interior control $(\pi,c)$ . The comparison part $\widetilde{V} - \widehat{V}$ in (2.9) depends on the optimal control $(\pi,c)$ because the $\widehat{V}$ is the value function of the interior control problem provided the input $\hat{X}_t=x_0-\kappa t$ , $Z_t=z_0$ , and $\hat{\mu}_t=\mu_t=\eta$ .
The next theorem is the main result of this paper.
Theorem 2.1. The quantity $\widetilde{V}(t,\eta)$ defined in (2.7) is the unique bounded and continuous viscosity solution to the variational inequalities (2.9). In addition, the optimal entry time for the composite problem (2.7) is given by the $\mathcal{F}_t$ -adapted stopping time
We also have that the process $\widetilde{V}(t,\mu_t;\, x_0-\kappa t,z_0)$ is a martingale with respect to the full information filtration $\mathcal{F}_t$ for $0\leq t\leq \tau^*$ .
The proof will be provided in Section 4.
3. Interior utility maximization under partial observation
We first solve the interior stochastic control problem under partial observation of stock prices.
3.1. Optimal consumption with Kalman–Bucy filtering
For some fixed time $0\leq k\leq T$ , the dynamic interior stochastic control problem under habit formation is defined by
where $\mathcal{A}_k(x)$ denotes the admissible control space starting from time k. Here, as the conditional variance $\hat{\Sigma}(t)$ is a deterministic function of time, we set $\theta$ as a parameter instead of a state variable.
By using the optimality principle and Itô’s formula, we can heuristically obtain the HJB equation as
with the terminal condition $V(T,x,z,\eta)=0$ .
3.2. The decoupled solution and main results
If $V(t,x,z,\eta)$ is smooth enough, the first-order condition gives
Thanks to the homogeneity property of the power utility, we conjecture the value function to have the form
for some functions $m(t,\eta)$ and $N(t,\eta)$ to be determined. It also follows that the terminal condition that $N(T,\eta)=0$ is required. In particular, we find that the simple ansatz of $m(t,\eta)\,:\!=\,m(t)$ satisfies the equation (2.8). After substitution, the HJB equation reduces to the linear parabolic PDE for $N(t,\eta)$ as
with $N(T,\eta)=0$ . We can further solve the linear PDE explicitly by
for $k\leq t\leq s\leq T$ . A(t, s), B(t, s) and C(t, s) satisfy the following ODEs:
with terminal conditions $A(s,s)=B(s,s)=C(s,s)=0$ . The explicit solutions of the ODEs (3.4), (3.5), and (3.6) are reported in Appendix A. For fixed $t\in[k,T]$ , we can define the effective domain of the pair (x, z) by $\mathbb{D}_t\,:\!=\,\{(x^{\prime},z^{\prime})\in(0,+\infty)\times[0,+\infty);\ x^{\prime}\geq m(t)z^{\prime}\}$ , where $k\leq t\leq T$ . The HJB equation (3.2) admits a classical solution on $[k,T]\times\mathbb{D}_t\times\mathbb{R}$ , given by
Remark 3.1. The effective domain of $V(t,x,z,\eta)$ requires some constraints on the optimal wealth process $\hat{X}^*_t$ and habit formation process $Z^*_t$ such that $\hat{X}^*_t\geq m(t)Z^*_t$ for $t\in[k,T]$ . In particular, we have to enforce the initial wealth-habit budget constraint that $\hat{X}_k\geq m(k)Z_k$ at the initial time k.
Theorem 3.1. (Verification theorem) If the initial budget constraint $\hat{X}_k\geq m(k)Z_k$ holds at time k, the unique solution (3.7) of the HJB equation equals the value function defined in (3.1), i.e., $V(k,x,z,\eta)=\widehat{V}(k,x,z,\eta)$ . Moreover, the optimal investment policy $\pi^*_t$ and optimal consumption policy $c^*_t$ are given in feedback form by $\pi^*_t=\pi^*(t,\hat{X}^*_t,Z^*_t,\hat{\mu}_t)$ and $\ c^*_t=c^*(t,\hat{X}^*_t,Z^*_t,\hat{\mu}_t)$ , $k\leq t\leq T$ . The function $\pi^*(t,x,z,\eta)\,:\,[k,T]\times\mathbb{D}_t\times\mathbb{R}\rightarrow\mathbb{R}$ is given by
and the function $c^*(t,x,z,\eta)\,:\,[k,T]\times\mathbb{D}_t\times\mathbb{R}\rightarrow\mathbb{R}^+$ is given by
The optimal wealth process $\hat{X}^*_t$ , $k\leq t\leq T$ , is given by
4. Exterior optimal stopping problem
4.1. Stochastic Perron method
We next study the exterior optimal entry problem. Recall that $\hat{X}_{\tau}=x_0-\kappa\tau$ , $Z_{\tau}=z_0$ , and $\hat{\Sigma}(\tau)=0$ are all taken as parameters. Our aim is to solve an optimal stopping problem in which $\mu_t$ is the only underlying state process.
Remark 4.1. Recall that the interior value function $\widehat{V}$ is of the form in (3.7). Moreover, by Remark A.1, the functions A(t, s) and B(t, s) in (3.7) satisfy $A(t,s)\leq 0$ and $B(t,s)\leq 0$ since $p<0$ . That is, if we take $\widehat{V}(\tau, \hat{\mu}_{\tau})$ as a functional of the input $\hat{\mu}_{\tau}$ , it is not globally convex or concave in $\hat{\mu}_{\tau}\in\mathbb{R}$ , because the function $\exp\left({A(t,s)\eta^2+B(t,s)\eta+C(t,s)}\right)$ is not globally convex or concave in the variable $\eta\in\mathbb{R}$ , which depends on values of A(t, s) and B(t, s). Therefore, the composite value function $\widetilde{V}(t,\eta)$ in (2.7) is not globally convex or concave in $\eta\in\mathbb{R}$ , which actually depends on all model parameters.
We choose to apply the stochastic Perron method in the present paper to verify that the value function of the composite problem is the unique viscosity solution of some variational inequalities. We first introduce sets of stochastic semi-solutions $\mathcal{V}^+$ and $\mathcal{V}^-$ and prove that $v^-\leq \widetilde{V}\leq v^+$ , where $v^-$ and $v^+$ are defined later in (4.2) and (4.3). Using the stochastic Perron method, we can show that $v^+$ is a bounded and upper semi-continuous (u.s.c.) viscosity subsolution and $v^-$ is a bounded and lower semi-continuous (l.s.c.) viscosity supersolution. Finally, we prove the comparison principle: that is, if we have any bounded and u.s.c. viscosity subsolution u and any bounded and l.s.c. viscosity supersolution v of (2.10), we must have the order $u\leq v$ . It follows that $v^+\leq v^-$ , which leads to the desired conclusion that $v^-= \widetilde{V}= v^+$ and the value function is the unique viscosity solution.
We next present the definitions of stochastic semi-solutions, which are mainly motivated by [Reference Bayraktar and Sirbu4].
Definition 4.1. The set of stochastic supersolutions for the PDE (2.10), denoted by $\mathcal{V}^+$ , is the set of functions $v\,:\,[0,T]\times\mathbb{R}\longrightarrow\mathbb{R}$ which have the following properties:
-
(i) The function v is u.s.c. and bounded on $[0,T]\times\mathbb{R}$ , and $v(t,\eta)\geq \widehat{V}(t,x_0-\kappa t, z_0,\eta)$ for any $(t,\eta)\in[0,T]\times\mathbb{R}$ .
-
(ii) For each $(t,\eta)\in[0,T]\times\mathbb{R}$ and any stopping time $t\leq \tau_1\in \mathcal {T}$ , we have $v(\tau_1,\mu_{\tau_1})\geq\mathbb{E}[v(\tau_2,\mu_{\tau_2})|\mathcal{F}_{\tau_1}]$ , $\mathbb{P}$ -a.s., for any $\tau_2\in\mathcal{T}$ and $\tau_2\geq \tau_1$ . That is, the function v along the solution of the SDE (2.2) is a supermartingale under the full information filtration $(\mathcal {F}_t)_{t\in[0,T]}$ between $\tau_1$ and T.
Definition 4.2. The set of stochastic subsolutions for the PDE (2.10), denoted by $\mathcal{V}^-$ , is the set of functions $v\,:\,[0,T]\times \mathbb{R}\longrightarrow\mathbb{R}$ which have the following properties:
-
(i) The function v is l.s.c. and bounded on $[0,T]\times \mathbb{R}$ , and $v(T,\eta)\leq 0$ for any $\eta\in\mathbb{R}$ .
-
(ii) For each $(t,\eta)\in[0,T]\times \mathbb{R}$ and any stopping time $t\leq \tau_1\in \mathcal {T}$ , we have $v(\tau_1,\mu_{\tau_1})\leq\mathbb{E}[v(\tau_2\wedge \zeta,\mu_{\tau_2\wedge \zeta})|\mathcal{F}_{\tau_1}]$ , $\mathbb{P}$ -a.s., for any $\tau_2\in\mathcal{T}$ and $\tau_2\geq\tau_1$ . That is, the function v along the solution to (2.2) is a submartingale under the full information filtration $(\mathcal {F}_t)_{t\in[0,T]}$ between $\tau_1$ and $\zeta$ , where
(4.1) \begin{align} {} {}\zeta\,:\!=\,\inf\big\{t\in [\tau_1, T]\,:\, v(t,\mu_t;\, x_0-\kappa t, z_0)\geq \widehat{V}(t, x_0-\kappa t,z_0, \mu_t)\big\}. {}\end{align}
Remark 4.1. We note that the definitions of stochastic supersolutions and stochastic subsolutions for the optimal stopping problem are not symmetric, which is consistent with the similar definitions in [Reference Bayraktar and Sirbu4]. The main reason for these differences comes from the natural supermartingale property of the Snell envelop process and its martingale property between the initial time and the first hitting time $\zeta$ in (4.1). That is, we naturally need $v(t,\eta)\geq \widehat{V}(t,x_0-\kappa t, z_0,\eta)$ for all $(t,\eta)\in[0,T]\times\mathbb{R}$ , including the terminal time T, in item (i) of Definition 4.1 (the definition of stochastic supersolutions), but we only require $v(T,\eta)\leq \widehat{V}(T,x_0-\kappa t, z_0,\eta)=0$ at the terminal time T in item (i) of Definition 4.2 (the definition of stochastic subsolutions). These comparison results and the supermartingale and submartingale properties will play important roles in the establishment of the desired sandwich result $v^-\leq \widetilde{V}\leq v^+$ in Lemma 4.4.
Lemma 4.1. $\widehat{V}(t,x_0-\kappa t, z_0, \eta;\, 0)$ is bounded and continuous for $(t,\eta)\in [0,T]\times \mathbb{R}$ .
Proof. For fixed $x_0$ and $z_0$ , it is clear that $\widehat{V}(t,x_0-\kappa t,z_0,\eta)$ in (3.7) is continuous and $\widehat{V}(t,x_0-\kappa t,z_0,\eta)\leq 0$ . Therefore we only need to show that $\widehat{V}$ is bounded below. By Appendix A, we know that $A(u)\leq 0$ , $B(u)\leq 0$ , and $C(u)\leq K$ for some $K\geq 0$ , thanks to $p<0$ . We hence obtain that $\big(A(u)\eta^2+B(u)\eta+C(u)\big)\leq K_1$ for some $K_1>0$ , and it follows that $\widehat{V}(t,x_0-\kappa t,z_0,\eta)$ is bounded below by some constant for $(t,\eta)\in [0,T]\times \mathbb{R}$ , again by $p<0$ .
As it is trivial to see that $0\in\mathcal{V}^-$ and $0\in\mathcal{V}^+$ , we have the following result.
Lemma 4.2. $\mathcal{V}^+$ and $\mathcal{V}^-$ are nonempty.
Definition 4.3. We define
The next result is similar to Lemma 2.2 of [Reference Bayraktar and Sirbu2].
Lemma 4.3. We have $v^-\in\mathcal{V}^-$ and $v^+\in\mathcal{V}^+$ .
We now have the first important sandwich result.
Lemma 4.4. We have $v^-\leq \widetilde{V}\leq v^+$ .
Proof. For each $v\in\mathcal{V}^+$ , let us consider $\tau_1=t\geq 0$ in Definition 4.1. For any $\tau\geq t$ , we have $v(t,\eta)\geq\mathbb{E}[v(\tau,\mu_{\tau})|\mathcal{F}_t]\geq\mathbb{E}\big[\widehat{V}\big(\tau,x_0-\kappa\tau,z_0,\mu_\tau\big)|\mathcal{F}_t\big]$ thanks to the supermartingale property in Definition 4.1. It follows that $v(t,\eta)\geq{\textrm{esssup}_{t\leq \tau}}\ \mathbb{E}\big[\widehat{V}(\tau,x_0-\kappa\tau,z_0, \mu_\tau)|\mathcal{F}_t\big]$ . This implies that $v(t,\eta)\geq\widetilde{V}(t,\eta)$ in view of the definition of $\widetilde{V}(t,\eta)$ , and hence $\widetilde{V}\leq v^+$ by the definition in (4.3). On the other hand, for each $v\in\mathcal{V}^-$ , by taking $\tau_1=t\geq 0$ in Definition 4.2, we have $v(t,\eta)\leq\mathbb{E}[v(\tau\wedge\zeta,\mu_{\tau\wedge\zeta})|\mathcal{F}_t]$ for any $\tau\geq t$ because of the submartingale property in Definition 4.2. In particular, using the definition of $\zeta$ , we further have
It then follows that $\widetilde{V}\geq v^-$ because of (4.2). In conclusion, we have the inequality $v^-\leq \widetilde{V}\leq v^+$ .
Theorem 4.1. The function $v^-$ in Definition 4.3 is a bounded and l.s.c. viscosity supersolution of
and the function $v^+$ in Definition 4.3 is a bounded and u.s.c. viscosity subsolution of
Proof. We follow some arguments from [Reference Bayraktar and Sirbu2, Reference Bayraktar and Sirbu4], modifying them to fit our setting.
(i) The subsolution property of $v^+$ . First, the definition in (4.3) and Lemma 4.3 imply that $v^+$ is bounded and u.s.c. Suppose $v^+$ is not a viscosity subsolution; then there exists some interior point $(\bar{t},\bar{\eta})\in(0,T)\times\mathbb{R}$ and a $C^{1,2}$ test function $\varphi\,:\,[0,T]\times\mathbb{R}\rightarrow\mathbb{R}$ such that $v^+ - \varphi$ attains a strict local maximum that is equal to zero and $F(\bar{t},\bar{\eta},v,v_{\bar{t}},v_{\bar{\eta}},v_{\bar{\eta}\bar{\eta}})>0$ . It follows that
There exists a ball $B(\bar{t},\bar{\eta},\varepsilon)$ small enough that
In addition, as $\varphi(\bar{t},\bar{\eta}) = v^+(\bar{t},\bar{\eta})> \widehat{V}(\bar{t},x_0-f(\bar{t}),z_0,\bar{\eta})$ , $\varphi$ is continuous, and $\widehat{V}$ is continuous, we can derive that for some $\varepsilon$ small enough, we have $\varphi-\varepsilon\geq\widehat{V}$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}$ . Because $v^+-\varphi$ is u.s.c. and $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})$ is compact, it then follows that there exists a $\delta>0$ such that $\varphi-\delta\geq v^+$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})$ .
If we choose $0<\xi<\delta\wedge\varepsilon$ , the function $\varphi^\xi=\varphi-\xi$ satisfies
and $\varphi^\xi(\bar{t},\bar{\eta})=v^+(\bar{t},\bar{\eta})-\xi$ .
Let us define an auxiliary function by
It is easy to check that $v^\xi$ is u.s.c. and $v^\xi(\bar{t},\bar{\eta})=\varphi^\xi(\bar{t},\bar{\eta})<v^+(\bar{t},\bar{\eta})$ . We claim that $v^\xi$ satisfies the terminal condition. To this end, we pick some $\varepsilon>0$ that satisfies $T>\bar{t}+\varepsilon$ and recall that $v^+$ satisfies the terminal condition. We then continue to show that $v^\xi\in \mathcal{V}^+$ to obtain a contradiction.
Let us fix $(t,\eta)$ and recall that $((\mu_s)_{t\leq s\leq T},(W_s,B_s)_{t\leq s\leq T},\Omega,\mathcal{F},\mathbb{P},(\mathcal{F}_s)_{t\leq s\leq T})\in\chi$ , where $\chi$ is the nonempty set of all weak solutions. We need to show that the process $(v^\xi(s,\mu_s))_{t\leq s\leq T}$ is a supermartingale on $(\Omega,\mathbb{P})$ with respect to $(\mathcal{F}_s)_{t\leq s\leq T}$ . We first assume that $(v^+(s,\mu_s))_{t\leq s\leq T}$ has right-continuous paths. In this case, $v^\xi$ is a supermartingale locally in the region $[t,T]\times\mathbb{R}\backslash B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)$ because it equals the right-continuous supermartingale $(v^+(s,\mu_s))_{t\leq s\leq T}$ . As the process $(v^\xi(s,\mu_s))_{t\leq s\leq T}$ is the minimum between two local supermartingales in the region $B(\bar{t},\bar{\eta},\varepsilon)$ , it is a local supermartingale. As the two regions $[t,T]\times\mathbb{R}\backslash B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)$ and $B(\bar{t},\bar{\eta},\varepsilon)$ overlap over an open region, $(v^\xi(s,\mu_s))_{t\leq s\leq T}$ is actually a supermartingale.
If the process $(v^+(s,\mu_s))_{t\leq s\leq T}$ is not right-continuous, we can consider its right-continuous limit over rational times to transform it to the special case discussed above. In particular, for a given rational number r and fixed $0 \leq t\leq r \leq s \leq T$ and $\eta\in\mathbb{R}$ , it remains to show the process $(Y_u)_{t\leq u\leq T}\,:\!=\,(v^\xi(u,\mu_u))_{t\leq u\leq T}$ between r and s is a supermartingale, which is equivalent to showing that $Y_r \geq \mathbb{E}[Y_s|\mathcal{F}_r]$ .
Let us define $G_u\,:\!=\,v^+(u,\mu_u), \ r \leq u \leq s,$ and freeze the process G after time s, i.e. $G_u\,:\!=\,v^+(s,\mu_s), \ s \leq u \leq T$ . As $(G_u)_{r\leq u\leq T}$ may not be right-continuous, by Proposition 1.3.14 in [Reference Karatzas and Shreve19], we can consider its right-continuous modification
Note that $G^+$ is a right-continuous supermartingale with respect to $\mathcal{F}$ that satisfies the usual conditions. Because $v^+$ is u.s.c. and the process remains the same after s, we conclude that $G_r\geq G_r^+, \ G_s=G_s^+$ . Recall that $v^+<\varphi-\delta$ in the open region $B(\bar{t},\bar{\eta},\varepsilon)\backslash\overline{B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)}$ ; if we take right limits inside this region and use the continuous function $\varphi$ , we have
Thus, if we consider the process
we also have $Y_r\geq Y_r^+, \ Y_s=Y_s^+$ .
Because $G^+$ has right-continuous paths, we can conclude that Y is a supermartingale such that
(ii) The terminal condition for $v^+$ .
For some $\eta_0\in\mathbb{R}$ , we assume that $v^+(T,\eta_0)>0$ and will show a contradiction. As $\widehat{V}$ is continuous on $\mathbb{R}$ , we can choose an $\varepsilon>0$ such that $0 \leq v^+(T,\eta_0)-\varepsilon$ and $|\eta-\eta_0|\leq\varepsilon$ . On the compact set $(\overline{B(T,\eta_0,\varepsilon)}\backslash B(T,\eta_0,\frac{\varepsilon}{2}))\cap([0,T]\times\mathbb{R})$ , $v^+$ is bounded above by the definition of $\mathcal{V}^+$ and the fact that $v^+\in\mathcal{V}^+$ . Moreover, as $v^+$ is u.s.c. on this compact set, we can find $\delta>0$ small enough so that
Next, for $k>0$ , we define the function
For k large enough, we derive that $-\varphi^{\delta,\varepsilon,k}_t - \mathcal{L}\varphi^{\delta,\varepsilon,k} > 0 \ \mbox{on} \ \overline{B(T,\eta_0,\varepsilon)}$ . Moreover, in view of (4.6), we have
and $\varphi^{\delta,\varepsilon,k}(T,\eta) \geq v^+(T,\eta_0) \geq 0+ \varepsilon$ for $|\eta-\eta_0|\leq\varepsilon$ .
Now, we can find $\xi<\varepsilon$ and define the following function:
By following a similar argument to that used in Step (i), one can obtain that $v^{\delta,\varepsilon,k,\xi}\in\mathcal{V}^+$ , but $v^{\delta,\varepsilon,k,\xi}(T,\eta_0) = v^+(T,\eta_0)-\xi$ , which leads to a contradiction.
(iii) The supersolution property of $v^-$ .
We provide only a sketch of the proof, as it is essentially similar to that of Step (i). Suppose that $v^-$ is not a viscosity supersolution; then there exist some interior point $(\bar{t},\bar{\eta})\in(0,T)\times\mathbb{R}$ and a $C^{1,2}$ test function $\psi\,:\,[0,T]\times\mathbb{R}\rightarrow\mathbb{R}$ such that $v^- - \psi$ attains a strict local minimum that is equal to zero. As $F(\bar{t},\bar{\eta},v,v_{\bar{t}},v_{\bar{\eta}},v_{\bar{\eta}\bar{\eta}})<0$ , there are two separate cases to check.
Case (i): $v^-(\bar{t},\bar{\eta})-\widehat{V}(\bar{t},x_0-f(\bar{t}),z_0,\bar{\eta})<0$ . This already leads to a contradiction with $v^-(\bar{t},\bar{\eta})\geq\widehat{V}(\bar{t},x_0-f(\bar{t}),z_0,\bar{\eta})$ by the definition of $v^-$ .
Case (ii): $-\frac{\partial \psi}{\partial t}(\bar{t},\bar{\eta})-\mathcal{L}\psi(\bar{t},\bar{\eta}) <0$ . In this case we can find a ball $B(\bar{t},\bar{\eta},\varepsilon)$ small enough so that $-\frac{\partial \psi}{\partial t}-\mathcal{L}\psi<0$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}$ . Moreover, as $v^--\psi$ is l.s.c. and $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})$ is compact, there exists a $\delta>0$ such that $\psi+\delta\leq v^-$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})$ . We can then choose $\xi\in(0,\frac{\delta}{2})$ small such that $\psi^\xi=\psi+\xi$ satisfies three properties:
-
(i) $-\frac{\partial \psi^\xi}{\partial t}-\mathcal{L}\psi^\xi<0$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}$ ;
-
(ii) we have $v^-\geq\psi+\delta > \psi+\xi = \psi^\xi$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)$ ;
-
(iii) $\psi^\xi(\bar{t},\bar{\eta}) = \psi(\bar{t},\bar{\eta}) + \xi = v^-(\bar{t},\bar{\eta}) +\xi > v^-(\bar{t},\bar{\eta})$ .
Thus, we can define an auxiliary function by
By repeating an argument similar to that of Step (i), we obtain that $v^\xi\in \mathcal{V}^-$ by showing that $(v^\xi(s,\mu_s))_{t\leq s\leq T}$ is a submartingale. If $v^-$ has right-continuous paths, the proof is trivial. In general, by Proposition 1.3.14 in [Reference Karatzas and Shreve19], we can consider the right-continuous submartingale $G^+_u(\omega)\,:\!=\,\lim_{u^{\prime}\rightarrow u, \ u^{\prime}>u, \ u^{\prime}\in\mathbb{Q}}G_{u^{\prime}}(\omega), \ \omega\in\Omega^\ast, \ r\leq u\leq T$ , where $G_u\,:\!=\,v^-(u, \mu_u), \ r \leq u \leq s$ and we stop it at time t. Similarly to Step (i), we note that $G^+$ is a right-continuous submartingale and therefore $G_r\leq G_r^+, \ G_s=G_s^+$ . As $ G^+_u>\psi^\xi(u, \mu_u), \ \mbox{if} \ (u,\mu_u)\in B(\bar{t},\bar{\eta},\varepsilon)\backslash \overline{B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})}$ , we can define the process
We can conclude that $Y_r\leq Y_r^+, \ Y_s=Y_s^+$ , and Y is a submartingale such that $Y_r\leq Y_r^+\leq\mathbb{E}[Y_s^+|\mathcal{F}_r]=\mathbb{E}[Y_s|\mathcal{F}_r]$ , which completes the proof.
(iv) The terminal condition for $v^-$ .
For some $\eta_0\in\mathbb{R}$ , suppose that $v^-(T,\eta_0)<0$ ; we will derive a contradiction. As $\widehat{V}$ is continuous on $\mathbb{R}$ , we can choose an $\varepsilon>0$ such that $0 \geq v^-(T,\eta_0)+\varepsilon$ and $|\eta-\eta_0|\leq\varepsilon$ . Similarly to Step (ii), we can find $\delta>0$ small enough so that
Then, for $k>0$ , we consider
For k large enough, we have that $-\psi^{\delta,\varepsilon,k}_t - \mathcal{L}\psi^{\delta,\varepsilon,k} < 0 \ \mbox{on} \ \overline{B(T,\eta_0,\varepsilon)}$ . Furthermore, in view of (4.7), we have
and $\psi^{\delta,\varepsilon,k}(T,\eta) \leq v^-(T,\eta_0) \leq - \varepsilon$ for $|\eta-\eta_0|\leq\varepsilon$ .
Next, we can find $\xi<\varepsilon$ and define the function
Similarly to Step (iii), we obtain that $v^{\delta,\varepsilon,k,\xi}\in\mathcal{V}^-$ , but $v^{\delta,\varepsilon,k,\xi}(T,\eta_0) = v^-(T,\eta_0)+\xi$ , which gives a contradiction.
We now reverse the time and consider $s\,:\!=\,T-t$ . However, for simplicity of presentation, we continue to use t in place of s whenever there is no possibility of confusion. The variational inequalities can be written as
where
with the condition $\widetilde{V}(0,\eta)=0$ .
Let us write this equivalently as
where
We also have the continuation region as $\mathcal{C}=\{(t,\eta)\in(0,T]\times\mathbb{R}\,:\,\widetilde{V}(t,\eta;\,x_0-f(T-t), z_0)>\widehat{V}(t, x_0-f(T-t), z_0, \eta)\}$ .
Proposition 4.1. (Comparison principle) Let u,v respectively be a u.s.c. viscosity subsolution and an l.s.c. viscosity supersolution of (4.9). If $u(0,\eta)\leq v(0,\eta)$ on $\mathbb{R}$ , then we have $u\leq v$ on $(0,T]\times\mathbb{R}$ .
Proof. We will follow some arguments from [Reference Bayraktar and Zhang5, Reference Pham28], modifying them to fit our setting. Suppose that $u(0,\eta)\leq v(0,\eta)$ on $\mathbb{R}$ ; we will prove that $u\leq v$ on $[0,T]\times\mathbb{R}$ . We first construct the strict supersolution to the system (4.9) with suitable perturbations of v. Let us recall that $A\leq 0$ , $B\leq 0$ , and C is bounded above by some constant as in Remark A.1. Moreover, we know that $\widehat{V}(t,x_0-\kappa t,z_0,\eta)\leq 0$ . Let us fix a constant $C_2>0$ small enough so that $\lambda>C_2\sigma^2_{\mu}$ and set $\psi(t,\eta) = C_0e^{t}+e^{C_2\eta^2}$ with some $C_0>1$ . We have that
We can then choose $C_0>1$ large enough so that
which guarantees that
Let us define $v^\Lambda \,:\!=\, (1-\Lambda)v + \Lambda\psi$ on $[0,T]\times\mathbb{R}$ for any $\Lambda\in(0,1)$ . It follows that
where we used $v - \widehat{V} \geq 0$ in the last inequality. From (4.10) and (4.11), we can deduce that for $\Lambda\in(0,1)$ , $v^\Lambda$ is a supersolution to
To prove the comparison principle, it suffices to prove the claim that $\sup(u-v^\Lambda)\leq 0$ for all $\Lambda\in(0,1)$ , as the required result is then obtained by letting $\Lambda$ go to 0. To this end, we will suppose that there exists some $\Lambda\in(0,1)$ such that $M\,:\!=\,\sup(u-v^\Lambda)>0$ , and derive a contradiction.
It is clear that u, v, and $\widehat{V}$ have the same growth conditions: in view of the explicit forms of A, B, C, and $\widehat{V}$ , it follows that $\widehat{V}$ has growth condition in t as $e^{e^{K_1t}}$ for some $K_1<0$ and has growth condition in $\eta$ as $e^{K_2\eta^2}$ for some $K_2<0$ ; on the other hand, $\psi$ has growth condition in t as $e^{t}$ and has growth condition in $\eta$ as $e^{C_2\eta^2}$ . Thus, we have that $u(t,\eta)-v^\Lambda(t,\eta) = ( u - (1-\Lambda)v - \Lambda\psi)(t,\eta)$ goes to $-\infty$ as $t\rightarrow T,\eta\rightarrow\infty$ . Consequently, the u.s.c. function $(u-v^\Lambda)$ attains its maximum M.
Let us consider the u.s.c. function $\Phi_\varepsilon(t,t^{\prime},\eta,\eta^{\prime})=u(t,\eta)-v^\Lambda(t^{\prime},\eta^{\prime})-\phi_\varepsilon(t,t^{\prime},\eta,\eta^{\prime})$ , where $\phi_\varepsilon(t,t^{\prime},\eta,\eta^{\prime}) = \frac{1}{2\varepsilon}((t-t^{\prime})^2+(\eta-\eta^{\prime})^2)$ , $\varepsilon >0$ , and $(t_\varepsilon,t^{\prime}_\varepsilon,\eta_\varepsilon,\eta^{\prime}_\varepsilon)$ attains the maximum of $\Phi_\varepsilon$ . We have
Let us recall the equivalent definition of viscosity solutions in terms of superjets and subjets. In particular, we define $\bar{\mathcal{P}}^{2,+}u(\bar{t},\bar{\eta})$ as the set of elements $(\bar{q},\bar{k},\bar{M})\in\mathbb{R}\times\mathbb{R}\times\mathbb{R}$ satisfying $u(t,\eta)\leq u(\bar{t},\bar{\eta}) + \bar{q}(t-\bar{t})+ \bar{k}(\eta-\bar{\eta})+ \frac{1}{2}\bar{M}(\eta-\bar{\eta})^2 + o((t-\bar{t})+(\eta-\bar{\eta})^2)$ . We define $\bar{\mathcal{P}}^{2,-}v^\Lambda(\bar{t},\bar{\eta})$ similarly. Thanks to the Crandall–Ishii lemma, we can find $A_\varepsilon, B_\varepsilon\in\mathbb{R}$ such that
By combining the viscosity subsolution property (4.5) of u and the viscosity strict supersolution property (4.12) of $v^\Lambda$ , we have that
where $b(t_\varepsilon,\eta_\varepsilon)=-\lambda(\eta_\varepsilon-\bar{\mu})$ , $\sigma^2(\eta_\varepsilon)=\sigma_\mu^2$ , $b(t^{\prime}_\varepsilon,\eta^{\prime}_\varepsilon)=-\lambda(\eta^{\prime}_\varepsilon-\bar{\mu})$ , and $\sigma^2(\eta^{\prime}_\varepsilon)=\sigma_\mu^2$ .
If $u-\widehat{V}\leq 0$ in (4.14), then because $v^\Lambda-\widehat{V}\geq \Lambda$ in (4.15), we obtain that $u-v^\Lambda\leq- \Lambda <0$ by contradiction with $\sup(u-v^\Lambda)=M>0$ . On the other hand, if $u-\widehat{V}>0$ in (4.14), then we have
Furthermore, combining two inequalities above, we derive that
The first inequality holds by the Crandall–Ishii lemma. In addition, by letting $\varepsilon\rightarrow 0$ , we get
thanks to (4.13). It follows that we have $0\geq \Lambda>0$ , which leads to a contradiction; therefore our claim holds.
Lemma 4.5. For all $(t,\eta)\in\mathcal{C}$ in the continuation region, $\widetilde{V}$ in (2.7) has Hölder continuous derivatives.
Proof. The proof closely follows the argument in Section 6.3 of [Reference Friedman16]. First, let us recall that
The fact that $\widetilde{V}$ is a viscosity solution to (4.8) gives that $\widetilde{V}$ is a supersolution to (4.16). On the other hand, for any $(\bar{t},\bar{\eta})\in\mathcal{C}$ , let $\varphi$ be a $C^2$ test function such that $(\bar{t},\bar{\eta})$ is a maximum of $\widetilde{V}-\varphi$ with $\widetilde{V}(\bar{t},\bar{\eta})=\varphi(\bar{t},\bar{\eta})$ . By definition of $\mathcal{C}$ , we have $\widetilde{V}(\bar{t},\bar{\eta}) > \widehat{V}(\bar{t},x_0-f(\bar{t}),z_0,\bar{\eta})$ , so that
owing to the property that $\widetilde{V}$ is a viscosity subsolution to (4.8). It follows that $\widetilde{V}$ is a viscosity subsolution and therefore a viscosity solution to (4.16).
Let us consider an initial boundary value problem:
Here, Q is an arbitrary bounded open region in $\mathcal{C}$ , Q lies in the strip $0<t<T$ , $\tilde{B}=\bar{Q}\cap\{t=0\}$ , $\tilde{B}_T=\bar{Q}\cap\{t=T\}$ , $B_T$ denotes the interior of $\tilde{B}_T$ , B denotes the interior of $\tilde{B}$ , $S_0$ denotes the boundary of Q lying in the strip $0\leq t\leq T$ , and $S=S_0\backslash B_T$ . Theorem 3.6 in [Reference Friedman16] gives the existence and uniqueness of a solution w on $Q\cup B_T$ to (4.17), and the solution w has Hölder continuous derivatives $w_t$ , $w_\eta$ , and $w_{\eta\eta}$ . Because the solution w is a viscosity solution to (4.16) on $Q\cup B_T$ , from standard uniqueness results on viscosity solutions, we know that $\widetilde{V}=w$ on $Q\cup B_T$ . As $Q\subset\mathcal{C}$ is arbitrary, it follows that $\widetilde{V}$ has the same property in the continuation region $\mathcal{C}$ . Therefore, $\widetilde{V}$ has Hölder continuous derivatives $\widetilde{V}_t$ , $\widetilde{V}_\eta$ , and $\widetilde{V}_{\eta\eta}$ .
Finally, we can prove Theorem 2.1.
Proof. We have proved the inequality $v^-=\sup_{p\in\mathcal{V}^-}p\leq \widetilde{V} \leq v^+=\inf_{q\in\mathcal{V}^+}q$ in Lemma 4.4. Using the comparison result in Proposition 4.1, we also have $v^+\leq v^-$ . Putting all the pieces together, we conclude that $v^+=\widetilde{V}(t,\eta)=v^-$ , and therefore the value function $\widetilde{V}(t,\eta)$ is the unique viscosity solution of the HJB variational inequality (2.9). Following an argument similar to that given for Theorem 1 in [Reference Duckworth and Zervos13], we fix the $\mathcal{F}_t$ -adapted stopping time $\tau^*$ defined in (2.11); the Itô–Tanaka formula (see Theorem IV.1.5 and Corollary IV.1.6 of [Reference Revuz and Yor32]) can be applied to $\widetilde{V}(t,\mu_t)$ in view of the Hölder continuous derivatives of $\widetilde{V}(t,\eta)$ , and we get that
where $\tau_n\uparrow T$ is the localizing sequence. As $\widetilde{V}(t,\eta)$ satisfies the HJB variational inequality (2.9), by taking conditional expectations and using the definition of $\tau^*$ in (2.11), we obtain that
By taking the limit of $\tau_n$ and using the dominated convergence theorem, we verify that
and therefore $\tau^*$ is the optimal entry time.
Finally, the martingale property between $t=0$ and $\tau^*$ follows from the definition of stochastic subsolutions and stochastic supersolutions.
Moreover, we can also easily verify the following sensitivity results for the composite value function.
Lemma 4.6. The value function $\widetilde{V}(t,\eta)$ has the following sensitivity properties:
-
(i) Suppose that $\alpha>0$ and $\delta>0$ are both constants in the definition of a habit formation process such that $\delta>\alpha$ . We have that $\widetilde{V}(t,\eta;\, \alpha, \delta)$ is decreasing in $\delta$ and increasing in $\alpha$ .
-
(ii) If the initial habit $z_0$ increases, the value function $\widetilde{V}(t,\eta)$ decreases.
-
(iii) If the information cost rate $\kappa$ increases, the value function $\widetilde{V}(t,\eta)$ decreases for any $t<T$ .
Proof. By the definition of $\widetilde{V}(t,\eta)$ and the explicit form of $\widehat{V}(t, x_0-\kappa t, z_0, \eta)$ in (3.7) and m(t) in (2.8), for given $\delta>\alpha$ , it is clear that $\widehat{V}(t, x_0-\kappa t, z_0, \eta)$ is decreasing in $\delta$ and increasing in $\alpha$ , which implies that $\widetilde{V}(t,\eta)$ has the same sensitivity property. Similarly, it is clear that $\widehat{V}(t, x_0-\kappa t, z_0, \eta)$ decreases as $z_0$ increases, and hence $\widetilde{V}(t,\eta)$ is decreasing in $z_0$ . Finally, $\widehat{V}(t, x_0-\kappa t, z_0, \eta)$ decreases if $x_0-\kappa t$ decreases; it readily follows that $\widetilde{V}(t,\eta)$ is decreasing in $\kappa$ .
Appendix A. Explicit solution to the auxiliary ODEs
Our ODE problems (3.4), (3.5), (3.6) are similar to ODEs for the terminal wealth optimization problem in [Reference Brendle8], in which the insightful observation is made that we can solve these ODEs with coefficients depending on time t by solving five auxiliary ODEs with constant coefficients; see Section 4 of [Reference Brendle8] for detailed discussions.
Lemma A.1. For $k\leq t\leq s\leq T$ , let us consider the following auxiliary ODEs for a $(t, s)$ , $b(t, s)$ , $l(t, s)$ , $w(t, s)$ , and $g(t, s)$ :
with the terminal conditions $a(s,s)=b(s,s)=l(s,s)=w(s,s)=g(s,s)=0$ . Direct substitutions and computations show that the solutions of the ODEs (3.4), (3.5), (3.6) are given respectively by
Following the same arguments as in [Reference Kim and Omberg21], we can actually solve the auxiliary ODEs (A.1), (A.2), (A.3), (A.4) and (A.5) explicitly, in the following order: we first solve the simple ODEs (A.1) and (A.4) to get a(t, s) and w(t, s), and then obtain b(t, s) and g(t, s) by solving the ODEs (A.2) and (A.5). Finally, we solve the ODE (A.3) to get l(t, s). We thus obtain
where
and
Moreover, it is straightforward to see that a, b, l, w, and g are globally bounded if we have that $\gamma_3>0$ , or $\gamma_1>0$ , or $\gamma_2<0$ .
Remark A.1 Under the assumption that $p<0$ , (A.7) clearly holds and we have $\gamma_2<0$ . We can see that $a(t,s)\leq 0$ and $b(t,s)\leq 0$ are bounded and that $1-2a(t,s)\hat{\Sigma}(t)>1$ and $1-w(t,s)\hat{\Sigma}(t)>1$ . From the expressions in (A.6), we can conclude that A(t, s), B(t, s), and C(t, s) are all bounded on $k\leq t\leq s\leq T$ , and that
and
for $k\leq t\leq s\leq T$ .
Appendix B. Proof of the verification theorem
We first show that the consumption constraint $c_t\geq Z_t$ implies the constraint on the controlled wealth process in the next lemma.
Lemma B.1. The admissible space $\mathcal{A}$ is non-empty if and only if the initial budget constraint $x\geq m(k)z$ is fulfilled. Moreover, for each pair $(\pi,c)\in\mathcal{A}$ , the controlled wealth process $\hat{X}^{\pi,c}_t$ satisfies the constraint
where the deterministic function $m(t)$ is defined in (2.8) and refers to the cost of subsistence consumption per unit of standard of living at time t.
Proof. Let us first assume that $x\geq m(k)z$ ; we can always take $\pi_t\equiv 0$ , and
for $t\in[k,T]$ . It is easy to verify that $\hat{X}^{\pi,c}_t\geq 0$ and $c_t\equiv Z_t$ , so that $(\pi,c)\in\mathcal{A}$ , and hence $\mathcal{A}$ is non-empty.
On the other hand, starting from $t=k$ with wealth x and standard of living z, the addictive habits constraint $c_t\geq Z_t$ , $k\leq t\leq T$ , implies that the consumption must always exceed the subsistence consumption $\bar{c}_t=Z(t;\,\bar{c}_t)$ which satisfies
Indeed, since $Z_t$ satisfies $dZ_t=(\delta_tc_t-\alpha_tZ_t)dt$ with $Z_k=z\geq 0$ , the constraint $c_t\geq Z_t$ implies that
By (B.2) and (B.3), one can get $d(Z_t-\bar{c}_t)\geq(\delta_t-\alpha_t)(Z_t-\bar{c}_t)dt$ and $Z_k-\bar{c}_k=0$ , from which we can derive that
It follows that $c_t\geq\bar{c}_t$ , which is equivalent to
Define the exponential local martingale
As $\hat{\mu}_t$ follows the dynamics (2.4), we derive that
Similarly to the proof of Corollary 3.5.14 and Corollary 3.5.16 in [Reference Karatzas and Shreve19], the Beneš condition implies that $\widetilde{H}$ is a true martingale with respect to $(\Omega,\mathcal{F}^S,\mathbb{P})$ .
Now, define the probability measure $\widetilde{\mathbb{P}}$ by $ \frac{d\widetilde{\mathbb{P}}}{d\mathbb{P}} = \widetilde{H}_T$ . Girsanov’s theorem states that
is a Brownian motion under $(\widetilde{\mathbb{P}},(\mathcal{F}^S_t)_{k\leq t\leq T})$ . We can rewrite the wealth process as
As we have $\hat{X}_T\geq 0$ , it is easy to see that $\int^t_k\pi_v\sigma_Sd\widetilde{W}_v$ is a supermartingale under $(\Omega,\mathbb{F}^S,\widetilde{\mathbb{P}})$ . By taking the expectation under $\widetilde{\mathbb{P}}$ , we have $x\geq\widetilde{\mathbb{E}}\left[\int^T_kc_vdv\right]$ . Thanks to the inequality (B.4), we further have $ x\geq z\widetilde{\mathbb{E}}\left[\int^T_k\exp\left(\int^v_k(\delta(u)-\alpha(u))du\right)dv\right]$ . Because $\delta(t)$ and $\alpha(t)$ are deterministic functions, we obtain that $x\geq m(k)z$ . In general, for any $t\in[k,T]$ , following the same procedure, we can take the conditional expectation under the filtration $\mathcal{F}^S_t$ and get
Again, as $\delta(t)$ , $\alpha(t)$ are deterministic, we get $\hat{X}_t\geq m(t)Z_t$ , $k\leq t\leq T$ .
We can finally prove Theorem 3.1 for the interior control problem.
Proof. For any pair of admissible controls $(\pi_t,c_t)\in\mathcal{A}$ , Itô’s lemma gives
where we define the process $\mathcal{G}^{\pi_t,c_t}V(t,\hat{X}_t,Z_t,\hat{\mu}_t)$ by
For any localizing sequence $\tau_n$ , by integrating (B.5) on $[k,\tau_n\wedge T]$ and taking the expectation, we have
Similarly to the argument in [Reference Janeček and Sîrbu17], let us consider a fixed pair of controls $(\pi_t,c_t)\in\mathcal{A}=\mathcal{A}_{x}$ , where we denote by $\mathcal{A}_{x}$ the admissible space with initial endowment x. For any $\epsilon>0$ , it is clear that $\mathcal{A}_{x}\subseteq\mathcal{A}_{x+\epsilon}$ and $(\pi_t,c_t)\in\mathcal{A}_{x+\epsilon}$ . Also, it is easy to see that $\hat{X}^{x+\epsilon}_t=\hat{X}^{x}_t+\epsilon=\hat{X}_t+\epsilon$ , $k\leq t\leq T$ . As the process $Z_t$ is defined using this consumption policy $c_t$ , under the probability measure $\mathbb{P}_{x,z,\eta}$ , we obtain
The monotone convergence theorem first leads to
For simplicity, let us write $Y_t=\Big(\hat{X}_t-m(t)Z_t\Big)$ . The definition (3.7) implies that
Lemma B.1 gives $\hat{X}_t\geq m(t)Z_t$ for $k\leq t\leq T$ under any admissible control $(\pi_t,c_t)$ , so we get that $Y_{\tau_n\wedge T}+\epsilon\geq\epsilon>0$ for all $k\leq t\leq T$ . As $p<0$ , it follows that
Remark A.1 gives that $A(t,s)\leq 0$ , B(t, s), and C(t, s) are all bounded on $k\leq t\leq s\leq T$ . Also, m(s) and $\delta(s)$ are continuous functions and hence bounded on [k, T]. Hence $N(k,\eta)\leq k_1\exp(k_2\eta)$ , for some constants $k_2, k_1>1$ . It follows that there exist some constants $\bar{k}_2,\bar{k}_1>1$ such that
The process $\hat{\mu}_t$ satisfies (2.4), which leads to
Hence, there exist positive constants l and $l_1>1$ large enough so that
Using the distribution of the running maximum of the Brownian motion, there exist some positive constants $\bar{l}>1$ and $\bar{l}_1$ such that
Finally, by (B.8) and (B.9), we can conclude that
The dominated convergence theorem and $N(T,\hat{\mu_T}) = 0$ imply that
Combining this with (B.7) and $(\pi_t,c_t)\in\mathcal{A}$ , we have that
Note that $V(t,x,z,\eta;\,\theta)$ is continuous in the variable x. By letting $\epsilon\to 0$ , we deduce that
On the other hand, for $\pi^*_t$ and $c^*_t$ given in (3.8) and (3.9), we first need to show that the SDE
with initial condition $x>m(k)z$ admits a unique strong solution that satisfies the constraint $\hat{X}^*_t>m(t)Z^*_t$ for all $k\leq t\leq T$ . Let $Y^*_t=\hat{X}^*_t-m(t)Z^*_t$ . By Itô’s lemma and substitution of $c^*_t$ using (3.9), we obtain that
In order to solve for $X^*_t$ explicitly, we define the auxiliary process $\Gamma_t \,:\!=\, \frac{N(t,\hat{\mu}_t)}{Y^*_t}$ , for $k\leq t\leq T$ . Itô’s lemma gives that
As $N(t,\eta)$ satisfies the linear PDE (3.3), (B.11) is reduced to
the existence of a unique strong solution is thus verified, and $\Gamma_k=\frac{N(k,\eta)}{x-m(k)z}>0$ implies that $\Gamma_t>0$ , $\forall k\leq t\leq T$ . Therefore, it holds that the SDE (B.10) admits a unique strong solution as defined in (3.10), and the solution $\hat{X}^*_t$ satisfies the constraint (B.1).
Next, we verify that the pair $(\pi^*_t,c^*_t)$ is indeed in the admissible space $\mathcal{A}$ . First, by the definitions in (3.8) and (3.9), it is clear that $\pi^*_t$ and $c^*_t$ are $\mathcal{F}^S_t$ -progressively measurable, and by the path continuity of $Y^*_t=\hat{X}^*_t-m(t)Z^*_t$ and of $\pi^*_t$ and $c^*_t$ , it is easy to show that $\int^T_k(\pi^*_t)^2dt<+\infty$ and $\int^T_kc^*_tdt<+\infty$ , a.s. Also, because $\hat{X}^*_t>m(t)Z^*_t$ for all $t\in[k,T]$ , by the definition of $c^*_t$ , the consumption constraint $c^*_t>Z^*_t$ for all $t\in[k,T]$ is satisfied. It follows that $(\pi^*_t,c^*_t)\in\mathcal{A}$ .
Given $(\pi^*_t,c^*_t)$ as above, instead of (B.6), the following equality is proved:
The monotone convergence theorem gives
Moreover, as we have $V(t,x,z,\eta)<0$ by $p<0$ , Fatou’s lemma implies that
It follows that
which completes the proof.
Funding information
Y. Yang and X. Yu are supported by the Hong Kong Polytechnic University under research grant no. P0031417.
Competing interests
There were no competing interests to declare which arose during the preparation or publication process for this article.