Optimal entry and consumption under habit formation

Yue Yang; Xiang Yu

doi:10.1017/apr.2021.37

Optimal entry and consumption under habit formation

Part of: Stochastic systems and control Mathematical finance Hamilton-Jacobi theories, including dynamic programming

Published online by Cambridge University Press: 10 March 2022

Yue Yang and

Xiang Yu

Show author details

Yue Yang*: Affiliation:
The Hong Kong Polytechnic University
Xiang Yu*: Affiliation:
The Hong Kong Polytechnic University
*: *Postal address: Department of Applied Mathematics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong.
*Postal address: Department of Applied Mathematics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong.

Article contents

Abstract
Introduction
Mathematical model and preliminaries
Interior utility maximization under partial observation
Exterior optimal stopping problem
Funding information
Competing interests
References

Rights & Permissions

Abstract

This paper studies a composite problem involving decision-making about the optimal entry time and dynamic consumption afterwards. In Stage 1, the investor has access to full market information subject to some information costs and needs to choose an optimal stopping time to initiate Stage 2; in Stage 2, the investor terminates the costly full information acquisition and starts dynamic investment and consumption under partial observation of free public stock prices. Habit formation preferences are employed, in which past consumption affects the investor’s current decisions. Using the stochastic Perron method, the value function of the composite problem is proved to be the unique viscosity solution of some variational inequalities.

Keywords

Optimal entry problem consumption habit formation stochastic Perron method viscosity solution

MSC classification

Primary: 91G10: Portfolio theory 49L25: Viscosity solutions

Secondary: 93E11: Filtering 93E20: Optimal stochastic control 49L20: Dynamic programming method

Type: Original Article
Information: Advances in Applied Probability , Volume 54 , Issue 2 , June 2022 , pp. 433 - 459

DOI: https://doi.org/10.1017/apr.2021.37 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press on behalf of Applied Probability Trust

1. Introduction

We consider a simple model to incorporate information costs in a continuous-time portfolio-consumption problem. In particular, we study a two-stage composite problem under complete and incomplete filtrations sequentially. The drift process of the stock price is assumed to be of Ornstein–Uhlenbeck type. In the first stage from the initial time, the investor needs to pay information costs to access the full market information generated by both drift and stock price processes, in order to update their dynamic distributions and decide the optimal time to enter the second stage. The information costs may include search costs, storage costs, communication costs, the cost of the investor’s attention, or other service costs. In the present paper we consider a simple linear information cost, which is modeled by a constant cost rate and will be subtracted directly from the amount of the investor’s wealth. That is, the longer the first stage is, the higher the information costs the investor needs to be able to afford. Some previous works have addressed the impact of information costs on optimal investment from different perspectives; see [Reference Kang and Stulz18], [Reference Portes and Rey29], [Reference Ahearne, Griever and Warnock1], and [Reference Keppo, Tan and Zhou20]. In our first stage, the mathematical problem becomes an optimal stopping problem under the complete market information filtration. The second stage starts from the chosen entry time, when the investor terminates the full observation of the drift process. From this point on, the investor instead dynamically chooses his investment and consumption levels based on the prior data inputs and free partial observation of the stock price, which can be formulated as an optimal control problem under an incomplete information filtration. As the value function of the interior control problem depends on the stopping time and data inputs of the drift process, the exterior problem can be interpreted as that of waiting in an optimal way so that the input values can achieve the maximum of the interior functional.

Portfolio optimization under partial observation has been extensively studied in past decades; a few examples with different financial motivations can be seen in [Reference Lakner22, Reference Xia34, Reference Brendle8, Reference Monoyios25, Reference Björk, Davis and Landén6, Reference Bo, Liao and Yu7]. As illustrated in these works, the value function under the incomplete information filtration is strictly lower than its counterpart under the full information filtration, and this gap is usually regarded as the value of information. The present paper attempts to study partial observation from a different perspective, where the full market information is available but costly because more data, services, and personal attention are involved. The information costs may change the investor’s attitude towards the usage of full observation, because it is no longer true that the more information he observes, the higher profit he can attain. Moreover, from previous work on partial observation, we know that the value function eventually depends on the given initial input of the random factor such as the drift process. As in [Reference Lakner22, Reference Brendle8], it is conventionally assumed that the initial data of the unobservable drift is a Gaussian random variable, so that the Kalman–Bucy filtering can be applied. We take this input into account and consider a model in which the investor can wait and dynamically update the distribution of inputs using the full market information, subject to information costs. We show that starting sharp from the initial time to invest and consume under incomplete information may not be optimal.

On the other hand, in recent years, habit formation has provided a new paradigm for modeling consumption rate preferences, which better matches some empirical observations; see [Reference Constantinides11, Reference Mehra and Prescott24]. The literature suggests that past consumption patterns may have a continuing impact on an individual’s current consumption decisions. In particular, the use of linear habit formation preferences, in which there exists an index term that stands for the accumulated consumption history, has been widely accepted. Habit formation preferences have been well studied by [Reference Detemple and Zapatero12, Reference Englezos and Karatzas14, Reference Munk26] in complete market models and by [Reference Yu35, Reference Yu36] in incomplete market models. It is noted that the utility function is decreasing in the habit level. In the present paper, we assume that there is no consumption during Stage 1, and the investor starts to gain consumption habits only in Stage 2. Therefore, an early entry time to Stage 2 may not be the optimal decision, because the investor has a longer time to develop a much higher habit level. This is our second motivation for investigating the exterior optimal entry time problem: to see whether waiting longer can benefit the investor more, as the resulting habit level may be much lower and may lead to a higher value function.

We show that the value function of the composite problem is the unique viscosity solution to some variational inequalities. To this end, we can choose to apply either the classical Perron method or the stochastic version of the Perron method introduced in [Reference Bayraktar and Sirbu2]. For the classical Perron method, to establish the equivalence between the value function and the viscosity solution, we have to either prove the dynamic programming principle or upgrade the global regularity of the solution and prove the verification theorem. The convexity (concavity) of the value function with respect to the state variable is usually crucial in standard arguments to conclude global regularity. However, this property is not clear in our composite problem; see Remark 4.1 for details. The global regularity of the value function is not guaranteed, and so the direct verification proof for our exterior problem becomes difficult. Therefore, we instead choose the stochastic Perron method, which allows us to show the equivalence between the value function and the viscosity solution without global regularity. For some related works on optimal stopping using viscosity solutions, we refer to [Reference Reikvam31] and [Reference Pham27]. Recent works on stochastic control problems using the stochastic Perron method include [Reference Bayraktar and Sirbu2, Reference Bayraktar and Sirbu3, Reference Bayraktar and Sirbu4, Reference Bayraktar and Zhang5, Reference Sirbu33, Reference Lee, Yu and Zhou23]. One important step in completing the argument for the stochastic Perron method is the comparison principle for the associated variational inequalities, which is established in the present paper.

The rest of the paper is organized as follows: Section 2 introduces the market model and the habit formation preferences and formulates the two-stage optimization problem. Section 3 gives the main result on the interior utility maximization problem with habit formation and partial observation. Section 4 studies the exterior optimal entry problem with linear information costs. Using the stochastic Perron method, we show that the value function of the composite problem is the unique viscosity solution of some variational inequalities. Some auxiliary results and proofs are reported in Appendices A and B.

2. Mathematical model and preliminaries

2.1. Market model

Given the probability space $(\Omega,\mathbb{F},\mathbb{P})$ with full information filtration $\mathbb{F}=(\mathcal{F}_{t})_{0\leq t\leq T}$ that satisfies the usual conditions, we consider the market with one risk-free bond and one risky asset over a finite time horizon [0, T]. It is assumed that the bond process satisfies $S^{0}_{t}\equiv1$ , for $t\in[0,T]$ , which amounts to the standard change of numéraire.

The stock price $S_{t}$ satisfies

(2.1)

\begin{align} dS_{t}=\mu_{t}S_{t}dt+\sigma_{S}S_{t}dW_{t}, \ \ \ 0\leq t\leq T,\end{align}

with $S_{0}=s>0$ . Some empirical studies, such as [Reference Brennan and Xia9, Reference Campbell10, Reference Fama and French15, Reference Poterba and Summers30], have observed that the drift process of many risky assets follows the so-called mean-reverting diffusion. We also consider here that the drift process $\mu_t$ in (2.1) satisfies the Ornstein–Uhlenbeck stochastic differential equation (SDE) by

(2.2)

\begin{align} d\mu_{t}=-\lambda(\mu_{t}-\bar{\mu})dt+\sigma_{\mu}dB_{t}, \ \ \ 0\leq t\leq T.\end{align}

Here, $(W_{t})_{0\leq t\leq T}$ and $(B_{t})_{0\leq t\leq T}$ are $\mathcal{F}_{t}$ -adapted Brownian motions with correlation coefficient $\rho\in[-1,1]$ . For simplicity, the initial value $\mu_0$ of the drift is a given constant. We assume that the market coefficients $\sigma_{S}$ , $\lambda$ , $\bar{\mu}$ , and $\sigma_{\mu}$ are given nonnegative constants based on calibrations from historical data.

It is assumed that the investor starts with initial wealth $x(0)=x_0>0$ at time $t=0$ . Also, starting from the initial time $t=0$ , access to the full market information $\mathcal{F}_t$ generated by W and B incurs information costs $\kappa t$ , where $\kappa>0$ is the constant cost rate per unit time. As stated earlier, the information costs may include storage costs, search costs, communication costs, the cost of the investor’s attention, or other service costs involved in fully observing the market information $\mathcal{F}_t$ . Moreover, to simplify the mathematical problem, it is assumed that from $t=0$ up to a chosen stopping time $\tau$ , the investor purely waits and updates the dynamic distributions of the processes $\mu_t$ and $S_t$ ; he does not invest or consume at all. This assumption makes sense as long as the value of the optimal entry time $\tau$ is short in the model. The dynamic wealth process including the information costs at time t is simply given by a deterministic function $x(t)=x_0-\kappa t$ for any $t\leq \tau$ .

As the full market information filtration is costly, the investor needs to choose optimally an $\mathcal{F}_{t}$ -adapted stopping time $\tau$ to terminate full information acquisition and enter the second stage. From the chosen stopping time $\tau$ , he switches to the partial observation filtration $\mathcal{F}_t^S=\mathcal{F}_{\tau}\bigvee\sigma(S_u\,:\,\tau\leq u\leq t)$ for $\tau\leq t\leq T$ , which is the union of the sigma-algebra $\mathcal{F}_{\tau}$ and the natural filtration generated by the stock price S up to time t. Moreover, for any time $\tau\leq t\leq T$ , the investor chooses a dynamic consumption rate $c_{t}\geq 0$ and decides the amounts $\pi_{t}$ of his wealth to invest in the risky asset (the rest is invested in the bond). Without paying information costs, he can no longer observe the drift process $\mu_{t}$ and Brownian motions $W_{t}$ and $B_{t}$ for $t\geq \tau$ . Therefore, the investment–consumption pair $(\pi_t, c_t)$ is only assumed to be adapted to the partial observation filtration $\mathcal{F}_t^S$ for $\tau\leq t\leq T$ . Recall that at the entry time $\tau$ , the investor only has wealth $x(\tau)=x_0-\kappa \tau$ left. Under the incomplete filtration $\mathcal{F}_t^S$ , the investor’s total wealth process $\hat{X}_{t}$ can be written as

(2.3)

\begin{align} d\hat{X}_{t}=(\pi_{t}\mu_{t}-c_{t})dt+\sigma_{S}\pi_{t}dW_{t}, \ \ \ \tau\leq t\leq T, \end{align}

with the initial value $\hat{X}_{\tau}=x(\tau)=x_{0}-\kappa \tau>0$ . Note that $W_t$ is no longer a Brownian motion under the partial observation filtration $\mathcal{F}_t^S$ ; we have to apply the Kalman–Bucy filtering and consider the innovation process defined by

\begin{align*} d\hat{W}_{t}\,:\!=\,\frac{1}{\sigma_{S}}\big[\big(\mu_{t}-\hat{\mu}_{t}\big)dt+\sigma_{S}dW_{t}\big]=\frac{1}{\sigma_{S}}\bigg(\frac{dS_{t}}{S_{t}}-\hat{\mu}_{t}dt\bigg), \ \ \ \tau\leq t\leq T, \end{align*}

which is a Brownian motion under $\mathcal{F}^{S}_{t}$ . The best estimate of the unobservable drift process $\mu_t$ under $\mathcal{F}_t^S$ is the conditional expectation process $\hat{\mu}_{t}=\mathbb{E}\big[\mu_{t}\big{|}\mathcal{F}^{S}_{t}\big]$ , for $\tau\leq t\leq T$ with the initial input $\hat{\mu}_{\tau}=\mu_{\tau}$ , $\mathbb{P}$ -almost surely (a.s.), at the stopping time $\tau$ where the distribution of $\mu_{\tau}$ is determined via (2.2) by paying information costs up to $\tau$ . By standard Kalman–Bucy filtering (see Equation (18) of [Reference Brendle8] or Equation (21) of [Reference Monoyios25]), $\hat{\mu}_t$ satisfies the SDE

(2.4)

\begin{align} d\hat{\mu}_{t}=-\lambda\big(\hat{\mu}_{t}-\bar{\mu}\big)dt+\left(\frac{\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho}{\sigma_{S}}\right)d\hat{W}_{t}, \ \ \ \tau\leq t\leq T, \end{align}

with $\hat{\mu}_{\tau}=\mu_{\tau}$ , $\mathbb{P}$ -a.s. Moreover, the conditional variance $\hat{\Sigma}(t)=\mathbb{E}\big[\big(\mu_{t}-\hat{\mu}_{t}\big)^2\big{|}\mathcal{F}^{S}_{t}\big]$ satisfies the deterministic Riccati ordinary differential equation (ODE) (see Equation (19) of [Reference Brendle8] or Equation (23) of [Reference Monoyios25])

(2.5)

\begin{align} \frac{d\hat{\Sigma}(t)}{dt}=-\frac{1}{\sigma_{S}^2}\hat{\Sigma}^2(t)+\bigg(-\frac{2\sigma_{\mu}\rho}{\sigma_{S}}-2\lambda\bigg)\hat{\Sigma}(t)+\big(1-\rho^2\big)\sigma_{\mu}^2, \ \ \ \tau\leq t\leq T, \end{align}

with the initial value $\hat{\Sigma}(\tau)= \mathbb{E}\big[\big(\mu_{\tau}-\hat{\mu}_{\tau}\big)^2\big{|}\mathcal{F}^{S}_{t}\big] = 0$ , given that $\hat{\mu}_{\tau} = \mu_{\tau}$ , $\mathbb{P}$ -a.s. This can be solved explicitly by

\begin{align*} \hat{\Sigma}(t)=\sqrt{k}\sigma_{S}\frac{k_{1}\exp\Big(2\Big(\frac{\sqrt{k}}{\sigma_{S}}\Big)t\Big)+k_{2}}{k_{1}\exp\Big(2\Big(\frac{\sqrt{k}}{\sigma_{S}}\Big)t\Big)-k_{2}}-\left(\lambda+\frac{\sigma_{\mu}\rho}{\sigma_{S}}\right)\sigma_{S}^2, \ \ \ \tau\leq t\leq T, \end{align*}

where $k=\lambda^2\sigma_{S}^2+2\sigma_{S}\sigma_{\mu}\lambda\rho+\sigma_{\mu}^2$ , $k_{1}=\sqrt{k}\sigma_{S}+\Big(\lambda\sigma_{S}^2+\sigma_{S}\sigma_{\mu}\rho\Big)$ , and $k_{2}=-\sqrt{k}\sigma_{S}+\Big(\lambda\sigma_{S}^2+\sigma_{S}\sigma_{\mu}\rho\Big)$ .

For the second-stage dynamic control problem, we employ habit formation preferences. In particular, we define $Z_{t}\,:\!=\, Z(c_{t})$ as the habit formation process or the standard of living process, which describes the level of the investor’s consumption habits. It is conventionally assumed that the cumulative preference $Z_{t}$ satisfies the recursive equation $dZ_{t}=(\delta(t)c_{t}-\alpha(t)Z_{t})dt$ , $\tau\leq t\leq T$ (see [Reference Detemple and Zapatero12]), where $Z_{\tau}=z_{0}\geq 0$ is called the initial consumption habit of the investor. Equivalently, we have

\begin{align*} Z_{t}=z_{0}e^{-\int^{t}_{\tau}\alpha(u)du}+\int^{t}_{\tau}\delta(u)e^{-\int^{t}_{u}\alpha(s)ds}c_{u}du, \ \ \ \tau\leq t\leq T, \end{align*}

which is the exponentially weighted average of the initial habit and the past consumption. Here, the deterministic discount factors $\alpha(t)\geq 0$ and $\delta(t)\geq 0$ measure, respectively, the persistence of the past level and the intensity of the consumption history. In the present paper we are interested in addictive habits; namely, we require that the investor’s current consumption strategies never fall below the level of standard of living, i.e. $c_{t}\geq Z_{t}$ a.s., for $\tau\leq t\leq T$ .

Under the partial observation filtration $(\mathcal{F}^{S}_{t})_{\tau\leq t\leq T}$ , the stock price dynamics (2.1) can be rewritten as $dS_{t}=\hat{\mu}_{t}S_{t}dt+\sigma_{S}S_{t}d\hat{W}_{t}$ , and the wealth dynamics (2.3) can be rewritten as $d\hat{X}_{t}=(\pi_{t}\hat{\mu}_{t}-c_{t})dt+\sigma_{S}\pi_{t}d\hat{W}_{t}$ , $\tau\leq t\leq T$ . To facilitate the formulation of the stochastic control problem and the derivation of the dynamic programming equation, for any $t\in [0,T]$ , we denote by $\mathcal{A}_{t}(x)$ the time-modulated admissible set of the pair of investment and consumption processes $(\pi_{s},c_{s})_{t\leq s\leq T}$ with the initial wealth $\hat{X}_t=x$ , which is $\mathcal{F}^{S}_{s}$ -progressively measurable and satisfies the integrability conditions $\int^{T}_{t}\pi_{s}^2ds<+\infty$ , $\mathbb{P}$ -a.s., and $\int^{T}_{t}c_{s}ds<+\infty$ , $\mathbb{P}$ -a.s., with the addictive habit formation constraint that $c_{s}\geq Z_{s}$ , $\mathbb{P}$ -a.s., $t\leq s\leq T$ . Moreover, no bankruptcy is allowed, i.e., the investor’s wealth remains nonnegative: $\hat{X}_{s}\geq 0$ , $\mathbb{P}$ -a.s., $t\leq s\leq T$ .

2.2. Problem formulation

The two-stage optimal decision-making problem is formulated as a composite problem involving the optimal stopping and the stochastic control afterwards, which is defined by

(2.6)

\begin{align} \begin{aligned} \widetilde{V}(0,\mu_0;\,x_0, z_0) &\,:\!=\,\sup_{\tau\geq 0}\mathbb{E}\left[ \underset{(\pi,c)\in\mathcal{A}_{\tau}(x_0-\kappa\tau)}{\textrm{esssup}}\mathbb{E}\left[\int^T_\tau \frac{(c_{s}-Z_{s})^p}{p}ds\bigg{|}\mathcal{F}_\tau^{S}\right]\right]. \end{aligned} \end{align}

In particular, starting from the chosen stopping time $\tau$ , we are interested in utility maximization on consumption with habit formation, in which the power utility function $U(x)=x^p/p$ is defined on the difference $c_t-Z_t$ . To simplify the presentation, in the present paper we only consider the case of a risk aversion coefficient $p<0$ . The indirect utility process of the interior control problem is denoted by

\begin{align*} &\widehat{V}\big(t,x_0-\kappa t, z_0, \mu_t;\, 0\big) \,:\!=\, \underset{(\pi,c)\in\mathcal {A}_{t}(x_0-\kappa t)}{\textrm{esssup}}\mathbb{E}\left[\int^T_t \frac{(c_{s}-Z_{s})^p}{p}ds\bigg{|}\mathcal{F}_t^{S}\right]\\ =& \underset{(\pi,c)\in\mathcal {A}_{t}(x_0-\kappa t)}{\textrm{esssup}}\mathbb{E}\left[\int^T_t \frac{(c_{s}-Z_{s})^p}{p}ds\bigg{|}\hat{X}_{t}=x_0-\kappa t,\hat{\mu}_{t}=\mu_{t},Z_{t}=z_0;\,\hat{\Sigma}(t)=0\right]. \end{align*}

To determine the exterior optimal stopping time, we need to maximize over the inputs of the values $\tau$ , $\hat{X}_{\tau}$ , $Z_{\tau}$ , and $\hat{\mu}_{\tau}$ . Recall that the investor does not manage his investment and consumption before $\tau$ ; it follows that $\hat{X}_{\tau}=x_0-\kappa\tau$ , $Z_{\tau}=z_0$ , and $\hat{\Sigma}(\tau)=0$ can all be taken as parameters instead of variables. That is, $\mu_{\tau}=\hat{\mu}_{\tau}$ is the only random input, and we can regard $\mu_t$ as the only underlying state process. Therefore, the dynamic counterpart of (2.6) is defined by

(2.7)

\begin{align} \widetilde{V}(t,\eta;\,x_0-\kappa t,z_0)\,:\!=\, \underset{\tau\geq t}{\textrm{esssup}} \ \mathbb{E}\left[\underset{(\pi,c)\in\mathcal{A}_{\tau}(x_0-\kappa\tau)}{\textrm{esssup}}\ \mathbb{E}\left[\int^T_\tau \frac{(c_{s}-Z_{s})^p}{p})ds\bigg{|}\mathcal{F}_\tau^{S}\right]\Bigg{|}\mu_t=\eta\right]. \end{align}

Remark 2.1. We focus on the case $p<0$ in the present paper because then the functions A(t, s), B(t, s), and C(t, s) introduced later, which are solutions to the ODEs (3.4), (3.5), and (3.6), are all bounded, and the utility U(x) is also bounded from above, which significantly simplifies the proof of the verification result in Theorem 3.1 and the comparison results in Proposition 4.1. The other case, $0<p<1$ , can essentially be handled in a similar way. However, in that case, as the process $\hat{\mu}_t$ in (2.4) is unbounded and the functions A(t, s), B(t, s), and C(t, s) may explode at some $t\in [0,T]$ , one needs some additional parameter assumptions to guarantee integrability conditions and martingale properties in the proofs of some of the main results.

Assumption 2.1. In accordance with Remark 3.1 for the interior control problem, it is assumed from this point onwards that $x_0-\kappa t>z_0 m(t)$ for any $0\leq t\leq T$ ; i.e. the initial wealth is sufficiently large, after paying information costs, so that the interior control problem is well defined for any $0\leq t\leq T$ , where m(t) is defined by

(2.8)

\begin{align} m(t)=\int^T_t\exp\left(\int^s_t(\delta(v)-\alpha(v))dv\right)ds, \ \ 0\leq t\leq T. \end{align}

We note that m(t) in (2.8) represents the cost of subsistence consumption per unit of standard of living at time t, because the interior control problem is solvable if and only if $\hat{X}_t^{*}\geq m(t)Z_t$ , $0\leq t\leq T$ ; see Lemma B.1.

The function $\widehat{V}$ can be solved in the explicit form given in (3.7) later. The process $\widetilde{V}(t,\mu_t;\, x_0-\kappa t, z_0)$ with the function $\widetilde{V}$ defined in (2.7) is the Snell envelope of the process $\widehat{V}(t, x_0-\kappa t, z_0, \mu_t)$ above. The function $\widetilde{V}$ in (2.7) can therefore be written as

\begin{align*} \widetilde{V}(t,\eta;\, x_0-\kappa t, z_0)=\underset{\tau\geq t}{\textrm{esssup}}\ \mathbb{E}\Big[ \widehat{V}(\tau, x_0-\kappa\tau, z_0, \mu_\tau) \Big{|}\mu_t=\eta \Big]. \end{align*}

The continuation region, interpreted as the region where the investor continues to use full information observations to update the input value, is denoted by $\mathcal{C}=\{(t,\eta)\in[0,T)\times\mathbb{R}\,:\,\widetilde{V}(t,\eta;\, x_0-\kappa t, z_0)>\widehat{V}(t, x_0-\kappa t,z_0, \eta)\},$ and the free boundary is $\partial\mathcal{C}=\{(t,\eta)\in[0,T)\times\mathbb{R}\,:\,\widetilde{V}(t,\eta;\, x_0-\kappa t, z_0)=\widehat{V}(t, x_0-\kappa t,z_0, \eta)\}$ . Let us denote $\widetilde{V}(t,\eta;\, x_0-\kappa t, z_0)$ by $\widetilde{V}(t,\eta)$ for short when there is no possibility of confusion. By heuristic arguments, we can write the Hamilton–Jacobi–Bellman (HJB) variational inequalities with the terminal condition $\widetilde{V}(T,\eta)=0$ , $\eta\in\mathbb{R}$ , as

(2.9)

\begin{align} \mbox{min}\left\{ \widetilde{V}(t,\eta) - \widehat{V}(t, x_0-\kappa t, z_0, \eta), \ \ -\frac{\partial \widetilde{V}(t,\eta)}{\partial t}-\mathcal{L}\widetilde{V}(t,\eta) \right\}=0, \end{align}

where

\begin{equation*}\mathcal{L}\widetilde{V}(t,\eta)=-\lambda(\eta-\bar{\mu})\frac{ \partial \widetilde{V} }{ \partial \eta }(t,\eta) +\frac{1}{2} \sigma_\mu^2 \frac{ \partial^2 \widetilde{V} }{ \partial \eta^2 }(t,\eta).\end{equation*}

To simplify notation in the following sections, we shall rewrite (2.9) as

(2.10)

\begin{align} \left\{\begin{array}{lr} F\Big(t,\eta,\widetilde{V}, \frac{\partial\widetilde{V}}{\partial t},\frac{ \partial \widetilde{V} }{ \partial \eta }, \frac{ \partial^2 \widetilde{V} }{ \partial \eta^2 } \Big)=0, \ \ \ \mbox{on} \ [0,T)\times\mathbb{R},\\[8pt] v(T,\eta)=0, \ \ \ \mbox{for} \ \eta\in\mathbb{R}, \end{array}\right. \end{align}

with the operator $F(t,\eta,v,v_t,v_\eta,v_{\eta\eta})\,:\!=\,\mbox{min}\left\{ v-\widehat{V}, \ \ -\frac{\partial v}{\partial t}-\mathcal{L}v\right\}$ .

Remark 2.2. The part $-\frac{\partial \widetilde{V}}{\partial t}-\mathcal{L}\widetilde{V}=0$ in (2.9) is a linear parabolic partial differential equation (PDE) and does not depend on the interior control $(\pi,c)$ . The comparison part $\widetilde{V} - \widehat{V}$ in (2.9) depends on the optimal control $(\pi,c)$ because the $\widehat{V}$ is the value function of the interior control problem provided the input $\hat{X}_t=x_0-\kappa t$ , $Z_t=z_0$ , and $\hat{\mu}_t=\mu_t=\eta$ .

The next theorem is the main result of this paper.

Theorem 2.1. The quantity $\widetilde{V}(t,\eta)$ defined in (2.7) is the unique bounded and continuous viscosity solution to the variational inequalities (2.9). In addition, the optimal entry time for the composite problem (2.7) is given by the $\mathcal{F}_t$ -adapted stopping time

(2.11)

\begin{align}\tau^*\,:\!=\,T\wedge\inf\big\{t\geq 0\,:\, \widetilde{V}(t,\mu_t;\, x_0-\kappa t, z_0)=\widehat{V}(t, x_0-\kappa t,z_0, \mu_t)\big\}.\end{align}

We also have that the process $\widetilde{V}(t,\mu_t;\, x_0-\kappa t,z_0)$ is a martingale with respect to the full information filtration $\mathcal{F}_t$ for $0\leq t\leq \tau^*$ .

The proof will be provided in Section 4.

3. Interior utility maximization under partial observation

We first solve the interior stochastic control problem under partial observation of stock prices.

3.1. Optimal consumption with Kalman–Bucy filtering

For some fixed time $0\leq k\leq T$ , the dynamic interior stochastic control problem under habit formation is defined by

(3.1)

\begin{align} \begin{aligned} &\widehat{V}(k,x,z,\eta;\,\theta)\,:\!=\,\sup_{(\pi,c)\in\mathcal{A}_k(x)}\mathbb{E}\left[\int^{T}_{k}\frac{(c_{s}-Z_{s})^p}{p}ds\bigg{|}\mathcal{F}_k^S\right]\\[5pt] =&\sup_{(\pi,c)\in\mathcal{A}_k(x)}\mathbb{E}\left[\int^{T}_{k}\frac{(c_{s}-Z_{s})^p}{p}ds\bigg{|}\hat{X}_{k}=x,Z_{k}=z,\hat{\mu}_{k}=\eta;\,\hat{\Sigma}(k)=\theta\right], \end{aligned} \end{align}

where $\mathcal{A}_k(x)$ denotes the admissible control space starting from time k. Here, as the conditional variance $\hat{\Sigma}(t)$ is a deterministic function of time, we set $\theta$ as a parameter instead of a state variable.

By using the optimality principle and Itô’s formula, we can heuristically obtain the HJB equation as

(3.2)

\begin{align} \begin{split} &V_{t}-\alpha(t)zV_{z}-\lambda(\eta-\bar{\mu})V_{\eta}+\frac{\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)^2}{2\sigma_{S}^2}V_{\eta\eta}+\max_{(\pi,c)\in\mathcal{A}}\left[-cV_{x}+c\delta(t)V_{z}+\frac{(c-z)^p}{p}\right]\\ &+\max_{(\pi,c)\in\mathcal{A}}\left[\pi\eta V_{x}+\frac{1}{2}\sigma_{S}^2\pi^2V_{xx}+V_{x\eta}\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)\pi\right]=0,\ \ \ k\leq t\leq T, \end{split} \end{align}

with the terminal condition $V(T,x,z,\eta)=0$ .

3.2. The decoupled solution and main results

If $V(t,x,z,\eta)$ is smooth enough, the first-order condition gives

\begin{align*} \begin{split} \pi^*(t,x,z,\eta) & = \frac{-\eta V_{x}-\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)V_{x\eta}}{\sigma_{S}^2 V_{xx}},\\ c^*(t,x,z,\eta) & = z+\big(V_{x}-\delta(t)V_{z}\big)^{\frac{1}{p-1}}. \end{split} \end{align*}

Thanks to the homogeneity property of the power utility, we conjecture the value function to have the form

\begin{equation*}V(t,x,z,\eta)=\frac{[(x-m(t,\eta)z)]^p}{p}N^{1-p}(t,\eta)\end{equation*}

for some functions $m(t,\eta)$ and $N(t,\eta)$ to be determined. It also follows that the terminal condition that $N(T,\eta)=0$ is required. In particular, we find that the simple ansatz of $m(t,\eta)\,:\!=\,m(t)$ satisfies the equation (2.8). After substitution, the HJB equation reduces to the linear parabolic PDE for $N(t,\eta)$ as

\begin{align*} \begin{aligned} N_t&+\frac{p\eta^2}{2(1-p)^2\sigma_S^2}N(t,\eta) +\frac{\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)^2}{2\sigma_S^2}N_{\eta\eta} +\big(1+\delta(t)m(t)\big)^{\frac{p}{p-1}}\\[5pt] &+\left[-\lambda(\eta-\bar{\mu})+\frac{\eta\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)p}{(1-p)\sigma_S^2}\right]N_{\eta}(t,\eta)=0, \end{aligned} \end{align*}

with $N(T,\eta)=0$ . We can further solve the linear PDE explicitly by

(3.3)

\begin{align} N(t,\eta)=\int^T_t\big(1+\delta(s)m(s)\big)^{\frac{p}{p-1}}\exp\big(A(t,s)\eta^2+B(t,s)\eta+C(t,s)\big)ds, \end{align}

for $k\leq t\leq s\leq T$ . A(t, s), B(t, s) and C(t, s) satisfy the following ODEs:

(3.4)

\begin{align} A_t(t,s)+\frac{p}{2(1-p)^2\sigma_S^2} +2\left[-\lambda+\frac{p\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)}{\sigma_S^2(1-p)}\right]A(t,s) & \nonumber \\[4pt] +\frac{2\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)^2}{\sigma_{S}^2}A^2(t,s) & =0, \end{align}

(3.5)

\begin{align} B_t(t,s)+\left[-\lambda+\frac{p\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)}{\sigma_S^2(1-p)}\right]B(t,s)+2\lambda\bar{\mu}A(t,s) & \notag\\ + \frac{2\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)^2}{\sigma_{S}^2}A(t,s)B(t,s) & =0, \end{align}

(3.6)

\begin{align} C_t(t,s)+\lambda\bar{\mu}B(t,s)+\frac{\big(\hat{\Sigma}(t)+\sigma_{S}\sigma_{\mu}\rho\big)^2}{2\sigma_{S}^2}\big(B^2(t,s)+2A(t,s)\big)=0, \end{align}

with terminal conditions $A(s,s)=B(s,s)=C(s,s)=0$ . The explicit solutions of the ODEs (3.4), (3.5), and (3.6) are reported in Appendix A. For fixed $t\in[k,T]$ , we can define the effective domain of the pair (x, z) by $\mathbb{D}_t\,:\!=\,\{(x^{\prime},z^{\prime})\in(0,+\infty)\times[0,+\infty);\ x^{\prime}\geq m(t)z^{\prime}\}$ , where $k\leq t\leq T$ . The HJB equation (3.2) admits a classical solution on $[k,T]\times\mathbb{D}_t\times\mathbb{R}$ , given by

(3.7)

\begin{align} \begin{aligned} V(t,x,z,\eta) = & \, \bigg[\int^T_t\big(1+\delta(s)m(s)\big)^{\frac{p}{p-1}}\exp\big(A(t,s)\eta^2+B(t,s)\eta+C(t,s)\big)ds\bigg]^{1-p} \\[4pt] &\times\frac{[(x-m(t)z)]^p}{p}. \end{aligned} \end{align}

Remark 3.1. The effective domain of $V(t,x,z,\eta)$ requires some constraints on the optimal wealth process $\hat{X}^*_t$ and habit formation process $Z^*_t$ such that $\hat{X}^*_t\geq m(t)Z^*_t$ for $t\in[k,T]$ . In particular, we have to enforce the initial wealth-habit budget constraint that $\hat{X}_k\geq m(k)Z_k$ at the initial time k.

Theorem 3.1. (Verification theorem) If the initial budget constraint $\hat{X}_k\geq m(k)Z_k$ holds at time k, the unique solution (3.7) of the HJB equation equals the value function defined in (3.1), i.e., $V(k,x,z,\eta)=\widehat{V}(k,x,z,\eta)$ . Moreover, the optimal investment policy $\pi^*_t$ and optimal consumption policy $c^*_t$ are given in feedback form by $\pi^*_t=\pi^*(t,\hat{X}^*_t,Z^*_t,\hat{\mu}_t)$ and $\ c^*_t=c^*(t,\hat{X}^*_t,Z^*_t,\hat{\mu}_t)$ , $k\leq t\leq T$ . The function $\pi^*(t,x,z,\eta)\,:\,[k,T]\times\mathbb{D}_t\times\mathbb{R}\rightarrow\mathbb{R}$ is given by

(3.8)

\begin{align} \pi^*(t,x,z,\eta) =\left[\frac{\eta}{(1-p)\sigma^2_S}+\frac{\big(\hat{\Sigma}(t)+\sigma_S\sigma_\mu\rho\big)}{\sigma^2_S}\frac{N_{\eta}(t,\eta)}{N(t,\eta)}\right](x-m(t)z), \end{align}

and the function $c^*(t,x,z,\eta)\,:\,[k,T]\times\mathbb{D}_t\times\mathbb{R}\rightarrow\mathbb{R}^+$ is given by

(3.9)

\begin{align} c^*(t,x,z,\eta)=z+\frac{(x-m(t)z)}{\big(1+\delta(t)m(t)\big)^{\frac{1}{1-p}}N(t,\eta)}. \end{align}

The optimal wealth process $\hat{X}^*_t$ , $k\leq t\leq T$ , is given by

(3.10)

\begin{align} \hat{X}^*_t=&(x-m(k)z)\frac{N(t,\hat{\mu}_t)}{N(k,\eta)}\exp\left(\int^t_k\frac{(\hat{\mu}_u)^2}{2(1-p)\sigma^2_S}du+\int^t_k\frac{\hat{\mu}_u}{(1-p)\sigma_S}d\hat{W}_u\right) +m(t)Z^*_t. \end{align}

4. Exterior optimal stopping problem

4.1. Stochastic Perron method

We next study the exterior optimal entry problem. Recall that $\hat{X}_{\tau}=x_0-\kappa\tau$ , $Z_{\tau}=z_0$ , and $\hat{\Sigma}(\tau)=0$ are all taken as parameters. Our aim is to solve an optimal stopping problem in which $\mu_t$ is the only underlying state process.

Remark 4.1. Recall that the interior value function $\widehat{V}$ is of the form in (3.7). Moreover, by Remark A.1, the functions A(t, s) and B(t, s) in (3.7) satisfy $A(t,s)\leq 0$ and $B(t,s)\leq 0$ since $p<0$ . That is, if we take $\widehat{V}(\tau, \hat{\mu}_{\tau})$ as a functional of the input $\hat{\mu}_{\tau}$ , it is not globally convex or concave in $\hat{\mu}_{\tau}\in\mathbb{R}$ , because the function $\exp\left({A(t,s)\eta^2+B(t,s)\eta+C(t,s)}\right)$ is not globally convex or concave in the variable $\eta\in\mathbb{R}$ , which depends on values of A(t, s) and B(t, s). Therefore, the composite value function $\widetilde{V}(t,\eta)$ in (2.7) is not globally convex or concave in $\eta\in\mathbb{R}$ , which actually depends on all model parameters.

We choose to apply the stochastic Perron method in the present paper to verify that the value function of the composite problem is the unique viscosity solution of some variational inequalities. We first introduce sets of stochastic semi-solutions $\mathcal{V}^+$ and $\mathcal{V}^-$ and prove that $v^-\leq \widetilde{V}\leq v^+$ , where $v^-$ and $v^+$ are defined later in (4.2) and (4.3). Using the stochastic Perron method, we can show that $v^+$ is a bounded and upper semi-continuous (u.s.c.) viscosity subsolution and $v^-$ is a bounded and lower semi-continuous (l.s.c.) viscosity supersolution. Finally, we prove the comparison principle: that is, if we have any bounded and u.s.c. viscosity subsolution u and any bounded and l.s.c. viscosity supersolution v of (2.10), we must have the order $u\leq v$ . It follows that $v^+\leq v^-$ , which leads to the desired conclusion that $v^-= \widetilde{V}= v^+$ and the value function is the unique viscosity solution.

We next present the definitions of stochastic semi-solutions, which are mainly motivated by [Reference Bayraktar and Sirbu4].

Definition 4.1. The set of stochastic supersolutions for the PDE (2.10), denoted by $\mathcal{V}^+$ , is the set of functions $v\,:\,[0,T]\times\mathbb{R}\longrightarrow\mathbb{R}$ which have the following properties:

(i) The function v is u.s.c. and bounded on $[0,T]\times\mathbb{R}$ , and $v(t,\eta)\geq \widehat{V}(t,x_0-\kappa t, z_0,\eta)$ for any $(t,\eta)\in[0,T]\times\mathbb{R}$ .
(ii) For each $(t,\eta)\in[0,T]\times\mathbb{R}$ and any stopping time $t\leq \tau_1\in \mathcal {T}$ , we have $v(\tau_1,\mu_{\tau_1})\geq\mathbb{E}[v(\tau_2,\mu_{\tau_2})|\mathcal{F}_{\tau_1}]$ , $\mathbb{P}$ -a.s., for any $\tau_2\in\mathcal{T}$ and $\tau_2\geq \tau_1$ . That is, the function v along the solution of the SDE (2.2) is a supermartingale under the full information filtration $(\mathcal {F}_t)_{t\in[0,T]}$ between $\tau_1$ and T.

Definition 4.2. The set of stochastic subsolutions for the PDE (2.10), denoted by $\mathcal{V}^-$ , is the set of functions $v\,:\,[0,T]\times \mathbb{R}\longrightarrow\mathbb{R}$ which have the following properties:

(i) The function v is l.s.c. and bounded on $[0,T]\times \mathbb{R}$ , and $v(T,\eta)\leq 0$ for any $\eta\in\mathbb{R}$ .
(ii) For each $(t,\eta)\in[0,T]\times \mathbb{R}$ and any stopping time $t\leq \tau_1\in \mathcal {T}$ , we have $v(\tau_1,\mu_{\tau_1})\leq\mathbb{E}[v(\tau_2\wedge \zeta,\mu_{\tau_2\wedge \zeta})|\mathcal{F}_{\tau_1}]$ , $\mathbb{P}$ -a.s., for any $\tau_2\in\mathcal{T}$ and $\tau_2\geq\tau_1$ . That is, the function v along the solution to (2.2) is a submartingale under the full information filtration $(\mathcal {F}_t)_{t\in[0,T]}$ between $\tau_1$ and $\zeta$ , where
(4.1) \begin{align} {} {}\zeta\,:\!=\,\inf\big\{t\in [\tau_1, T]\,:\, v(t,\mu_t;\, x_0-\kappa t, z_0)\geq \widehat{V}(t, x_0-\kappa t,z_0, \mu_t)\big\}. {}\end{align}

Remark 4.1. We note that the definitions of stochastic supersolutions and stochastic subsolutions for the optimal stopping problem are not symmetric, which is consistent with the similar definitions in [Reference Bayraktar and Sirbu4]. The main reason for these differences comes from the natural supermartingale property of the Snell envelop process and its martingale property between the initial time and the first hitting time $\zeta$ in (4.1). That is, we naturally need $v(t,\eta)\geq \widehat{V}(t,x_0-\kappa t, z_0,\eta)$ for all $(t,\eta)\in[0,T]\times\mathbb{R}$ , including the terminal time T, in item (i) of Definition 4.1 (the definition of stochastic supersolutions), but we only require $v(T,\eta)\leq \widehat{V}(T,x_0-\kappa t, z_0,\eta)=0$ at the terminal time T in item (i) of Definition 4.2 (the definition of stochastic subsolutions). These comparison results and the supermartingale and submartingale properties will play important roles in the establishment of the desired sandwich result $v^-\leq \widetilde{V}\leq v^+$ in Lemma 4.4.

Lemma 4.1. $\widehat{V}(t,x_0-\kappa t, z_0, \eta;\, 0)$ is bounded and continuous for $(t,\eta)\in [0,T]\times \mathbb{R}$ .

Proof. For fixed $x_0$ and $z_0$ , it is clear that $\widehat{V}(t,x_0-\kappa t,z_0,\eta)$ in (3.7) is continuous and $\widehat{V}(t,x_0-\kappa t,z_0,\eta)\leq 0$ . Therefore we only need to show that $\widehat{V}$ is bounded below. By Appendix A, we know that $A(u)\leq 0$ , $B(u)\leq 0$ , and $C(u)\leq K$ for some $K\geq 0$ , thanks to $p<0$ . We hence obtain that $\big(A(u)\eta^2+B(u)\eta+C(u)\big)\leq K_1$ for some $K_1>0$ , and it follows that $\widehat{V}(t,x_0-\kappa t,z_0,\eta)$ is bounded below by some constant for $(t,\eta)\in [0,T]\times \mathbb{R}$ , again by $p<0$ .

As it is trivial to see that $0\in\mathcal{V}^-$ and $0\in\mathcal{V}^+$ , we have the following result.

Lemma 4.2. $\mathcal{V}^+$ and $\mathcal{V}^-$ are nonempty.

Definition 4.3. We define

(4.2)

\begin{align} v^-\,:\!=\,\sup_{p\in\mathcal{V}^-}p, \end{align}

(4.3)

\begin{align} v^+\,:\!=\,\inf_{q\in\mathcal{V}^+}q. \end{align}

The next result is similar to Lemma 2.2 of [Reference Bayraktar and Sirbu2].

Lemma 4.3. We have $v^-\in\mathcal{V}^-$ and $v^+\in\mathcal{V}^+$ .

We now have the first important sandwich result.

Lemma 4.4. We have $v^-\leq \widetilde{V}\leq v^+$ .

Proof. For each $v\in\mathcal{V}^+$ , let us consider $\tau_1=t\geq 0$ in Definition 4.1. For any $\tau\geq t$ , we have $v(t,\eta)\geq\mathbb{E}[v(\tau,\mu_{\tau})|\mathcal{F}_t]\geq\mathbb{E}\big[\widehat{V}\big(\tau,x_0-\kappa\tau,z_0,\mu_\tau\big)|\mathcal{F}_t\big]$ thanks to the supermartingale property in Definition 4.1. It follows that $v(t,\eta)\geq{\textrm{esssup}_{t\leq \tau}}\ \mathbb{E}\big[\widehat{V}(\tau,x_0-\kappa\tau,z_0, \mu_\tau)|\mathcal{F}_t\big]$ . This implies that $v(t,\eta)\geq\widetilde{V}(t,\eta)$ in view of the definition of $\widetilde{V}(t,\eta)$ , and hence $\widetilde{V}\leq v^+$ by the definition in (4.3). On the other hand, for each $v\in\mathcal{V}^-$ , by taking $\tau_1=t\geq 0$ in Definition 4.2, we have $v(t,\eta)\leq\mathbb{E}[v(\tau\wedge\zeta,\mu_{\tau\wedge\zeta})|\mathcal{F}_t]$ for any $\tau\geq t$ because of the submartingale property in Definition 4.2. In particular, using the definition of $\zeta$ , we further have

\begin{align*} {} {}v(t,\eta) &\leq \mathbb{E}\big[v(\tau\wedge\zeta,\mu_{\tau\wedge\zeta})|\mathcal{F}_t\big] \leq \mathbb{E}\big[\widehat{V}(\tau\wedge\zeta,x_0-f(\tau\wedge\zeta),z_0,\mu_{\tau\wedge\zeta})|\mathcal{F}_t\big] {} {}\\ {} {}&\leq{\textrm{esssup}_{\tau\geq t}}\ \mathbb{E}\big[\widehat{V}(\tau,x_0-\kappa\tau,z_0, \mu_\tau)|\mathcal{F}_t\big]=\widetilde{V}(t,\eta). {}\end{align*}

It then follows that $\widetilde{V}\geq v^-$ because of (4.2). In conclusion, we have the inequality $v^-\leq \widetilde{V}\leq v^+$ .

Theorem 4.1. The function $v^-$ in Definition 4.3 is a bounded and l.s.c. viscosity supersolution of

(4.4)

\begin{align} \left\{\begin{array}{lr} F(t,\eta,v,v_t,v_\eta,v_{\eta\eta})\geq 0, \quad {on} \ [0,T)\times\mathbb{R},\\[5pt] v(T,\eta)\geq 0, \quad {for\ any} \ \eta\in\mathbb{R}, \end{array}\right. \end{align}

and the function $v^+$ in Definition 4.3 is a bounded and u.s.c. viscosity subsolution of

(4.5)

\begin{align} \left\{\begin{array}{lr} F(t,\eta,v,v_t,v_\eta,v_{\eta\eta})\leq 0, \quad {on} \ [0,T)\times\mathbb{R},\\[5pt] v(T,\eta)\leq 0, \quad {for\ any} \ \eta\in\mathbb{R}. \end{array}\right. \end{align}

Proof. We follow some arguments from [Reference Bayraktar and Sirbu2, Reference Bayraktar and Sirbu4], modifying them to fit our setting.

(i) The subsolution property of $v^+$ . First, the definition in (4.3) and Lemma 4.3 imply that $v^+$ is bounded and u.s.c. Suppose $v^+$ is not a viscosity subsolution; then there exists some interior point $(\bar{t},\bar{\eta})\in(0,T)\times\mathbb{R}$ and a $C^{1,2}$ test function $\varphi\,:\,[0,T]\times\mathbb{R}\rightarrow\mathbb{R}$ such that $v^+ - \varphi$ attains a strict local maximum that is equal to zero and $F(\bar{t},\bar{\eta},v,v_{\bar{t}},v_{\bar{\eta}},v_{\bar{\eta}\bar{\eta}})>0$ . It follows that

\begin{align*} \left\{ \begin{array}{lr} v^+\big(\bar{t},\bar{\eta}\big) - \widehat{V}\big(\bar{t},x_0-f(\bar{t}),z_0,\bar{\eta}\big)> 0,\\[8pt] -\dfrac{\partial \varphi}{\partial t}\big(\bar{t},\bar{\eta}\big)-\mathcal{L}\varphi\big(\bar{t},\bar{\eta}\big) >0. \end{array} \right. \end{align*}

There exists a ball $B(\bar{t},\bar{\eta},\varepsilon)$ small enough that

\begin{align*} \left\{ \begin{array}{lr} -\dfrac{\partial \varphi}{\partial t}-\mathcal{L}\varphi>0 \ \ \mbox{on} \ \ \overline{B(\bar{t},\bar{\eta},\varepsilon)},\\[8pt] \varphi>v^+ \ \ \mbox{on} \ \ \overline{B\big(\bar{t},\bar{\eta},\varepsilon\big)}\backslash\big(\bar{t},\bar{\eta}\big). \end{array} \right. \end{align*}

In addition, as $\varphi(\bar{t},\bar{\eta}) = v^+(\bar{t},\bar{\eta})> \widehat{V}(\bar{t},x_0-f(\bar{t}),z_0,\bar{\eta})$ , $\varphi$ is continuous, and $\widehat{V}$ is continuous, we can derive that for some $\varepsilon$ small enough, we have $\varphi-\varepsilon\geq\widehat{V}$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}$ . Because $v^+-\varphi$ is u.s.c. and $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})$ is compact, it then follows that there exists a $\delta>0$ such that $\varphi-\delta\geq v^+$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})$ .

If we choose $0<\xi<\delta\wedge\varepsilon$ , the function $\varphi^\xi=\varphi-\xi$ satisfies

\begin{align*} \left\{ \begin{array}{lr} -\dfrac{\partial \varphi^\xi}{\partial t}-\mathcal{L}\varphi^\xi>0 \ \ \mbox{on} \ \ \overline{B(\bar{t},\bar{\eta},\varepsilon)},\\[8pt] \varphi^\xi>v^+ \ \ \mbox{on} \ \ \overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big),\\[8pt] \varphi^\xi\geq\widehat{V} \ \ \mbox{on} \ \ \overline{B(\bar{t},\bar{\eta},\varepsilon)}, \end{array} \right. \end{align*}

and $\varphi^\xi(\bar{t},\bar{\eta})=v^+(\bar{t},\bar{\eta})-\xi$ .

Let us define an auxiliary function by

\begin{align*} v^\xi\,:\!=\,\left\{ \begin{array}{lr} v^+\wedge\varphi^\xi \ \ \mbox{on} \ \ \overline{B(\bar{t},\bar{\eta},\varepsilon)},\\[8pt] v^+ \ \ \mbox{outside} \ \ \overline{B(\bar{t},\bar{\eta},\varepsilon)}. \end{array} \right. \end{align*}

It is easy to check that $v^\xi$ is u.s.c. and $v^\xi(\bar{t},\bar{\eta})=\varphi^\xi(\bar{t},\bar{\eta})<v^+(\bar{t},\bar{\eta})$ . We claim that $v^\xi$ satisfies the terminal condition. To this end, we pick some $\varepsilon>0$ that satisfies $T>\bar{t}+\varepsilon$ and recall that $v^+$ satisfies the terminal condition. We then continue to show that $v^\xi\in \mathcal{V}^+$ to obtain a contradiction.

Let us fix $(t,\eta)$ and recall that $((\mu_s)_{t\leq s\leq T},(W_s,B_s)_{t\leq s\leq T},\Omega,\mathcal{F},\mathbb{P},(\mathcal{F}_s)_{t\leq s\leq T})\in\chi$ , where $\chi$ is the nonempty set of all weak solutions. We need to show that the process $(v^\xi(s,\mu_s))_{t\leq s\leq T}$ is a supermartingale on $(\Omega,\mathbb{P})$ with respect to $(\mathcal{F}_s)_{t\leq s\leq T}$ . We first assume that $(v^+(s,\mu_s))_{t\leq s\leq T}$ has right-continuous paths. In this case, $v^\xi$ is a supermartingale locally in the region $[t,T]\times\mathbb{R}\backslash B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)$ because it equals the right-continuous supermartingale $(v^+(s,\mu_s))_{t\leq s\leq T}$ . As the process $(v^\xi(s,\mu_s))_{t\leq s\leq T}$ is the minimum between two local supermartingales in the region $B(\bar{t},\bar{\eta},\varepsilon)$ , it is a local supermartingale. As the two regions $[t,T]\times\mathbb{R}\backslash B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)$ and $B(\bar{t},\bar{\eta},\varepsilon)$ overlap over an open region, $(v^\xi(s,\mu_s))_{t\leq s\leq T}$ is actually a supermartingale.

If the process $(v^+(s,\mu_s))_{t\leq s\leq T}$ is not right-continuous, we can consider its right-continuous limit over rational times to transform it to the special case discussed above. In particular, for a given rational number r and fixed $0 \leq t\leq r \leq s \leq T$ and $\eta\in\mathbb{R}$ , it remains to show the process $(Y_u)_{t\leq u\leq T}\,:\!=\,(v^\xi(u,\mu_u))_{t\leq u\leq T}$ between r and s is a supermartingale, which is equivalent to showing that $Y_r \geq \mathbb{E}[Y_s|\mathcal{F}_r]$ .

Let us define $G_u\,:\!=\,v^+(u,\mu_u), \ r \leq u \leq s,$ and freeze the process G after time s, i.e. $G_u\,:\!=\,v^+(s,\mu_s), \ s \leq u \leq T$ . As $(G_u)_{r\leq u\leq T}$ may not be right-continuous, by Proposition 1.3.14 in [Reference Karatzas and Shreve19], we can consider its right-continuous modification

\begin{align*} G^+_u(\omega)\,:\!=\,\lim_{u^{\prime}\rightarrow u, \ u^{\prime}>u, \ u^{\prime}\in\mathbb{Q}}G_{u^{\prime}}(\omega), \ \ \ r\leq u\leq T. \end{align*}

Note that $G^+$ is a right-continuous supermartingale with respect to $\mathcal{F}$ that satisfies the usual conditions. Because $v^+$ is u.s.c. and the process remains the same after s, we conclude that $G_r\geq G_r^+, \ G_s=G_s^+$ . Recall that $v^+<\varphi-\delta$ in the open region $B(\bar{t},\bar{\eta},\varepsilon)\backslash\overline{B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)}$ ; if we take right limits inside this region and use the continuous function $\varphi$ , we have

\begin{align*} G^+_u<\varphi^\xi(u,\mu_u), \ \mbox{if} \ (u,\mu_u)\in B(\bar{t},\bar{\eta},\varepsilon)\backslash\overline{B\bigg(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\bigg)}. \end{align*}

Thus, if we consider the process

\begin{align*} Y_u^+\,:\!=\,\left\{ \begin{array}{lr} G_u^+, \ (u,\mu_u)\not\in\overline{B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)},\\[8pt] G_u^+\wedge\varphi^\xi(u,\mu_u), \ (u,\mu_u)\in B(\bar{t},\bar{\eta},\varepsilon), \end{array} \right. \end{align*}

we also have $Y_r\geq Y_r^+, \ Y_s=Y_s^+$ .

Because $G^+$ has right-continuous paths, we can conclude that Y is a supermartingale such that

\begin{align*} Y_r\geq Y_r^+\geq\mathbb{E}\big[Y_s^+|\mathcal{F}_r\big] =\mathbb{E}\big[Y_s|\mathcal{F}_r\big]. \end{align*}

(ii) The terminal condition for $v^+$ .

For some $\eta_0\in\mathbb{R}$ , we assume that $v^+(T,\eta_0)>0$ and will show a contradiction. As $\widehat{V}$ is continuous on $\mathbb{R}$ , we can choose an $\varepsilon>0$ such that $0 \leq v^+(T,\eta_0)-\varepsilon$ and $|\eta-\eta_0|\leq\varepsilon$ . On the compact set $(\overline{B(T,\eta_0,\varepsilon)}\backslash B(T,\eta_0,\frac{\varepsilon}{2}))\cap([0,T]\times\mathbb{R})$ , $v^+$ is bounded above by the definition of $\mathcal{V}^+$ and the fact that $v^+\in\mathcal{V}^+$ . Moreover, as $v^+$ is u.s.c. on this compact set, we can find $\delta>0$ small enough so that

(4.6)

\begin{align} v^+(T,\eta_0) + \frac{\varepsilon^2}{4\delta} \geq \varepsilon+\sup_{(t,\eta)\in(\overline{B(T,\eta_0,\varepsilon)}\backslash B(T,\eta_0,\frac{\varepsilon}{2}))\cap([0,T]\times\mathbb{R})}v^+(t,\eta). \end{align}

Next, for $k>0$ , we define the function

\begin{equation*}\varphi^{\delta,\varepsilon,k}(t,\eta) \,:\!=\, v^+(T,\eta_0) + \frac{|\eta-\eta_0|^2}{\delta} + k(T-t).\end{equation*}

For k large enough, we derive that $-\varphi^{\delta,\varepsilon,k}_t - \mathcal{L}\varphi^{\delta,\varepsilon,k} > 0 \ \mbox{on} \ \overline{B(T,\eta_0,\varepsilon)}$ . Moreover, in view of (4.6), we have

\begin{align*} \varphi^{\delta,\varepsilon,k} \geq \varepsilon + v^+ \ \mbox{on} \ \Big(\overline{B(T,\eta_0,\varepsilon)}\backslash B\Big(T,\eta_0,\frac{\varepsilon}{2}\Big)\Big)\cap([0,T]\times\mathbb{R}), \end{align*}

and $\varphi^{\delta,\varepsilon,k}(T,\eta) \geq v^+(T,\eta_0) \geq 0+ \varepsilon$ for $|\eta-\eta_0|\leq\varepsilon$ .

Now, we can find $\xi<\varepsilon$ and define the following function:

\begin{align*} v^{\delta,\varepsilon,k,\xi}\,:\!=\,\left\{\begin{array}{lr} v^+\wedge\big(\varphi^{\delta,\varepsilon,k}-\xi\big) \ \ \mbox{on} \ \ \overline{B(T,\eta_0,\varepsilon)},\\[7pt] v^+ \ \ \mbox{outside} \ \ \overline{B(T,\eta_0,\varepsilon)}. \end{array}\right. \end{align*}

By following a similar argument to that used in Step (i), one can obtain that $v^{\delta,\varepsilon,k,\xi}\in\mathcal{V}^+$ , but $v^{\delta,\varepsilon,k,\xi}(T,\eta_0) = v^+(T,\eta_0)-\xi$ , which leads to a contradiction.

(iii) The supersolution property of $v^-$ .

We provide only a sketch of the proof, as it is essentially similar to that of Step (i). Suppose that $v^-$ is not a viscosity supersolution; then there exist some interior point $(\bar{t},\bar{\eta})\in(0,T)\times\mathbb{R}$ and a $C^{1,2}$ test function $\psi\,:\,[0,T]\times\mathbb{R}\rightarrow\mathbb{R}$ such that $v^- - \psi$ attains a strict local minimum that is equal to zero. As $F(\bar{t},\bar{\eta},v,v_{\bar{t}},v_{\bar{\eta}},v_{\bar{\eta}\bar{\eta}})<0$ , there are two separate cases to check.

Case (i): $v^-(\bar{t},\bar{\eta})-\widehat{V}(\bar{t},x_0-f(\bar{t}),z_0,\bar{\eta})<0$ . This already leads to a contradiction with $v^-(\bar{t},\bar{\eta})\geq\widehat{V}(\bar{t},x_0-f(\bar{t}),z_0,\bar{\eta})$ by the definition of $v^-$ .

Case (ii): $-\frac{\partial \psi}{\partial t}(\bar{t},\bar{\eta})-\mathcal{L}\psi(\bar{t},\bar{\eta}) <0$ . In this case we can find a ball $B(\bar{t},\bar{\eta},\varepsilon)$ small enough so that $-\frac{\partial \psi}{\partial t}-\mathcal{L}\psi<0$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}$ . Moreover, as $v^--\psi$ is l.s.c. and $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})$ is compact, there exists a $\delta>0$ such that $\psi+\delta\leq v^-$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})$ . We can then choose $\xi\in(0,\frac{\delta}{2})$ small such that $\psi^\xi=\psi+\xi$ satisfies three properties:

(i) $-\frac{\partial \psi^\xi}{\partial t}-\mathcal{L}\psi^\xi<0$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}$ ;
(ii) we have $v^-\geq\psi+\delta > \psi+\xi = \psi^\xi$ on $\overline{B(\bar{t},\bar{\eta},\varepsilon)}\backslash B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)$ ;
(iii) $\psi^\xi(\bar{t},\bar{\eta}) = \psi(\bar{t},\bar{\eta}) + \xi = v^-(\bar{t},\bar{\eta}) +\xi > v^-(\bar{t},\bar{\eta})$ .

Thus, we can define an auxiliary function by

\begin{align*} v^\xi\,:\!=\,\left\{ \begin{array}{lr} v^-\vee\psi^\xi \ \ \mbox{on} \ \ \overline{B(\bar{t},\bar{\eta},\varepsilon)},\\[8pt] v^- \ \ \mbox{outside} \ \ \overline{B(\bar{t},\bar{\eta},\varepsilon)}. \end{array} \right. \end{align*}

By repeating an argument similar to that of Step (i), we obtain that $v^\xi\in \mathcal{V}^-$ by showing that $(v^\xi(s,\mu_s))_{t\leq s\leq T}$ is a submartingale. If $v^-$ has right-continuous paths, the proof is trivial. In general, by Proposition 1.3.14 in [Reference Karatzas and Shreve19], we can consider the right-continuous submartingale $G^+_u(\omega)\,:\!=\,\lim_{u^{\prime}\rightarrow u, \ u^{\prime}>u, \ u^{\prime}\in\mathbb{Q}}G_{u^{\prime}}(\omega), \ \omega\in\Omega^\ast, \ r\leq u\leq T$ , where $G_u\,:\!=\,v^-(u, \mu_u), \ r \leq u \leq s$ and we stop it at time t. Similarly to Step (i), we note that $G^+$ is a right-continuous submartingale and therefore $G_r\leq G_r^+, \ G_s=G_s^+$ . As $ G^+_u>\psi^\xi(u, \mu_u), \ \mbox{if} \ (u,\mu_u)\in B(\bar{t},\bar{\eta},\varepsilon)\backslash \overline{B(\bar{t},\bar{\eta},\frac{\varepsilon}{2})}$ , we can define the process

\begin{align*} Y_u^+\,:\!=\,\left\{ \begin{array}{lr} G_u^+, \ (u, \mu_u)\not\in\overline{B\big(\bar{t}, \bar{\eta},\frac{\varepsilon}{2}\big)},\\[8pt] G_u^+\vee\psi^\xi(u, \mu_u), \ (u, \mu_u)\in \overline{B\big(\bar{t},\bar{\eta},\frac{\varepsilon}{2}\big)}. \end{array} \right. \end{align*}

We can conclude that $Y_r\leq Y_r^+, \ Y_s=Y_s^+$ , and Y is a submartingale such that $Y_r\leq Y_r^+\leq\mathbb{E}[Y_s^+|\mathcal{F}_r]=\mathbb{E}[Y_s|\mathcal{F}_r]$ , which completes the proof.

(iv) The terminal condition for $v^-$ .

For some $\eta_0\in\mathbb{R}$ , suppose that $v^-(T,\eta_0)<0$ ; we will derive a contradiction. As $\widehat{V}$ is continuous on $\mathbb{R}$ , we can choose an $\varepsilon>0$ such that $0 \geq v^-(T,\eta_0)+\varepsilon$ and $|\eta-\eta_0|\leq\varepsilon$ . Similarly to Step (ii), we can find $\delta>0$ small enough so that

(4.7)

\begin{align} v^-(T,\eta_0) - \frac{\varepsilon^2}{4\delta} \leq \inf_{(t,\eta)\in\big(\overline{B\big(T,\eta_0,\varepsilon\big)}\backslash B\big(T,\eta_0,\frac{\varepsilon}{2}\big)\big)\cap([0,T]\times\mathbb{R})}v^-(t,\eta)-\varepsilon. \end{align}

Then, for $k>0$ , we consider

\begin{equation*} \psi^{\delta,\varepsilon,k}(t,\eta) \,:\!=\, v^-(T,\eta_0) - \frac{|\eta-\eta_0|^2}{\delta} - k(T-t).\end{equation*}

For k large enough, we have that $-\psi^{\delta,\varepsilon,k}_t - \mathcal{L}\psi^{\delta,\varepsilon,k} < 0 \ \mbox{on} \ \overline{B(T,\eta_0,\varepsilon)}$ . Furthermore, in view of (4.7), we have

\begin{align*} \psi^{\delta,\varepsilon,k} \leq v^- - \varepsilon \ \mbox{on} \ \Big(\overline{B(T,\eta_0,\varepsilon)}\backslash B\Big(T,\eta_0,\frac{\varepsilon}{2}\Big)\Big)\cap([0,T]\times\mathbb{R}), \end{align*}

and $\psi^{\delta,\varepsilon,k}(T,\eta) \leq v^-(T,\eta_0) \leq - \varepsilon$ for $|\eta-\eta_0|\leq\varepsilon$ .

Next, we can find $\xi<\varepsilon$ and define the function

\begin{align*} v^{\delta,\varepsilon,k,\xi}\,:\!=\,\left\{\begin{array}{lr} v^-\vee(\psi^{\delta,\varepsilon,k}+\xi) \ \ \mbox{on} \ \ \overline{B(T,\eta_0,\varepsilon)},\\[8pt] v^- \ \ \mbox{outside} \ \ \overline{B(T,\eta_0,\varepsilon)}. \end{array}\right. \end{align*}

Similarly to Step (iii), we obtain that $v^{\delta,\varepsilon,k,\xi}\in\mathcal{V}^-$ , but $v^{\delta,\varepsilon,k,\xi}(T,\eta_0) = v^-(T,\eta_0)+\xi$ , which gives a contradiction.

We now reverse the time and consider $s\,:\!=\,T-t$ . However, for simplicity of presentation, we continue to use t in place of s whenever there is no possibility of confusion. The variational inequalities can be written as

(4.8)

\begin{align} \mbox{min}\left\{ \widetilde{V}(t,\eta;\,x_0-f(T-t), z_0) - \widehat{V}\big(t, x_0-f(T-t), z_0, \eta\big), \ \ \frac{\partial \widetilde{V}(t,\eta)}{\partial t}-\mathcal{L}\widetilde{V}(t,\eta) \right\}=0,\end{align}

where

with the condition $\widetilde{V}(0,\eta)=0$ .

Let us write this equivalently as

(4.9)

\begin{align} \left\{\begin{array}{lr} F(t,\eta,v,v_t,v_\eta,v_{\eta\eta})=0, \quad \mbox{on} \ (0,T]\times\mathbb{R},\\[8pt] v(0,\eta)=\widehat{V}(0,x_0-f(0),z_0,\eta), \quad \mbox{for any} \ \eta\in\mathbb{R}, \end{array}\right. \end{align}

where

\begin{equation*}F(t,\eta,v,v_t,v_\eta,v_{\eta\eta})\,:\!=\,\mbox{min}\left\{ v-\widehat{V}, \ \ \frac{\partial v}{\partial t}-\mathcal{L}v\right\}.\end{equation*}

We also have the continuation region as $\mathcal{C}=\{(t,\eta)\in(0,T]\times\mathbb{R}\,:\,\widetilde{V}(t,\eta;\,x_0-f(T-t), z_0)>\widehat{V}(t, x_0-f(T-t), z_0, \eta)\}$ .

Proposition 4.1. (Comparison principle) Let u,v respectively be a u.s.c. viscosity subsolution and an l.s.c. viscosity supersolution of (4.9). If $u(0,\eta)\leq v(0,\eta)$ on $\mathbb{R}$ , then we have $u\leq v$ on $(0,T]\times\mathbb{R}$ .

Proof. We will follow some arguments from [Reference Bayraktar and Zhang5, Reference Pham28], modifying them to fit our setting. Suppose that $u(0,\eta)\leq v(0,\eta)$ on $\mathbb{R}$ ; we will prove that $u\leq v$ on $[0,T]\times\mathbb{R}$ . We first construct the strict supersolution to the system (4.9) with suitable perturbations of v. Let us recall that $A\leq 0$ , $B\leq 0$ , and C is bounded above by some constant as in Remark A.1. Moreover, we know that $\widehat{V}(t,x_0-\kappa t,z_0,\eta)\leq 0$ . Let us fix a constant $C_2>0$ small enough so that $\lambda>C_2\sigma^2_{\mu}$ and set $\psi(t,\eta) = C_0e^{t}+e^{C_2\eta^2}$ with some $C_0>1$ . We have that

\begin{align*} \begin{aligned} \frac{\partial \psi}{\partial t} - \mathcal{L}\psi = &C_0e^{t} +C_2\Big[ 2\big(\lambda-C_2\sigma_\mu^2\big)\eta^2-2\lambda\bar{\mu}\eta-\sigma_\mu^2 \Big]e^{C_2\eta^2}\\[4pt]\geq& C_0e^{t} + C_2\frac{-2\big(\lambda-C_2\sigma_\mu^2\big)\sigma_\mu^2-\lambda^2\bar{\mu}^2}{2\big(\lambda-C_2\sigma_\mu^2\big)}\\[4pt]>& C_0 + C_2\frac{-2\big(\lambda-C_2\sigma_\mu^2\big)\sigma_\mu^2-\lambda^2\bar{\mu}^2}{2\big(\lambda-C_2\sigma_\mu^2\big)}. \end{aligned} \end{align*}

We can then choose $C_0>1$ large enough so that

\begin{equation*}C_0 + C_2\frac{-2\big(\lambda-C_2\sigma_\mu^2\big)\sigma_\mu^2-\lambda^2\bar{\mu}^2}{2\big(\lambda-C_2\sigma_\mu^2\big)}>1,\end{equation*}

which guarantees that

(4.10)

\begin{align}\frac{\partial \psi}{\partial t} - \mathcal{L}\psi >1.\end{align}

Let us define $v^\Lambda \,:\!=\, (1-\Lambda)v + \Lambda\psi$ on $[0,T]\times\mathbb{R}$ for any $\Lambda\in(0,1)$ . It follows that

(4.11)

\begin{align} \begin{split} v^\Lambda - \widehat{V} &= (1-\Lambda)v + \Lambda\psi - \widehat{V}= (1-\Lambda)v + \Lambda \Big(C_0e^{t}+e^{C_2\eta^2}\Big) -\widehat{V}\\ & \geq (1-\Lambda)v + \Lambda \Big(C_0e^{t}+e^{C_2\eta^2}\Big) + \Lambda\widehat{V} -\widehat{V}\\ &> (1-\Lambda)\big(v - \widehat{V}\big) + \Lambda C_0 > \Lambda, \end{split} \end{align}

where we used $v - \widehat{V} \geq 0$ in the last inequality. From (4.10) and (4.11), we can deduce that for $\Lambda\in(0,1)$ , $v^\Lambda$ is a supersolution to

(4.12)

\begin{align} \mbox{min}\left\{ v^\Lambda-\widehat{V}, \ \ \frac{\partial v^\Lambda}{\partial t}-\mathcal{L}v^\Lambda\right\} \geq \Lambda. \end{align}

To prove the comparison principle, it suffices to prove the claim that $\sup(u-v^\Lambda)\leq 0$ for all $\Lambda\in(0,1)$ , as the required result is then obtained by letting $\Lambda$ go to 0. To this end, we will suppose that there exists some $\Lambda\in(0,1)$ such that $M\,:\!=\,\sup(u-v^\Lambda)>0$ , and derive a contradiction.

It is clear that u, v, and $\widehat{V}$ have the same growth conditions: in view of the explicit forms of A, B, C, and $\widehat{V}$ , it follows that $\widehat{V}$ has growth condition in t as $e^{e^{K_1t}}$ for some $K_1<0$ and has growth condition in $\eta$ as $e^{K_2\eta^2}$ for some $K_2<0$ ; on the other hand, $\psi$ has growth condition in t as $e^{t}$ and has growth condition in $\eta$ as $e^{C_2\eta^2}$ . Thus, we have that $u(t,\eta)-v^\Lambda(t,\eta) = ( u - (1-\Lambda)v - \Lambda\psi)(t,\eta)$ goes to $-\infty$ as $t\rightarrow T,\eta\rightarrow\infty$ . Consequently, the u.s.c. function $(u-v^\Lambda)$ attains its maximum M.

Let us consider the u.s.c. function $\Phi_\varepsilon(t,t^{\prime},\eta,\eta^{\prime})=u(t,\eta)-v^\Lambda(t^{\prime},\eta^{\prime})-\phi_\varepsilon(t,t^{\prime},\eta,\eta^{\prime})$ , where $\phi_\varepsilon(t,t^{\prime},\eta,\eta^{\prime}) = \frac{1}{2\varepsilon}((t-t^{\prime})^2+(\eta-\eta^{\prime})^2)$ , $\varepsilon >0$ , and $(t_\varepsilon,t^{\prime}_\varepsilon,\eta_\varepsilon,\eta^{\prime}_\varepsilon)$ attains the maximum of $\Phi_\varepsilon$ . We have

(4.13)

\begin{align} M_\varepsilon = \max\Phi_\varepsilon = \Phi_\varepsilon\big(t_\varepsilon,t^{\prime}_\varepsilon,\eta_\varepsilon,\eta^{\prime}_\varepsilon\big)\rightarrow M \ \text{and} \ \phi_\varepsilon\big(t_\varepsilon,t^{\prime}_\varepsilon,\eta_\varepsilon,\eta^{\prime}_\varepsilon\big)\rightarrow 0 \ \text{when} \ \varepsilon\rightarrow 0. \end{align}

Let us recall the equivalent definition of viscosity solutions in terms of superjets and subjets. In particular, we define $\bar{\mathcal{P}}^{2,+}u(\bar{t},\bar{\eta})$ as the set of elements $(\bar{q},\bar{k},\bar{M})\in\mathbb{R}\times\mathbb{R}\times\mathbb{R}$ satisfying $u(t,\eta)\leq u(\bar{t},\bar{\eta}) + \bar{q}(t-\bar{t})+ \bar{k}(\eta-\bar{\eta})+ \frac{1}{2}\bar{M}(\eta-\bar{\eta})^2 + o((t-\bar{t})+(\eta-\bar{\eta})^2)$ . We define $\bar{\mathcal{P}}^{2,-}v^\Lambda(\bar{t},\bar{\eta})$ similarly. Thanks to the Crandall–Ishii lemma, we can find $A_\varepsilon, B_\varepsilon\in\mathbb{R}$ such that

\begin{align*} \begin{split} \bigg(\frac{t_\varepsilon-t^{\prime}_\varepsilon}{\varepsilon},\frac{\eta_\varepsilon-\eta^{\prime}_\varepsilon}{\varepsilon},A_\varepsilon\bigg) &\in \bar{\mathcal{P}}^{2,+}u(t_\varepsilon,\eta_\varepsilon), \\ \bigg(\frac{t_\varepsilon-t^{\prime}_\varepsilon}{\varepsilon},\frac{\eta_\varepsilon-\eta^{\prime}_\varepsilon}{\varepsilon},B_\varepsilon\bigg) &\in \bar{\mathcal{P}}^{2,-}v^\Lambda(t^{\prime}_\varepsilon,\eta^{\prime}_\varepsilon), \\ \sigma^2(\eta_\varepsilon)A_{\varepsilon} - \sigma^2(\eta^{\prime}_\varepsilon)B_{\varepsilon} &\leq \frac{3}{\varepsilon}\big(\sigma(\eta_\varepsilon)-\sigma(\eta^{\prime}_\varepsilon)\big)^2. \end{split} \end{align*}

By combining the viscosity subsolution property (4.5) of u and the viscosity strict supersolution property (4.12) of $v^\Lambda$ , we have that

(4.14)

\begin{align} \begin{split} \mbox{min}\bigg\{ u(t_\varepsilon,\eta_\varepsilon)-\widehat{V}\big(t_\varepsilon,x_0-f(t_\varepsilon),z_0,\eta_\varepsilon\big),\frac{t_\varepsilon-t^{\prime}_\varepsilon}{\varepsilon} -\frac{\eta_\varepsilon-\eta^{\prime}_\varepsilon}{\varepsilon}b(t_\varepsilon,\eta_\varepsilon)-\frac{1}{2}\sigma^2(\eta_\varepsilon)A_{\varepsilon}\bigg\} \leq 0,\\ \end{split} \end{align}

(4.15)

\begin{align} \begin{split} \mbox{min}\bigg\{ v^\Lambda\big(t^{\prime}_\varepsilon,\eta^{\prime}_\varepsilon\big)-\widehat{V}\big(t^{\prime}_\varepsilon,x_0-f\big(t^{\prime}_\varepsilon\big),z_0,\eta^{\prime}_\varepsilon\big),\frac{t_\varepsilon-t^{\prime}_\varepsilon}{\varepsilon} -\frac{\eta_\varepsilon-\eta^{\prime}_\varepsilon}{\varepsilon}b\big(t^{\prime}_\varepsilon,\eta^{\prime}_\varepsilon\big)-\frac{1}{2}\sigma^2\big(\eta^{\prime}_\varepsilon\big)B_{\varepsilon}\bigg\} \geq \Lambda, \end{split} \end{align}

where $b(t_\varepsilon,\eta_\varepsilon)=-\lambda(\eta_\varepsilon-\bar{\mu})$ , $\sigma^2(\eta_\varepsilon)=\sigma_\mu^2$ , $b(t^{\prime}_\varepsilon,\eta^{\prime}_\varepsilon)=-\lambda(\eta^{\prime}_\varepsilon-\bar{\mu})$ , and $\sigma^2(\eta^{\prime}_\varepsilon)=\sigma_\mu^2$ .

If $u-\widehat{V}\leq 0$ in (4.14), then because $v^\Lambda-\widehat{V}\geq \Lambda$ in (4.15), we obtain that $u-v^\Lambda\leq- \Lambda <0$ by contradiction with $\sup(u-v^\Lambda)=M>0$ . On the other hand, if $u-\widehat{V}>0$ in (4.14), then we have

\begin{align*} \left\{ \begin{array}{lr} \frac{t_\varepsilon-t^{\prime}_\varepsilon}{\varepsilon}-\frac{\eta_\varepsilon-\eta^{\prime}_\varepsilon}{\varepsilon}b(t_\varepsilon,\eta_\varepsilon)-\frac{1}{2}\sigma^2(\eta_\varepsilon)A_{\varepsilon}\leq 0,\\[5pt] \frac{t_\varepsilon-t^{\prime}_\varepsilon}{\varepsilon}-\frac{\eta_\varepsilon-\eta^{\prime}_\varepsilon}{\varepsilon}b(t^{\prime}_\varepsilon,\eta^{\prime}_\varepsilon)-\frac{1}{2}\sigma^2(\eta^{\prime}_\varepsilon)B_{\varepsilon} \geq \Lambda. \end{array} \right. \end{align*}

Furthermore, combining two inequalities above, we derive that

\begin{align*}\begin{split} &\frac{\eta_\varepsilon-\eta^{\prime}_\varepsilon}{\varepsilon}\big(b(t_\varepsilon,\eta_\varepsilon)-b\big(t^{\prime}_\varepsilon,\eta^{\prime}_\varepsilon\big)\big) +\frac{3}{2\varepsilon}\big(\sigma(\eta_\varepsilon)-\sigma\big(\eta^{\prime}_\varepsilon\big)\big)^2\\[5pt] \geq &\frac{\eta_\varepsilon-\eta^{\prime}_\varepsilon}{\varepsilon}\big(b(t_\varepsilon,\eta_\varepsilon)-b\big(t^{\prime}_\varepsilon,\eta^{\prime}_\varepsilon\big)\big) +\frac{1}{2}\big(\sigma^2(\eta_\varepsilon)A_{\varepsilon}-\sigma^2(\eta^{\prime}_\varepsilon)B_{\varepsilon}\big) \geq \Lambda. \end{split}\end{align*}

The first inequality holds by the Crandall–Ishii lemma. In addition, by letting $\varepsilon\rightarrow 0$ , we get

\begin{equation*}\frac{\eta_\varepsilon-\eta^{\prime}_\varepsilon}{\varepsilon}\big(b(t_\varepsilon,\eta_\varepsilon)-b\big(t^{\prime}_\varepsilon,\eta^{\prime}_\varepsilon\big)\big) +\frac{3}{2\varepsilon}\big(\sigma(\eta_\varepsilon)-\sigma\big(\eta^{\prime}_\varepsilon\big)\big)^2=0\end{equation*}

thanks to (4.13). It follows that we have $0\geq \Lambda>0$ , which leads to a contradiction; therefore our claim holds.

Lemma 4.5. For all $(t,\eta)\in\mathcal{C}$ in the continuation region, $\widetilde{V}$ in (2.7) has Hölder continuous derivatives.

Proof. The proof closely follows the argument in Section 6.3 of [Reference Friedman16]. First, let us recall that

(4.16)

\begin{equation} \frac{\partial \widetilde{V}}{\partial t}(t,\eta) + \lambda(\eta-\bar{\mu})\frac{ \partial \widetilde{V} }{ \partial \eta }(t,\eta) -\frac{1}{2} \sigma_\mu^2 \frac{ \partial^2 \widetilde{V} }{ \partial \eta^2 }(t,\eta)=0 \ \mbox{on} \ \mathcal{C}. \end{equation}

The fact that $\widetilde{V}$ is a viscosity solution to (4.8) gives that $\widetilde{V}$ is a supersolution to (4.16). On the other hand, for any $(\bar{t},\bar{\eta})\in\mathcal{C}$ , let $\varphi$ be a $C^2$ test function such that $(\bar{t},\bar{\eta})$ is a maximum of $\widetilde{V}-\varphi$ with $\widetilde{V}(\bar{t},\bar{\eta})=\varphi(\bar{t},\bar{\eta})$ . By definition of $\mathcal{C}$ , we have $\widetilde{V}(\bar{t},\bar{\eta}) > \widehat{V}(\bar{t},x_0-f(\bar{t}),z_0,\bar{\eta})$ , so that

\begin{equation*} \frac{\partial \varphi}{\partial t}(\bar{t},\bar{\eta}) + \lambda(\eta-\bar{\mu})\frac{ \partial \varphi }{ \partial \eta }(\bar{t},\bar{\eta}) -\frac{1}{2} \sigma_\mu^2 \frac{ \partial^2 \varphi }{ \partial \eta^2 }(\bar{t},\bar{\eta})\leq0, \end{equation*}

owing to the property that $\widetilde{V}$ is a viscosity subsolution to (4.8). It follows that $\widetilde{V}$ is a viscosity subsolution and therefore a viscosity solution to (4.16).

Let us consider an initial boundary value problem:

(4.17)

\begin{equation} \begin{split} -\frac{\partial w}{\partial t}(t,\eta) - \lambda(\eta-\bar{\mu})\frac{ \partial w }{ \partial \eta }(t,\eta) &+\frac{1}{2} \sigma_\mu^2 \frac{ \partial^2 w }{ \partial \eta^2 }(t,\eta)=0 \ \mbox{on} \ Q\cup B_T, \\ w(0,\eta)&=0 \ \mbox{on} \ B,\\ w(t,\eta)&=\widehat{V}(t,x_0-\kappa t,z_0,\eta) \ \mbox{on} \ S. \end{split} \end{equation}

Here, Q is an arbitrary bounded open region in $\mathcal{C}$ , Q lies in the strip $0<t<T$ , $\tilde{B}=\bar{Q}\cap\{t=0\}$ , $\tilde{B}_T=\bar{Q}\cap\{t=T\}$ , $B_T$ denotes the interior of $\tilde{B}_T$ , B denotes the interior of $\tilde{B}$ , $S_0$ denotes the boundary of Q lying in the strip $0\leq t\leq T$ , and $S=S_0\backslash B_T$ . Theorem 3.6 in [Reference Friedman16] gives the existence and uniqueness of a solution w on $Q\cup B_T$ to (4.17), and the solution w has Hölder continuous derivatives $w_t$ , $w_\eta$ , and $w_{\eta\eta}$ . Because the solution w is a viscosity solution to (4.16) on $Q\cup B_T$ , from standard uniqueness results on viscosity solutions, we know that $\widetilde{V}=w$ on $Q\cup B_T$ . As $Q\subset\mathcal{C}$ is arbitrary, it follows that $\widetilde{V}$ has the same property in the continuation region $\mathcal{C}$ . Therefore, $\widetilde{V}$ has Hölder continuous derivatives $\widetilde{V}_t$ , $\widetilde{V}_\eta$ , and $\widetilde{V}_{\eta\eta}$ .

Finally, we can prove Theorem 2.1.

Proof. We have proved the inequality $v^-=\sup_{p\in\mathcal{V}^-}p\leq \widetilde{V} \leq v^+=\inf_{q\in\mathcal{V}^+}q$ in Lemma 4.4. Using the comparison result in Proposition 4.1, we also have $v^+\leq v^-$ . Putting all the pieces together, we conclude that $v^+=\widetilde{V}(t,\eta)=v^-$ , and therefore the value function $\widetilde{V}(t,\eta)$ is the unique viscosity solution of the HJB variational inequality (2.9). Following an argument similar to that given for Theorem 1 in [Reference Duckworth and Zervos13], we fix the $\mathcal{F}_t$ -adapted stopping time $\tau^*$ defined in (2.11); the Itô–Tanaka formula (see Theorem IV.1.5 and Corollary IV.1.6 of [Reference Revuz and Yor32]) can be applied to $\widetilde{V}(t,\mu_t)$ in view of the Hölder continuous derivatives of $\widetilde{V}(t,\eta)$ , and we get that

\begin{align*}&\widehat{V}\big(\tau^*\wedge\tau_n, x_0-\kappa \tau^*\wedge\tau_n, z_0, \mu_{\tau^*\wedge\tau_n}\big)\\=&\widetilde{V}(t,\mu_t)+\Big[ \widehat{V}\big(\tau^*\wedge\tau_n, x_0-\kappa \tau^*\wedge\tau_n, z_0, \mu_{\tau^*\wedge\tau_n}\big)-\widetilde{V}\big(\tau^*\wedge\tau_n, \mu_{\tau^*\wedge\tau_n}\big)\Big]\\+&\int_t^{\tau^*\wedge\tau_n}\sigma_{\mu}\frac{\partial \widetilde{V}}{\partial \eta}(s,\mu_s)dB_s+\int_t^{\tau^*\wedge\tau_n}\Big[\frac{\partial \widetilde{V}(s,\mu_s)}{\partial t}+\mathcal{L}\widetilde{V}(s,\mu_s)\Big]ds,\end{align*}

where $\tau_n\uparrow T$ is the localizing sequence. As $\widetilde{V}(t,\eta)$ satisfies the HJB variational inequality (2.9), by taking conditional expectations and using the definition of $\tau^*$ in (2.11), we obtain that

\begin{align*}\mathbb{E}_t\left[\widehat{V}(\tau^*\wedge\tau_n, x_0-\kappa \tau^*\wedge\tau_n, z_0, \mu_{\tau^*\wedge\tau_n})\mathbf{1}_{\{\tau^*\leq \tau_n\}}\right]+\mathbb{E}_t\left[\widetilde{V}(\tau_n, \mu_{\tau_n})\mathbf{1}_{\{\tau^*>\tau_n\}} \right] =\widetilde{V}(t,\mu_t).\end{align*}

By taking the limit of $\tau_n$ and using the dominated convergence theorem, we verify that

\begin{align*}\mathbb{E}_t\left[\widehat{V}(\tau^*, x_0-\kappa \tau^*, z_0, \mu_{\tau^*})\right]=\widetilde{V}(t,\mu_t),\end{align*}

and therefore $\tau^*$ is the optimal entry time.

Finally, the martingale property between $t=0$ and $\tau^*$ follows from the definition of stochastic subsolutions and stochastic supersolutions.

Moreover, we can also easily verify the following sensitivity results for the composite value function.

Lemma 4.6. The value function $\widetilde{V}(t,\eta)$ has the following sensitivity properties:

(i) Suppose that $\alpha>0$ and $\delta>0$ are both constants in the definition of a habit formation process such that $\delta>\alpha$ . We have that $\widetilde{V}(t,\eta;\, \alpha, \delta)$ is decreasing in $\delta$ and increasing in $\alpha$ .
(ii) If the initial habit $z_0$ increases, the value function $\widetilde{V}(t,\eta)$ decreases.
(iii) If the information cost rate $\kappa$ increases, the value function $\widetilde{V}(t,\eta)$ decreases for any $t<T$ .

Proof. By the definition of $\widetilde{V}(t,\eta)$ and the explicit form of $\widehat{V}(t, x_0-\kappa t, z_0, \eta)$ in (3.7) and m(t) in (2.8), for given $\delta>\alpha$ , it is clear that $\widehat{V}(t, x_0-\kappa t, z_0, \eta)$ is decreasing in $\delta$ and increasing in $\alpha$ , which implies that $\widetilde{V}(t,\eta)$ has the same sensitivity property. Similarly, it is clear that $\widehat{V}(t, x_0-\kappa t, z_0, \eta)$ decreases as $z_0$ increases, and hence $\widetilde{V}(t,\eta)$ is decreasing in $z_0$ . Finally, $\widehat{V}(t, x_0-\kappa t, z_0, \eta)$ decreases if $x_0-\kappa t$ decreases; it readily follows that $\widetilde{V}(t,\eta)$ is decreasing in $\kappa$ .

Appendix A. Explicit solution to the auxiliary ODEs

Our ODE problems (3.4), (3.5), (3.6) are similar to ODEs for the terminal wealth optimization problem in [Reference Brendle8], in which the insightful observation is made that we can solve these ODEs with coefficients depending on time t by solving five auxiliary ODEs with constant coefficients; see Section 4 of [Reference Brendle8] for detailed discussions.

Lemma A.1. For $k\leq t\leq s\leq T$ , let us consider the following auxiliary ODEs for a $(t, s)$ , $b(t, s)$ , $l(t, s)$ , $w(t, s)$ , and $g(t, s)$ :

(A.1)

\begin{align} a_t=&-\frac{2\big(1-p+p\rho^2\big)}{1-p}\sigma^2_\mu a^2 +\left(2\lambda-\frac{2p\rho\sigma_\mu}{(1-p)\sigma_S}\right)a -\frac{p}{2(1-p)\sigma^2_S}, \end{align}

(A.2)

\begin{align} b_t=&-\frac{2\big(1-p+p\rho^2\big)}{1-p}\sigma^2_\mu ab -2\lambda\bar{\mu}a+\left(\lambda-\frac{p\rho\sigma_\mu}{(1-p)\sigma_S}\right)b,\qquad \ \ \ \ \end{align}

(A.3)

\begin{align} l_t=&-\sigma^2_\mu a-\frac{\big(1-p+p\rho^2\big)\sigma^2_\mu}{2(1-p)}b^2-\lambda\bar{\mu}b,\qquad\qquad\qquad\qquad\quad\ \ \ \ \end{align}

(A.4)

\begin{align} w_t=&-2\big(1-\rho^2\big)\sigma^2_\mu w^2+2\frac{\lambda\sigma_S+\rho\sigma_\mu}{\sigma_S}w+\frac{1}{2\sigma^2_S}, \end{align}

(A.5)

\begin{align} \ g_t\,=\,&\sigma_\mu^2\big(1-\rho^2\big)(w-a),\qquad\qquad\qquad\qquad\qquad \end{align}

with the terminal conditions $a(s,s)=b(s,s)=l(s,s)=w(s,s)=g(s,s)=0$ . Direct substitutions and computations show that the solutions of the ODEs (3.4), (3.5), (3.6) are given respectively by

(A.6)

\begin{align} A(t,s)&\,:\!=\,\frac{a(t,s)}{(1-p)\big(1-2a(t,s)\hat{\Sigma}(t)\big)},\ \ \ \ B(t,s)\,:\!=\,\frac{b(t,s)}{(1-p)\big(1-2a(t,s)\hat{\Sigma}(t)\big)},\notag\\ C(t,s)&\,:\!=\,\frac{1}{1-p}\Big[l(t,s)+\frac{\hat{\Sigma}(t)}{\big(1-2a(t,s)\hat{\Sigma}(t)\big)}b^2(t,s) -\frac{1-p}{2}\log\big(1-2a(t,s)\hat{\Sigma}(t)\big)\\ & -\frac{p}{2}\log\big(1-2w(t,s)\hat{\Sigma}(t)\big)-pg(t,s)\Big]. \end{align}

Following the same arguments as in [Reference Kim and Omberg21], we can actually solve the auxiliary ODEs (A.1), (A.2), (A.3), (A.4) and (A.5) explicitly, in the following order: we first solve the simple ODEs (A.1) and (A.4) to get a(t, s) and w(t, s), and then obtain b(t, s) and g(t, s) by solving the ODEs (A.2) and (A.5). Finally, we solve the ODE (A.3) to get l(t, s). We thus obtain

\begin{align*} a(t,s)\,=\, &\frac{p\big(1-e^{2\xi(t-s)}\big)} {2(1-p)\sigma_S^2\Big[2\xi-(\xi+\gamma_2)\big(1-e^{2\xi(t-s)}\big)\Big]},\\[4pt] b(t,s)\,=\,&\frac{p\lambda\bar{\mu}\big(1-e^{\xi(t-s)}\big)^2} {(1-p)\sigma_S^2\xi\Big[2\xi-(\xi+\gamma_2)\big(1-e^{2\xi(t-s)}\big)\Big]},\\[4pt] l(t,s)\,=\,&\frac{p}{2(1-p)\sigma_S^2}\left(\frac{\lambda^2\bar{\mu}^2}{\xi^2}-\frac{\sigma_\mu^2\gamma_2}{\gamma_2^2-\xi^2}\right)(s-t)\\[4pt] &+\frac{p\lambda^2\bar{\mu}^2\Big[\big(\xi+2\gamma_2\big)e^{2\xi(t-s)}-4\gamma_2e^{\xi(t-s)}+2\gamma_2-\xi\Big]} {2(1-p)\sigma_S^2\xi^3\left[2\xi-(\xi+\gamma_2)\big(1-e^{2\xi(t-s)}\big)\right]}\\[4pt] &+\frac{p\sigma_\mu^2}{2(1-p)\sigma_S^2\big(\xi^2-\gamma_2^2\big)}\log\left|\frac{2\xi-(\xi+\gamma_2)\big(1-e^{2\xi(t-s)}\big)}{{2\xi}e^{\xi(t-s)}}\right|, \\[4pt] w(t,s)\,=\,&-\frac{1}{2\sigma_S}\frac{1-e^{2\xi_1(t-s)}} {\big(\sigma_S\xi_1+\lambda\sigma_S+\rho\sigma_\mu\big)+\big(\sigma_S\xi_1-\lambda\sigma_S-\rho\sigma_\mu\big)e^{2\xi_1(t-s)}},\\[4pt] g(t,s)\,=\,&\frac{1}{2}\log\left(\frac{\big(\sigma_S\xi_1+\lambda\sigma_S+\rho\sigma_\mu\big)+\big(\sigma_S\xi_1-\lambda\sigma_S-\rho\sigma_\mu\big)e^{2\xi_1(t-s)}}{2\sigma_S\xi_1e^{\xi_1(t-s)}}\right)\\[4pt] &-\frac{(1-p)(1-\rho^2)}{2(1-p+p\rho^2)}\log\left(\frac{\big(\sigma_S\xi+\lambda\sigma_S-\frac{\rho\sigma_\mu p}{1-p}\big)+\big(\sigma_S\xi-\lambda\sigma_S+\frac{\rho\sigma_\mu p}{1-p}\big)e^{2\xi(t-s)}}{2\sigma_S\xi e^{\xi(t-s)}}\right)\\[4pt] &-\frac{\rho^2\lambda(s-t)}{2(1-p+p\rho^2)}-\frac{\rho\sigma_\mu(s-t)}{2(1-p+p\rho^2)\sigma_S}, \end{align*}

where

(A.7)

\begin{align} \Delta\,:\!=\,\lambda^2-\frac{2\lambda p\rho\sigma_\mu}{(1-p)\sigma_S}-\frac{p\sigma_\mu^2}{(1-p)\sigma_S^2}>0, \end{align}

and

\begin{align*} \xi & \,:\!=\,\sqrt\Delta=\sqrt{\gamma^2_2-\gamma_1\gamma_3},\ \ \xi_1\,:\!=\,\frac{\sqrt{\big(1-\rho^2\big)\sigma_\mu^2+\big(\lambda\sigma_S+\rho\sigma_\mu\big)^2}}{\sigma_S},\\ \gamma_1 & \,:\!=\,\frac{\big(1-p+p\rho^2\big)}{1-p}\sigma_\mu^2, \ \ \gamma_2\,:\!=\,-\lambda+\frac{p\rho\sigma_\mu}{(1-p)\sigma_S}, \ \ \gamma_3\,:\!=\,\frac{p}{(1-p)\sigma_S^2}. \end{align*}

Moreover, it is straightforward to see that a, b, l, w, and g are globally bounded if we have that $\gamma_3>0$ , or $\gamma_1>0$ , or $\gamma_2<0$ .

Remark A.1 Under the assumption that $p<0$ , (A.7) clearly holds and we have $\gamma_2<0$ . We can see that $a(t,s)\leq 0$ and $b(t,s)\leq 0$ are bounded and that $1-2a(t,s)\hat{\Sigma}(t)>1$ and $1-w(t,s)\hat{\Sigma}(t)>1$ . From the expressions in (A.6), we can conclude that A(t, s), B(t, s), and C(t, s) are all bounded on $k\leq t\leq s\leq T$ , and that

\begin{equation*}A(t,s)=\frac{a(t,s)}{(1-p)\big(1-2a(t,s)\hat{\Sigma}(t)\big)}\leq 0\end{equation*}

and

\begin{equation*}B(t,s)=\frac{b(t,s)}{(1-p)\big(1-2a(t,s)\hat{\Sigma}(t)\big)}\leq 0\end{equation*}

for $k\leq t\leq s\leq T$ .

Appendix B. Proof of the verification theorem

We first show that the consumption constraint $c_t\geq Z_t$ implies the constraint on the controlled wealth process in the next lemma.

Lemma B.1. The admissible space $\mathcal{A}$ is non-empty if and only if the initial budget constraint $x\geq m(k)z$ is fulfilled. Moreover, for each pair $(\pi,c)\in\mathcal{A}$ , the controlled wealth process $\hat{X}^{\pi,c}_t$ satisfies the constraint

(B.1)

\begin{align} \hat{X}^{\pi,c}_t\geq m(t)Z_t, \ \ k\leq t\leq T,\end{align}

where the deterministic function $m(t)$ is defined in (2.8) and refers to the cost of subsistence consumption per unit of standard of living at time t.

Proof. Let us first assume that $x\geq m(k)z$ ; we can always take $\pi_t\equiv 0$ , and

\begin{equation*}c_t=ze^{\int^t_k(\delta(v)-\alpha(v))dv}\end{equation*}

for $t\in[k,T]$ . It is easy to verify that $\hat{X}^{\pi,c}_t\geq 0$ and $c_t\equiv Z_t$ , so that $(\pi,c)\in\mathcal{A}$ , and hence $\mathcal{A}$ is non-empty.

On the other hand, starting from $t=k$ with wealth x and standard of living z, the addictive habits constraint $c_t\geq Z_t$ , $k\leq t\leq T$ , implies that the consumption must always exceed the subsistence consumption $\bar{c}_t=Z(t;\,\bar{c}_t)$ which satisfies

(B.2)

\begin{align} d\bar{c}_t=(\delta(t)-\alpha(t))\bar{c}_tdt, \ \ \bar{c}_k=z, \ \ k\leq t\leq T. \end{align}

Indeed, since $Z_t$ satisfies $dZ_t=(\delta_tc_t-\alpha_tZ_t)dt$ with $Z_k=z\geq 0$ , the constraint $c_t\geq Z_t$ implies that

(B.3)

\begin{align} dZ_t\geq\big(\delta_tZ_t-\alpha_tZ_t\big)dt, \ \ Z_k=z. \end{align}

By (B.2) and (B.3), one can get $d(Z_t-\bar{c}_t)\geq(\delta_t-\alpha_t)(Z_t-\bar{c}_t)dt$ and $Z_k-\bar{c}_k=0$ , from which we can derive that

\begin{equation*}e^{\int^t_k(\delta_s-\alpha_s)ds}\big(Z_t-\bar{c}_t\big)\geq 0, \ \ k\leq t\leq T.\end{equation*}

It follows that $c_t\geq\bar{c}_t$ , which is equivalent to

(B.4)

\begin{align} c_t\geq ze^{\int^t_k(\delta(v)-\alpha(v))dv}, \ \ k\leq t\leq T. \end{align}

Define the exponential local martingale

\begin{equation*}\widetilde{H}_t=\exp\left(-\int^t_k\frac{\hat{\mu}_v}{\sigma_S}d\hat{W}_v-\frac{1}{2}\int^t_k\frac{\hat{\mu}_v^2}{\sigma_S^2}dv\right), \ \ k\leq t\leq T.\end{equation*}

As $\hat{\mu}_t$ follows the dynamics (2.4), we derive that

\begin{align*} \hat{\mu}_t=e^{-t\lambda}\eta+\bar{\mu}\big(1-e^{-t\lambda}\big) +\int^t_ke^{\lambda(u-t)}\frac{\Big(\hat{\Sigma}(u)+\sigma_S\sigma_\mu\rho\Big)}{\sigma_S}d\hat{W}_u. \end{align*}

Similarly to the proof of Corollary 3.5.14 and Corollary 3.5.16 in [Reference Karatzas and Shreve19], the Beneš condition implies that $\widetilde{H}$ is a true martingale with respect to $(\Omega,\mathcal{F}^S,\mathbb{P})$ .

Now, define the probability measure $\widetilde{\mathbb{P}}$ by $ \frac{d\widetilde{\mathbb{P}}}{d\mathbb{P}} = \widetilde{H}_T$ . Girsanov’s theorem states that

\begin{equation*}\widetilde{W}_t\,:\!=\,\hat{W}_t+\int^t_k\frac{\hat{\mu}_v}{\sigma_S}dv, \ \ k\leq t\leq T,\end{equation*}

is a Brownian motion under $(\widetilde{\mathbb{P}},(\mathcal{F}^S_t)_{k\leq t\leq T})$ . We can rewrite the wealth process as

\begin{equation*}\hat{X}_T+\int^T_kc_vdv=x+\int^T_k\pi_v\sigma_Sd\widetilde{W}_v.\end{equation*}

As we have $\hat{X}_T\geq 0$ , it is easy to see that $\int^t_k\pi_v\sigma_Sd\widetilde{W}_v$ is a supermartingale under $(\Omega,\mathbb{F}^S,\widetilde{\mathbb{P}})$ . By taking the expectation under $\widetilde{\mathbb{P}}$ , we have $x\geq\widetilde{\mathbb{E}}\left[\int^T_kc_vdv\right]$ . Thanks to the inequality (B.4), we further have $ x\geq z\widetilde{\mathbb{E}}\left[\int^T_k\exp\left(\int^v_k(\delta(u)-\alpha(u))du\right)dv\right]$ . Because $\delta(t)$ and $\alpha(t)$ are deterministic functions, we obtain that $x\geq m(k)z$ . In general, for any $t\in[k,T]$ , following the same procedure, we can take the conditional expectation under the filtration $\mathcal{F}^S_t$ and get

\begin{equation*}\hat{X}_t\geq Z_t\widetilde{\mathbb{E}}\bigg[\int^T_t\exp\Big(\int^v_t(\delta(u)-\alpha(u))du\Big)dv\bigg{|}\mathcal{F}^S_t\bigg].\end{equation*}

Again, as $\delta(t)$ , $\alpha(t)$ are deterministic, we get $\hat{X}_t\geq m(t)Z_t$ , $k\leq t\leq T$ .

We can finally prove Theorem 3.1 for the interior control problem.

Proof. For any pair of admissible controls $(\pi_t,c_t)\in\mathcal{A}$ , Itô’s lemma gives

(B.5)

\begin{align} d\big[V\big(t,\hat{X}_t,Z_t,\hat{\mu}_t\big)\big]=\big[\mathcal{G}^{\pi_t,c_t}V\big(t,\hat{X}_t,Z_t,\hat{\mu}_t\big)\big]dt +\left[V_{x}\sigma_S\pi_t+V_{\eta}\frac{\big(\hat{\Sigma}(t)+\sigma_S\sigma_\mu\rho\big)}{\sigma_S}\right]d\hat{W}_t,\end{align}

where we define the process $\mathcal{G}^{\pi_t,c_t}V(t,\hat{X}_t,Z_t,\hat{\mu}_t)$ by

\begin{align}\begin{split} & \mathcal{G}^{\pi_t,c_t}V\big(t,\hat{X}_t,Z_t,\hat{\mu}_t\big) = V_t-\alpha(t)Z_tV_t-\lambda(\hat{\mu}_t-\bar{\mu})V_{\eta} +\frac{\big(\hat{\Sigma}(t)+\sigma_S\sigma_\mu\rho\big)^2}{2\sigma_S^2}V_{\eta\eta}-c_tV_{x}\\ &+c_t\delta(t)V_{z} +\frac{(c_t-Z_t)^p}{p}+\pi_t\hat{\mu}_tV_{x}+\frac{1}{2}\sigma_S^2\pi_t^2V_{xx} +V_{x\eta}\big(\hat{\Sigma}(t)+\sigma_S\sigma_\mu\rho\big)\pi_t.\nonumber\end{split}\end{align}

For any localizing sequence $\tau_n$ , by integrating (B.5) on $[k,\tau_n\wedge T]$ and taking the expectation, we have

(B.6)

\begin{align} V(k,x,z,\eta)\geq\mathbb{E}\left[\int^{\tau_n\wedge T}_k\frac{(c_s-Z_s)^p}{p}ds\right] +\mathbb{E}\big[V\big(\tau_n\wedge T,\hat{X}_{\tau_n\wedge T},Z_{\tau_n\wedge T},\hat{\mu}_{\tau_n\wedge T}\big)\big]. \end{align}

Similarly to the argument in [Reference Janeček and Sîrbu17], let us consider a fixed pair of controls $(\pi_t,c_t)\in\mathcal{A}=\mathcal{A}_{x}$ , where we denote by $\mathcal{A}_{x}$ the admissible space with initial endowment x. For any $\epsilon>0$ , it is clear that $\mathcal{A}_{x}\subseteq\mathcal{A}_{x+\epsilon}$ and $(\pi_t,c_t)\in\mathcal{A}_{x+\epsilon}$ . Also, it is easy to see that $\hat{X}^{x+\epsilon}_t=\hat{X}^{x}_t+\epsilon=\hat{X}_t+\epsilon$ , $k\leq t\leq T$ . As the process $Z_t$ is defined using this consumption policy $c_t$ , under the probability measure $\mathbb{P}_{x,z,\eta}$ , we obtain

(B.7)

\begin{align} V(k,x+\epsilon,z,\eta)\geq\mathbb{E}\left[\int^{\tau_n\wedge T}_k\frac{(c_s-Z_s)^p}{p}ds\right] +\mathbb{E}\big[V\big(\tau_n\wedge T,\hat{X}_{\tau_n\wedge T}+\epsilon,Z_{\tau_n\wedge T},\hat{\mu}_{\tau_n\wedge T}\big)\big]. \end{align}

The monotone convergence theorem first leads to

\begin{align*} \lim_{n\to+\infty}\mathbb{E}\left[\int^{\tau_n\wedge T}_k\frac{(c_s-Z_s)^p}{p}ds\right] =\mathbb{E}\left[\int^{T}_k\frac{(c_s-Z_s)^p}{p}ds\right]. \end{align*}

For simplicity, let us write $Y_t=\Big(\hat{X}_t-m(t)Z_t\Big)$ . The definition (3.7) implies that

\begin{equation*} V\big(\tau_n\wedge T,\hat{X}_{\tau_n\wedge T}+\epsilon,Z_{\tau_n\wedge T},\hat{\mu}_{\tau_n\wedge T}\big)=\frac{1}{p}(Y_{\tau_n\wedge T}+\epsilon)^pN^{1-p}_{\tau_n\wedge T}.\end{equation*}

Lemma B.1 gives $\hat{X}_t\geq m(t)Z_t$ for $k\leq t\leq T$ under any admissible control $(\pi_t,c_t)$ , so we get that $Y_{\tau_n\wedge T}+\epsilon\geq\epsilon>0$ for all $k\leq t\leq T$ . As $p<0$ , it follows that

(B.8)

\begin{align} \sup_{n}(Y_{\tau_n\wedge T}+\epsilon)^p<\epsilon^p<+\infty. \end{align}

Remark A.1 gives that $A(t,s)\leq 0$ , B(t, s), and C(t, s) are all bounded on $k\leq t\leq s\leq T$ . Also, m(s) and $\delta(s)$ are continuous functions and hence bounded on [k, T]. Hence $N(k,\eta)\leq k_1\exp(k_2\eta)$ , for some constants $k_2, k_1>1$ . It follows that there exist some constants $\bar{k}_2,\bar{k}_1>1$ such that

\begin{align*} \sup_{n}N^{1-p}_{\tau_n\wedge T}\leq\sup_{t\in[k,T]}\big(k_1\exp(k_2\hat{\mu}_t)\big)^{1-p} \leq\bar{k}_1\exp\Big(\bar{k}_2\sup_{t\in[k,T]}\hat{\mu}_t\Big). \end{align*}

The process $\hat{\mu}_t$ satisfies (2.4), which leads to

\begin{align*} \hat{\mu}_t=e^{-t\lambda}\eta+\bar{\mu}\big(1-e^{-t\lambda}\big) +\int^t_ke^{\lambda(u-t)}\frac{\big(\hat{\Sigma}(u)+\sigma_S\sigma_\mu\rho\big)}{\sigma_S}d\hat{W}_u. \end{align*}

Hence, there exist positive constants l and $l_1>1$ large enough so that

\begin{equation*}\sup_{t\in[k,T]}\hat{\mu}_t\leq l+\sup_{t\in[k,T]}l_1\hat{W}_t, \ \ t\in[k,T].\end{equation*}

Using the distribution of the running maximum of the Brownian motion, there exist some positive constants $\bar{l}>1$ and $\bar{l}_1$ such that

(B.9)

\begin{align} \mathbb{E}\left[\sup_{n}N^{1-p}_{\tau_n\wedge T}\right] \leq\bar{l}_1\mathbb{E}\bigg[\exp\bigg(\sup_{t\in[k,T]}\bar{l}\hat{B}_t\bigg)\bigg]<+\infty. \end{align}

Finally, by (B.8) and (B.9), we can conclude that

\begin{align*} \mathbb{E}\Big[\sup_{n}V\big(\tau_n\wedge T,\hat{X}_{\tau_n\wedge T}+\epsilon,Z_{\tau_n\wedge T},\hat{\mu}_{\tau_n\wedge T}\big)\Big]<+\infty. \end{align*}

The dominated convergence theorem and $N(T,\hat{\mu_T}) = 0$ imply that

\begin{align*} \lim_{n\to\infty}\mathbb{E}\Big[V\big(\tau_n\wedge T,\hat{X}_{\tau_n\wedge T}+\epsilon,Z_{\tau_n\wedge T},\hat{\mu}_{\tau_n\wedge T}\big)\Big]=\mathbb{E}\left[\frac{1}{p}(Y_T+\epsilon)^pN^{1-p}\big(T,\hat{\mu_T}\big)\right] = 0. \end{align*}

Combining this with (B.7) and $(\pi_t,c_t)\in\mathcal{A}$ , we have that

\begin{align*} V(k,x+\epsilon,z,\eta;\,\theta)\geq\sup_{\pi,c\in\mathcal{A}}\mathbb{E} \left[\int^T_k\frac{(c_s-Z_s)^p}{p}ds\right] =\widehat{V}(k,x,z,\eta,\theta). \end{align*}

Note that $V(t,x,z,\eta;\,\theta)$ is continuous in the variable x. By letting $\epsilon\to 0$ , we deduce that

\begin{align*} V(k,x,z,\eta;\,\theta)=\lim_{\epsilon\to 0}V(k,x+\epsilon,z,\eta) \geq \widehat{V}(k,x,z,\eta,\theta). \end{align*}

On the other hand, for $\pi^*_t$ and $c^*_t$ given in (3.8) and (3.9), we first need to show that the SDE

(B.10)

\begin{align} d\hat{X}^*_t=\big(\pi^*_t\mu_t-c^*_t\big)dt+\sigma_S\pi^*_td\hat{W}_t, \ \ k\leq t\leq T, \end{align}

with initial condition $x>m(k)z$ admits a unique strong solution that satisfies the constraint $\hat{X}^*_t>m(t)Z^*_t$ for all $k\leq t\leq T$ . Let $Y^*_t=\hat{X}^*_t-m(t)Z^*_t$ . By Itô’s lemma and substitution of $c^*_t$ using (3.9), we obtain that

\begin{align*} \begin{split} dY^*_t=&\left[-\frac{\big(1+\delta(t)m(t)\big)^{\frac{-p}{1-p}}}{N} +\frac{\hat{\mu}_t^2}{(1-p)\sigma_S^2}+\frac{\big(\hat{\Sigma}(t)+\sigma_S\sigma_\mu\rho\big)}{\sigma_S^2}\frac{N_{\eta}}{N}\hat{\mu}_t\right]Y^*_t dt\\ &+\left[\frac{\hat{\mu}_t}{(1-p)\sigma_S}+\frac{\big(\hat{\Sigma}(t)+\sigma_S\sigma_\mu\rho\big)}{\sigma_S}\frac{N_{\eta}}{N}\right]Y^*_td\hat{W}_t. \end{split} \end{align*}

In order to solve for $X^*_t$ explicitly, we define the auxiliary process $\Gamma_t \,:\!=\, \frac{N(t,\hat{\mu}_t)}{Y^*_t}$ , for $k\leq t\leq T$ . Itô’s lemma gives that

(B.11)

\begin{align} \begin{split} d\Gamma_t =&\frac{\Gamma_t}{N_t}\Bigg[N_t-\lambda(\hat{\mu}_t-\bar{\mu})N_{\eta} +\frac{\big(\hat{\Sigma}(t)+\sigma_S\sigma_\mu\rho\big)^2}{2\sigma_S^2}N_{\eta\eta} +\frac{\hat{\mu}_t\big(\hat{\Sigma}(t)+\sigma_S\sigma_\mu\rho\big)p}{(1-p)\sigma_S^2}N_{\eta}\\ &+\big(1+\delta(t)m(t)\big)^{\frac{-p}{1-p}} +\frac{p\hat{\mu}_t^2}{(1-p)^2\sigma_S^2}N\Bigg]dt +\Gamma_t\left[\frac{-\hat{\mu}_t}{(1-p)\sigma_S}\right]d\hat{W}_t. \end{split} \end{align}

As $N(t,\eta)$ satisfies the linear PDE (3.3), (B.11) is reduced to

\begin{align*} d\Gamma_t=\Gamma_t\left[\frac{p\hat{\mu}_t^2}{2(1-p)^2\sigma_S^2}\right]dt +\Gamma_t\left[\frac{-\hat{\mu}_t}{(1-p)\sigma_S}\right]d\hat{W}_t; \end{align*}

the existence of a unique strong solution is thus verified, and $\Gamma_k=\frac{N(k,\eta)}{x-m(k)z}>0$ implies that $\Gamma_t>0$ , $\forall k\leq t\leq T$ . Therefore, it holds that the SDE (B.10) admits a unique strong solution as defined in (3.10), and the solution $\hat{X}^*_t$ satisfies the constraint (B.1).

Next, we verify that the pair $(\pi^*_t,c^*_t)$ is indeed in the admissible space $\mathcal{A}$ . First, by the definitions in (3.8) and (3.9), it is clear that $\pi^*_t$ and $c^*_t$ are $\mathcal{F}^S_t$ -progressively measurable, and by the path continuity of $Y^*_t=\hat{X}^*_t-m(t)Z^*_t$ and of $\pi^*_t$ and $c^*_t$ , it is easy to show that $\int^T_k(\pi^*_t)^2dt<+\infty$ and $\int^T_kc^*_tdt<+\infty$ , a.s. Also, because $\hat{X}^*_t>m(t)Z^*_t$ for all $t\in[k,T]$ , by the definition of $c^*_t$ , the consumption constraint $c^*_t>Z^*_t$ for all $t\in[k,T]$ is satisfied. It follows that $(\pi^*_t,c^*_t)\in\mathcal{A}$ .

Given $(\pi^*_t,c^*_t)$ as above, instead of (B.6), the following equality is proved:

\begin{align*} V\big(k,x,z,\eta;\,\theta\big)=\mathbb{E}\left[\int^{\tau_n\wedge T}_k\frac{\big(c^*_t-Z^*_t\big)^p}{p}dt\right] +\mathbb{E}\Big[V\big(\tau_n\wedge T,\hat{X}^*_{\tau_n\wedge T},Z^*_{\tau_n\wedge T},\hat{\mu}_{\tau_n\wedge T}\big)\Big]. \end{align*}

The monotone convergence theorem gives

\begin{equation*}\lim_{n\to+\infty}\mathbb{E}\left[\int^{\tau_n\wedge T}_k\frac{\big(c^*_t-Z^*_t\big)^p}{p}dt\right] =\mathbb{E}\left[ \int^{T}_k\frac{\big(c^*_t-Z^*_t\big)^p}{p}dt \right].\end{equation*}

Moreover, as we have $V(t,x,z,\eta)<0$ by $p<0$ , Fatou’s lemma implies that

\begin{align*} \limsup_{n\to+\infty}\mathbb{E}\Big[V\big(\tau_n\wedge T,\hat{X}^*_{\tau_n\wedge T},Z^*_{\tau_n\wedge T},\hat{\mu}_{\tau_n\wedge T}\big)\Big] \leq\mathbb{E}\Big[V\big(T,\hat{X}^*_{T},Z^*_{T},\hat{\mu}_{T}\big)\Big] =0. \end{align*}

It follows that

\begin{align*} V(k,x,z,\eta;\,\theta) \leq\mathbb{E}\left[\int^{T}_k\frac{\big(c^*_t-Z^*_t\big)^p}{p}dt\right] \leq \widehat{V}(k,x,z,\eta,\theta), \end{align*}

which completes the proof.

Funding information

Y. Yang and X. Yu are supported by the Hong Kong Polytechnic University under research grant no. P0031417.

Competing interests

There were no competing interests to declare which arose during the preparation or publication process for this article.

References

Ahearne, A. G., Griever, W. L. and Warnock, F. E. (2004). Information costs and home bias: an analysis of US holdings of foreign equities. J. Internat. Econom. 62, 313–336.CrossRef Google Scholar

Bayraktar, E. and Sirbu, M. (2012). Stochastic Perron’s method and verification without smoothness using viscosity comparison: the linear case. Proc. Amer. Math. Soc. 140, 3645–3654.CrossRef Google Scholar

Bayraktar, E. and Sirbu, M. (2013). Stochastic Perron’s method for Hamilton–Jacobi–Bellman equations. SIAM J. Control Optimization 51, 4274–4294.CrossRef Google Scholar

Bayraktar, E. and Sirbu, M. (2014). Stochastic Perron’s method and verification without smoothness using viscosity comparison: obstacle problems and Dynkin games. Proc. Amer. Math. Soc. 142, 1399–1412.CrossRef Google Scholar

Bayraktar, E. and Zhang, Y. (2015). Stochastic Perron’s method for the probability of lifetime ruin problem under transaction costs. SIAM J. Control Optimization 53, 91–113.CrossRef Google Scholar

Björk, T., Davis, M. and Landén, C. (2010). Optimal investment under partial information. Math. Meth. Operat. Res. 71, 371–399.CrossRef Google Scholar

Bo, L., Liao, H. and Yu, X. (2019). Risk-sensitive credit portfolio optimization under partial information and contagion risk. Preprint. Available at https://arxiv.org/abs/1905.08004.Google Scholar

Brendle, S. (2006). Portfolio selection under incomplete information. Stoch. Process. Appl. 116, 701–723.CrossRef Google Scholar

Brennan, M. J. and Xia, Y. (2010). Persistence, predictability, and portfolio planning. In Handbook of Quantitative Finance and Risk Management, Springer, Boston, pp. 289–318.CrossRef Google Scholar

Campbell, J. Y. et al. (1997). The Econometrics of Financial Markets. Princeton University Press.CrossRef Google Scholar

Constantinides, G. M. (1990). Habit formation: a resolution of the equity premium puzzle. J. Political Econom. 98, 519–543.CrossRef Google Scholar

Detemple, J. and Zapatero, F. (1992). Optimal consumption-portfolio policies with habit formation. Math. Finance 2, 251–274.CrossRef Google Scholar

Duckworth, J. K. and Zervos, M. (2000). An investment model with entry and exit decisions. J. Appl. Prob. 37, 547–559.CrossRef Google Scholar

Englezos, N. and Karatzas, I. (2009). Utility maximization with habit formation: dynamic programming and stochastic PDEs. SIAM J. Control Optimization 48, 481–520.CrossRef Google Scholar

Fama, E. F. and French, K. R. (1989). Business conditions and expected returns on stocks and bonds. J. Financial Econom. 25, 23–49.CrossRef Google Scholar

Friedman, A. (2012). Stochastic Differential Equations and Applications. Dover, Mineola, NY.Google Scholar

Janeček, K. and Sîrbu, M. (2012). Optimal investment with high-watermark performance fee. SIAM J. Control Optimization 50, 790–819.CrossRef Google Scholar

Kang, J. and Stulz, R. M. (1997). Why is there a home bias? An analysis of foreign portfolio equity ownership in Japan. J. Financial Econom. 46, 3–28.CrossRef Google Scholar

Karatzas, I. and Shreve, S. (1991). Brownian Motion and Stochastic Calculus, 2nd edn. Springer, New York.Google Scholar

Keppo, J., Tan, H. M. and Zhou, C. (2019). Smart city investments. Preprint. Available at https://doi.org/10.2139/ssrn.3141043.CrossRef Google Scholar

Kim, T. S. and Omberg, E. (1996). Dynamic nonmyopic portfolio behavior. Rev. Financial Studies 9, 141–161.CrossRef Google Scholar

Lakner, P. (1998). Optimal trading strategy for an investor: the case of partial information. Stoch. Process. Appl. 76, 77–97.CrossRef Google Scholar

Lee, J., Yu, X. and Zhou, C. (2021). Lifetime ruin under high-water mark fees and drift uncertainty. Appl. Math. Optimization 84, 2743–2773.CrossRef Google Scholar

Mehra, R. and Prescott, E. C. (1985). The equity premium: a puzzle. J. Monetary Econom. 15, 145–161.CrossRef Google Scholar

Monoyios, M. (2009). Optimal investment and hedging under partial and inside information. Adv. Financial Model. 8, 371–410.Google Scholar

Munk, C. (2008). Portfolio and consumption choice with stochastic investment opportunities and habit formation in preferences. J. Econom. Dynam. Control 32, 3560–3589.CrossRef Google Scholar

Pham, H. (1997). Optimal stopping, free boundary, and American option in a jump-diffusion model. Appl. Math. Optimization 35, 145–164.CrossRef Google Scholar

Pham, H. (2009). Continuous-Time Stochastic Control and Optimization with Financial Applications. Springer, Berlin, Heidelberg.CrossRef Google Scholar

Portes, R. and Rey, H. (2005). The determinants of cross-border equity flows. J. Internat. Econom. 65, 269–296.CrossRef Google Scholar

Poterba, J. M. and Summers, L. H. (1988). Mean reversion in stock prices: evidence and implications. J. Financial Econom. 22, 27–59.CrossRef Google Scholar

Reikvam, K. (1998). Viscosity solutions of optimal stopping problems. Stoch. Stoch. Reports 62, 285–301.CrossRef Google Scholar

Revuz, D. and Yor, M. (1991). Continuous Martingales and Brownian Motion. Springer, Berlin, Heidelberg.CrossRef Google Scholar

Sirbu, M. (2014). Stochastic Perron’s method and elementary strategies for zero-sum differential games. SIAM J. Control Optimization 52, 1693–1711.CrossRef Google Scholar

Xia, Y. (2001). Learning about predictability: the effects of parameter uncertainty on dynamic asset allocation. J. Finance 56, 205–246.CrossRef Google Scholar

Yu, X. (2015). Utility maximization with addictive consumption habit formation in incomplete semimartingale markets. Ann. Appl. Prob. 25, 1383–1419.CrossRef Google Scholar

Yu, X. (2017). Optimal consumption under habit formation in markets with transaction costs and random endowments. Ann. Appl. Prob. 27, 960–1002.CrossRef Google Scholar

Article contents

Optimal entry and consumption under habit formation

Abstract

Keywords

MSC classification

1. Introduction

2. Mathematical model and preliminaries

2.1. Market model

2.2. Problem formulation

3. Interior utility maximization under partial observation

3.1. Optimal consumption with Kalman–Bucy filtering

3.2. The decoupled solution and main results

4. Exterior optimal stopping problem

4.1. Stochastic Perron method

Appendix A. Explicit solution to the auxiliary ODEs

Appendix B. Proof of the verification theorem

Funding information

Competing interests

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests