Hostname: page-component-745bb68f8f-b6zl4 Total loading time: 0 Render date: 2025-02-11T07:05:33.893Z Has data issue: false hasContentIssue false

FIXED-b ASYMPTOTICS IN SINGLE-EQUATION COINTEGRATION MODELS WITH ENDOGENOUS REGRESSORS

Published online by Cambridge University Press:  23 May 2006

Helle Bunzel
Affiliation:
Iowa State University
Rights & Permissions [Opens in a new window]

Abstract

This note uses fixed bandwidth (fixed-b) asymptotic theory to suggest a new approach to testing cointegration parameters in a single-equation cointegration environment. It is shown that the standard tests still have asymptotic distributions that are free of serial correlation nuisance parameters regardless of the bandwidth or kernel used, even if the regressors in the cointegration relationship are endogenous.I thank Tim Vogelsang for his guidance, Pentti Saikkonen for the promptness with which he provided me with an old working paper of his, Wayne Fuller for several discussions, and an anonymous referee for many helpful comments. The usual disclaimer applies.

Type
NOTES AND PROBLEMS
Copyright
© 2006 Cambridge University Press

1. MOTIVATION

In regression models with heteroskedasticity and serial correlation of unknown form, the standard approach to testing hypotheses on the regression parameters involves estimation of the correlation structure using nonparametric heteroskedasticity and autocorrelation consistent (HAC) estimators. These estimators have been thoroughly examined in the literature by, among many others, Andrews (1991), Andrews and Monahan (1992), Hansen (1992), de Jong and Davidson (2000), Newey and West (1987), Robinson (1991), and White (1984). They furnish consistent estimates of the correlation structure in single-equation models of cointegration also, allowing inference on the cointegrating vector to be carried out using conventional tests. Inference conducted in this manner leads to pivotal tests and is robust to heteroskedasticity and serial correlation of unknown form. Even though tests that use HAC estimators are valid asymptotically, they typically display substantial size distortions in finite samples (see, e.g., Andrews, 1991; Andrews and Monahan, 1992; den Haan and Levin, 1997).

Recent efforts have been made to improve upon the HAC approach in standard (stationary) regression models using inconsistent covariance matrix estimates. The first paper in this literature was Kiefer, Vogelsang, and Bunzel (2000), where a new test based on the Bartlett kernel with bandwidth equal to sample size was developed. Continuing this line of research, Bunzel, Kiefer, and Vogelsang (2001) extended the theory to nonlinear stationary regression models, and Kiefer and Vogelsang (2002) developed the new fixed bandwidth (fixed-b) asymptotic theory. In the case of a cointegration relationship with exogenous regressors, applying fixed-b asymptotics would have been a straightforward extension of the theory in the standard regression model. However, when the regressors are allowed to be endogenous, the task is nontrivial. Here we provide the asymptotic distribution of Wald and t-type statistics in single-equation cointegration models with endogenous regressors.

The principle behind the fixed-b theory is to let b = M/T where T is the sample size and M is the truncation lag or bandwidth used to compute the HAC estimator. The standard assumptions would require that b → 0, but fixed-b asymptotics instead assumes that the truncation lag is a fixed proportion of the sample, i.e., b is fixed. This approach has several advantages. First, it improves the asymptotic approximation, resulting in reduced size distortions. Second, it provides an asymptotic distribution that depends on the bandwidth and kernel, thus providing us with better tools for choosing these parameters.

2. RESULTS

Consider the following model containing a single cointegrating relationship in addition to some deterministic variables:

where f (t) denotes a (k1 × 1) vector of trend functions, Xt is a

vector of regressors, and α and β are (k1 × 1) and

vectors of parameters, respectively. Let ′ denote the transpose, except when it is used in conjunction with the kernel function, where it will denote the derivative. It is assumed that the sequence {ut} = {(u1,t,u2,t′)′} does not contain unit roots but may exhibit serial correlation or heteroskedasticity.

At times, it will be useful to stack the first equation in (1) and rewrite it as

Here f(T) is the (T × k1) stacked vector of trend functions, and X is the

matrix of regressors. The following notation is required before we state the main assumptions of the note. Denote

, Γ(j) = E(utut+j′), and Γ22(j) = E(u2,tu2,t+j′), let wj(r) be a j-vector of independent Wiener processes, and let [rT] denote the integer part of rT, where r ∈ [0,1]. In the discussion that follows ⇒ is used to denote weak convergence.

The first assumption, which is similar to that of Vogelsang (1998), is made to rule out ill-behaved trend functions and to provide some useful notation for deriving and stating the asymptotic distributions.

Assumption 1. There exist a (k1 × k1) diagonal matrix τT and a vector of functions F, such that

exists and is nonsingular. In addition, f (t) includes a constant term.

Assumption 1 can be relaxed but, as it stands, is sufficiently general to cover most commonly used models. For later use, let F(T) be the matrix of the stacked F(t/T) functions.

The next assumption provides us with the necessary invariance principles and ensures that we can estimate (1) consistently, even when the regressors are endogenous.

Assumption 2. {ut}t=1 is weakly stationary and satisfies the following conditions:

Assumptions 2(a)–(d) have been used extensively in the literature on nonparametric covariance matrix estimation to ensure that the relevant multivariate invariance principles hold. These conditions are sufficient to provide the asymptotic distribution of the ordinary least squares (OLS) estimates of (1) if the regressors are exogenous. Assumptions 2(e)–(g) are made to allow us to deal with endogenous regressors in the manner suggested by Saikkonen (1991), Phillips and Loretan (1991), Stock and Watson (1993), and Wooldridge (1991). A direct implication (see Saikkonen, 1991) is that we can write u1,t as

where

is a stationary process such that E(u2,t vt+l′) = E((XtXt−1)vt+l′) = 0, l = 0,±1,±2,…. Following standard procedure, we can thus estimate the model using dynamic ordinary least squares (DOLS); i.e., we estimate

where

. We are now ready to make the third and final assumption.

Assumption 3. Let p → ∞ such that p3/T → 0 and T1/2 [sum ]| j|>p∥γj∥ → 0.

Saikkonen (1991) shows that if Assumption 3 holds, then (4) is asymptotically equivalent to

and under Assumptions 1–3, the asymptotic distributions of the least squares estimates of α and β are well known (for the case where trends are included in the model, see Saikkonen, 1991; Phillips and Loretan, 1991; Stock and Watson, 1993; Wooldridge, 1991; Phillips and Hansen, 1990).

As usual, inference on β is conducted using the DOLS estimates and a HAC estimate of the asymptotic covariance matrix to form Wald or t-type test statistics. Denote the Cholesky composition of Ω by

. Then HAC estimators of σ take the general form

Here N = T − (2p + 1),

are the residuals from (4), M is called the bandwidth or the truncation lag, and k(x) is a kernel function satisfying k(x) = k(−x), k(0) = 1, |k(x)| ≤ 1, k(x) continuous at x = 0, and

. Following Kiefer and Vogelsang (2002), we assume that M is directly proportional to T, such that M = [bT] (in place of the usual assumption that M/T → 0 as T → ∞) and develop this asymptotic theory for the cointegration model. The limiting distribution of

will depend on the specific bandwidth (now fully determined by the parameter b) and kernel used to construct the estimator. To proceed we provide the following definition, which describes two different types of kernels.

Definition. A kernel is labeled type 1 if k(x) is twice continuously differentiable everywhere and is labeled type 2 if k(x) is continuous, k(x) = 0 for |x| ≥ 1, and k(x) is twice continuously differentiable everywhere except at |x| = 1.

In addition, we will consider the Bartlett kernel separately. The following lemma provides the asymptotic distribution of

under fixed-b asymptotics and for various choices of kernels. To state the asymptotic distributions, we define

where w1(s) is independent of

and where

is defined as the residual from the projection of

on the subspace generated by F(s) in the Hilbert space of square integrable functions on [0,1] with the inner product

, such that

. Correspondingly, F(s)X is the residual from the projection of F(s) onto the space generated by

.

LEMMA 1. If k is type 1,

If k is type 2

where k*(x) = k(x/b) and k*′(b) is the derivative of k*(x) from below at b. If k is the Bartlett kernel,

The proof of Lemma 1 follows that of Kiefer and Vogelsang (2002) but with the added complication that endogenous regressors are present, and therefore some additional work is required to determine the asymptotic distribution of the partial sums of the residuals. The asymptotic distribution of

is proportional to σ2 and depends on the bandwidth and kernel as expected.

Using Lemma 1, hypotheses of the form H0 : Rβ = β0 can be tested using the standard Wald test. In what follows R is a nonstochastic restriction matrix of dimension

and rank q. The Wald test for H0 is defined as

The corresponding one-dimensional t-test can be obtained in the usual manner. Theorem 1, which follows, states the asymptotic distribution of W under fixed-b asymptotics.

THEOREM 1. Suppose Assumptions 1–3 hold. Then, under H0,

where

and

is the residual from the projection of the first q coordinates of

onto the last

coordinates of

.

This theorem demonstrates that it is possible to obtain pivotal test statistics with the fixed-b assumption. The asymptotic distribution of the Wald test depends on the kernel and bandwidth, and through

it also depends on the number of restrictions being tested, the number of regressors in the model, and the trends included, where standard b → 0 asymptotics would have resulted in an asymptotic χ2 distribution. Note that although the asymptotic distribution is nonstandard, it is simple to obtain critical values through simulations because the distribution is simply a function of independent Wiener processes. (Simulations guiding the choice of b and also some critical values are available from the author upon request.) With the critical values in hand, it is possible to use the fixed-b approach for testing in single-equation models of cointegration. Although the choice of b and the simulations demonstrating the efficacy of the test are treated in a companion paper (Bunzel, 2006), the fixed-b approach is likely to improve the asymptotic approximation, resulting in reduced size distortions. It also provides an asymptotic distribution that depends on the bandwidth and kernel, thus providing us with better tools for choosing these parameters.

APPENDIX

A.1. Proof of Lemma 1.

Following Kiefer and Vogelsang (2002), we define

and use this expression to rewrite

as

where

are the residuals from (4). Note that for (A.1) to be valid it must be the case that the residuals sum to zero. Therefore, for the results that follow to be valid f (t) must include a constant term as assumed in Assumption 1. To establish the asymptotic distribution of

, it is necessary first to determine the asymptotic distribution of

.

LEMMA 2. Under Assumptions 1–3,

.

Proof. First define γ = [γp′,…, γp′]′, which is a

vector of parameters and let ΔZt+p = [ΔXtp′,ΔXtp+1,…,′ΔXt+p′]′ be the corresponding vector of regressors. Simple matrix manipulations yield

In what follows, we will show that the last term in the expression for

, (A.2), is

and therefore does not affect the asymptotic distribution of

. First note that

by Assumption 2(f). Thus

. We then obtain

by equation (23) in Saikkonen (1991), which states that

.

We can now determine the asymptotic distribution of

from the first two terms of (A.2). By Assumptions 1–3 we know from Saikkonen (1991) and Phillips and Hansen (1990) that

It also follows directly from Assumption 2 that

. Because N/T → 1, it will also be the case that

. So it is now established that

The rest of the proof is split into three cases, corresponding to type 1, type 2, and the Bartlett kernels. It follows directly from Kiefer and Vogelsang (2002) and Lemma 2.

Case 1. k(x) is a type 1 kernel. By definition of the second derivative, T2Δ2κil − (−k*′′((il)/N)) → 0, and using Lemma 2 it follows easily that

Case 2. k(x) is a type 2 kernel. Following Kiefer and Vogelsang (2002), we use simple algebra and the definition of Δ2κij to establish that when |ij| > [bN], Δ2κij = 0, and when |ij| = [bN], Δ2κij = −k(([bN] − 1)/[bN]). Also recall that when |ij| < [bN] k(x) is twice continuously differentiable. We split up the expression of

as follows:

where the asymptotic distribution follows directly from Lemma 2 and Kiefer and Vogelsang (2002).

Case 3. k(x) is the Bartlett kernel. Here again following Kiefer and Vogelsang (2002), it can be verified that when |ij| = 0, Δ2κij = 2/[bN], and when |ij| = [bN], Δ2κij = −(1/[bN]). Using these expressions and Lemma 2 in (A.1), we obtain the following limiting distribution:

A.2. Proof of Theorem 1.

The initial step of the proof will be to rewrite the model, projecting out all regressors that are not related to the hypothesis in question. Then we will prove that the statistic is numerically unchanged if it is calculated from the rewritten model. Finally the expression of W obtained from the rewritten model will be used to derive the asymptotic distribution of the statistic.

To rewrite the model, let

, where D is chosen such that L has full rank

, and define

. Using these definitions, model (4) can be rewritten in the following manner:

Because

are linear combinations of X, they too contain unit root processes if the original assumption of just one cointegration relationship is maintained. Furthermore

contains the leads and lags of the differenced

variables. We will now show that testing H0 : Rβ = β0 is equivalent to testing the hypothesis

: β1* = β0 in the model

where for any matrix G,

.

LEMMA 3. The statistic for testing

from (A.3) is numerically identical to the statistic for testing H0 : Rβ = β0 in the model given by (4).

The proof follows from extensive but simple matrix algebra. To complete the proof of the theorem we thus need to determine the asymptotic distribution of

Because

, where

is defined as

, but for the model in (A.3), we know

and therefore,

By the definition of X1*,

The distribution of W* can now be obtained.

If k is type 1,

If k is type 2,

If k is Bartlett,

References

REFERENCES

Andrews, D.W.K. (1991) Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica 59, 817854.Google Scholar
Andrews, D.W.K. & J.C. Monahan (1992) An improved heteroskedasticity and autocorrelation consistent covariance matrix estimator. Econometrica 60, 953966.Google Scholar
Bunzel, H. (2006) Practical Use of Fixed-b Asymptotics in Cointegration Models. Mimeo, Iowa State University.
Bunzel, H., N.M. Kiefer, & T.J. Vogelsang (2001) Simple robust testing of hypotheses in non-linear models. Journal of the American Statistical Association 96, 10881098.Google Scholar
de Jong, R.M. & J. Davidson (2000) Consistency of kernel estimators of heteroskedastic and autocorrelated covariance matrices. Econometrica 68, 407424.Google Scholar
den Haan, W.J. & A. Levin (1997) A practictioner's guide to robust covariance matrix estimation. In G. Maddala & C. Rao (eds.), Handbook of Statistics: Robust Inference, vol. 15, pp. 291341. New York: Elsevier.
Hansen, B.E. (1992) Consistent covariance matrix estimation for dependent heterogenous processes. Econometrica 60, 967972.Google Scholar
Kiefer, N.M. & T.J. Vogelsang (2002) Heteroskedasticity-autocorrelation robust testing using bandwidth equal to sample size. Econometric Theory 18, 13501366.Google Scholar
Kiefer, N.M., T.J. Vogelsang, & H. Bunzel (2000) Simple robust testing of regression hypotheses. Econometrica 68, 695714.Google Scholar
Newey, W.K. & K.D. West (1987) A simple, positive semi-definite, heteroskedasticity and autocorrelation consistent covariance matrix. Econometrica 55, 703708.Google Scholar
Phillips, P.C.B. & B.E. Hansen (1990) Statistical inference in instrumental variables regression with I(1) processes. Review of Economic Studies 57, 99125.Google Scholar
Phillips, P.C.B. & M. Loretan (1991) Estimating long run economic equilibria. Review of Economic Studies 58, 407436.Google Scholar
Robinson, P. (1991) Automatic frequency domain inference on semiparametric and nonparametric models. Econometrica 59, 13291363.Google Scholar
Saikkonen, P. (1991) Asymptotically efficient estimation of cointegrating regressions. Econometric Theory 7, 121.Google Scholar
Stock, J.H. & M.W. Watson (1993) A simple estimator of cointegrating vectors in higher order integrated systems. Econometrica 61, 783820.Google Scholar
Vogelsang, T.J. (1998) Trend function hypothesis testing in the presence of serial correlation parameters. Econometrica 65, 123148.Google Scholar
White, H. (1984) Asymptotic Theory for Econometricians. Academic Press.
Wooldridge, J.M. (1991) Notes on Regression with Difference-Stationary Data. Mimeo, Michigan State University.