1. INTRODUCTION
Cointegration has traditionally focused on the case of unit root (or integrated of order one) observable processes with weak dependent (or integrated of order zero) cointegrating errors. Formally, we consider that a zero-mean scalar covariance stationary process ζt, t ∈ Z, Z = {t : t = 0,±1,…}, with spectral density fζ(λ), is integrated of order zero, denoted ζt ∼ I(0), if
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm001.gif?pub-status=live)
whereas a zero-mean scalar process is I(d) if it could be represented as an I(0) process after differencing it d times. More precisely, let ζt, t ∈ Z, be an I(0) process and then define the Type II fractionally integrated of order d process ξt, denoted ξt ∼ I(d), as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm001.gif?pub-status=live)
where Δ = 1 − L, L is the lag operator,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm002.gif?pub-status=live)
where Γ(·) represents the gamma function, taking Γ(α) = ∞ for α = 0,−1,−2,…, and Γ(0)/Γ(0) = 1. The # superscript attached to a scalar or vector sequence ht has the meaning
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm003.gif?pub-status=live)
where 1(·) is the indicator function. Note that because of the truncation on the right-hand side of (1), ξt = 0, t ≤ 0, whereas integer values of d provide the typical definition of integrated process with a particular initial condition. For example, if d = 1, as aj(1) = 1, j ≥ 0,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm002.gif?pub-status=live)
For an alternative definition of fractionally integrated process (the Type I class) see Marinucci and Robinson (1999). Note that for d < ½ the truncation on the right-hand side of (1) is not strictly necessary and implies that ξt is only asymptotically stationary, in the precise sense defined by Robinson and Marinucci (2001). If d ≥ ½, this truncation implies that the variance of ξt is finite (albeit evolving at rate O(t2d−1)), so that ξt is well defined in mean-squared sense.
The traditional situation mentioned previously was denoted by Engle and Granger (1987) as CI(1,1), first and second arguments referring to the integration orders of the observables and the reduction of these orders by certain linear combinations, respectively. Recently, increasing interest about a wider framework where the orders of integration of both observables and cointegrating errors could be real, but perhaps not integer, numbers has emerged. This, in view of the work of Granger and Joyeux (1980) and Hosking (1981) on fractionally integrated processes, represents a natural generalization of the CI(1,1) framework, which was already anticipated by Engle and Granger (1987) because their CI(d,b) definition did not necessarily require d, b to be integers. This general setting considers CI(1,1) as a particular situation, noting that I(1) and I(0) are very specific cases of nonstationarity and stationarity, respectively, whereas it also allows for consideration of “stationary cointegration,” where the observables are (perhaps only asymptotically) covariance stationary long-memory processes, with cointegrating errors also being long memory with strictly less memory than the observables or weak dependent I(0) processes. Theoretical works on estimation of the relation of cointegration in fractional frameworks include Robinson (1994), Jeganathan (1999), Kim and Phillips (2000), Robinson and Marinucci (2001, 2003), Hualde and Robinson (2006), Robinson and Hualde (2003), and Hualde and Robinson (2004a).
This new setting introduces additional challenges with respect to the traditional CI(1,1) situation. In particular, assuming that the integration orders are real numbers, the precise knowledge of their values seems difficult to justify, even after pretesting, which contradicts the usual practice in the standard CI(1,1) framework, where the assumed knowledge of these orders is used to derive estimates with optimal asymptotic properties (see, e.g., Phillips, 1991). In the fractional setting, some of the works mentioned earlier deal explicitly with the important issue of unknown integration orders, although, as in the traditional prescription, standard estimates that do not rely on this knowledge could enjoy good asymptotic properties. Among those, the most common ones are the ordinary least squares (OLS) and the narrow band least squares (NBLS), whose asymptotic properties in possibly fractional circumstances were studied by Robinson and Marinucci (2001, 2003).
Furthermore, the real-valued condition of the integration orders also poses additional problems, which can be presented by the next simple example. Suppose we observe the processes yt, xt, t = 1,…, n, which are I(δ1), I(δ2), being generated by the I(0) processes ζ1t, ζ2t, respectively. Then, provided δ1 = δ2, a necessary and sufficient condition for the two series yt, xt to be cointegrated is that the long-run variance-covariance matrix of the bivariate process (ζ1t,ζ2t)′, defined as 2π times its spectral density evaluated at frequency 0, is singular. Needless to say, the existence of cointegration or no cointegration between yt and xt has an enormous impact on the behavior of different statistics and estimates. For example, if δ1 = δ2 = 1 and there is no cointegration, Phillips (1986) showed that the OLS estimate
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm003.gif?pub-status=live)
is not consistent for the fundamental coefficient (see Park, Ouliaris, and Choi, 1988)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm004.gif?pub-status=live)
where ωij is the (i,j)th element of the long-run variance-covariance matrix of (ζ1t,ζ2t)′. Alternatively, if there is cointegration and (ζ1t,ζ2t)′ has a structure such that the linear combination yt − νxt is an I(0) process, then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm004.gif?pub-status=live)
is an n-consistent estimate of ν, with nonstandard limiting distribution (Phillips and Durlauf, 1986).
The present paper does not focus on departures from the standard notion of cointegration affecting the singularity of this long-run variance-covariance matrix (see, e.g., Jansson and Haldrup, 2002), but on consequences of relaxing the condition of equality of the integration orders of the observables, which is necessary for the existence of cointegration. For example, let δ2 = 1 and δ1 be a fixed value such that 0 < δ1 ≠ 1 in the previous example. Clearly, any linear combination of yt and xt will be I(1) or I(δ1) depending on whether δ1 < 1 or δ1 > 1, respectively. Thus, irrespective of the possible singularity of the long-run variance-covariance matrix of ζ1t, ζ2t, following Robinson and Marinucci (2001),
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm005.gif?pub-status=live)
so that the OLS estimate either converges to zero or diverges as n tends to infinity, in the case δ1 < 1 or δ1 > 1, respectively. We find two very relevant issues related to this discussion. First, even in the situation where the singularity condition on the input error process holds and the difference between the orders δ1, δ2 is very small, the asymptotic theory predicts that the OLS estimate does not converge at all to the fundamental coefficient ν. This fact is somehow counterintuitive, as one could suspect that if δ1 and δ2 are very close to each other and the long-run variance-covariance matrix is singular, we are in fact in a “near cointegration” situation, so that estimates like the OLS should have a closer finite-sample behavior to the proper case of cointegration than to the one predicted by the theory with distinct orders of integration. The Monte Carlo experiment reported in Section 5 supports this guess, which we find very appealing when dealing with fractional orders. Here, as mentioned before, there could be situations where it is certainly not realistic to assume that the orders of integration present in the model are known. If this is the case, any testing procedure for cointegration should include a pretest for equality of the orders of integration. This was considered theoretically by Robinson and Yajima (2002) and Hualde (2002) and empirically by Marinucci and Robinson (2001). But even if we conclude that the orders of integration of two processes are statistically equal, their real-valued essence could, in certain circumstances, make us suspect that the orders are perhaps not strictly equal but only very close to each other. Strictly speaking, this would not be a situation of cointegration, but in practice, properties of the estimates might not be very much affected by minor differences in orders of integration. In this case, the typical cotrending that cointegration implies would be only approximate but could be sufficient to infer sensible statistical results.
This idea is very close in spirit to some well-established evidence in the literature. This mainly refers to the nearly nonstationary first-order autoregressive (AR(1)) model studied by, among others, Ahtola and Tiao (1984), Chan and Wei (1987), Phillips (1987, 1988), Cox and Llatas (1991), and Elliott (1998). These studies were motivated by the well-known fact that for the AR(1) process
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm006.gif?pub-status=live)
where ζt, t = 0,±1,…, is an independent and identically distributed sequence, when |φ| < 1,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm007.gif?pub-status=live)
is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm008.gif?pub-status=live)
-consistent and asymptotically normal, but this limiting distribution provides a poor approximation to the actual finite-sample distribution of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm009.gif?pub-status=live)
for moderate n when φ is close to (but below) 1. Evans and Savin (1981) presented numerical evidence that the nonstandard limiting distribution of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm010.gif?pub-status=live)
when φ = 1, which is n-consistent, provides a better approximation when φ is close to but below 1. This issue becomes very relevant if we are uncertain about whether a process has a root of unity or in the vicinity of unity. These authors studied the limiting distribution of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm011.gif?pub-status=live)
in the case
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm012.gif?pub-status=live)
which for a certain positive fixed real number α is smaller but approximating 1 as n tends to infinity, wt in this case being a nearly nonstationary AR(1) process. Phillips (1988) denoted near cointegration the situation where a linear combination of I(1) processes was nearly nonstationary. Jansson and Haldrup (2002) provided an alternative definition of near cointegration that complements to a certain extent the work of Phillips (1988). Taking our previous example, assuming δ1 = δ2 = 1, they analyzed the case where the long-run variance-covariance matrix of (ζ1t,ζ2t)′ tends suitably to a singular matrix, and they examined asymptotic properties of different estimates of ν in this case. Thus, one of our aims seeks to complement these previous analyses in a particular sense that we find very relevant in the case of dealing with fractional orders, that is, the study of a near fractional cointegration situation, where the only departure from strict cointegration is the existence of very small differences in the orders of integration of the observables that tend to disappear as the sample size tends to infinity. We will refer to this situation as weakly unbalanced cointegration.
The relevance of the second issue we analyze in the paper can be also better motivated in the fractional setting. As we will see in Section 3, in a weakly unbalanced cointegrating situation, the main message of our work is that one should not worry about suspected small differences of the integration orders in the data, because even if this happens, standard estimates could retain the properties of the strictly balanced situation. However, there could be cases where the integration orders of the observables are substantially different, so that it is unrealistic to model the relation between the series as one covered by weakly unbalanced cointegration. Traditionally, different orders of integration implied that the relation between the series could not be captured by a cointegrating structure, because the necessary condition of equality of the orders of integration was missing. On the contrary, we find that there could be situations where the integration orders of the observables are substantially different but there still exists a strong intrinsic linkage between the two processes, in particular that the long-run variance-covariance matrix of the input error process generating the integrated processes is singular. Referring to the previous example, suppose δ1 ≠ δ2, so that yt and xt would not be cointegrated but yt and xt(δ2 − δ1) (which share the same integration order δ1) are cointegrated in the usual sense, where for a scalar or vector process ht and real number c,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm005.gif?pub-status=live)
noting (2) and (3); that is, the linear combination yt − νxt(δ2 − δ1) is integrated of an order strictly smaller than δ1. Thus, it is readily seen that if δ1 = δ2 the relevant parameter to explain the long-run relationship between yt and xt is ν. If, on the contrary, the orders are different, but there is still cointegration between a raw and a filtered series, we find that two parameters are relevant to explain the long-run connection between yt and xt. These parameters are ν and δ2 − δ1. We will refer to this situation as strongly unbalanced cointegration. We believe that this concept could help practitioners in their task of unmasking anomalies appearing in some estimated models that contradict predictions from economic theory. An excellent motivating puzzle for our work is the forward premium anomaly. In short, this refers to surprising negative estimates from the regression of the change in the logarithm of the spot exchange rate on the forward premium, whereas the theory predicts a theoretical value of one for that slope (see Backus, Foressi, and Telmer, 1996; Bekaert, 1996; Bekaert, Hodrick, and Marshall, 1997). Baillie and Bollerslev (2000) describe this issue as mainly a statistical phenomenon, characterized by the fact that the integration orders of dependent and explanatory variables perhaps could be not the same. In particular, they suggest that the spot exchange rate could be I(1), whereas there seems to be evidence of long-memory behavior of the forward premium. Maynard and Phillips (2001) gave formal theoretical justification to this phenomenon. With these ideas in mind, we hope to offer a sensible statistical solution to what might well be a statistical problem.
The paper is organized as follows. In the next section we present the model, assumptions, and particular estimates whose asymptotic properties will be analyzed in the different situations pinpointed before. Section 3 collects the main results, which are fully characterized in Appendix A and rigorously justified in Appendix B. Section 4 presents alternative estimates for the strongly unbalanced cointegration situation. Finally, Section 5 reports a Monte Carlo study of finite-sample behavior of the different estimates presented in the paper and some artificially generated figures that give further motivation to the strongly unbalanced cointegration situation.
2. MODEL, ASSUMPTIONS, AND PROPOSED ESTIMATES
Throughout the paper, we consider a bivariate triangular array {(yt,n,xt,n)′ : 1 ≤ t ≤ n}n=1∞ generated by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm006.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm007.gif?pub-status=live)
where θn is a sequence of real numbers and vt = (v1t,v2t)′, t ∈ Z, is a zero-mean bivariate process at least asymptotically stationary with bounded, possibly time-dependent, spectral density f(t)(λ). Model (6), (7) is very general and allows us to consider simultaneously different situations depending on θn and the structure of vt, which will be the key element to assess whether or not there is cointegration between yt,n and xt,n. We set subsequently specific conditions on θn and vt that will determine different relations between yt,n and xt,n, that is, whether they are cointegrated in the wide sense (balanced, weakly, strongly unbalanced cointegrated) or not cointegrated at all. We will use the simplifying notation yt = yt,n, because noting (6), yt,n = yt,n′ for n ≠ n′, and similarly xt = xt,n in the case θn = θ for all n. As cointegration has been mainly considered among I(1) or I(2) processes, we will concentrate on the case where in (6), (7)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm008.gif?pub-status=live)
so that for θn = 0 both observables are purely nonstationary and the common cases δ = 1 or 2 are covered by our theory. Note that we treat all purely nonstationary situations except δ = ½, a borderline case that would require a different approach.
As will be seen in the next section, for the noncointegrated cases we will concentrate on the situation where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm009.gif?pub-status=live)
Here, the particular case θ = 0 was analyzed theoretically by Phillips (1986) for the unit root case δ = 1 and by Marmol (1998) for general real δ in the nonstationary region. This situation was denoted spurious cointegration, and here the typical dimensionality reduction in the stochastic trends explaining jointly the evolution of the observables, which characterizes cointegration, is not present. The noncointegrated cases will be also characterized by the following set of regularity conditions on the process vt. Throughout, we denote by Ip the p × p identity matrix.
ASSUMPTION NC (No cointegration). The process vt, t ∈ Z, has representation
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm013.gif?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm014.gif?pub-status=live)
and
(i) A(eiλ) is differentiable in λ with derivative in Lip(ϱamp;), ϱamp; > ½; in addition, with ∥·∥ denoting the euclidean norm,
(ii) the εt are independent and identically distributed vectors with mean zero, positive definite covariance matrix Σ, and E∥εt∥q < ∞, q ≥ 4, q > max{2/(2δ − 1),[2/(2(δ + θ) − 1)]1(δ + θ > ½)}; finally, defining the long-run variance-covariance matrix of vt as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm015.gif?pub-status=live)
with (i,j)th element ωij and the squared correlation coefficient computed from Ω
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm016.gif?pub-status=live)
(iii) ω11 > 0, ω22 > 0, ρ2 < 1.
Thus, absence of cointegration will be characterized by vt being a bivariate covariance stationary process with spectral density matrix f(t)(λ) = f (λ) such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm017.gif?pub-status=live)
with (i,j)th element fij(λ) and
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm018.gif?pub-status=live)
as is implied by (iii), which, as mentioned in the Introduction, rules out the possibility of cointegration between yt and xt,n even if θn = 0. Notice that (i) implies
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm019.gif?pub-status=live)
, because the derivative of A(eiλ) has Fourier coefficients jAj, whence Zygmund (1977, p. 240) can be applied. Further, this also implies
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm020.gif?pub-status=live)
, which, along with the condition in (ii), enables us to apply the functional limit theorem of Marinucci and Robinson (2000) (developing earlier work of Akonom and Gourieroux, 1987; Silveira, 1991), as is required to characterize the limit distribution of our estimates. Also, the moment assumption on εt is satisfied, for any δ,δ + θ > ½, by Gaussianity. Finally, note that (i) and (iii) imply that both individual processes v1t, v2t, are I(0).
Next, we will focus on the situation where there exists cointegration in the wide sense. The key here is to characterize properly the structure of vt in (6), (7), as we do in the next assumption.
ASSUMPTION C (Cointegration). There exist real numbers ν ≠ 0 and γ such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm010.gif?pub-status=live)
and a certain process u1t, t ∈ Z, where
(i)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm011.gif?pub-status=live)
(ii) the bivariate process ut = (u1t,v2t)′, t ∈ Z can be represented as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm012.gif?pub-status=live)
where the conditions on A(L) and εt set in Assumptions NC(i) and (ii) apply to B(L), εt, respectively, with E(εtεt′) = Ψ and E∥εt∥q < ∞, q ≥ 4, q > max{2/(2δ − 1),[2/(2γ − 1)]1(γ > ½)}; finally, denoting φij the (i,j)th element of the long-run variance-covariance matrix of ut, namely,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm021.gif?pub-status=live)
(iii) φ11 > 0, φ22 > 0.
This assumption has very important implications. First, (ii) and (iii) imply that both u1t and v2t are I(0) with differentiable spectral density matrix. Further, by (i), v1t is decomposed as the sum of an I(0) process and an overdifferenced asymptotically stationary process u1t(δ − γ), noting (5) and (10). This is the most relevant and distinctive condition of Assumption C. An alternative representation could have been to consider
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm013.gif?pub-status=live)
instead of v1t, where the second term on the right-hand side of (13) is now covariance stationary, so that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm140.gif?pub-status=live)
also shares this condition. However, we find it more justifiable to use the particular vt characterized by Assumption C, because under this condition (6) and (7) imply that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm022.gif?pub-status=live)
which if θn = 0 for all n is the bivariate cointegrated system involving Type II fractionally integrated processes considered by Hualde and Robinson (2006) and Robinson and Hualde (2003). Note also that even if yt and xt,n are not cointegrated in the usual sense, standard cointegration occurs between yt and xt,n(θn). Thus, the time dependence of vt seems natural when dealing with Type II processes. In fact, denoting
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm014.gif?pub-status=live)
with (i,j)th element hij(λ), it is straightforward to show that under Assumption C the time-dependent spectral density of vt, f(t)(λ) is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm023.gif?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm024.gif?pub-status=live)
which is the cointegrating gap. Clearly, f(t)(λ) can be decomposed as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm015.gif?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-03875-mediumThumb-S0266466606060361ffm025.jpg?pub-status=live)
Defining
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm016.gif?pub-status=live)
f (λ) is the spectral density function of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm026.gif?pub-status=live)
, which could be referred to as the covariance stationary process version of vt. Our cointegrating model implies
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm017.gif?pub-status=live)
because the rank of g(λ) is reduced for all λ, whereas noting that a(c;0) = 0, for any c < 0, R(0) = 0. Also
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm018.gif?pub-status=live)
where the O(·) notation in (18) does not have the usual meaning of upper bound but exact rate. As mentioned before, the type of cointegration between yt and xt,n depends crucially on the structure of the process vt and more specifically on the behavior of the component R(λ) in the vicinity of frequency 0. The speed at which this element vanishes as λ tends to 0 is the key to characterize the strength of the cointegrating relationship, and, in view of (18), this is completely determined by the cointegrating gap β. Further, by Stirling's approximation it can be seen that the rate at which R(t)(λ) vanishes depends also on this cointegrating gap, specifically
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm019.gif?pub-status=live)
uniformly in λ. We believe that (15)–(19), which are implied by model (6), (7) under Assumption C, are the key intrinsic features of any cointegrating model involving Type II fractionally integrated processes. Finally, note that the real number ν has the meaning anticipated in (4), where ωij now denotes the (i,j)th element of the long-run variance-covariance matrix of the covariance stationary version of vt (note that under Assumption NC is vt itself).
For balanced cointegration we assume that θn = 0 in (7), so that the stochastic trends driving yt and xt are strictly of the same order, but, as opposed to the case of absence of cointegration, there is a common trend of order δ driving the behavior of both processes. Weakly unbalanced cointegration will denote situations where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm027.gif?pub-status=live)
and the previously referred cotrending, which is fully characterized by the order of integration of the observables and the structure of vt, is only approximate. Strictly speaking, the triangular array xt,n is I(δ + θn), so that even if the behavior of vt implies CI(δ,β) cointegration in the case θn = 0, for fixed n, any linear combination of yt and xt,n is I(δ) or I(δ + θn) depending on whether θn < 0 or θn > 0, respectively. Note that in practice this situation could be indistinguishable from that of balanced cointegration, so that from the viewpoint of modeling purposes the applicability of the idea of weakly unbalanced cointegration is limited. However, this concept was not introduced with this objective but mainly to stress the fact that the effect of minor suspected differences in the integration orders of the observables could be negligible asymptotically. In fact, one of the points of the paper is to show that even if in finite samples yt and xt,n are not cointegrated in the strict sense, the presence of a small perturbation (converging to 0 as n tends to infinity) in one of the integration orders in a bivariate system (which could be thought of as natural in a framework where the orders of integration are real numbers) may not affect first-order asymptotic properties of different estimates and statistics, whose finite-sample performance could display behavior closer to a CI(δ,β) situation than to the one predicted by the theory if the orders of integration of the observables are different. As will be seen later, our asymptotic results and Monte Carlo experiment support this idea and also could help to explain the unexpected values (over what economic theory predicts) of some estimated parameters that appear in some empirical studies.
Strongly unbalanced cointegration will be characterized by (9), where θ ≠ 0 in (7). As mentioned in the Introduction, here yt, xt are not cointegrated, but yt, xt(θ) are, so that estimates taking into account the imbalance between the orders of integration of the observables could enjoy good asymptotic properties.
In short, we present the relevant cases in Table 1.
Cointegration between yt and xt,n
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-72768-mediumThumb-S0266466606060361tbl001.jpg?pub-status=live)
The study of the different situations will mainly focus on analyzing the asymptotic properties of a particular class of estimates of the fundamental parameter ν. To specify this class, we need some preliminary definitions. First, for a scalar or vector process ζt, let
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm028.gif?pub-status=live)
be the discrete Fourier transform and given another sequence (possibly the same one) ξt, let
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm029.gif?pub-status=live)
be the cross-periodogram and periodogram, respectively. Thus, we will consider members of the class of estimates of ν given by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm020.gif?pub-status=live)
where λj = 2πj/n, j = 0,…, m, are the Fourier frequencies, m is an integer such that 1 ≤ m ≤ n/2, cj = 1, j = 0,n/2, cj = 2 otherwise, and a is a possibly random scalar. Note that because of the orthogonal properties of the complex exponential
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm030.gif?pub-status=live)
and by the symmetry of the real part of the (cross-) periodogram about λ = 0,π, for the particular choice m = [n/2], where [·] denotes integer part,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm031.gif?pub-status=live)
which is the OLS estimate. In this case, the numerator of (20) is real, so our notation in (20) may be redundant. When m < [n/2], we will only consider the case where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm021.gif?pub-status=live)
so that under (21), (20) is the NBLS estimate. This is motivated by the fact that cointegration is a long-run phenomenon and hence neglecting components of the observable series associated with high (short-run) frequencies could improve the estimation. For cases where θn = θ for all n, we will also consider estimates νθ(m) (note that for θ = 0 this estimate is, depending on m, the OLS or NBLS) and
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm032.gif?pub-status=live)
, where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm033.gif?pub-status=live)
is a consistent estimate of θ such that the following regularity condition holds. Throughout K denotes a generic positive constant.
ASSUMPTION PE (Preliminary estimate). Provided θn = θ for all n, there exists an estimate
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm034.gif?pub-status=live)
of θ such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm022.gif?pub-status=live)
and for a real sequence gn, such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm023.gif?pub-status=live)
and random variable Λ,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm024.gif?pub-status=live)
The parameter θ reflects the difference between the integration orders of processes xt and yt, so that an estimate of θ will be naturally based on the difference of the estimates of their respective orders. As both processes yt and xt are observable, (24) with (23) is very mild, holding even if
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm035.gif?pub-status=live)
is based on semiparametric estimates of the orders of xt, yt, such as the log-periodogram or Gaussian semiparametric, analyzed by Robinson (1995a, 1995b), respectively, and extended by Velasco (1999a, 1999b) to cover arbitrarily large but finite orders of integration. The only difficulty here is that these results do not apply directly to Type II processes, although in this case the bounds measuring the distance between Type I and Type II processes established by Robinson (2005) suffice. For parametric estimates of θ, gn = nα with α = ½ is achievable. The term Λ could be Gaussian or non-Gaussian, depending on the method of estimation of θ, and also on various things, such as, for example, the degree of tapering applied in the estimation (see Velasco, 1999a, 1999b), a user-chosen number (see Lobato and Robinson, 1996), or even δ, θ (see Velasco and Robinson, 2000). Condition (22) is innocuous if
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm036.gif?pub-status=live)
is based on estimates that optimize over bounded sets, this being standard for implicitly defined estimates.
The estimates
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm037.gif?pub-status=live)
take explicitly into account the possible imbalance between the orders of integration of the series, but, in general, νθ(m) will be infeasible, because knowledge of the particular integration orders of the observables in empirical situations is difficult to justify. On the contrary,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm038.gif?pub-status=live)
does not require this knowledge, and in view of Hualde and Robinson (2006) and Robinson and Hualde (2003), one could suspect that provided
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm039.gif?pub-status=live)
converges fast enough to
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm040.gif?pub-status=live)
could enjoy the same rate of convergence and limiting distribution of νθ(m). The fact that generally θ is unknown could be a strong point in favor of the feasible estimate, even if we suspect that θ = 0, although, as shown in the next section, the insertion of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm041.gif?pub-status=live)
could importantly affect first-order asymptotic properties of the estimates.
Finally, note that we could have equally considered the class of estimates
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm042.gif?pub-status=live)
with
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm043.gif?pub-status=live)
, which in certain cases could be superior to νa(m). More precisely, under Assumption C, yt and xt(θ) are CI(δ,γ) cointegrated, whereas, noting that under this condition
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm044.gif?pub-status=live)
yt(−θ) and xt are CI(δ + θ,γ + θ) cointegrated. This implies that when θ > 0,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm045.gif?pub-status=live)
could enjoy a faster convergence rate than νa(m), the opposite happening when θ < 0 (see, e.g., Robinson and Marinucci, 2001). For simplicity, we only consider νa(m), although the treatment of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm046.gif?pub-status=live)
could be addressed in a similar way.
3. MAIN ASYMPTOTIC RESULTS
First, we concentrate on the situation of balanced orders, with
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm025.gif?pub-status=live)
We collect results corresponding to this case in two theorems that cover the noncointegrating and cointegrating situations, respectively. Most of these results are well known in the literature and are presented just for completeness.
Denote by W(r) the 2 × 1 vector Brownian motion with covariance matrix I2 and define the (Type II; see Marinucci and Robinson, 1999) fractional Brownian motion
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm047.gif?pub-status=live)
and the 2 × 1 vectors
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm048.gif?pub-status=live)
By ⇒ we will mean convergence in the Skorohod space D[0,1] endowed with the J1 topology and related to the different analyzed estimates denote
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm049.gif?pub-status=live)
under (21), noting that when θ = 0, ν1 = ν3, ν2 = ν4. The expressions νi, i = 1,2 and i = 3,…,6 will be referred to as undifferenced-regressor (U) and differenced-regressor (D) estimates, respectively, noting the previous caution about the equality of the different types of estimates when θ = 0. The main results will be given in a sequence of theorems linked to different cointegrating situations and estimates. For most of the theorems the full characterization of the limiting distributions and rates of convergence is given in Appendix A.
THEOREM BNCUD (Balanced orders, no cointegration, U and D estimates). Let (6)–(8), (21), (25), and Assumptions NC and PE hold. Then, as n → ∞,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm026.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm027.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm028.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm029.gif?pub-status=live)
Theorem BNCUD refers to the situation of spurious cointegration. We omit the proof of (26) and (27) because the first result is a straightforward application of Theorem 1 of Marinucci and Robinson (2000) and the continuous mapping theorem, whereas (27) is implied by Lemma 1 of Marmol and Velasco (2004). The proof of (28) and (29) is given in Appendix B. Note that as A(1)ΣA′(1) = LL′, where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm050.gif?pub-status=live)
from the Cholesky decomposition. We could equally represent the limiting distribution of ν1 by (see Marmol and Velasco, 2004)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm030.gif?pub-status=live)
Note also that (27)–(29) under Assumption PE imply that all estimates enjoy the same limiting distribution, so these estimates are not consistent for the fundamental parameter ν. As is well known, this inconsistency turns into consistency with reduced rank of Ω, as is readily seen from (30) when ρ2 = 1. However, this information is not sufficient to derive the limiting distribution of the estimates, this being precisely the reason why the characterization in Assumption C(i) comes into play. Defining
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm051.gif?pub-status=live)
the next theorem, given just for completeness, collects results given in Robinson and Marinucci (2001) for OLS and NBLS estimates.
THEOREM BCU (Balanced orders, cointegration, U estimates). Let (6)–(8), (21), (25), and Assumption C hold. Then, as n → ∞,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm052.gif?pub-status=live)
See Appendix A for full characterization of pi(n), Ξi(γ,δ), i = 1,2. As in Robinson and Marinucci (2001), we were unable to characterize the precise limiting distribution of our estimates in the case δ + γ > 1, 0 < γ ≤ ½. It is also important to note that Assumption C is sufficient for the set of conditions needed for the different results given in Robinson and Marinucci (2001) to hold. In particular, Assumption C(ii) is sufficient for the cumulant spectral density related conditions and also implies square integrability of the individual spectra of u1t, v2t, and fourth-order stationarity of ut. We do not consider in this theorem the behavior of ν5, ν6, because this is covered by the more general Theorem UCD, given subsequently, which assumes θn = θ, where θ is not necessarily 0.
Now, we turn to the weakly unbalanced cointegration case, where θn → 0 as n → ∞. Although it is not needed for our proofs, it is convenient to visualize θn as a monotonic sequence, and to have a neater interpretation of the asymptotically negligible departures from the equality of the integration orders, we specify further the θn sequence, setting
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm031.gif?pub-status=live)
where η is a nonzero real finite number and hn is a positive sequence such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm032.gif?pub-status=live)
Note that η reflects the direction of the asymptotically negligible perturbation of the order of integration of xt,n, as depending on whether η > 0 or η < 0, θn tends to 0 from above or below, respectively. We only consider the infeasible estimates νi, i = 1,2, hence avoiding the difficulty associated with the fact that now
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm053.gif?pub-status=live)
would estimate not a fixed parameter θ but a sequence of parameters, whereas it does not seem realistic to assume knowledge of θn. In any case, the main point of the theorem that follows is that in a situation of standard fractional cointegration, a small perturbation of the order of integration of one of the observables could be asymptotically negligible.
THEOREM ABCU (Asymptotically balanced orders, cointegration, U estimates). Let (6)–(8), (21), (31), (32), and Assumption C hold. Then, as n → ∞,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm054.gif?pub-status=live)
See Appendix A for full characterization of qi(n), Fi(γ,δ,ν,η), i = 1,2, and Appendix B for the formal derivations of those results. The interpretation of this theorem is straightforward. The small perturbation in the order of integration of xt,n produces some additional terms in the expansion of νi − ν. Only one of these terms is competitive with the difference νi − ν when θn = 0. Thus, the speed at which θn tends to 0 determines the relative importance of these two terms. Heuristically, on the one hand, if θn converges relatively fast to 0, the presence of the small perturbation θn in the order of xt,n does not affect the limiting distribution of the estimates with respect to the situation where θn = 0. On the other hand, if θn converges slowly, the term previously referred to arising by the presence of the perturbation dominates, and the estimates have a degenerate limiting distribution. Apart from these situations, there is an exact rate for θn at which those two terms are balanced, so that contributions from both appear in the limiting distributions of the estimates. In this case, the effect on the situation where θn = 0 is simply to shift the asymptotic distribution by −νη, whereas the rates of convergence remain unchanged. Finally, note that (32) is a consistency condition, and as long as it holds, the OLS and NBLS will be consistent estimates with possibly very slow rates of convergence.
Finally, we discuss the situation where (9) holds, so in the case θ ≠ 0, the integration orders of the observables are truly different. The first theorem devoted to this situation collects results, mainly based on Robinson and Marinucci (2001), for OLS and NBLS estimates. As we might guess, because of the imbalance between the orders, these estimates are not consistent for ν. We derive all results under Assumption NC, although we did not actually use condition ρ2 < 1, and the results given subsequently are also valid in the case ρ2 = 1, which would denote a situation of cointegration. It can be seen easily that if ρ2 = 1, some terms in the asymptotic distributions given later could disappear, although this is not always the case. For the cases where it is possible to fully characterize the limiting distributions, we also give in Appendix A an alternative representation of them highlighting this fact and also stressing the dependence of these distributions on the fundamental parameter ν.
THEOREM UU (Unbalanced orders, U estimates). Let (6)–(9) and (21) with θ ≠ 0 and Assumption NC (without the requirement ρ2 < 1) hold. Then, as n → ∞,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm055.gif?pub-status=live)
See Appendix A for full characterization of ri(n), Θi(θ,δ), Πi(δ + θ), i = 1,2. These results are mainly taken from Robinson and Marinucci (2001), but we indicate and justify in Appendix B the exact steps that do not follow directly from this reference. As expected, U estimates are not consistent for ν because of the imbalance between the orders of integration, so, in most cases, νi, i = 1,2, converge to zero or infinity depending on the different region of the (θ,δ) space on which we focus.
Finally, we propose a sensible solution to the problem of unbalanced series based on the previously defined D estimates, for which the following theorem holds.
THEOREM UCD (Unbalanced orders, cointegration, D estimates). Let (6)–(9), (21), and Assumptions C and PE hold. Then, as n → ∞,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm056.gif?pub-status=live)
See Appendix A for full characterization of si(n), Gi(γ,δ,ν,Λ), i = 3,…,6. We omit the proof of this theorem because it is almost identical to that of Theorem ABCU. Trivially, results for νi, i = 3,4, are identical to those of Theorem BCU, because taking θ differences on the regressor simply balances the series to turn the situation into one of traditional cointegration, νi, i = 3,4, behaving accordingly. Results for νi, i = 5,6, are similar to those of Theorem ABCU, and they also have a straightforward interpretation. The feasible filtering of the regressor
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm057.gif?pub-status=live)
instead of xt(θ)) makes some additional terms appear in the expansion of νi − ν, i = 5,6. One of these terms is in all cases Op(gn−1 log n), so that unless this term converges faster to 0 than the usual term in νi − ν, i = 3,4, it dominates, obtaining for the feasible estimates in all cases the limiting distribution
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm033.gif?pub-status=live)
which is simply the asymptotic distribution of the estimate of θ, with a slightly different convergence rate (because of the log−1 n factor), and premultiplied by the unknown parameter ν. Note that if Λ is distributed as a zero-mean Gaussian random variable, the value of ν affects the limiting variance of the feasible estimates. In any case, it is worth stressing that as long as (23) holds, the feasible estimates are always consistent. Note finally that if θ is estimated semiparametrically, we expect gn to be relatively slow, this being transmitted to the rate of convergence of the feasible estimates of ν. In the next section we will propose alternative estimates of ν that do not suffer from the serious drawback of having slow rates of convergence. On the contrary, they achieve optimal rates.
4. ALTERNATIVE FEASIBLE ESTIMATES UNDER COINTEGRATION
In this section, we consider the model (6), (7) with (9) and work under Assumption C. Noting that the results presented in Theorem UCD may be unsatisfactory in some cases because of the distortive effect of the insertion of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm058.gif?pub-status=live)
instead of the true imbalance parameter θ, in the spirit of Hualde and Robinson (2006) and Robinson and Hualde (2003), we propose similar estimates to theirs, considering the possible imbalance of the integration orders of the observables. As will become clear subsequently, these estimates enjoy the same optimal asymptotic properties of the ones previously referred to, namely, optimal rates of convergence and standard limiting distribution, implying, among other consequences, that straightforward inference on the value of ν is readily available.
First, we discuss the case where β > ½ (termed strong fractional cointegration by Hualde and Robinson, 2004a). For the sake of a clear exposition we consider initially the situation where B(L) = I2 in (12), so that ut is a white noise. Denoting by ψij the (i,j)th element of Ψ and by εit, i = 1,2, the ith element of εt, assuming γ, δ, θ are known real numbers, the pseudo maximum likelihood estimate of ν is identical to the OLS estimate of ν in the equation
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm034.gif?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm059.gif?pub-status=live)
(see, e.g., Phillips, 1991, in which this result is derived for the case γ = θ = 0, δ = 1). Robinson and Hualde (2003) showed that under mild regularity conditions this estimate is nβ-consistent with mixed-normal limiting distribution. However, in practice it is unrealistic to assume knowledge of γ, δ, and/or θ, and therefore our proposed estimate is in general infeasible. Fortunately, the results of Robinson and Hualde (2003) imply that as long as the estimates of the orders, say,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm060.gif?pub-status=live)
are nκ-consistent with
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm035.gif?pub-status=live)
estimating ν from the OLS regression of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm061.gif?pub-status=live)
on
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm062.gif?pub-status=live)
is asymptotically equivalent (to first-order properties) to estimating ν from (34). As in Robinson and Hualde (2003) the main difficulty here is to find estimates of the orders for which (35) holds, because almost
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm063.gif?pub-status=live)
-consistency could be required. Our present framework adds more difficulties because of the unknown nature of θ, and whereas δ and δ + θ can be estimated easily from yt and xt, respectively, to estimate γ from residuals, preliminary estimates of the parameters explaining the long-run linkage between the series, namely, ν, θ, are needed, as opposed to the Robinson and Hualde (2003) situation, where only preliminary estimation of ν was required (although θ = 0 was assumed to be known). Thus, although showing rigorously the properties of the estimates of the orders goes beyond the scope of the present paper, we propose several sensible estimation procedures that are relatively simple to implement. The simplest one is to base a semiparametric procedure on the residuals
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm064.gif?pub-status=live)
, where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm065.gif?pub-status=live)
is the difference between semiparametric estimates of the integration orders of xt and yt. Provided β ≥ 1, this is a valid strategy, but for β < 1 this procedure does not ensure obtaining estimates of γ with the required rates of convergence. However, we could easily improve upon this method by exploiting parametric assumptions on ut. Keeping the discussion within the white noise framework, δ + θ is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm066.gif?pub-status=live)
-consistently estimable from xt by various methods including
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm036.gif?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm067.gif?pub-status=live)
is a compact set such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm068.gif?pub-status=live)
(see Hualde and Robinson, 2004b). Similarly, we could estimate γ and θ simultaneously by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm037.gif?pub-status=live)
for an appropriate choice of m, where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm069.gif?pub-status=live)
are compact sets such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm070.gif?pub-status=live)
. The exact properties of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm071.gif?pub-status=live)
could be difficult to justify, but in view of Hualde and Robinson (2004b) and (33), our guess is that the rate
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm072.gif?pub-status=live)
/log n is achievable for both estimates, and hence (35) is satisfied for any β > ½.
In the general I(0) case, we can adapt the estimation procedure proposed in Robinson and Hualde (2003) to our present framework. In particular, if ut is I(0) with spectral density h(λ) (see (14)) depending on a vector of short-memory parameters φ, such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm073.gif?pub-status=live)
we could define
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm074.gif?pub-status=live)
where setting zt(c,d) = (yt(c),xt(d))′,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-12693-mediumThumb-S0266466606060361ffm075.jpg?pub-status=live)
Thus, for an estimate of the short-memory parameters
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm076.gif?pub-status=live)
and estimates of the orders
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm077.gif?pub-status=live)
, we propose
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm038.gif?pub-status=live)
where ν* denotes
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm078.gif?pub-status=live)
. Each estimate in (38) reflects situations of different knowledge about the structure of the model. Note that the first four estimates with θ = 0 are identical to those presented in Robinson and Hualde (2003). Again, the problem is obtaining
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm079.gif?pub-status=live)
with the required properties, but procedures incorporating the ideas developed in the white noise situation and those in Velasco and Robinson (2000), hence employing as loss functions possibly tapered parametric Whittle likelihoods, should provide estimates with rate of convergence
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm080.gif?pub-status=live)
/log n. In any case, in view of Hualde and Robinson (2004a), even if only nκ-consistent estimates of the nuisance parameters with arbitrarily small but positive κ are available, narrow band versions of the frequency domain estimates in (38) could also enjoy optimal asymptotic properties for an adequate choice of the bandwidth, semiparametric extensions also being possible.
The situation where β < ½ was referred to by Hualde and Robinson (2006) as weak fractional cointegration. In view of the results of this paper, if ut is a white noise and γ, δ, and θ are known, the OLS in (34) would produce
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm081.gif?pub-status=live)
-consistent and asymptotically normal estimates of ν, this result being easily extendable to the situation where ut is an autoregressive process of finite order p, so that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm039.gif?pub-status=live)
in (12). In the case where the orders are unknown, additional difficulties arise because of the presence of the unknown parameter θ, but as in the case of strong cointegration, and following the lines of Hualde and Robinson (2006), it seems possible to deal with this issue, although
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm082.gif?pub-status=live)
-consistent estimates of the orders are needed to estimate ν
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm083.gif?pub-status=live)
-consistently.
If ut is a white noise, we could improve upon the method proposed in (37) by first estimating δ + θ by (36) and then γ and γ + θ by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm040.gif?pub-status=live)
where ν(c,b,d), τ(c,b,d) are the estimated slopes corresponding to regressors xt(b), xt(d), respectively, in the OLS regression of yt(c) on xt(b) and xt(d). In this case the
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm084.gif?pub-status=live)
-consistent estimate of
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm085.gif?pub-status=live)
. As in Hualde and Robinson (2006), this procedure can be easily extended to the case where in (39) Bj is upper-triangular for all j = 1,…, p, noting that this is also a valid strategy when β > ½, in which case our proposed estimate of ν is nβ-consistent with mixed-normal limiting distribution and the estimates of the orders are also
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm086.gif?pub-status=live)
-consistent.
5. MONTE CARLO EVIDENCE
With the aim of analyzing the finite-sample equivalent of the asymptotic phenomena under wide-sense cointegration described in the paper, we carried out a small Monte Carlo experiment. We generated ut in (12) as a white noise process of dimensions n = 64, 128, 256, with ψ11 = ψ22 = 1, varying the correlation ψ12 (taking values 0, 0.5, −0.5). Then, we considered (6), (7), and (11) with ν = 1 and employed six combinations of (γ,δ) given by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm087.gif?pub-status=live)
noting that the fourth and sixth cases correspond to weak cointegration, whereas the rest represent different situations of strong cointegration. In the first part of the study, we analyze the case of weakly unbalanced cointegration, where θn in (7) takes four different values θn = θn(i), i = 1,…,4, where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm088.gif?pub-status=live)
for which we study in terms of Monte Carlo bias (defined as the estimate minus ν) and standard deviation over 1,000 replications the behavior of ν1, ν2, noting that by Theorems ABCU and BCU, these estimates are consistent for all choices of θn except θn(1). Results are given in Tables 2, 3 and 4 (for the bias in Tables 2 and 4; for standard deviations in Tables 3 and 5). Clearly, in almost all cases bias decreases as θn decreases in absolute value, the sign of θn being inversely related to the sign of the bias because of the relative dominance of the denominator (numerator) of the estimates when θn is positive (negative). Although the inconsistency of ν1, ν2 when θn = θn(1) is evident by looking at the evolution of the bias, the decrease of the bias as n increases is very slow for θn = θn(2). Biases for θn = θn(3) are larger than when the orders of integration of the observables are strictly balanced, that is, when θn = 0, but not far from them and reacting in a similar way when n increases. As expected, for θn = θn(i), i = 3,4, bias corresponding to both estimates tends to decrease as β increases, this not being the case when θn = θn(2) (most evident for ρ = −0.5). In fact, the very small bias for (γ,δ) = (0, 0.6), (0.4, 0.8), (0.7, 1) for ρ = −0.5 is the most surprising result of this part of the Monte Carlo experiment, some cancellations probably taking place, although in this case, at least when β < ½, no clear evidence of decreases in bias as n increases is observed. For θn = θn(i), i = 3,4, bias tends to increase as |ρ| increases, this also being true for θn = θn(2) when ρ = 0.5 (but not when ρ = −0.5). Finally, whereas for ρ ≠ 0 and θn = θn(i), i = 3,4, ν2 clearly beats ν1 (most noticeably for (γ,δ) = (0, 0.6)), this is not the case for θn = θn(2), especially when ρ = −0.5, when ν1 dominates ν2 (especially for small β).
Monte Carlo bias of ν1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-45056-mediumThumb-S0266466606060361tbl002.jpg?pub-status=live)
Monte Carlo standard deviation of ν1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-41652-mediumThumb-S0266466606060361tbl003.jpg?pub-status=live)
Monte Carlo bias of ν2
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-19275-mediumThumb-S0266466606060361tbl004.jpg?pub-status=live)
Monte Carlo standard deviation of ν2
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-73633-mediumThumb-S0266466606060361tbl005.jpg?pub-status=live)
Standard deviations are reported in Tables 3 and 5. In almost all cases they decrease as n increases, including the situation θn = θn(1), which suggests that the inconsistency of the estimates due to the (slowly converging to zero) imbalance between the orders is mainly due to a bias problem. In fact, in many cases, the smallest variances correspond to this inconsistent case. In general, standard deviations tend to decrease as β increases, the effect of changes in ρ not being very clear. Values of standard deviations for the two estimates are very similar, although some superiority of ν1 when β < ½ is noted, whereas for (γ,δ) = (0, 0.6), in general ν2 beats ν1.
The second part of the Monte Carlo experiment focuses on the strongly unbalanced cointegration situation. Here, the only difference with respect to the previous analysis is that we consider
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm089.gif?pub-status=live)
We reported results corresponding to six different estimates, which are as follows: ν1, ν2, their feasible D estimate counterparts ν5, ν6, respectively, ν7, which is the OLS estimate of ν in (34), and its feasible version, denoted by ν8. Noting that OLS and NBLS do not rely on any parametric assumption about the structure of ut, to compute ν5 and ν6, the estimate of θ was calculated as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm090.gif?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm091.gif?pub-status=live)
is the version of the log periodogram estimate of Geweke and Porter-Hudak (1983) proposed by Robinson (1995a) without pooling or trimming, applied to the series
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm092.gif?pub-status=live)
adding back to the estimate 1 or 2 in the case xt* = xt(1) or = xt(2), respectively. Similarly,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm093.gif?pub-status=live)
was computed in the same way by applying the log periodogram to the series
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm094.gif?pub-status=live)
instead. In all cases the bandwidths of the estimates of the orders (and in fact also of ν2 and ν6) were m = 10, 20, 40, corresponding to n = 64, 128, 256, respectively. Alternatively, the feasible estimate ν8 was derived from the OLS regression (34) with γ, γ + θ, δ + θ replaced by corresponding parametric estimates derived as in (36) and (40). We fixed the sets where the respective functions were optimized in the following way:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm095.gif?pub-status=live)
, which in all cases contains the asymptotic 95% confidence interval
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm096.gif?pub-status=live)
, where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm097.gif?pub-status=live)
. Note that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm098.gif?pub-status=live)
are infeasible sets, but we found them reasonably large. In particular, the upper bound
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm099.gif?pub-status=live)
seems sensible because we found it unrealistic that a very small β (less than 0.05) could be detected.
Monte Carlo bias and standard deviations are reported in Tables 6, 7, 8, 9, 10, and 11. In Tables 6, 8, and 10 the inconsistency of ν1 and ν2 is clearly reflected, with very large negative (positive) biases related to positive (negative) values of θ. Their corresponding feasible D estimates ν5 and ν6 behave in a rather unsatisfactory way. In general, smallest biases correspond to the cases (γ,δ) = (0, 0.6), (0.4, 0.8), (0.7, 1), but they do not react in the appropriate direction as n increases (except for some cases with ρ = −0.5). On the contrary, for the cases with δ = 1.2, 2, larger positive biases are reported, but in general, they decrease as n increases, usually very slowly however, indicating that a faster estimate of θ or larger sample sizes may be needed to obtain acceptable results for this class of estimates. Also, although their values are relatively similar, the NBLS appears to be inferior to the OLS. As expected, ν7 performs extremely well, with very small biases sharing the sign of ρ and decreasing in absolute value as β increases. Larger biases are reported for its feasible version ν8, although this estimate still works well when β ≥ 0.8, whereas for the other cases, biases are large but react appropriately when n increases. Finally, it is worth mentioning that ρ has a rather different effect on ν8 from the one reported in Robinson and Hualde (2003). Now, it seems that in comparison to the ρ = 0 situation, positive and negative correlation benefits and worsens the estimate, respectively.
Monte Carlo bias, ρ = 0
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-22311-mediumThumb-S0266466606060361tbl006.jpg?pub-status=live)
Monte Carlo standard deviation, ρ = 0
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-53513-mediumThumb-S0266466606060361tbl007.jpg?pub-status=live)
Monte Carlo bias, ρ = 0.5
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-65884-mediumThumb-S0266466606060361tbl008.jpg?pub-status=live)
Monte Carlo standard deviation, ρ = 0.5
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-85656-mediumThumb-S0266466606060361tbl009.jpg?pub-status=live)
Monte Carlo bias, ρ = −0.5
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-30333-mediumThumb-S0266466606060361tbl010.jpg?pub-status=live)
Monte Carlo standard deviation, ρ = −0.5
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-20689-mediumThumb-S0266466606060361tbl011.jpg?pub-status=live)
Monte Carlo standard deviations are reported in Tables 7, 9, and 11. For the cases of strong cointegration the smallest values correspond to ν7, closely followed by ν8 when (γ,δ) = (0,2) and the inconsistent estimates ν1, ν2 otherwise. For the weak cointegration situation, ν1 and ν2 are best, followed by ν7. In general, for β ≥ 0.8, ν8 beats ν5 and ν6, the opposite happening when β < 0.8, although in this case the values corresponding to ν8 enjoy faster decreases as n increases, noting that these standard deviations are severely harmed by replications where the parametric estimates of γ + θ and δ + θ are very close to each other. Standard deviations decrease as n increases for all estimates.
Finally, the last part of the Monte Carlo is devoted to motivating the correct use of graphical tools to detect the possible presence of strongly unbalanced cointegration. Thus, we generated pairs of time series of dimension n = 1,000 as in the previous two parts of the experiment, fixing ψ12 = 0 and concentrating on different values of γ, δ, θ. In Figures 1 and 2 we present the situation where δ = 1.4, θ = 0.4, and under the heading of cointegrated and no-cointegrated we denote situations where γ = 0 and γ = δ in (11), respectively. Similarly, Figures 3 and 4, 5 and 6, and 7 and 8 represent pairs of series generated as in the first case (but with different seed originating the white noise process), for the cases (δ,θ) = (1, 0.8), (0.8, 0.4), and (0.6, 0.8), respectively, where in all situations γ = 0 or γ = δ corresponds to the cases of cointegration and noncointegration, respectively. Note that if γ = 0, processes yt and xt(θ) are CI(δ,0) cointegrated, whereas if γ = δ, they are not cointegrated. Graphs on the left of each figure simply represent both time series as a function of time. Clearly, the main consequence of the gap between the integration orders is a different dimension in the series, most evident when θ = 0.8. Hence, these figures simply suggest that the orders of integration of the observable series are different, but it is usually not possible to assess therefrom whether there is an intrinsic linkage between the series or not, it being hard to make a guess about the possible existence of unbalanced cointegration. This picture changes dramatically when focusing on the graphs on the right of each figure. These represent exactly the same series as on the corresponding left graphs but with the important difference that the smallest integration order series is drawn with respect to a different scale given on the right vertical axis. Here, it is evident that strong comovements exist (especially when θ = 0.4) between the two unbalanced series in the case that cointegration exists, whereas, as expected, the cotrending does not appear when the series are not cointegrated. We admit that this is not a significative study, and undoubtedly further empirical and Monte Carlo investigation is needed, but our intention in showing these figures was simply to give the flavor of the possible existence of intrinsic linkages between series with different orders of integration and thus motivate empirical researchers to interpret real data following these lines.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-03791-mediumThumb-S0266466606060361fig001g.jpg?pub-status=live)
Strongly unbalanced cointegrated I(1.8), I(1.4) series.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-53032-mediumThumb-S0266466606060361fig002g.jpg?pub-status=live)
Noncointegrated I(1.8), I(1.4) series.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-09660-mediumThumb-S0266466606060361fig003g.jpg?pub-status=live)
Strongly unbalanced cointegrated I(1.8), I(1) series.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-51990-mediumThumb-S0266466606060361fig004g.jpg?pub-status=live)
Noncointegrated I(1.8), I(1) series.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-97283-mediumThumb-S0266466606060361fig005g.jpg?pub-status=live)
Strongly unbalanced cointegrated I(1.2), I(0.8) series.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-85020-mediumThumb-S0266466606060361fig006g.jpg?pub-status=live)
Noncointegrated I(1.2), I(0.8) series.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-05946-mediumThumb-S0266466606060361fig007g.jpg?pub-status=live)
Strongly unbalanced cointegrated I(1.4), I(0.6) series.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-38623-mediumThumb-S0266466606060361fig008g.jpg?pub-status=live)
Noncointegrated I(1.4), I(0.6) series.
APPENDIX A: Characterization of Limiting Distributions
THEOREM BCU.
(i) If γ + δ > 1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-65027-mediumThumb-S0266466606060361frm041.jpg?pub-status=live)
(ii) if γ + δ = 1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-71197-mediumThumb-S0266466606060361ffm100.jpg?pub-status=live)
(iii) if γ + δ < 1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm101.gif?pub-status=live)
THEOREM ABCU.
(i) If γ + δ > 1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-63537-mediumThumb-S0266466606060361ffm102.jpg?pub-status=live)
where for any two sequences an, bn, ∼ denotes that an /bn → 1 as n → ∞;
(ii) if γ + δ = 1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-92783-mediumThumb-S0266466606060361ffm103.jpg?pub-status=live)
where 1a = 1(γ = 0) + a1(γ > 0);
(iii) if γ + δ < 1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-87564-mediumThumb-S0266466606060361ffm104.jpg?pub-status=live)
THEOREM UU.
(i) If 2δ + θ > 1, δ + θ > ½
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-86518-mediumThumb-S0266466606060361ffm105.jpg?pub-status=live)
(ii) if 2δ + θ > 1, 0 ≤ δ + θ < ½
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm042.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm043.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm044.gif?pub-status=live)
(iii) if 2δ + θ = 1, 0 < δ + θ < ½
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-15456-mediumThumb-S0266466606060361ffm106.jpg?pub-status=live)
(iv) if 2δ + θ = 1, δ + θ = 0 (or equivalently δ = 1, θ = −δ)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-29463-mediumThumb-S0266466606060361ffm107.jpg?pub-status=live)
(v) if 2δ + θ < 1, δ + θ ≥ 0
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-22475-mediumThumb-S0266466606060361ffm108.jpg?pub-status=live)
Alternative Representation of the Limiting Distributions of Theorem UU. Under the same conditions of Theorem UU, assuming ω12 ≠ 0, as n → ∞,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm109.gif?pub-status=live)
where ri(n) are the normalizing sequences given in Theorem UU and denoting gij(λ) = fij(λ)/fij(0), i,j = 1,2,
(i) if 2δ + θ > 1, δ + θ > ½
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm110.gif?pub-status=live)
(ii) if 2δ + θ > 1, δ + θ = 0 (or equivalently δ > 1, θ = −δ)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-23151-mediumThumb-S0266466606060361ffm111.jpg?pub-status=live)
(iii) if 2δ + θ = 1, 0 < δ + θ < ½
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm112.gif?pub-status=live)
(iv) if 2δ + θ = 1, δ + θ = 0 (or equivalently δ = 1, θ = −δ)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-53499-mediumThumb-S0266466606060361ffm113.jpg?pub-status=live)
(v) if 2δ + θ < 1, δ + θ ≥ 0
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-66376-mediumThumb-S0266466606060361ffm114.jpg?pub-status=live)
THEOREM UCD.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm139.gif?pub-status=live)
(i) If γ + δ > 1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-94902-mediumThumb-S0266466606060361ffm115.jpg?pub-status=live)
(ii) if γ + δ = 1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-92567-mediumThumb-S0266466606060361ffm116.jpg?pub-status=live)
where 1a = 1(γ = 0) + a1(γ > 0);
(iii) if γ + δ < 1
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-90033-mediumThumb-S0266466606060361ffm117.jpg?pub-status=live)
APPENDIX B: Proofs of Theorems
Proof of Theorem BNCUD. First, we show (28). Now
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm118.gif?pub-status=live)
so that, in view of Theorems 4.4 and 5.1 of Robinson and Marinucci (2001), (28) holds on showing
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm045.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm046.gif?pub-status=live)
First, as in Robinson and Hualde (2003), by Taylor's theorem, for some R > 1, the left side of (B.1) is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm047.gif?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm119.gif?pub-status=live)
and δ is an intermediate point between
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm120.gif?pub-status=live)
. Now, in view of Lemmas D.1 and D.5 of Robinson and Hualde (2003) and Assumption PE, choosing R large enough, the dominant term in (B.3) is the first one, so that (B.1) follows from minor modifications of Theorems 4.4 and 5.1 of Robinson and Marinucci (2001), where the only difference is that the weights aj(r)(δ) are not covered by those of Robinson and Marinucci (2001) because of the presence of log factors but they just contribute the log factor in (B.1) (see, e.g., Bingham, Goldie, and Teugels, 1989, Cor. 1.7.3). The proof of (B.2) is identical to that of (B.1), to conclude for (28). Finally, (29) follows from minor modifications of Propositions 4.1 and 4.2 of Robinson and Marinucci (2003). █
Proof of Theorem ABCU. We give only the proof for ν1, the proof for ν2 being almost identical in view of Propositions 4.1 and 4.2 of Robinson and Marinucci (2003). Now
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm048.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm049.gif?pub-status=live)
The first term on the right-hand side of (B.4) is the usual one appearing with strictly balanced orders, whose asymptotic behavior was discussed in Theorem BCU. Next, as in the proof of Theorem BNCUD, by Taylor's theorem and (32), it is readily seen that the dominant term on the second term on the right-hand side of (B.4) is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm121.gif?pub-status=live)
Taking derivatives in (2),
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm050.gif?pub-status=live)
where ψ(x) is the digamma function
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm122.gif?pub-status=live)
By Taylor's theorem, the first term on the right-hand side of (B.6) is equal to
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm051.gif?pub-status=live)
where ψ(r)(x) represents the rth derivative of the digamma function
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm123.gif?pub-status=live)
with the convention ψ(0)(x) = ψ(x) and j < j + δ < j + δ. Noting that as in the proof of Lemma D.1 of Robinson and Hualde (2003), for l ≥ 1,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm124.gif?pub-status=live)
the first term in the expansion (B.7) is the dominant one. By (6.3.21) in Abramowitz and Stegun (1970, p. 259), for j > 0
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm052.gif?pub-status=live)
The absolute value of the third term is bounded by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm125.gif?pub-status=live)
implying from (B.8) that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm126.gif?pub-status=live)
Thus, the first term in (B.6) is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm053.gif?pub-status=live)
By Marinucci and Robinson (2000), the first term in (B.9) is Op(tδ−1/2 log t), whereas, by Stirling's approximation, the second one is Op(1 + log t1(δ = 1) + tδ−11(δ > 1)). In addition, by Marinucci and Robinson (2000) the second term in (B.6) is Op(tδ−1/2), so that by Marinucci and Robinson (2000) and the continuous mapping theorem
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm127.gif?pub-status=live)
Finally, noting that due to (32), (B.5) is in all cases of smaller order than the first term on the right-hand side of (B.4), we conclude the proof of the theorem. █
Proof of Theorem UU. First, (i) follows from direct application of Theorem 1 of Marinucci and Robinson (2000), the continuous mapping theorem, and Theorems 4.4 and 5.1 of Robinson and Marinucci (2001). Next, under (ii), xt is asymptotically stationary, and defining
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm128.gif?pub-status=live)
which is that covariance stationary version of xt, it is straightforward to show that under our conditions
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm129.gif?pub-status=live)
(see, e.g., Hualde and Robinson, 2006). By the Cauchy–Schwarz inequality, the proof of (A.2) would be complete on showing
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm054.gif?pub-status=live)
The expectation on the left-hand side of (B.10) is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20170407182308-82206-mediumThumb-S0266466606060361ffm130.jpg?pub-status=live)
as δ + θ < ½, so (B.10) holds, to complete the proof of (A.2). Next, (A.3) follows on showing
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm131.gif?pub-status=live)
Now,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm055.gif?pub-status=live)
Related to the first term, by Theorem 1 of Robinson (1994),
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm132.gif?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm133.gif?pub-status=live)
where the symbol ∼ indicates here that the ratio of left- and right-hand sides tends to 1 as λ tends to 0. Thus,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm056.gif?pub-status=live)
noting that when δ + θ = 0, (B.12) reflects the standard result for I(0) processes (see, e.g., Brockwell and Davis, 1991, Thm. 10.4.1),
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm134.gif?pub-status=live)
Next, the second term in (B.11) is equal to
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361frm057.gif?pub-status=live)
By the Cauchy–Schwarz inequality, the second term is bounded by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm135.gif?pub-status=live)
By previous arguments
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm136.gif?pub-status=live)
and by Robinson (2005),
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm137.gif?pub-status=live)
so the second term of (B.13) is Op(m1/2 log1/2 m) if δ + θ = 0, or Op(n2(δ+θ)m1/2−(δ+θ)) if δ + θ > 0, so that it is of smaller order than the first on the right-hand side of (B.11). Similarly, the first term of (B.13) is of smaller order. Finally, the third term on the right-hand side of (B.11) is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20170215071410291-0625:S0266466606060361:S0266466606060361ffm138.gif?pub-status=live)
by Marinucci and Robinson (2000), also of smaller order, to conclude the proof of (A.3).
Finally, (A.4) holds as (A.1), and the results corresponding to (iii), (iv), and (v) are straightforward applications of previous arguments and results in Robinson and Marinucci (2001). █