A SIMPLE TEST OF NORMALITY FOR TIME SERIES

Ignacio N. Lobato; Carlos Velasco

doi:10.1017/S0266466604204030

A SIMPLE TEST OF NORMALITY FOR TIME SERIES

Published online by Cambridge University Press: 01 August 2004

Ignacio N. Lobato and

Carlos Velasco

Show author details

Ignacio N. Lobato: Affiliation:
Instituto Tecnológico Autónomo de México (ITAM)
Carlos Velasco: Affiliation:
Institució Catalana de Recerca i Estudis Avançats and Universitat Autònoma de Barcelona

Article contents

Abstract
1. INTRODUCTION
2. FRAMEWORK
3. THE GENERALIZED SKEWNESS-KURTOSIS TEST
4. CONSISTENT VARIANCE ESTIMATORS
5. RESIDUAL TESTING
6. FINITE SAMPLE PERFORMANCE
APPENDIX A
APPENDIX B
APPENDIX C
References

Rights & Permissions

Abstract

This paper considers testing for normality for correlated data. The proposed test procedure employs the skewness-kurtosis test statistic, but studentized by standard error estimators that are consistent under serial dependence of the observations. The standard error estimators are sample versions of the asymptotic quantities that do not incorporate any downweighting, and, hence, no smoothing parameter is needed. Therefore, the main feature of our proposed test is its simplicity, because it does not require the selection of any user-chosen parameter such as a smoothing number or the order of an approximating model.We are very grateful to Don Andrews and two referees for useful comments and suggestions. We are especially thankful to a referee who provided a FORTRAN code. Lobato acknowledges financial support from Asociación Mexicana de Cultura and from Consejo Nacional de Ciencia y Tecnologìa (CONACYT) under project grant 41893-S. Velasco acknowledges financial support from Spanish Dirección General de Enseñanza Superior, BEC 2001-1270.

Type: Research Article
Information: Econometric Theory , Volume 20 , Issue 4 , August 2004 , pp. 671 - 689

DOI: https://doi.org/10.1017/S0266466604204030 [Opens in a new window]
Copyright: © 2004 Cambridge University Press

1. INTRODUCTION

There has been recent interest in testing for normality for economic and financial data. For instance, Bai and Ng (2001) test for normality in a set of macroeconomic series, whereas Bontemps and Meddahi (2002) emphasize financial applications. Kilian and Demiroglu (2000) present a variety of cases where testing for normality is of interest for econometricians. These applications include financial and economic ones where, for instance, assessing whether abnormal financial profits or economic growth rates are normal is important for the specification of financial and economic models. They also present methodological applications where testing for normality is a previous step for the design of some tests, such as tests for structural stability or tests of forecast encompassing.

In econometrics, testing for normality is customarily performed by means of the skewness-kurtosis test. The main reasons for its widespread use are its straightforward implementation and interpretation. The skewness-kurtosis test statistic is the sum of the square of the sample skewness and the excess kurtosis coefficients properly standardized by their asymptotic variances in the white noise case, 6 and 24, respectively. Implementing the skewness-kurtosis test is very simple because it compares the skewness-kurtosis test statistic against upper critical values of a chi-squared distribution with two degrees of freedom (χ₂²). This test is typically applied to the residual series of dynamic econometric models (see, e.g., Lütkepohl, 1991, Sect. 4.5).

In many empirical studies with time series data, the application of the skewness-kurtosis test is questionable, though. The reason is that the previous asymptotic variances are correct under the assumption that the model is correctly specified, implying that the sequence under examination is uncorrelated. However, on many occasions either the researcher might specify the model incorrectly or might not even be interested in modeling the serial correlation. In both cases, when the considered data are correlated, the asymptotic variances are no longer 6 and 24 but some functions of all the autocorrelations. In this situation the skewness-kurtosis test is invalid because it does not control asymptotically the type I error.

In this paper we propose to employ the standard test statistic based on the sample skewness and sample kurtosis, but studentized by standard error estimators that are consistent under serial dependence of the observations. The standard error estimators are sample versions of the asymptotic quantities that do not incorporate any downweighting, and, hence, no smoothing parameter is needed. These standard error estimators are consistent even though the asymptotic standard errors involve infinite sums of terms that depend on all autocorrelations. The reason is that in the expression of the asymptotic standard errors, the autocorrelations enter raised to the cubic or fourth powers. Hence, the powers of the sample autocorrelations provide stochastic dampening factors, similar to the nonstochastic dampening factors that appear in the standard nonparametric approach. By contrast, Bai and Ng (2001) and Bontemps and Meddahi (2002) rely on smoothing with kernel methods.

Our test can employ either frequency or time domain estimators of the asymptotic variances of the sample skewness and the sample excess kurtosis. Although the proposed test is based on a time domain estimator, in the technical part of the paper in the Appendixes we stress a frequency domain estimator because it is relatively easier to handle theoretically. In addition, for conciseness of exposition, we only analyze the univariate case.

The plan of the paper is the following. Section 2 presents the framework. Section 3 introduces the proposed test statistic and studies its asymptotic theory. Section 4 discusses the proposed variance estimators. Section 5 examines the case where the considered series are the residuals of regression and time series models. Section 6 considers the finite sample performance of the proposed test in a brief Monte Carlo exercise. The technical material is included in the Appendixes.

2. FRAMEWORK

Notation. Let x_t be an ergodic strictly stationary process with mean μ and centered moments denoted by μ_k = E(x_t − μ)^k for k natural, with

being the corresponding sample moments where x is the sample mean and n is the sample size. In addition, γ(j) denotes the population autocovariance of order j,

is the corresponding sample autocovariance,

. Notice that μ₂ = γ(0). Let f (λ) be the spectral density function of x_t, defined by

where Π = [−π,π] , and let I(λ) denote the periodogram I(λ) = |w(λ)|² where

. In addition, κ_q(j₁,…,j_q−1) denotes the qth-order cumulant of x₁,x_1+j₁,…,x_{1+j_q−1}, and the marginal cumulant of order q is κ_q = κ_q(0,…,0).

Null and alternative hypotheses. The null hypothesis of interest is that the marginal distribution of x_t is normal. For the independent case, omnibus tests for this null hypothesis such as the Shapiro–Wilk test (Shapiro and Wilk, 1965), which is based on order statistics, or tests based on the distance between the empirical distribution function and the normal cumulative distribution function such as the Kolmogorov–Smirnov, the Cramér–von Mises, or the Anderson–Darling test have been proposed. A test based on L₂ distance between Gaussian and empirical characteristic functions has been introduced by Epps and Pulley (1983) and developed by Henze and others. For more details see Mardia (1980), Henze (1997), Epps (1999), and references therein. For the independent case, the omnibus tests are consistent, but it has been shown that their finite sample performance can be very poor (see, e.g., Shapiro, Wilk, and Chen, 1968). For the weak dependent case, no such analysis exists because inference with these omnibus test statistics is problematic as a result of the fact that their asymptotic distributions are nonstandard and case dependent. Hence, the standard application of these tests to weak dependent time series sequences is invalid (see Gleser and Moore, 1983). The only developed test of which we are aware is the one by Epps (1987) that is based on the characteristic function. However, Epps's procedure is hard to implement because sensing functions g(x_t,v) have to be selected, a joint spectral density of sensing functions has to be found, a matrix has to be inverted, and a quadratic form has to be minimized to estimate the marginal mean and variance. In addition, there is the disadvantage of having to choose the parameters v that enter g(·,v).

In practice, instead of the previous omnibus tests, the common procedure just tests whether the third and fourth marginal moments coincide with those of the normal distribution. Equivalently, in terms of the cumulants, it is tested that the third and fourth marginal cumulants are zero instead of testing that all higher order marginal cumulants are zero. We follow this practice, and in this paper we test that the marginal distribution is normal by testing that μ₃ = 0 and μ₄ = 3μ₂². Of course, the derived tests are not consistent because they cannot detect deviations from normality that are not reflected in the third or fourth moments.

The skewness-kurtosis test. This test compares the skewness-kurtosis test statistic

against upper critical values of a χ₂² distribution (see Bowman and Shenton, 1975). Apart from the fact that Jarque and Bera (1987) have shown the optimality of this test within the Pearson family of distributions, the popularity of this approach resides in its simplicity as we mentioned previously. In fact, nowadays most econometrics packages customarily report the SK test, which is called the Jarque–Bera test.

The SK test procedure is justified on the following grounds. When the considered series x_t is an uncorrelated Gaussian process, the following limiting result holds:

where →_d denotes convergence in distribution. However, when x_t is a Gaussian process satisfying the weak dependent condition

the result (2) is replaced by

where

for k = 3,4 (see Lomnicki, 1961; Gasser, 1975). Notice that condition (3) guarantees that all F^(k) are well defined because it entails that [sum ]|γ(j)|^r < ∞, for all natural r.

Hence, when the series exhibits serial correlation, the SK test is invalid because the denominators of its components do not estimate consistently the true asymptotic variances in (4), implying that asymptotically its rejection probabilities do not coincide with the desired nominal levels under the null hypothesis.

3. THE GENERALIZED SKEWNESS-KURTOSIS TEST

In the previous section we have seen that the SK test is invalid when the considered process x_t exhibits serial correlation. One strategy to overcome this problem is to carry out a two-step test where the SK procedure is applied after testing that the considered series is uncorrelated. However, this solution is not simple because there is an obvious pretest problem in such a sequential procedure and, furthermore, testing for uncorrelatedness for non-Gaussian series is rather challenging (see Lobato, Nankervis, and Savin, 2002).

Looking at (4) two natural solutions appear. The first one consists of modifying the SK test statistic by including consistent estimators of F⁽³⁾ and F⁽⁴⁾ in the denominators of its components. This solution is proposed by Gasser (1975, Sect. 6), who suggested truncating the infinite sums that appear in the asymptotic variances. However, he did not provide any formal analysis or any recommendation about the selection of the truncation number. As we will see, our proposed procedure overcomes these difficulties because it does not require the selection of any truncation number. The second solution estimates the unknown asymptotic variances with the bootstrap; that is, it employs the SK test statistic with bootstrap-based critical values. Implementing the bootstrap in a time series context is problematic because generally valid bootstrap procedures require the introduction of an arbitrary user-chosen number, typically a block length (see, e.g., Davison and Hinkley, 1997, Ch. 8). Therefore in this paper we follow the first approach. Furthermore, in our case the bootstrap does not present a clear theoretical advantage because the SK statistic is not asymptotically pivotal.

Before introducing our test statistic, let us consider the following estimator of F^(k), which is the sample analog of (5):

In the next section we consider alternative versions of this estimator and study their large sample properties; in particular, Lemma 1 establishes the consistency of

for Gaussian processes that satisfy condition (3). Then, our proposed test statistic, the generalized SK statistic, is

The G statistic does not require the introduction of any user chosen number, and, in view of (4) and Lemma 1 in the next section, the proposed test consists of comparing the G test statistic against upper critical values from a χ₂² distribution.

In the next assumption we introduce the class of processes under the alternative hypothesis for which both

converge to bounded positive constants, and hence whenever μ₃ ≠ 0 or μ₄ ≠ 3μ₂², the G test rejects with probability tending to 1 as n tends to infinity. Notice that the conditions of Gasser (1975) that involve summability conditions of cumulants of all orders are relaxed to cumulants up to order 16 using an extension of Theorem 3 in Rosenblatt (1985, p. 58).

Assumption A. The process x_t satisfies Ex_t¹⁶ < ∞, and, for q = 2,3,…,16,

and, for k = 3,4,

where

denotes the σ-field generated by x_t, t ≤ −j, and, for k = 3,4,

Assumption A is a weak dependent assumption that implies that the higher order spectral densities up to the sixteenth order are bounded and continuous. For the case q = 2, expression (7) implies that condition (3) holds. We require finite moments up to the sixteenth order because we need to evaluate the variance of the fourth power of the sample autocovariances. Notice that condition (9) assures that the asymptotic variances of estimates are positive.

The following theorem establishes the asymptotic properties of the G test.

THEOREM 1.

(i) Under the null hypothesis and for Gaussian processes that satisfy condition (3), G →_d χ₂².

(ii) Under Assumption A, the test statistic G diverges to infinity whenever μ₃ ≠ 0 or μ₄ ≠ 3μ₂².

The asymptotic null distribution is straightforward to derive given the consistency of

for F^(k) that is proved in Lemma 1 in the next section. The proof of (ii) is omitted because it follows easily using that under the alternative hypothesis

converges to a bounded positive constant (by (7) and (9)), whereas the numerator of G diverges.

4. CONSISTENT VARIANCE ESTIMATORS

Following the literature on nonparametric estimation of asymptotic covariance matrices, the standard approach to estimate F^(k) consistently employs a smoothed estimator such as

In (10) the weights {w_j} are usually obtained through a lag window {w_j = w(j/M)} such that the weight function w(·) verifies some regularity properties and M is a smoothing number that grows slowly with n. Note that the introduction of the smoothing number leads to estimators whose rate of convergence is usually slower than the parametric rate. We stress that in this approach the weights {w_j} provide a nonstochastic dampening on the

for large j. Because of this dampening, the estimator in (10) is consistent for (5) as it happens in the case k = 1, where f (0) is consistently estimated by autocorrelation robust estimators (see, e.g., Robinson and Velasco, 1997).

As mentioned in the introduction, the main problem with the smoothing approach is that statistical inference can be very sensitive to the selection of the user-chosen weights; in our context, the discussion in Section I in Robinson (1998) is especially relevant. In the absence of a clear and rigorously justified procedure to select the smoothing number in our testing framework, we prefer to analyze estimators that do not require any smoothing.

Our first estimator

, introduced in equation (6), also admits a frequency domain version (see Appendix A). For technical reasons, in this paper we consider a second estimator that can be motivated by writing F^(k) in terms of the spectral density function of the x_t process using (1):

The sample analog of the previous equation renders the following alternative estimator for F^(k):

where λ_j = 2πj/n. The estimator

can also be written in the time domain by plugging

into equation (12). After some algebra, in Appendix A it is shown that

Notice that both expressions for

are numerically identical, but in the Appendixes, for technical reasons, we stress the frequency domain version (12). Expression (12) guarantees that

is positive in finite samples.

The next lemma states the consistency of

for F^(k). This lemma is the substantive technical contribution of the paper. Its proof is in Appendix B.

LEMMA 1. Under the null hypothesis, for Gaussian time series that satisfy condition (3),

At first look, consistency of

could be surprising because no smoothing parameter has been introduced. Robinson (1998) analyzes a special regression model where smoothing is not necessary for establishing consistency of asymptotic covariance matrix estimators. The reason is that the specific form of the covariance matrix that he considers (see his equation (1.2)) allows for a stochastic dampening of some sample autocovariances by other sample autocovariances. The time domain versions (6) and (14) provide a similar intuition where the powers of the sample autocovariances provide the stochastic dampening factors.

In the frequency domain, (11) provides a complementary explanation. Recall that in time series the standard problem is that the relevant asymptotic variance depends on the spectral density function evaluated at a unique point, typically the zero frequency, f (0). However, in our case (11) shows that the asymptotic variance, F^(k), is a convolution of the spectral density function, instead of a single value. Intuitively, in the first case a user-chosen smoothing number is required to estimate the local quantity, f (0), whereas in our case no such number is needed because we are estimating a global quantity.

5. RESIDUAL TESTING

The previous sections analyze the case where raw data are under examination. However, in practice the test is commonly applied to the residuals of regression or time series models. Again, two approaches can be used: first, the G test that we propose and, second, employing the SK statistic with bootstrap-based critical values. The bootstrap has been employed by Kilian and Demiroglu (2000). However, as mentioned in Section 3, application of the bootstrap is not an obvious task in a time series context. Kilian and Demiroglu perform a parametric bootstrap that could be justified if the model were correctly specified, although in this case the SK test would also be asymptotically valid. However, in the absence of the knowledge of the true data generating process, a parametric bootstrap is invalid; that is, there is no guarantee that the type I error is controlled properly asymptotically. As mentioned previously, bootstrap procedures valid for time series require the introduction of a user-chosen number, typically a block number, complicating statistical inference in finite samples.

Next, we introduce a general assumption that validates the use of the G statistic applied to the residuals of many dynamic econometric models where the correlation structure is not correctly specified or it is not specified at all. In this section,

denotes the residuals of the regression or time series model, and x_t denotes the true disturbances.

Assumption B. Let the Gaussian process x_t satisfy (3) and let

satisfy

The first condition in (15) guarantees the consistency of the estimates of F^(k) based on residuals, whereas the second guarantees that the residual SK test has the same asymptotic distribution as the original SK test. Assumption B is very general and covers many interesting cases such as linear regressions with possible trending stochastic and deterministic regressors that satisfy Grenander's conditions and weakly dependent errors. In this case

, where Z_t is a p-dimensional sequence of regressors, so (15) implies that

, allowing for the components of

to have different convergence rates. A leading example with stochastic Z_t is a regression between cointegrated variables. For stationary Z_t, another interesting application is when

are the residuals obtained through possibly misspecified AR(p) regressions; that is,

with Z_t = (y_t−1,…,y_t−p)′, and

for some vector β such that the polynomial

has no roots on or inside the unit circle. For this case, if Assumption B holds for y_t, the limit process x_t = y_t − β′Z_t = β(L)y_t inherits the weak dependence properties of y_t, but notice that x_t is autocorrelated unless y_t follows an AR(q) process with q ≤ p.

In Appendix C we prove the following lemma, which shows that the use of residuals does not affect the consistent studentization that we propose in this paper.

LEMMA 2. Under the null hypothesis and Assumption B, for k = 3,4,

Finally, using the previous lemma and Hölder's inequality, it is straightforward to prove the next theorem, which establishes that the asymptotic null distribution of the G test statistic applied to the residuals of many dynamic econometric models whose correlation structure is ignored or misspecified is still χ₂² and that whenever μ₃ ≠ 0 or μ₄ ≠ 3μ₂² the G test rejects with probability tending to 1 as n tends to infinity.

THEOREM 2. Let

be the test statistic G calculated from residuals

6. FINITE SAMPLE PERFORMANCE

This section compares briefly the finite sample behavior of the previous tests with the Epps (1987) test. Under the null hypothesis we generate data from an AR(1) process x_t = φx_t−1 + ε_t, where ε_t is independent and identically distributed N(0,1) and the autoregressive parameter φ takes eight values: −0.9,−0.5, 0, 0.5, 0.6, 0.7, 0.8, and 0.9. We report the results for a detailed grid of positive values of φ because positive autocorrelation is particularly relevant for many empirical applications.

Along with the null hypothesis, we consider also testing the null that the skewness is zero by using the first components of the SK and G statistics. Namely, we compute the skewness test statistic

and the generalized skewness test statistic

and compare them with upper critical values from a χ₁². We have not reported the results of a kurtosis test because of the well-known slow convergence of the sample kurtosis to the normal asymptotic distribution even in the white noise case (see, e.g., Bowman and Shenton, 1975, p. 243). In Tables 1A and 1B we report the empirical rejection probabilities for the tests for three sample sizes, n = 100, 500, and 1,000, and three nominal levels, α = 0.10, 0.05, and 0.01. In these experiments 5,000 replications are carried out.

Empirical rejection probabilities for 3 sample sizes and 3 nominal levels

The main conclusions derived from Table 1A are the following. For the case of testing symmetry, the S test is not reliable since it severely underrejects for the cases when φ < 0 and substantially overrejects for the cases when φ > 0. This result could be expected because when φ is negative,

is negative, leading to overestimation of the asymptotic variance and then to underrejection of the S test, whereas when φ is positive the opposite effect occurs. The most interesting evidence is the magnitude of these distortions, which are very large for negative values of φ and all sample sizes, whereas for positive φ the distortions are increasing steadily with the sample size. On the contrary, for the GS test the empirical rejection probabilities are very close to the nominal levels for all the parameter values and all sample sizes (the only exception is when n = 100 and φ = 0.9).

Table 1B reports the results for testing normality for the three tests, SK, G, and Epps test. The SK test, which is the sum of the skewness test and the kurtosis test, inherits their characteristics. Notice that for the cases where φ < 0, there is a fair amount of compensation between the skewness and kurtosis, making the distortions of the SK test much smaller than those of its components. The G test inherits the slow convergence from the kurtosis, but using the white noise case as benchmark, it appears to be robust to the presence of moderate serial correlation. When |φ| = 0.9, the G test is severely affected by its kurtosis component. In fact, even for n = 1,000 the G test appears to be very conservative. For the cases φ = 0.7 and φ = 0.8, a similar pattern can be observed. Similar to the G test, the Epps test is also insensitive to moderate serial correlation. However, for the case φ = −0.9, and also for the most interesting cases where φ ≥ 0.7, the Epps test appears to be too liberal.

We also conducted power experiments for data generated by the previous AR(1) model for six different distributions: standard log-normal, student's t with 10 degrees of freedom, χ₁², χ₁₀², beta with parameters (1,1), and beta with parameters (2,1). Although distributions with bounded support are not that popular in econometrics, it is well known that in the independent and identically distributed (i.i.d) setting the SK test performs very poorly against such alternatives. Hence, it is of interest to examine the performance of the G test in these difficult cases. Table 2 reports the power results for the G and the Epps tests for three sample sizes, n = 100, 500, and 1,000, respectively, and for a 5% nominal level. In these experiments 2,000 replications are carried out. The main conclusions from these tables are the following. For both tests it appears that the sign of the autocorrelation has little relevance in terms of power (although generally the empirical power is slightly greater for positive φ). Using the white noise as the reference case, higher values for |φ| lead to a decrease in the empirical power that in some cases is very exacerbated. The empirical rejection probabilities for the G test are particularly high for heavily skewed distributions such as the lognormal or the χ₁². For these cases the G test is clearly preferable to the Epps test. When the distribution is symmetric or slightly skewed, both tests are comparable. For the t₁₀ and the χ₁₀² distributions, the G test presents higher empirical power, especially for a moderate degree of serial correlation. For these cases and when |φ| = 0.9, the tests present very low empirical power even for n = 1,000. Notice that when n = 1,000 and φ = 0.7 or 0.8, for the χ₁₀² case, the empirical powers of both the Epps test and, especially, the G test are moderately high, but the power deteriorates suddenly for φ = 0.9. For the beta distributions, both tests (and especially the G test) appear to be very sensitive to a high degree of serial correlation. In fact, when |φ| = 0.9, the power of both tests is very low even when n = 1,000. Here again, there is a sudden decrease in the empirical power when φ increases from 0.6 to 0.7 for n = 500 and when φ increases from 0.7 to 0.8 for n = 1,000.

Empirical rejection probabilities at the 0.05 nominal levels for the G and Epps (E) tests for 3 sample sizes

We end with a suggestion on further research. In this section we have seen that for small sample sizes, because of the slow convergence of the sample kurtosis coefficient, the G test presents significant size distortions even in the white noise case. One potential way of improving the finite sample performance is by using the bootstrap. Because the G test statistic is asymptotically pivotal, it can be expected that application of the bootstrap will deliver an asymptotic refinement. Hence, it would be interesting to study the implementation of the G statistic with bootstrap-based critical values.

APPENDIX A

This Appendix provides the alternative versions of

. First, the

estimator can be written in the frequency domain as follows:

where

satisfies ∫_Π D_n(v) dv = 2π and D_n(v) → 2πδ(v = 0) as n → ∞, where δ represents the Dirac's delta function. Hence, for large n we obtain the following approximate expression for

in the frequency domain:

Equation (12) is the natural discrete approximation of (A.1).

Second, to obtain the time domain expression of

we just plug (13) into equation (12) to get

where

. Finally, using that φ_n(λ_j) = 0 if λ_j = 2πj/n, j ≠ 0 mod n, and φ_n(0) = n, and denoting the indicator function by 1, we obtain for j = 1,…,k − 1,

where we have used that

is even. Then (14) follows immediately.

APPENDIX B

Proof of Lemma 1(i). We just report the analysis for

because the analysis for

is similar but notationally more involved. We prove consistency by checking the sufficient conditions that

is asymptotically unbiased and that its variance goes to zero as n → ∞.

First, we consider the expectation of

Using the definition of I(λ),

where the summation in ν runs for all possible partitions ν = ν₁∪···∪ν_q, q = 1,2,3 of the 6-tuple

such that

and where cum(ν_i) stands for cum(w(λ_{ν_i(1)}),…,w(λ_{ν_i(p_i)})) (See Brillinger, 1981, pp. 20–21).

To evaluate the expectation (B.1), by Gaussianity the only cumulants different from zero are second-order cumulants, κ₂, with q = 3. Hence

where the sum in κ₂³ is for all the different 3-tuples ν₁∪ν₂∪ν₃ of pairs ν_i = (ν_i(1),ν_i(2)) formed with all the permutations of the coefficients in (B.2). In fact, following Brillinger (1981, Theorem 4.3.1), the only relevant combinations in the sum in κ₂³ are those for which ν_i(1) + ν_i(2) = 0 mod n, i = 1,2,3. Therefore, using that |φ_n(μ)| ≤ 2 min{|μ|⁻¹,n} (see Zygmund, 1977, pp. 49–51), and the continuity of f (λ) implied by (3), we obtain that (B.3) is

where Φ_n⁽²⁾(μ) = (2πn)⁻¹|φ_n(μ)|² and ∫_Π Φ_n⁽²⁾(μ) dμ = 1.

Second, we study the variance of

Now, we need to consider all the indecomposable partitions ν = ν₁∪···∪ν_q, q = 1,…,6 of the following array with 12 elements:

By Gaussianity, the relevant partitions only involve six second-order cumulants, that is,

where the sum in κ₂⁶ is for all the different 6-tuples ν = ν₁∪···∪ν₆ of pairs ν_i = (ν_i(1),ν_i(2)) constructed in such a way that at least one ν_i in ν has elements in each of the rows of the array (B.5) to guarantee an indecomposable partition. Following the same arguments, the only terms that contribute to the leading term of the variance of

are those in (B.6) characterized by a restriction ν_i(1) + ν_i(2) = 0 mod n, for just one i ∈ {1,…,6} (e.g., j₁ = −j₁′). Then, taking into account all the possible partitions (6 × 3) and using the continuity of f, the variance of

as n → ∞. Hence, from (B.4) and (B.7) we conclude that

. █

Proof of Lemma 1(ii). Notice that

because

. Then, setting M = n^1/2, the first element in (B.8) is equal to

Now,

for 0 < t ≤ M, and using the same methods of the proof of Lemma 1(i), it is easy to see that for p = 2,4,6,

Hence, we obtain that for k = 3,4,

Next,

where

as n → ∞ for k = 3,4 and

Hence, both terms on the right-hand side of (B.9) are o_p(1). Similar reasoning can be used to show that the remaining terms in (B.8) are also asymptotically negligible and conclude that

. █

APPENDIX C

Proof of Lemma 2. Write

say. Thus,

Hence, using from Appendix B that

and the Cauchy–Schwartz inequality, we only need to show that

First,

where we have used Assumption B.

Second,

where we have employed the Cauchy–Schwartz inequality. The analysis of

is omitted because it is similar to that of

. █

References

REFERENCES

Bai, J. & S. Ng (2001) Tests for Skewness, Kurtosis, and Normality for Time Series Data. Preprint, Boston College.

Bontemps, C. & N. Meddahi (2002) Testing Normality: A GMM Approach. Preprint, Université de Montréal.

Bowman, K.O. & L.R. Shenton (1975) Omnibus test contours for departures from normality based on and b₂. Biometrika 62, 243–250.Google Scholar

Brillinger, D.R. (1981) Time Series: Data Analysis and Theory. Holden Day.

Davison, A.C. & D.V. Hinkley (1997) Bootstrap Methods and Their Application. Cambridge University Press.

Epps, T.W. (1987) Testing that a stationary time series is Gaussian. Annals of Statistics 15, 1683–1698.Google Scholar

Epps, T.W. (1999) Limiting behavior of the integrated characteristic function test for normality under Gram-Charlier alternatives. Statistics and Probability Letters 42, 175–184.Google Scholar

Epps, T.W. & L.B. Pulley (1983) A test for normality based on the empirical characteristic function. Biometrika 70, 723–726.Google Scholar

Gasser, T. (1975) Goodness-of-fit tests for correlated data. Biometrika 62, 563–570.Google Scholar

Gleser, L.J. & D.S. Moore (1983) The effect of dependence on chi-squared and empiric distribution tests of fit. Annals of Statistics 11, 1100–1108.Google Scholar

Henze, N. (1997) A new approach to the BEHP tests for multivariate normality. Journal of Multivariate Analysis 62, 1–23.Google Scholar

Jarque, C.M. & A.K. Bera (1987) A test for normality of observations and regression residuals. International Statistical Review 55, 163–172.Google Scholar

Kilian, L. & U. Demiroglu (2000) Residual-based tests for normality in autoregressions: Asymptotic theory and simulation evidence. Journal of Business and Economic Statistics 18, 40–50.Google Scholar

Lobato, I.N., J.C. Nankervis, & N.E. Savin (2002) Testing for zero autocorrelation in the presence of statistical dependence. Econometric Theory 18, 730–743.Google Scholar

Lomnicki, Z.A. (1961) Tests for departure from normality in the case of linear stochastic processes. Metrika 4, 37–62.Google Scholar

Lütkepohl, H. (1991) Introduction to Multiple Time Series Analysis. Springer Verlag.

Mardia, K.V. (1980) Tests of univariate and multivariate normality. In P.R. Krishnaiah (ed.), Handbook of Statistics: Robust Inference, vol. 1, pp. 279–320. North-Holland.

Robinson, P.M. (1998) Inference-without-smoothing in the presence of nonparametric autocorrelation. Econometrica 66, 1163–1182.Google Scholar

Robinson, P.M. & C. Velasco (1997) Autocorrelation robust inference. In G.S. Maddala & C.R. Rao (eds.), Handbook of Statistics: Robust Inference, vol. 15, pp. 267–298. North-Holland.

Rosenblatt, M. (1985) Stationary Sequences and Random Fields. Birkhäuser.

Shapiro, S.S. & M.B. Wilk, (1965) An analysis of variance test for normality (complete samples). Biometrika 52, 591–611.Google Scholar

Shapiro, S.S., M.B. Wilk, & H.J. Chen (1968) A comparative study of various tests for normality. Journal of the American Statistical Association 63, 1343–1372.Google Scholar

Zygmund, A. (1977) Trigonometric Series. Cambridge University Press.

Empirical rejection probabilities for 3 sample sizes and 3 nominal levels

Empirical rejection probabilities at the 0.05 nominal levels for the G and Epps (E) tests for 3 sample sizes

Article contents

A SIMPLE TEST OF NORMALITY FOR TIME SERIES

Abstract

1. INTRODUCTION

2. FRAMEWORK

3. THE GENERALIZED SKEWNESS-KURTOSIS TEST

4. CONSISTENT VARIANCE ESTIMATORS

5. RESIDUAL TESTING

6. FINITE SAMPLE PERFORMANCE

APPENDIX A

APPENDIX B

APPENDIX C

References

REFERENCES

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests