- Research
- Open access
- Published:
Empirical likelihood inference for threshold autoregressive conditional heteroscedasticity model
Journal of Inequalities and Applications volume 2021, Article number: 56 (2021)
Abstract
This paper considers the parameter estimation problem of a first-order threshold autoregressive conditional heteroscedasticity model by using the empirical likelihood method. We obtain the empirical likelihood ratio statistic based on the estimating equation of the least squares estimation and construct the confidence region for the model parameters. Simulation studies indicate that the empirical likelihood method outperforms the normal approximation-based method in terms of coverage probability.
1 Introduction
Consider the following first-order threshold autoregressive conditional heteroscedasticity model:
where \(X_{t}^{+}=\max (X_{t},0)\), \(X_{t}^{-}=\min (X_{t},0)\), \(\varepsilon _{t}=\sqrt{h_{t}}e_{t}\), \(h_{t}=\alpha _{0}+\alpha _{1}(\varepsilon _{t-1}^{+})^{2}+\alpha _{2}( \varepsilon _{t-1}^{-})^{2}\), \(e_{t}\) is a sequence of independent and identically distributed random variables satisfying \(Ee_{t}=0\) and \(\operatorname{Var}(e_{t})=1\). \(\theta _{1}\), \(\theta _{2}\), \(\alpha _{0}\), \(\alpha _{1}\), and \(\alpha _{2}\) are the model parameters with \(\alpha _{0}>0\), \(0\leq \alpha _{j}<1\), \(j=1, 2\).
When \(\theta _{1}=\theta _{2}\), model (1) becomes the usual autoregressive model whose innovation is a conditional heteroscedasticity process. Threshold autoregressive model is a nonlinear time series model. Because the threshold autoregressive model can explain nonlinear features such as asymmetry and limit cycles, it is widely used in time series modeling (see Tong [1]). Petruccelli and Woolford [2] first defined model (1) and investigated its properties and the parameter estimation problems, but they assumed that the error sequence is a sequence of independent and identically distributed random variables. Brockwell et al. [3] and Hwang and Basawa [4] further generalized the model coefficients to be random variables. Hwang and Woo [5] first considered the parameter estimation problems when the error sequence is a conditional heteroscedasticity process and proposed to use the conditional least squares method to estimate the model parameters. In this paper, we use the empirical likelihood method to estimate the model parameters.
Similar to the parametric likelihood, Owen [6–8] introduced empirical likelihood method. It is a nonparametric likelihood method which establishes a likelihood function through placing positive probability on every one of the observed data values, but often makes no assumptions on the data-generating mechanism. Empirical likelihood has many advantages compared with the normal approximation method. For example, the limiting distribution of empirical likelihood ratio statistic is a chi-squared distribution. Therefore, we need not estimate the asymptotic variance when we construct the confidence region. Moreover, the confidence region is completely decided by the data themselves because we make no assumptions on the probability distribution of the data. These attract the attention of statisticians to make inference for all kinds of statistical models using the empirical likelihood method, such as linear regressive model [9–13], generalized linear models [14–17], and partially linear models [18–21]. In recent years, empirical likelihood method is also applied to make statistical inference about time series models, such as autoregressive model [22–24], random coefficient autoregressive model [25–28], and integer-valued autoregressive model [29–32].
In this paper, we obtain the limiting distribution of empirical log-likelihood ratio statistic and construct the confidence region for the parameters in model (1) by using the empirical likelihood method. Some simulation studies indicate that the empirical likelihood method has a higher coverage probability compared with the normal approximation-based method.
This paper is organized as follows. In Sect. 2, we present the main methods and results. Some simulation results and real data analysis are given in Sect. 3. Section 4 is concerned with the proofs of the main results. Moreover, the symbols “\(\stackrel{d}{\longrightarrow }\)” and “\(\stackrel{p}{\longrightarrow }\)” denote convergence in distribution and convergence in probability, respectively. \(O_{p}(1)\) means a term which is bounded in probability. \(o_{p}(1)\) means a term which converges to zero in probability. “Almost surely” and “independent identical distributed” are denoted by “a.s.” and “i.i.d.”, respectively.
2 Methods and main results
For model (1), Hwang and Woo [5] obtained the least square estimation of the model parameters and its limiting properties. Now we use the empirical likelihood method to estimate the model parameters. Before giving the main results, we assume that the following conditions are true:
- \((\mathbf{A1})\):
-
Probability density function \(f(\cdot )\) of \(e_{t}\) has its support on \((-\infty , +\infty )\). \(\theta _{\max }+\sqrt{\alpha _{\max }}<1 \), where \(\theta _{\max }=\max \{ \vert \theta _{1} \vert , \vert \theta _{2} \vert \}\), \(\alpha _{\max }=\max \{\alpha _{1}, \alpha _{2}\}\).
- \((\mathbf{A2})\):
-
\(E(X_{t}^{6})<\infty \).
According to Theorem 1 in Hwang and Woo [5], if \((\mathbf{A1})\) holds, then \(\{X_{t}, t\geq 1\}\) is geometrically ergodic, and the sequence \(\{X_{t}\}\) has a unique stationary distribution.
Hwang and Woo [5] used the least square method to estimate the model parameters. Let \(\theta =(\theta _{1},\theta _{2})^{\tau } \). Based on the observation data \(\{X_{0}, X_{1},\ldots , X_{n}\}\), the least square estimation \(\theta ^{*}\) of θ can be obtained by minimizing \(Q(\theta )=\sum_{t=1}^{n}(X_{t}-E(X_{t}\mid \mathscr{F}_{t-1}))^{2} =\sum_{t=1}^{n}(X_{t}-\theta _{1}X_{t-1}^{+}-\theta _{2}X_{t-1}^{-})^{2} \) with respect to θ. Solving
for θ, we know that
Let \(\mathscr{X}_{t}=(X_{t-1}^{+}, X_{t-1}^{-})^{\tau } \). Then the estimating equation (2) can be written as
Further, let \(H_{t}(\theta )=(X_{t}-\mathscr{X}_{t}^{\tau }\theta )\mathscr{X}_{t} \). By (3), we can obtain the following empirical likelihood ratio statistic:
By using the Lagrange multiplier method, it is easy to know that
where the Lagrange multiplier \(b(\theta )\) satisfies
Therefore, we have
The following theorem indicates that the limiting distribution of \(-2\log (L(\theta ))\) is a chi-squared distribution.
Theorem 2.1
If \((\mathbf{A1})\) and \((\mathbf{A2})\) hold, then when \(n \rightarrow \infty \),
where \(\chi ^{2}{(2)} \) is the chi-squared distribution with two degrees of freedom.
Using the above theorem, we can construct the empirical likelihood ratio confidence region for the parameter θ. For \(0<\delta <1\), the \(100(1-\delta )\%\) asymptotic confidence region for the parameter θ is
where \(\chi ^{2}_{\delta }(2)\) is the upper δ quantile of chi-squared distribution with two degrees of freedom.
3 Simulation studies
In this section, we carry out some simulation studies to compare the performances of our empirical likelihood (EL) method with the least square (LS) method proposed by Hwang and Woo [5] through random simulation. Consider the simulation results of model (1) in the following error sequence:
Sequence I: \(\{e_{t}\}\) is a sequence of independent and identically distributed (i.i.d.) standard normal distribution \(N(0,1)\) random variables.
Sequence II: \(\{e_{t}\}\) is an independent noise sequence with ϵ-contamination distribution, and the distribution function of \(\{e_{t}\} \) is
where \(\sigma _{i}>0\) (\(i=1, 2\)), ϵ is a fixed constant satisfying \(0 <\epsilon < 1\) and \(\Phi (x) \) is the distribution function of the standard normal random variable.
Sequence III: \(\{e_{t}\} \) is a sequence of independent and identically distributed (i.i.d.) mixing random variable sequence, and the distribution function of \(\{e_{t}\} \) is
where \(\sigma >0 \), \(0<\epsilon <1\), \(T_{k}(x)\) is the distribution function of T distribution with k degrees of freedom, \(\Phi (x) \) is the distribution function of standard normal random variable.
We calculate the coverage probabilities of the empirical likelihood and the least square methods for different model parameters. The nominal confidence level \(1-\delta \) is chosen to be 0.90. All simulation studies are based on 1000 repetitions, and the sample sizes considered in these simulations are \(n =100, 300\), and 500. The simulation results for sequence I are presented in Table 1. For sequence II, we simulate \((\epsilon , \sigma _{1}, \sigma _{2})=(0.9, 1,3)\) and \((\epsilon , \sigma _{1}, \sigma _{2})=(0.75, 1, \sqrt{7})\), and the simulation results are presented in Table 2 and Table 3, respectively. For sequence III, we simulate \((\epsilon , \sigma , k)=(0.2, 1, 6)\) and \((\epsilon , \sigma , k)=(0.5, 1, 3)\), and the simulation results are presented in Table 4 and Table 5, respectively. The first figures in parentheses are the simulation results obtained by the empirical likelihood method, and the second figures are the simulation results obtained by the least square method.
From the simulation results in Tables 1–5, it can be seen that, for different error distribution, the confidence region constructed by the empirical likelihood method has a higher coverage probabilities for different parameters, sample sizes, pollution levels, and pollution distributions. Moreover, the confidence region constructed by the empirical likelihood method is closer to the confidence level 0.90. This shows that the empirical likelihood method is more robust than the least square method.
4 Real data analysis
In this section, we use our method to fit student teacher ratio (number of teachers = 1) data in Chinese universities, which are provided by the website of China National Bureau of Statistics (http://data.stats.gov.cn/easyquery.htm?cn=C01&zb=A060E01&sj=2019). Student teacher ratio is an important index to measure the level of universities. There are 70 available observations, which are denoted by \(X_{1}, X_{2},\ldots , X_{70}\). The observations represent the yearly counts of student teacher ratio in China over the period from 1949 to 2018. Let \(Y_{t} ={X_{t}}-{X_{t-1}}\). The plot of sample path, autocorrelation function (ACF), and partial autocorrelation function (PACF) for the series \(\{Y_{t}\}\) are given in Figs. 1, 2, and 3, respectively. The corresponding plots of sample autocorrelation function (ACF) and partial autocorrelation function (PACF) indicate an \(\operatorname{AR}(1)\)-like autocorrelation structure.
In what follows, based on the observation data \(\{Y_{t}\}\), we give the figure of the empirical likelihood ratio confidence region when the confidence level is 0.95 (see Fig. 4). After a simple calculation, we know that the least square estimation \(\theta ^{*}=( 0.1616, 0.6191)\), and it is denoted by ∗ in Fig. 4. From Fig. 4, we can see that the least square estimation \(\theta ^{*}\) is in the empirical likelihood ratio confidence region. Moreover, the empirical likelihood ratio confidence region is relatively small although the confidence level is 0.95.
5 Proofs
In order to establish Theorem 2.1, we first prove the following lemmas.
Lemma 5.1
If \((\mathbf{A1})\) and \((\mathbf{A2})\) hold, then
where
Proof
Note that
Therefore we have
By the ergodic theorem, we have
According to the result of Lemma 1 in Hwang and Woo [5], we have
Combining with (11), we know that Lemma 5.1 holds. □
Lemma 5.2
If \((\mathbf{A1})\) and \((\mathbf{A2})\) hold, then
Proof
Notice that
Therefore, according to the ergodic theorem, Lemma 5.2 is established. □
Lemma 5.3
If \((\mathbf{A1})\) and \((\mathbf{A2})\) hold, then
Proof
Based on assumption \((\mathbf{A2})\), we know that \(E(H_{t}(\theta )H^{\tau }_{t}(\theta ))<\infty \), which implies that
Further by assumption \((\mathbf{A1})\), we know that the model is stationary, from which we can conclude that
By using the Borel–Cantelli lemma, we know that \(\Vert H_{t}(\theta ) \Vert >n^{\frac{1}{2}}\) holds for finite n, which implies that \(\max_{1\leq t\leq n} \Vert H_{t}(\theta ) \Vert >n^{\frac{1}{2}} \) holds for finite n. Similarly, we can obtain that for \(\forall \varepsilon >0\), \(\max_{1\leq t\leq n} \Vert H_{t}(\theta ) \Vert >\varepsilon n^{\frac{1}{2}} \) holds for finite n. Thus, Lemma 5.3 is established. □
Lemma 5.4
If \((\mathbf{A1})\) and \((\mathbf{A2})\) hold, then
Proof
Let \(b= \Vert b \Vert \varsigma \). By (6), we have
Hence we have
where
Let \({D}_{n}=\frac{1}{n}\sum_{t=1}^{n}H_{t}(\theta )H^{\tau }_{t}( \theta )\). From (5), we can see that \(1+b^{\tau }(\theta )H_{t}(\theta )>0\). Thus we have
which implies that
According to Lemma 5.1, we have
which implies that
Further, by Lemma 5.3, we obtain
Note that D is a positive definite matrix. Thus we have
and
where \(\sigma _{\max }\) and \(\sigma _{\min }\) are the smallest and the largest eigenvalue of D, respectively. Combining with (18)–(23), we can obtain that
Combined with (22), we know that \(\Vert b(\theta ) \Vert =O_{p}(n^{-\frac{1}{2}})\). Lemma 5.4 is established. □
Lemma 5.5
If \((\mathbf{A1})\) and \((\mathbf{A2})\) hold, then
where
and
Proof
By (6), we have
Thus, (25) can be established.
In what follows, we consider (27). Note that
So (27) holds. □
Lemma 5.6
If \((\mathbf{A1})\) and \((\mathbf{A2})\) hold, then
where
Proof
We expand
After a simple algebraic operation, we know that (29) holds.
Next, we consider (30). Note that
which implies that (30) holds.
Last, we consider (31). For this, we first prove that there exists a finite real number \(Q>0\) such that
Consider the third-order Taylor expansion of \(\log (1+x)\) at \(x=0\):
where \(\frac{\varpi (x)}{x^{3}}\rightarrow 0\) as \(x\rightarrow 0\). Therefore, there exists \(\iota >0\) such that, for any \(\vert x \vert <\iota \), \(\vert \frac{\varpi (x)}{x^{3}} \vert <\frac{1}{6}\). In addition, note that
Therefore, we have
Let \(A_{n}=\{\omega : \max_{1\leq t\leq n} \vert b^{\tau }(\theta ) H_{t}( \theta )\vert ^{3}<\iota ^{3}\}\). It is easy to prove that, for any \(\omega \in A_{n}\) and \(1\leq t\leq n\),
Thus we have
where \(Q=\frac{1}{2}\). This implies that
□
Proof of Theorem 2.1
By Lemma 5.6, we can conclude that \(-2\log (L(\theta ))\) and \((\sum_{t=1}^{n}H_{t}(\theta ))^{\tau }(\sum_{t=1}^{n}H_{t}( \theta )H^{\tau }_{t}(\theta ))^{-1}\sum_{t=1}^{n}H_{t}( \theta )\) have the same limit distribution. By Lemma 5.1 and Lemma 5.2, we can conclude that
Hence Theorem 2.1 holds. □
Availability of data and materials
The data used to support the findings of this study are available from the corresponding author upon request.
References
Tong, H.: Nonlinear Time Series. Oxford University Press, Oxford (1990)
Petruccelli, J.D., Woolford, S.W.: A threshold \(\operatorname{AR}(1)\) model. J. Appl. Probab. 21, 270–286 (1984)
Brockwell, P.J., Liu, J., Tweedie, R.L.: On the existence of stationary threshold autoregressive moving-average processes. J. Time Ser. Anal. 13, 95–107 (1992)
Hwang, S.Y., Basawa, I.V.: Large sample inference for conditional exponential families with applications to nonlinear time series. J. Stat. Plan. Inference 38, 141–158 (1994)
Hwang, S.Y., Woo, M.J.: Threshold \(\operatorname{ARCH}(1)\) processes: asymptotic inference. Stat. Probab. Lett. 55, 11–20 (2001)
Owen, A.B.: Empirical likelihood ratio confidence intervals for a single functional. Biometrika 75, 237–249 (1988)
Owen, A.B.: Empirical likelihood confidence regions. Ann. Stat. 18, 90–120 (1990)
Owen, A.B.: Empirical Likelihood. Chapman & Hall, New York (2001)
Owen, A.B.: Empirical likelihood for linear models. Ann. Stat. 19, 1725–1747 (1991)
Chen, S.X.: Empirical likelihood confidence intervals for linear regression coefficients. J. Multivar. Anal. 49, 24–40 (1994)
Xue, L.G.: Empirical likelihood for linear models with missing responses. J. Multivar. Anal. 100, 1353–1366 (2009)
Qin, Y.S., Li, Y.H.: Empirical likelihood for linear models under negatively associated errors. J. Syst. Sci. Complex. 102, 153–163 (2011)
Qin, J., Lawless, J.: Empirical likelihood and general estimating equations. Ann. Stat. 22, 300–325 (1994)
Kolaczyk, E.D.: Empirical likelihood for generalized linear models. Stat. Sin. 4, 199–218 (1994)
Chen, S.X., Cui, H.: An extended empirical likelihood for generalized linear models. Stat. Sin. 13, 69–81 (2003)
Xue, D., Xue, L.G., Cheng, W.H.: Empirical likelihood for generalized linear models with missing responses. J. Stat. Plan. Inference 141, 2007–2020 (2011)
Bai, Y., Fung, W.K., Zhu, Z.: Weighted empirical likelihood for generalized linear models with longitudinal data. J. Stat. Plan. Inference 140, 3446–3456 (2010)
Shi, J., Lau, T.S.: Empirical likelihood for partially linear models. J. Multivar. Anal. 72, 132–148 (2000)
Li, G.R., Lin, L., Zhu, L.X.: Empirical likelihood for a varying coefficient partially linear model with diverging number of parameters. J. Multivar. Anal. 105, 85–111 (2012)
Lu, X.W.: Empirical likelihood for heteroscedastic partially linear models. J. Multivar. Anal. 100, 387–396 (2009)
Yan, L., Chen, X.: Empirical likelihood for partly linear models with errors in all variables. J. Multivar. Anal. 130, 275–288 (2014)
Li, J.Y., Liang, W., He, S.Y., Wu, X.B.: Empirical likelihood for the smoothed LAD estimator in infinite variance autoregressive models. Stat. Probab. Lett. 80, 1420–1430 (2010)
Zhao, Z.W., Wang, D.H.: Empirical likelihood for an autoregressive model with explanatory variables. Commun. Stat., Theory Methods 40, 559–570 (2011)
Liu, X.H., Peng, L.: Asymptotic theory and unified confidence region for an autoregressive model. J. Time Ser. Anal. 40, 43–65 (2019)
Zhao, Z.W., Wang, D.H.: Statistical inference for generalized random coefficient autoregressive model. Math. Comput. Model. 56, 152–166 (2012)
Zhao, Z.W., Wang, D.H., Peng, C.X.: Coefficient constancy test in generalized random coefficient autoregressive model. Appl. Math. Comput. 219, 10283–10292 (2013)
Zhao, Z.W., Wang, D.H., Peng, C.X., Zhang, M.L.: Empirical likelihood-based inference for stationary-ergodicity of the generalized random coefficient autoregressive model. Commun. Stat., Theory Methods 44, 2586–2599 (2015)
Zhao, Z.W., Li, Y., Peng, C.X., Fu, Z.H.: Empirical likelihood-based inference in generalized random coefficient autoregressive model with conditional moment restrictions. J. Comput. Appl. Math. 348, 146–160 (2019)
Zhang, H.X., Wang, D.H., Zhu, F.K.: Empirical likelihood inference for random coefficient \(\operatorname{INAR}(p)\) process. J. Time Ser. Anal. 32, 195–203 (2011)
Peng, C.X., Wang, D.H., Zhao, Z.W.: Empirical likelihood-based inference in Poisson autoregressive model with conditional moment restrictions. J. Inequal. Appl. 2015, 218 (2015)
Ding, X., Wang, D.H.: Empirical likelihood inference for \(\operatorname{INAR}(1)\) model with explanatory variables. J. Korean Stat. Soc. 45, 623–632 (2016)
Zhao, Z.W., Wang, D.H., Peng, C.X.: Conditional heteroscedasticity test for Poisson autoregressive model. Commun. Stat., Theory Methods 46, 4437–4448 (2017)
Acknowledgements
We thank the editor and the anonymous referee for their constructive comments and suggestions, which greatly improved this paper.
Funding
This work is supported by the National Natural Science Foundation of China (No. 11571138, 11671054, 11301137, 11271155, 11371168, J1310022, 11501241). The national social science fund of China (16BTJ020).
Author information
Authors and Affiliations
Contributions
All authors contributed equally and significantly in this manuscript, and they read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Peng, C., Zhao, Z. Empirical likelihood inference for threshold autoregressive conditional heteroscedasticity model. J Inequal Appl 2021, 56 (2021). https://doi.org/10.1186/s13660-021-02581-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13660-021-02581-3