Maximum likelihood estimators in linear regression models with Ornstein-Uhlenbeck process

Hu, Hongchang; Pan, Xiong; Xu, Lifeng

doi:10.1186/1029-242X-2014-301

Research
Open access
Published: 19 August 2014

Maximum likelihood estimators in linear regression models with Ornstein-Uhlenbeck process

Hongchang Hu¹,
Xiong Pan² &
Lifeng Xu¹

Journal of Inequalities and Applications volume 2014, Article number: 301 (2014) Cite this article

1700 Accesses
Metrics details

Abstract

The paper studies the linear regression model

y_{t} = x_{t}^{T} β + ε_{t}, t = 1, 2, \dots, n,

where

d ε_{t} = λ (μ - ε_{t}) d t + σ d B_{t},

with parameters $λ, σ \in R^{+}$ , $μ \in R$ and ${B_{t}, t \geq 0}$ the standard Brownian motion. Firstly, the maximum likelihood (ML) estimators of β, λ and $σ^{2}$ are given. Secondly, under general conditions, the asymptotic properties of the ML estimators are investigated. And then, limiting distributions for likelihood ratio test statistics of the hypothesis are also given. Lastly, the validity of the method are illuminated by two real examples.

MSC:62J05, 62M10, 60J60.

1 Introduction

Consider the following linear regression model

y_{t} = x_{t}^{T} β + ε_{t}, t = 1, 2, \dots, n,

(1.1)

where $y_{t}$ ’s are scalar response variables, $x_{t}$ ’s are explanatory variables, β is an m-dimensional unknown parameter, and ${ε_{t}}$ is an Ornstein-Uhlenbeck process, which satisfies the linear stochastic differential equation (SDE)

d ε_{t} = λ (μ - ε_{t}) d t + σ d B_{t}

(1.2)

with parameters $λ, σ \in R^{+}$ , $μ \in R$ and ${B_{t}, t \geq 0}$ the standard Brownian motion.

It is well known that a linear regression model is the most important and popular model in the statistical literature, which attracts many people to investigate the model. For an ordinary linear regression model (when the errors are independent and identically distributed (i.i.d.) random variables), Wang and Zhou [1], Anatolyev [2], Bai and Guo [3], Chen [4], Gil et al. [5], Hampel et al. [6], Cui [7], Durbin [8] and Li and Yang [9] used various estimation methods to obtain estimators of the unknown parameters in (1.1) and discussed some large or small sample properties of these estimators. Recently, linear regression with serially correlated errors has attracted increasing attention from statisticians and economists. One case of considerable interest is that the errors are autoregressive processes; Hu [10], Wu [11], and Fox and Taqqu [12] established its asymptotic normality with the usual $p_{n}$ -normalization in the case of long memory stationary Gaussian observations errors. Giraitis and Surgailis [13] extended this result to non-Gaussian linear sequences. Koul and Surgailis [14] established the asymptotic normality of the Whittle estimator in linear regression models with non-Gaussian long memory moving average errors. Shiohama and Taniguchi [15] estimated the regression parameters in a linear regression model with autoregressive process. Fan [16] investigated moderate deviations for M-estimators in linear models with ϕ-mixing errors.

The Ornstein-Uhlenbeck process was originally introduced by Ornstein and Uhlenbeck [17] as a model for particle motion in a fluid. In physical sciences, the Ornstein-Uhlenbeck process is a prototype of a noisy relaxation process, whose probability density function $f (x, t)$ can be described by the Fokker-Planck equation (see Janczura et al. [18], Debbasch et al. [19], Gillespie [20], Ditlevsen and Lansky [21], Garbaczewski and Olkiewicz [22], Plastino and Plastino [23]):

\frac{\partial f (x, t)}{\partial t} = \frac{\partial}{\partial x} (λ (x - μ) f (x, t)) + \frac{σ^{2}}{2} \frac{\partial^{2} f (x, t)}{\partial x^{2}} .

This process is now widely used in many areas of application. The main characteristic of the Ornstein-Uhlenbeck process is the tendency to return towards the long-term equilibrium μ. This property, known as mean-reversion, is found in many real life processes, e.g., in commodity and energy price processes (see Fasen [24], Yu [25], Geman [26]). There are a number of papers concerned with the Ornstein-Uhlenbeck process, for example, Janczura et al. [18], Zhang et al. [27], Rieder [28], Iacus [29], Bishwal [30], Shimizu [31], Zhang and Zhang [32], Chronopoulou and Viens [33], Lin and Wang [34] and Xiao et al. [35]. It is well known that the solution of model (1.2) is an autoregressive process. For a constant or functional or random coefficient autoregressive model, many people (for example, Magdalinos [36], Andrews and Guggenberger [37], Fan and Yao [38], Berk [39], Goldenshluger and Zeevi [40], Liebscher [41], Baran et al. [42], Distaso [43] and Harvill and Ray [44]) used various estimation methods to obtain estimators and discussed some asymptotic properties of these estimators, or investigated hypotheses testing.

By (1.1) and (1.2), we can obtain that the more general process satisfies the SDE

d y_{t} = λ (L (t, λ, μ, β) - y_{t}) d t + σ d B_{t},

(1.3)

where $L (t, λ, μ, β)$ is a time-dependent mean reversion level with three parameters. Thus, model (1.3) is a general Ornstein-Uhlenbeck process. Its special cases have gained much attention and have been applied to many fields such as economics, physics, geography, geology, biology and agriculture. Dehling et al. [45] considered the model with maximum likelihood estimate, and proved strong consistency and asymptotic normality. Lin and Wang [34] established the existence of a successful coupling for a class of stochastic differential equations given by (1.3). Bishwal [30] investigated the uniform rate of weak convergence of the minimum contrast estimator in the Ornstein-Uhlenbeck process (1.3).

The solution of model (1.2) is given by

ε_{t} = e^{- λ t} ε_{0} + μ (1 - e^{- λ t}) + σ \int_{0}^{t} e^{λ (s - t)} d B_{t},

(1.4)

where $\int_{0}^{t} e^{λ (s - t)} d B_{t} \sim N (0, \frac{1 - {exp}^{- 2 λ t}}{2 λ})$ .

The process observed in discrete time is more relevant in statistics and economics. Therefore, by (1.4), the Ornstein-Uhlenbeck time series for $t = 1, 2, \dots, n$ is given by

ε_{t} = e^{- λ d} ε_{t - 1} + μ (1 - e^{- λ d}) + σ \sqrt{\frac{1 - e^{- 2 λ d}}{2 λ}} η_{t},

(1.5)

where $η_{t} \sim N (0, 1)$ i.i.d. random errors and with equidistant time lag d, fixed in advance. Models (1.1) and (1.5) include many special cases such as a linear regression model with constant coefficient autoregressive processes (when $μ = 0$ ; see Hu [10], Wu [11], Maller [46], Pere [47] and Fuller [48]), Ornstein-Uhlenbeck time series or processes (when $β = 0$ ; see Rieder [28], Iacus [29], Bishwal [30], Shimizu [31] and Zhang and Zhang [32]), constant coefficient autoregressive processes (when $μ = 0$ , $β = 0$ ; see Chambers [49], Hamilton [50], Brockwell and Davis [51] and Abadir and Lucas [52], etc.).

The paper discusses models (1.1) and (1.5). The organization of the paper is as follows. In Section 2 some estimators of β, θ and $σ^{2}$ are given by the quasi-maximum likelihood method. Under general conditions, the existence and consistency of the quasi-maximum likelihood estimators as well as asymptotic normality are investigated in Section 3. The hypothesis testing is given in Section 4. Some preliminary lemmas are presented in Section 5. The main proofs of theorems are presented in Section 6, with two real examples in Section 7.

2 Estimation method

Without of loss generality, we assume that $μ = 0$ , $ε_{0} = 0$ in the sequel. Write the ‘true’ model as

y_{t} = x_{t}^{T} β_{0} + e_{t}, t = 1, 2, \dots, n

(2.1)

and

e_{t} = exp (- λ_{0} d) e_{t - 1} + σ_{0} \sqrt{\frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}}} η_{t},

(2.2)

where $η_{t} \sim N (0, 1)$ i.i.d.

By (2.2), we have

e_{t} = σ_{0} \sqrt{\frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}}} \sum_{j = 1}^{t} exp {- λ_{0} d (t - j)} η_{j} .

(2.3)

Thus $e_{t}$ is measurable with respect to the σ-field H generated by $η_{1}, η_{2}, \dots, η_{t}$ , and

E e_{t} = 0, Var (e_{t}) = σ_{0}^{2} \frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}} exp {- λ_{0} d t (t - 1)} .

(2.4)

Using similar arguments as those of Rieder [28] or Maller [46], we get the log-likelihood of $y_{2}, y_{3}, \dots, y_{n}$ conditional on $y_{1}$ ,

\begin{array}{rcl} Ψ_{n} (β, λ, σ^{2}) & = & log L_{n} \\ = & - \frac{1}{2} (n - 1) log (\frac{π σ^{2}}{λ}) - \frac{1}{2} (n - 1) log (1 - exp (- 2 λ d)) \\ - \frac{λ}{σ^{2} (1 - exp (- 2 λ d))} \sum_{t = 2}^{n} {(ε_{t} - exp (- λ d) ε_{t - 1})}^{2} . \end{array}

(2.5)

We maximize (2.5) to obtain QML estimators denoted by ${\hat{σ}}_{n}^{2}$ , ${\hat{β}}_{n}$ , ${\hat{λ}}_{n}$ (when they exist). Then the first derivatives of $Ψ_{n}$ may be written as

\frac{\partial Ψ_{n}}{\partial σ^{2}} = - \frac{n - 1}{2 σ^{2}} + \frac{λ}{σ^{4} (1 - exp (- 2 λ d))} \sum_{t = 2}^{n} {(ε_{t} - exp (- λ d) ε_{t - 1})}^{2},

(2.6)

\begin{matrix} \frac{\partial Ψ_{n}}{\partial λ} = \frac{n - 1}{2 λ} - \frac{(n - 1) d exp (- 2 λ d)}{1 - exp (- 2 λ d)} - \frac{2 d λ exp (- λ d)}{σ^{2} (1 - exp (- 2 λ d))} \sum_{t = 2}^{n} (ε_{t} - exp (- λ d) ε_{t - 1}) ε_{t - 1} \\ - \frac{1 - (1 + 2 d λ) exp (- 2 λ d)}{σ^{2} {(1 - exp (- 2 λ d))}^{2}} \sum_{t = 2}^{n} {(ε_{t} - exp (- λ d) ε_{t - 1})}^{2} \end{matrix}

(2.7)

and

\frac{\partial Ψ_{n}}{\partial β} = \frac{2 λ}{σ^{2} (1 - exp (- 2 λ d))} \sum_{t = 2}^{n} (ε_{t} - exp (- λ d) ε_{t - 1}) (x_{t} - exp (- λ d) x_{t - 1}) .

(2.8)

Thus ${\hat{σ}}_{n}^{2}$ , ${\hat{β}}_{n}$ , ${\hat{λ}}_{n}$ satisfy the following estimation equations:

{\hat{σ}}_{n}^{2} = \frac{2 {\hat{λ}}_{n}}{(n - 1) (1 - exp (- 2 {\hat{λ}}_{n} d))} \sum_{t = 2}^{n} {({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1})}^{2},

(2.9)

\begin{matrix} \frac{{\hat{σ}}_{n}^{2} (1 - (1 + 2 d {\hat{λ}}_{n}) exp (- 2 {\hat{λ}}_{n} d))}{2 {\hat{λ}}_{n}} - \frac{2 d {\hat{λ}}_{n} exp (- {\hat{λ}}_{n} d)}{n - 1} \sum_{t = 2}^{n} ({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1}) {\hat{ε}}_{t - 1} \\ - \frac{1 - (1 + 2 d {\hat{λ}}_{n}) exp (- 2 {\hat{λ}}_{n} d)}{(1 - exp (- 2 {\hat{λ}}_{n} d)) (n - 1)} \sum_{t = 2}^{n} {({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1})}^{2} = 0 \end{matrix}

(2.10)

and

\sum_{t = 2}^{n} ({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1}) (x_{t} - exp (- {\hat{λ}}_{n} d) x_{t - 1}) = 0,

(2.11)

where

{\hat{ε}}_{t} = y_{t} - x_{t}^{T} {\hat{β}}_{n} .

(2.12)

To obtain our results, the following conditions are sufficient (see Maller [46]).

(A1) $X_{n} = \sum_{t = 2}^{n} x_{t} x_{t}^{T}$ is positive definite for sufficiently large n and

lim_{n \to \infty} max_{1 \leq t \leq n} x_{t}^{T} X_{n}^{- 1} x_{t} = 0 .

(2.13)

(A2)

\underset{n \to \infty}{lim sup} {| \tilde{λ} |}_{\max} (X_{n}^{- \frac{1}{2}} Z_{n} X_{n}^{- \frac{T}{2}}) < 1,

(2.14)

where $Z_{n} = \frac{1}{2} \sum_{t = 2}^{n} (x_{t} x_{t - 1}^{T} + x_{t - 1} x_{t}^{T})$ , ${| \tilde{λ} |}_{\max} (\cdot)$ denotes the maximum in absolute value of the eigenvalues of a symmetric matrix.

For ease of exposition, we shall introduce the following notations which will be used later in the paper.

Let $(m + 1)$ -vector $θ = (β, λ)$ . Define

S_{n} (θ) = σ^{2} \frac{\partial Ψ_{n}}{\partial θ} = σ^{2} (\frac{\partial Ψ_{n}}{\partial β}, \frac{\partial Ψ_{n}}{\partial λ}), F_{n} (θ) = - σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial θ \partial θ^{T}} .

(2.15)

By (2.7) and (2.8), we get the components of $F_{n} (θ)$

\begin{matrix} - σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial β \partial β^{T}} = \frac{2 λ}{1 - exp (- 2 λ d)} \sum_{t = 2}^{n} (x_{t} - exp (- λ d) x_{t - 1}) {(x_{t} - exp (- λ d) x_{t - 1})}^{T} \\ = \frac{2 λ}{1 - exp (- 2 λ d)} X_{n} (λ), \end{matrix}

(2.16)

\begin{matrix} - σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial β \partial λ} = - \frac{2 d λ exp (- λ d)}{1 - exp (- 2 λ d)} \sum_{t = 2}^{n} (ε_{t - 1} x_{t} + ε_{t} x_{t - 1} - 2 exp (- λ d) x_{t - 1} ε_{t - 1}) \\ - \frac{1 - (1 + 2 d λ) exp (- 2 λ d)}{{(1 - exp (- 2 λ d))}^{2}} \\ \cdot \sum_{t = 2}^{n} (ε_{t} - exp (- λ d) ε_{t - 1}) (x_{t} - exp (- λ d) x_{t - 1}) \end{matrix}

(2.17)

and

\begin{array}{rcl} - σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} & = & \frac{σ^{2} (n - 1)}{2 λ^{2}} - \frac{2 σ^{2} (n - 1) d^{2} exp (- 2 λ d)}{{(1 - exp (- 2 λ d))}^{2}} + \frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} \sum_{t = 2}^{n} ε_{t - 1}^{2} \\ + \frac{2 d (1 - d λ - (1 + d λ) exp (- 2 λ d)) exp (- λ d)}{{(1 - exp (- 2 λ d))}^{2}} \sum_{t = 2}^{n} (ε_{t} - exp (- λ d) ε_{t - 1}) ε_{t - 1} \\ + \frac{2 d exp (- λ d) [1 - (1 + 2 d λ) exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{2}} \sum_{t = 2}^{n} (ε_{t} - exp (- λ d) ε_{t - 1}) ε_{t - 1} \\ + \frac{4 d exp (- 2 λ d) [d λ - 1 + (1 + d λ) exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{3}} \sum_{t = 2}^{n} {(ε_{t} - exp (- λ d) ε_{t - 1})}^{2} \\ = & \frac{σ^{2} (n - 1)}{2 λ^{2}} - \frac{2 σ^{2} (n - 1) d^{2} exp (- 2 λ d)}{{(1 - exp (- 2 λ d))}^{2}} + \frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} \sum_{t = 2}^{n} ε_{t - 1}^{2} \\ + \frac{2 d exp (- λ d) [(2 - d λ) - d λ exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{2}} \sum_{t = 2}^{n} (ε_{t} - exp (- λ d) ε_{t - 1}) ε_{t - 1} \\ + \frac{4 d exp (- 2 λ d) [d λ - 1 + (1 + d λ) exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{3}} \\ \cdot \sum_{t = 2}^{n} {(ε_{t} - exp (- λ d) ε_{t - 1})}^{2} . \end{array}

(2.18)

Hence we have

F_{n} (θ) = (\begin{array}{c} \frac{2 λ}{1 - exp (- 2 λ d)} X_{n} (λ) & - σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial β \partial λ} \\ * & - σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} \end{array}),

(2.19)

where the ∗ indicates that the elements are filled in by symmetry. By (2.18), we have

\begin{array}{rcl} E {{- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |}_{θ = θ_{0}}} & = & (n - 1) σ_{0}^{2} {\frac{1}{2 λ_{0}^{2}} + \frac{2 d exp (- 2 λ_{0} d) [- 1 + (1 + d λ_{0}) exp (- 2 λ_{0} d)]}{λ_{0} {(1 - exp (- 2 λ_{0} d))}^{2}}} \\ + \frac{2 d^{2} λ_{0} exp (- 2 λ_{0} d)}{1 - exp (- 2 λ_{0} d)} \sum_{t = 2}^{n} E e_{t - 1}^{2} \\ = & (n - 1) σ_{0}^{2} \frac{{[1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)]}^{2}}{2 λ_{0}^{2} {(1 - exp (- 2 λ_{0} d))}^{2}} \\ + \frac{2 d^{2} λ_{0} exp (- 2 λ_{0} d)}{1 - exp (- 2 λ_{0} d)} \sum_{t = 2}^{n} E e_{t - 1}^{2} \\ = & Δ_{n} (θ_{0}, σ_{0}) = O (n) . \end{array}

(2.20)

Thus,

\begin{array}{rcl} D_{n} & = & E (F_{n} (θ_{0})) \\ = & (\begin{array}{c} \frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)} X_{n} (λ_{0}) & 0 \\ 0 & Δ_{n} (θ_{0}, σ_{0}) . \end{array}) \end{array}

(2.21)

3 Large sample properties of the estimators

Theorem 3.1 Suppose that conditions (A1)-(A2) hold. Then there is a sequence $A_{n} ↓ 0$ such that, for each $A > 0$ , as $n \to \infty$ , the probability

P {there are estimators {\hat{θ}}_{n}, {\hat{σ}}_{n}^{2} with S_{n} ({\hat{θ}}_{n}) = 0, and ({\hat{θ}}_{n}, {\hat{σ}}_{n}^{2}) \in N_{n}^{'} (A)} \to 1 .

(3.1)

Furthermore,

({\hat{θ}}_{n}, {\hat{σ}}_{n}^{2}) \to_{p} (θ_{0}, σ_{0}^{2}), n \to \infty,

(3.2)

where, for each $n = 1, 2, \dots$ , $A > 0$ and $A_{n} \in (0, σ_{0}^{2})$ , define neighborhoods

N_{n} (A) = {θ \in R^{m + 1} : {(θ - θ_{0})}^{T} D_{n} (θ - θ_{0}) \leq A^{2}}

(3.3)

and

N_{n}^{'} (A) = N_{n} (A) \cap {σ^{2} \in [σ_{0}^{2} - A_{n}, σ_{0}^{2} + A_{n}]} .

(3.4)

Theorem 3.2 Suppose that conditions (A1)-(A2) hold. Then

\frac{1}{\hat{σ_{n}}} F_{n}^{\frac{T}{2}} ({\hat{θ}}_{n}) ({\hat{θ}}_{n} - θ_{0}) \to_{D} N (0, I_{m + 1}), n \to \infty .

(3.5)

In the following, we will investigate some special cases in models (1.1) and (1.5). From Theorem 3.1 and Theorem 3.2, we obtain the following results. Here we omit their proofs.

Corollary 3.1 If $β = 0$ , then

\frac{\sqrt{Δ_{n} (θ_{0}, σ_{0})}}{{\hat{σ}}_{n}} ({\hat{λ}}_{n} - λ_{0}) \to_{D} N (0, 1), n \to \infty .

(3.6)

Corollary 3.2 If $β = 0$ , then

\sqrt{n} ({\hat{λ}}_{n} - λ_{0}) \to_{D} N (0, σ_{0}^{2}), n \to \infty .

(3.7)

4 Hypothesis testing

In order to fit a data set ${y_{t}, t = 1, 2, \dots, n}$ , we may use model (1.3) or an Ornstein-Uhlenbeck process with a constant mean level model

d y_{t} = λ (μ - y_{t}) d t + σ d B_{t} .

(4.1)

If $β \neq 0$ , then we use model (1.3), namely models (1.1) and (1.2). If $β = 0$ , then we use model (1.4). How to know $β = 0$ or $β \neq 0$ ? In the section, we shall consider the question about hypothesis testing and obtain limiting distributions for likelihood ratio (LR) test statistics (see Fan and Jiang [53]).

Under the null hypothesis

H_{0} : β_{0} = 0, λ_{0} > 0, σ_{0} > 0,

(4.2)

let ${\hat{β}}_{0 n}$ , ${\hat{λ}}_{0 n}$ , ${\hat{σ}}_{0 n}^{2}$ be the corresponding ML estimators of β, λ, $σ^{2}$ . Also let

{\hat{L}}_{n} = - 2 Ψ_{n} ({\hat{β}}_{n}, {\hat{λ}}_{n}, {\hat{σ}}_{n}^{2})

(4.3)

and

{\hat{L}}_{0 n} = - 2 Ψ_{n} ({\hat{β}}_{0 n}, {\hat{λ}}_{0 n}, {\hat{σ}}_{0 n}^{2}) .

(4.4)

By (2.9) and (2.5), we have that

\begin{array}{rcl} {\hat{L}}_{n} & = & (n - 1) log (\frac{π {\hat{σ}}_{n}^{2}}{{\hat{λ}}_{n}}) + (n - 1) log (1 - exp (- 2 {\hat{λ}}_{n} d)) \\ + \frac{2 {\hat{λ}}_{n}}{{\hat{σ}}_{n}^{2} (1 - exp (- 2 {\hat{λ}}_{n} d))} \sum_{t = 2}^{n} {({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1})}^{2} \\ = & (n - 1) log (\frac{π {\hat{σ}}_{n}^{2}}{{\hat{λ}}_{n}}) + (n - 1) log (1 - exp (- 2 {\hat{λ}}_{n} d)) + (n - 1) \\ = & (n - 1) log (π + 1) + (n - 1) log ({\hat{σ}}_{n}^{2}) \\ + (n - 1) (log (1 - exp (- 2 {\hat{λ}}_{n} d)) - log ({\hat{λ}}_{n})) . \end{array}

(4.5)

And similarly,

\begin{array}{rcl} {\hat{L}}_{0 n} & = & (n - 1) log (π + 1) + (n - 1) log ({\hat{σ}}_{0 n}^{2}) \\ + (n - 1) (log (1 - exp (- 2 {\hat{λ}}_{0 n} d)) - log ({\hat{λ}}_{0 n})) . \end{array}

(4.6)

By (4.5) and (4.6), we have

\begin{array}{rcl} \tilde{d} (n) & = & {\hat{L}}_{0 n} - {\hat{L}}_{n} \\ = & (n - 1) log (\frac{{\hat{σ}}_{0 n}^{2}}{{\hat{σ}}_{n}^{2}}) + (n - 1) (log (\frac{1 - exp (- 2 {\hat{λ}}_{0 n} d)}{1 - exp (- 2 {\hat{λ}}_{n} d)}) - log (\frac{{\hat{λ}}_{0 n}}{{\hat{λ}}_{n}})) \\ = & (n - 1) (\frac{{\hat{σ}}_{0 n}^{2}}{{\hat{σ}}_{n}^{2}} - 1) + (n - 1) (\frac{1 - exp (- 2 {\hat{λ}}_{0 n} d)}{1 - exp (- 2 {\hat{λ}}_{n} d)} - \frac{{\hat{λ}}_{0 n}}{{\hat{λ}}_{n}}) + o_{p} (1) \\ = & (n - 1) \frac{{\hat{σ}}_{0 n}^{2} - {\hat{σ}}_{n}^{2}}{σ_{0}^{2}} + (n - 1) (\frac{1 - exp (- 2 {\hat{λ}}_{0 n} d)}{1 - exp (- 2 {\hat{λ}}_{n} d)} - \frac{{\hat{λ}}_{0 n}}{{\hat{λ}}_{n}}) + o_{p} (1) \\ = & (n - 1) \frac{{\hat{σ}}_{0 n}^{2} - {\hat{σ}}_{n}^{2}}{σ_{0}^{2}} + o_{p} (1) . \end{array}

(4.7)

Large values of $\tilde{d} (n)$ suggest rejection of the null hypothesis.

Theorem 4.1 Suppose that conditions (A1)-(A2) hold. If $H_{0}$ holds, then

\tilde{d} (n) \to_{D} χ^{2} (m), n \to \infty .

(4.8)

5 Some lemmas

Throughout this paper, let C denote a generic positive constant which could take different value at each occurrence. To prove our main results, we first introduce the following lemmas.

Lemma 5.1 If condition (A1) holds, then for any $λ \in R^{+}$ the matrix $X_{n} (λ)$ is positive definite for large enough n, and

lim_{n \to \infty} max_{1 \leq t \leq n} x_{t}^{T} X_{n}^{- 1} (λ) x_{t} = 0 .

Proof Let ${\tilde{λ}}_{1}$ and ${\tilde{λ}}_{m}$ be the smallest and largest roots of $| Z_{n} - \tilde{λ} X_{n} | = 0$ . Then from Ex. 22.1 of Rao [54],

{\tilde{λ}}_{1} \leq \frac{u^{T} Z_{n} u}{u^{T} X_{n} u} \leq {\tilde{λ}}_{m}

for unit vectors u. Thus by (2.18) there are some $δ \in (0, 1)$ and $n_{0} (δ)$ such that $n \geq N_{0}$ implies

| u^{T} Z_{n} u | \leq (1 - δ) u^{T} X_{n} u .

(5.1)

By (2.16) and (5.1), we have

\begin{array}{rcl} u^{T} X_{n} (λ) u & = & \sum_{t = 2}^{n} {(u^{T} (x_{t} - exp (- λ d) x_{t - 1}))}^{2} \\ \geq & \sum_{t = 2}^{n} {(u^{T} x_{t})}^{2} + min_{λ} exp (- 2 λ d) \sum_{t = 2}^{n} {(u^{T} x_{t - 1})}^{2} \\ - max_{λ} exp (- λ d) u^{T} Z_{n} u \\ \geq & u^{T} X_{n} u + min_{λ} exp (- 2 λ d) u^{T} X_{n} u - u^{T} Z_{n} u \\ \geq & (1 + min_{λ} exp (- 2 λ d) - (1 - δ)) u^{T} X_{n} u \\ = & (min_{λ} exp (- 2 λ d) + δ) u^{T} X_{n} u = C (λ, δ) u^{T} X_{n} u . \end{array}

(5.2)

By Rao [[54], p.60] and (2.17), we have

\frac{{(u^{T} x_{t})}^{2}}{u^{T} X_{n} u} \to 0 .

(5.3)

From (5.3) and $C (λ, δ) > 0$ ,

x_{t}^{T} X_{n}^{- 1} (λ) x_{t} = sup_{u} (\frac{{(u^{T} x_{t})}^{2}}{u^{T} X_{n} (λ) u}) \leq sup_{u} (\frac{{(u^{T} x_{t})}^{2}}{C (λ, δ) u^{T} X_{n} u}) \to 0 .

(5.4)

□

Lemma 5.2 The matrix $D_{n}$ is positive definite for large enough n, $E (S_{n} (θ_{0})) = 0$ and $Var (S_{n} (θ_{0})) = σ_{0}^{2} D_{n}$ .

Proof Note that $X_{n} (λ_{0})$ is positive definite and $Δ_{n} (θ_{0}, σ_{0}) > 0$ . It is easy to show that the matrix $D_{n}$ is positive definite for large enough n. By (2.8), we have

\begin{array}{rcl} σ_{0}^{2} E ({\frac{\partial Ψ_{n}}{\partial β} |}_{θ = θ_{0}}) & = & \frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)} \\ \cdot \sum_{t = 2}^{n} E (e_{t} - exp (- λ_{0} d) e_{t - 1}) (x_{t} - exp (- λ_{0} d) x_{t - 1}) \\ = & \frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)} σ_{0} \sqrt{\frac{1 - exp (2 d λ_{0})}{2 λ_{0}}} \\ \cdot \sum_{t = 2}^{n} (x_{t} - exp (- λ_{0} d) x_{t - 1}) E η_{t} \\ = & 0 . \end{array}

(5.5)

Note that $e_{t - 1}$ and $η_{t}$ are independent, so we have $E (η_{t} e_{t - 1}) = 0$ . Thus, by (2.7) and $E η_{t} = 0$ , we have

\begin{array}{rcl} E ({\frac{\partial Ψ_{n}}{\partial λ} |}_{θ = θ_{0}}) & = & \frac{n - 1}{2 λ_{0}} - \frac{(n - 1) d exp (- 2 λ_{0} d)}{1 - exp (- 2 λ_{0} d)} - 0 \\ - \frac{1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)}{σ_{0}^{2} {(1 - exp (- 2 λ_{0} d))}^{2}} σ_{0}^{2} \frac{1 - exp (2 d λ_{0})}{2 λ_{0}} \sum_{t = 2}^{n} E η_{t}^{2} \\ = & \frac{n - 1}{2 λ_{0}} - \frac{(n - 1) d exp (- 2 λ_{0} d)}{1 - exp (- 2 λ_{0} d)} - \frac{1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)}{2 λ_{0} (1 - exp (- 2 λ_{0} d))} (n - 1) \\ = & 0 . \end{array}

(5.6)

Hence, from (5.5) and (5.6),

E (S_{n} (θ_{0})) = σ_{0}^{2} E ({\frac{\partial Ψ_{n}}{\partial β} |}_{θ = θ_{0}}, {\frac{\partial Ψ_{n}}{\partial λ} |}_{θ = θ_{0}}) = 0 .

(5.7)

By (2.8) and (2.20), we have

\begin{matrix} Var ({σ_{0}^{2} \frac{\partial Ψ_{n}}{\partial β} |}_{θ = θ_{0}}) \\ = Var {\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)} \sum_{t = 2}^{n} (e_{t} - exp (- λ_{0} d) e_{t - 1}) (x_{t} - exp (- λ_{0} d) x_{t - 1})} \\ = \frac{2 σ_{0}^{2} λ_{0}}{1 - exp (- 2 λ_{0} d)} Var {\sum_{t = 2}^{n} (x_{t} - exp (- λ_{0} d) x_{t - 1}) η_{t}} \\ = \frac{2 σ_{0}^{2} λ_{0}}{1 - exp (- 2 λ_{0} d)} X_{n} (λ_{0}) . \end{matrix}

(5.8)

Note that ${η_{t} e_{t - 1}, H_{t}}$ is a martingale difference sequence with

Var (η_{t} e_{t - 1}) = E η_{t}^{2} E e_{t - 1}^{2} = E e_{t - 1}^{2},

so

\begin{array}{rcl} Var ({σ_{0}^{2} \frac{\partial Ψ_{n}}{\partial λ} |}_{θ = θ_{0}}) & = & E {σ_{0} d exp (- λ_{0} d) \sqrt{\frac{2 λ_{0}}{1 - exp (- λ_{0} d)}} \sum_{t = 2}^{n} η_{t} e_{t - 1}}^{2} \\ + E {\frac{σ_{0}^{2} [1 - (1 + 2 d λ_{0}) exp (- 2 d λ_{0})]}{2 λ_{0} (1 - exp (- 2 λ_{0} d))} \sum_{t = 2}^{n} (η_{t}^{2} - 1)}^{2} \\ + \frac{\sqrt{2} σ_{0}^{3} d exp (- λ_{0} d) [1 - (1 + 2 d λ_{0}) exp (- 2 d λ_{0})]}{\sqrt{λ_{0}} {(1 - exp (- 2 λ_{0} d))}^{\frac{3}{2}}} \\ \cdot E {\sum_{t = 2}^{n} η_{t} e_{t - 1} \sum_{t = 2}^{n} (η_{t}^{2} - 1)} \\ = & \frac{2 λ_{0} σ_{0}^{2} d^{2} exp (- 2 λ_{0} d)}{1 - exp (- λ_{0} d)} \sum_{t = 2}^{n} E e_{t - 1}^{2} \\ + {\frac{σ_{0}^{2} [1 - (1 + 2 d λ_{0}) exp (- 2 d λ_{0})]}{2 λ_{0} (1 - exp (- 2 λ_{0} d))}}^{2} (n - 1) (E η_{t}^{4} - 1) \\ + \frac{\sqrt{2} σ_{0}^{3} d exp (- λ_{0} d) [1 - (1 + 2 d λ_{0}) exp (- 2 d λ_{0})]}{\sqrt{λ_{0}} {(1 - exp (- 2 λ_{0} d))}^{\frac{3}{2}}} \\ \cdot (\sum_{t = 2}^{n} E ((η_{t}^{2} - 1) η_{t} e_{t - 1}) + \sum_{t \neq k} E (η_{t} e_{t - 1} (η_{k}^{2} - 1))) \\ = & \frac{2 λ_{0} σ_{0}^{2} d^{2} exp (- 2 λ_{0} d)}{1 - exp (- λ_{0} d)} \sum_{t = 2}^{n} E e_{t - 1}^{2} \\ + 2 (n - 1) {\frac{σ_{0}^{2} [1 - (1 + 2 d λ_{0}) exp (- 2 d λ_{0})]}{2 λ_{0} (1 - exp (- 2 λ_{0} d))}}^{2} \\ = & σ_{0}^{2} Δ_{n} (θ_{0}, σ_{0}) . \end{array}

(5.9)

By (2.7), (2.8), and noting that $e_{t - 1}$ and $η_{t}$ are independent, we have

\begin{array}{rcl} {Cov (σ_{0}^{2} \frac{\partial Ψ_{n}}{\partial β}, σ_{0}^{2} \frac{\partial Ψ_{n}}{\partial λ}) |}_{θ = θ_{0}} & = & - σ_{0}^{3} \frac{1 - (1 + 2 d λ) exp (- 2 λ d)}{\sqrt{2 λ_{0}} {(1 - exp (- 2 λ d))}^{\frac{3}{2}}} \\ \cdot E (\sum_{t = 2}^{n} η_{t}^{2} \sum_{t = 2}^{n} η_{t} (x_{t} - exp (- λ d) x_{t - 1})) \\ = & - σ_{0}^{3} \frac{1 - (1 + 2 d λ) exp (- 2 λ d)}{\sqrt{2 λ_{0}} {(1 - exp (- 2 λ d))}^{\frac{3}{2}}} E η_{t}^{3} \\ \cdot \sum_{t = 2}^{n} (x_{t} - exp (- λ d) x_{t - 1}) = 0 . \end{array}

(5.10)

From (5.8)-(5.10), it follows that $Var (S_{n} (θ_{0})) = σ_{0}^{2} D_{n}$ . The proof is completed. □

Lemma 5.3 (Maller [55])

Let $W_{n}$ be a symmetric random matrix with eigenvalues ${\tilde{λ}}_{j} (n)$ , $1 \leq j \leq d$ . Then

W_{n} \to_{p} I \Leftrightarrow {\tilde{λ}}_{j} (n) \to_{p} 1, n \to \infty .

Lemma 5.4 For each $A > 0$ ,

sup_{θ \in N_{n} (A)} ∥ D_{n}^{- \frac{1}{2}} F_{n} (θ) D_{n}^{- \frac{T}{2}} - Φ_{n} ∥ \to_{p} 0, n \to \infty

(5.11)

and also

Φ_{n} \to_{D} Φ,

(5.12)

lim_{c \to 0} \underset{A \to \infty}{lim sup} \underset{n \to \infty}{lim sup} P {inf_{θ \in N_{n} (A)} λ_{\min} (D_{n}^{- \frac{1}{2}} F_{n} (θ) D_{n}^{- \frac{T}{2}}) \leq c} = 0,

(5.13)

where

Φ_{n} = (\begin{array}{c} \frac{λ (1 - exp (- 2 d λ_{0}))}{λ_{0} (1 - exp (- 2 d λ))} I_{m} & 0 \\ 0 & \frac{- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |_{θ = θ_{0}}}{Δ_{n} (θ_{0}, σ_{0})} \end{array}), Φ = I_{m + 1} .

(5.14)

Proof Let $X_{n} (λ_{0}) = X_{n}^{\frac{1}{2}} (λ_{0}) X_{n}^{\frac{T}{2}} (λ_{0})$ be a square root decomposition of $X_{n} (λ_{0})$ . Then

\begin{array}{rcl} D_{n} & = & (\begin{array}{c} \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 d λ_{0})}} X_{n}^{\frac{1}{2}} (λ_{0}) & 0 \\ 0 & \sqrt{Δ_{n} (θ_{0}, σ_{0})} \end{array}) \\ \cdot (\begin{array}{c} \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 d λ_{0})}} X_{n}^{\frac{T}{2}} (λ_{0}) & 0 \\ 0 & \sqrt{Δ_{n} (θ_{0}, σ_{0})} \end{array}) \\ = & D_{n}^{\frac{1}{2}} D_{n}^{\frac{T}{2}} . \end{array}

(5.15)

Let $θ \in N_{n} (A)$ . Then

\begin{array}{rcl} {(θ - θ_{0})}^{T} D_{n} (θ - θ_{0}) & = & \frac{2 λ_{0}}{1 - exp (- 2 d λ_{0})} {(β - β_{0})}^{T} X_{n} (λ_{0}) (β - β_{0}) \\ + {(λ - λ_{0})}^{2} Δ_{n} (θ_{0}, σ_{0}) \leq A^{2} . \end{array}

(5.16)

From (2.20), (2.21) and (5.14),

D_{n}^{- \frac{1}{2}} F_{n} (θ) D_{n}^{- \frac{T}{2}} - Φ_{n} = (\begin{array}{c} W_{11} & W_{12} \\ * & W_{22} \end{array}),

(5.17)

where

W_{11} = \frac{λ (1 - exp (- 2 d λ_{0}))}{λ_{0} (1 - exp (- 2 d λ))} {X_{n}^{- \frac{1}{2}} (λ_{0}) X_{n} (λ) X_{n}^{- \frac{T}{2}} (λ_{0}) - I_{m}},

(5.18)

W_{12} = \frac{\sqrt{\frac{1 - exp (- 2 d λ_{0})}{2 λ_{0}}} X_{n}^{- \frac{1}{2}} (λ_{0}) (- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial β \partial λ})}{\sqrt{Δ_{n} (θ_{0}, σ_{0})}}

(5.19)

and

W_{22} = \frac{- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} - σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |_{θ = θ_{0}}}{Δ_{n} (θ_{0}, σ_{0})} .

(5.20)

Let

N_{n}^{β} (A) = {β : \frac{2 λ_{0}}{1 - exp (- 2 d λ_{0})} | {(β - β_{0})}^{T} X_{n}^{\frac{1}{2}} (λ_{0}) |^{2} \leq A^{2}}

(5.21)

and

N_{n}^{λ} (A) = {θ : | λ - λ_{0} | \leq \frac{A}{\sqrt{Δ_{n} (θ_{0}, σ_{0})}}} .

(5.22)

As the first step, we will show that, for each $A > 0$ ,

sup_{θ \in N_{n}^{θ} (A)} ∥ W_{11} ∥ \to 0, n \to \infty .

(5.23)

In fact, note that

\begin{array}{rcl} W_{11} & = & \frac{λ (1 - exp (- 2 d λ_{0}))}{λ_{0} (1 - exp (- 2 d λ))} X_{n}^{- \frac{1}{2}} (λ_{0}) (X_{n} (λ) - X_{n} (λ_{0})) X_{n}^{- \frac{T}{2}} (λ_{0}) \\ = & \frac{λ (1 - exp (- 2 d λ_{0}))}{λ_{0} (1 - exp (- 2 d λ))} X_{n}^{- \frac{1}{2}} (λ_{0}) (T_{1} + T_{2} - T_{3}) X_{n}^{- \frac{T}{2}} (λ_{0}), \end{array}

(5.24)

where

\begin{matrix} T_{1} = \sum_{t = 2}^{n} (exp (- d λ_{0}) - exp (- d λ)) x_{t - 1} {(x_{t} - exp (- d λ_{0}) x_{t - 1})}^{T}, \\ T_{2} = \sum_{t = 2}^{n} (exp (- d λ_{0}) - exp (- d λ)) (x_{t} - exp (- d λ_{0}) x_{t - 1}) x_{t}^{T} \end{matrix}

and

T_{3} = \sum_{t = 2}^{n} {(exp (- d λ) - exp (- d λ_{0}))}^{2} x_{t - 1} x_{t - 1}^{T} .

Let $u, v \in R^{d}$ , $| u | = | v | = 1$ , and let $u_{n}^{T} = u^{T} X_{n}^{- \frac{1}{2}} (λ_{0})$ , $v_{n}^{T} = X_{n}^{- \frac{T}{2}} (λ_{0}) v$ . By the Cauchy-Schwarz inequality, Lemma 5.1 and noting $N_{n}^{λ} (A)$ , we have

\begin{array}{rcl} | u_{n}^{T} T_{1} v_{n} | & = & | (exp (- d λ_{0}) - exp (- d λ)) \sum_{t = 2}^{n} u_{n}^{T} x_{t - 1} {(x_{t} - exp (- d λ_{0}) x_{t - 1})}^{T} v_{n} | \\ \leq & max | exp (- d λ_{0}) - exp (- d λ) | {(\sum_{t = 2}^{n} u_{n}^{T} x_{t} x_{t}^{T} u_{n})}^{\frac{1}{2}} \\ \cdot {(\sum_{t = 2}^{n} v_{n}^{T} (x_{t} - exp (- d λ_{0}) x_{t - 1}) {(x_{t} - exp (- d λ_{0}) x_{t - 1})}^{T} v_{n})}^{\frac{1}{2}} \\ \leq & d | λ_{0} - λ | \cdot \sqrt{n} max_{1 \leq t \leq n} (x_{t}^{T} X_{n}^{- 1} (λ_{0}) x_{t}) \cdot 1 \\ \leq & C \sqrt{\frac{n}{Δ_{n} (θ_{0}, σ_{0})}} o (1) \to 0 . \end{array}

(5.25)

Similar to the proof of $T_{1}$ , we easily obtain

| u_{n}^{T} T_{2} v_{n} | \to 0 .

(5.26)

By the Cauchy-Schwarz inequality, Lemma 5.1 and noting $N_{n}^{λ} (A)$ , we have

\begin{array}{rcl} | u_{n}^{T} T_{3} v_{n} | & = & | u_{n}^{T} \sum_{t = 2}^{n} {(exp (- d λ_{0}) - exp (- d λ))}^{2} x_{t - 1} x_{t - 1}^{T} v_{n} | \\ \leq & max | exp (- d λ_{0}) - exp (- d λ) |^{2} \\ \cdot {(\sum_{t = 2}^{n} u_{n}^{T} x_{t} x_{t}^{T} u_{n} \sum_{t = 2}^{n} v_{n}^{T} x_{t} x_{t}^{T} v_{n})}^{\frac{1}{2}} \\ \leq & n {| λ_{0} - λ |}^{2} max_{1 \leq t \leq n} (x_{t}^{T} X_{n}^{- 1} (λ_{0}) x_{t}) \\ \leq & \frac{n A^{2}}{Δ_{n} (θ_{0}, σ_{0})} o (1) \to 0 . \end{array}

(5.27)

Hence, (5.23) follows from (5.24)-(5.27).

For the second step, we will show that

W_{12} \to_{p} 0 .

(5.28)

Note that

ε_{t} = y_{t} - x_{t}^{T} β = x_{t}^{T} (β_{0} - β) + e_{t}

(5.29)

and

ε_{t} - exp (- d λ_{0}) ε_{t - 1} = {(x_{t} - exp (- d λ_{0}) x_{t - 1})}^{T} (β_{0} - β) + σ_{0} \sqrt{\frac{1 - exp (- 2 d λ_{0})}{2 λ_{0}}} η_{t} .

(5.30)

Write

\begin{array}{rcl} J & = & \sqrt{\frac{1 - exp (- 2 d λ_{0})}{2 λ_{0}}} X_{n}^{- \frac{1}{2}} (λ_{0}) (- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial β \partial λ}) \\ = & - \sqrt{\frac{1 - exp (- 2 d λ_{0})}{2 λ_{0}}} X_{n}^{- \frac{1}{2}} (λ_{0}) \frac{2 d λ exp (- λ d)}{1 - exp (- 2 λ d)} \\ \cdot \sum_{t = 2}^{n} (ε_{t - 1} x_{t} + ε_{t} x_{t - 1} - 2 exp (- λ d) x_{t - 1} ε_{t - 1}) \\ - \sqrt{\frac{1 - exp (- 2 d λ_{0})}{2 λ_{0}}} X_{n}^{- \frac{1}{2}} (λ_{0}) \frac{1 - (1 + 2 d λ) exp (- 2 λ d)}{{(1 - exp (- 2 λ d))}^{2}} \\ \cdot \sum_{t = 2}^{n} (ε_{t} - exp (- λ d) ε_{t - 1}) (x_{t} - exp (- λ d) x_{t - 1}) \\ = & - \sqrt{\frac{1 - exp (- 2 d λ_{0})}{2 λ_{0}}} \frac{2 d λ exp (- λ d)}{1 - exp (- 2 λ d)} X_{n}^{- \frac{1}{2}} (λ_{0}) (T_{1} + T_{2} + 2 T_{3} + 2 T_{4} + 2 T_{5}) \\ - \sqrt{\frac{1 - exp (- 2 d λ_{0})}{2 λ_{0}}} \frac{1 - (1 + 2 d λ) exp (- 2 λ d)}{{(1 - exp (- 2 λ d))}^{2}} X_{n}^{- \frac{1}{2}} (λ_{0}) T_{6}, \end{array}

(5.31)

where

\begin{matrix} T_{1} = \sum_{t = 2}^{n} x_{t - 1}^{T} (β_{0} - β) (x_{t} - exp (- λ_{0} d) x_{t - 1}), \\ T_{2} = \sum_{t = 2}^{n} {(x_{t} - exp (- λ_{0} d) x_{t - 1})}^{T} (β_{0} - β) x_{t - 1}, \\ T_{3} = \sum_{t = 2}^{n} (exp (- λ_{0} d) - exp (- λ d)) x_{t - 1}^{T} (β_{0} - β) x_{t - 1}, \\ T_{4} = σ \sqrt{\frac{1 - exp (- 2 λ d)}{2 λ}} \sum_{t = 2}^{n} η_{t} x_{t - 1}, T_{5} = \sum_{t = 2}^{n} e_{t - 1} x_{t - 1}, \\ T_{6} = σ \sqrt{\frac{1 - exp (- 2 λ d)}{2 λ}} \sum_{t = 2}^{n} η_{t} (x_{t} - exp (- λ d) x_{t - 1}) . \end{matrix}

For $β \in N_{n}^{β} (A)$ and each $A > 0$ , we have

\begin{array}{rcl} | {(β_{0} - β)}^{T} x_{t} |^{2} & = & {(β_{0} - β)}^{T} X_{n}^{\frac{1}{2}} (λ_{0}) X_{n}^{- \frac{1}{2}} (λ_{0}) x_{t} x_{t}^{T} X_{n}^{- \frac{T}{2}} (λ_{0}) X_{n}^{\frac{T}{2}} (λ_{0}) (β_{0} - β) \\ \leq & max_{1 \leq t \leq n} (x_{t}^{T} X_{n}^{- 1} (λ_{0}) x_{t}) {(β_{0} - β)}^{T} X_{n} (λ_{0}) (β_{0} - β) \\ \leq & A^{2} max_{1 \leq t \leq n} (x_{t}^{T} X_{n}^{- 1} (λ_{0}) x_{t}) . \end{array}

(5.32)

By (5.32) and Lemma 5.1, we have

sup_{β \in N_{n}^{β} (A)} max_{1 \leq t \leq n} | {(β_{0} - β)}^{T} x_{t} | \to 0, n \to \infty, A > 0 .

(5.33)

Using the Cauchy-Schwarz inequality and (5.33), we obtain

\begin{array}{rcl} u_{n}^{T} T_{1} & = & \sum_{t = 2}^{n} u_{n}^{T} x_{t - 1}^{T} (β_{0} - β) (x_{t} - exp (- λ_{0} d) x_{t - 1}) \\ \leq & {\sum_{t = 2}^{n} {(x_{t - 1}^{T} (β_{0} - β))}^{2}}^{\frac{1}{2}} \\ \cdot {\sum_{t = 2}^{n} u_{n}^{T} (x_{t} - exp (- λ_{0} d) x_{t - 1}) {(x_{t} - exp (- λ_{0} d) x_{t - 1})}^{T} u_{n}}^{\frac{1}{2}} \\ \leq & \sqrt{n} max_{1 \leq t \leq n} | {(β_{0} - β)}^{T} x_{t} | = o (\sqrt{n}) . \end{array}

(5.34)

Using a similar argument as $T_{1}$ , we obtain that

u_{n}^{T} T_{2} = o_{p} (\sqrt{n}) .

(5.35)

By the Cauchy-Schwarz inequality and (5.33), (5.25), we get

\begin{array}{rcl} u_{n}^{T} T_{3} & = & \sum_{t = 2}^{n} (exp (- λ_{0} d) - exp (- λ d)) x_{t - 1}^{T} (β_{0} - β) u_{n}^{T} x_{t - 1} \\ \leq & {\sum_{t = 2}^{n} {(exp (- λ_{0} d) - exp (- λ d))}^{2} {(x_{t - 1}^{T} (β_{0} - β))}^{2} \sum_{t = 2}^{n} {(u_{n}^{T} x_{t - 1})}^{2}}^{\frac{1}{2}} \\ \leq & C | λ_{0} - λ | {\sum_{t = 2}^{n} {(x_{t - 1}^{T} (β_{0} - β))}^{2} \sum_{t = 2}^{n} {(u_{n}^{T} x_{t - 1})}^{2}}^{\frac{1}{2}} \\ \leq & C \frac{A}{\sqrt{Δ_{n} (θ_{0}, σ_{0})}} \sqrt{n} o (1) o (\sqrt{n}) = o (\sqrt{n}) . \end{array}

(5.36)

By (5.25), we have

Var (u_{n}^{T} T_{4}) = σ^{2} \frac{1 - exp (- 2 λ d)}{2 λ} \sum_{t = 2}^{n} {(u_{n}^{T} x_{t - 1})}^{2} = o (n) .

(5.37)

Thus, by the Chebychev inequality and (5.37),

u_{n}^{T} T_{4} = o_{p} (\sqrt{n}) .

(5.38)

By Lemma 5.1 and (2.3), we have

\begin{array}{rcl} Var (u_{n}^{T} T_{5}) & = & Var (\sum_{t = 2}^{n} u_{n}^{T} x_{t} e_{t - 1}) \\ = & σ_{0}^{2} \frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}} \\ \cdot Var {\sum_{j = 1}^{n - 1} (\sum_{t = j + 1}^{n} u_{n}^{T} x_{t} exp {- λ_{0} d (t - 1 - j)}) η_{j}} \\ = & σ_{0}^{2} \frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}} \sum_{j = 1}^{n - 1} {(\sum_{t = j + 1}^{n} u_{n}^{T} x_{t} exp {- λ_{0} d (t - 1 - j)})}^{2} \\ \leq & σ_{0}^{2} \frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}} max_{2 \leq t \leq n} | u_{n}^{T} x_{t} | \\ \cdot \sum_{j = 1}^{n - 1} {(\sum_{t = j + 1}^{n} exp {- λ_{0} d (t - 1 - j)})}^{2} \\ \leq & C max_{2 \leq t \leq n} | u_{n}^{T} x_{t} | n = o (n) . \end{array}

(5.39)

Thus, by the Chebychev inequality and (5.39),

u_{n}^{T} T_{5} = o_{p} (\sqrt{n}) .

(5.40)

Using a similar argument as $T_{4}$ , we obtain

u_{n}^{T} T_{6} = o_{p} (\sqrt{n}) .

(5.41)

Thus (5.28) follows immediately from (5.31), (5.34)-(5.36), (5.38), (5.40) and (5.41).

For the third step, we will show that

W_{22} \to_{p} 0 .

(5.42)

Write that

\begin{array}{rcl} J & = & - σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} - {σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |}_{θ = θ_{0}} \\ = & \frac{σ^{2} (n - 1)}{2 λ^{2}} - \frac{2 σ^{2} (n - 1) d^{2} exp (- 2 λ d)}{{(1 - exp (- 2 λ d))}^{2}} + \frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} \sum_{t = 2}^{n} ε_{t - 1}^{2} \\ + \frac{2 d exp (- λ d) [(2 - d λ) - d λ exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{2}} \sum_{t = 2}^{n} (ε_{t} - exp (- λ d) ε_{t - 1}) ε_{t - 1} \\ + \frac{4 d exp (- 2 λ d) [d λ - 1 + (1 + d λ) exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{3}} \sum_{t = 2}^{n} {(ε_{t} - exp (- λ d) ε_{t - 1})}^{2} \\ - \frac{σ_{0}^{2} (n - 1)}{2 λ_{0}^{2}} + \frac{2 σ_{0}^{2} (n - 1) d^{2} exp (- 2 λ_{0} d)}{{(1 - exp (- 2 λ_{0} d))}^{2}} - \frac{2 d^{2} λ_{0} exp (- 2 λ_{0} d)}{1 - exp (- 2 λ_{0} d)} \sum_{t = 2}^{n} e_{t - 1}^{2} \\ - \frac{2 d exp (- λ_{0} d) [(2 - d λ_{0}) - d λ_{0} exp (- 2 λ_{0} d)]}{{(1 - exp (- 2 λ_{0} d))}^{2}} \sum_{t = 2}^{n} (e_{t} - exp (- λ_{0} d) e_{t - 1}) e_{t - 1} \\ - \frac{4 d exp (- 2 λ_{0} d) [d λ_{0} - 1 + (1 + d λ_{0}) exp (- 2 λ_{0} d)]}{{(1 - exp (- 2 λ_{0} d))}^{3}} \\ \cdot \sum_{t = 2}^{n} {(e_{t} - exp (- λ_{0} d) e_{t - 1})}^{2} . \end{array}

(5.43)

By (3.3) and (3.4), we obtain that

\begin{array}{rcl} T_{1} & = & \frac{σ^{2} (n - 1)}{2 λ^{2}} - \frac{σ_{0}^{2} (n - 1)}{2 λ_{0}^{2}} \\ = & \frac{n - 1}{2 λ^{2} λ_{0}^{2}} (σ^{2} (λ_{0}^{2} - λ^{2}) + λ^{2} (σ^{2} - σ_{0}^{2})) \\ = & o (n) \end{array}

(5.44)

and

\begin{array}{rcl} T_{2} & = & \frac{2 σ_{0}^{2} (n - 1) d^{2} exp (- 2 λ_{0} d)}{{(1 - exp (- 2 λ_{0} d))}^{2}} - \frac{2 σ^{2} (n - 1) d^{2} exp (- 2 λ d)}{{(1 - exp (- 2 λ d))}^{2}} \\ = & \frac{2 d^{2} (n - 1)}{{(1 - exp (- 2 λ_{0} d))}^{2} {(1 - exp (- 2 λ d))}^{2}} {σ_{0} (exp (- λ_{0} d) - exp (- λ d)) \\ + exp (- λ d) (σ_{0} - σ) + exp (- λ d - λ_{0} d) \\ \cdot [σ (exp (- λ_{0} d) - exp (- λ d)) + exp (- λ d) (σ - σ_{0})]} \\ \cdot (σ_{0} exp (- λ_{0} d) (1 - exp (- 2 λ d)) + σ exp (- λ d) (1 - exp (- 2 λ_{0} d))) \\ = & o (n) . \end{array}

(5.45)

By (5.29), we have

\begin{array}{rcl} T_{3} & = & \frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} \sum_{t = 2}^{n} ε_{t - 1}^{2} - \frac{2 d^{2} λ_{0} exp (- 2 λ_{0} d)}{1 - exp (- 2 λ_{0} d)} \sum_{t = 2}^{n} e_{t - 1}^{2} \\ = & \frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} \sum_{t = 2}^{n} {{(x_{t}^{T} (β_{0} - β))}^{2} + 2 x_{t}^{T} (β_{0} - β) e_{t} + e_{t}^{2}} \\ - \frac{2 d^{2} λ_{0} exp (- 2 λ_{0} d)}{1 - exp (- 2 λ_{0} d)} \sum_{t = 2}^{n} e_{t - 1}^{2} \\ = & \frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} \sum_{t = 2}^{n} {(x_{t}^{T} (β_{0} - β))}^{2} + \frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} \sum_{t = 2}^{n} 2 x_{t}^{T} (β_{0} - β) e_{t} \\ + {\frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} - \frac{2 d^{2} λ_{0} exp (- 2 λ_{0} d)}{1 - exp (- 2 λ_{0} d)}} \sum_{t = 2}^{n} e_{t - 1}^{2} \\ = & \frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} T_{31} + \frac{4 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} T_{32} + T_{33} . \end{array}

(5.46)

By (5.32), it is easy to show that

T_{31} = o (n) .

(5.47)

By Lemma 5.1, (2.3) and (5.32), we have

\begin{array}{rcl} Var (T_{32}) & = & Var (\sum_{t = 2}^{n} x_{t}^{T} (β_{0} - β) e_{t}) \\ = & Var {\sum_{j = 1}^{n - 1} (\sum_{t = j + 1}^{n} x_{t}^{T} (β_{0} - β) exp {- λ_{0} d (t - 1 - j)}) η_{j}} \\ = & \sum_{j = 1}^{n - 1} {(\sum_{t = j + 1}^{n} x_{t}^{T} (β_{0} - β) exp {- λ_{0} d (t - 1 - j)})}^{2} \\ \leq & max_{2 \leq t \leq n} | x_{t}^{T} (β_{0} - β) | \sum_{j = 1}^{n - 1} {(\sum_{t = j + 1}^{n} exp {- λ_{0} d (t - 1 - j)})}^{2} \\ \leq & C max_{2 \leq t \leq n} | x_{t}^{T} (β_{0} - β) | n = o (n) . \end{array}

(5.48)

Thus by the Chebychev inequality and (5.48),

T_{32} = o_{p} (\sqrt{n}) .

(5.49)

Write

\begin{matrix} \frac{2 d^{2} λ exp (- 2 λ d)}{1 - exp (- 2 λ d)} - \frac{2 d^{2} λ_{0} exp (- 2 λ_{0} d)}{1 - exp (- 2 λ_{0} d)} \\ = \frac{2 d^{2}}{(1 - exp (- 2 λ d)) (1 - exp (- 2 λ_{0} d))} U, \end{matrix}

(5.50)

where

U = λ exp (- 2 λ d) (1 - exp (- 2 λ_{0} d)) - λ_{0} exp (- 2 λ_{0} d) (1 - exp (- 2 λ d)) .

Note that

\begin{array}{rcl} U & = & λ exp (- 2 λ d) (exp (- 2 λ d) - exp (- 2 λ_{0} d)) \\ + (λ (exp (- 2 λ d) - exp (- 2 λ_{0} d)) + (λ - λ_{0}) exp (- 2 λ_{0} d)) (1 - exp (- 2 λ d)) \\ = & o (1), \end{array}

(5.51)

so we have

T_{33} = o (n) .

(5.52)

Thus, by (5.46), (5.47), (5.49) and (5.52), we have

T_{3} = o (n) .

(5.53)

By (5.29), we have

\begin{array}{rcl} T_{4} & = & \frac{2 d exp (- λ d) [(2 - d λ) - d λ exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{2}} \sum_{t = 2}^{n} (ε_{t} - exp (- λ d) ε_{t - 1}) ε_{t - 1} \\ - \frac{2 d exp (- λ_{0} d) [(2 - d λ_{0}) - d λ_{0} exp (- 2 λ_{0} d)]}{{(1 - exp (- 2 λ_{0} d))}^{2}} \sum_{t = 2}^{n} (e_{t} - exp (- λ_{0} d) e_{t - 1}) e_{t - 1} \\ = & \frac{2 d exp (- λ d) [(2 - d λ) - d λ exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{2}} σ \sqrt{\frac{1 - exp (- 2 λ d)}{2 λ}} \sum_{t = 2}^{n} x_{t - 1}^{T} (β_{0} - β) η_{t} \\ + {\frac{2 d exp (- λ d) [(2 - d λ) - d λ exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{2}} σ \sqrt{\frac{1 - exp (- 2 λ d)}{2 λ}} \\ - \frac{2 d exp (- λ_{0} d) [(2 - d λ_{0}) - d λ_{0} exp (- 2 λ_{0} d)]}{{(1 - exp (- 2 λ_{0} d))}^{2}} \\ \cdot σ \sqrt{\frac{1 - exp (- 2 λ d)}{2 λ}}} \sum_{t = 2}^{n} η_{t} e_{t - 1} \\ = & T_{41} + T_{42} . \end{array}

(5.54)

It is easy to show that

T_{41} = o (n) .

(5.55)

Note that ${η_{t} e_{t - 1}, H_{t}}$ is a martingale difference sequence, so we have

Var (\sum_{t = 2}^{n} η_{t} e_{t - 1}) = \sum_{t = 2}^{n} E e_{t - 1}^{2} = Δ_{n} (θ_{0}, σ_{0}) .

Hence,

T_{42} = o (n) .

(5.56)

By (5.54)-(5.56), we have

T_{4} = o (n) .

(5.57)

It is easily proved that

\begin{array}{rcl} T_{5} & = & \frac{4 d exp (- 2 λ d) [d λ - 1 + (1 + d λ) exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{3}} \sum_{t = 2}^{n} {(ε_{t} - exp (- λ d) ε_{t - 1})}^{2} \\ - \frac{4 d exp (- 2 λ_{0} d) [d λ_{0} - 1 + (1 + d λ_{0}) exp (- 2 λ_{0} d)]}{{(1 - exp (- 2 λ_{0} d))}^{3}} \sum_{t = 2}^{n} {(e_{t} - exp (- λ_{0} d) e_{t - 1})}^{2} \\ = & {\frac{4 d exp (- 2 λ d) [d λ - 1 + (1 + d λ) exp (- 2 λ d)]}{{(1 - exp (- 2 λ d))}^{3}} σ \sqrt{\frac{1 - exp (- 2 λ d)}{2 λ}} \\ - \frac{4 d exp (- 2 λ_{0} d) [d λ_{0} - 1 + (1 + d λ_{0}) exp (- 2 λ_{0} d)]}{{(1 - exp (- 2 λ_{0} d))}^{3}} \\ \cdot σ_{0} \sqrt{\frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}}}} \sum_{t = 2}^{n} η_{t}^{2} = o (n) . \end{array}

(5.58)

Hence, (5.42) follows immediately from (5.43)-(5.45), (5.53), (5.57) and (5.58). This completes the proof of (5.11) from (5.17), (5.23), (5.28) and (5.42).

It is well known that $\frac{λ (1 - exp (- 2 d λ_{0}))}{λ_{0} (1 - exp (- 2 d λ))} \to 1$ as $n \to \infty$ . To prove (5.12), we need to show that

\frac{- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |_{θ = θ_{0}}}{Δ_{n} (θ_{0}, σ_{0})} \to_{p} 1, n \to \infty .

This follows immediately from (2.20) and the Markov inequality.

Finally, we will prove (5.13). By (5.11) and (5.12), we have

D_{n}^{- \frac{1}{2}} F (θ) D_{n}^{- \frac{T}{2}} \to_{p} I_{m}, n \to \infty

(5.59)

uniformly in $θ \in N_{n} (A)$ for each $A > 0$ . Thus, by Lemma 5.3,

λ_{\min} (D_{n}^{- \frac{1}{2}} F (θ) D_{n}^{- \frac{T}{2}}) \to_{p} 1, n \to \infty .

(5.60)

This implies (5.13). □

Lemma 5.5 (Hall and Heyde [56])

Let ${S_{n i}, F_{n i}, 1 \leq i \leq k_{n}, n \geq 1}$ be a zero-mean, square-integrable martingale array with differences $X_{n i}$ , and let $η^{2}$ be an a.s. finite random variable. Suppose that $\sum_{i} E {X_{n i}^{2} I (| X_{n i} | > ε) | F_{n, i - 1}} \to_{p} 0$ for all $ε \to 0$ , and $\sum_{i} E {X_{n i}^{2} | F_{n, i - 1}} \to_{p} η^{2}$ . Then

S_{n k_{n}} = \sum_{i} X_{n i} \to_{D} Z,

where the r.v. Z has the characteristic function $E {exp (- \frac{1}{2} η^{2} t^{2})}$ .

6 Proof of theorems

Proof of Theorem 3.1 Take $A > 0$ , let

M_{n} (A) = {θ \in R^{m + 1} : {(θ - θ_{0})}^{T} D_{n} (θ - θ_{0}) = A^{2}}

(6.1)

be the boundary of $N_{n} (A)$ , and let $θ \in M_{n} (A)$ . Using (2.19) and the Taylor expansion, for each $σ^{2} > 0$ , we have

\begin{array}{rcl} Ψ_{n} (θ, σ^{2}) & = & Ψ_{n} (θ_{0}, σ^{2}) + {(θ - θ_{0})}^{T} \frac{\partial Ψ_{n} (θ_{0}, σ^{2})}{\partial θ} + \frac{1}{2} {(θ - θ_{0})}^{T} \frac{\partial^{2} Ψ_{n} (θ_{0}, σ^{2})}{\partial θ \partial θ^{T}} (θ - θ_{0}) \\ = & \frac{1}{σ^{2}} Ψ_{n} (θ_{0}, σ^{2}) + {(θ - θ_{0})}^{T} S_{n} (θ_{0}) - \frac{1}{2 σ^{2}} {(θ - θ_{0})}^{T} F_{n} (\tilde{θ}) (θ - θ_{0}), \end{array}

(6.2)

where $\tilde{θ} = a θ + (1 - a) θ_{0}$ for some $0 \leq a \leq 1$ .

Let $Q_{n} (θ) = \frac{1}{2} {(θ - θ_{0})}^{T} F_{n} (\tilde{θ}) (θ - θ_{0})$ and $v_{n} (θ) = \frac{1}{A} D_{n}^{\frac{T}{2}} (θ - θ_{0})$ . Take $c > 0$ and $θ \in M_{n} (A)$ , and by (6.2), we obtain that

\begin{matrix} P {Ψ_{n} (θ, σ^{2}) \geq Ψ_{n} (θ_{0}, σ^{2}) for some θ \in M_{n} (A)} \\ \leq P {{(θ - θ_{0})}^{T} S_{n} (θ_{0}) \geq Q_{n} (θ), Q_{n} (θ) > c A^{2} for some θ \in M_{n} (A)} \\ + P {Q_{n} (θ) \leq c A^{2} for some θ \in M_{n} (A)} \\ \leq P {v_{n}^{T} (θ) D_{n}^{- \frac{1}{2}} S_{n} (θ_{0}) > c A for some θ \in M_{n} (A)} \\ + P {v_{n}^{T} (θ) D_{n}^{- \frac{1}{2}} F_{n} (\tilde{θ}) D_{n}^{- \frac{T}{2}} v_{n} (θ) \leq c for some θ \in M_{n} (A)} \\ \leq P {| D_{n}^{- \frac{1}{2}} S_{n} (θ_{0}) | > c A} \\ + P {inf_{θ \in N_{n} (A)} λ_{\min} (D_{n}^{- \frac{1}{2}} F_{n} (\tilde{θ}) D_{n}^{- \frac{T}{2}}) \leq c} . \end{matrix}

(6.3)

By Lemma 5.2 and the Chebychev inequality, we obtain

P {| D_{n}^{- \frac{1}{2}} S_{n} (θ_{0}) | > c A} \leq \frac{Var (D_{n}^{- \frac{1}{2}} S_{n} (θ_{0}))}{c^{2} A^{2}} = \frac{σ_{0}^{2}}{c^{2} A^{2}} .

(6.4)

Let $A \to \infty$ , then $c ↓ 0$ , and using (5.13), we have

P {inf_{φ \in N_{n} (A)} λ_{\min} (D_{n}^{- \frac{1}{2}} F_{n} (\tilde{θ}) D_{n}^{- \frac{T}{2}}) \leq c} \to 0 .

(6.5)

By (6.3)-(6.5), we have

lim_{A \to \infty} \underset{n \to \infty}{lim inf} P {Ψ_{n} (θ, σ^{2}) < Ψ_{n} (θ_{0}, σ^{2}) for all θ \in M_{n} (A)} = 1 .

(6.6)

By Lemma 5.3, $λ_{\min} (X_{n} (θ_{0})) \to \infty$ as $n \to \infty$ . Hence $λ_{\min} (D_{n}) \to \infty$ . Moreover, from (5.13), we have

inf_{θ \in N_{n} (A)} λ_{\min} (F_{n} (θ)) \to_{p} \infty .

This implies that $Ψ_{n} (θ, σ^{2})$ is concave on $N_{n} (A)$ . Noting this fact and (6.6), we get

\begin{matrix} lim_{A \to \infty} \underset{n \to \infty}{lim inf} P {sup_{θ \in M_{n} (A)} Ψ_{n} (θ, σ^{2}) < Ψ_{n} (θ_{0}, σ^{2}), Ψ_{n} (θ, σ^{2}) is concave on N_{n} (A)} \\ = 1 . \end{matrix}

(6.7)

On the event in the brackets, the continuous function $Ψ_{n} (θ, σ^{2})$ has a unique maximum in θ over the compact neighborhood $N_{n} (A)$ . Hence

lim_{A \to \infty} \underset{n \to \infty}{lim inf} P {S_{n} ({\hat{θ}}_{n} (A)) = 0 for a unique {\hat{θ}}_{n} (A) \in N_{n} (A)} = 1 .

Moreover, there is a sequence $A_{n} \to \infty$ such that ${\hat{θ}}_{n} = \hat{θ} (A_{n})$ satisfies

\underset{n \to \infty}{lim inf} P {S_{n} ({\hat{θ}}_{n}) = 0 and {\hat{θ}}_{n} maximizes Ψ_{n} (θ, σ^{2}) uniquely in N_{n} (A)} = 1 .

This ${\hat{θ}}_{n} = ({\hat{β}}_{n}, {\hat{λ}}_{n})$ is a QML estimator for $θ_{0}$ . It is clearly consistent, and

lim_{A \to \infty} \underset{n \to \infty}{lim inf} P {{\hat{θ}}_{n} \in N_{n} (A)} = 1 .

Since ${\hat{θ}}_{n} = ({\hat{β}}_{n}, {\hat{λ}}_{n})$ are ML estimators for $θ_{0}$ , ${\hat{σ}}_{n}^{2}$ is an ML estimator for $σ_{0}^{2}$ from (2.9).

To complete the proof, we will show that ${\hat{σ}}_{n}^{2} \to σ_{0}^{2}$ as $n \to \infty$ . If ${\hat{θ}}_{n} \in N_{n} (A)$ , then ${\hat{β}}_{n} \in N_{n}^{β} (A)$ and ${\hat{λ}}_{n} \in N_{n}^{λ} (A)$ .

By (2.12) and (2.1), we have

{\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1} = {(x_{t} - exp (- {\hat{λ}}_{n} d) x_{t - 1})}^{T} (β_{0} - {\hat{β}}_{n}) + (e_{t} - exp (- {\hat{λ}}_{n} d) e_{t - 1}) .

(6.8)

By (2.9), (2.11) and (6.8), we have

\begin{array}{rcl} (n - 1) {\hat{σ}}_{n}^{2} & = & \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} {({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1})}^{2} \\ = & \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} ({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1}) \\ \cdot {{(x_{t} - exp (- {\hat{λ}}_{n} d) x_{t - 1})}^{T} (β_{0} - {\hat{β}}_{n}) + (e_{t} - exp (- {\hat{λ}}_{n} d) e_{t - 1})} \\ = & \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} ({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1}) {(x_{t} - exp (- {\hat{λ}}_{n} d) x_{t - 1})}^{T} (β_{0} - {\hat{β}}_{n}) \\ + \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} ({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1}) (e_{t} - exp (- {\hat{λ}}_{n} d) e_{t - 1}) \\ = & \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} ({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1}) (e_{t} - exp (- {\hat{λ}}_{n} d) e_{t - 1}) . \end{array}

(6.9)

From (6.8), it follows that

\begin{matrix} \sum_{t = 2}^{n} {{(x_{t} - exp (- {\hat{λ}}_{n} d) x_{t - 1})}^{T} (β_{0} - {\hat{β}}_{n})}^{2} \\ = \sum_{t = 2}^{n} {({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1})}^{2} \\ - 2 \sum_{t = 2}^{n} ({\hat{ε}}_{t} - exp (- {\hat{λ}}_{n} d) {\hat{ε}}_{t - 1}) (e_{t} - exp (- {\hat{λ}}_{n} d) e_{t - 1}) \\ + \sum_{t = 2}^{n} {(e_{t} - exp (- {\hat{λ}}_{n} d) e_{t - 1})}^{2} . \end{matrix}

(6.10)

From (2.2), we get

\begin{matrix} \sum_{t = 2}^{n} {(e_{t} - exp (- {\hat{λ}}_{n} d) e_{t - 1})}^{2} \\ = \sum_{t = 2}^{n} {(exp (- λ_{0} d) e_{t - 1} + σ_{0} \sqrt{\frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}}} η_{t} - exp (- {\hat{λ}}_{n} d) e_{t - 1})}^{2} \\ = σ_{0}^{2} \frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}} \sum_{t = 2}^{n} η_{t}^{2} \\ + \sum_{t = 2}^{n} {(exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d))}^{2} e_{t - 1}^{2} \\ + 2 σ_{0} \sqrt{\frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}}} \sum_{t = 2}^{n} (exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d)) η_{t} e_{t - 1} . \end{matrix}

(6.11)

By (6.9)-(6.11), we have

\begin{array}{rcl} (n - 1) {\hat{σ}}_{n}^{2} & = & \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} {(e_{t} - exp (- {\hat{λ}}_{n} d) e_{t - 1})}^{2} \\ - \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} {({(x_{t} - exp (- {\hat{λ}}_{n} d) x_{t - 1})}^{T} (β_{0} - {\hat{β}}_{n}))}^{2} \\ = & \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \cdot σ_{0}^{2} \frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}} \sum_{t = 2}^{n} η_{t}^{2} \\ + \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \cdot \sum_{t = 2}^{n} {(exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d))}^{2} e_{t - 1}^{2} \\ + 2 \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \cdot σ_{0} \sqrt{\frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}}} \\ \cdot \sum_{t = 2}^{n} (exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d)) η_{t} e_{t - 1} \\ - \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} {({(x_{t} - exp (- {\hat{λ}}_{n} d) x_{t - 1})}^{T} (β_{0} - {\hat{β}}_{n}))}^{2} \\ = & T_{1} + T_{2} + 2 T_{3} - T_{4} . \end{array}

(6.12)

By the law of large numbers and ${\hat{λ}}_{n} \to_{p} λ$ , we have

\begin{array}{rcl} \frac{1}{n - 1} T_{1} & = & \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \cdot \frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}} σ_{0}^{2} \cdot \frac{1}{n - 1} \sum_{t = 2}^{n} η_{t}^{2} \\ \to_{p} & σ_{0}^{2} \frac{2 λ_{n}}{1 - exp (- 2 λ_{n} d)} \cdot \frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}} \\ = & σ_{0}^{2} (n \to \infty) . \end{array}

(6.13)

By the Markov inequality, and noting that $E T_{2} \leq C A^{2}$ , we obtain

\frac{1}{n - 1} T_{2} \to_{p} 0 (n \to \infty) .

(6.14)

Since ${(exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d)) η_{t} e_{t - 1}, H_{t - 1}}$ is a martingale difference sequence with

Var {(exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d)) η_{t} e_{t - 1}} = {(exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d))}^{2} E e_{t - 1}^{2},

so we have

\begin{array}{rcl} Var (T_{3}) & = & \sum_{t = 2}^{n} E {((exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d)) η_{t} e_{t - 1})}^{2} \\ = & \sum_{t = 2}^{n} {(exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d))}^{2} E e_{t - 1}^{2} \\ \leq & C {(λ_{0} - {\hat{λ}}_{n})}^{2} \sum_{t = 2}^{n} E e_{t - 1}^{2} \leq C A^{2} . \end{array}

(6.15)

By the Chebychev inequality, we have

\frac{1}{n - 1} T_{3} \to_{p} 0 (n \to \infty) .

(6.16)

By (5.33), we have

\begin{array}{rcl} T_{4} & = & \sum_{t = 2}^{n} {({(x_{t}^{T} (β_{0} - {\hat{β}}_{n}) - exp (- {\hat{λ}}_{n} d) x_{t - 1})}^{T} (β_{0} - {\hat{β}}_{n}))}^{2} \\ \leq & 2 \sum_{t = 2}^{n} {(x_{t}^{T} (β_{0} - {\hat{β}}_{n}))}^{2} + \sum_{t = 2}^{n} {(exp (- {\hat{λ}}_{n} d) x_{t - 1}^{T} (β_{0} - {\hat{β}}_{n}))}^{2} \\ = & o (n) . \end{array}

(6.17)

From (6.12)-(6.14), (6.16) and (6.17), we have ${\hat{σ}}_{n}^{2} \to σ_{0}^{2}$ .

We therefore complete the proof of Theorem 3.1. □

Proof of Theorem 3.2 It is easy to know that $S_{n} ({\hat{θ}}_{n}) = 0$ and $F_{n} ({\hat{θ}}_{n})$ is nonsingular from Theorem 3.1. By the Taylor expansion, we have

0 = S_{n} ({\hat{θ}}_{n}) = S_{n} (θ_{0}) - F_{n} ({\tilde{θ}}_{n}) ({\hat{θ}}_{n} - θ_{0}) .

(6.18)

Since ${\hat{θ}}_{n} \in N_{n} (A)$ , also ${\tilde{θ}}_{n} \in N_{n} (A)$ . By (5.11), we have

F_{n} ({\tilde{θ}}_{n}) = D_{n}^{\frac{1}{2}} (Φ_{n} + {\tilde{A}}_{n}) D_{n}^{\frac{T}{2}},

(6.19)

where ${\tilde{A}}_{n}$ is a symmetric matrix with ${\tilde{A}}_{n} \to_{p} 0$ . By (6.18) and (6.19), we have

D_{n}^{\frac{T}{2}} ({\hat{θ}}_{n} - θ_{0}) = D_{n}^{\frac{T}{2}} F_{n}^{- 1} ({\tilde{θ}}_{n}) S_{n} (θ_{0}) = {(Φ_{n} + {\tilde{A}}_{n})}^{- 1} D_{n}^{- \frac{1}{2}} S_{n} (θ_{0}) .

(6.20)

Similar to (6.20), we have

\begin{array}{rcl} F_{n} ({\hat{θ}}_{n}) & = & D_{n}^{\frac{1}{2}} (Φ_{n} + {\hat{A}}_{n}) D_{n}^{\frac{T}{2}} \\ = & (D_{n}^{\frac{1}{2}} {(Φ_{n} + {\hat{A}}_{n})}^{\frac{1}{2}}) ({(Φ_{n} + {\hat{A}}_{n})}^{\frac{T}{2}} D_{n}^{\frac{T}{2}}) \\ = & F_{n}^{\frac{1}{2}} ({\hat{θ}}_{n}) F_{n}^{\frac{T}{2}} ({\hat{θ}}_{n}) . \end{array}

(6.21)

Here ${\hat{A}}_{n} \to_{p} 0$ . By (6.20), (6.21), and noting that ${\hat{σ}}_{n}^{2} \to_{p} σ_{0}^{2}$ and $D_{n}^{- \frac{1}{2}} S_{n} (θ_{0}) = O_{p} (1)$ , we obtain that

\begin{array}{rcl} F_{n}^{\frac{T}{2}} ({\hat{θ}}_{n}) ({\hat{θ}}_{n} - θ_{0}) / {\hat{σ}}_{n} & = & {(Φ_{n} + {\hat{A}}_{n})}^{\frac{1}{2}} {(Φ_{n} + {\tilde{A}}_{n})}^{- 1} D_{n}^{- \frac{1}{2}} S_{n} (θ_{0}) / {\hat{σ}}_{n} \\ = & Φ_{n}^{- \frac{1}{2}} D_{n}^{- \frac{1}{2}} S_{n} (θ_{0}) / σ_{0} + o_{p} (1) . \end{array}

(6.22)

From (2.7) and (2.8), we have

\begin{array}{rcl} \frac{S_{n} (θ_{0})}{σ_{0}} & = & \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} {\sum_{t = 2}^{n} η_{t} (x_{t} - exp (- λ_{0} d) x_{t - 1}), \\ - d exp (- λ_{0} d) \sum_{t = 2}^{n} η_{t} e_{t - 1} \\ - \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} \frac{σ_{0} [1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)]}{4 λ_{0}^{2}} \sum_{t = 2}^{n} (η_{t}^{2} - 1)} . \end{array}

(6.23)

From (5.14) and (5.15), we have

\begin{array}{rcl} Φ_{n}^{- \frac{1}{2}} D_{n}^{- \frac{1}{2}} & = & (\begin{array}{c} {(\frac{λ (1 - exp (- 2 d λ_{0}))}{λ_{0} (1 - exp (- 2 d λ))})}^{- \frac{1}{2}} I_{d} & 0 \\ 0 & \sqrt{\frac{Δ_{n} (θ_{0}, σ_{0})}{- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |_{θ = θ_{0}}}} \end{array}) \\ \cdot (\begin{array}{c} {(\frac{2 λ_{0}}{1 - exp (- 2 d λ_{0})})}^{- \frac{1}{2}} X_{n}^{- \frac{1}{2}} (λ_{0}) & 0 \\ 0 & \frac{1}{\sqrt{Δ_{n} (θ_{0}, σ_{0})}} \end{array}) \\ = & (\begin{array}{c} {(\frac{2 λ}{1 - exp (- 2 d λ)})}^{- \frac{1}{2}} X_{n}^{- \frac{1}{2}} (λ_{0}) & 0 \\ 0 & \frac{1}{\sqrt{- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |_{θ = θ_{0}}}} \end{array}) . \end{array}

(6.24)

By (6.23) and (6.24), we have

\begin{matrix} Φ_{n}^{- \frac{1}{2}} D_{n}^{- \frac{1}{2}} S_{n} (θ_{0}) / σ_{0} \\ = \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} {{(\frac{2 λ}{1 - exp (- 2 d λ)})}^{- \frac{1}{2}} \sum_{t = 2}^{n} η_{t} X_{n}^{- \frac{1}{2}} (θ_{0}) (x_{t} - exp (- λ_{0} d) x_{t - 1}), \\ \frac{1}{\sqrt{- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |_{θ = θ_{0}}}} \cdot [- d exp (- λ_{0} d) \sum_{t = 2}^{n} η_{t} e_{t - 1} \\ - \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} \frac{σ_{0} [1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)]}{4 λ_{0}^{2}} \sum_{t = 2}^{n} (η_{t}^{2} - 1)]} . \end{matrix}

(6.25)

Let $u \in R^{d}$ with $| u | = 1$ , and

a_{t n} = u {(\frac{2 λ}{1 - exp (- 2 d λ)})}^{- \frac{1}{2}} X_{n}^{- \frac{1}{2}} (λ_{0}) (x_{t} - exp (- λ_{0} d) x_{t - 1}) .

Then ${max}_{2 \leq t \leq n} a_{t n} = o (1)$ , and we will consider the limiting distribution of the following 2-vector

\begin{matrix} \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} {\sum_{t = 2}^{n} a_{t n} η_{t}, - \frac{1}{\sqrt{- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |_{θ = θ_{0}}}} [d exp (- λ_{0} d) \sum_{t = 2}^{n} η_{t} e_{t - 1} \\ + \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} \frac{σ_{0} [1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)]}{4 λ_{0}^{2}} \sum_{t = 2}^{n} (η_{t}^{2} - 1)]} . \end{matrix}

(6.26)

Note that

{- σ^{2} \frac{\partial^{2} Ψ_{n}}{\partial λ^{2}} |}_{θ = θ_{0}} = O_{p} (Δ_{n} (θ_{0}, σ_{0})) = O_{p} (n) .

Hence, by the Cramer-Wold device, it will suffice to find the asymptotic distribution of the following random

\begin{matrix} \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} \sum_{t = 2}^{n} {u_{1} a_{t n} η_{t} - \frac{u_{2}}{\sqrt{Δ_{n} (θ_{0}, σ_{0})}} [d exp (- λ_{0} d) η_{t} e_{t - 1} \\ + \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} \frac{σ_{0} [1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)]}{4 λ_{0}^{2}} (η_{t}^{2} - 1)]} \\ = \sum_{t = 2}^{n} ζ_{t}, \end{matrix}

(6.27)

where $(u_{1}, u_{2}) \in R^{2}$ with $u_{1}^{2} + u_{2}^{2} = 1$ . Note that

\begin{array}{rcl} E {ζ_{t} | H_{t - 1}} & = & \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} {u_{1} a_{t n} E (η_{t}) - \frac{u_{2}}{\sqrt{Δ_{n} (θ_{0}, σ_{0})}} [d exp (- λ_{0} d) E (η_{t}) e_{t - 1} \\ + \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} \frac{σ_{0} [1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)]}{4 λ_{0}^{2}} E (η_{t}^{2} - 1)]} \\ = & 0, a.s., \end{array}

(6.28)

so the sums in (6.27) are partial sums of a martingale triangular array to $H_{t}$ , and we will verify the Lindeberg conditions for their convergence to normality.

By (6.27), and noting that $E η_{t}^{3} = 0$ , $E η_{t}^{4} = 3$ and $λ \in N_{n}^{λ} (A)$ , we have

\begin{array}{rcl} \sum_{t = 2}^{n} E (ζ_{t}^{2} | H_{t - 1}) & = & \frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)} {u_{1}^{2} \sum_{t = 2}^{n} a_{t n}^{2} + u_{2}^{2} \frac{1}{Δ_{n} (θ_{0}, σ_{0})} [d^{2} exp (- 2 λ_{0} d) \sum_{t = 2}^{n} e_{t - 1}^{2} \\ + \frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)} \frac{σ_{0}^{2} {[1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)]}^{2}}{16 λ_{0}^{2}} \sum_{t = 2}^{n} E {(η_{t}^{2} - 1)}^{2}] \\ - 2 \sum_{t = 2}^{n} u_{1} \frac{u_{2}}{\sqrt{n}} d exp (- λ_{0} d) a_{t n} e_{t - 1} \\ + 2 \frac{u_{2}^{2}}{n} d exp (- λ_{0} d) \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} \\ \cdot \frac{σ_{0} [1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)]}{4 λ_{0}^{2}} E (η_{t} (η_{t}^{2} - 1)) e_{t - 1} \\ - 2 u_{1} a_{t n} \frac{u_{2}}{\sqrt{n}} \sqrt{\frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)}} \\ \cdot \frac{σ_{0} [1 - (1 + 2 d λ_{0}) exp (- 2 λ_{0} d)]}{4 λ_{0}^{2}} E (η_{t} (η_{t}^{2} - 1))} \\ = & u_{1}^{2} u^{2} \frac{2 λ_{0}}{1 - exp (- 2 λ_{0} d)} {(\frac{2 λ}{1 - exp (- 2 d λ)})}^{- 1} + u_{2}^{2} + o_{p} (1) + 0 + 0 \\ = & u_{1}^{2} + u_{2}^{2} + o_{p} (1) = 1 + o_{p} (1) . \end{array}

(6.29)

Let ${\tilde{a}}_{t n} = min {a_{t n}, \frac{1}{\sqrt{Δ_{n} (θ_{0}, σ_{0})}}}$ and $ζ_{t} = {\tilde{a}}_{t n} {\tilde{ζ}}_{t}$ . Then ${\tilde{a}}_{t n} = o (1)$ .

For any $c > 0$ ,

\begin{array}{rcl} \sum_{t = 2}^{n} E {ζ_{t}^{2} I (| ζ_{t} | > c) | H_{t - 1}} & = & \sum_{t = 2}^{\infty} \int_{c}^{\infty} y^{2} d P {| {\tilde{a}}_{t n} {\tilde{ζ}}_{t} | \leq y | H_{t - 1}} \\ = & \sum_{t = 2}^{n} {\tilde{a}}_{t n}^{2} \int_{\frac{c}{{\tilde{a}}_{t n}}}^{\infty} y^{2} d p {| {\tilde{ζ}}_{t} | \leq y | H_{t - 1}} \\ = & o (1) \sum_{t = 2}^{n} {\tilde{a}}_{t n}^{2} = o (1) O_{p} (1) \to 0, n \to \infty . \end{array}

(6.30)

This verifies the Lindeberg conditions, and by Lemma 5.5, we have

\sum_{t = 2}^{n} ζ_{t} \to_{D} N (0, 1) .

Thus we complete the proof of Theorem 3.2. □

Proof of Theorem 4.1 Note that ${\hat{λ}}_{0 n} \to λ_{0}$ , ${\hat{λ}}_{n} \to λ_{0}$ . Similarly to the proof of Theorem 4.1(3) in Maller [55], by (6.12) and Theorem 3.2, we have

\begin{array}{rcl} \tilde{d} (n) & = & \frac{1}{σ_{0}^{2}} {(\frac{2 {\hat{λ}}_{0 n}}{1 - exp (- 2 {\hat{λ}}_{0 n} d)} - \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)}) σ_{0}^{2} \frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}} \sum_{t = 2}^{n} η_{t}^{2} \\ + \sum_{t = 2}^{n} {\frac{2 {\hat{λ}}_{0 n}}{1 - exp (- 2 {\hat{λ}}_{0 n} d)} {(exp (- λ_{0} d) - exp (- {\hat{λ}}_{0 n} d))}^{2} \\ - \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} {(exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d))}^{2}} e_{t - 1}^{2} \\ + 2 σ_{0} \sqrt{\frac{1 - exp (- 2 λ_{0} d)}{2 λ_{0}}} \sum_{t = 2}^{n} {\frac{2 {\hat{λ}}_{0 n}}{1 - exp (- 2 {\hat{λ}}_{0 n} d)} (exp (- λ_{0} d) - exp (- {\hat{λ}}_{0 n} d)) \\ - \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} (exp (- λ_{0} d) - exp (- {\hat{λ}}_{n} d))} η_{t} e_{t - 1} \\ + \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} {({(x_{t} - exp (- {\hat{λ}}_{n} d) x_{t - 1})}^{T} (β_{0} - {\hat{β}}_{n}))}^{2}} \\ = & \frac{1}{σ_{0}^{2}} \frac{2 {\hat{λ}}_{n}}{1 - exp (- 2 {\hat{λ}}_{n} d)} \sum_{t = 2}^{n} {({(x_{t} - exp (- {\hat{λ}}_{n} d) x_{t - 1})}^{T} (β_{0} - {\hat{β}}_{n}))}^{2} + o (1) \\ \to_{D} & χ^{2} (m) . \end{array}

(6.31)

□

7 Empirical examples

In the section, we consider two empirical examples. The first one (β is a one-dimensional unknown parameter, namely $m = 1$ ) is water flowing in the Kootenay River in January, which is taken from Hampel et al. [[6], p.310]. The second one (β is a 4-dimensional unknown parameter, namely $m = 4$ ) is the consumption of spirits in the United Kingdom, which is taken from Fuller [48].

7.1 Water flowing in the Kootenay river

By the ordinary least squares method, we obtain that

{\hat{y}}_{t} = 9.51371 + 0.47476 x_{t} + {\hat{ε}}_{t}

(7.1)

and

ε_{t} = 0.2077 ε_{t - 1} + η_{t}, t = 1, 2, \dots, 13,

(7.2)

where $η_{t}$ is a sequence of uncorrelated $(0, {1.5013}^{2})$ random variables.

By the Huber-Dutter (HD) method, we obtain the following model (see Hu [10]):

{\hat{y}}_{t} = 9.51371 + 0.4745 x_{t} + {\hat{ε}}_{t}

(7.3)

and

ε_{t} = 0.3024 ε_{t - 1} + η_{t},

(7.4)

where $η_{t}$ is a sequence of uncorrelated $(0, {1.0988}^{2})$ random variables.

By the ML method (take $d = 1$ and starting values for $λ^{(0)} = 1$ , ${(σ^{2})}^{(0)} = 1.5$ , $β^{(0)} = 0.5$ ; here we use pattern search algorithms), we obtain the following model:

{\hat{y}}_{t} = 9.51371 + 0.48039 x_{t} + {\hat{ε}}_{t}

(7.5)

and

ε_{t} = exp (- 1.80089) ε_{t - 1} + 0.5184 η_{t},

(7.6)

where $η_{t}$ is a sequence of uncorrelated $(0, 1)$ random variables.

By model (1.3), we obtain a general process ${y_{t}}$ satisfying the following SDE:

d (y_{t} - 9.51371) = (1.3455 x_{t} + 9.51371 - y_{t}) d t + 0.9976 d B_{t} .

(7.7)

Since ${1.5013}^{2} > {1.0988}^{2} > {0.5184}^{2}$ , our results excel the results of HD and the least squares method in mean squares error (MSE).

By (4.7), we obtain $\tilde{d} (13) = 362.4137 > 6.63 = χ_{1 - 0.01}^{2} (1)$ . It is shown that $β \neq 0$ at the significant level $α = 0.01$ . Thus we should apply the linear regression model (1.1) with Ornstein-Uhlenbeck process instead of only the Ornstein-Uhlenbeck process for the data.

It is shown that our estimation method and testing approach are valid in the case of $m = 1$ . For a multidimensional parameter β, it is true in the following example.

7.2 Consumption of spirits in the UK

We will use the data studied by Fuller [48]. The data pertain to the consumption of spirits in the United Kingdom from 1870 to 1983. The dependent variable $y_{t}$ is the annual per capita consumption of spirits in the United Kingdom. The explanatory variables $x_{t 1}$ and $x_{t 2}$ are per capita income and price of spirits, respectively, both deflated by a general price index. All data are in logarithms. The model suggested by Prest can be written as follows:

y_{t} = β_{0} + β_{1} x_{t 1} + β_{2} x_{t 2} + β_{3} x_{t 3} + β_{4} x_{t 4} + ε_{t},

(7.8)

where 1869 is the origin for t, $x_{t 3} = \frac{t}{100}$ , $x_{t 4} = \frac{{(t - 35)}^{2}}{10^{4}}$ , and assume that $ε_{t}$ is a stationary time series.

Fuller [48] obtained the estimated generalized least squares equation

{\hat{y}}_{t} = 2.36 + 0.72 x_{t 1} - 0.80 x_{t 2} - 0.81 x_{t 3} - 0.92 x_{t 4}

(7.9)

and

ε_{t} = 0.7633 ε_{t - 1} + η_{t},

where $η_{t}$ is a sequence of uncorrelated $(0, 0.000417)$ random variables.

Take $d = 1$ and starting values for

λ^{(0)} = 0.3, {(σ^{2})}^{(0)} = 0.0004, β^{(0)} = {(0.72, - 0.80, - 0.81, - 0.92)}^{T} .

Using our method, we obtain the following models:

{\hat{y}}_{t} = 2.36 + 0.73251 x_{t 1} - 0.80024 x_{t 2} - 0.86286 x_{t 3} - 0.60774 x_{t 4}

(7.10)

and

ε_{t} = exp (- 0.25319) ε_{t - 1} + 0.0196 η_{t},

(7.11)

where $η_{t}$ is a sequence of uncorrelated $(0, 1)$ random variables; or

d ε_{t} = - 0.25319 ε_{t} d t + 0.0221 d B_{t} .

(7.12)

Since $0.000417 > 0.00038461$ , our results excel the results of Fuller [48] in MSE.

By (4.7), we obtain $\tilde{d} (69) = 100.2777 > 13.3 = χ_{1 - 0.01}^{2} (4)$ . It is shown that $β \neq 0$ at the significant level $α = 0.01$ .

References

Wang XM, Zhou W: Bootstrap approximation to the distribution of M -estimates in a linear model. Acta Math. Sin. Engl. Ser. 2004,20(1):93-104. 10.1007/s10114-003-0246-6
Article MathSciNet MATH Google Scholar
Anatolyev S: Inference in regression models with many regressors. J. Econom. 2012, 170: 368-382. 10.1016/j.jeconom.2012.05.011
Article MathSciNet Google Scholar
Bai ZD, Guo M: A paradox in least-squares estimation of linear regression models. Stat. Probab. Lett. 1999, 42: 167-174. 10.1016/S0167-7152(98)00205-3
Article MathSciNet MATH Google Scholar
Chen X: Consistency of LS estimates of multiple regression under a lower order moment condition. Sci. China Ser. A 1995,38(12):1420-1431.
MathSciNet MATH Google Scholar
Gil GR, Engela B, Norberto C, Ana C: Least squares estimation of linear regression models for convex compact random sets. Adv. Data Anal. Classif. 2007, 1: 67-81. 10.1007/s11634-006-0003-7
Article MathSciNet MATH Google Scholar
Hampel FR, Ronchetti EM, Rousseeuw PJ, Stahel WA: Robust Statistics. Wiley, New York; 1986.
MATH Google Scholar
Cui H: On asymptotics of t -type regression estimation in multiple linear model. Sci. China Ser. A 2004,47(4):628-639. 10.1360/03ys0020
Article MathSciNet MATH Google Scholar
Durbin L: A note on regression when there is extraneous information about one of the coefficients. J. Am. Stat. Assoc. 1953, 48: 799-808. 10.1080/01621459.1953.10501201
Article MATH Google Scholar
Li Y, Yang H: A new stochastic mixed ridge estimator in linear regression model. Stat. Pap. 2010,51(2):315-323. 10.1007/s00362-008-0169-5
Article MathSciNet MATH Google Scholar
Hu HC:Asymptotic normality of Huber-Dutter estimators in a linear model with $AR (1)$ processes. J. Stat. Plan. Inference 2013,143(3):548-562. 10.1016/j.jspi.2012.08.012
Article MathSciNet MATH Google Scholar
Wu WB: M -Estimation of linear models with dependent errors. Ann. Stat. 2007,35(2):495-521. 10.1214/009053606000001406
Article MathSciNet MATH Google Scholar
Fox R, Taqqu MS: Large sample properties of parameter estimates for strongly dependent stationary Gaussian time series. Ann. Stat. 1986, 14: 517-532. 10.1214/aos/1176349936
Article MathSciNet MATH Google Scholar
Giraitis L, Surgailis D: A central limit theorem for quadratic forms in strongly dependent linear variables and its application to asymptotic normality of Whittle’s estimate. Probab. Theory Relat. Fields 1990, 86: 87-104. 10.1007/BF01207515
Article MathSciNet MATH Google Scholar
Koul HL, Surgailis D: Asymptotic normality of the Whittle estimator in linear regression models with long memory errors. Stat. Inference Stoch. Process. 2000, 3: 129-147. 10.1023/A:1009999607588
Article MathSciNet MATH Google Scholar
Shiohama T, Taniguchi M: Sequential estimation for time series regression models. J. Stat. Plan. Inference 2004, 123: 295-312. 10.1016/S0378-3758(03)00153-8
Article MathSciNet MATH Google Scholar
Fan J: Moderate deviations for M -estimators in linear models with ϕ -mixing errors. Acta Math. Sin. Engl. Ser. 2012,28(6):1275-1294. 10.1007/s10114-011-9188-6
Article MathSciNet MATH Google Scholar
Ornstein LS, Uhlenbeck GE: On the theory of Brownian motion. Phys. Rev. 1930, 36: 823-841. 10.1103/PhysRev.36.823
Article MATH Google Scholar
Janczura J, Orzel S, Wylomanska A: Subordinated α -stable Ornstein-Uhlenbeck process as a tool for financial data description. Physica A 2011, 390: 4379-4387. 10.1016/j.physa.2011.07.007
Article Google Scholar
Debbasch F, Mallick K, Rivet JP: Relativistic Ornstein-Uhlenbeck process. J. Stat. Phys. 1997, 88: 945-966.
Article MathSciNet MATH Google Scholar
Gillespie D: Exact numerical simulation of the Ornstein-Uhlenbeck process and its integral. Phys. Rev. E 1996,54(2):2084-2091. 10.1103/PhysRevE.54.2084
Article MathSciNet Google Scholar
Ditlevsen S, Lansky P: Estimation of the input parameters in the Ornstein-Uhlenbeck neuronal model. Phys. Rev. E 2005. Article ID 011907,71(1): Article ID 011907
MATH Google Scholar
Garbaczewski P, Olkiewicz R: Ornstein-Uhlenbeck-Cauchy process. J. Math. Phys. 2000,41(10):6843-6860. 10.1063/1.1290054
Article MathSciNet MATH Google Scholar
Plastino AR, Plastino A: Non-extensive statistical mechanics and generalized Fokker-Planck equation. Physica A 1995, 222: 347-354. 10.1016/0378-4371(95)00211-1
Article MathSciNet Google Scholar
Fasen V: Statistical estimation of multivariate Ornstein-Uhlenbeck processes and applications to co-integration reserved. J. Econom. 2012. 10.1016/j.jeconom.2012.08.019
Google Scholar
Yu J: Bias in the estimation of the mean reversion parameter in continuous time models. J. Econom. 2012, 169: 114-122. 10.1016/j.jeconom.2012.01.004
Article MathSciNet Google Scholar
Geman H: Commodities and Commodity Derivatives. Wiley, Chichester; 2005.
Google Scholar
Zhang B, Grzelak LA, Oosterlee CM: Efficient pricing of commodity options with early-exercise under the Ornstein-Uhlenbeck process. Appl. Numer. Math. 2012, 62: 91-111. 10.1016/j.apnum.2011.10.005
Article MathSciNet MATH Google Scholar
Rieder S: Robust parameter estimation for the Ornstein-Uhlenbeck process. Stat. Methods Appl. 2012. 10.1007/s10260-012-0195-2
Google Scholar
Iacus S: Simulation and Inference for Stochastic Differential Equations. Springer, New York; 2008.
Book MATH Google Scholar
Bishwal JPN: Uniform rate of weak convergence of the minimum contrast estimator in the Ornstein-Uhlenbeck process. Methodol. Comput. Appl. Probab. 2010, 12: 323-334. 10.1007/s11009-008-9099-x
Article MathSciNet MATH Google Scholar
Shimizu Y: Local asymptotic mixed normality for discretely observed non-recurrent Ornstein-Uhlenbeck processes. Ann. Inst. Stat. Math. 2012, 64: 193-211. 10.1007/s10463-010-0307-4
Article MathSciNet MATH Google Scholar
Zhang S, Zhang X: A least squares estimator for discretely observed Ornstein-Uhlenbeck processes driven by symmetric α -stable motions. Ann. Inst. Stat. Math. 2012. 10.1007/s10463-012-0362-0
Google Scholar
Chronopoulou A, Viens FG: Estimation and pricing under long-memory stochastic volatility. Ann. Finance 2012, 8: 379-403. 10.1007/s10436-010-0156-4
Article MathSciNet MATH Google Scholar
Lin H, Wang J: Successful couplings for a class of stochastic differential equations driven by Levy processes. Sci. China Math. 2012,55(8):1735-1748. 10.1007/s11425-012-4387-x
Article MathSciNet MATH Google Scholar
Xiao W, Zhang W, Zhang X: Minimum contrast estimator for fractional Ornstein-Uhlenbeck processes. Sci. China Math. 2012,55(7):1497-1511. 10.1007/s11425-012-4386-y
Article MathSciNet MATH Google Scholar
Magdalinos T: Mildly explosive autoregression under weak and strong dependence. J. Econom. 2012, 169: 179-187. 10.1016/j.jeconom.2012.01.024
Article MathSciNet Google Scholar
Andrews DWK, Guggenberger P:Asymptotics for LS, GLS, and feasible GLS statistics in an $AR (1)$ model with conditional heteroskedasticity. J. Econom. 2012, 169: 196-210. 10.1016/j.jeconom.2012.01.017
Article MathSciNet Google Scholar
Fan J, Yao Q: Nonlinear Time Series: Nonparametric and Parametric Methods. Springer, New York; 2005.
MATH Google Scholar
Berk KN: Consistent autoregressive spectral estimates. Ann. Stat. 1974, 2: 489-502. 10.1214/aos/1176342709
Article MathSciNet MATH Google Scholar
Goldenshluger A, Zeevi A: Non-asymptotic bounds for autoregressive time-series modeling. Ann. Stat. 2001, 29: 417-444. 10.1214/aos/1009210547
Article MathSciNet MATH Google Scholar
Liebscher E: Strong convergence of estimators in nonlinear autoregressive models. J. Multivar. Anal. 2003, 84: 247-261. 10.1016/S0047-259X(02)00022-2
Article MathSciNet MATH Google Scholar
Baran S, Pap G, Zuijlen MV: Asymptotic inference for unit roots in spatial triangular autoregression. Acta Appl. Math. 2007, 96: 17-42. 10.1007/s10440-007-9097-y
Article MathSciNet MATH Google Scholar
Distaso W: Testing for unit root processes in random coefficient autoregressive models. J. Econom. 2008, 142: 581-609. 10.1016/j.jeconom.2007.09.002
Article MathSciNet Google Scholar
Harvill JL, Ray BK: Functional coefficient autoregressive models for vector time series. Comput. Stat. Data Anal. 2008, 50: 3547-3566.
Article MathSciNet MATH Google Scholar
Dehling H, Franke B, Kott T: Drift estimation for a periodic mean reversion process. Stat. Inference Stoch. Process. 2010, 13: 175-192. 10.1007/s11203-010-9045-8
Article MathSciNet MATH Google Scholar
Maller RA: Asymptotics of regressions with stationary and nonstationary residuals. Stoch. Process. Appl. 2003, 105: 33-67. 10.1016/S0304-4149(02)00263-6
Article MathSciNet MATH Google Scholar
Pere P:Adjusted estimates and Wald statistics for the $AR (1)$ model with constant. J. Econom. 2000, 98: 335-363. 10.1016/S0304-4076(00)00023-3
Article MathSciNet MATH Google Scholar
Fuller WA: Introduction to Statistical Time Series. 2nd edition. Wiley, New York; 1996.
MATH Google Scholar
Chambers MJ: Jackknife estimation of stationary autoregressive models. J. Econom. 2012. 10.1016/j.jeconom.2012.09.003
Google Scholar
Hamilton JD: Time Series Analysis. Princeton University Press, Princeton; 1994.
MATH Google Scholar
Brockwell PJ, Davis RA: Time Series: Theory and Methods. Springer, New York; 1987.
Book MATH Google Scholar
Abadir KM, Lucas A: A comparison of minimum MSE and maximum power for the nearly integrated non-Gaussian model. J. Econom. 2004, 119: 45-71. 10.1016/S0304-4076(03)00155-6
Article MathSciNet Google Scholar
Fan JQ, Jiang JC: Nonparametric inference with generalized likelihood ratio tests. Test 2007, 16: 409-444. 10.1007/s11749-007-0080-8
Article MathSciNet MATH Google Scholar
Rao CR: Linear Statistical Inference and Its Applications. Wiley, New York; 1973.
Book MATH Google Scholar
Maller RA: Quadratic negligibility and the asymptotic normality of operator normed sums. J. Multivar. Anal. 1993, 44: 191-219. 10.1006/jmva.1993.1011
Article MathSciNet MATH Google Scholar
Hall P, Heyde CC: Martingale Limit Theory and Its Application. Academic Press, New York; 1980.
MATH Google Scholar

Download references

Acknowledgements

This work was supported by the Natural Science Foundation of China (No. 41374017), and Science and Technology Research Projects of the Educational Department of Hubei Province (No. Q20142501).

Author information

Authors and Affiliations

School of Mathematics and Statistics, Hubei Normal University, Huangshi, 435002, China
Hongchang Hu & Lifeng Xu
Faculty of Information Engineering, China University of Geosciences, Wuhan, 430074, China
Xiong Pan

Authors

Hongchang Hu
View author publications
You can also search for this author in PubMed Google Scholar
Xiong Pan
View author publications
You can also search for this author in PubMed Google Scholar
Lifeng Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiong Pan.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors contributed equally to the writing of this paper. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hu, H., Pan, X. & Xu, L. Maximum likelihood estimators in linear regression models with Ornstein-Uhlenbeck process. J Inequal Appl 2014, 301 (2014). https://doi.org/10.1186/1029-242X-2014-301

Download citation

Received: 05 February 2014
Accepted: 09 July 2014
Published: 19 August 2014
DOI: https://doi.org/10.1186/1029-242X-2014-301

Maximum likelihood estimators in linear regression models with Ornstein-Uhlenbeck process

Abstract

1 Introduction

2 Estimation method

3 Large sample properties of the estimators

4 Hypothesis testing

5 Some lemmas

6 Proof of theorems

7 Empirical examples

7.1 Water flowing in the Kootenay river

7.2 Consumption of spirits in the UK

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords