The Berry-Esséen bounds for kernel density estimator under dependent sample

Yang, Wenzhi; Hu, Shuhe

doi:10.1186/1029-242X-2012-287

Research
Open access
Published: 07 December 2012

The Berry-Esséen bounds for kernel density estimator under dependent sample

Wenzhi Yang¹ &
Shuhe Hu¹

Journal of Inequalities and Applications volume 2012, Article number: 287 (2012) Cite this article

2033 Accesses
5 Citations
Metrics details

Abstract

Let ${X_{n}}_{n \geq 1}$ be a φ-mixing sequence with an unknown common probability density function $f (x)$ and the mixing coefficients satisfy $φ (n) = O (n^{- 18 / 5})$ . By using some inequalities for φ-mixing random variables and selecting some positive bandwidths $h_{n}$ , we investigate the Berry-Esséen bounds of the estimator $f_{n} (x)$ for $f (x)$ and its bounds are presented as $O (n^{- 1 / 6} \cdot log n \cdot log log n)$ and $O (n^{- 1 / 6} \cdot log n \cdot log log n) + O (h_{n}^{δ}) + O (h_{n}^{13 (1 - δ) / 5})$ , where $0 < δ < 1$ .

MSC:62G05, 62G07.

1 Introduction

The most popular nonparametric estimator of a distribution based on a sample of observations is the empirical distribution, and the most popular method of nonparametric density estimation is the kernel method. For an introduction and applications of this field, the books by Prakasa Rao [1] and Silverman [2] provide the basic methods for density estimation. For the nonparametric curve estimation from time series such as φ-mixing, ρ-mixing and α-mixing, Györfi et al. [3] studied the density estimator and hazard function estimator for these mixing sequences. It is known that φ-mixing ⇒ ρ-mixing ⇒ α-mixing, and its converse is not true. Although, φ-mixing is stronger than α-mixing, some properties of φ-mixing such as moment inequality, exponential inequality, etc., are better than those of α-mixing to use. For the properties and examples of mixing, we can read the book of Doukhan [4]. In this paper, we only give the definition of a φ-mixing sequence. For the basic properties of φ-mixing, one can refer to Billingsley [5].

Denote $F_{n}^{m} = σ (X_{i}, n \leq i \leq m)$ and define the coefficients as follows:

φ (n) = sup_{m \geq 1} sup_{A \in F_{1}^{m}, B \in F_{m + n}^{\infty}, P (A) \neq 0} | P (B | A) - P (B) | .

If $φ (n) ↓ 0$ as $n \to \infty$ , then ${X_{n}}_{n \geq 1}$ is said to be a φ-mixing sequence.

Many works have been done for the kernel density estimation. For example, Masry [6] gave the recursive probability density estimation under a mixing-dependent sample, Fan and Yao [7] summarized the nonparametric and parametric methods including a nonparametric density estimator for nonlinear time series such as φ-mixing, α-mixing, etc. For an independent sample, Cao [8] investigated the bootstrap approximations in a nonparametric density estimator and obtained Berry-Esséen bounds for the kernel density estimation. Under φ-mixing dependence errors, Li et al. [9] obtained the asymptotic normality of a wavelet estimator of the regression model. Li et al. [10] also gave the Berry-Esséen bound of a wavelet estimator of the regression model. Meanwhile, Yang et al. [11] studied the Berry-Esséen bound of sample quantiles for φ-mixing random variables. In this paper, we will investigate the Berry-Esséen bounds for a kernel density estimator under a φ-mixing dependent sample.

Let ${X_{n}}_{n \geq 1}$ be a φ-mixing sequence with an unknown common probability density function $f (x)$ and the mixing coefficients satisfy $φ (n) = O (n^{- 18 / 5})$ . With the help of techniques of inequalities such as moment inequality, exponential inequality and the Bernstein’s big-block and small-block procedure, by selecting some positive bandwidths $h_{n}$ , which do not depend on the mixing coefficients and the lengths of Bernstein’s big-block and small-block, we investigate the Berry-Esséen bounds of the estimator $f_{n} (x)$ for $f (x)$ and its bounds are presented as $O (n^{- 1 / 6} \cdot log n \cdot log log n)$ and $O (n^{- 1 / 6} \cdot log n \cdot log log n) + O (h_{n}^{δ}) + O (h_{n}^{13 (1 - δ) / 5})$ , where $0 < δ < 1$ . Particularly, if $δ = 13 / 18$ and $h_{n} = n^{- 16 / 69}$ , the bound is presented as $O (n^{- 1 / 6} \cdot log n \cdot log log n)$ . For details, please see our results in Section 3. Some assumptions and lemmas are presented in Section 2. Regarding the technique of Bernstein’s big-block and small-block procedure, the reader can refer to Masry [6, 12], Fan and Yao [7], Roussas [13] and the references therein.

For the kernel density estimator under association and a negatively associated sample, one can refer to Roussas [13] and Liang and Baek [14] obtained for asymptotic normality, Wei [15] for the consistences, Henriques and Oliveira [16] for exponential rates, Liang and Baek [17] for the Berry-Esséen bounds, etc. Regarding other works about the Berry-Esséen bounds, we can refer to Chang and Rao [18] for the Kaplan-Meier estimator, Cai and Roussas [19] for the smooth estimator of a distribution function, Yang [20] for the regression weighted estimator, Dedecker and Prieur [21] for some new dependence coefficients, examples and applications to statistics, Yang et al. [22] for sample quantiles under negatively associated sample, Herve et al. [23] for M-estimators of geometrically ergodic Markov chains, and so on. On the other hand, Härdle et al. [24] summarized the Berry-Esséen bounds of partially linear models (see Chapter 5 of Härdle et al. [24]).

Throughout the paper, $c, c_{1}, c_{2}, \dots, C, M_{0}$ denote some positive constants not depending on n, which may be different in various places, $⌊ x ⌋$ means the largest integer not exceeding x and $I (A)$ is the indicator function of the set A. Let $c (x)$ be some positive constant depending only on x. For convenience, we denote $c = c (x)$ in this paper, whose value may vary at different places.

2 Some assumptions and lemmas

For the unknown common probability density function $f (x)$ , we assume that

f (x) \in C_{s, α},

(2.1)

where α is a positive constant and $C_{s, α}$ is a family of probability density functions having derivatives of s th order, $f^{(s)} (x)$ are continuous and $| f^{(s)} (x) | \leq α$ , $s = 0, 1, 2, \dots$ .

Let $K (\cdot)$ be a kernel function in R and satisfy the following condition (A₁):

(A₁) Assume that $K (\cdot)$ is a bounded probability density function and $K (\cdot) \in H_{s}$ , where $H_{s}$ is a class of functions $K (\cdot)$ with the properties

\int_{- \infty}^{\infty} u^{r} K (u) d u = 0, r = 1, 2, \dots, s - 1, \int_{- \infty}^{\infty} u^{s} K (u) d u = A \neq 0 .

(2.2)

Here A is a finite constant and s is a positive integer for $s \geq 2$ .

Obviously, the probability density functions Gaussian kernel $K (x) = {(2 π)}^{- \frac{1}{2}} exp {- \frac{x^{2}}{2}}$ and Epanechnikov kernel $K (x) = \frac{3}{20 \sqrt{5}} (5 - x^{2}) I$ ( $| x | \leq \sqrt{5}$ ) belong to $H_{2}$ . For more details, one can refer to Chapter 2 of Prakasa Rao [1].

For a fixed x, the kernel-type estimator of $f (x)$ is defined as

f_{n} (x) = \frac{1}{n h_{n}} \sum_{i = 1}^{n} K (\frac{x - X_{i}}{h_{n}}),

(2.3)

where $h_{n}$ is a sequence of positive bandwidths tending to zero as $n \to \infty$ .

Similar to the proof of Theorem 2.2 of Wei [15], we have, by using Taylor’s expansion for $f (x - h_{n} u)$ , that

f (x - h_{n} u) = f (x) - f^{'} (x) h_{n} u + \dots + \frac{f^{(s - 1)} (x)}{(s - 1)!} {(- h_{n} u)}^{s - 1} + \frac{f^{(s)} (x - ξ h_{n} u)}{s!} {(- h_{n} u)}^{s},

where $0 < ξ < 1$ . By (2.1) and (2.2), it follows

| E f_{n} (x) - f (x) | \leq \int_{- \infty}^{\infty} | K (u) h_{n}^{s} u^{s} | \cdot | \frac{f^{(s)} (x - ξ h_{n} u)}{s!} | d u \leq c h_{n}^{s},

which yields

| E f_{n} (x) - f (x) | = O (h_{n}^{s}) .

For $s \geq 2$ , one can get the ‘bias’ term rate as

\sqrt{n h_{n}} | E f_{n} (x) - f (x) | \leq c n^{1 / 2} h_{n}^{(2 s + 1) / 2},

by providing $n^{1 / 2} h_{n}^{(2 s + 1) / 2} \to 0$ .

It can be checked that $K (x) = {(2 π)}^{- \frac{1}{2}} exp {- \frac{x^{2}}{2}}$ and $K (x) = \frac{3}{20 \sqrt{5}} (5 - x^{2}) I$ ( $| x | \leq \sqrt{5}$ ) belong to $H_{2}$ . So, with $s = 2$ , one can see that $h_{n} = n^{- 1 / 4}$ satisfies the conditions $0 < h_{n} \to 0$ and $n^{1 / 2} h_{n}^{(2 s + 1) / 2} \to 0$ as $n \to \infty$ . Consequently, we pay attention to the Berry-Esséen bound of the centered variate as

\sqrt{n h_{n}} (f_{n} (x) - E f_{n} (x))

in this paper.

Similar to Masry [6] and Roussas [13], we give the following assumption.

(A₂) Assume that $f (x, y, k)$ are the joint p.d.f. of the random variables $X_{j}$ and $X_{j + k}$ , $j = 1, 2, \dots$ , which satisfy

sup_{x, y} | f (x, y, k) - f (x) f (y) | \leq M_{0}, for k \geq 1 .

Under the assumption (A₂) and other conditions, Masry [6] gave the asymptotic normality for the density estimator under a mixing dependent sample and Roussas [13] obtained the asymptotic normality for the kernel density estimator under an association sample. Unlike the mixing case, association and negatively associated random variables $X_{1}, X_{2}, \dots, X_{n}$ are subject to the transformation $K (\frac{x - X_{i}}{h_{n}})$ , $i = 1, 2, \dots, n$ , losing in the process the association or negatively associated property, i.e., the kernel weights $K (\frac{x - X_{i}}{h_{n}})$ , $i = 1, 2, \dots, n$ , are not necessarily association or negatively associated random variables (see Roussas [13] and Liang and Baek [14, 17]). In addition, if $K (x) = \frac{1}{2} I$ ( $- 1 \leq x \leq 1$ ), which is a function of bounded variation, then $K (x) = K_{1} (x) - K_{2} (x)$ , where $K_{1} (x) = \frac{1}{2} I$ ( $x \leq 1$ ) and $K_{2} (x) = \frac{1}{2} I$ ( $x < - 1$ ) are bounded and monotone nonincreasing functions. Although the transformations ${K_{1} (\frac{x - X_{i}}{h_{n}}), 1 \leq i \leq n}$ and ${K_{2} (\frac{x - X_{i}}{h_{n}}), 1 \leq i \leq n}$ are also the association or negatively associated random variables, $K_{1} (x)$ and $K_{2} (x)$ are not integrable in R. So, there are some difficulties in investigating the kernel density estimator under these dependent samples. Meanwhile, the nonparametric estimation and nonparametric tests for association and negatively associated random variables can be found in Prakasa Rao [25].

In order to obtain the Berry-Esséen bounds for the kernel density estimator under a φ-mixing sample, we give some useful inequalities such as covariance inequality, moment inequality, characteristic function inequality and exponential inequality for a φ-mixing sequence.

Lemma 2.1 (Billingsley [5], inequality (20.28), p.171)

If $E | ξ | < \infty$ and $P (| η | > C) = 0$ (ξ measurable $M_{- \infty}^{k}$ and η measurable $M_{k + n}^{\infty}$ ), then

| E (ξ η) - E ξ E η | \leq 2 C φ (n) E | ξ | .

Lemma 2.2 (Yang [26], Lemma 2)

Let ${X_{n}}_{n \geq 1}$ be a mean zero φ-mixing sequence with $\sum_{n = 1}^{\infty} φ^{1 / 2} (n) < \infty$ . Assume that there exists some $p \geq 2$ such that $E {| X_{n} |}^{p} < \infty$ for all $n \geq 1$ . Then

E | \sum_{i = 1}^{n} X_{i} |^{p} \leq C {\sum_{i = 1}^{n} E {| X_{i} |}^{p} + {(\sum_{i = 1}^{n} E X_{i}^{2})}^{p / 2}}, n \geq 1,

where C is a positive constant depending only on $φ (\cdot)$ .

Lemma 2.3 (Li et al. [9], Lemma 3.4)

Let ${X_{n}}_{n \geq 1}$ be a φ-mixing sequence. Suppose that p and q are two positive integers. Set $η_{l} = \sum_{j = (l - 1) (p + q) + 1}^{(l - 1) (p + q) + p} X_{j}$ for $1 \leq l \leq k$ . Then

| E exp {i t \sum_{l = 1}^{k} η_{l}} - \prod_{l = 1}^{k} E exp {i t η_{l}} | \leq C | t | φ (q) \sum_{l = 1}^{k} E | η_{l} | .

Lemma 2.4 Let X and Y be random variables. Then for any $a > 0$ ,

sup_{t} | P (X + Y \leq t) - Φ (t) | \leq sup_{t} | P (X \leq t) - Φ (t) | + \frac{a}{\sqrt{2 π}} + P (| Y | > a) .

Remark 2.1 Lemma 2.4 is due to Petrov (Petrov [27], Lemma 1.9, p.20 and p.36, lines 19-20). It can also be found in Lemma 2 of Chang and Rao [18].

Lemma 2.5 (Yang et al. [11], Corollary A.1)

Let ${X_{n}}_{n \geq 1}$ be a mean zero φ-mixing sequence with $| X_{n} | \leq d < \infty$ , a.s., for all $n \geq 1$ . For $0 < λ < 1$ , let $m = ⌊ n^{λ} ⌋$ and $Δ_{2} = \sum_{i = 1}^{n} E X_{i}^{2}$ . Then for $\forall ε > 0$ and $n \geq 2$ ,

P (| \sum_{i = 1}^{n} X_{i} | \geq ε) \leq 2 e C_{1} exp {- \frac{ε^{2}}{2 C_{2} (2 Δ_{2} + n^{λ} d ε)}},

where $C_{1} = exp {2 e n^{1 - λ} φ (m)}$ , $C_{2} = 4 [1 + 4 \sum_{i = 1}^{2 m} φ^{1 / 2} (i)]$ .

3 Main results

Theorem 3.1 For $s \geq 2$ , let the condition (A₁) hold true. Assume that ${X_{n}}_{n \geq 1}$ is a sequence of identically distributed φ-mixing random variables with the mixing coefficients $φ (n) = O (n^{- 18 / 5})$ . If $h_{n}^{- 1 / 2} \leq c n^{8 / 69}$ , $0 < h_{n} \to 0$ as $n \to \infty$ and ${lim inf}_{n \to \infty} {n h_{n} Var (f_{n} (x))} = σ_{1}^{2} (x) > 0$ , then

(3.1)

where $Φ (\cdot)$ is the standard normal distribution function.

Proof It can be found that

\frac{\sqrt{n h_{n}} (f_{n} (x) - E f_{n} (x))}{\sqrt{Var (\sqrt{n h_{n}} f_{n} (x))}} = \frac{\sum_{i = 1}^{n} Z_{n, i} (x)}{\sqrt{Var (\sum_{i = 1}^{n} Z_{n, i} (x))}},

(3.2)

where $Z_{n, i} (x) = \frac{1}{\sqrt{h_{n}}} [K (\frac{x - X_{i}}{h_{n}}) - E K (\frac{x - X_{i}}{h_{n}})]$ . We employ the Bernstein’s big-block and small-block procedure to prove (3.1). Denote

μ = μ_{n} = ⌊ n^{2 / 3} ⌋, ν = ν_{n} = ⌊ n^{1 / 6} ⌋, k = k_{n} = ⌊ \frac{n}{μ_{n} + ν_{n}} ⌋ = ⌊ n^{1 / 3} ⌋,

(3.3)

and ${\tilde{Z}}_{n, i} (x) = Z_{n, i} (x) / \sqrt{Var (\sum_{i = 1}^{n} Z_{n, i} (x))}$ . Define $η_{j}$ , $ξ_{j}$ , $ζ_{k}$ as follows:

(3.4)

(3.5)

(3.6)

By (3.2), (3.4), (3.5) and (3.6), one has

S_{n} = \frac{\sum_{i = 1}^{n} Z_{n, i} (x)}{\sqrt{Var (\sum_{i = 1}^{n} Z_{n, i} (x))}} = \sum_{j = 0}^{k - 1} η_{j} + \sum_{j = 0}^{k - 1} ξ_{j} + ζ_{k} = S_{n}^{'} + S_{n}^{''} + S_{n}^{'''} .

(3.7)

From (3.5) and (3.7), it follows

E {[S_{n}^{''}]}^{2} = Var [\sum_{j = 0}^{k - 1} ξ_{j}] = \sum_{j = 0}^{k - 1} Var [ξ_{j}] + 2 \sum_{0 \leq i < j \leq k - 1} Cov (ξ_{i}, ξ_{j}) : = I_{1} + I_{2} .

(3.8)

We have by (2.1) and (A₁) that

E Z_{n, i}^{2} (x) = E Z_{n, 1}^{2} (x) \leq c_{1} h_{n}^{- 1} E K^{2} (\frac{x - X_{1}}{h_{n}}) = c_{1} h_{n}^{- 1} \int_{- \infty}^{\infty} K^{2} (\frac{x - u}{h_{n}}) f (u) d u \leq c_{2} .

So, by the conditions ${lim inf}_{n \to \infty} {n h_{n} Var (f_{n} (x))} = {lim inf}_{n \to \infty} {n^{- 1} Var (\sum_{i = 1}^{n} Z_{n, i} (x))} = σ_{1}^{2} (x) > 0$ , $φ (n) = O (n^{- 18 / 5})$ and $E Z_{n, i} (x) = 0$ , we apply Lemma 2.2 with $p = 2$ and obtain that

Var [ξ_{j}] = E {[\sum_{i = j (μ + ν) + 1}^{(j + 1) (μ + ν)} {\tilde{Z}}_{n, i} (x)]}^{2} \leq \frac{c_{3}}{n} E {[\sum_{i = j (μ + ν) + 1}^{(j + 1) (μ + ν)} Z_{n, i} (x)]}^{2} \leq \frac{c_{4}}{n} ν_{n} .

Consequently,

I_{1} = \sum_{j = 0}^{k - 1} Var [ξ_{j}] \leq \frac{c_{3} k_{n} ν_{n}}{n} = O (n^{- 1 / 2}) .

(3.9)

Meanwhile, one has $| {\tilde{Z}}_{n, i} (x) | \leq c_{1} n^{- 1 / 2} h_{n}^{- 1 / 2}$ , $E | {\tilde{Z}}_{n, i} (x) | \leq c_{2} n^{- 1 / 2} h_{n}^{1 / 2}$ , $1 \leq i \leq n$ . With $λ_{j} = j (μ_{n} + ν_{n}) + μ_{n}$ ,

I_{2} = 2 \sum_{0 \leq i < j \leq k - 1} Cov (ξ_{i}, ξ_{j}) = 2 \sum_{0 \leq i < j \leq k - 1} \sum_{l_{1} = 1}^{ν_{n}} \sum_{l_{2} = 1}^{ν_{n}} Cov [{\tilde{Z}}_{n, λ_{i} + l_{1}} (x), {\tilde{Z}}_{n, λ_{j} + l_{2}} (y)],

but since $i \neq j$ , $| λ_{i} - λ_{j} + l_{1} - l_{2} | \geq μ_{n}$ , we have, by applying Lemma 2.1 with $φ (n) = O (n^{- 18 / 5})$ and (3.3), that

\begin{array}{rcl} | I_{2} | & \leq & 2 \sum_{\begin{array}{c} 1 \leq i < j \leq n \\ j - i \geq μ_{n} \end{array}} | Cov [{\tilde{Z}}_{n, i} (x), {\tilde{Z}}_{n, j} (x)] | \leq 4 c_{1} c_{2} \sum_{\begin{array}{c} 1 \leq i < j \leq n \\ j - i \geq μ_{n} \end{array}} n^{- 1 / 2} h_{n}^{- 1 / 2} n^{- 1 / 2} h_{n}^{1 / 2} φ (j - i) \\ \leq & c_{3} \sum_{k \geq μ_{n}} k^{- 18 / 5} \leq c_{4} μ_{n}^{- 13 / 5} = O (n^{- 26 / 15}) . \end{array}

(3.10)

So, by (3.8), (3.9) and (3.10), one has

E {[S_{n}^{''}]}^{2} = O (n^{- 1 / 2}) .

(3.11)

On the other hand, by $φ (n) = O (n^{- 18 / 5})$ , $E Z_{n, i} (x) = 0$ and Lemma 2.1 with $p = 2$ , we obtain that

\begin{array}{rcl} E {[S_{n}^{'''}]}^{2} & \leq & \frac{c_{7}}{n} E {(\sum_{i = k (μ + ν) + 1}^{n} Z_{n, i})}^{2} \leq \frac{c_{8}}{n} (n - k_{n} (μ_{n} + ν_{n})) \\ \leq & \frac{c_{9} (μ_{n} + ν_{n})}{n} = O (n^{- 1 / 3}) . \end{array}

(3.12)

Now, we turn to estimate ${sup}_{- \infty < t < \infty} | P (S_{n}^{'} \leq t) - Φ (t) |$ . Define

s_{n}^{2} = \sum_{j = 0}^{k - 1} Var (η_{j}), Γ_{n} = \sum_{0 \leq i < j \leq k - 1} Cov (η_{i}, η_{j}) .

Since $E S_{n}^{2} = 1$ , one has

E {(S_{n}^{'})}^{2} = E {[S_{n} - (S_{n}^{''} + S_{n}^{'''})]}^{2} = 1 + E {(S_{n}^{''} + S_{n}^{'''})}^{2} - 2 E [S_{n} (S_{n}^{''} + S_{n}^{'''})] .

Combining (3.11) with (3.12), one can check that

\begin{array}{rcl} | E {(S_{n}^{'})}^{2} - 1 | & = & | E {(S_{n}^{''} + S_{n}^{'''})}^{2} - 2 E [S_{n} (S_{n}^{''} + S_{n}^{'''})] | \\ \leq & E {(S_{n}^{''})}^{2} + E {(S_{n}^{'''})}^{2} + 2 {[E {(S_{n}^{''})}^{2}]}^{1 / 2} {[E {(S_{n}^{'''})}^{2}]}^{1 / 2} \\ + 2 {[E (S_{n}^{2})]}^{1 / 2} {[E {(S_{n}^{''})}^{2}]}^{1 / 2} + 2 {[E (S_{n}^{2})]}^{1 / 2} {[E {(S_{n}^{'''})}^{2}]}^{1 / 2} \\ = & O (n^{- 1 / 4}) + O (n^{- 1 / 6}) = O (n^{- 1 / 6}) . \end{array}

(3.13)

With $λ_{j} = j (μ_{n} + ν_{n})$ , $i \neq j$ , $| λ_{i} - λ_{j} + l_{1} - l_{2} | \geq ν_{n}$ , one has

2 Γ_{n} = 2 \sum_{0 \leq i < j \leq k - 1} Cov (η_{i}, η_{j}) = 2 \sum_{0 \leq i < j \leq k - 1} \sum_{l_{1} = 1}^{μ_{n}} \sum_{l_{2} = 1}^{μ_{n}} Cov [{\tilde{Z}}_{n, λ_{i} + l_{1}} (x), {\tilde{Z}}_{n, λ_{j} + l_{2}} (x)] .

So, similar to the proof of (3.10), by Lemma 2.1 with $φ (n) = O (n^{- 18 / 5})$ , $| {\tilde{Z}}_{n, i} (x) | \leq c_{1} n^{- 1 / 2} h_{n}^{- 1 / 2}$ and $E | {\tilde{Z}}_{n, j} (x) | \leq c_{2} n^{- 1 / 2} h_{n}^{1 / 2}$ , we have that

\begin{array}{rcl} | Γ_{n} | & \leq & 2 \sum_{\begin{array}{c} 1 \leq i < j \leq n \\ j - i \geq ν_{n} \end{array}} | Cov [{\tilde{Z}}_{n, i} (x), {\tilde{Z}}_{n, j} (x)] | \leq 4 c_{1} c_{2} \sum_{\begin{array}{c} 1 \leq i < j \leq n \\ j - i \geq ν_{n} \end{array}} n^{- 1 / 2} h_{n}^{- 1 / 2} n^{- 1 / 2} h_{n}^{1 / 2} φ (j - i) \\ \leq & c_{3} \sum_{k \geq ν_{n}} k^{- 18 / 5} \leq c_{4} ν_{n}^{- 13 / 5} = O (n^{- 13 / 30}) . \end{array}

(3.14)

Obviously,

s_{n}^{2} = E {[S_{n}^{'}]}^{2} - 2 Γ_{n},

(3.15)

by (3.13), (3.14) and (3.15), we obtain that

| s_{n}^{2} - 1 | = O (n^{- 1 / 6}) .

(3.16)

Let $η_{j}^{'}$ , $j = 0, 1, \dots, k - 1$ , be the independent random variables and $η_{j}^{'}$ have the same distribution as $η_{j}$ for $j = 0, 1, \dots, k - 1$ . Put $B_{n} = \sum_{j = 0}^{k - 1} η_{j}^{'}$ . It can be seen that

\begin{array}{rcl} sup_{- \infty < t < \infty} | P (S_{n}^{'} \leq t) - Φ (t) | & \leq & sup_{- \infty < t < \infty} | P (S_{n}^{'} \leq t) - P (B_{n} \leq t) | \\ + sup_{- \infty < t < \infty} | P (B_{n} \leq t) - Φ (t / s_{n}) | \\ + sup_{- \infty < t < \infty} | Φ (t / s_{n}) - Φ (t) | : = F_{1} + F_{2} + F_{3} . \end{array}

(3.17)

Denote the characteristic functions of $S_{n}^{'}$ and $B_{n}$ by $φ (t)$ and $ψ (t)$ , respectively. Using the Esséen inequality (Petrov [27], Theorem 5.3), for any $T > 0$ , we have

\begin{array}{rcl} F_{1} & \leq & \int_{- T}^{T} | \frac{φ (t) - ψ (t)}{t} | d t + T sup_{- \infty < t < \infty} \int_{| u | \leq \frac{C}{T}} | P (B_{n} \leq u + t) - P (B_{n} \leq t) | d u \\ : = & F_{1 n} + F_{2 n} . \end{array}

(3.18)

It is a simple fact that

\begin{array}{rcl} E {| Z_{n, i} (x) |}^{3} & \leq & c_{1} h_{n}^{- 3 / 2} E K^{3} (\frac{x - X_{1}}{h_{n}}) \\ = & c_{1} h_{n}^{- 3 / 2} \int_{- \infty}^{\infty} K^{3} (\frac{x - u}{h_{n}}) f (u) d u \leq c_{2} h_{n}^{- 1 / 2}, 1 \leq i \leq n \end{array}

and $E Z_{n, i}^{2} (x) \leq c_{3}$ , $1 \leq i \leq n$ . Applying Lemma 2.2 with $p = 3$ , we obtain by $h_{n}^{- 1 / 2} \leq c n^{8 / 69}$ and ${lim inf}_{n \to \infty} {n^{- 1} Var (\sum_{i = 1}^{n} Z_{n, i} (x))} = σ_{1}^{2} (x) > 0$ that

\begin{array}{rcl} E {| η_{j} |}^{3} & = & E | \sum_{i = j (μ + ν) + 1}^{j (μ + ν) + μ} {\tilde{Z}}_{n, i} |^{3} \leq \frac{c_{1}}{n^{3 / 2}} E | \sum_{i = j (μ + ν) + 1}^{j (μ + ν) + μ} Z_{n, i} (x) |^{3} \\ \leq & \frac{c_{2}}{n^{3 / 2}} {\sum_{i = j (μ + ν) + 1}^{j (μ + ν) + μ} E {| Z_{n, i} (x) |}^{3} + {(\sum_{i = j (μ + ν) + 1}^{j (μ + ν) + μ} E Z_{n, i}^{2} (x))}^{3 / 2}} \\ \leq & \frac{c_{3}}{n^{3 / 2}} (μ h_{n}^{- 1 / 2} + μ^{3 / 2}) \leq \frac{c_{4} n}{n^{3 / 2}} = O (n^{- 1 / 2}) . \end{array}

(3.19)

Consequently, by Lemma 2.3, the Jensen inequality, $φ (n) = O (n^{- 18 / 5})$ , (3.3), (3.4) and (3.19), one can see that

\begin{array}{rcl} | ϕ (t) - ψ (t) | & = & | E exp (i t \sum_{j = 0}^{k - 1} η_{j}) - \prod_{j = 0}^{k - 1} E exp (i t η_{j}) | \\ \leq & c_{1} | t | φ (ν) \sum_{j = 0}^{k - 1} E | η_{j} | \leq c_{1} | t | φ (ν) \sum_{j = 0}^{k - 1} {(E {| η_{j} |}^{3})}^{1 / 3} \\ \leq & c_{2} | t | k n^{- 1 / 6} φ (ν) \leq c_{2} | t | n^{- 13 / 30} . \end{array}

(3.20)

Combining (3.18) with (3.20), we obtain, by taking $T = n^{13 / 60}$ , that

F_{1 n} = \int_{- T}^{T} | \frac{φ (t) - ψ (t)}{t} | d t \leq c n^{- 13 / 30} \cdot T = O (n^{- 13 / 60}) .

(3.21)

From (3.16), it follows $s_{n} \to 1$ . Thus, by the Berry-Esséen inequality (Petrov [27], Theorem 5.7), (3.3) and (3.19), one has that

sup_{- \infty < t < \infty} | P (B_{n} / s_{n} \leq t) - Φ (t) | \leq \frac{c}{s_{n}^{3}} \sum_{j = 0}^{k - 1} E {| η_{j}^{'} |}^{3} = \frac{c}{s_{n}^{3}} \sum_{j = 0}^{k - 1} E {| η_{j} |}^{3} = O (n^{- 1 / 6}),

(3.22)

which implies

(3.23)

By (3.18) and (3.23), take $T = n^{13 / 60}$ , we obtain that

F_{2 n} = T sup_{- \infty < t < \infty} \int_{| u | \leq C / T} | P (B_{n} \leq u + t) - P (B_{n} \leq t) | d u \leq \frac{c_{1}}{n^{1 / 6}} + \frac{c_{2}}{T} = O (n^{- 1 / 6}) .

(3.24)

Therefore, similar to the proof of (2.28) in Yang et al. [11], by (3.16), one has

F_{3} = sup_{- \infty < t < \infty} | Φ (t / s_{n}) - Φ (t) | \leq c_{1} | s_{n}^{2} - 1 | = O (n^{- 1 / 6}),

(3.25)

and from (3.22), it follows

F_{2} = sup_{- \infty < t < \infty} | P (B_{n} / s_{n} \leq t / s_{n}) - Φ (t / s_{n}) | = O (n^{- 1 / 6}) .

(3.26)

Consequently, by (3.17), (3.18), (3.21), (3.24), (3.25) and (3.26), one has that

sup_{- \infty < t < \infty} | P (S_{n}^{'} \leq t) - Φ (t) | = O (n^{- 1 / 6}) + O (n^{- 7 / 24}) = O (n^{- 1 / 6}) .

(3.27)

On the other hand, let $ε_{n} = n^{- 1 / 6} \cdot log n \cdot log log n$ . By (3.7), we apply Lemma 2.4 with $a = 2 ε_{n}$ and obtain that

\begin{array}{rcl} sup_{- \infty < t < \infty} | P (S_{n} \leq t) - Φ (t) | & \leq & sup_{- \infty < t < \infty} | P (S_{n}^{'} \leq t) - Φ (t) | + \frac{2 ε_{n}}{\sqrt{2 π}} \\ + P (| S_{n}^{''} | > ε_{n}) + P (| S_{n}^{'''} | > ε_{n}) . \end{array}

(3.28)

Obviously, by (3.11) and Markov’s inequality, we have

P (| S_{n}^{''} | > ε_{n}) \leq n^{1 / 3} {(log n \cdot log log n)}^{- 2} \cdot E {[S_{n}^{''}]}^{2} = O (n^{- 1 / 6} {(log n \cdot log log n)}^{- 2}) .

(3.29)

It is time to estimate $P (| S_{n}^{'''} | > ε_{n})$ . By $h_{n}^{- 1 / 2} \leq c n^{8 / 69}$ and (3.12), one has

| {\tilde{Z}}_{n, i} | \leq C_{3} n^{- 1 / 2} h_{n}^{- 1 / 2} \leq C_{4} n^{- 53 / 138}, \sum_{i = k (μ + ν) + 1}^{n} E {\tilde{Z}}_{n, i}^{2} \leq C_{5} n^{- 1 / 3} .

So, we have, by Lemma 2.5 with $λ = 5 / 23$ and $m = ⌊ n^{5 / 23} ⌋ = ⌊ n^{λ} ⌋$ , that for n large enough,

\begin{array}{rcl} P (| S_{n}^{'''} | > ε_{n}) & = & P (| \sum_{i = k (μ + ν) + 1}^{n} Z_{n, i} | > n^{- 1 / 6} \cdot log n \cdot log log n) \\ \leq & 2 e C_{1} exp {- \frac{n^{- 1 / 3} \cdot {log}^{2} n \cdot {(log log n)}^{2}}{2 C_{2} (2 C_{5} n^{- 1 / 3} + n^{5 / 23} C_{4} n^{- 53 / 138} n^{- 1 / 6} \cdot log n \cdot log log n)}} \\ \leq & \frac{C_{12}}{n}, \end{array}

(3.30)

where

Finally, the desired result (3.1) follows from (3.2), (3.7), (3.27), (3.28), (3.29) and (3.30) immediately. □

Theorem 3.2 For $s \geq 2$ , let the conditions (A₁) and (A₂) hold true. Assume that ${X_{n}}_{n \geq 1}$ is a sequence of identically distributed φ-mixing random variables with the mixing coefficients $φ (n) = O (n^{- 18 / 5})$ , and $f (x)$ satisfies a Lipschitz condition. If $h_{n}^{- 1 / 2} \leq c n^{8 / 69}$ , $0 < h_{n} \to 0$ , then for any $δ \in (0, 1)$ ,

(3.31)

where $σ^{2} (x) = f (x) \int_{- \infty}^{\infty} K^{2} (u) d u$ with $f (x) > 0$ and $Φ (\cdot)$ is the standard normal distribution function.

Proof By the condition (A₁), $\int_{- \infty}^{\infty} u K (u) d u = 0$ implies that $\int_{- \infty}^{\infty} | u | K (u) d u < \infty$ . Thus, by the Lipschitz condition of $f (x)$ , we obtain that

(3.32)

Obviously, one has

\frac{1}{h_{n}} {[E K (\frac{x - X_{1}}{h_{n}})]}^{2} = \frac{1}{h_{n}} {[\int_{- \infty}^{\infty} K (\frac{x - u}{h_{n}}) f (u) d u]}^{2} \leq c h_{n} .

(3.33)

Thus, we obtain by combining (3.32) with (3.33) that

\begin{array}{rcl} | Var (Z_{n, i} (x)) - σ^{2} (x) | & = & | Var (Z_{n, 1} (x)) - σ^{2} (x) | \\ \leq & \frac{1}{h_{n}} {[E K (\frac{x - X_{1}}{h_{n}})]}^{2} + | \frac{1}{h_{n}} E K^{2} (\frac{x - X_{1}}{h_{n}}) - σ^{2} (x) | \\ \leq & c_{3} h_{n}, 1 \leq i \leq n . \end{array}

(3.34)

Meanwhile, for $i \neq j$ , one has by the condition (A₂) that

(3.35)

By (3.35), we take $r_{n} = h_{n}^{δ - 1}$ and obtain that

\frac{2}{n} \sum_{\begin{array}{c} 1 \leq i < j \leq n \\ 1 \leq j - i \leq r_{n} \end{array}} | Cov [Z_{n, i} (x), Z_{n, j} (y)] | \leq c_{4} h_{n} r_{n} = c_{4} h_{n}^{δ} .

(3.36)

Applying Lemma 2.2 with $| Z_{n, i} (x) | \leq c_{1} h_{n}^{- 1 / 2}$ , $E | Z_{n, j} (x) | \leq c_{2} h_{n}^{1 / 2}$ and $φ (n) = O (n^{- 18 / 5})$ , we obtain that

\frac{2}{n} \sum_{\begin{array}{c} 1 \leq i < j \leq n \\ j - i > r_{n} \end{array}} | Cov [Z_{n, i} (x), Z_{n, j} (y)] | \leq c_{5} \sum_{k > r_{n}} φ (k) \leq c_{6} h_{n}^{13 (1 - δ) / 5} .

(3.37)

Define

σ_{n}^{2} (x) = Var [\sum_{i = 1}^{n} Z_{n, i} (x)], σ_{n, 0}^{2} (x) = n σ^{2} (x), n \geq 1 .

(3.38)

Consequently, by (3.34), (3.36), (3.37) and (3.38), it can be checked that

\begin{array}{rcl} | σ_{n}^{2} (x) - σ_{n, 0}^{2} (x) | & \leq & n | Var (Z_{n, 1} (x)) - σ^{2} (x) | + 2 \sum_{1 \leq i < j \leq n} | Cov [Z_{n, i} (x), Z_{n, j} (y)] | \\ \leq & c_{7} n (h_{n} + h_{n}^{δ} + h_{n}^{13 (1 - δ) / 5}) . \end{array}

(3.39)

We obtain, by (3.2), (3.31) and (3.38), that

(3.40)

From (3.38) and (3.39), it follows ${lim}_{n \to \infty} σ_{n}^{2} (x) / σ_{n, 0}^{2} (x) = 1$ , since $h_{n} \to 0$ as $n \to \infty$ and $δ \in (0, 1)$ . Thus, by applying Theorem 3.1, we establish that

Q_{1} = O (n^{- 1 / 6} \cdot log n \cdot log log n) .

(3.41)

On the other hand, similar to the proof of (2.34) in Yang et al. [11], it follows by (3.39) again that

Q_{2} \leq c_{2} | \frac{σ_{n}^{2} (x)}{σ_{n, 0}^{2} (x)} - 1 | = \frac{c_{2}}{σ_{n, 0}^{2} (x)} | σ_{n}^{2} (x) - σ_{n, 0}^{2} (x) | = O (h_{n}^{δ}) + O (h_{n}^{13 (1 - δ) / 5}) .

(3.42)

Finally, by (3.40), (3.41) and (3.42), (3.31) holds true. □

Remark 3.1 Under an independent sample, Cao [8] studied the bootstrap approximations in nonparametric density estimation and obtained Berry-Esséen bounds as $O_{p} (n^{- 1 / 5})$ and $O_{p} (n^{- 2 / 9})$ (see Theorem 1 and Theorem 2 of Cao [8]). Under a negatively associated sample, Liang and Baek [17] studied the Berry-Esséen bound and obtained the rate $O ({(\frac{log n}{n})}^{1 / 6})$ under some conditions (see Remark 3.1 of Liang and Baek [17]). In our Theorem 3.1 and Theorem 3.2, under the mixing coefficients condition $φ (n) = O (n^{- 18 / 5})$ and other simple assumptions, we obtain the Berry-Esséen bounds of the centered variate as $O (n^{- 1 / 6} \cdot log n \cdot log log n)$ and $O (n^{- 1 / 6} \cdot log n \cdot log log n) + O (h_{n}^{δ}) + O (h_{n}^{13 (1 - δ) / 5})$ , where $0 < δ < 1$ . Particularly, by taking $δ = 13 / 18$ and $h_{n} = n^{- 16 / 69}$ in Theorem 3.2, the Berry-Esséen bound of the centered variate is presented as

sup_{- \infty < t < \infty} | P (\frac{\sqrt{n h_{n}} (f_{n} (x) - E f_{n} (x))}{σ (x)} \leq t) - Φ (t) | = O (n^{- 1 / 6} \cdot log n \cdot log log n), n \to \infty,

where $σ (x)$ and $Φ (\cdot)$ are defined in Theorem 3.2.

References

Prakasa Rao BLS: Nonparametric Function Estimation. Academic Press, New York; 1983.
Google Scholar
Silverman BW: Density Estimation for Statistics and Data Analysis. Chapman & Hall, New York; 1986.
Book Google Scholar
Györfi L, Härdle W, Sarda P, Vieu P: Nonparametric Curve Estimation from Time Series. Springer, New York; 1989.
Book Google Scholar
Doukhan P Lecture Notes in Statistics. In Mixing. Properties and Examples. Springer, Berlin; 1995.
Google Scholar
Billingsley P: Convergence of Probability Measures. Wiley, New York; 1968.
Google Scholar
Masry E: Recursive probability density estimation for weakly dependent stationary processes. IEEE Trans. Inf. Theory 1986, 32(2):254–267. 10.1109/TIT.1986.1057163
Article MathSciNet Google Scholar
Fan JQ, Yao QW: Nonlinear Time Series: Nonparametric and Parametric Methods. Springer, New York; 2005.
Google Scholar
Cao AR: Ordenes de convergencia para las aproximaciones nornal y bootstrap en estimacion no parametrica de la funcion de densidad. Trab. Estad. 1990, 5(2):23–32. 10.1007/BF02863645
Article Google Scholar
Li YM, Yin CM, Wei CD: On the asymptotic normality for φ -mixing dependent errors of wavelet regression function estimator. Acta Math. Appl. Sin. 2008, 31(6):1046–1055.
MathSciNet Google Scholar
Li YM, Wei CD, Xing GD: Berry-Esseen bounds for wavelet estimator in a regression model with linear process errors. Stat. Probab. Lett. 2011, 81(1):103–110. 10.1016/j.spl.2010.09.024
Article MathSciNet Google Scholar
Yang WZ, Wang XJ, Li XQ, Hu SH: Berry-Esséen bound of sample quantiles for φ -mixing random variables. J. Math. Anal. Appl. 2012, 388(1):451–462. 10.1016/j.jmaa.2011.10.058
Article MathSciNet Google Scholar
Masry E: Nonparametric regression estimation for dependent functional data: asymptotic normality. Stoch. Process. Appl. 2005, 115(1):155–177. 10.1016/j.spa.2004.07.006
Article MathSciNet Google Scholar
Roussas GG: Asymptotic normality of the kernel estimate of a probability density function under association. Stat. Probab. Lett. 2000, 50(1):1–12. 10.1016/S0167-7152(00)00072-9
Article MathSciNet Google Scholar
Liang HY, Baek J: Asymptotic normality of recursive density estimates under some dependent assumptions. Metrika 2004, 60(2):155–166. 10.1007/s001840300302
Article MathSciNet Google Scholar
Wei LS: The consistencies for the Kernel-type density estimation in the case of NA samples. J. Syst. Sci. Math. Sci. 2001, 21(1):79–87.
Google Scholar
Henriques C, Oliveira PE: Exponential rates for kernel density estimation under association. Stat. Neerl. 2005, 59(4):448–466. 10.1111/j.1467-9574.2005.00302.x
Article MathSciNet Google Scholar
Liang HY, Baek J: Berry-Esseen bounds for density estimates under NA assumption. Metrika 2008, 68(3):305–322. 10.1007/s00184-007-0159-y
Article MathSciNet Google Scholar
Chang MN, Rao PV: Berry-Esseen bound for the Kaplan-Meier estimator. Commun. Stat., Theory Methods 1989, 18(12):4647–4664. 10.1080/03610928908830180
Article MathSciNet Google Scholar
Cai ZW, Roussas GG: Berry-Esseen bounds for smooth estimator of a distribution function under association. J. Nonparametr. Stat. 1999, 11(1):79–106. 10.1080/10485259908832776
Article MathSciNet Google Scholar
Yang SC: Uniformly asymptotic normality of the regression weighted estimator for negatively associated samples. Stat. Probab. Lett. 2003, 62(2):101–110. 10.1016/S0167-7152(02)00427-3
Article Google Scholar
Dedecker J, Prieur C: New dependence coefficients. Examples and applications to statistics. Probab. Theory Relat. Fields 2005, 132(2):203–236. 10.1007/s00440-004-0394-3
Article MathSciNet Google Scholar
Yang WZ, Hu SH, Wang XJ, Zhang QC: Berry-Esséen bound of sample quantiles for negatively associated sequence. J. Inequal. Appl. 2011., 2011: Article ID 83
Google Scholar
Herve L, Ledoux J, Patilea V: A uniform Berry-Esseen theorem on M-estimators for geometrically ergodic Markov chains. Bernoulli 2012, 18(2):703–734. 10.3150/10-BEJ347
Article MathSciNet Google Scholar
Härdle W, Liang H, Gao JT Springer Series in Economics and Statistics. In Partially Linear Models. Physica-Verlag, New York; 2000.
Chapter Google Scholar
Prakasa Rao BLS: Associated Sequences, Demimartingales and Nonparametric Inference. Birkhäuser, Basel; 2012.
Book Google Scholar
Yang SC: Almost sure convergence of weighted sums of mixing sequences. J. Syst. Sci. Math. Sci. 1995, 15(3):254–265.
Google Scholar
Petrov VV: Limit Theorems of Probability Theory: Sequences of Independent Random Variables. Oxford University Press, New York; 1995.
Google Scholar

Download references

Acknowledgements

The authors are grateful to associate editor prof. Andrei Volodin and two anonymous referees for their careful reading and insightful comments. This work was supported by the National Natural Science Foundation of China (11171001, 11201001, 11126176), HSSPF of the Ministry of Education of China (10YJA910005), the Natural Science Foundation of Anhui Province (1208085QA03) and the Provincial Natural Science Research Project of Anhui Colleges (KJ2010A005).

Author information

Authors and Affiliations

School of Mathematical Science, Anhui University, Hefei, 230039, P.R. China
Wenzhi Yang & Shuhe Hu

Authors

Wenzhi Yang
View author publications
You can also search for this author in PubMed Google Scholar
Shuhe Hu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shuhe Hu.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Yang, W., Hu, S. The Berry-Esséen bounds for kernel density estimator under dependent sample. J Inequal Appl 2012, 287 (2012). https://doi.org/10.1186/1029-242X-2012-287

Download citation

Received: 18 August 2012
Accepted: 21 November 2012
Published: 07 December 2012
DOI: https://doi.org/10.1186/1029-242X-2012-287

The Berry-Esséen bounds for kernel density estimator under dependent sample

Abstract

1 Introduction

2 Some assumptions and lemmas

3 Main results

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords