- Research
- Open access
- Published:
Berry-Esséen bound of sample quantiles for negatively associated sequence
Journal of Inequalities and Applications volume 2011, Article number: 83 (2011)
Abstract
In this paper, we investigate the Berry-Esséen bound of the sample quantiles for the negatively associated random variables under some weak conditions. The rate of normal approximation is shown as O(n-1/9).
2010 Mathematics Subject Classification: 62F12; 62E20; 60F05.
1 Introduction
Assume that {X n }n≥1is a sequence of random variables defined on a fixed probability space with a common marginal distribution function F(x) = P(X1 ≤ x). F is a distribution function (continuous from the right, as usual). For 0 < p < 1, the p th quantile of F is defined as
and is alternately denoted by F-1(p). The function F-1(t), 0 < t < 1, is called the inverse function of F. It is easy to check that ξ p possesses the following properties:
-
(i)
F(ξ p -) ≤ p ≤ F(ξ p );
-
(ii)
if ξ p is the unique solution x of F (x-) ≤ p ≤ F(x), then for any ε > 0,
For a sample X1, X2, ⋯, X n , n ≥ 1, let F n represent the empirical distribution function based on X1, X2,..., X n , which is defined as , x ∈ ℝ, where I(A) denotes the indicator function of a set A and ℝ is the real line. For 0 < p < 1, we define as the p th quantile of sample.
Recall that a finite family {X1,..., X n } is said to be negatively associated (NA) if for any disjoint subsets A, B ⊂ {1, 2,..., n}, and any real coordinatewise nondecreasing functions f on RA , g on RB ,
A sequence of random variables {X i }i≥1is said to be NA if for every n ≥ 2, X1, X2,..., X n are NA.
From 1960s, many authors have obtained the asymptotic results for the sample quantiles, including the well-known Bahadur representation. Bahadur [1] firstly introduced an elegant representation for the sample quantiles in terms of empirical distribution function based on independent and identically distributed (i.i.d.) random variables. Sen [2], Babu and Singh [3] and Yoshihara [4] gave the Bahadur representation for the sample quantiles under ϕ-mixing sequence and α-mixing sequence, respectively. Sun [5] established the Bahadur representation for the sample quantiles under α-mixing sequence with polynomially decaying rate. Ling [6] investigated the Bahadur representation for the sample quantiles under NA sequence. Li et al. [7] investigated the Bahadur representation of the sample quantile based on negatively orthant-dependent (NOD) sequence, which is weaker than NA sequence. Xing and Yang [8] also studied the Bahadur representation for the sample quantiles under NA sequence. Wang et al. [9] revised the results of Sun [5] and got a better bound. For more details about Bahadur representation, one can refer to Serfling [10].
For a fixed p ∈ (0, 1), let ξ p = F-1(p), and Φ(t) be the distribution function of a standard normal variable. In [[10], p. 81], the Berry-Esséen bound of the sample quantiles for i.i.d. random variables is given as follows:
Theorem A Let 0 < p < 1 and {X n }n≥1be a sequence of i.i.d. random variables. Suppose that in a neighborhood of ξ p , F possesses a positive continuous density f and a bounded second derivative F″. Then
In this paper, we investigate the Berry-Esséen bound of the sample quantiles for NA random variables under some weak conditions. The rate of normal approximation is shown as O(n-1/9).
Berry-Esséen theorem, which is known as the rate of convergence in the central limit theorem, can be found in many monographs such as Shiryaev [11], Petrov [12]. For the case of i.i.d. random variables, the optimal rate is , and for the case of martingale, the rate is [[13], Chapter 3]. For other papers about Berry-Esséen bound, for example, under the association sample, Cai and Roussas [14, 15] studied the Berry-Esséen bounds for the smooth estimator of quantiles and the smooth estimator of a distribution function, respectively; Yang [16] obtained the Berry-Esséen bound of the regression weighted estimator for NA sequence; Wang and Zhang [17] provided the Berry-Esséen bound for linear negative quadrant-dependent (LNQD) sequence; Liang and Baek [18] gave the Berry-Esséen bounds for density estimates under NA sequence; Liang and Uña-Álvarez [19] studied the Berry-Esséen bound in kernel density estimation for α-mixing censored sample; Lahiri and Sun [20] obtained the Berry-Esséen bound of the sample quantiles for α-mixing random variables, etc.
Throughout the paper, C, C1, C2, C3,..., d denote some positive constants not depending on n, which may be different in various places. ⌊x⌋ denotes the largest integer not exceeding x, and the second-order stationarity means that
Inspired by Serfling [10], Cai and Roussas [14, 15], Yang [16], Liang and Uña-Álvarez [19], Lahiri and Sun [20], etc., we obtain Theorem 1.1 in Section 1. Two preliminary lemmas are given in Section 2, and the proof of Theorem 1.1 is given in Section 3. Next, we give the main result as follows:
Theorem 1.1 Let 0 < p < 1 and {X n }n≥1be a second-order stationary NA sequence with common marginal distribution function F and EX n = 0 for n = 1, 2, . . .. Assume that in a neighborhood of ξ p , F possesses a positive continuous density f and a bounded second derivative F″. If there exists an ε0> 0 such that for × ∈ [ξ p - ε0, ξ p + ε0],
and
then
Remark 1.1 Assumption (1.2) is a general condition, see for example Cai and Roussas [14]. For the stationary sequences of associated and negatively associated, Cai and Roussas [15] gave the notation and supposed that μ(1) < ∞. In addition, they supposed that μ(n) = O(n-α) for some α > 0 or δ(1) < ∞, where , then obtained the Berry-Esséen bounds for smooth estimator of a distribution function. Under the assumptions for some r > 1 or , Chaubey et al. [21] studied the smooth estimation of survival and density functions for a stationary-associated process using Poisson weights. In this paper, for x ∈ [ξ p - ε0, ξ p + ε0], the assumption (1.1) has some restriction on the covariances of Cov[I(X1 ≤ x), I(X j ≤ x)] in the neighborhood of ξ p .
2 Preliminaries
Lemma 2.1 Let {X n }n≥1be a stationary NA sequence with EX n = 0, |X n | ≤ d < ∞ for n = 1, 2, . . .. There exists some β ≥ 1 such thatfor all 0 < b n → ∞ as n → ∞. If
then
Proof We employ Bernstein's big-block and small-block procedure. Partition the set {1, 2,..., n} into 2k n + 1 subsets with large blocks of size μ = μ n and small block of size υ = υ n . Define
and . Let η j , ξ j , ζ j be defined as follows:
Write
By Lemma A.3, we can see that
Firstly, we estimate and , which will be used to estimate and in (2.7). By the conditions |X i | ≤ d and , it is easy to see that . And E(ξ j )2 ≤ Cυ n /n follows from EZn,i= 0 and Lemma A.1. Combining the definition of NA with the definition of ξ j , j = 0, 1, ⋯, k - 1, we can easily prove that {ξ0, ξ1, ⋯, ξk-1} is NA. Therefore, it follows from (2.2), (2.4), (2.6) and Lemma A.1 that
On the other hand, we can get that
from (2.5), , |X i | ≤ d and Lemma A.1. Consequently, by Markov's inequality, (2.8) and (2.9),
In the following, we will estimate . Define
Here, we first estimate the growth rate . Since and
by (2.8) and (2.9), it has
Notice that
With λ j = j(μ n + υ n ),
but since i ≠ j, |λ i - λ j + l1 - l2| ≥ υ n , it has that
following from (2.2) and the conditions of stationary, and , β ≥ 1. So, by (2.12), (2.13) and (2.14), we can get that
For j = 0, 1,..., k - 1, let be the independent random variables and have the same distribution as η j , j = 0, 1,..., k - 1. Define . It can be found that
Let ϕ(t) and ψ(t) be the characteristic functions of and H n , respectively. By Esséen inequality [[12], Theorem 5.3], for any T > 0,
With λ j = j(μ n + υ n ) and similar to the proof of Lemma 3.4 of Yang [16], we have that
by (2.2) and the conditions of stationary, and . Set T = n(3β - 1)/18for β ≥ 1, we have by (2.18) that
It follows from the Berry-Esséen inequality [[12], Theorem 5.7], that
By (2.3) and Lemma A.1,
Combining (2.20) with (2.21), we obtain that
since s n → 1 as n → ∞ by (2.15). It follows from (2.22) that
which implies that
where T = n(3β - 1)/18. It is known that [[12], Lemma 5.2],
Thus, by (2.15),
and by (2.22),
Therefore, it follows from (2.16), (2.17), (2.19), (2.23), (2.24) and (2.25) that
Finally, by (2.7), (2.10), (2.11) and (2.26), (2.1) holds true. □
Lemma 2.2 Let {X n }n≥1be a second-order stationary NA sequence with common marginal distribution function and EX n = 0, |X n | ≤ d< ∞, n = 1,2,.... We give an assumption such that. If, then
Proof Define , and γ(k) = Cov (Xi+k, X i ) for k = 0, 1, 2,.... For the second-order stationarity process {X n }n≥1with common marginal distribution function, it can be found by the condition that
On the other hand,
Obviously, if b n → ∞ as n → ∞, then it follows from that
(2.28) and the fact yield that . Thus, by Lemma 2.1,
By (2.28) again and similar to the proof of (2.24), it follows
Finally, by (2.29), (2.30) and (2.31), (2.27) holds true. □
Remark 2.1 Under the conditions of Lemma 2.2, we have (27). Furthermore, by the proof of Lemma 2.2, we can obtain that
where is a positive constant depending only on .
3 Proof of the main result
Proof of Theorem 1.1 The proof is inspired by the proofs of Theorem A and Theorem C of Serfling [[10], pp. 77-84]. Denote A = σ (ξ p ) / f (ξ p ) and
Let L n = (log n log log n)1/2, we have
Since , x > 0 it follows
Let ε n = (A - ε0) (log n log log n)1/2n-1/2, where 0 < ε0 < A. Seeing that
and
by Lemma A.4 (iii), we obtain
where V i = I (X i > ξ p + ξ n ) and δn 1= F(ξ p + ε n ) - p. Likewise,
where W i = I (X i > ξ p - ξn) and δn 2= p - F(ξ p - ε n ). It is easy to see that {V i - EV i }1≤i≤n. and {W i - EV i }1≤i≤nare still NA sequences. Obviously, |V i - EV i | ≤ 1, , |W i - EW i | ≤ 1, . By Lemma A.2, we have that
Consequently,
Since F (x) is continuous at ξ p with F' (ξ p ) > 0, ξ p is the unique solution of F (x-) ≤ p ≤ F (x) and F (ξ p ) = p. By the assumption on f'(x) and Taylor's expansion,
Therefore, we can get that for n large enough,
Note that max(δn 1, δn 2) → 0. as n → ∞. So with (3), for n large enough,
Next, we define
where Z i = I [X i ≤ ξ p + tAn-1/2] - EI [X i ≤ ξ p + tAn-1/2]. Seeing that
we will estimate the convergence rate of |σ2 (n, t) - σ2 (ξ p )|. By the condition (1.1), we can see that σ2 (ξ p ) < ∞. Since that F possesses a positive continuous density f and a bounded second derivative F', for |t| ≤ L n = (log n log log n)1/2, we will obtain by Taylor's expansion that
Similarly, for j ≥ 2 and |t| ≤ L n ,
Therefore, by a similar argument, for j ≥ 2 and |t| ≤ L n ,
Consequently, by the conditions (1.1) and (3.5), (3.6), for |t| ≤ L n ,
By Lemma A.4 (iii) again, it has
Thus,
where
It is easy to check that
By (3.7), it has that as n → ∞, which implies that 0 < σ2 (n, t) for |t| ≤ L n and n large enough. Obviously, {Z i } is a second-order stationary NA sequence. Thus, for a fixed t, |t| ≤ L n , by the Lemma 2.2, (2.32) in Remark 2.1 and (3.7), it has for n large enough that
where C1 does not depend on t for |t| ≤ L n . Therefore, for n large enough, we have
By (3.8) and the inequality above, we can get that for n large enough,
On the other hand,
By (3.7) again and similar to the proof of (2.31), we have
By Taylor's expansion again, we obtain that
where ξp,tlies between ξ p and ξ p + Atn-1/2. It is known that [[12], Lemma 5.2],
Therefore, by (3.12), (3.13) and the condition that F' is bounded in a neighborhood of ξ p , we get for n large enough that
since σ2 (ξ p ) < ∞ and for |t| ≤ L n . Therefore, it follows from (3.9), (3.10), (3.11) and (3.14) that
Finally, the desired result (1.3) follows from (3.1), (3.2), (3.4) and (3.15) immediately. □
Appendix
Lemma A.1 [[22], Theorem 1] Let {X n }n≥1be NA random variables, EX i = 0, E|X i | p < ∞, where i = 1, 2,..., n and p ≥ 2. Then, there exists some constant c p depending only on p such that
Lemma A.2 [[16], Lemma 3.5] Let {X n }n≥1be a NA sequence with EX i = 0, |X i | ≤ b, a.s. i = 1, 2,..., Denote. Then for ∀ ε > 0,
Lemma A.3 [[23], Lemma 2] Let × and Y be random variables, then for any a > 0,
Lemma A.4 [[10], Lemma 1.1.4] Let F(x) be a right-continuous distribution function. The inverse function F-1(t), 0 < t < 1, is nondecreasing and left-continuous, and satisfies
-
(i)
F -1 (F(x)) ≤ x, - ∞ < x < ∞;
-
(ii)
F (F -1 (t)) ≥ t, 0 < t < 1;
-
(ii)
F (x) ≥ t if and only if × ≥ F -1 (t).
References
Bahadur RR: A note on quantiles in large samples. Ann. Math. Stat 1966, 37: 577–580. 10.1214/aoms/1177699450
Sen PK: On Bahadur representation of sample quantile for sequences of ϕ -mixing random variables. J. Multivar. Anal 1972, 2: 77–95. 10.1016/0047-259X(72)90011-5
Babu GJ, Singh K: On deviations between empirical and quantile processes for mixing random variables. J. Multivar. Anal 1978, 8: 532–549. 10.1016/0047-259X(78)90031-3
Yoshihara K: The Bahadur representation of sample quantile for sequences of strongly mixing random variables. Stat. Probab. Lett 1995, 24: 299–304. 10.1016/0167-7152(94)00187-D
Sun SX: The Bahadur representation for sample quantiles under weak dependence. Stat. Probab. Lett 2006, 76: 1238–1244. 10.1016/j.spl.2005.12.021
Ling NX: The Bahadur representation for sample quantiles under negatively associated sequence. Stat. Probab. Lett 2008, 78: 2660–2663. 10.1016/j.spl.2008.03.026
Li XQ, Yang WZ, Hu SH, Wang XJ: The Bahadur representation for sample quantile under NOD sequence. J. Nonparametric Stat 2011,23(1):59–65. 10.1080/10485252.2010.486033
Xing GD, Yang SC: A remark on the Bahadur representation of sample quantiles for negatively associated sequences. J. Korean Stat. Soc 2011,40(3):277–280. 10.1016/j.jkss.2010.10.006
Wang XJ, Hu SH, Yang WZ: The Bahadur representation for sample quantiles under strongly mixing sequence. J. Stat. Plan. Inf 2010, 141: 655–662.
Serfling RJ: Approximation Theorems of Mathematical Statistics. Wiley, New York; 1980.
Shiryaev AN: Probability, 2nd edn. Springer, New York; 1989.
Petrov VV: Limit Theorems of Probability Theory: Sequences of Independent Random Variables. Oxford University Press Inc., New York; 1995.
Hall P: Martingale Limit Theory and Its Application. Academic Press, Inc., New York; 1980.
Cai ZW, Roussas GG: Smooth estimate of quantiles under association. Stat. Probab. Lett 1997, 36: 275–287. 10.1016/S0167-7152(97)00074-6
Cai ZW, Roussas GG: Berry-Esséen bounds for smooth estimator of a distribution function under association. J. Nonparametric Stat 1999, 11: 79–106. 10.1080/10485259908832776
Yang SC: Uniformly asymptotic normality of the regression weighted estimator for negatively associated samples. Stat. Probab. Lett 2003, 62: 101–110.
Wang JF, Zhang LX: A Berry-Esséen theorem for weakly negatively dependent random variables and its applications. Acta Math. Hung 2006,110(4):293–308. 10.1007/s10474-006-0024-x
Liang HY, Baek J: Berry-Esséen bounds for density estimates under NA assumption. Metrika 2008, 68: 305–322. 10.1007/s00184-007-0159-y
Liang HY, De Uña-Álvarez J: A Berry-Esséen type bound in kernel density estimation for strong mixing censored samples. J. Multivar. Anal 2009, 100: 1219–1231. 10.1016/j.jmva.2008.11.001
Lahiri SN, Sun S: A Berry-Esséen theorem for samples quantiles under weak dependent. Ann. Appl. Probab 2009,19(1):108–126. 10.1214/08-AAP533
Chaubey YP, Dewan I, Li J: Smooth estimation of survival and density functions for a stationary associated process using Poisson weights. Stat. Probab. Lett 2011, 81: 267–276. 10.1016/j.spl.2010.10.010
Su C, Zhao LC, Wang YB: Moment inequalities and weak convergence for negatively associated sequences. Sci. China A 1997, 40: 172–182. 10.1007/BF02874436
Chang MN, Rao PV: Berry-Esséen bound for the Kaplan-Meier estimator. Commun. Stat. Theory Methods 1989,18(12):4647–4664. 10.1080/03610928908830180
Acknowledgements
The authors are most grateful to the Editor Charles E. Chidume and an anonymous referee for the careful reading of the manuscript and valuable suggestions which helped in significantly improving an earlier version of this paper. Supported by the NNSF of China (11171001, 61075009), HSSPF of the Ministry of Education of China (10YJA910005), Provincial Natural Science Research Project of Anhui Colleges (KJ2010A005), Talents youth Fund of Anhui Province Universities (2010SQRL016ZD) and Youth Science Research Fund of Anhui University (2009QN011A).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
Under some weak conditions, the Berry-Esséen bound of the sample quantiles for NA sequence is presented as O (n-1/9). All authors read and approved the final manuscript.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Yang, W., Hu, S., Wang, X. et al. Berry-Esséen bound of sample quantiles for negatively associated sequence. J Inequal Appl 2011, 83 (2011). https://doi.org/10.1186/1029-242X-2011-83
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/1029-242X-2011-83