- Research
- Open access
- Published:
Wavelet estimations for densities and their derivatives with Fourier oscillating noises
Journal of Inequalities and Applications volume 2014, Article number: 236 (2014)
Abstract
By developing the classical kernel method, Delaigle and Meister provide a nice estimation for a density function with some Fourier-oscillating noises over a Sobolev ball and over risk (Delaigle and Meister in Stat. Sin. 21:1065-1092, 2011). The current paper extends their theorem to Besov ball and risk with by using wavelet methods. We firstly show a linear wavelet estimation for densities in over risk, motivated by the work of Delaigle and Meister. Our result reduces to their theorem, when . Because the linear wavelet estimator is not adaptive, a nonlinear wavelet estimator is then provided. It turns out that the convergence rate is better than the linear one for . In addition, our conclusions contain estimations for density derivatives as well.
1 Introduction and preliminary
One of the fundamental deconvolution problems is to estimate a density function of a random variable X, when the available data are independent and identically distributed (i.i.d.) with
We assume that all and are independent and the density function of the noise δ is known.
Let the Fourier transform of be defined by in this paper. When satisfies
with and , there exist lots of optimal estimations for [1–6]. However, many noise densities have zeros in the Fourier transform domain, i.e., the inequality (1) does not hold. For example, Sun et al. described an experiment where data on the velocity of halo stars in the Milky Way are collected, and where the measurement errors are assumed to be uniformly distributed [7]. The classical kernel method provides a slower convergence rate in that case [8–10]. Delaigle and Meister [11] developed a new method for a density with
Here and (non-negative integer set). Such noises are called Fourier-oscillating. Clearly, (2) allows having zeros for . When , (2) reduces to (1) (non-zero case).
Delaigle and Meister defined a kernel estimator for a density in a Sobolev space and prove that with EX denoting the expectation of a random variable X,
under the assumption (2) (Theorem 4.1 in [11]). Here, stands for the Sobolev ball with the radius L. This above convergence rate attains the same one as in the non-zero case [1–4, 6]. In particular, it does not depend on the parameter v.
It seems that many papers deal with estimations. However, estimations () are important [5, 12]. On the other hand, Besov spaces contain many classical spaces (e.g., Sobolev spaces and Hölder spaces) as special examples. The current paper extends (3) from to the Besov ball , and from to risk estimations. In addition, our results contain estimations for d th derivatives of . The next section provides a linear wavelet estimation for over a Besov ball and over risk (). It turns out that our estimation reduces to (3), when , . Moreover, we show a nonlinear wavelet estimation which improves the linear one for in the last part.
1.1 Wavelet basis
The fundamental method to construct a wavelet basis comes from the concept of multiresolution analysis (MRA [13]). It is defined as a sequence of closed subspaces of the square integrable function space satisfying the following properties:
-
(i)
, (the integer set);
-
(ii)
if and only if for each ;
-
(iii)
(the space is dense in );
-
(iv)
There exists (scaling function) such that forms an orthonormal basis of .
With the standard notation in wavelet analysis, we can find a corresponding wavelet function
such that, for a fixed , constitutes an orthonormal basis of the orthogonal complement of in [13]. Then each has an expansion in sense,
where , .
A family of important examples are Daubechies wavelets , which are compactly supported in time domain [14]. They can be smooth enough with increasing supports as N gets large, although do not have analytic formulas except for .
As usual, let and be the orthogonal projections from to and , respectively,
The following simple lemma is fundamental in our discussions. We use to denote norm for , and do norm for , where
By using Proposition 8.3 in [15], we have the following conclusion.
Lemma 1.1 Let h be a Daubechies scaling function or the corresponding wavelet. Then there exists such that, for and ,
1.2 Besov spaces
One of the advantages of wavelet bases is that they can characterize Besov spaces. To introduce those spaces (see [15]), we need the Sobolev spaces with integer order and . Then can be considered as .
For , with and , the Besov spaces are defined by
with the associated norm , where
stands for the smoothness modulus of f. It should be pointed out that .
According to Theorem 9.6 in [15], the following result holds.
Lemma 1.2 Let be a Daubechies scaling function with large N and ψ be the corresponding wavelet. If , , , , and , then the following assertions are equivalent:
-
(i)
;
-
(ii)
;
-
(iii)
, where is the projection operator to .
In each case,
Here and throughout, denotes for some constant ; means ; we use standing for both and .
Note that is continuously embedded into for . Then the above lemma implies that
2 Linear wavelet estimation
We shall provide a linear wavelet estimation for a compactly supported density function and its derivatives under Fourier-oscillating noises in this section, motivated by the work of Delaigle and Meister. It turns out that our result generalizes their theorem.
As in [11], we define
Then and . Delaigle and Meister found that
where J and depend only on v and the support length of .
Let be the Daubechies scaling function with N large enough. Since both and φ have compact supports, the set is finite and the cardinality . Then with ,
It is easy to see . This with (5) and the Plancherel formula leads to
Note that and . Then the identity (6) reduces to
where . Since the empirical estimator for is , it is natural to define a linear wavelet estimator
with
When , and . Then our estimator reduces to the classical linear estimator for the case having no zeros (see e.g., [2–5]).
Let ψ be the Daubechies wavelet function corresponding to the scaling function and . Similar to (9), we define
Then it can easily be seen that , and .
For , , we consider the subset of ,
in this paper.
Lemma 2.1 Let (N large enough), ψ be the corresponding wavelet and satisfy (2). If , , and , then, for ,
Proof One shows only the first inequality; the second one is similar. Define
Then and
Clearly, . One estimates and in order to use the Rosenthal inequality: By the assumption (2), ≲ ≲ = ≲ . Because , the last integration is finite for large N. Hence,
Since (), and . This with the Parseval identity shows
Furthermore, one obtains thanks to (2). Hence,
According to (11) and the Rosenthal inequality,
Combining this with (13), one obtains for , which is the desired conclusion. When ,
due to (12) and (13). Moreover, . Since ,
Finally, . This completes the proof of the first part of Lemma 2.1. □
Now, we are in a position to state our first theorem.
Theorem 2.1 Let be defined by (8)-(9), , and . Then with and ,
Proof It is easy to see that for , because is continuously embedded into . Moreover, thanks to Jensen’s inequality.
When , and . Then
Therefore, it suffices to prove the theorem, for ,
If , then and
due to Lemma 1.2. On the other hand, and
because of Lemma 1.1 and . This with Lemma 2.1 leads to . Combining this with (15), one obtains
Take . Then the inequality (14) follows, and the proof of Theorem 2.1 is finished. □
Remark 2.1 If and , then , , Theorem 2.1 reduces to Theorem 4.1 in [11].
Remark 2.2 From the choice in the proof of Theorem 2.1, we find that our estimator is not adaptive, because it depends on the parameter s of . In order to avoid that shortcoming, we study a nonlinear estimation in the next part.
3 Nonlinear estimation
This section is devoted to an adaptive nonlinear estimation, which also improves the convergence rate of the linear one in some cases. The idea of proof comes from [12]. Choose ,
Let and be defined by (9) and (10), respectively, and
where the constant T will be determined in the proof of Theorem 3.1. Then we define a nonlinear wavelet estimator
where , and . Clearly, the cardinality since both and ψ have compact supports.
Lemma 3.1 If , then there exists such that, for each ,
Proof By the definitions of and , , where
Then and with ,
thanks to the classical Bernstein inequality in [15]. On the other hand, because of (12), (13), and . Hence, , and (17) reduces to
with . This completes the proof of Lemma 2.1. □
Theorem 3.1 Under the assumptions of Theorem 2.1, there exist and such that, for ,
Proof Similar to [12], one defines
and
It is easy to check that holds if and only if , and as well as if and only if , and . Then the conclusion of Theorem 3.1 can be rewritten as
Choose and such that
Then it can easily be shown by (16) that . Clearly,
where
By the assumption , and is continuously embedded into . Since , and thanks to Lemma 1.2. This with and the definition of μ leads to
Note that . Then due to Lemma 1.1, and Lemma 2.1. By and the choice ,
According to (21)-(23), it is sufficient to prove : Define
Then = + − + = , where
In order to conclude Theorem 3.1, one needs only to show that
By (16), and . This with Lemma 1.1 shows that, for , there exists such that
To estimate , one takes . Then, for each , and . This with the Hölder inequality shows
Clearly, . Furthermore, using , Lemma 2.1, and Lemma 3.1, one obtains . Moreover, by (25),
Choose . Then , and . Similar to (23), one has
In the proof of (26), one needs to choose .
Now, one considers : For , and . By Lemma 3.1, . Since , and
Moreover, it follows from the definition of and (25) that
By (16), one can choose (independent of n) so that . Hence, this above inequality reduces to . Similar arguments to (22) lead to
In this above proof, one needs to choose .
For , one uses , (25), and Lemma 2.1 to find
Recall that . Then , which reduces to by similar arguments of (23). It remains to show
By Lemma 2.1 and the definition of , ≲ ≲ . According to Lemma 1.2, . Hence,
Combining this above inequality with (25), one obtains
where as defined in (19).
When , . By the choice ,
Since for , and . Then one obtains the desired inequality (28).
When , and . Take
Then and in that case. With , one knows from Lemma 2.1 and the definition of that
Since , . This with (25) leads to
Note that due to the definition of , as well as implies . Then
Finally, one estimates : Define . Then
due to and the definition of . Using (25) and in (19), one obtains
When , . Recall that holds if and only if and . Then it can be checked that . Hence,
On the other hand, Lemma 1.1 tells that
Since , and . Moreover,
where the last inequality comes from the choice . Combining this with (31), one has, for ,
It remains to show for . By Lemma 1.1,
This with Lemma 1.2 shows that
When ,
When ,
For , by the Hölder inequality,
Using (34) and the choice , one obtains
On the other hand,
thanks to (30). According to the choice of and
for , one finds by direct computations. Hence, in each case, which is (33). Now, (24) follows from (26), (27), (29), and (33). The proof is done. □
Remark 3.1 We find easily from Theorem 2.1 and Theorem 3.1 that the nonlinear wavelet estimator converges faster than the linear one for . Moreover, the nonlinear estimator is adaptive, while the linear one is not.
Remark 3.2 This paper studies wavelet estimations of a density and its derivatives with Fourier-oscillating noises. The remaining problems include the optimality of the above estimations, numerical experiments as well as the corresponding regression problems. We shall investigate those problems in the future.
References
Walter GG: Density estimation in the presence of noise. Stat. Probab. Lett. 1999, 41: 237–246. 10.1016/S0167-7152(98)00160-6
Pensky M, Vidakovic B: Adaptive wavelet estimator for nonparametric density deconvolution. Ann. Stat. 1999, 27: 2033–2053. 10.1214/aos/1017939249
Fan J: On the optimal rates of convergence for nonparametric deconvolution problems. Ann. Stat. 1991, 19: 1257–1272. 10.1214/aos/1176348248
Fan J, Koo J: Wavelet deconvolution. IEEE Trans. Inf. Theory 2002, 48: 734–747. 10.1109/18.986021
Lounici K, Nickl R: Global uniform risk bounds for wavelet deconvolution estimators. Ann. Stat. 2011, 39: 201–231. 10.1214/10-AOS836
Li R, Liu Y: Wavelet optimal estimations for a density with some additive noises. Appl. Comput. Harmon. Anal. 2014, 36: 416–433. 10.1016/j.acha.2013.07.002
Sun, J, Morrison, H, Harding, P, Woodroofe, M: Density and mixture estimation from data with measurement errors. Technical report (2002)
Devroye L: Consistent deconvolution in density estimation. Can. J. Stat. 1989, 17: 235–239. 10.2307/3314852
Hall P, Meister A: A ridge-parameter approach to deconvolution. Ann. Stat. 2007, 35: 1535–1558. 10.1214/009053607000000028
Meister A: Deconvolution from Fourier-oscillating error densities under decay and smoothness restrictions. Inverse Probl. 2008, 24: 1–14.
Delaigle A, Meister A: Nonparametric function estimation under Fourier-oscillating noise. Stat. Sin. 2011, 21: 1065–1092. 10.5705/ss.2009.082
Donoho DL, Johnstone IM, Kerkyacharian G, Picard D: Density estimation by wavelet thresholding. Ann. Stat. 1996, 24: 508–539.
Hernández E, Weiss G: A First Course on Wavelets. CRC Press, Boca Raton; 1996.
Daubechies I: Ten Lectures on Wavelets. SIAM, Philadelphia; 1992.
Härdle W, Kerkyacharian G, Picard D, Tsybakov A Lecture Notes in Statist. In Wavelets, Approximation, and Statistical Applications. Springer, New York; 1998.
Acknowledgements
This paper is supported by the National Natural Science Foundation of China (No. 11271038).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
All authors contributed equally and significantly in writing this article. All authors read and approved the final manuscript.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Guo, H., Liu, Y. Wavelet estimations for densities and their derivatives with Fourier oscillating noises. J Inequal Appl 2014, 236 (2014). https://doi.org/10.1186/1029-242X-2014-236
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/1029-242X-2014-236