Nonlinear wavelet density estimation for biased data in Sobolev spaces

Wang, Jinru; Wang, Meng; Zhou, Yuan

doi:10.1186/1029-242X-2013-308

Research
Open access
Published: 03 July 2013

Nonlinear wavelet density estimation for biased data in Sobolev spaces

Jinru Wang¹,
Meng Wang¹ &
Yuan Zhou¹

Journal of Inequalities and Applications volume 2013, Article number: 308 (2013) Cite this article

1650 Accesses
42 Citations
Metrics details

Abstract

In this paper, we consider the density estimation problem from independent and identically distributed (i.i.d.) biased observations. We develop an adaptive wavelet hard thresholding rule and evaluate its performance by considering $L_{p}$ risk over Sobolev balls. We prove that our estimation attains a sharp rate of convergence and show the optimality.

MSC:49K40, 90C29, 90C31.

1 Introduction

In practice, it usually happens that drawing a direct sample from a random variable X is impossible. In this paper, we consider the problem of estimating the density functions $f^{X} (x)$ without observing directly the i.i.d. sample $X_{1}, X_{2}, \dots, X_{n}$ . We observe the samples $Y_{1}, Y_{2}, \dots, Y_{n}$ from biased data with the following density function:

f^{Y} (x) = \frac{g (x) f^{X} (x)}{μ},

(1.1)

where $g (x)$ is the so-called weight or bias function, $μ = E (g (X))$ . The purpose of this paper is to estimate the density function $f^{X} (x)$ from the samples $Y_{1}, Y_{2}, \dots, Y_{n}$ .

Several examples of this biased data can be found in the literature. For instance, in paper [1], it is shown that the distribution of the concentration of alcohol in the blood of intoxicated drivers is of interest, since the drunken driver has a larger chance of being arrested, the collected data are size-biased.

The density estimation problem for biased data (1.1) has been discussed in several papers. In 1982, Vardi [2] considered the nonparametric maximum likelihood estimation for $f^{X} (x)$ . In 1991, Jones [3] discussed the mean squared error properties of the kernel density estimation. In 2004, Efromovich [4] developed the Efromovich-Pinsker adaptive Fourier estimator. It was based on a blockwise shrinkage algorithm and achieved the minimax rate of convergence under the $L_{2}$ risk over a Besov class $B_{2, 2}^{s}$ .

In 2010, Ramírez and Vidakovic [5] proposed a linear wavelet estimator and discussed the consistency of a function in $L_{2} [0, 1]$ under the mean integrated squared error (MISE) sense. But the wavelet estimator in paper [5] contained the unknown parameter μ. In the same year, Christophe [6] constructed a nonlinear wavelet estimator and evaluated the $L_{p}$ risk in the Besov space $B_{r, q}^{s}$ . However, Sobolev spaces $W_{r}^{N}$ ( $N \in N^{+}$ ) except $r = 2$ is not a special case in the Besov space $B_{r, q}^{s}$ .

In this paper, we consider the nonlinear hard thresholding wavelet density estimation for biased data in Sobolev spaces $W_{r}^{N}$ ( $N \in N^{+}$ ). We mainly give the upper bound of minimax rate of convergence under the $L_{p}$ risk without particular restriction on the parameters r and p, and the convergence rate is optimal.

2 Preliminaries

In this section, we shall recall some well-known concepts and lemmas.

2.1 Wavelets

In this paper, we always assume that the scaling wavelet φ is orthonormal, compactly supported and $N + 1$ regular.

Definition 2.1 The scaling function $φ (x)$ is called m regular if $φ (x)$ has continuous derivatives of order m and its corresponding wavelet $ψ (x)$ has vanishing moments of order m, i.e., $\int x^{k} ψ (x) d x = 0$ , $k = 0, 1, \dots, m - 1$ .

The following conditions about the scaling function φ and the kernel function $K (x, y)$ will be very useful in the third section.

Condition (θ)

The function $θ_{φ} (x) = \sum_{k \in Z} | φ (x - k) |$ is such that $ess {sup}_{x \in R} θ_{φ} (x) < \infty$ .

Condition $H (N)$

There exists an integrable function $F (x)$ such that for any $x, y \in R$ , $| K (x, y) | \leq F (x - y)$ , where $\int {| x |}^{N} F (x) d x < \infty$ .

Condition $M (N)$

Condition $H (N)$ is satisfied and $\int K (x, y) {(y - x)}^{k} d y = δ_{0 k}$ , $k = 0, \dots, N$ , $x \in R$ .

For any $x \in R$ , $j, k \in Z$ , denoted by $φ_{j k} (x) : = 2^{\frac{j}{2}} φ (2^{j} x - k)$ , $ψ_{j k} (x) : = 2^{\frac{j}{2}} ψ (2^{j} x - k)$ , then for any $f (x) \in L_{r} (R) : = {f (x) | \int_{R} {| f (x) |}^{r} d x < \infty}$ , where $1 \leq r < \infty$ , we have the following equation [7]:

f (x) = \sum_{k \in Z} α_{J, k} φ_{J, k} (x) + \sum_{j \geq J} \sum_{k \in Z} β_{j, k} ψ_{j, k} (x), a.e.,

(2.1)

where

α_{J, k} = \int_{R} f (x) φ_{J, k} (x) d x, β_{j, k} = \int_{R} f (x) ψ_{j, k} (x) d x .

2.2 Sobolev space

The Sobolev space $W_{r}^{N} (R)$ ( $N \in N^{+}$ ) is defined by $W_{r}^{N} (R) : = {f : f \in L_{r} (R), f^{(N)} \in L_{r} (R)}$ , which is equipped with the norm ${∥ f ∥}_{W_{r}^{N}} : = {∥ f ∥}_{r} + {∥ f^{(N)} ∥}_{r}$ . The Sobolev balls ${\tilde{W}}_{r}^{N} (A, L)$ are defined as follows:

\begin{array}{rcl} {\tilde{W}}_{r}^{N} (A, L) & : = & {f \in W_{r}^{N} (R) : f is a probability density function, supp f \leq A, \\ {∥ f^{(N)} ∥}_{r} \leq L} . \end{array}

Between a Sobolev space and a Besov space, the following embedding conclusions are established.

Lemma 2.1 [8]

Let $s > 0$ , $1 \leq p, q, r \leq \infty$ , then

(i)
$W_{r}^{N} ↪ B_{r \infty}^{N} ↪ B_{\infty \infty}^{N - 1 / r}$ , $\forall N > 1 / r$ ;
(ii)
$B_{r q}^{s} ↪ B_{p q}^{s^{'}}$ , $\forall r < p$ , $s^{'} = s - 1 / r + 1 / p$ ,

where $A ↪ B$ denotes that the Banach space A is continuously embedding in the Banach space B, i.e., there exists a constant $c \geq 0$ such that for any $u \in A$ , we have ${∥ u ∥}_{B} \leq c {∥ u ∥}_{A}$ .

2.3 Auxiliary lemmas

The following lemmas given by [9] will be used in the next section.

Lemma 2.2 If the scaling function φ satisfies Condition (θ), then for any sequence ${λ_{k}}_{k \in Z}$ satisfying ${∥ λ ∥}_{l_{p}} : = {(\sum_{k} {| λ_{k} |}^{p})}^{\frac{1}{p}} < \infty$ , we have $C_{1} {∥ λ ∥}_{l_{p}} 2^{(\frac{j}{2} - \frac{j}{p})} \leq {∥ \sum_{k} λ_{k} φ_{j, k} ∥}_{p} \leq C_{2} {∥ λ ∥}_{l_{p}} 2^{(\frac{j}{2} - \frac{j}{p})}$ , where $C_{1} = {({∥ θ_{φ} ∥}_{\infty}^{\frac{1}{p}} {∥ φ ∥}_{1}^{\frac{1}{q}})}^{- 1}$ , $C_{2} = {({∥ θ_{φ} ∥}_{\infty}^{\frac{1}{q}} {∥ φ ∥}_{1}^{\frac{1}{p}})}^{- 1}$ , $1 \leq p \leq \infty$ , $\frac{1}{p} + \frac{1}{q} = 1$ .

Lemma 2.3 For some integer $N \geq 0$ , if the kernel function $K (x, y)$ satisfies Conditions $M (N)$ and $H (N + 1)$ , $f \in B_{p q}^{s} (R)$ , where $1 \leq p, q \leq \infty$ , $0 < s < N + 1$ , then we have ${∥ K_{j} f - f ∥}_{p} = 2^{- j s} ε_{j}$ , where $ε_{j} \in l_{q}$ .

Lemma 2.4 (Rosenthal inequality)

Let $X_{1}, \dots, X_{n}$ be independent random variables such that $E (X_{i}) = 0$ and $E ({| X_{i} |}^{p}) < \infty$ , then there exists a constant $C (p) > 0$ such that

\begin{matrix} E ({| \sum_{i = 1}^{n} X_{i} |}^{p}) \leq C (p) (\sum_{i = 1}^{n} E ({| X_{i} |}^{p}) + {(\sum_{i = 1}^{n} E (X_{i}^{2}))}^{p / 2}), p > 2, \\ E ({| \sum_{i = 1}^{n} X_{i} |}^{p}) \leq {(\sum_{i = 1}^{n} E (X_{i}^{2}))}^{p / 2}, 0 < p \leq 2 . \end{matrix}

Lemma 2.5 (Bernstein inequality)

Let $X_{1}, X_{2}, \dots, X_{n}$ be independent random variables such that $E (X_{i}) = 0$ , $E (X_{i}^{2}) \leq σ^{2}$ , $| X_{i} | \leq M < \infty$ . Then

P (| \frac{1}{n} \sum_{i = 1}^{n} X_{i} | > λ) \leq 2 exp (- \frac{n λ^{2}}{2 (σ^{2} + M λ / 3)}), \forall λ > 0 .

Remark In this paper, we often use the notation $A ≲ B$ to indicate that $A ⩽ c B$ with a positive constant c, which is independent of A and B. If $A ≲ B$ and $B ≲ A$ , we write $A \sim B$ .

3 Main results

In this paper, our hard thresholding wavelet density estimator is defined as follows:

{\hat{f}}_{n}^{X non} (x) = \sum_{k} {\hat{α}}_{j_{0} k} φ_{j_{0} k} (x) + \sum_{j = j_{0}}^{j_{1}} \sum_{k} {\hat{β}}_{j k}^{*} ψ_{j k} (x),

(3.1)

where

{\hat{α}}_{j_{0} k} : = \frac{\hat{μ}}{n} \sum_{i = 1}^{n} \frac{φ_{j_{0} k} (Y_{i})}{g (Y_{i})}, {\hat{β}}_{j k} : = \frac{\hat{μ}}{n} \sum_{i = 1}^{n} \frac{ψ_{j k} (Y_{i})}{g (Y_{i})}, \hat{μ} : = \frac{n}{\sum_{i = 1}^{n} \frac{1}{g (Y_{i})}} .

The hard thresholding wavelet coefficients are ${\hat{β}}_{j k}^{*} : = {\hat{β}}_{j k} I {| {\hat{β}}_{j k} | \geq λ}$ , where

I {| {\hat{β}}_{j k} | \geq λ} : = {\begin{matrix} 1, & | {\hat{β}}_{j k} | \geq λ, \\ 0, & | {\hat{β}}_{j k} | < λ . \end{matrix}

Suppose that the parameters $j_{0}$ , $j_{1}$ , λ of the wavelet thresholding estimator (3.1) satisfy the assumptions:

2^{j_{0}} \sim {\begin{matrix} {({(ln n)}^{\frac{p - r}{r}} n)}^{\frac{1}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ n^{\frac{1 - 2 / p}{2 (N - 1 / r) + 1}}, & r \leq \frac{p}{2 N + 1}, \end{matrix}

(3.2)

2^{j_{1}} \sim {\begin{matrix} n^{\frac{N}{N^{'} (2 N + 1)}}, & r > \frac{p}{2 N + 1}, \\ {(\frac{n}{ln n})}^{\frac{1}{2 (N - 1 / r) + 1}}, & r \leq \frac{p}{2 N + 1}, \end{matrix}

(3.3)

λ = c \sqrt{\frac{j}{n}},

(3.4)

where c is a suitably chosen positive constant.

Lemma 3.1 Suppose that there exist two constants $g_{1}$ and $g_{2}$ such that $0 < g_{1} \leq g (x) \leq g_{2} < \infty$ for $x \in R$ . Let $α_{j k}$ , $β_{j k}$ be the coefficients in the expansion (2.1) and let ${\hat{α}}_{j k}$ , ${\hat{β}}_{j k}$ be defined by estimator in (3.1). If $2^{j} \leq n$ , then for any $1 \leq p < \infty$ , we have

(i)
$E {| α_{j k} - {\hat{α}}_{j k} |}^{p} ≲ n^{- \frac{p}{2}}$ ;
(ii)
$E {| β_{j k} - {\hat{β}}_{j k} |}^{p} ≲ n^{- \frac{p}{2}}$ .

Proof (i) From the definition of ${\hat{α}}_{j k}$ and the triangular inequality, we have

\begin{array}{rcl} | {\hat{α}}_{j, k} - α_{j, k} | & = & | \frac{\hat{μ}}{n} \sum_{i = 1}^{n} \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} - α_{j, k} | \\ = & | \frac{\hat{μ}}{μ} (\frac{μ}{n} \sum_{i = 1}^{n} \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} - α_{j, k}) + \hat{μ} α_{j, k} (\frac{1}{μ} - \frac{1}{\hat{μ}}) | \\ \leq & | \frac{\hat{μ}}{μ} | | \frac{μ}{n} \sum_{i = 1}^{n} \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} - α_{j, k} | + | \hat{μ} α_{j, k} | | \frac{1}{\hat{μ}} - \frac{1}{μ} | . \end{array}

Since $g_{1} \leq g (y) \leq g_{2}$ , we have

\hat{μ} = \frac{n}{\sum_{i = 1}^{n} \frac{1}{g (Y_{i})}} \leq g_{2}, μ = E g (X) \geq g_{1},

and

| α_{j, k} | \leq \int | f^{X} (y) | | φ_{j, k} (y) | d y \leq {(\int {| f^{X} (y) |}^{2} d y)}^{\frac{1}{2}} {(\int {| φ_{j, k} (y) |}^{2} d y)}^{\frac{1}{2}} \leq A^{1 / 2} {∥ f ∥}_{\infty} .

Furthermore, a Sobolev space and a Besov space have the following embedding theorem, $W_{r}^{N} ↪ B_{r \infty}^{N} ↪ B_{\infty \infty}^{N - 1 / r}$ , for any integer $N > 1 / r$ , then we have ${∥ f ∥}_{\infty} \leq {∥ f ∥}_{B_{\infty \infty}^{N - 1 / r}} \leq {∥ f ∥}_{W_{r}^{N}} = c$ . Therefore, by the convexity inequality, we get

\begin{matrix} E {| {\hat{α}}_{j, k} - α_{j, k} |}^{p} \\ \leq E {(\frac{g_{2}}{g_{1}} | \frac{μ}{n} \sum_{i = 1}^{n} \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} - α_{j, k} | + g_{2} A^{1 / 2} c | \frac{1}{\hat{μ}} - \frac{1}{μ} |)}^{p} \\ \leq 2^{p - 1} max {\frac{g_{2}}{g_{1}}, g_{2} A^{1 / 2} c}^{p} E ({| \frac{μ}{n} \sum_{i = 1}^{n} \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} - α_{j, k} |}^{p} + {| \frac{1}{\hat{μ}} - \frac{1}{μ} |}^{p}) \\ ≲ E {| \frac{μ}{n} \sum_{i = 1}^{n} \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} - α_{j, k} |}^{p} + E {| \frac{1}{\hat{μ}} - \frac{1}{μ} |}^{p} \\ = : T_{1} + T_{2}, \end{matrix}

where $T_{1} : = E {| \frac{μ}{n} \sum_{i = 1}^{n} \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} - α_{j, k} |}^{p}$ , $T_{2} : = E {| \frac{1}{\hat{μ}} - \frac{1}{μ} |}^{p}$ .

The term $T_{i}$ is estimated as follows. Firstly, let $ξ_{i} : = μ \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} - α_{j, k}$ , we can see that they are i.i.d., and $E (ξ_{i}) = 0$ . Moreover, for any $m \geq 2$ ,

\begin{array}{rcl} E {| ξ_{i} |}^{m} & = & E {| μ \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} - α_{j, k} |}^{m} \\ \leq & 2^{m - 1} (E {| μ \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} |}^{m} + {| α_{j, k} |}^{m}), \end{array}

where

\begin{array}{rcl} E {| μ \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} |}^{m} & = & μ^{m - 1} \frac{φ_{j, k}^{m - 2} (Y_{i})}{g {(Y_{i})}^{m - 1}} E | μ \frac{φ_{j, k}^{2} (Y_{i})}{g (Y_{i})} | \\ \leq & g_{2}^{m - 1} g_{1}^{- m + 1} 2^{\frac{j}{2} (m - 2)} {∥ φ ∥}_{\infty}^{m - 2} E | μ \frac{φ_{j, k}^{2} (Y_{i})}{g (Y_{i})} |, \end{array}

and

\begin{array}{rcl} E | μ \frac{φ_{j, k}^{2} (Y_{i})}{g (Y_{i})} | & = & \int μ \frac{φ_{j, k}^{2} (y)}{g (y)} f^{Y} (y) d y = \int μ \frac{φ_{j, k}^{2} (y)}{g (y)} \frac{g (y) f^{X} (y)}{μ} d y \\ \leq & {∥ f ∥}_{\infty} \leq {∥ f ∥}_{W_{r}^{N}} = c . \end{array}

So, we have

E {| μ \frac{φ_{j, k} (Y_{i})}{g (Y_{i})} |}^{m} \leq g_{2}^{m - 1} g_{1}^{- m + 1} 2^{\frac{j}{2} (m - 2)} {∥ φ ∥}_{\infty}^{m - 2} c .

Since $2^{j} \leq n$ , we obtain

E {| ξ_{i} |}^{m} \leq C 2^{j (m - 2) / 2} ≲ n^{(m - 2) / 2} .

By Rosenthal’s inequality, we have

T_{1} = E {| \frac{1}{n} \sum_{i = 1}^{n} ξ_{i} |}^{p} ≲ n^{- p} (n E {| ξ_{i} |}^{p} + n^{p / 2} {(E {| ξ_{i} |}^{2})}^{p / 2}) ≲ n^{- p / 2} .

(3.5)

To estimate the term $T_{2}$ , let $η_{i} = \frac{1}{g (Y_{i})} - \frac{1}{μ}$ . We can compute $E (η_{i}) = 0$ easily, and for any $m \geq 2$ , $E {| η_{i} |}^{m} \leq C$ .

If $p \geq 2$ , i.e., $1 - p < - p / 2$ , using Rosenthal’s inequality, we have

T_{2} = E {| \frac{1}{n} \sum_{i = 1}^{n} η_{i} |}^{p} ≲ n^{- p} (n E {| η_{i} |}^{p} + n^{p / 2} {(E {| η_{i} |}^{2})}^{p / 2}) ≲ n^{- p + 1} + n^{- p / 2} ≲ n^{- p / 2} .

(3.6)

If $1 \leq p < 2$ , we get

T_{2} = E {| \frac{1}{n} \sum_{i = 1}^{n} η_{i} |}^{p} \leq n^{- p} (n^{p / 2} {(E {| η_{i} |}^{2})}^{p / 2}) \leq n^{- p / 2} .

(3.7)

By (3.5), (3.6) and (3.7), we obtain

E {| {\hat{α}}_{j, k} - α_{j, k} |}^{p} ≲ T_{1} + T_{2} ≲ n^{- p / 2} .

(ii)
It is similar to (i), we omit it. □

Lemma 3.2 If $j 2^{j} \leq n$ , then for any $ω > 0$ , there exists a constant $c > 0$ such that

P (| {\hat{β}}_{j k} - β_{j k} | > λ = c \sqrt{\frac{j}{n}}) ≲ 2^{- ω j} .

(3.8)

Proof We can easily get

\begin{matrix} \hat{μ} \leq g_{2}, μ \geq g_{1}, \frac{1}{μ} \leq g_{1}^{- 1}, \\ | β_{j, k} | \leq A^{1 / 2} {∥ f ∥}_{\infty} \leq A^{1 / 2} {∥ f ∥}_{W_{r}^{N}} . \end{matrix}

Therefore,

\begin{array}{rcl} | {\hat{β}}_{j, k} - β_{j, k} | & = & | \frac{\hat{μ}}{μ} (\frac{μ}{n} \sum_{i = 1}^{n} \frac{ψ_{j, k} (Y_{i})}{g (Y_{i})} - β_{j, k}) + \hat{μ} β_{j, k} (\frac{1}{μ} - \frac{1}{\hat{μ}}) | \\ \leq & \frac{g_{2}}{g_{1}} | \frac{1}{n} \sum_{i = 1}^{n} (μ \frac{ψ_{j, k} (Y_{i})}{g (Y_{i})} - β_{j, k}) | + g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}} | \frac{1}{n} \sum_{i = 1}^{n} (\frac{1}{g (Y_{i})} - \frac{1}{μ}) | \\ = : & \frac{g_{2}}{g_{1}} | \frac{1}{n} \sum_{i = 1}^{n} ξ_{i} | + g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}} | \frac{1}{n} \sum_{i = 1}^{n} η_{i} |, \end{array}

where $ξ_{i} = μ \frac{ψ_{j, k} (Y_{i})}{g (Y_{i})} - β_{j, k}$ , $η_{i} = \frac{1}{g (Y_{i})} - \frac{1}{μ}$ . So, we get

\begin{array}{rcl} P (| {\hat{β}}_{j, k} - β_{j, k} | > λ) & \leq & P (\frac{g_{2}}{g_{1}} | \frac{1}{n} \sum_{i = 1}^{n} ξ_{i} | + g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}} | \frac{1}{n} \sum_{i = 1}^{n} η_{i} | > λ) \\ \leq & P (\frac{g_{2}}{g_{1}} | \frac{1}{n} \sum_{i = 1}^{n} ξ_{i} | > λ / 2) + P (g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}} | \frac{1}{n} \sum_{i = 1}^{n} η_{i} | > λ / 2) \\ = & P (| \frac{1}{n} \sum_{i = 1}^{n} ξ_{i} | > \frac{λ g_{1}}{2 g_{2}}) + P (| \frac{1}{n} \sum_{i = 1}^{n} η_{i} | > \frac{λ}{2 g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}}) \\ = : & P_{1} + P_{2}, \end{array}

where $P_{1} : = P (| \frac{1}{n} \sum_{i = 1}^{n} ξ_{i} | > \frac{λ g_{1}}{2 g_{2}})$ , $P_{2} : = P (| \frac{1}{n} \sum_{i = 1}^{n} η_{i} | > \frac{λ}{2 g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}})$ .

Now, we estimate $P_{1}$ . Clearly, $E ξ_{i} = 0$ , and

\begin{array}{rcl} E ξ_{i}^{2} & = & E {(μ \frac{ψ_{j, k} (Y_{i})}{g (Y_{i})} - β_{j, k})}^{2} \\ \leq & 2 (E {| μ \frac{ψ_{j, k} (Y_{i})}{g (Y_{i})} |}^{2} + β_{j, k}^{2}) \\ = & 2 (E | μ \frac{ψ_{j, k}^{2} (Y_{i})}{g (Y_{i})} | \frac{μ}{g (Y_{i})} + β_{j, k}^{2}) \\ \leq & 2 (\frac{g_{2}}{g_{1}} {∥ f^{X} ∥}_{W_{r}^{N}} + A {∥ f^{X} ∥}_{W_{r}^{N}}^{2}) \\ : = & σ^{2} . \end{array}

Furthermore, we have

| ξ_{i} | = | μ \frac{ψ_{j, k} (Y_{i})}{g (Y_{i})} - E (μ \frac{ψ_{j, k} (Y_{i})}{g (Y_{i})}) | \leq 2 \cdot 2^{j / 2} g_{2} g_{1}^{- 1} {∥ ψ ∥}_{\infty} .

By Bernstein’s inequality, we obtain

\begin{array}{rcl} P (| \frac{1}{n} \sum_{i = 1}^{n} ξ_{i} | > \frac{λ g_{1}}{2 g_{2}}) & \leq & 2 exp (- \frac{n λ^{2} g_{1}^{2} / 4 g_{2}^{2}}{2 (σ^{2} + \frac{λ g_{1}}{2 g_{2}} 2 \cdot 2^{j / 2} g_{2} g_{1}^{- 1} {∥ ψ ∥}_{\infty} / 3)}) \\ = & 2 exp (- \frac{n \cdot c^{2} \frac{j}{n} \cdot g_{1}^{2} / 4 g_{2}^{2}}{2 (σ^{2} + g_{2} c \sqrt{j / n} 2^{j / 2} g_{2} g_{1}^{- 1} {∥ ψ ∥}_{\infty} g_{1} / 3)}) \\ = & 2 exp (- \frac{c^{2} j g_{1}^{2} / 4 g_{2}^{2}}{2 (σ^{2} + \sqrt{j 2^{j} / n} {∥ ψ ∥}_{\infty} c / 3)}) . \end{array}

Since $j 2^{j} \leq n$ , then

\begin{array}{rcl} P (| \frac{1}{n} \sum_{i = 1}^{n} ξ_{i} | > \frac{λ g_{1}}{2 g_{2}}) & \leq & 2 exp (- \frac{c^{2} j g_{1}^{2} / 4 g_{2}^{2}}{2 (σ^{2} + {∥ ψ ∥}_{\infty} c / 3)}) \\ = & 2 exp (- \frac{c^{2} g_{1}^{2} / 4 g_{2}^{2}}{2 (σ^{2} + {∥ ψ ∥}_{\infty} c / 3)} j) . \end{array}

Taking $c_{1} > 0$ such that $\frac{c_{1}^{2} g_{1}^{2} / 4 g_{2}^{2}}{2 (σ^{2} + {∥ ψ ∥}_{\infty} c_{1} / 3)} \geq ω$ , then

P_{1} = P (| \frac{1}{n} \sum_{i = 1}^{n} ξ_{i} | > \frac{λ g_{1}}{2 g_{2}}) \leq 2 e^{- ω j} ≲ 2^{- ω j} .

(3.9)

Next, we estimate $P_{2}$ . We compute that $E η_{i} = 0$ , i.e.,

\begin{array}{rcl} E η_{i} & = & E (\frac{1}{g (Y_{i})}) - E (\frac{1}{μ}) = \int \frac{1}{g (y)} f^{Y} (y) d y - \frac{1}{μ} \\ = & \int \frac{1}{g (y)} \frac{g (y) f^{X} (y)}{μ} d y - \frac{1}{μ} \\ = & \frac{1}{μ} \int f^{X} (y) d y - \frac{1}{μ} = 0, \end{array}

and

\begin{matrix} E η_{i}^{2} = E {(\frac{1}{g (Y_{i})} - \frac{1}{μ})}^{2} \leq 2 (E {| \frac{1}{g (Y_{i})} |}^{2} + \frac{1}{μ^{2}}) \leq \frac{4}{g_{1}^{2}}, \\ | η_{i} | = | \frac{1}{g (Y_{i})} - \frac{1}{μ} | = | \frac{1}{g (Y_{i})} - E \frac{1}{g (Y_{i})} | \leq 2 g_{1}^{- 1} . \end{matrix}

By Bernstein’s inequality, we obtain

\begin{matrix} P (| \frac{1}{n} \sum_{i = 1}^{n} η_{i} | > \frac{λ}{2 g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}}) \\ \leq 2 exp (- \frac{n {(λ / (2 g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}))}^{2}}{2 (\frac{4}{g_{1}^{2}} + λ g_{1}^{- 1} / (3 g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}))}) \\ = 2 exp (- \frac{n c^{2} j / (4 n g_{2}^{2} A {∥ f^{X} ∥}_{W_{r}^{N}}^{2})}{2 (\frac{4}{g_{1}^{2}} + c \sqrt{j / n} / (3 g_{1} g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}))}) . \end{matrix}

Since $j \leq n$ , then

\begin{matrix} P (| \frac{1}{n} \sum_{i = 1}^{n} η_{i} | > \frac{λ}{2 g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}}) \\ \leq 2 exp (- \frac{c^{2} / (4 g_{2}^{2} A {∥ f^{X} ∥}_{W_{r}^{N}}^{2})}{2 (\frac{4}{g_{1}^{2}} + c / (3 g_{1} g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}))} j) . \end{matrix}

Taking $c_{2} > 0$ such that $\frac{c_{2}^{2} / (4 g_{2}^{2} A {∥ f^{X} ∥}_{W_{r}^{N}}^{2})}{2 (\frac{4}{g_{1}^{2}} + c_{2} / (3 g_{1} g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}))} \geq ω$ , we have

P_{2} = P (| \frac{1}{n} \sum_{i = 1}^{n} η_{i} | > \frac{λ}{2 g_{2} A^{1 / 2} {∥ f^{X} ∥}_{W_{r}^{N}}}) \leq 2 e^{- ω j} ≲ 2^{- ω j} .

(3.10)

Taking $c = max {c_{1}, c_{2}}$ , by (3.9) and (3.10), we have

P (| {\hat{β}}_{j, k} - β_{j, k} | > λ) \leq P_{1} + P_{2} ≲ 2^{- ω j} .

□

Lemma 3.3 Suppose that there exist two constants $g_{1}$ and $g_{2}$ such that $0 < g_{1} \leq g (x) \leq g_{2} < \infty$ , for $x \in R$ , and ${\hat{β}}_{j k}$ , ${\hat{β}}_{j k}^{*}$ are given by (3.1). Then

E {∥ \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p} ≲ {\begin{matrix} {(ln n)}^{c_{3}} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ {(ln n)}^{c_{4}} {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r = \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r < \frac{p}{2 N + 1}, \end{matrix}

where $c_{3}$ , $c_{4}$ are constants.

Proof By Lemma 2.2, we obtain

\begin{array}{rcl} E {∥ \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p} & \leq & E \sum_{j = j_{0}}^{j_{1}} {∥ \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p} \\ ≲ & E \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k}^{*} - β_{j k} |}^{p})}^{\frac{1}{p}} . \end{array}

Furthermore, since ${\hat{β}}_{j k}^{*} = {\hat{β}}_{j k} I {| {\hat{β}}_{j k} | > λ}$ , we have

\begin{matrix} E {∥ \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p} \\ ≲ E (\sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k} - β_{j k} |}^{p})}^{\frac{1}{p}} \\ \times (I {| {\hat{β}}_{j k} | > λ, | β_{j k} | \geq \frac{λ}{2}} + I {| {\hat{β}}_{j k} | > λ, | β_{j k} | < \frac{λ}{2}}) \\ + \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| β_{j k} |}^{p})}^{\frac{1}{p}} (I {| {\hat{β}}_{j k} | \leq λ, | β_{j k} | \leq 2 λ} + I {| {\hat{β}}_{j k} | \leq λ, | β_{j k} | > 2 λ})) . \end{matrix}

Note that

\begin{matrix} I {| {\hat{β}}_{j k} | > λ, | β_{j k} | < \frac{λ}{2}} \leq I {| {\hat{β}}_{j k} - β_{j k} | > \frac{λ}{2}}, \\ I {| {\hat{β}}_{j k} | \leq λ, | β_{j k} | > 2 λ} \leq I {| {\hat{β}}_{j k} - β_{j k} | > \frac{λ}{2}}, \end{matrix}

and if $| {\hat{β}}_{j k} | \leq λ$ , $| β_{j k} | > 2 λ$ , we get $| {\hat{β}}_{j k} - β_{j k} | \geq | β_{j k} | - | {\hat{β}}_{j k} | > \frac{| β_{j k} |}{2}$ , i.e., $| β_{j k} | < 2 | {\hat{β}}_{j k} - β_{j k} |$ ; therefore, we have

\begin{array}{rcl} E {∥ \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p} & ≲ & E \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| β_{j k} | \geq \frac{λ}{2}})}^{\frac{1}{p}} \\ + E \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| {\hat{β}}_{j k} - β_{j k} | > \frac{λ}{2}})}^{\frac{1}{p}} \\ + \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| β_{j k} |}^{p} I {| β_{j k} | \leq 2 λ})}^{\frac{1}{p}} \\ = : & W_{1} + W_{2} + W_{3}, \end{array}

where

\begin{matrix} W_{1} : = E \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| β_{j k} | \geq \frac{λ}{2}})}^{\frac{1}{p}}, \\ W_{2} : = E \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| {\hat{β}}_{j k} - β_{j k} | > \frac{λ}{2}})}^{\frac{1}{p}}, \\ W_{3} : = \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| β_{j k} |}^{p} I {| β_{j k} | \leq 2 λ})}^{\frac{1}{p}} . \end{matrix}

(i)
Firstly, we estimate
$W_{1} : = E \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| β_{j k} | \geq \frac{λ}{2}})}^{\frac{1}{p}} .$

By Lemma 3.1, we have

E {| {\hat{β}}_{j k} - β_{j k} |}^{p} ≲ n^{- \frac{p}{2}} .

Using $I {| β_{j k} | \geq \frac{λ}{2}} \leq {(\frac{| β_{j k} |}{\frac{λ}{2}})}^{r}$ and Jensen’s inequality, we obtain

\begin{array}{rcl} W_{1} & = & E \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| β_{j k} | \geq \frac{λ}{2}})}^{\frac{1}{p}} \\ \leq & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} E {| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| β_{j k} | > \frac{λ}{2}})}^{\frac{1}{p}} \\ ≲ & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} n^{- \frac{p}{2}} {(\frac{| β_{j k} |}{λ / 2})}^{r})}^{\frac{1}{p}} \\ = & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} n^{- \frac{1}{2}} λ^{- \frac{r}{p}} {∥ β_{j \cdot} ∥}_{r}^{\frac{r}{p}} . \end{array}

By ${∥ β_{j \cdot} ∥}_{r} ≲ 2^{- j (N + \frac{1}{2} - \frac{1}{r})}$ and $λ \sim c \sqrt{\frac{ln n}{n}}$ , we have

\begin{aligned} W_{1} & ≲ \sum_{j = j_{0}}^{j_{1}} n^{- \frac{1}{2}} 2^{j (\frac{1}{2} - \frac{1}{p})} 2^{- j (N + \frac{1}{2} - \frac{1}{r}) \frac{r}{p}} {(\frac{n}{ln n})}^{\frac{r}{2 p}} \\ = n^{- \frac{1}{2}} {(\frac{n}{ln n})}^{\frac{r}{2 p}} \sum_{j = j_{0}}^{j_{1}} 2^{- j ξ} \\ \leq n^{\frac{r - p}{2 p}} ln n^{- \frac{r}{2 p}} (2^{- j_{0} ξ} I {ξ > 0} + 2^{- j_{1} ξ} I {ξ < 0} + (j_{1} - j_{0} + 1) I {ξ = 0}) . \end{aligned}

Using Lemma 3.1 and (3.4), we obtain

W_{1} ≲ {\begin{matrix} {(ln n)}^{c_{3}} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ {(ln n)}^{c_{4}} {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r = \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r < \frac{p}{2 N + 1}, \end{matrix}

(3.11)

where $c_{3}$ , $c_{4}$ are constants.

(ii)
For
$W_{3} : = \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| β_{j k} |}^{p} I {| β_{j k} | \leq 2 λ})}^{\frac{1}{p}},$

let $ξ : = \frac{1}{2} (\frac{r}{p} (2 N + 1) - 1)$ . By $I {| β_{j k} | \leq 2 λ} \leq {(\frac{2 λ}{| β_{j k} |})}^{p - r}$ ( $r < p$ ), we have

\begin{array}{rcl} W_{3} & ≲ & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| β_{j k} |}^{p} {(\frac{2 λ}{| β_{j k} |})}^{p - r})}^{\frac{1}{p}} \\ = & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(2 λ)}^{\frac{p - r}{p}} {(\sum_{k} {| β_{j k} |}^{r})}^{\frac{1}{p}} \\ = & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(2 λ)}^{\frac{p - r}{p}} {∥ β_{j \cdot} ∥}_{r}^{\frac{r}{p}} . \end{array}

Since $f^{X} \in W_{r}^{N} (R)$ , then ${∥ β_{j \cdot} ∥}_{r} ≲ 2^{- j (N + \frac{1}{2} - \frac{1}{r})}$ . Taking $λ = c \sqrt{\frac{j}{n}} \sim c \sqrt{\frac{ln n}{n}}$ , $j_{1} - j_{0} \sim C (ln n)$ , we have

\begin{array}{rcl} W_{3} & ≲ & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} λ^{\frac{p - r}{p}} 2^{- j (N + \frac{1}{2} - \frac{1}{r}) \frac{r}{p}} \\ ≲ & {(\frac{ln n}{n})}^{\frac{p - r}{2 p}} \sum_{j = j_{0}}^{j_{1}} 2^{- j \frac{1}{2} [\frac{r}{p} (2 N + 1) - 1]} \\ = & {(\frac{ln n}{n})}^{\frac{p - r}{2 p}} \sum_{j = j_{0}}^{j_{1}} 2^{- j ξ} \\ ≲ & {(\frac{ln n}{n})}^{\frac{p - r}{2 p}} (2^{- j_{0} ξ} I {ξ > 0} + 2^{- j_{1} ξ} I {ξ < 0} + (j_{1} - j_{0} + 1) I {ξ = 0}) . \end{array}

Note that $ξ > 0$ if and only if $r > \frac{p}{2 N + 1}$ . When $ξ = 0$ , i.e., $p = r (2 N + 1)$ , we can compute $\frac{N^{'}}{2 (N - \frac{1}{r}) + 1} = \frac{p - r}{2 p}$ . Using (3.2), (3.3), we obtain

W_{3} ≲ {\begin{matrix} {(\frac{ln n}{n})}^{\frac{p - r}{2 p}} 2^{- j_{0} ξ} = {(ln n)}^{\frac{p - r}{2 r (2 N + 1)}} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{p - r}{2 p}} (j_{1} - j_{0} + 1) ≲ {(\frac{ln n}{n})}^{\frac{p - r}{2 p}}, & r = \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{p - r}{2 p}} 2^{- j_{1} ξ} = {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r < \frac{p}{2 N + 1} . \end{matrix}

(3.12)

(iii)
Finally, we estimate
$W_{2} : = E \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| {\hat{β}}_{j k} - β_{j k} | > \frac{λ}{2}})}^{\frac{1}{p}} .$

Let $1 < q^{'}, q < \infty$ , and $\frac{1}{q} + \frac{1}{q^{'}} = 1$ . Using Jensen’s inequality and Hölder’s inequality, we have

\begin{array}{rcl} W_{2} & = & E \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| {\hat{β}}_{j k} - β_{j k} | > \frac{λ}{2}})}^{\frac{1}{p}} \\ \leq & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} E ({| {\hat{β}}_{j k} - β_{j k} |}^{p} I {| {\hat{β}}_{j k} - β_{j k} | > \frac{λ}{2}}))}^{\frac{1}{p}} \\ \leq & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {(E {| {\hat{β}}_{j k} - β_{j k} |}^{q p})}^{\frac{1}{q}} {(E I^{q^{'}} {| {\hat{β}}_{j k} - β_{j k} | > \frac{λ}{2}})}^{\frac{1}{q^{'}}})}^{\frac{1}{p}} \\ \leq & \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} {(E {| {\hat{β}}_{j k} - β_{j k} |}^{q p})}^{\frac{1}{q}} {(P (| {\hat{β}}_{j k} - β_{j k} | > \frac{λ}{2}))}^{\frac{1}{q^{'}}})}^{\frac{1}{p}} . \end{array}

By Lemma 3.1 and Lemma 3.2, we obtain

W_{2} \leq \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{1}{p})} {(2^{j} n^{- \frac{p}{2}} 2^{- \frac{ω j}{q^{'}}})}^{\frac{1}{p}} = n^{- \frac{1}{2}} \sum_{j = j_{0}}^{j_{1}} 2^{j (\frac{1}{2} - \frac{ω}{p q^{'}})} .

Taking large enough ω such that $\frac{1}{2} < \frac{ω}{p q^{'}}$ , we get

W_{2} ≲ n^{- \frac{1}{2}} 2^{j_{0} (\frac{1}{2} - \frac{ω}{p q^{'}})} \leq \sqrt{n^{- 1} 2^{j_{0}}} .

Taking $2^{j_{0}}$ as in (3.2), we have

W_{2} ≲ {\begin{matrix} {(ln n)}^{\frac{p - r}{2 r (2 N + 1)}} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ n^{- \frac{N^{'}}{2 (N - 1 / r) + 1}}, & r \leq \frac{p}{2 N + 1} . \end{matrix}

(3.13)

Putting (3.11), (3.12) and (3.13) together, we can obtain

E {∥ \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p} ≲ {\begin{matrix} {(ln n)}^{c_{3}} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ {(ln n)}^{c_{4}} {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r = \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r < \frac{p}{2 N + 1}, \end{matrix}

where $c_{3}$ , $c_{4}$ are constants. □

Theorem 3.4 Let the scaling function $φ (x)$ be orthonormal, compactly supported and $N + 1$ regular. There exist two positive constants $g_{1}$ and $g_{2}$ such that $g_{1} \leq g (x) \leq g_{2}$ , $x \in R$ . If ${\hat{f}}_{n}^{X non}$ is the nonlinear wavelet estimator in (3.1), and assumptions (3.2), (3.3) and (3.4) are satisfied, then for any $f^{X} \in {\tilde{W}}_{r}^{N} (A, L)$ , where $1 \leq r < p < \infty$ , $N > \frac{1}{r}$ , we have

sup_{f^{X} \in {\tilde{W}}_{r}^{N} (A, L)} E {∥ {\hat{f}}_{n}^{X non} - f^{X} ∥}_{p} ≲ {\begin{matrix} {(ln n)}^{c_{3}} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ {(ln n)}^{c_{4}} {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r = \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r < \frac{p}{2 N + 1}, \end{matrix}

where $N^{'} = N - 1 / r + 1 / p$ , $c_{3}$ , $c_{4}$ are constants.

Proof By the definition of ${\hat{f}}_{n}^{X non}$ in (3.1) and the expansion of $f^{X}$ in (2.1), one has

{\hat{f}}_{n}^{X non} - f^{X} = \sum_{k} ({\hat{α}}_{j_{0} k} - α_{j_{0} k}) φ_{j_{0} k} + \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} + P_{j_{1} + 1} f^{X} - f^{X} .

Then

\begin{matrix} E {∥ {\hat{f}}_{n}^{X non} - f^{X} ∥}_{p} \\ \leq E {∥ \sum_{k} ({\hat{α}}_{j_{0} k} - α_{j_{0} k}) φ_{j_{0} k} ∥}_{p} + E {∥ \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p} + {∥ P_{j_{1} + 1} f^{X} - f^{X} ∥}_{p} \\ = : I_{1} + I_{2} + I_{3}, \end{matrix}

where

\begin{matrix} I_{1} : = E {∥ \sum_{k} ({\hat{α}}_{j_{0} k} - α_{j_{0} k}) φ_{j_{0} k} ∥}_{p}, \\ I_{2} : = E {∥ \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p}, \\ I_{3} : = {∥ P_{j_{1} + 1} f^{X} - f^{X} ∥}_{p} . \end{matrix}

Firstly, we estimate

I_{1} : = E {∥ \sum_{k} ({\hat{α}}_{j_{0} k} - α_{j_{0} k}) φ_{j_{0} k} ∥}_{p} .

By Lemma 2.2 and Jensen’s inequality,

I_{1} ≲ 2^{j_{0} (\frac{1}{2} - \frac{1}{p})} E {(\sum_{k} {| {\hat{α}}_{j_{0} k} - α_{j_{0} k} |}^{p})}^{\frac{1}{p}} \leq 2^{j_{0} (\frac{1}{2} - \frac{1}{p})} {(\sum_{k} E {| {\hat{α}}_{j_{0} k} - α_{j_{0} k} |}^{p})}^{\frac{1}{p}} .

Since $f^{X} (x)$ and $φ (x)$ are compactly supported, then the number of elements in ${k : α_{j_{0} k} \neq 0}$ is $O (2^{j_{0}})$ . By Lemma 3.1, we have $E {| {\hat{α}}_{j_{0} k} - α_{j_{0} k} |}^{p} ≲ n^{- \frac{p}{2}}$ .

Therefore

I_{1} ≲ 2^{j_{0} (\frac{1}{2} - \frac{1}{p})} {(2^{j_{0}} n^{- \frac{p}{2}})}^{\frac{1}{p}} = \sqrt{n^{- 1} 2^{j_{0}}} .

Using (3.2), we have

I_{1} ≲ {\begin{matrix} {(ln n)}^{\frac{p - r}{2 r (2 N + 1)}} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ n^{- \frac{N^{'}}{2 (N - 1 / r) + 1}}, & r \leq \frac{p}{2 N + 1}, \end{matrix}

(3.14)

where $N^{'} = N - \frac{1}{r} + \frac{1}{p}$ .

Next, we estimate

I_{3} : = {∥ P_{j_{1} + 1} f^{X} - f^{X} ∥}_{p} .

In reference [9], it turns out that if the scaling function $φ (x)$ is orthonormal, compactly supported and $N + 1$ regular, then the associated kernel function $K (x, y) : = \sum_{k} φ (x - k) φ (y - k)$ satisfies Conditions $H (N + 1)$ and $M (N)$ , and $K_{j} f (x) = P_{j} f (x)$ .

Since a Sobolev space and a Besov space have the following embedding theorem: ${\tilde{W}}_{r}^{N} ↪ {\tilde{B}}_{r \infty}^{N} ↪ {\tilde{B}}_{p \infty}^{N^{'}}$ , where $N^{'} = N - \frac{1}{r} + \frac{1}{p}$ , then $f^{X} \in {\tilde{B}}_{p \infty}^{N^{'}}$ . By Lemma 2.3, we have

{∥ P_{j_{1} + 1} f^{X} - f^{X} ∥}_{p} ≲ 2^{- j_{1} N^{'}} .

Taking $2^{j_{1}}$ as in (3.3), we have

I_{3} ≲ {\begin{matrix} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r \leq \frac{p}{2 N + 1} . \end{matrix}

(3.15)

Finally, we estimate

I_{2} : = E {∥ \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p} .

Using Lemma 3.3, we obtain

E {∥ \sum_{j = j_{0}}^{j_{1}} \sum_{k} ({\hat{β}}_{j k}^{*} - β_{j k}) ψ_{j k} ∥}_{p} ≲ {\begin{matrix} {(ln n)}^{c_{3}} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ {(ln n)}^{c_{4}} {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r = \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r < \frac{p}{2 N + 1} . \end{matrix}

(3.16)

By (3.14), (3.15) and (3.16), we obtain

sup_{f^{X} \in {\tilde{W}}_{r}^{N} (A, L)} E {∥ {\hat{f}}_{n}^{X non} - f^{X} ∥}_{p} ≲ {\begin{matrix} {(ln n)}^{c_{3}} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ {(ln n)}^{c_{4}} {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r = \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r < \frac{p}{2 N + 1} . \end{matrix}

□

4 Optimality

Now, we discuss the optimality of the rates of convergence. Using similar techniques as those in reference [10], we can obtain the following lower bound theorem.

Theorem 4.1 Let the scaling function $φ (x)$ be orthonormal, compactly supported and $N + 1$ regular, $f^{X} \in {\tilde{W}}_{r}^{N} (A, L)$ . If there exist two positive constants $g_{1}$ and $g_{2}$ such that $g_{1} \leq g (x) \leq g_{2}$ , $x \in R$ , then for any estimator ${\hat{f}}_{n}^{X}$ , we have

inf_{{\hat{f}}_{n}^{X}} sup_{f^{X} \in {\tilde{W}}_{r}^{N} (A, L)} E {∥ {\hat{f}}_{n}^{X} - f^{X} ∥}_{p} ≳ {\begin{matrix} n^{- \frac{N}{2 N + 1}}, & r > \frac{p}{2 N + 1}, \\ {(\frac{ln n}{n})}^{\frac{N^{'}}{2 (N - 1 / r) + 1}}, & r \leq \frac{p}{2 N + 1}, \end{matrix}

where $1 \leq r, p < \infty$ , $N > \frac{1}{r}$ .

Remark The proof is very similar to that in reference [10], in which the author studied the lower bound of the convergence rates in Besov spaces for the samples without bias data.

According to Theorem 4.1, we can see that:

(i)
When $r < \frac{p}{2 N + 1}$ , our nonlinear estimator can attain the optimal rate.
(ii)
When $r = \frac{p}{2 N + 1}$ , our convergence rate and the optimal rate of convergence differ in a logarithmic. So, it is sub-optimal.
(iii)
When $r > \frac{p}{2 N + 1}$ , the logarithmic factor is an extra penalty for the chosen wavelet thresholding, our convergence rate is sub-optimal.

References

Efromovich S Springer Series in Statistics. In Nonparametric Curve Estimation. Methods, Theory, and Applications. Springer, New York; 1999.
Google Scholar
Vardi Y: Nonparametric estimation in the presence of length bias. Ann. Stat. 1982, 10(2):616–620.
Article MathSciNet MATH Google Scholar
Jones MC: Kernel density estimation for length-biased data. Biometrika 1991, 78(3):511–519.
Article MathSciNet MATH Google Scholar
Efromovich S: Density estimation for biased data. Ann. Stat. 2004, 32: 1137–1161.
Article MathSciNet MATH Google Scholar
Ramirez P, Vidakovic B: Wavelet density estimation for stratified size-biased sample. J. Stat. Plan. Inference 2010, 140(2):419–432.
Article MathSciNet MATH Google Scholar
Christophe C: Wavelet block thresholding for density estimation in the presence of bias. J. Korean Stat. Soc. 2010, 39: 43–53.
Article MathSciNet MATH Google Scholar
Kelly C, Kon MA, Rapheal LA: Local convergence for wavelet expansion. J. Funct. Anal. 1994, 126: 102–138.
Article MathSciNet Google Scholar
Triebel H: Theory of Function Spaces. Birkhäuser, Basel; 1983.
Book MATH Google Scholar
Härdle W, Kerkyacharian G, Picard D, Tsybakov A: Wavelets, Approximation and Statistical Applications. Springer, Berlin; 1997.
MATH Google Scholar
Wang HY: Convergence rates of density estimation in Besov spaces. Appl. Math. 2011, 2(10):1258–1262.
Article MathSciNet Google Scholar

Download references

Acknowledgements

This paper is supported by the National Natural Science Foundation of China (No. 11271038) and Foundation of BJUT (No. 006000542213501).

Author information

Authors and Affiliations

Department of Applied Mathematics, Beijing University of Technology, Beijing, 100124, China
Jinru Wang, Meng Wang & Yuan Zhou

Authors

Jinru Wang
View author publications
You can also search for this author in PubMed Google Scholar
Meng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jinru Wang.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

WJR participated in the sequence alignment and drafted the manuscript. WM participated in the design of the study and performed the statistical analysis. ZY conceived of the study and participated in its design and coordination. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Wang, J., Wang, M. & Zhou, Y. Nonlinear wavelet density estimation for biased data in Sobolev spaces. J Inequal Appl 2013, 308 (2013). https://doi.org/10.1186/1029-242X-2013-308

Download citation

Received: 12 January 2013
Accepted: 14 June 2013
Published: 03 July 2013
DOI: https://doi.org/10.1186/1029-242X-2013-308

Nonlinear wavelet density estimation for biased data in Sobolev spaces

Abstract

1 Introduction

2 Preliminaries

2.1 Wavelets

2.2 Sobolev space

2.3 Auxiliary lemmas

3 Main results

4 Optimality

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords