# Wavelet estimation of copula function based on censored data

## Abstract

In this paper, we consider wavelet analysis to obtain an estimator of a copula function based on censored data. We show that optimal convergence rates for the mean integrated squared error (MISE) of linear wavelet-based function estimators are exact under right censoring model. Moreover, we derive asymptotic formulae for MISE. Finally, the simulation results and the analysis of real data validate the proposed procedure.

## 1 Introduction

Recently, copulas and their applications in statistics have become a rather popular phenomenon. For a long time, statisticians have been interested in the relationship between a multivariate distribution function and its lower-dimensional margins. When it comes to analyzing the dependence between random variables, the copula function becomes a very useful tool. According to Sklar’s theorem , if H is a bivariate distribution function with margins $$F(x)$$ and $$G(y)$$, then there exists a copula C such that $$H(x,y) = C(F(x),G(y))$$. Statisticians are interested in copulas for two reasons: (1) Inspecting how the measures of dependence are scale-free and (2) constructing families of bivariate distributions. For more information about this flexible tool and dependence modeling, we refer to  and . A detailed survey on copula models can be found in . Copula models have many applications in finance and insurance, some of which are considered in [2, 7, 10, 11, 13]. For more studies on copulas in econometrics, we refer to  and .

In recent years, the analysis of censored data has become very popular. In many sciences such as statistics, engineering, finances, and other areas, censoring is a condition in which observation is partially known, that is, censoring occurs when a value is outside the range of a measuring instrument, particularly in life-testing data in medicine, where observations often survive or fail at the end of the test (see, e.g.,  for good examples of the censoring method). Let $$T_{1},T_{2},\ldots,T_{n}$$ be i.i.d. survival times with probability density function f and common distribution function F. Typically, instead of observing the variables of interest, we are able to observe the i.i.d. censoring times $$C_{1},C_{2},\ldots,C_{n}$$. Let G be the common distribution function of the latter. Also, it is assumed that $$T_{i}$$ and $$C_{i}$$ are independent. Let $$Y_{i}= \min (T_{i},C_{i})$$ for $$i=1,\ldots,n$$, and define the indicator function

$$\delta _{i}= \textstyle\begin{cases} 1 & \text{if } T_{i}\leq C_{i}, \\ 0 & \text{if } T_{i} > C_{i}. \end{cases}$$

This definition denotes the right-censoring. Several approaches have been proposed to deal with the estimation of copulas under uncensored or censored data. See  for nonparametric estimation and also . In  consistent estimators are considered under some restrictions on the dependence structure of the censored data. For right-censored data, less work has been done on nonparametric copula estimators. Recently, in  a new class of nonparametric estimators of copula function for bivariate censoring is described. The aim of this paper is to estimate the copula function via wavelets based on censoring.

The theory of wavelets and their applications in statistics and other sciences have become an important technique. We can find several applications of wavelet estimators for copula functions in different contexts. In  smoothed empirical copula estimators considered using ranks and wavelets. See  for more information about the procedure to estimate copula functions using wavelets, where the problem of multivariate copula density estimation by wavelet methods has been described. In  the wavelet approaches were used, basically following the same route as in . In  multiwavelets for estimating copula density were used, and in  the same work was prepared based on a Legendre multiwavelets. There are many different papers that considered wavelets, and we proposed some of the most important.

Currently, an extension of wavelets and copulas is increasingly popular as an alternative to many methods such as those briefly discussed. In this paper, we propose a linear wavelet-based estimation for copulas with right censored data in the observation $$T_{1}$$ or $$T_{2}$$. Let $$(T_{11},T_{21}),\ldots,(T_{1n},T_{2n})$$ be independent and have a joint distribution function (d.f.) $$F(t_{1},t_{2})$$. As usual, assume that $$T_{1}$$ and $$C_{1}$$ are independent, where $$C_{1}$$ is the censoring variable associated with the variable $$T_{1}$$. Rather than observing the variables of interest $$(T_{11},T_{21}),\ldots,(T_{1n},T_{2n})$$, in the randomly right censor model, $$(Y_{i},\delta _{i},T_{2i})$$ is observed. We apply the method of , which provides a MISE expansion similar to a density function over $$(-\infty ,T]$$ for any fixed $$T< \tau _{H(\cdot ,t)}$$, where $$\tau _{H(\cdot ,t)} = \inf \{u: H(u,t) = 1\} \leq \infty$$ is the least upper bound for the support of $$H(\cdot ,t)$$, the distribution function of $$(Z_{1},t)$$.

The rest of the paper is as follows. In Sect. 2, we define the elements of copula density and wavelet transform and provide the wavelet estimators for the copula based on the censored data. The main results are described in Sect. 3, and the simulation study for the proposed estimator is provided in Sect. 4. The proofs are presented in the Appendix.

## 2 Preliminary notations

In this section, we provide some necessary concepts about the general framework of this paper. Section 2.1 presents preliminary assumptions and some notations about copula functions. All the major definitions and facts about the wavelets used in the paper are presented in Sect. 2.2. At the end of the section we introduce the estimators for the main result.

### 2.1 Copula function

Copula had been first established in  and improved very fast in many spaces. Here we give a brief definition of a copula function in the two-dimensional case. Any extension of the results to higher dimensions is straightforward. Consider a random vector $$(T_{1},T_{2})$$ with cumulative distribution function $$F(t_{1},t_{2})= P(T_{1}\leq t_{1},T_{2}\leq t_{2})$$ and margin distribution functions $$F_{1}$$ and $$F_{2}$$. The relation between these variables is of interest, so the copula function C can be formed as follows:

\begin{aligned} F(t_{1},t_{2})= C\bigl(F_{1}(t_{1}),F_{2}(t_{2}) \bigr). \end{aligned}

The marginals $$F_{1}$$ and $$F_{2}$$ have uniform distribution on (0,1), and if they are continuous, then C is unique and coincides with the distribution function of the pair $$(U,V)=(F_{1}(T_{1}),F_{2}(T_{2}))$$. In practice, F is unknown. The advantage of using copula is that the joint distribution function ($$F(t_{1},t_{2})$$) can be constructed by using marginals ($$F_{1}$$ and $$F_{2}$$) when they are from different classes of distributions. Let $$(T_{11},T_{21}),\ldots,(T_{1n},T_{2n})$$ be a random sample from the unknown distribution F. Denote by $$F_{1n}$$ and $$F_{2n}$$ the empirical distributions associated with $$F_{1}$$ and $$F_{2}$$. A first step in selecting an appropriate class of copulas consists of plotting the pairs $$(\frac{R_{i}}{n}, \frac{S_{i}}{n}) = (F_{1n}(T_{1i}),F_{2n}(T_{2i}))$$, $$i=\{1,\ldots,n\}$$.

Here $$R_{i}$$ is the rank of $$T_{1i}$$ among $$T_{11},\ldots,T_{1n}$$, and $$S_{i}$$ is the rank of $$T_{2i}$$ among $$T_{21},\ldots,T_{2n}$$. The motivation behind this approach is that the pseudo-observations $$(R_{i}/n,S_{i}/n)$$ are close substitutes to the unobservable pairs $$(U_{i},V_{i})= (F_{1} (T_{1i}),F_{2} (T_{2i}))$$ forming a random sample from C. We denote by $$c(u,v)$$ the density of $$C(u,v)$$,

\begin{aligned} c(u,v)= \frac{\partial ^{2} C(u,v)}{\partial u \partial v}, \quad u,v\in (0,1). \end{aligned}

It is obvious that in real analysis one of our variables (or even both of them) may be subject to be censored, and one observes a minimum between it and another (censoring) random variable, denoted by $$C_{j}$$, so $$Y_{j} = \min (T_{j},C_{j})$$ and $$\delta _{j} = I_{T_{j} \leq C_{j}}$$ for $$j=1,2$$. The i.i.d. replication vector $$(Y_{1i},Y _{2i},\delta _{1i},\delta _{2i})_{1\leq i\leq n}$$ denotes the random sample of $$(Y_{1},Y_{2},\delta _{1},\delta _{2})$$. Clearly, many different estimators can be considered for a distribution function based on censoring. The one used in this paper has the form

\begin{aligned} \hat{F}_{n}(t_{1},t_{2}) = \frac{1}{n} \sum _{i=1}^{n} W_{in}^{(\cdot )} I (Y _{1i}\leq t_{1},T_{2i} \leq t_{2}), \end{aligned}

where $$W_{in}$$ are random weights designed to compensate asymptotically the bias caused by censoring and can be assumed in three different forms; see . The weight considered in the present paper is

\begin{aligned} W_{in} = \frac{\delta _{1i}}{1-\hat{G}(Y_{1i}^{-})} . \end{aligned}

In this form only $$T_{1}$$ is assumed to be censored, and then $$Y_{2} = T_{2}$$, $$\delta _{2} = 1$$ a.s., and $$C_{1}$$ is independent from $$T_{1}$$. Also Ĝ, the Kaplan–Meier estimator of the censoring variable, is defined as $$\hat{G}(t) = 1- \prod_{i:Y_{1i}\leq t} (1- \frac{1}{ \sum_{j=1}^{n}I(Y_{1j}\geq Y_{1i})})^{1-\delta _{1i}}$$. Introducing $$G(t)= P(C_{1}\leq t)$$, the weight $$W_{in}$$ can be seen as an approximation of $$W_{i} = \frac{\delta _{1i}}{1-G(Y_{1i}^{-})}$$. The other weights are discussed in . They took $$W_{in}= \delta _{1i} \hat{g}(Y_{1i})$$, where ĝ is a consistent estimator of limit function g, estimated from the data, where g satisfies the condition $$E[\delta _{1} g(Y_{1}) \phi (Y_{1},T_{2})] = E[ \phi (T_{1},T_{2})]$$ for all $$\phi \in L^{1}$$.

### 2.2 Wavelets

Wavelets and their applications are still an important subject in statistics. The term wavelet is used to refer to a set of orthonormal basis functions generated by dilation and translation of a compactly supported scaling function (father wavelet) ϕ and a mother wavelet ψ associated with an r-regular ($$r > 0$$) multiresolution analysis of $$L_{2}(\mathbb{R})$$, the space of square-integrable functions on the line. Define $$\phi _{j,k}(x)= 2^{j/2} \phi (2^{j}x-k)$$ and $$\psi _{j,k}(x)= 2^{j/2} \psi (2^{j}x-k)$$ for $$j\in \mathbb{N}$$ and $$k=(k_{1},k_{2})\in {\mathbb{Z}}^{2}$$. It is assumed that $$j\geq j _{o}$$ for some coarse scale $$j_{o}\in \mathbb{N}$$, which we take as l. We suppose that ϕ and ψ are bounded and compactly supported. For more on wavelets, see  and . The wavelet expansion for $$f(x,y)$$ can be written as

\begin{aligned} f(x,y)=f_{j_{0}}(x,y)+D_{j_{0}}f(x,y), \quad x,y \in R, \end{aligned}
(1)

where $$f_{j_{0}}(x,y)= \sum_{k \in \mathbb{Z}^{2}}\alpha _{j_{0}k} \phi _{j_{0}k}(x,y)$$ is a trend of an approximation, and

\begin{aligned} D_{j_{0}}f(x,y)= \sum_{j=j_{0}}^{\infty } \biggl(\sum_{k\in \mathbb{Z}^{2}} \beta _{j_{0}k}^{(1)} \psi _{j_{0}k}^{(1)}(x,y)+\sum_{k\in \mathbb{Z} ^{2}} \beta _{j_{0}k}^{(2)}\psi _{j_{0}k}^{(2)}(x,y)+ \sum _{k\in \mathbb{Z}^{2}}\beta _{j_{0}k}^{(3)}\psi _{j_{0}k}^{(3)}(x,y)\biggr). \end{aligned}

For more details about $$D_{j_{0}} f(x,y)$$ and the functions $$\psi _{j_{0}k}$$, see . The coefficients $$\alpha _{j _{0} k}$$ and $$\beta _{j_{0} k}^{(1)}$$, $$\beta _{j_{0} k}^{(2)}$$, $$\beta _{j_{0} k}^{(3)}$$ are unique for every $$j_{0} \in \mathbb{N}$$. The function $$\phi _{j_{0} k}$$ is defined as $$\phi _{j_{0} k_{1} k_{2}} (x,y) = \phi _{j_{0} k_{1}} (x)\phi _{j_{0} k_{2}} (y)$$.

Some important cases of wavelets are the Haar, Daubechies, Shannon, Meyer, and Morlet wavelets (see ). We used the Haar and Daubechies wavelets in our simulation studies. Accordingly, by equation (1) the copula density c can be expanded with

\begin{aligned} \alpha _{j_{0}k} = \int _{{(0,1)}^{2}} c(u,v) \phi _{j_{0}k}(u,v) \,du \,dv, \quad k \in \mathbb{Z}^{2}. \end{aligned}

By the change of variables $$u=F_{1}(t_{1})$$ and $$v=F_{2}(t_{2})$$ we get

\begin{aligned} \alpha _{j_{0}k} = \int \phi _{j_{0}k}\bigl(F_{1}(t_{1}),F_{2}(t_{2}) \bigr) f(t _{1},t_{2}) \,dt_{1} \,dt_{2} = E_{f} \bigl(\phi _{j_{0}k}\bigl(F_{1}(T_{1}),F_{2}(T _{2})\bigr)\bigr). \end{aligned}
(2)

Assuming that $$I=I(Y_{1i}\leq T,T_{2i}\leq T)$$, when $$F_{1}$$ and $$F_{2}$$ (the marginal distributions) are known, a moment-based estimator of $$\alpha _{j_{0}k}$$ based on censored data is then given by

\begin{aligned} \hat{\alpha }_{j_{0} k} =& \int I \phi _{j_{0}k} \bigl(F_{1}(y_{1}),F_{2}(t _{2})\bigr) \hat{F}(dy_{1},\,dt_{2}) \\ =& \frac{1}{n} \sum_{i=1}^{n} W_{in}^{(\cdot )} I(Y_{1i}\leq t_{1},T _{2i}\leq t_{2}) \phi _{j_{0}k} \bigl(F_{1}(Y_{1i}),F_{2}(T_{2i}) \bigr). \end{aligned}
(3)

Then the wavelet-based estimator of c is given by

$$\hat{c_{j_{0}}}(u,v) = \sum_{k\in \mathbb{Z}^{2}} \hat{\alpha }_{j _{0}k} \phi _{j_{0}k}(u,v),\quad u,v\in (0,1).$$

When $$F_{1}$$ and $$F_{2}$$ are unknown, their empirical distribution functions $$F_{1n}$$ and $$F_{2n}$$ are used. So the rank-based estimator is as follows:

\begin{aligned} \tilde{\alpha }_{j_{0}k} =& \int I \phi _{j_{0}k} \bigl(F_{1n}(y_{1}),F _{2n}(t_{2})\bigr) \hat{F}(dy_{1}, \,dt_{2}) \\ =& \frac{1}{n} \sum_{i=1}^{n} W_{in}^{(\cdot )} I(Y_{1i}\leq t_{1},T_{2i} \leq t_{2}) \phi _{j_{0}k} \bigl(F_{1n}(Y_{1i}),F_{2n}(T_{2i}) \bigr). \end{aligned}
(4)

Now we can introduce the linear wavelet-based estimator of c based on the ranks as

\begin{aligned} \tilde{c}_{j_{0}} (u,v) = \sum _{k\in \mathbb{Z}^{2}} \tilde{\alpha } _{j_{0}k} \phi _{j_{0}k}(u,v),\quad u,v\in (0,1). \end{aligned}
(5)

When we have no censoring, the definition reduces to that of the copula estimator introduced in . We further denote by K any constant that may change from one line to another, does not depend on j, k, n, but depends on the wavelet basis and on $$\Vert c \Vert _{\infty } = \sup_{(u,v)\in (0,1)} \vert c(u,v) \vert$$ and $$\Vert c \Vert _{2}= \int c(u,v)^{2} \,du\,dv$$.

Since we deal with the wavelet method, it is very common to consider Besov spaces as functional spaces because they are characterized in terms of wavelet coefficients as follows. Besov spaces depend on three parameters $$s > 0$$, $$1< p<\infty$$, and $$1< q<\infty$$ and are denoted by $$B^{s}_{pq}$$. Let $$f\in L_{2}({\mathbb{R}}^{d})$$ (that is, $$L_{2}( {\mathbb{R}}^{2})$$ in our paper), and let $$s< r$$ (wavelet regularity). Define the sequence norm of the wavelet coefficients of a function $$f\in B^{s}_{pq}$$ by

\begin{aligned} \vert f_{B^{s}_{pq}} \vert = {\biggl(\sum_{k\in \mathbb{Z}^{2}} \vert \alpha _{j_{0}k} \vert ^{p}\biggr)} ^{1/p}+ { \biggl(\sum_{j\geq j_{0}}\biggl[2^{j_{0}(s+d(1/2-1/p))} \biggl( \sum _{k\in \mathbb{Z}^{2}} \vert \beta _{j,k} \vert ^{p}\biggr)^{1/p}\biggr]^{q}\biggr)}^{1/q}, \end{aligned}

where $$( \vert \beta _{j,k} \vert ^{p})^{1/p} = (\sum_{k\in \mathbb{Z}^{2}} \sum_{\epsilon \in S_{2}} \vert \beta _{j,k}^{\epsilon } \vert ^{p})^{1/p}$$. We assume that the copula function c belongs to a Besov space.

## 3 Main results

In this section, we present the main results. The following theorems show that the wavelet-based estimators based on censoring attain nearly optimal convergence rates over a large range of Besov function classes. We also show that our estimator obtains the optimal convergence rates under mean integrated squared error (MISE) by accepting some mild conditions. The mean integrated squared error (MISE) is defined by

\begin{aligned} \operatorname{MISE}(\tilde{c}_{j_{0}},c) = E_{f}\biggl[ \int _{0}^{1} \int _{0}^{1} {\bigl\{ \tilde{c}_{j_{0}}(u,v)-c(u,v) \bigr\} }^{2} \,dv\,du\biggr]. \end{aligned}

In view of decomposition (1) for c, it is obvious that

\begin{aligned} \operatorname{MISE}(\tilde{c}_{j_{0}},c) = \operatorname{MISE}( \tilde{c}_{j_{0}},c_{j_{0}})+ \int _{0}^{1} \int _{0}^{1} {\bigl\{ D_{j_{0}} c(u,v)\bigr\} }^{2} \,dv\,du . \end{aligned}

The bias term in the equation can be bounded as in the proof of Lemma 1 in ,

\begin{aligned} \int _{0}^{1} \int _{0}^{1} {\bigl\{ D_{j_{0}} c(u,v)\bigr\} }^{2} \,dv\,du \leq M 2^{-2j _{0} s^{*}}. \end{aligned}

Precisely, suppose that c belongs to the ball of radius $$M >0$$ in the Besov space $$B_{p,q}^{s}(M)$$, where the parameters $$s >0$$ and $$p\geq 2$$, $$1\leq q\leq \infty$$ or $$s > 2/p -1$$ and $$p\in [1,2]$$, $$1 \leq q\leq \infty$$; also, $$s^{*}$$ can be defined as $$s^{*}=s+1-2/p$$ if $$p\in [1,2]$$ and $$s^{*}=s$$ otherwise. Also, by defining $$\hat{c}_{j _{0}}$$ based on Eq. (3), we can easily show that

\begin{aligned} \operatorname{MISE}(\tilde{c}_{j_{0}},c_{j_{0}})\leq 2 \operatorname{MISE}(\tilde{c}_{j_{0}}, \hat{c}_{j_{0}}) + 2 \operatorname{MISE}(\hat{c}_{j_{0}},c_{j_{0}}). \end{aligned}

We now look for an optimal upper bound for each of the above sequences.

### Lemma 1

Letα̂be as in (3), and let $$j_{0}$$be an arbitrary integer in $$\mathbb{N}$$. Then

\begin{aligned} S_{1} \equiv E\sum_{k\in z^{2}} (\hat{\alpha }_{j_{0} k}-\alpha _{j _{0} k})^{2} \leq K_{1} \frac{ 2^{2j_{0}}}{n} \end{aligned}

for some constant $$K_{1} > 0$$depending only onϕand either

\begin{aligned} \Vert c \Vert _{2} = \int c(u,v)^{2} \,du \,dv \quad \textit{or} \quad \Vert c \Vert _{\infty } = \sup_{u,v\in (0,1)} \bigl\vert c(u,v) \bigr\vert . \end{aligned}

Also note that the proposed linear wavelet estimator of $$c_{j_{0}}$$ is $$\hat{c}_{j_{0}}(x,y) = \sum_{k \in \mathbb{Z}^{2}}\hat{\alpha }_{j _{0}k} \phi _{j_{0}k}(x,y)$$. Then

\begin{aligned} \operatorname{MISE}(\hat{c}_{j_{0}},c_{j_{0}}) =O\biggl( \frac{2^{2j_{0}}}{n} \biggr). \end{aligned}

Note that $$j_{0}$$ must be chosen so that $$2^{j_{0}}\ll \sqrt{n}$$.

### Proof of Lemma 1

The proof is similar to optimality results in Sect. 5 of . □

### Theorem 1

Assume that the functionϕis m-differentiable, and let $$\tilde{c}_{j0}$$be the copula density estimator ofcdefined in (5). Then there exists a constant $$K_{1} > 0$$such that, for a given $$(u,v) \in {(0,1)}^{2}$$and any level $$j_{0}$$satisfying $$2^{j_{0}} \simeq {(n/\log n)}^{1/2-1/{2m}}$$,

$$\operatorname{MISE}(\tilde{c}_{j_{0}},\hat{c}_{j_{0}}) \leq K_{1} \frac{2^{2j_{0}}}{n} \biggl(2^{2j_{0}} \frac{\log n}{n}+2^{-j_{0}} \log n \biggr).$$

The error term associated with the use of ranks is negligible with respect to the usual error term as soon as $$2^{j_{0}}\gg \log (n)$$. For the proof, see the Appendix.

Now, combining the results of Lemma 1 and Theorem 1, we are in a position to express the main theorem of this section.

### Theorem 2

Letϕbe a scaling function mentioned in Sect2.2havingmderivatives with $$m>1+1/s^{*}$$, and for arbitrary $$j_{0}\in \mathbb{N}$$, let $$\tilde{c}_{j_{0}}$$be the estimator in (5). Then there exists a constant $$K_{2}>0$$such that, for all $$M\in (0, \infty )$$, $$s>2/p-1$$, and $$p,q\in [1,\infty )$$, if $$j_{0}$$satisfies $$2^{j_{0}} \simeq n^{1/2+2s^{*}}$$, then

\begin{aligned} \sup_{c\in B_{p,q}^{s}(M)} \operatorname{MISE}(\tilde{c}_{j_{0}},c) \leq K_{2} n^{-s ^{*}/1+s^{*}}. \end{aligned}

Note that the procedure of estimation described is optimal on the Besov space. The simulation studies are applied in the next section. For the proof, see the Appendix.

### Remark 1

This study can be regarded as an extension from complete data to randomly right censored data. If we assume that there is no censoring, that is, $$G\equiv 0$$ on $$(-\infty ,+\infty )$$, then $$\delta _{1i}=1$$ for all $$i=1,2,\ldots,n$$, and the estimators defined by (4) and (3) are the same as those of  for the complete data case. Therefore our estimators can be regarded as an extension of those of  from complete data to randomly right censored data.

## 4 Simulation studies and analysis for real data

In this section, we conduct simulation studies to investigate the performance of the proposed estimator (5) for censored data and compare it with the kernel estimator by an average squared error. Wavelet estimators for the copula, for noncensored data, have been proposed in .

### 4.1 Simulation study

The simulation scheme consisted in three steps.

Step 1: We simulate a random sample of size $$n=2000$$ to display some scatter and rank–rank plots according to the following scheme.

1. (i)

The marginal distribution of random variables $$T_{1}$$ and $$T_{2}$$ ($$F_{1}$$ and $$F_{2}$$) are simulated from three distributions: (a) $$F_{1}$$ is the B distribution with shape parameter $$\alpha _{1}=0.5$$ and scale parameter $$\alpha _{2}=0.7$$, $$F_{2}$$ is the exponential distribution with mean $$1/3$$; (b) $$F_{1}$$ is $$N (0,1)$$, and $$F_{2}$$ is as the same as in part (a); and (c) both $$F_{1}$$ and $$F_{2}$$ are uniform distributions.

2. (ii)

For the dependence structure, we consider two copula families, Gumbel and Clayton with Kendall’s tau $$\tau =0.5$$.

3. (iii)

Censoring variables were simulated from $$\operatorname{Exp}(\nu (x))$$ where $$\nu (x) = x+0.7$$ with results in approximately 0.40 censoring.

The plots are shown in Figs. 1 and 2. In both figures the scatter plots in the top row show what happens when $$F_{2}$$ is exponential with mean $$1/3$$ and $$F_{1}$$ is either $$B (0.5,0.7)$$ or $$N (0,1)$$. In the bottom row the left one shows the same scatter plot, but the margins are considered as $$U (0,1)$$, and the right one shows the pairs of normalized ranks. It is worth mentioning that Fig. 1 displays scatter plots generated from the Gumbel copula, and Fig. 2 displays the same ones from the Clayton copula. As the number of observations is increased, the rank–rank plots are more unreadable. It is clear that for a random sample of size $$n=2000$$ or more, the square in these plots could be completed filled, and all features of distribution had been lost. As a suggestion, we would consider the plots of the empirical copula function of the pairs $$(R_{i}/n,S_{i}/n)$$. For the same data as in panel (d, bottom right) in Figs. 1 and 2, 3d-histograms of the relative frequencies of the pseudo-observations $$(R_{i}/n,S_{i}/n)$$ are illustrated in Fig. 3. The plots show the relative frequency of $$n=2000$$ pairs $$(R_{i}/n,S_{i}/n)$$ in a $$32 \times 32$$ regular partitions of the unit square for Gumbel (left column) and Clayton (right column) copulas in two parts, full data (top row) and censored data (bottom row) with Kendall’s tau $$\tau =1/2$$.

Step 2: Marginal distribution of $$T_{1}$$ and $$T_{2}$$ are simulated from $$B(0.5,0.7)$$ and $$\operatorname{Exp}(3)$$, respectively. Then for $$N=2^{J}\leq \sqrt{(}n) < 2^{J+1}$$, the empirical scaling coefficients at resolution level J are computed as follows:

$$\tilde{\alpha }_{Jk_{1}k_{2}} = \frac{1}{n} \sum _{i=1}^{n} w_{in} I\biggl\{ \frac{k _{1}-1}{N} < \frac{R_{i}}{n}\leq \frac{k_{1}}{N} , \frac{k_{2}-1}{N} < \frac{S_{i}}{n}\leq \frac{k_{2}}{N}\biggr\}$$

for all $$k_{1},k_{2} \in \{1,\ldots,N\}$$. Finally, we obtain the wavelet estimators for Gumbel and Clayton copulas (equation (5)) and consider their plots at levels $$J-1$$ and $$J-2$$. The graphs are in Fig. 4.

Step 3: We find the average squared error (ASE) calculated for several wavelet copula estimators, including (5) for different sample sizes n. The results for ASE and the total number of replications $$N=100$$ for two different dependence copula parameters and for wavelet and kernel methods are shown in Table 1. The ASE criterion is defined as

$$\mathit{ASE}= \frac{1}{N}\sum_{l=1}^{N} \Biggl(\frac{1}{n^{2}}\sum_{j=1}^{n}\sum _{i=1} ^{n} {\bigl({\tilde{c}}^{(l)}(u_{i},v_{j})-c(u_{i},v_{j}) \bigr)}^{2}\Biggr),$$

where $${\tilde{c}}^{(l)}$$ denotes an estimator of c at the lth replication. In this simulation study, we have used Daubechies’s compactly supported wavelet symmlet8 and level $$j_{0}=5$$.

The codes are in MATLAB environment using the Wavelab software. Our simulation study is done for sampling in three sizes $$n=100$$, $$n=300$$, and $$n=500$$. The main result in the table is consistent with the conclusions of . In both of these examples the simulation results show that the wavelet estimator performs better than kernel estimators in terms of AMSE criterion. We can do the same work for some other copula functions like Gaussian, Frank, and Student copulas or for other wavelets such as Haar or Adelson wavelets.

### 4.2 Real data

Here we present an application of the proposed methodology for the data of , where the first observation is censored. The data consist of the indemnity payment (LOSS) and the allocated loss adjustment expense (ALAE) for 1500 general liability claims. The graphical representation of this data is considered in Fig. 5. Panels (a) and (b) show logarithm scale of the original data and the rank–rank plot, respectively. In panel (c) the 3D-histogram of Loss (censored) and Alae data is shown based on $$16 \times 16$$ grid. Panel (d) shows the wavelet-based estimator described in Sect. 2.

Due to our simulation results, the nonparametric procedure tries to provide a smooth copula estimate, and the wavelet-based estimator is a good estimator under the fact that a considerable copula family is Gumbel. Many authors who used this data in their researches claimed that the best representation of Loss and Alae data is the Gumbel copula.

## 5 Conclusions

Here we consider wavelet-based identification and estimation of a censored copula density function under a rank-based producer. We proposed a linear wavelet estimator and provided its asymptotic formulae for mean integrated square error. The wavelet methods offer fast computations and easy updating in addition to being easily adapted to the design. We derived an analog of the asymptotic formula of the mean integrated square error in the context of kernel density estimators for censored data, admitting an expansion with distinct squared bias and variance components.

The numerical performance of the proposed linear wavelet density estimators was illustrated on simulated datasets. Comparisons between full data and censored data for some different sample sizes were also given. Although using wavelet-based estimator of a copula function is very useful for underlying dependence structure, it does not cover the main conditions of parametric models. In future work we might also consider using a nonlinear wavelet-based copula density estimator for randomly censored data or using for goodness-of-fit testing.

## References

1. Autin, F., Lepennec, E., Tribouley, K.: Thresholding methods to estimate the copula density. J. Multivar. Anal. 101, 200–222 (2010). http://doi.org/10.1016/j.jmva.2009.07.009

2. Bouye, E., Durrleman, V., Nikeghbali, A., Riboulet, G., Roncalli, T.: Copulas for finance: a reading guide and some applications. In: Groupe de Recherche Operationelle. Credit Lyonnais, Paris (2000)

3. Bouye, E., Durrleman, V., Nikeghbali, A., Riboulet, G., Roncalli, T.: Copulas for finance—A reading guide and some applications. Social Science Research Network Working Paper Series (2007). https://doi.org/10.2139/ssrn.1032533

4. Butucea, C., Tribouley, K.: Nonparametric homogeneity tests. J. Stat. Plan. Inference 136, 597–639 (2006). http://doi.org/10.1016/j.jspi.2004.08.003

5. Chatrabgoun, O., Parham, G.: Copula density estimation using multiwavelets based on the multiresolution analysis. Commun. Stat., Simul. Comput. 45, 3350–3372 (2014)

6. Chatrabgoun, O., Parham, G., Chinipardaz, R.: A Legendre multiwavelets approach to copula density estimation. Stat. Pap. 58, 673–690 (2017). http://doi.org/10.1007/s00362-015-0720-0

7. Cherubini, U., Luciano, E., Vecchiato, W.: Copula Methods in Finance. Wiley Finance Series. Wiley, Chichester (2004)

8. Daubechies, I.: Ten Lectures on Wavelets. SIAM, Philadelphia (1992)

9. Deheuvels, P.: La fonction de dépendance empirique et ses propriétés: Un test non paramétrique d’indépendance. Bull. Cl. Sci., Acad. R. Belg. (5) 65, 274–292 (1979)

10. Embrechts, P., Kluppelberg, C., Mikosch, T.: Modeling Extremal Events for Insurance and Finance. Springer, Berlin (1997)

11. Fermanian, J.-D., Wegkamp, M.: Time dependent copulas. Preprint (2004). http://doi.org/10.1.1.332.6192

12. Fleming, T.R., Harrington, D.P.: Counting Processes and Survival Analysis. Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics. Wiley, New York (1991). http://doi.org/10.1002/9781118150672

13. Frees, E., Valdez, E.A.: Understanding relationships using copulas. N. Am. Actuar. J. 2, 1–25 (1998). http://doi.org/10.1080/10920277.1998.10595667

14. Genest, C., Masiello, E., Tribouley, K.: Estimating copula densities through wavelets. Insur. Math. Econ. 44, 170–181 (2009). http://doi.org/10.1016/j.insmatheco.2008.07.006

15. Gribkova, S., Lopez, O.: Non-parametric copula estimation under bivariate censoring. Scand. J. Stat. 42, 925–946 (2015). http://doi.org/10.1111/sjos.12144

16. Joe, H.: Multivariate Models and Dependence Concepts. Chapman & Hall, London (1997). https://doi.org/10.1002/(SICI)1097-0258(19980930)17:18<2154::AID-SIM913>3.0.CO;2-R

17. Li, L.: Non-linear wavelet-based density estimators under random censorship. J. Stat. Plan. Inference 117, 35–58 (2003). http://doi.org/10.1016/so378-3758(02)00366-x

18. Meyer, Y.: Wavelets: Algorithms and Applications. SIAM, Philadelphia (1993). http://doi.org/10.1137/1036136

19. Morettin, P.A., Toloi, C.M.C., Chiann, C., de Miranda, J.C.S.: Wavelet smoothed empirical copula estimators. Braz. Rev. Finance 8, 263–281 (2010)

20. Nelsen, R.B.: An Introduction to Copulas, 2nd edn. Springer, New York (2006)

21. Patton, A.J.: A review of copula models for economic time series. J. Multivar. Anal. 110, 4–18 (2012). http://doi.org/10.1016/j.jmva.2012.02.021

22. Sklar, A.: Fonctions de répartition à n dimensions et leurs marges. Publ. Inst. Stat. Univ. Paris 8, 229–231 (1959)

23. Van der Laan, M.J.: Efficient estimation in the bivariate censoring model and repairing NPMLE. Ann. Stat. 24(2), 596–627 (1996). http://jstor.org/stable/2242663

24. Wang, W., Wells, M.T.: Nonparametric estimators of the bivariate survival function under simplified censoring conditions. Biometrika 84(4), 863–880 (1997). http://doi.org/10.1093/biomet/84.4.863

## Availability of data and materials

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Not applicable.

## Author information

Authors

### Contributions

This work was carried out in collaboration among all authors. All authors read and approved the final manuscript.

## Ethics declarations

### Competing interests

The authors declare that they have no competing interests.

## Appendix:  Proofs

### 1.1 A.1 Proof of Theorem 1

The proofs follow along with the lines of that in  for the density function. Compared to the uncensored case, the main difficulty here is to handle the weights $$W_{in}$$. For this purpose, we use some assumptions of . This is the direct approach to the copula function estimation problem under censoring. To complete the proof, we need some preparations. Suppose that Assumption 4 in  is satisfied. Here this assumption for our results is as follows.

### Assumption 1

Assume that $$E{(\frac{\delta _{1}}{1-G})}^{2} =E{(\delta _{1} g(T_{1}))} ^{2} < \infty$$, and assume that there exist i.i.d. random variables ($$Z_{i}$$) such that $$\sup \vert W_{in}-W_{i} \vert \leq B_{n} Z_{i}$$, where $$B_{n}=\sup \vert \frac{\hat{G}-G}{1-\hat{G}} \vert = O(n^{-1/2})$$ and $$E[Z_{i}]=E[\frac{\delta _{1}}{1-G(Y_{1i})}] <\infty$$.

Also, suppose that the function ϕ defined in Sect. 2.2 is m-differentiable. We define

$$\xi _{k} (Y_{1i},T_{2i}) = \bigl(\phi _{j_{0} k} \bigl(F_{1n}(Y_{1i}),F_{2n}(T _{2i})\bigr) - \phi _{j_{0} k} \bigl(F_{1}(Y_{1i}),F_{2}(T_{2i}) \bigr)\bigr)I_{(Y_{1i} \leq A_{1},Y_{2i}\leq A_{2})}.$$

In addition, $$\tilde{\alpha }_{j_{0} k} - \hat{\alpha }_{j_{0} k} = \frac{1}{n} \sum_{i=1}^{n} W_{in} \xi _{k} (Y_{1i},T_{2i})$$. So we obtain

\begin{aligned} E\biggl(\sum_{k} (\tilde{\alpha }_{j_{0} k} - \hat{\alpha }_{j_{0} k})^{2}\biggr) =& \sum _{k} E \Biggl(\frac{1}{n} \sum _{i=1}^{n} W_{in} \xi _{k} (Y_{1i},T _{2i})\Biggr)^{2} \\ =&\sum_{k} E \Biggl(\frac{1}{n} \sum _{i=1}^{n} W_{i} \xi _{k} (Y_{1i},T_{2i})+ \frac{1}{n} \sum _{i=1}^{n} (W_{in}-W_{i}) \xi _{k} (Y_{1i},T_{2i})\Biggr)^{2} \\ \leq & 2 \Biggl(\sum_{k} E \Biggl( \frac{1}{n} \sum_{i=1}^{n} W_{i} \xi _{k} (Y _{1i},T_{2i}) \Biggr)^{2} \\ &{}+\sum_{k} E \Biggl( \frac{1}{n} \sum_{i=1}^{n} (W_{in}-W _{i}) \xi _{k} (Y_{1i},T_{2i}) \Biggr)^{2} \Biggr) \\ =:& 2(T_{1}+T_{2}). \end{aligned}

First, following the proof of Proposition 1 in , we obtain a bound for $$T_{1}$$:

\begin{aligned} T_{1} \leq & \frac{1}{n^{2}} \sum_{k} \sum_{i\in I(j,0)\cup I_{1}(j,\epsilon )} E\bigl(W_{i} \xi _{k} (Y_{1i},T _{2i}) \bigr)^{2} \\ &{}+ \frac{1}{n^{2}}\sum_{k} \sum _{i\neq l\in I(j,0)\cup I_{1}(j,\epsilon )} \bigl\vert E\bigl(W_{i} W_{l} \xi _{k} (Y_{1i},T_{2i}) \xi _{k} (Y_{1l},T_{2l})\bigr) \bigr\vert +K2^{2j} n^{-\delta } E(W _{i})^{2}. \end{aligned}

Due to the last sentence in Sect. 2.1 and because of the limit function g, we get

\begin{aligned} E\bigl(W_{i} \xi _{k} (Y_{1i},T_{2i}) \bigr)^{2} = gE\bigl(\xi _{k} (T_{1i},T_{2i}) \bigr)^{2} \leq K_{1} E\bigl(\xi _{k} (T_{1i},T_{2i})\bigr)^{2} \end{aligned}

and also

\begin{aligned} E\bigl(W_{i} W_{l} \xi _{k} (Y_{1i},T_{2i}) \xi _{k} (Y_{1l},T_{2l}) \bigr) =& gE\bigl( \xi _{k} (T_{1i},T_{2i}) \xi _{k} (T_{1l},T_{2l})\bigr) \\ \leq &K_{1}E \bigl(\xi _{k} (T _{1i},T_{2i}) \xi _{k} (T_{1l},T_{2l})\bigr). \end{aligned}

Similarly, we have $$E(W_{i})^{2}= \frac{1}{1-G}=g\leq K_{1}$$.

Since the support of the scaling function is compact, we finally obtain the bound for $$T_{1}$$:

\begin{aligned} T_{1} \leq & K K_{1} \frac{1}{n^{2}} 2^{2j} \bigl(n2^{-j}\bigr) 2^{3j}\frac{ \log (n)}{n} + K K_{1} \frac{1}{n^{2}} 2^{2j} \bigl(n2^{-j} \bigr)^{2} 2^{3j}\frac{ \log (n)}{n} +K K_{1} n^{-\delta +1} \\ \leq & K \frac{2^{2j}}{n} \biggl(2^{2j} \frac{\log (n)}{n} + 2^{-j} \log (n) \biggr). \end{aligned}

In addition, according to Assumption 1, we have

\begin{aligned} E\bigl((W_{in}-W_{i})\xi _{k} (Y_{1i},T_{2i})\bigr)^{2} =& E\biggl( \frac{\delta _{1} ( \hat{G}-G)}{(1-\hat{G})(1-G)}\xi _{k} (Y_{1i},T_{2i}) \biggr)^{2} \\ =&\frac{(\hat{G}-G)^{2}}{(1-\hat{G})^{2}} E\bigl(Z_{i}\xi _{k} (T_{1i},T _{2i})\bigr)^{2} \\ \leq & Kn^{-1}E\bigl(\xi _{k} (T_{1i},T_{2i}) \bigr)^{2}. \end{aligned}

Similarly, the bound for $$(W_{in}-W_{i})\xi _{k} (Y_{1i},T_{2i})$$ follows:

\begin{aligned}& E\bigl((W_{in}-W_{i}) (W_{l n}-W_{l}) \xi _{k} (Y_{1i},T_{2i}) \xi _{k} (Y_{1l},T _{2l})\bigr) \\& \quad = \frac{(\hat{G}-G)^{2}}{(1-\hat{G})^{2}} E \bigl(Z_{i}^{2} \xi _{k} (T_{1i},T_{2i}) \xi _{k} (T_{1l},T_{2l})\bigr) \\& \quad \leq K n^{-1} E\bigl(\xi _{k} (T_{1i},T_{2i}) \xi _{k} (T_{1l},T_{2l})\bigr). \end{aligned}

It remains to find the bound for the last part of $$T_{2}$$:

$$E(W_{in}-W_{i})^{2} = E\biggl( \frac{\delta _{1} (\hat{G}-G)}{(1-\hat{G})(1-G)} \biggr)^{2} = \frac{(\hat{G}-G)^{2}}{(1- \hat{G})^{2}} E(Z_{i})^{2} \leq K n^{-1},$$

where the last equality follows from Assumption 1. At the end, the sharper bound for $$C_{2}$$ is

\begin{aligned} T_{2} \leq & \frac{1}{n^{2}} \sum_{k} \sum_{i\in I(j,0)\cup I_{1}(j,\epsilon )} \frac{(\hat{G}-G)^{2}}{(1- \hat{G})^{2}} E\bigl(Z_{i} \xi _{k} (T_{1i},T_{2i})\bigr)^{2} \\ &{}+ \frac{1}{n^{2}} \sum_{k} \sum _{i\neq l\in I(j,0)\cup I_{1}(j,\epsilon )} \frac{(\hat{G}-G)^{2}}{(1- \hat{G})^{2}} \bigl\vert E\bigl(Z_{i}^{2} \bigr) \xi _{k} (T_{1i},T_{2i}) \xi _{k} (T_{1l},T _{2l}) \bigr\vert \\ &{}+ K n^{-\delta } 2^{2j} \frac{(\hat{G}-G)^{2}}{(1-\hat{G})^{2}} E(Z_{i})^{2} \\ \leq & K K_{1} n^{-1} \frac{1}{n^{2}} 2^{2j} \bigl(n2^{-j}\bigr) 2^{3j}\frac{ \log (n)}{n} + K K_{1} n^{-1} \frac{1}{n^{2}} 2^{2j} \bigl(n2^{-j}\bigr)^{2} 2^{3j}\frac{ \log (n)}{n} +K K_{1} n^{-1} n^{-\delta +1} \\ \leq & K \frac{2^{2j}}{n^{2}} \biggl(2^{2j} \frac{\log (n)}{n}+2^{-j} \log (n)\biggr). \end{aligned}

Therefore with the bounds for $$T_{1}$$ and $$T_{2}$$ the proof of Theorem 1 is complete.

### 1.2 A.2 Proof of the Theorem 2

Combining the result of Lemma 1 and Theorem 1 and the fact that the error term associated with the use of ranks in the proof of Theorem 1 is negligible with respect to the usual error term as soon as $$2^{j_{0}} \gg \log (n)$$, the proof is complete.

## Rights and permissions 