Skip to main content

Wavelet estimation of copula function based on censored data

Abstract

In this paper, we consider wavelet analysis to obtain an estimator of a copula function based on censored data. We show that optimal convergence rates for the mean integrated squared error (MISE) of linear wavelet-based function estimators are exact under right censoring model. Moreover, we derive asymptotic formulae for MISE. Finally, the simulation results and the analysis of real data validate the proposed procedure.

1 Introduction

Recently, copulas and their applications in statistics have become a rather popular phenomenon. For a long time, statisticians have been interested in the relationship between a multivariate distribution function and its lower-dimensional margins. When it comes to analyzing the dependence between random variables, the copula function becomes a very useful tool. According to Sklar’s theorem [22], if H is a bivariate distribution function with margins \(F(x)\) and \(G(y)\), then there exists a copula C such that \(H(x,y) = C(F(x),G(y))\). Statisticians are interested in copulas for two reasons: (1) Inspecting how the measures of dependence are scale-free and (2) constructing families of bivariate distributions. For more information about this flexible tool and dependence modeling, we refer to [16] and [20]. A detailed survey on copula models can be found in [24]. Copula models have many applications in finance and insurance, some of which are considered in [2, 7, 10, 11, 13]. For more studies on copulas in econometrics, we refer to [3] and [21].

In recent years, the analysis of censored data has become very popular. In many sciences such as statistics, engineering, finances, and other areas, censoring is a condition in which observation is partially known, that is, censoring occurs when a value is outside the range of a measuring instrument, particularly in life-testing data in medicine, where observations often survive or fail at the end of the test (see, e.g., [12] for good examples of the censoring method). Let \(T_{1},T_{2},\ldots,T_{n}\) be i.i.d. survival times with probability density function f and common distribution function F. Typically, instead of observing the variables of interest, we are able to observe the i.i.d. censoring times \(C_{1},C_{2},\ldots,C_{n}\). Let G be the common distribution function of the latter. Also, it is assumed that \(T_{i}\) and \(C_{i}\) are independent. Let \(Y_{i}= \min (T_{i},C_{i})\) for \(i=1,\ldots,n\), and define the indicator function

$$ \delta _{i}= \textstyle\begin{cases} 1 & \text{if } T_{i}\leq C_{i}, \\ 0 & \text{if } T_{i} > C_{i}. \end{cases} $$

This definition denotes the right-censoring. Several approaches have been proposed to deal with the estimation of copulas under uncensored or censored data. See [9] for nonparametric estimation and also [23]. In [24] consistent estimators are considered under some restrictions on the dependence structure of the censored data. For right-censored data, less work has been done on nonparametric copula estimators. Recently, in [15] a new class of nonparametric estimators of copula function for bivariate censoring is described. The aim of this paper is to estimate the copula function via wavelets based on censoring.

The theory of wavelets and their applications in statistics and other sciences have become an important technique. We can find several applications of wavelet estimators for copula functions in different contexts. In [14] smoothed empirical copula estimators considered using ranks and wavelets. See [1] for more information about the procedure to estimate copula functions using wavelets, where the problem of multivariate copula density estimation by wavelet methods has been described. In [19] the wavelet approaches were used, basically following the same route as in [14]. In [5] multiwavelets for estimating copula density were used, and in [6] the same work was prepared based on a Legendre multiwavelets. There are many different papers that considered wavelets, and we proposed some of the most important.

Currently, an extension of wavelets and copulas is increasingly popular as an alternative to many methods such as those briefly discussed. In this paper, we propose a linear wavelet-based estimation for copulas with right censored data in the observation \(T_{1}\) or \(T_{2}\). Let \((T_{11},T_{21}),\ldots,(T_{1n},T_{2n})\) be independent and have a joint distribution function (d.f.) \(F(t_{1},t_{2})\). As usual, assume that \(T_{1}\) and \(C_{1}\) are independent, where \(C_{1}\) is the censoring variable associated with the variable \(T_{1}\). Rather than observing the variables of interest \((T_{11},T_{21}),\ldots,(T_{1n},T_{2n})\), in the randomly right censor model, \((Y_{i},\delta _{i},T_{2i})\) is observed. We apply the method of [17], which provides a MISE expansion similar to a density function over \((-\infty ,T]\) for any fixed \(T< \tau _{H(\cdot ,t)}\), where \(\tau _{H(\cdot ,t)} = \inf \{u: H(u,t) = 1\} \leq \infty \) is the least upper bound for the support of \(H(\cdot ,t)\), the distribution function of \((Z_{1},t)\).

The rest of the paper is as follows. In Sect. 2, we define the elements of copula density and wavelet transform and provide the wavelet estimators for the copula based on the censored data. The main results are described in Sect. 3, and the simulation study for the proposed estimator is provided in Sect. 4. The proofs are presented in the Appendix.

2 Preliminary notations

In this section, we provide some necessary concepts about the general framework of this paper. Section 2.1 presents preliminary assumptions and some notations about copula functions. All the major definitions and facts about the wavelets used in the paper are presented in Sect. 2.2. At the end of the section we introduce the estimators for the main result.

2.1 Copula function

Copula had been first established in [22] and improved very fast in many spaces. Here we give a brief definition of a copula function in the two-dimensional case. Any extension of the results to higher dimensions is straightforward. Consider a random vector \((T_{1},T_{2})\) with cumulative distribution function \(F(t_{1},t_{2})= P(T_{1}\leq t_{1},T_{2}\leq t_{2})\) and margin distribution functions \(F_{1}\) and \(F_{2}\). The relation between these variables is of interest, so the copula function C can be formed as follows:

$$\begin{aligned} F(t_{1},t_{2})= C\bigl(F_{1}(t_{1}),F_{2}(t_{2}) \bigr). \end{aligned}$$

The marginals \(F_{1}\) and \(F_{2}\) have uniform distribution on (0,1), and if they are continuous, then C is unique and coincides with the distribution function of the pair \((U,V)=(F_{1}(T_{1}),F_{2}(T_{2}))\). In practice, F is unknown. The advantage of using copula is that the joint distribution function (\(F(t_{1},t_{2})\)) can be constructed by using marginals (\(F_{1}\) and \(F_{2}\)) when they are from different classes of distributions. Let \((T_{11},T_{21}),\ldots,(T_{1n},T_{2n})\) be a random sample from the unknown distribution F. Denote by \(F_{1n}\) and \(F_{2n}\) the empirical distributions associated with \(F_{1}\) and \(F_{2}\). A first step in selecting an appropriate class of copulas consists of plotting the pairs \((\frac{R_{i}}{n}, \frac{S_{i}}{n}) = (F_{1n}(T_{1i}),F_{2n}(T_{2i}))\), \(i=\{1,\ldots,n\}\).

Here \(R_{i}\) is the rank of \(T_{1i}\) among \(T_{11},\ldots,T_{1n}\), and \(S_{i}\) is the rank of \(T_{2i}\) among \(T_{21},\ldots,T_{2n}\). The motivation behind this approach is that the pseudo-observations \((R_{i}/n,S_{i}/n)\) are close substitutes to the unobservable pairs \((U_{i},V_{i})= (F_{1} (T_{1i}),F_{2} (T_{2i}))\) forming a random sample from C. We denote by \(c(u,v)\) the density of \(C(u,v)\),

$$\begin{aligned} c(u,v)= \frac{\partial ^{2} C(u,v)}{\partial u \partial v}, \quad u,v\in (0,1). \end{aligned}$$

It is obvious that in real analysis one of our variables (or even both of them) may be subject to be censored, and one observes a minimum between it and another (censoring) random variable, denoted by \(C_{j}\), so \(Y_{j} = \min (T_{j},C_{j})\) and \(\delta _{j} = I_{T_{j} \leq C_{j}}\) for \(j=1,2\). The i.i.d. replication vector \((Y_{1i},Y _{2i},\delta _{1i},\delta _{2i})_{1\leq i\leq n}\) denotes the random sample of \((Y_{1},Y_{2},\delta _{1},\delta _{2})\). Clearly, many different estimators can be considered for a distribution function based on censoring. The one used in this paper has the form

$$\begin{aligned} \hat{F}_{n}(t_{1},t_{2}) = \frac{1}{n} \sum _{i=1}^{n} W_{in}^{(\cdot )} I (Y _{1i}\leq t_{1},T_{2i} \leq t_{2}), \end{aligned}$$

where \(W_{in}\) are random weights designed to compensate asymptotically the bias caused by censoring and can be assumed in three different forms; see [15]. The weight considered in the present paper is

$$\begin{aligned} W_{in} = \frac{\delta _{1i}}{1-\hat{G}(Y_{1i}^{-})} . \end{aligned}$$

In this form only \(T_{1}\) is assumed to be censored, and then \(Y_{2} = T_{2}\), \(\delta _{2} = 1\) a.s., and \(C_{1}\) is independent from \(T_{1}\). Also Ĝ, the Kaplan–Meier estimator of the censoring variable, is defined as \(\hat{G}(t) = 1- \prod_{i:Y_{1i}\leq t} (1- \frac{1}{ \sum_{j=1}^{n}I(Y_{1j}\geq Y_{1i})})^{1-\delta _{1i}}\). Introducing \(G(t)= P(C_{1}\leq t)\), the weight \(W_{in}\) can be seen as an approximation of \(W_{i} = \frac{\delta _{1i}}{1-G(Y_{1i}^{-})} \). The other weights are discussed in [15]. They took \(W_{in}= \delta _{1i} \hat{g}(Y_{1i})\), where ĝ is a consistent estimator of limit function g, estimated from the data, where g satisfies the condition \(E[\delta _{1} g(Y_{1}) \phi (Y_{1},T_{2})] = E[ \phi (T_{1},T_{2})]\) for all \(\phi \in L^{1}\).

2.2 Wavelets

Wavelets and their applications are still an important subject in statistics. The term wavelet is used to refer to a set of orthonormal basis functions generated by dilation and translation of a compactly supported scaling function (father wavelet) ϕ and a mother wavelet ψ associated with an r-regular (\(r > 0\)) multiresolution analysis of \(L_{2}(\mathbb{R})\), the space of square-integrable functions on the line. Define \(\phi _{j,k}(x)= 2^{j/2} \phi (2^{j}x-k)\) and \(\psi _{j,k}(x)= 2^{j/2} \psi (2^{j}x-k)\) for \(j\in \mathbb{N}\) and \(k=(k_{1},k_{2})\in {\mathbb{Z}}^{2}\). It is assumed that \(j\geq j _{o}\) for some coarse scale \(j_{o}\in \mathbb{N}\), which we take as l. We suppose that ϕ and ψ are bounded and compactly supported. For more on wavelets, see [8] and [18]. The wavelet expansion for \(f(x,y)\) can be written as

$$\begin{aligned} f(x,y)=f_{j_{0}}(x,y)+D_{j_{0}}f(x,y), \quad x,y \in R, \end{aligned}$$
(1)

where \(f_{j_{0}}(x,y)= \sum_{k \in \mathbb{Z}^{2}}\alpha _{j_{0}k} \phi _{j_{0}k}(x,y)\) is a trend of an approximation, and

$$\begin{aligned} D_{j_{0}}f(x,y)= \sum_{j=j_{0}}^{\infty } \biggl(\sum_{k\in \mathbb{Z}^{2}} \beta _{j_{0}k}^{(1)} \psi _{j_{0}k}^{(1)}(x,y)+\sum_{k\in \mathbb{Z} ^{2}} \beta _{j_{0}k}^{(2)}\psi _{j_{0}k}^{(2)}(x,y)+ \sum _{k\in \mathbb{Z}^{2}}\beta _{j_{0}k}^{(3)}\psi _{j_{0}k}^{(3)}(x,y)\biggr). \end{aligned}$$

For more details about \(D_{j_{0}} f(x,y)\) and the functions \(\psi _{j_{0}k}\), see [14]. The coefficients \(\alpha _{j _{0} k}\) and \(\beta _{j_{0} k}^{(1)}\), \(\beta _{j_{0} k}^{(2)}\), \(\beta _{j_{0} k}^{(3)}\) are unique for every \(j_{0} \in \mathbb{N} \). The function \(\phi _{j_{0} k}\) is defined as \(\phi _{j_{0} k_{1} k_{2}} (x,y) = \phi _{j_{0} k_{1}} (x)\phi _{j_{0} k_{2}} (y)\).

Some important cases of wavelets are the Haar, Daubechies, Shannon, Meyer, and Morlet wavelets (see [8]). We used the Haar and Daubechies wavelets in our simulation studies. Accordingly, by equation (1) the copula density c can be expanded with

$$\begin{aligned} \alpha _{j_{0}k} = \int _{{(0,1)}^{2}} c(u,v) \phi _{j_{0}k}(u,v) \,du \,dv, \quad k \in \mathbb{Z}^{2}. \end{aligned}$$

By the change of variables \(u=F_{1}(t_{1})\) and \(v=F_{2}(t_{2})\) we get

$$\begin{aligned} \alpha _{j_{0}k} = \int \phi _{j_{0}k}\bigl(F_{1}(t_{1}),F_{2}(t_{2}) \bigr) f(t _{1},t_{2}) \,dt_{1} \,dt_{2} = E_{f} \bigl(\phi _{j_{0}k}\bigl(F_{1}(T_{1}),F_{2}(T _{2})\bigr)\bigr). \end{aligned}$$
(2)

Assuming that \(I=I(Y_{1i}\leq T,T_{2i}\leq T)\), when \(F_{1}\) and \(F_{2}\) (the marginal distributions) are known, a moment-based estimator of \(\alpha _{j_{0}k}\) based on censored data is then given by

$$\begin{aligned} \hat{\alpha }_{j_{0} k} =& \int I \phi _{j_{0}k} \bigl(F_{1}(y_{1}),F_{2}(t _{2})\bigr) \hat{F}(dy_{1},\,dt_{2}) \\ =& \frac{1}{n} \sum_{i=1}^{n} W_{in}^{(\cdot )} I(Y_{1i}\leq t_{1},T _{2i}\leq t_{2}) \phi _{j_{0}k} \bigl(F_{1}(Y_{1i}),F_{2}(T_{2i}) \bigr). \end{aligned}$$
(3)

Then the wavelet-based estimator of c is given by

$$ \hat{c_{j_{0}}}(u,v) = \sum_{k\in \mathbb{Z}^{2}} \hat{\alpha }_{j _{0}k} \phi _{j_{0}k}(u,v),\quad u,v\in (0,1). $$

When \(F_{1}\) and \(F_{2}\) are unknown, their empirical distribution functions \(F_{1n}\) and \(F_{2n}\) are used. So the rank-based estimator is as follows:

$$\begin{aligned} \tilde{\alpha }_{j_{0}k} =& \int I \phi _{j_{0}k} \bigl(F_{1n}(y_{1}),F _{2n}(t_{2})\bigr) \hat{F}(dy_{1}, \,dt_{2}) \\ =& \frac{1}{n} \sum_{i=1}^{n} W_{in}^{(\cdot )} I(Y_{1i}\leq t_{1},T_{2i} \leq t_{2}) \phi _{j_{0}k} \bigl(F_{1n}(Y_{1i}),F_{2n}(T_{2i}) \bigr). \end{aligned}$$
(4)

Now we can introduce the linear wavelet-based estimator of c based on the ranks as

$$\begin{aligned} \tilde{c}_{j_{0}} (u,v) = \sum _{k\in \mathbb{Z}^{2}} \tilde{\alpha } _{j_{0}k} \phi _{j_{0}k}(u,v),\quad u,v\in (0,1). \end{aligned}$$
(5)

When we have no censoring, the definition reduces to that of the copula estimator introduced in [9]. We further denote by K any constant that may change from one line to another, does not depend on j, k, n, but depends on the wavelet basis and on \(\Vert c \Vert _{\infty } = \sup_{(u,v)\in (0,1)} \vert c(u,v) \vert \) and \(\Vert c \Vert _{2}= \int c(u,v)^{2} \,du\,dv\).

Since we deal with the wavelet method, it is very common to consider Besov spaces as functional spaces because they are characterized in terms of wavelet coefficients as follows. Besov spaces depend on three parameters \(s > 0\), \(1< p<\infty \), and \(1< q<\infty \) and are denoted by \(B^{s}_{pq}\). Let \(f\in L_{2}({\mathbb{R}}^{d})\) (that is, \(L_{2}( {\mathbb{R}}^{2})\) in our paper), and let \(s< r\) (wavelet regularity). Define the sequence norm of the wavelet coefficients of a function \(f\in B^{s}_{pq}\) by

$$\begin{aligned} \vert f_{B^{s}_{pq}} \vert = {\biggl(\sum_{k\in \mathbb{Z}^{2}} \vert \alpha _{j_{0}k} \vert ^{p}\biggr)} ^{1/p}+ { \biggl(\sum_{j\geq j_{0}}\biggl[2^{j_{0}(s+d(1/2-1/p))} \biggl( \sum _{k\in \mathbb{Z}^{2}} \vert \beta _{j,k} \vert ^{p}\biggr)^{1/p}\biggr]^{q}\biggr)}^{1/q}, \end{aligned}$$

where \(( \vert \beta _{j,k} \vert ^{p})^{1/p} = (\sum_{k\in \mathbb{Z}^{2}} \sum_{\epsilon \in S_{2}} \vert \beta _{j,k}^{\epsilon } \vert ^{p})^{1/p}\). We assume that the copula function c belongs to a Besov space.

3 Main results

In this section, we present the main results. The following theorems show that the wavelet-based estimators based on censoring attain nearly optimal convergence rates over a large range of Besov function classes. We also show that our estimator obtains the optimal convergence rates under mean integrated squared error (MISE) by accepting some mild conditions. The mean integrated squared error (MISE) is defined by

$$\begin{aligned} \operatorname{MISE}(\tilde{c}_{j_{0}},c) = E_{f}\biggl[ \int _{0}^{1} \int _{0}^{1} {\bigl\{ \tilde{c}_{j_{0}}(u,v)-c(u,v) \bigr\} }^{2} \,dv\,du\biggr]. \end{aligned}$$

In view of decomposition (1) for c, it is obvious that

$$\begin{aligned} \operatorname{MISE}(\tilde{c}_{j_{0}},c) = \operatorname{MISE}( \tilde{c}_{j_{0}},c_{j_{0}})+ \int _{0}^{1} \int _{0}^{1} {\bigl\{ D_{j_{0}} c(u,v)\bigr\} }^{2} \,dv\,du . \end{aligned}$$

The bias term in the equation can be bounded as in the proof of Lemma 1 in [4],

$$\begin{aligned} \int _{0}^{1} \int _{0}^{1} {\bigl\{ D_{j_{0}} c(u,v)\bigr\} }^{2} \,dv\,du \leq M 2^{-2j _{0} s^{*}}. \end{aligned}$$

Precisely, suppose that c belongs to the ball of radius \(M >0\) in the Besov space \(B_{p,q}^{s}(M)\), where the parameters \(s >0\) and \(p\geq 2\), \(1\leq q\leq \infty \) or \(s > 2/p -1\) and \(p\in [1,2]\), \(1 \leq q\leq \infty \); also, \(s^{*}\) can be defined as \(s^{*}=s+1-2/p\) if \(p\in [1,2]\) and \(s^{*}=s\) otherwise. Also, by defining \(\hat{c}_{j _{0}}\) based on Eq. (3), we can easily show that

$$\begin{aligned} \operatorname{MISE}(\tilde{c}_{j_{0}},c_{j_{0}})\leq 2 \operatorname{MISE}(\tilde{c}_{j_{0}}, \hat{c}_{j_{0}}) + 2 \operatorname{MISE}(\hat{c}_{j_{0}},c_{j_{0}}). \end{aligned}$$

We now look for an optimal upper bound for each of the above sequences.

Lemma 1

Letα̂be as in (3), and let \(j_{0}\)be an arbitrary integer in \(\mathbb{N}\). Then

$$\begin{aligned} S_{1} \equiv E\sum_{k\in z^{2}} (\hat{\alpha }_{j_{0} k}-\alpha _{j _{0} k})^{2} \leq K_{1} \frac{ 2^{2j_{0}}}{n} \end{aligned}$$

for some constant \(K_{1} > 0\)depending only onϕand either

$$\begin{aligned} \Vert c \Vert _{2} = \int c(u,v)^{2} \,du \,dv \quad \textit{or} \quad \Vert c \Vert _{\infty } = \sup_{u,v\in (0,1)} \bigl\vert c(u,v) \bigr\vert . \end{aligned}$$

Also note that the proposed linear wavelet estimator of \(c_{j_{0}}\) is \(\hat{c}_{j_{0}}(x,y) = \sum_{k \in \mathbb{Z}^{2}}\hat{\alpha }_{j _{0}k} \phi _{j_{0}k}(x,y)\). Then

$$\begin{aligned} \operatorname{MISE}(\hat{c}_{j_{0}},c_{j_{0}}) =O\biggl( \frac{2^{2j_{0}}}{n} \biggr). \end{aligned}$$

Note that \(j_{0}\) must be chosen so that \(2^{j_{0}}\ll \sqrt{n}\).

Proof of Lemma 1

The proof is similar to optimality results in Sect. 5 of [14]. □

Theorem 1

Assume that the functionϕis m-differentiable, and let \(\tilde{c}_{j0}\)be the copula density estimator ofcdefined in (5). Then there exists a constant \(K_{1} > 0\)such that, for a given \((u,v) \in {(0,1)}^{2}\)and any level \(j_{0}\)satisfying \(2^{j_{0}} \simeq {(n/\log n)}^{1/2-1/{2m}} \),

$$ \operatorname{MISE}(\tilde{c}_{j_{0}},\hat{c}_{j_{0}}) \leq K_{1} \frac{2^{2j_{0}}}{n} \biggl(2^{2j_{0}} \frac{\log n}{n}+2^{-j_{0}} \log n \biggr). $$

The error term associated with the use of ranks is negligible with respect to the usual error term as soon as \(2^{j_{0}}\gg \log (n)\). For the proof, see the Appendix.

Now, combining the results of Lemma 1 and Theorem 1, we are in a position to express the main theorem of this section.

Theorem 2

Letϕbe a scaling function mentioned in Sect2.2havingmderivatives with \(m>1+1/s^{*}\), and for arbitrary \(j_{0}\in \mathbb{N}\), let \(\tilde{c}_{j_{0}}\)be the estimator in (5). Then there exists a constant \(K_{2}>0\)such that, for all \(M\in (0, \infty )\), \(s>2/p-1\), and \(p,q\in [1,\infty )\), if \(j_{0}\)satisfies \(2^{j_{0}} \simeq n^{1/2+2s^{*}}\), then

$$\begin{aligned} \sup_{c\in B_{p,q}^{s}(M)} \operatorname{MISE}(\tilde{c}_{j_{0}},c) \leq K_{2} n^{-s ^{*}/1+s^{*}}. \end{aligned}$$

Note that the procedure of estimation described is optimal on the Besov space. The simulation studies are applied in the next section. For the proof, see the Appendix.

Remark 1

This study can be regarded as an extension from complete data to randomly right censored data. If we assume that there is no censoring, that is, \(G\equiv 0\) on \((-\infty ,+\infty )\), then \(\delta _{1i}=1\) for all \(i=1,2,\ldots,n\), and the estimators defined by (4) and (3) are the same as those of [14] for the complete data case. Therefore our estimators can be regarded as an extension of those of [14] from complete data to randomly right censored data.

4 Simulation studies and analysis for real data

In this section, we conduct simulation studies to investigate the performance of the proposed estimator (5) for censored data and compare it with the kernel estimator by an average squared error. Wavelet estimators for the copula, for noncensored data, have been proposed in [14].

4.1 Simulation study

The simulation scheme consisted in three steps.

Step 1: We simulate a random sample of size \(n=2000\) to display some scatter and rank–rank plots according to the following scheme.

  1. (i)

    The marginal distribution of random variables \(T_{1}\) and \(T_{2}\) (\(F_{1}\) and \(F_{2}\)) are simulated from three distributions: (a) \(F_{1}\) is the B distribution with shape parameter \(\alpha _{1}=0.5\) and scale parameter \(\alpha _{2}=0.7\), \(F_{2}\) is the exponential distribution with mean \(1/3\); (b) \(F_{1}\) is \(N (0,1)\), and \(F_{2}\) is as the same as in part (a); and (c) both \(F_{1}\) and \(F_{2}\) are uniform distributions.

  2. (ii)

    For the dependence structure, we consider two copula families, Gumbel and Clayton with Kendall’s tau \(\tau =0.5\).

  3. (iii)

    Censoring variables were simulated from \(\operatorname{Exp}(\nu (x))\) where \(\nu (x) = x+0.7\) with results in approximately 0.40 censoring.

The plots are shown in Figs. 1 and 2. In both figures the scatter plots in the top row show what happens when \(F_{2}\) is exponential with mean \(1/3\) and \(F_{1}\) is either \(B (0.5,0.7)\) or \(N (0,1)\). In the bottom row the left one shows the same scatter plot, but the margins are considered as \(U (0,1)\), and the right one shows the pairs of normalized ranks. It is worth mentioning that Fig. 1 displays scatter plots generated from the Gumbel copula, and Fig. 2 displays the same ones from the Clayton copula. As the number of observations is increased, the rank–rank plots are more unreadable. It is clear that for a random sample of size \(n=2000\) or more, the square in these plots could be completed filled, and all features of distribution had been lost. As a suggestion, we would consider the plots of the empirical copula function of the pairs \((R_{i}/n,S_{i}/n)\). For the same data as in panel (d, bottom right) in Figs. 1 and 2, 3d-histograms of the relative frequencies of the pseudo-observations \((R_{i}/n,S_{i}/n)\) are illustrated in Fig. 3. The plots show the relative frequency of \(n=2000\) pairs \((R_{i}/n,S_{i}/n)\) in a \(32 \times 32\) regular partitions of the unit square for Gumbel (left column) and Clayton (right column) copulas in two parts, full data (top row) and censored data (bottom row) with Kendall’s tau \(\tau =1/2\).

Figure 1
figure 1

Scatter and rank-rank plots for a sample of size 2000 from the Gumbel copula based on censored data with \(\tau = 1/2\), once the margins have been transformed to \(F_{1}\) and \(F_{2}\). (a, top left): \(F_{1} = B (0.5,0.7)\) and \(F_{2} = \varepsilon (3)\); (b, top right): \(F_{1} = N (0,1)\) and \(F_{2} = \varepsilon (3)\); (c, bottom left): \(F_{1} = F_{2} = U (0,1)\); (d, bottom right): the rank-rank plot for Gumbel copula

Figure 2
figure 2

Scatter and rank-rank plots for a sample of size 2000 from the Clayton copula based on censored data with \(\tau = 1/2\), once the margins have been transformed to \(F_{1}\) and \(F_{2}\). (a, top left): \(F_{1} = B (0.5,0.7)\) and \(F_{2} = \varepsilon (3)\); (b, top right): \(F_{1} = N (0,1)\) and \(F_{2} = \varepsilon (3)\); (c, bottom left): \(F_{1} = F_{2} = U (0,1)\); (d, bottom right): the rank-rank plot for Clayton copula

Figure 3
figure 3

3d-histograms. (a, top left): Gumbel (full data), (b, top right): Clayton (full data), (c, bottom left): Gumbel (censored data), (d, bottom right): Clayton (censored data)

Step 2: Marginal distribution of \(T_{1}\) and \(T_{2}\) are simulated from \(B(0.5,0.7)\) and \(\operatorname{Exp}(3)\), respectively. Then for \(N=2^{J}\leq \sqrt{(}n) < 2^{J+1}\), the empirical scaling coefficients at resolution level J are computed as follows:

$$ \tilde{\alpha }_{Jk_{1}k_{2}} = \frac{1}{n} \sum _{i=1}^{n} w_{in} I\biggl\{ \frac{k _{1}-1}{N} < \frac{R_{i}}{n}\leq \frac{k_{1}}{N} , \frac{k_{2}-1}{N} < \frac{S_{i}}{n}\leq \frac{k_{2}}{N}\biggr\} $$

for all \(k_{1},k_{2} \in \{1,\ldots,N\}\). Finally, we obtain the wavelet estimators for Gumbel and Clayton copulas (equation (5)) and consider their plots at levels \(J-1\) and \(J-2\). The graphs are in Fig. 4.

Figure 4
figure 4

Wavelet smoothing of \(n=2000\) pairs of normalized ranks from Gumbel (left column) and Clayton (right column). (a, top left) and (b, top right) show wavelet estimators under censored data at level \(j=J-1\) with \(J=5\) for Gumbel and Clayton copula, respectively. (c, bottom left) and (d, bottom right) show wavelet estimators under censored data at level \(j=J-2\) with \(J=5\) for Gumbel and Clayton copulas, respectively

Step 3: We find the average squared error (ASE) calculated for several wavelet copula estimators, including (5) for different sample sizes n. The results for ASE and the total number of replications \(N=100\) for two different dependence copula parameters and for wavelet and kernel methods are shown in Table 1. The ASE criterion is defined as

$$ \mathit{ASE}= \frac{1}{N}\sum_{l=1}^{N} \Biggl(\frac{1}{n^{2}}\sum_{j=1}^{n}\sum _{i=1} ^{n} {\bigl({\tilde{c}}^{(l)}(u_{i},v_{j})-c(u_{i},v_{j}) \bigr)}^{2}\Biggr), $$

where \({\tilde{c}}^{(l)}\) denotes an estimator of c at the lth replication. In this simulation study, we have used Daubechies’s compactly supported wavelet symmlet8 and level \(j_{0}=5\).

Table 1 Computed value for ASE compared between the proposed wavelet estimator in our paper and kernel estimator (defined in lopez(2015)) based on two copulas for various sample sizes

The codes are in MATLAB environment using the Wavelab software. Our simulation study is done for sampling in three sizes \(n=100\), \(n=300\), and \(n=500\). The main result in the table is consistent with the conclusions of [14]. In both of these examples the simulation results show that the wavelet estimator performs better than kernel estimators in terms of AMSE criterion. We can do the same work for some other copula functions like Gaussian, Frank, and Student copulas or for other wavelets such as Haar or Adelson wavelets.

4.2 Real data

Here we present an application of the proposed methodology for the data of [13], where the first observation is censored. The data consist of the indemnity payment (LOSS) and the allocated loss adjustment expense (ALAE) for 1500 general liability claims. The graphical representation of this data is considered in Fig. 5. Panels (a) and (b) show logarithm scale of the original data and the rank–rank plot, respectively. In panel (c) the 3D-histogram of Loss (censored) and Alae data is shown based on \(16 \times 16\) grid. Panel (d) shows the wavelet-based estimator described in Sect. 2.

Figure 5
figure 5

(a, top left): original data on the logarithm scale, (b, top right): rank–rank plot, (c, bottom left): 3D-histogram, panel (d, bottom right): wavelet-based copula density estimate

Due to our simulation results, the nonparametric procedure tries to provide a smooth copula estimate, and the wavelet-based estimator is a good estimator under the fact that a considerable copula family is Gumbel. Many authors who used this data in their researches claimed that the best representation of Loss and Alae data is the Gumbel copula.

5 Conclusions

Here we consider wavelet-based identification and estimation of a censored copula density function under a rank-based producer. We proposed a linear wavelet estimator and provided its asymptotic formulae for mean integrated square error. The wavelet methods offer fast computations and easy updating in addition to being easily adapted to the design. We derived an analog of the asymptotic formula of the mean integrated square error in the context of kernel density estimators for censored data, admitting an expansion with distinct squared bias and variance components.

The numerical performance of the proposed linear wavelet density estimators was illustrated on simulated datasets. Comparisons between full data and censored data for some different sample sizes were also given. Although using wavelet-based estimator of a copula function is very useful for underlying dependence structure, it does not cover the main conditions of parametric models. In future work we might also consider using a nonlinear wavelet-based copula density estimator for randomly censored data or using for goodness-of-fit testing.

References

  1. Autin, F., Lepennec, E., Tribouley, K.: Thresholding methods to estimate the copula density. J. Multivar. Anal. 101, 200–222 (2010). http://doi.org/10.1016/j.jmva.2009.07.009

    Article  MathSciNet  Google Scholar 

  2. Bouye, E., Durrleman, V., Nikeghbali, A., Riboulet, G., Roncalli, T.: Copulas for finance: a reading guide and some applications. In: Groupe de Recherche Operationelle. Credit Lyonnais, Paris (2000)

    Google Scholar 

  3. Bouye, E., Durrleman, V., Nikeghbali, A., Riboulet, G., Roncalli, T.: Copulas for finance—A reading guide and some applications. Social Science Research Network Working Paper Series (2007). https://doi.org/10.2139/ssrn.1032533

  4. Butucea, C., Tribouley, K.: Nonparametric homogeneity tests. J. Stat. Plan. Inference 136, 597–639 (2006). http://doi.org/10.1016/j.jspi.2004.08.003

    Article  MathSciNet  Google Scholar 

  5. Chatrabgoun, O., Parham, G.: Copula density estimation using multiwavelets based on the multiresolution analysis. Commun. Stat., Simul. Comput. 45, 3350–3372 (2014)

    Article  MathSciNet  Google Scholar 

  6. Chatrabgoun, O., Parham, G., Chinipardaz, R.: A Legendre multiwavelets approach to copula density estimation. Stat. Pap. 58, 673–690 (2017). http://doi.org/10.1007/s00362-015-0720-0

    Article  MathSciNet  Google Scholar 

  7. Cherubini, U., Luciano, E., Vecchiato, W.: Copula Methods in Finance. Wiley Finance Series. Wiley, Chichester (2004)

    Book  Google Scholar 

  8. Daubechies, I.: Ten Lectures on Wavelets. SIAM, Philadelphia (1992)

    Book  Google Scholar 

  9. Deheuvels, P.: La fonction de dépendance empirique et ses propriétés: Un test non paramétrique d’indépendance. Bull. Cl. Sci., Acad. R. Belg. (5) 65, 274–292 (1979)

    MATH  Google Scholar 

  10. Embrechts, P., Kluppelberg, C., Mikosch, T.: Modeling Extremal Events for Insurance and Finance. Springer, Berlin (1997)

    Book  Google Scholar 

  11. Fermanian, J.-D., Wegkamp, M.: Time dependent copulas. Preprint (2004). http://doi.org/10.1.1.332.6192

  12. Fleming, T.R., Harrington, D.P.: Counting Processes and Survival Analysis. Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics. Wiley, New York (1991). http://doi.org/10.1002/9781118150672

    MATH  Google Scholar 

  13. Frees, E., Valdez, E.A.: Understanding relationships using copulas. N. Am. Actuar. J. 2, 1–25 (1998). http://doi.org/10.1080/10920277.1998.10595667

    Article  MathSciNet  Google Scholar 

  14. Genest, C., Masiello, E., Tribouley, K.: Estimating copula densities through wavelets. Insur. Math. Econ. 44, 170–181 (2009). http://doi.org/10.1016/j.insmatheco.2008.07.006

    Article  MathSciNet  Google Scholar 

  15. Gribkova, S., Lopez, O.: Non-parametric copula estimation under bivariate censoring. Scand. J. Stat. 42, 925–946 (2015). http://doi.org/10.1111/sjos.12144

    Article  MathSciNet  Google Scholar 

  16. Joe, H.: Multivariate Models and Dependence Concepts. Chapman & Hall, London (1997). https://doi.org/10.1002/(SICI)1097-0258(19980930)17:18<2154::AID-SIM913>3.0.CO;2-R

    Book  Google Scholar 

  17. Li, L.: Non-linear wavelet-based density estimators under random censorship. J. Stat. Plan. Inference 117, 35–58 (2003). http://doi.org/10.1016/so378-3758(02)00366-x

    Article  MathSciNet  Google Scholar 

  18. Meyer, Y.: Wavelets: Algorithms and Applications. SIAM, Philadelphia (1993). http://doi.org/10.1137/1036136

    MATH  Google Scholar 

  19. Morettin, P.A., Toloi, C.M.C., Chiann, C., de Miranda, J.C.S.: Wavelet smoothed empirical copula estimators. Braz. Rev. Finance 8, 263–281 (2010)

    Article  Google Scholar 

  20. Nelsen, R.B.: An Introduction to Copulas, 2nd edn. Springer, New York (2006)

    MATH  Google Scholar 

  21. Patton, A.J.: A review of copula models for economic time series. J. Multivar. Anal. 110, 4–18 (2012). http://doi.org/10.1016/j.jmva.2012.02.021

    Article  MathSciNet  Google Scholar 

  22. Sklar, A.: Fonctions de répartition à n dimensions et leurs marges. Publ. Inst. Stat. Univ. Paris 8, 229–231 (1959)

    MATH  Google Scholar 

  23. Van der Laan, M.J.: Efficient estimation in the bivariate censoring model and repairing NPMLE. Ann. Stat. 24(2), 596–627 (1996). http://jstor.org/stable/2242663

    Article  MathSciNet  Google Scholar 

  24. Wang, W., Wells, M.T.: Nonparametric estimators of the bivariate survival function under simplified censoring conditions. Biometrika 84(4), 863–880 (1997). http://doi.org/10.1093/biomet/84.4.863

    Article  MathSciNet  Google Scholar 

Download references

Availability of data and materials

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

This work was carried out in collaboration among all authors. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Masoud Yarmohammadi.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Appendix:  Proofs

Appendix:  Proofs

1.1 A.1 Proof of Theorem 1

The proofs follow along with the lines of that in [14] for the density function. Compared to the uncensored case, the main difficulty here is to handle the weights \(W_{in}\). For this purpose, we use some assumptions of [15]. This is the direct approach to the copula function estimation problem under censoring. To complete the proof, we need some preparations. Suppose that Assumption 4 in [15] is satisfied. Here this assumption for our results is as follows.

Assumption 1

Assume that \(E{(\frac{\delta _{1}}{1-G})}^{2} =E{(\delta _{1} g(T_{1}))} ^{2} < \infty \), and assume that there exist i.i.d. random variables (\(Z_{i}\)) such that \(\sup \vert W_{in}-W_{i} \vert \leq B_{n} Z_{i}\), where \(B_{n}=\sup \vert \frac{\hat{G}-G}{1-\hat{G}} \vert = O(n^{-1/2})\) and \(E[Z_{i}]=E[\frac{\delta _{1}}{1-G(Y_{1i})}] <\infty \).

Also, suppose that the function ϕ defined in Sect. 2.2 is m-differentiable. We define

$$ \xi _{k} (Y_{1i},T_{2i}) = \bigl(\phi _{j_{0} k} \bigl(F_{1n}(Y_{1i}),F_{2n}(T _{2i})\bigr) - \phi _{j_{0} k} \bigl(F_{1}(Y_{1i}),F_{2}(T_{2i}) \bigr)\bigr)I_{(Y_{1i} \leq A_{1},Y_{2i}\leq A_{2})}. $$

In addition, \(\tilde{\alpha }_{j_{0} k} - \hat{\alpha }_{j_{0} k} = \frac{1}{n} \sum_{i=1}^{n} W_{in} \xi _{k} (Y_{1i},T_{2i})\). So we obtain

$$\begin{aligned} E\biggl(\sum_{k} (\tilde{\alpha }_{j_{0} k} - \hat{\alpha }_{j_{0} k})^{2}\biggr) =& \sum _{k} E \Biggl(\frac{1}{n} \sum _{i=1}^{n} W_{in} \xi _{k} (Y_{1i},T _{2i})\Biggr)^{2} \\ =&\sum_{k} E \Biggl(\frac{1}{n} \sum _{i=1}^{n} W_{i} \xi _{k} (Y_{1i},T_{2i})+ \frac{1}{n} \sum _{i=1}^{n} (W_{in}-W_{i}) \xi _{k} (Y_{1i},T_{2i})\Biggr)^{2} \\ \leq & 2 \Biggl(\sum_{k} E \Biggl( \frac{1}{n} \sum_{i=1}^{n} W_{i} \xi _{k} (Y _{1i},T_{2i}) \Biggr)^{2} \\ &{}+\sum_{k} E \Biggl( \frac{1}{n} \sum_{i=1}^{n} (W_{in}-W _{i}) \xi _{k} (Y_{1i},T_{2i}) \Biggr)^{2} \Biggr) \\ =:& 2(T_{1}+T_{2}). \end{aligned}$$

First, following the proof of Proposition 1 in [14], we obtain a bound for \(T_{1}\):

$$\begin{aligned} T_{1} \leq & \frac{1}{n^{2}} \sum_{k} \sum_{i\in I(j,0)\cup I_{1}(j,\epsilon )} E\bigl(W_{i} \xi _{k} (Y_{1i},T _{2i}) \bigr)^{2} \\ &{}+ \frac{1}{n^{2}}\sum_{k} \sum _{i\neq l\in I(j,0)\cup I_{1}(j,\epsilon )} \bigl\vert E\bigl(W_{i} W_{l} \xi _{k} (Y_{1i},T_{2i}) \xi _{k} (Y_{1l},T_{2l})\bigr) \bigr\vert +K2^{2j} n^{-\delta } E(W _{i})^{2}. \end{aligned}$$

Due to the last sentence in Sect. 2.1 and because of the limit function g, we get

$$\begin{aligned} E\bigl(W_{i} \xi _{k} (Y_{1i},T_{2i}) \bigr)^{2} = gE\bigl(\xi _{k} (T_{1i},T_{2i}) \bigr)^{2} \leq K_{1} E\bigl(\xi _{k} (T_{1i},T_{2i})\bigr)^{2} \end{aligned}$$

and also

$$\begin{aligned} E\bigl(W_{i} W_{l} \xi _{k} (Y_{1i},T_{2i}) \xi _{k} (Y_{1l},T_{2l}) \bigr) =& gE\bigl( \xi _{k} (T_{1i},T_{2i}) \xi _{k} (T_{1l},T_{2l})\bigr) \\ \leq &K_{1}E \bigl(\xi _{k} (T _{1i},T_{2i}) \xi _{k} (T_{1l},T_{2l})\bigr). \end{aligned}$$

Similarly, we have \(E(W_{i})^{2}= \frac{1}{1-G}=g\leq K_{1}\).

Since the support of the scaling function is compact, we finally obtain the bound for \(T_{1}\):

$$\begin{aligned} T_{1} \leq & K K_{1} \frac{1}{n^{2}} 2^{2j} \bigl(n2^{-j}\bigr) 2^{3j}\frac{ \log (n)}{n} + K K_{1} \frac{1}{n^{2}} 2^{2j} \bigl(n2^{-j} \bigr)^{2} 2^{3j}\frac{ \log (n)}{n} +K K_{1} n^{-\delta +1} \\ \leq & K \frac{2^{2j}}{n} \biggl(2^{2j} \frac{\log (n)}{n} + 2^{-j} \log (n) \biggr). \end{aligned}$$

In addition, according to Assumption 1, we have

$$\begin{aligned} E\bigl((W_{in}-W_{i})\xi _{k} (Y_{1i},T_{2i})\bigr)^{2} =& E\biggl( \frac{\delta _{1} ( \hat{G}-G)}{(1-\hat{G})(1-G)}\xi _{k} (Y_{1i},T_{2i}) \biggr)^{2} \\ =&\frac{(\hat{G}-G)^{2}}{(1-\hat{G})^{2}} E\bigl(Z_{i}\xi _{k} (T_{1i},T _{2i})\bigr)^{2} \\ \leq & Kn^{-1}E\bigl(\xi _{k} (T_{1i},T_{2i}) \bigr)^{2}. \end{aligned}$$

Similarly, the bound for \((W_{in}-W_{i})\xi _{k} (Y_{1i},T_{2i})\) follows:

$$\begin{aligned}& E\bigl((W_{in}-W_{i}) (W_{l n}-W_{l}) \xi _{k} (Y_{1i},T_{2i}) \xi _{k} (Y_{1l},T _{2l})\bigr) \\& \quad = \frac{(\hat{G}-G)^{2}}{(1-\hat{G})^{2}} E \bigl(Z_{i}^{2} \xi _{k} (T_{1i},T_{2i}) \xi _{k} (T_{1l},T_{2l})\bigr) \\& \quad \leq K n^{-1} E\bigl(\xi _{k} (T_{1i},T_{2i}) \xi _{k} (T_{1l},T_{2l})\bigr). \end{aligned}$$

It remains to find the bound for the last part of \(T_{2}\):

$$ E(W_{in}-W_{i})^{2} = E\biggl( \frac{\delta _{1} (\hat{G}-G)}{(1-\hat{G})(1-G)} \biggr)^{2} = \frac{(\hat{G}-G)^{2}}{(1- \hat{G})^{2}} E(Z_{i})^{2} \leq K n^{-1}, $$

where the last equality follows from Assumption 1. At the end, the sharper bound for \(C_{2}\) is

$$\begin{aligned} T_{2} \leq & \frac{1}{n^{2}} \sum_{k} \sum_{i\in I(j,0)\cup I_{1}(j,\epsilon )} \frac{(\hat{G}-G)^{2}}{(1- \hat{G})^{2}} E\bigl(Z_{i} \xi _{k} (T_{1i},T_{2i})\bigr)^{2} \\ &{}+ \frac{1}{n^{2}} \sum_{k} \sum _{i\neq l\in I(j,0)\cup I_{1}(j,\epsilon )} \frac{(\hat{G}-G)^{2}}{(1- \hat{G})^{2}} \bigl\vert E\bigl(Z_{i}^{2} \bigr) \xi _{k} (T_{1i},T_{2i}) \xi _{k} (T_{1l},T _{2l}) \bigr\vert \\ &{}+ K n^{-\delta } 2^{2j} \frac{(\hat{G}-G)^{2}}{(1-\hat{G})^{2}} E(Z_{i})^{2} \\ \leq & K K_{1} n^{-1} \frac{1}{n^{2}} 2^{2j} \bigl(n2^{-j}\bigr) 2^{3j}\frac{ \log (n)}{n} + K K_{1} n^{-1} \frac{1}{n^{2}} 2^{2j} \bigl(n2^{-j}\bigr)^{2} 2^{3j}\frac{ \log (n)}{n} +K K_{1} n^{-1} n^{-\delta +1} \\ \leq & K \frac{2^{2j}}{n^{2}} \biggl(2^{2j} \frac{\log (n)}{n}+2^{-j} \log (n)\biggr). \end{aligned}$$

Therefore with the bounds for \(T_{1}\) and \(T_{2}\) the proof of Theorem 1 is complete.

1.2 A.2 Proof of the Theorem 2

Combining the result of Lemma 1 and Theorem 1 and the fact that the error term associated with the use of ranks in the proof of Theorem 1 is negligible with respect to the usual error term as soon as \(2^{j_{0}} \gg \log (n)\), the proof is complete.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ghanbari, B., Yarmohammadi, M., Hosseinioun, N. et al. Wavelet estimation of copula function based on censored data. J Inequal Appl 2019, 188 (2019). https://doi.org/10.1186/s13660-019-2140-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13660-019-2140-5

Keywords