Bounds for the approximation of Poisson-binomial distribution by Poisson distribution

Hung, Tran Loc; Thao, Vu Thi

doi:10.1186/1029-242X-2013-30

Research
Open access
Published: 24 January 2013

Bounds for the approximation of Poisson-binomial distribution by Poisson distribution

Tran Loc Hung¹ &
Vu Thi Thao¹

Journal of Inequalities and Applications volume 2013, Article number: 30 (2013) Cite this article

3902 Accesses
6 Citations
Metrics details

Abstract

Let ( $X_{n k}$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ ) be a row-wise triangular array of independent Bernoulli random variables with success probabilities $P (X_{n k} = 1) = 1 - P (X_{n k} = 0) = p_{n k} \in [0, 1]$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ . For every $n = 1, 2, \dots$ , the random variables $S_{n} = \sum_{k = 1}^{n} X_{n k}$ have probability distributions with complicated structure and therefore they are used to being approximated by Poisson distribution. Well-known Le Cam’s inequality is established for providing information on the quality of a Poisson approximation. The main aim of this paper is to re-establish the Le Cam-type inequalities via a linear operator. The operator method used in this paper is quite elementary and it also could be applied for the probability distributions of random sums $S_{N_{n}} = \sum_{k = 1}^{N_{n}} X_{n k}$ in the Poisson approximation, where $N_{n}$ , $n = 1, 2, \dots$ , are positive integer-valued random variables, independent of all $X_{n k}$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ .

MSC:60F05, 60G50, 41A36.

1 Introduction

Throughout this paper, let ( $X_{n k}$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ ) be a row-wise triangular array of independent Bernoulli random variables with success probabilities $P (X_{n k} = 1) = 1 - P (X_{n k} = 0) = p_{n k} \in [0, 1]$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ . The random variables $S_{n} = \sum_{k = 1}^{n} X_{n k}$ , $n = 1, 2, \dots$ , are often called the Poisson-binomial random variables. And it is easily seen that the mean, variance, and characteristic function of $S_{n}$ , $n = 1, 2, \dots$ , are $E (S_{n}) = \sum_{k = 1}^{n} p_{n k}$ , $D (S_{n}) = \sum_{k = 1}^{n} p_{n k} (1 - p_{n k})$ , and $f_{S_{n}} (t) = E (e^{i t S_{n}}) = \prod_{k = 1}^{n} (1 - p_{n k} + p_{n k} e^{i t})$ , respectively.

The probability distributions of $S_{n}$ , $n = 1, 2, \dots$ , have many applications in various areas of mathematics and statistics such as reliability, survival analysis, survey sampling, econometrics, and so on (the reader is referred to [1, 2] and [3] for full development). However, since the probability distributions of $S_{n}$ , $n \geq 1$ , have the complicated structure (see, for instance, [3]), they are used to being approximated by the distribution of Poisson random variables $Z_{λ_{n}}$ with a positive parameter $λ_{n} = E (S_{n}) = \sum_{k = 1}^{n} p_{n k}$ . More specifically, assume that

lim_{n \to \infty} λ_{n} = λ (0 < λ < + \infty),

(1)

then

S_{n} \overset{d}{\to} Z_{λ}, as n \to \infty,

(2)

where, and from now on, the notation $\overset{d}{\to}$ means the convergence in distribution (see, for instance, [4]). Moreover, remarkable Le Cam’s inequality for the Poisson-binomial distribution [5] is widely considered in literature as follows:

\sum_{k = 0}^{\infty} | P (S_{n} = k) - P (Z_{λ_{n}} = k) | \leq 2 \sum_{k = 1}^{n} p_{n k}^{2}

(3)

(we refer the reader to the results of Le Cam [5], Barbour, Holst, and Janson [6], Steele [7], Chen [8], Chen and Liu [1], Neammanee [9], and Ross [10] for more details).

It should be noted that in [6, 7], and [9] various powerful tools (such as the method of matrix analysis, the semi-group method, the coupling method, and the Chen-Stein method) for providing Le Cam’s inequality have been demonstrated. The main objective of this paper is to obtain the bounds for well-known Le Cam’s inequality in (3) using the operator method, introduced by Renyi [4]. In the third section, we use the operator method from [4] to establish the bounds for the approximation of Poisson-binomial distribution by Poisson distribution. The operator method in this paper is quite elementary and it also could be applied for random sums $S_{N_{n}} = \sum_{k = 1}^{N_{n}} X_{n k}$ , $S_{0} = 0$ , where $N_{n}$ , $n = 1, 2, \dots$ are positive integer-valued random variables, independent of all $X_{n k}$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ . This will be taken up in the last section. We refer the reader to the works of Trotter [11], Renyi [4], and Hung [12] for a deeper discussion of this operator method. Based on the operator method, the received results of this paper are analogues of Le Cam’s inequality in classical literature (we refer the reader to Steele [7], Le Cam [5], Chen [8], Neammanee [9], and Wang [3] for a complete treatment of the problem).

2 Preliminaries

In the sequel we will need the operator method, which has been used for a long time in various studies of classical limit theorems for sums of independent random variables (see Trotter [11], Renyi [4], and Hung [12] for the complete bibliography).

We recall some definitions and notations. We denote by K the set of all real-valued bounded functions $f (x)$ , defined on the set of non-negative integers $Z_{+} = {0, 1, 2, \dots}$ . The norm of a function $f \in K$ is defined by $∥ f ∥ = {sup}_{x \in Z_{+}} | f (x) |$ .

Definition 2.1 We define a linear operator associated with a positive discrete random variable X, $A_{X} : K \to K$ , by setting

(A_{X} f) (x) : = E (f (X + x)) = \sum_{k = 0}^{\infty} f (x + k) P (X = k), \forall f \in K, x \in Z_{+} .

(4)

It is to be noticed that the linear operator defined in (4) is actually a discrete form of Trotter’s operator (we refer the readers to Trotter [11], Renyi [4], and Hung [12] for a more general and detailed discussion of this operator method).

We will need some properties of the operator in (4) in the sequel. Let $A_{X}$ , $A_{Y}$ be operators associated with two discrete random variables X and Y for $f, g \in K$ . Suppose that α and β are two real numbers, then we easily get the following linear property of the operator in (4):

A_{X} (α f + β g) = α A_{X} (f) + β A_{X} (g) .

We define the operator $(A_{X} + A_{Y})$ by $(A_{X} + A_{Y}) f = A_{X} f + A_{Y} f$ , $\forall f \in K$ , and the product of two operators $A_{X}$ and $A_{Y}$ is $(A_{X} A_{Y}) f = A_{X} (A_{Y} f)$ , $\forall f \in K$ .

It is obvious that

1.
$∥ A_{X} f ∥ ⩽ ∥ f ∥$ for all $f \in K$ .
2.
$∥ A_{X} f + A_{Y} f ∥ ⩽ ∥ A_{X} f ∥ + ∥ A_{Y} f ∥$ for all $f \in K$ .
3.
Suppose that $A_{X}$ and $A_{Y}$ are operators associated with two independent random variables X, Y and $f \in K$ . Then $A_{X} A_{Y} f = A_{Y} A_{X} f = A_{X + Y} f$ .

In fact, for all $f \in K$ and $x \in Z_{+}$ ,

\begin{array}{rcl} A_{X} A_{Y} f (x) & = & A_{X} (A_{Y} f (x)) = A_{X} (\sum_{k = 0}^{\infty} f (x + k) P (Y = k)) \\ = & \sum_{r, k = 0}^{\infty} f (x + k + r) P (Y = k) P (X = r) \\ = & \sum_{l = 0}^{\infty} f (x + l) P (X + Y = l) \\ = & A_{X + Y} f (x) \end{array}

by an argument analogous to that used for the proof of $A_{Y} A_{X} f = A_{X + Y} f$ .

4.
Suppose that $A_{X_{1}}, A_{X_{2}}, \dots, A_{X_{n}}$ are the operators associated with the independent random variables $X_{1}, X_{2}, \dots, X_{n}$ . Then $A_{S_{n}} = A_{X_{1}} A_{X_{2}} \dots A_{X_{n}}$ is the operator associated with the partial sum $S_{n} = X_{1} + X_{2} + \dots + X_{n}$ .
5.
Suppose that $A_{X_{1}}, A_{X_{2}}, \dots, A_{X_{n}}$ and $A_{Y_{1}}, A_{Y_{2}}, \dots, A_{Y_{n}}$ are operators associated with independent random variables $X_{1}, X_{2}, \dots, X_{n}$ and $Y_{1}, Y_{2}, \dots, Y_{n}$ . Moreover, assume that all $X_{i}$ and $Y_{j}$ are independent for $i, j = 1, 2, \dots, n$ . Then, for every $f \in K$ ,
$∥ A_{\sum_{k = 1}^{n} X_{k}} f - A_{\sum_{k = 1}^{n} Y_{k}} f ∥ ⩽ \sum_{k = 1}^{n} ∥ A_{X_{k}} f - A_{Y_{k}} f ∥ .$
(5)

Clearly,

A_{X_{1}} A_{X_{2}} \dots A_{X_{n}} - A_{Y_{1}} A_{Y_{2}} \dots A_{Y_{n}} = \sum_{k = 1}^{n} A_{X_{1}} A_{X_{2}} \dots A_{X_{k - 1}} (A_{X_{k}} - A_{Y_{k}}) A_{Y_{k + 1}} \dots A_{Y_{n}} .

It deduces that

\begin{array}{rcl} ∥ A_{\sum_{k = 1}^{n} X_{k}} f - A_{\sum_{k = 1}^{n} Y_{k}} f ∥ & ⩽ & \sum_{k = 1}^{n} ∥ A_{X_{1}} \dots A_{X_{k - 1}} (A_{X_{k}} - A_{Y_{k}}) A_{Y_{k + 1}} \dots A_{Y_{n}} f ∥ \\ ⩽ & \sum_{k = 1}^{n} ∥ A_{Y_{k + 1}} \dots A_{Y_{n}} (A_{X_{k}} - A_{Y_{k}}) f ∥ \\ ⩽ & \sum_{k = 1}^{n} ∥ A_{X_{k}} f - A_{Y_{k}} f ∥ . \end{array}

6.
It is to be noticed that $∥ A_{X}^{n} f - A_{Y}^{n} f ∥ \leq n ∥ A_{X} f - A_{Y} f ∥$ .
7.
Suppose that $X_{1}, \dots, X_{n}$ and $Y_{1}, \dots, Y_{n}$ are independent random variables (in each group), and let ${N_{n}, n = 1, 2, \dots}$ be a sequence of positive integer-valued random variables independent of all $X_{k}$ and $Y_{k}$ , $k = 1, 2, \dots$ . Then, for every $f \in K$ ,
$∥ A_{\sum_{k = 1}^{N_{n}} X_{k}} f - A_{\sum_{k = 1}^{N_{n}} Y_{k}} f ∥ \leq \sum_{n = 1}^{\infty} P (N_{n} = n) \sum_{k = 1}^{n} ∥ A_{X_{k}} f - A_{Y_{k}} f ∥ .$

Lemma 2.1 The equation $A_{X} f (x) = A_{Y} f (x)$ for $f \in K$ , $x \in Z_{+}$ , provided that X and Y are identically distributed random variables.

Let $A_{X_{1}}, A_{X_{2}}, \dots, A_{X_{n}}, \dots$ be a sequence of operators associated with the independent discrete random variables $X_{1}, X_{2}, \dots, X_{n}, \dots$ , and $A_{X}$ be the operator associated with the discrete random variable X. The following lemma states one of the most important properties of the operator $A_{X}$ .

Lemma 2.2 A sufficient condition for a sequence of random variables $X_{1}, X_{2}, \dots, X_{n} \dots$ converging in distribution to a random variable X is that

lim_{n \to \infty} ∥ A_{X_{n}} f - A_{X} f ∥ = 0 for all f \in K .

Proof Since ${lim}_{n \to \infty} ∥ A_{X_{n}} f - A_{X} f ∥ = 0$ , for all $f \in K$ , we get

lim_{n \to \infty} | \sum_{k = 0}^{\infty} f (x + k) (P (X_{n} = k) - P (X = k)) | = 0 for all f \in K and x \in Z_{+} .

If we choose

f (x) = {\begin{matrix} 1, & if 0 ⩽ x ⩽ t, \\ 0, & if x > t . \end{matrix}

Then

lim_{n \to \infty} | \sum_{k = 0}^{t} (P (X_{n} = k) - P (X = k)) | = 0 .

It follows that $P (X_{n} ⩽ t) - P (X ⩽ t) \to 0$ as n tends to +∞.

In other words, $X_{n} \overset{d}{\to} X$ as $n \to + \infty$ . □

3 A bound of Poisson-binomial approximation

Let $A_{X_{n k}}$ , $k = 1, \dots, n$ ; $n = 1, 2, \dots$ be the operators associated with the random variables $X_{n k}$ , $k = 1, \dots, n$ ; $n = 1, 2, \dots$ , and let $A_{Z_{p_{n k}}}$ , $k = 1, \dots, n$ ; $n = 1, 2, \dots$ , be the operators associated with the Poisson random variables with parameters $p_{n k}$ , $k = 1, \dots, n$ ; $n = 1, 2, \dots$ . On the assumption that $Z_{λ_{n}}$ is a Poisson random variable with a positive parameter $λ_{n} = \sum_{k = 1}^{n} p_{n k}$ , we can perform that $Z_{λ_{n}} \overset{d}{=} \sum_{k = 1}^{n} Z_{p_{n k}}$ , where $Z_{p_{n 1}}, Z_{p_{n 2}}, \dots, Z_{p_{n n}}$ are independent Poisson random variables with positive parameters $p_{n 1}, p_{n 2}, \dots, p_{n n}$ , and the notation $\overset{d}{=}$ denotes coincidence of distributions. We will now state an analogue of Le Cam’s inequality [5] via the linear operator in (4) as follows.

Theorem 3.1 Let ( $X_{n k}$ , $1 \leq k \leq n$ ; $n = 1, 2, \dots$ ) be a row-wise triangular array of independent, Bernoulli random variables with success probabilities $P (X_{n k} = 1) = 1 - P (X_{n k} = 0) = p_{n k}$ , $p_{n k} \in [0, 1]$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ . Let us write $S_{n} = \sum_{k = 1}^{n} X_{n k}$ and $λ_{n} = \sum_{k = 1}^{n} p_{n k}$ . We denote by $Z_{λ_{n}}$ the Poisson random variable with the parameter $λ_{n}$ . Then, for all real-valued bounded functions $f \in K$ , we have

∥ A_{S_{n}} f - A_{Z_{λ_{n}}} f ∥ ⩽ 2 ∥ f ∥ \sum_{k = 1}^{n} p_{n k}^{2} .

(6)

Proof Applying the inequality in (5), we have

∥ A_{S_{n}} f - A_{Z_{λ_{n}}} f ∥ ⩽ \sum_{k = 1}^{n} ∥ A_{X_{n k}} f - A_{Z_{p_{n k}}} f ∥ .

Moreover, for all $f \in K$ and for all $x \in Z_{+}$ , we conclude that

\begin{array}{rcl} A_{X_{n k}} f (x) - A_{Z_{p_{n k}}} f (x) & = & \sum_{r = 0}^{\infty} f (x + r) (P (X_{n k} = r) - P (Z_{p_{n k}} = r)) \\ = & \sum_{r = 0}^{\infty} f (x + r) (P (X_{n k} = r) - \frac{e^{- p_{n k}} p_{n k}^{r}}{r!}) \\ = & f (x) (1 - p_{n k} - e^{- p_{n k}}) + f (x + 1) (p_{n k} - p_{n k} e^{- p_{n k}}) \\ - \sum_{r = 2}^{\infty} f (x + r) \frac{e^{- p_{n k}} p_{n k}^{r}}{r!} . \end{array}

Since $\sum_{r = 2}^{\infty} \frac{e^{- p_{n k}} p_{n k}^{r}}{r!} = 1 - e^{- p_{n k}} - p_{n k} e^{- p_{n k}}$ , for all $f \in K$ and $x \in Z_{+}$ , it may be concluded that

\begin{array}{rcl} | A_{X_{n k}} (f) - A_{Z_{p_{n k}}} (f) | & = & | f (x) (1 - p_{n k} - e^{- p_{n k}}) + f (x + 1) (p_{n k} - p_{n k} e^{- p_{n k}}) \\ - \sum_{r = 2}^{\infty} f (x + r) \frac{e^{- p_{n k}} p_{n k}^{r}}{r!} | \\ ⩽ & | f (x) (1 - p_{n k} - e^{- p_{n k}}) | + | f (x + 1) (p_{n k} - p_{n k} e^{- p_{n k}}) | \\ + | \sum_{r = 2}^{\infty} f (x + r) \frac{e^{- p_{n k}} p_{n k}^{r}}{r!} | \\ ⩽ & sup_{x \in Z^{+}} | f (x) | (e^{- p_{n k}} - 1 + p_{n k} + p_{n k} - p_{n k} e^{- p_{n k}} + 1 - e^{- p_{n k}} - p_{n k} e^{- p_{n k}}) \\ ⩽ & 2 ∥ f ∥ p_{n k} (1 - e^{- p_{n k}}) ⩽ 2 ∥ f ∥ p_{n k}^{2} . \end{array}

Therefore, applying (5), we can assert that

∥ A_{S_{n}} (f) - A_{Z_{λ_{n}}} (f) ∥ ⩽ 2 ∥ f ∥ \sum_{k = 1}^{n} p_{n k}^{2} .

This completes the proof. □

Remark 3.1 According to Theorem 3.1 and assumption (1), using the definition of the norm of the operator A, we get following inequality:

∥ A_{S_{n}} - A_{Z_{λ}} ∥ \leq 2 (\sum_{k = 1}^{n} p_{n k}^{2}) .

The following corollaries are immediate consequences from Theorem 3.1.

Corollary 3.1 Under the stated assumptions of Theorem 3.1, for all $k = 0, 1, 2, \dots$ ,

| P (S_{n} = k) - P (Z_{λ_{n}} = k) | ⩽ 2 \sum_{j = 1}^{n} p_{n j}^{2} .

(7)

Proof Choose the particular function $f (x)$ , $x \in Z_{+}$ , such that

f (x + m) = {\begin{matrix} 1 & if m = k, \\ 0 & if m \neq k . \end{matrix}

Set $y = x + m$ . Since $x, m \in Z^{+}$ , it follows that $y \in Z_{+}$ . Then we have

∥ f ∥ = sup_{x} | f (x) | = sup_{y} | f (y) | = 1 .

Thus, according to Theorem 3.1, we conclude that

∥ A_{S_{n}} (f) - A_{Z_{λ_{n}}} (f) ∥ \leq 2 \sum_{j = 1}^{n} p_{n j}^{2} .

(8)

On the other hand, by choosing the function $f (x)$ as above, we have

\begin{array}{rcl} ∥ A_{S_{n}} f - A_{Z_{λ_{n}}} f ∥ & = & sup_{x} | A_{S_{n}} f (x) - A_{Z_{λ_{n}}} f (x) | \\ = & sup_{x} | \sum_{m = 0}^{\infty} f (x + m) [P (S_{n} = m) - P (Z_{λ_{n}} = m)] | \\ = & sup_{x} | f (x) [P (S_{n} = 0) - P (Z_{λ_{n}} = 0)] + \dots \\ + f (x + k) [P (S_{n} = k) - P (Z_{λ_{n}} = k) + \dots] | \\ = & | P (S_{n} = k) - P (Z_{λ_{n}} = k) | . \end{array}

Applying (8) we can assert that

| P (S_{n} = k) - P (Z_{λ_{n}} = k) | ⩽ 2 \sum_{j = 1}^{n} p_{n j}^{2} .

The proof is complete. □

Corollary 3.2 Let condition (1) hold. Under the hypotheses of Theorem 3.1, if moreover

lim_{n \to \infty} max_{1 \leq k \leq n} p_{n k} = 0,

(9)

then the distribution of $S_{n}$ converges to the Poisson distribution with mean λ, i.e., $S_{n} \overset{d}{\to} Z_{λ}$ as $n \to \infty$ .

Proof

The proof is based on the following observation:

\sum_{k = 1}^{n} p_{n k}^{2} \leq max_{1 \leq k \leq n} p_{n k} \times \sum_{k = 1}^{n} p_{n k} .

According to the inequality in (6) for all $f \in K$ and (9), we conclude that

lim_{n \to \infty} ∥ A_{S_{n}} (f) - A_{Z_{λ_{n}}} (f) ∥ = 0 .

As an argument analogous to the one used for the proof of Corollary 3.1, on account of Lemma 2.2, we get

lim_{n \to \infty} [P (S_{n} = k) - P (Z_{λ_{n}} = k)] = 0 .

Then, on account of (1), we have

lim_{n \to \infty} P (S_{n} = k) = lim_{n \to \infty} \frac{e^{- λ_{n}} {(λ_{n})}^{k}}{k!} = \frac{e^{- λ} λ^{k}}{k!}, k = 0, 1, 2 \dots .

Thus, the proof is straightforward. □

4 A bound of random Poisson-binomial approximation

Throughout this section, we begin with assuming that $N_{n}$ , $n = 1, 2, \dots$ , are positive integer-valued random variables independent of all $X_{n k}$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ , which are supposed to obey the relation

N_{n} \overset{P}{\to} + \infty as n \to + \infty .

(10)

Here and subsequently, $\overset{P}{\to}$ denotes the convergence in probability. For every $n = 1, 2, \dots$ , we denote by $S_{N_{n}}$ the random sums $S_{N_{n}} = \sum_{k = 1}^{N_{n}} X_{n k}$ ( $S_{0} = 0$ by convention). Therefore, the random sums $S_{N_{n}}$ could be said to be the random Poisson-binomial random variables. In this section, we establish Le Cam-type inequalities related to the Poisson approximation for distributions of random Poisson-binomial variables. It is to be noticed that many various results concerning the random summations have already been included in the textbooks of probability theory; see, e.g., [4, 13, 14]).

Let $A_{X_{n 1}}, A_{X_{n 2}}, \dots, A_{X_{n N_{n}}}$ be operators associated with the independent triangular array of random variables $X_{n 1}, X_{n 2}, \dots, X_{n N_{n}}$ , and let $A_{Z_{p_{n 1}}}, A_{Z_{p_{n 2}}}, \dots, A_{Z_{p_{n N_{n}}}}$ be operators associated with the independent Poisson distributed random variables with positive parameters $p_{n 1}, p_{n 2}, \dots, p_{n N_{n}}$ . According to the properties of the linear operator in (4), we have $A_{S_{N_{n}}} = A_{X_{n 1}} A_{X_{n 2}} \dots A_{X_{n N_{n}}}$ and $A_{Z_{λ_{N_{n}}}} = A_{Z_{p_{n 1}}} A_{Z_{p_{n 2}}} \dots A_{Z_{p_{n N_{n}}}}$ are the respective operators associated with the random sums $S_{N_{n}} = \sum_{k = 1}^{N_{n}} X_{n k}$ and $Z_{λ_{N_{n}}} = \sum_{k = 1}^{N_{n}} Z_{p_{n k}}$ .

Theorem 4.1 Let ( $X_{n k}$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ ) be a row-wise triangular array of independent, non-identically distributed Bernoulli random variables with success probabilities $P (X_{n k} = 1) = 1 - P (X_{n k} = 0) = p_{n k}$ , $p_{n k} \in [0, 1]$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ . Moreover, we suppose that $N_{n}$ , $n = 1, 2, \dots$ are independent positive integer-valued random variables, independent of all $X_{n k}$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ . Then, for all real-valued bounded functions $f \in K$ and for all $x \in Z_{+}$ , we have

∥ A_{S_{N_{n}}} (f) - A_{Z_{λ_{N_{n}}}} (f) ∥ ⩽ 2 ∥ f ∥ E (\sum_{k = 1}^{N_{n}} p_{n k}^{2}) .

Proof According to the assumptions on the random variables $N_{n}$ , $X_{n k}$ , $Z_{λ_{n}}$ , $k = 1, 2, \dots, n$ ; $n = 1, 2, \dots$ , we can write

A_{S_{N_{n}}} f (x) = \sum_{m = 1}^{\infty} P (N_{n} = m) \sum_{k = 0}^{\infty} f (x + k) P (S_{m} = k),

and

A_{Z_{λ_{N_{n}}}} f (x) = \sum_{m = 1}^{\infty} P (N_{n} = m) \sum_{k = 0}^{\infty} f (x + k) \frac{e^{- λ_{m}} λ_{m}^{k}}{k!} .

Therefore, by an argument analogous to that used for the proof of Theorem 3.1, for all real-valued function $f \in K$ , $x \in Z_{+}$ , we have

\begin{array}{rcl} ∥ A_{S_{N_{n}}} (f) - A_{Z_{λ_{N_{n}}}} (f) ∥ & = & ∥ \sum_{m = 1}^{\infty} P (N_{n} = m) (A_{X_{n 1}} \dots A_{X_{n m}} (f) - A_{Z_{p_{n 1}}} \dots A_{Z_{p_{n m}}} (f)) ∥ \\ ⩽ & \sum_{m = 1}^{\infty} P (N_{n} = m) ∥ A_{X_{n 1}} \dots A_{X_{n m}} (f) - A_{Z_{p_{n 1}}} \dots A_{Z_{p_{n m}}} (f) ∥ \\ ⩽ & \sum_{m = 1}^{\infty} P (N_{n} = m) \sum_{k = 1}^{m} 2 ∥ f ∥ p_{n k}^{2} \\ ⩽ & 2 ∥ f ∥ E (\sum_{k = 1}^{N_{n}} p_{n k}^{2}) . \end{array}

The proof is complete. □

Note that the following remarks are immediate consequences from Theorem 4.1.

Remark 4.1 According to Theorem 4.1 and assumption (1), using the definition of the norm of the operator A, we conclude that

∥ A_{S_{N_{n}}} - A_{Z_{λ}} ∥ \leq 2 E (\sum_{k = 1}^{N_{n}} p_{n k}^{2}) .

Remark 4.2 By an argument analogous to that used for the proof of Corollary 3.1, under the stated assumptions of Theorem 4.1, for all $k = 0, 1, 2, \dots$ , we have

| P (S_{N_{n}} = k) - P (Z_{λ_{N_{n}}} = k) | ⩽ 2 E (\sum_{k = 1}^{N_{n}} p_{n k}^{2}) .

When the success probability is identical, $p_{n k} = p_{n} \in [0, 1]$ , $k = 1, 2, \dots, n$ ; for $n = 1, 2, \dots$ , we obtain the following remark.

Remark 4.3 Suppose that the $N_{n}$ , $n = 1, 2, \dots$ are positive integer-valued random variables independent of all independent identically distributed random variables $X_{n k}$ , and assume that $P (X_{n k} = 1) = 1 - P (X_{n k} = 0) = p_{n} \in [0, 1]$ , $k = 1, 2, \dots, N_{n}$ ; $n = 1, 2, \dots$ . Then, for all $k = 0, 1, 2, \dots$ , we get the following inequality:

| P (S_{N_{n}} = k) - P (Z_{λ_{N_{n}}} = k) | ⩽ 2 E (N_{n}) p_{n}^{2} .

It is worth noticing that when the positive integer-valued random variables $N_{n}$ , $n = 1, 2, \dots$ take on the value n with probability one, i.e., $P (N_{n} = n) = 1$ , the results concerning the probability distributions of the random sums $S_{N_{n}}$ in the Poisson approximation in this section return to the ones in Section 3.

We conclude this paper with the following comments. The linear operator in this paper introduced by Renyi [4] essentially is a discrete form of Trotter’s operator [11] which has been used in the theory of limit theorems. The proofs of theorems in this paper by the operator method are very elementary and elegant. The received results in this article allow us to think about a new approach method to the Poisson approximation problems for the distributions of the sums of the discrete independent random variables like Poisson-binomial, geometric, and negative binomial variables.

References

Chen SX, Liu JS: Statistical applications of the Poisson-binomial and conditional Bernoulli distributions. Stat. Sin. 1997, 7: 875–892.
MATH Google Scholar
Tejada A, Dekker AJ: The role of Poisson’s binomial distribution in the analysis of TEM images. Ultramicroscopy 2011, 111: 1553–1556. 10.1016/j.ultramic.2011.08.010
Article Google Scholar
Wang YH: On the number of successes in independent trials. Stat. Sin. 1993, 3: 295–312.
MATH Google Scholar
Renyi A: Probability Theory. Akad. Kiadó, Budapest; 1970.
Google Scholar
Le Cam L: An approximation theorem for the Poisson binomial distribution. Pac. J. Math. 1960, 10(4):1181–1197. 10.2140/pjm.1960.10.1181
Article MATH MathSciNet Google Scholar
Barbour AD, Holst L, Janson S: Poisson Approximation. Clarendon, Oxford; 1992.
MATH Google Scholar
Steele JM: Le Cam’s inequality and Poisson approximations. Am. Math. Mon. 1994, 101(1):48–54. 10.2307/2325124
Article MATH MathSciNet Google Scholar
Chen LHY: On the convergence of Poisson binomial to Poisson distribution. Ann. Probab. 1974, 2(1):178–180. 10.1214/aop/1176996766
Article MATH Google Scholar
Neammanee K: A nonuniform bound for the approximation of Poisson binomial by Poisson distribution. Int. J. Math. Math. Sci. 2003, 2003(48):3041–3046. 10.1155/S0161171203212229
Article MATH MathSciNet Google Scholar
Ross SM: Introduction to Probability Models. 9th edition. Elsevier, Amsterdam; 2007.
Google Scholar
Trotter HF: An elementary proof of the central limit theorem. Arch. Math. 1959, 10: 226–234. 10.1007/BF01240790
Article MATH MathSciNet Google Scholar
Hung TL: On a probability metric based on Trotter operator. Vietnam J. Math. 2007, 35(1):22–33.
Google Scholar
Kruglov VM, Korolev VY: Limit Theorems for Random Sums. Moscow University Press, Moscow; 1990. (in Russian)
MATH Google Scholar
Gnedenko BV, Korolev VY: Random Summation: Limit Theorems and Applications. CRC Press, Boca Raton; 1996.
MATH Google Scholar

Download references

Acknowledgements

Dedicated to Professor Nguyen Duy Tien on the occasion of his 70th birthday.

The authors wish to express their gratitude to the referees for valuable remarks and comments improving the previous version of this paper. This work is supported by the Vietnam National Foundation For Science and Technology Development (NAFOSTED, Vietnam), grant 101.01-2010.02.

Author information

Authors and Affiliations

Faculty of Basic Science, University of Finance & Marketing (UFM), 306 Nguyen Trong Tuyen St., Tan Binh Dist., Ho Chi Minh City, Vietnam
Tran Loc Hung & Vu Thi Thao

Authors

Tran Loc Hung
View author publications
You can also search for this author in PubMed Google Scholar
Vu Thi Thao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tran Loc Hung.

Additional information

Competing interests

The authors declare that they have no completing interests.

Authors’ contributions

All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Hung, T.L., Thao, V.T. Bounds for the approximation of Poisson-binomial distribution by Poisson distribution. J Inequal Appl 2013, 30 (2013). https://doi.org/10.1186/1029-242X-2013-30

Download citation

Received: 15 August 2012
Accepted: 09 January 2013
Published: 24 January 2013
DOI: https://doi.org/10.1186/1029-242X-2013-30

Bounds for the approximation of Poisson-binomial distribution by Poisson distribution

Abstract

1 Introduction

2 Preliminaries

3 A bound of Poisson-binomial approximation

4 A bound of random Poisson-binomial approximation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords