A maximum principle for optimal control system with endpoint constraints

Wang, Weifeng; Liu, Bin

doi:10.1186/1029-242X-2012-231

Research
Open access
Published: 12 October 2012

A maximum principle for optimal control system with endpoint constraints

Weifeng Wang¹ &
Bin Liu¹

Journal of Inequalities and Applications volume 2012, Article number: 231 (2012) Cite this article

2381 Accesses
Metrics details

Abstract

Pontryagin’s maximum principle for an optimal control system governed by an ordinary differential equation with endpoint constraints is proved under the assumption that the control domain has no linear structure. We also obtain the variational equation, adjoint equation and Hamilton system for our problem.

MSC:65K10, 34H05, 93C15.

1 Introduction

Optimal control problems have been studied for a long time and have a lot of practical applications in the fields such as physics, biology and economics, etc. Hamilton systems are derived from Pontryagin’s maximum principle, which is known as a necessary condition for optimality. Many results have been obtained both for finite and infinite dimensional control systems such as [1–5]. Regarding the state constraint problems, lots of results are also obtained. For example, readers can refer to [6, 7] and the references therein.

To our best knowledge, to derive the necessary conditions of Pontryagin’s maximum principle type for optimal control problems, there are two main perturbation methods. When the control domain is convex, we often use the convex perturbation. When the control domain is non-convex and does not have any linear structure, we usually use the spike perturbation. Many relevant results have been obtained; see [3, 6, 8–10] and the references therein. The two methods have their advantages and disadvantages. The convex variational needs the control domain being convex, but in reality it is not always satisfied. And the spike variational needs more regularity for the coefficients and the solutions to the state equations, especially in the stochastic case.

In 2010, Lou [7] introduced a new method to study the necessary and sufficient conditions of optimal control problems in the absence of linear structure for the deterministic case. The author gave a local linearization of the optimal control problem along the optimal control, and transformed the original problem into a new relaxed control problem. Moreover, he proved the equivalence of the two problems in some sense. Being directly inspired by [7], we are also interested in applying this method to an endpoint constraints optimal control system, which is also in the absence of linear structure. Also, Pontryagin’s maximum principle is obtained for our problem.

The rest of this paper is organized as follows. Section 2 begins with a general formulation of our state constraints optimal control problem and the local linearization of the problem is given. In Section 3, we give our main result and its proof. Moreover, we obtain the variational equation, adjoint equation and Hamilton system for our optimal control system.

2 Preliminaries

We consider the controlled ordinary differential equation in $R^{n}$

{\begin{matrix} \dot{x} (t) = f (t, x (t), u (t)), in [0, T], \\ x (0) = x_{0}, \end{matrix}

(2.1)

with the cost functional

J (x_{0}, u (\cdot)) = \int_{0}^{T} l (t, x (t), u (t)) d t,

(2.2)

where $T > 0$ is a given constant, $x_{0} \in R^{n}$ is a decision variable and $u (\cdot) \in U [0, T]$ with

U [0, T] = {v : [0, T] \to V | v (\cdot) is measurable},

where V is a non-convex set in $R^{k}$ .

Let $U_{ad} [0, T]$ be the set of all elements $(x_{0}, u (\cdot)) \in S_{1} \times U [0, T]$ satisfying

x (T) \in S_{2},

where $S_{1}$ and $S_{2}$ are closed convex subsets of $R^{n}$ . Let $S = S_{1} \times S_{2}$ .

Then the optimal control problem can be stated as follows.

Problem 2.1 Find a pair $({\bar{x}}_{0}, \bar{u} (\cdot)) \in U_{ad} [0, T]$ such that

J ({\bar{x}}_{0}, \bar{u} (\cdot)) = inf_{(x_{0}, u (\cdot)) \in U_{ad} [0, T]} J (x_{0}, u (\cdot)) .

(2.3)

Any $({\bar{x}}_{0}, \bar{u} (\cdot)) \in U_{ad} [0, T]$ satisfying the above identity is called an optimal control, and the corresponding state $x (\cdot; {\bar{x}}_{0}, \bar{u} (\cdot)) ≜ \bar{x} (\cdot)$ is called an optimal trajectory; $(\bar{x} (\cdot), \bar{u} (\cdot))$ is called an optimal pair.

Let the following hypotheses hold:

( $H_{1}$ ) The metric space $(V, d)$ is separable, and d is the usual metric in $R^{k}$ .

( $H_{2}$ ) Functions $f = {(f^{1}, f^{2}, \dots, f^{n})}^{⊤} : [0, T] \times R^{n} \times V \to R^{n}$ and $l : [0, T] \times R^{n} \times V \to R$ are measurable in t, continuous in $(x, u)$ and continuously differentiable in x, where ⊤ denotes the transpose of a matrix. Moreover, there exists a constant $L > 0$ such that

{\begin{matrix} | f (t, x_{1}, u) - f (t, x_{2}, u) | \leq L | x_{1} - x_{2} |, \\ | l (t, x_{1}, u) - l (t, x_{2}, u) | \leq L | x_{1} - x_{2} |, \\ | f (t, 0, u) | \leq L, | l (t, 0, u) | \leq L, \\ \forall (t, x_{1}, x_{2}, u) \in [0, T] \times R^{n} \times R^{n} \times V, \end{matrix}

(2.4)

where $| \cdot |$ denotes the usual Euclidean norm.

From the above conditions, it is easy to know that the state equation (2.1) has a unique solution $x (\cdot; x_{0}, u (\cdot))$ for any $(x_{0}, u (\cdot)) \in U [0, T]$ .

Let $({\bar{x}}_{0}, \bar{u} (\cdot)) \in U_{ad} [0, T]$ be a minimizer of $J (x_{0}, u (\cdot))$ over $U_{ad} [0, T]$ , and we linearize $U_{ad} [0, T]$ along $(x_{0}, \bar{u} (\cdot))$ (for any fixed $x_{0}$ ) in the following manner. Define

M_{ad} [0, T] ≜ {x_{0} \times ((1 - α) δ_{\bar{u} (\cdot)} + α δ_{u (\cdot)}) | α \in [0, 1], (x_{0}, u (\cdot)) \in U_{ad} [0, T]},

where $δ_{u (\cdot)}$ denotes the Dirac measure at $u (\cdot)$ on $U [0, T]$ . Let $σ (\cdot) = (1 - α) δ_{\bar{u} (\cdot)} + α δ_{u (\cdot)}$ , then it is easy to see that $(x_{0}, σ (\cdot)) \in M_{ad} [0, T]$ . Now, we define

f (t, x, σ (t)) \equiv \int_{U} f (t, x, v) σ (t) d v = (1 - α) f (t, x, \bar{u} (t)) + α f (t, x, u (t)),

(2.5)

and similarly,

l (t, x, σ (t)) \equiv \int_{U} l (t, x, v) σ (t) d v = (1 - α) l (t, x, \bar{u} (t)) + α l (t, x, u (t)) .

Then we define $x (\cdot) = x (\cdot; x_{0}, σ (\cdot))$ as the solution of the following equation:

{\begin{matrix} \dot{x} (t) = f (t, x (t), σ (t)), in [0, T], \\ x (0) = x_{0}, \end{matrix}

(2.6)

and the corresponding cost functional is

J (x_{0}, σ (\cdot)) = \int_{0}^{T} l (t, x (t, σ (t)), σ (t)) d t .

We can easily find that $x (\cdot; x_{0}, u (\cdot))$ and $J (x_{0}, u (\cdot))$ coincide with $x (\cdot; x_{0}, δ_{u (\cdot)})$ and $J (x_{0}, δ_{u (\cdot)})$ respectively. Thus, $U_{ad} [0, T]$ can be viewed as a subset of $M_{ad} [0, T]$ in the sense of identifying $(x_{0}, u (\cdot)) \in U_{ad} [0, T]$ and $(x_{0}, δ_{u (\cdot)}) \in M_{ad} [0, T]$ . Because the elements of $M_{ad} [0, T]$ are very simple, we need neither pose additional assumptions like that the control domain is compact as Warga [11] did nor introduce the relaxed control defined by finite-additive probability measure as Fattorini [12] did. Now, we can see that $M_{ad} [0, T]$ already has a linear structure at $({\bar{x}}_{0}, \bar{u} (\cdot))$ . First, we give some lemmas to show that $({\bar{x}}_{0}, δ_{\bar{u} (\cdot)})$ is a minimizer of $J (x_{0}, σ (\cdot))$ over $M_{ad} [0, T]$ .

Lemma 2.2 (Lou [7])

Let $(H_{1})$ - $(H_{2})$ hold. Then there exists a positive constant $C > 0$ , such that for any $(x_{0}, σ (\cdot)) \in M_{ad} [0, T]$ and $t_{1}, t_{2} \in [0, T]$ ,

{\begin{matrix} {∥ x (\cdot; x_{0}, σ (\cdot)) ∥}_{C [0, T]} \leq C, \\ | x (t_{1}; x_{0}, σ (\cdot)) - x (t_{2}; x_{0}, σ (\cdot)) | \leq C | t_{1} - t_{2} | . \end{matrix}

(2.7)

Lemma 2.3 (Lou [7])

Let $(H_{1})$ - $(H_{2})$ hold and $({\bar{x}}_{0}, \bar{u} (\cdot))$ be a minimizer of $J (x_{0}, u (\cdot))$ over $U_{ad} [0, T]$ . Then $({\bar{x}}_{0}, δ_{\bar{u} (\cdot)})$ is a minimizer of $J (x_{0}, σ (\cdot))$ over $M_{ad} [0, T]$ .

Remark 2.4 We can see that Lemma 2.3 shows the equivalence of the above two problems. And this lemma is essential for our following maximum principle.

3 Pontryagin’s maximum principle

In this section, we give our main result first and then prove it. Let $f_{x}$ denote the derivative of f on x, and others can be defined in the same way. $〈 \cdot, \cdot 〉$ denotes the inner product in $R^{n}$ .

Theorem 3.1 (Pontryagin’s maximum principle)

We assume ( $H_{1}$ )-( $H_{2}$ ) hold. Let $(\bar{x} (\cdot), \bar{u} (\cdot))$ be a solution of the optimal control problem (2.1). Then there exists a nontrivial pair $(ψ^{0}, p (\cdot)) \in R \times C^{1} ([0, T]; R^{n})$ , i.e., $(ψ^{0}, p (\cdot)) \neq 0$ such that

(3.1)

(3.2)

(3.3)

(3.4)

where for any $(t, x, u, p, ψ^{0}) \in [0, T] \times R^{n} \times V \times R^{n} \times R$ ,

H (t, x, u, p, ψ^{0}) = 〈 f (t, x, u), p 〉 + ψ^{0} l (t, x, u),

(3.5)

and we also have the following Hamilton system:

{\begin{matrix} \dot{x} (t) = H_{p} (t, x (t), u (t), p (t), ψ^{0}), a.e. t \in [0, T], \\ \dot{p} (t) = - H_{x} (t, x (t), u (t), p (t), ψ^{0}), a.e. t \in [0, T], \\ x (0) = x_{0}, p (T) = - \bar{ψ}, \\ H (t, x (t), u (t), p (t), ψ^{0}) \\ = {max}_{(x_{0}, u (\cdot)) \in U_{ad} [0, T]} H (t, x (t), u (t), p (t), ψ^{0}), a.e. t \in [0, T], \end{matrix}

(3.6)

where $\bar{ψ}$ will be defined in the following part.

We recall that under the conditions $(H_{1})$ - $(H_{2})$ , for any $(x_{0}, u (\cdot)) \in R^{n} \times U [0, T]$ , the state equation (2.1) admits a unique solution $x (\cdot)$ with $x (0) = x_{0}$ . So, the cost functional is uniquely determined by $(x_{0}, u (\cdot))$ . In the sequel, we denote the unique solution of (2.1) with $x (0) = x_{0}$ by $x (\cdot; x_{0}, u (\cdot))$ . Now, let $({\bar{x}}_{0}, \bar{u} (\cdot)) \in U_{ad} [0, T]$ be an optimal control. We denote $\bar{x} (0) = {\bar{x}}_{0}$ . Without loss of generality, we may assume that $J ({\bar{x}}_{0}, \bar{u} (\cdot)) = 0$ ; otherwise, we may consider the optimal control problem with a cost functional of $J (x_{0}, u (\cdot)) - J ({\bar{x}}_{0}, \bar{u} (\cdot))$ .

Now, we give some definitions and lemmas for Theorem 3.1.

First, we define a penalty functional, via which, for convenience, we can transform the original problem to another one called the approximate problem, which has no endpoint constraint.

Let us introduce some notations. For all $(x_{1}, u_{1} (\cdot)), (x_{2}, u_{2} (\cdot)) \in R^{n} \times U [0, T]$ , we define

\bar{d} ((x_{1}, u_{1} (\cdot)), (x_{2}, u_{2} (\cdot))) = | x_{1} - x_{2} | + d (u_{1} (\cdot), u_{2} (\cdot)),

(3.7)

where $d (u_{1} (\cdot), u_{2} (\cdot))$ denote the measure of ${t \in [0, T] | u_{1} (t) \neq u_{2} (t)}$ (it obviously is a metric).

For $\forall ε > 0$ and $\forall (x_{0}, u (\cdot)) \in R^{n} \times U [0, T]$ , the penalty functional is defined as follows:

J_{ε} (x_{0}, u (\cdot)) = {d_{S}^{2} (x_{0}, x (T; x_{0}, u (\cdot))) + {[{(J (x_{0}, u (\cdot)) + ε)}^{+}]}^{2}}^{\frac{1}{2}},

(3.8)

where ${(a)}^{+} = max {0, a}$ , and for any $(x_{0}, x_{1}) \in R^{n} \times R^{n}$ ,

d_{S} (x_{0}, x_{1}) = d ((x_{0}, x_{1}), S) ≜ inf_{(y_{0}, y_{1}) \in S} {{| y_{0} - x_{0} |}^{2} + {| y_{1} - x_{1} |}^{2}}^{\frac{1}{2}} .

(3.9)

Obviously, $d_{S}$ is a convex function and it is Lipschitz continuous with the Lipschitz constant being 1. We define the subdifferential of the function $d_{S}$ as follows:

\begin{array}{rcl} \partial d_{S} (x_{0}, x_{1}) & = & {(a, b) \in R^{n} \times R^{n} | d_{S} (x_{0}^{'}, x_{1}^{'}) - d_{S} (x_{0}, x_{1}) \\ \geq 〈 a, x_{0}^{'} - x_{0} 〉 + 〈 b, x_{1}^{'} - x_{1} 〉, \forall (x_{0}^{'}, x_{1}^{'}) \in R^{n} \times R^{n}}, \end{array}

(3.10)

where $〈 \cdot, \cdot 〉$ denotes the inner product in $R^{n}$ . For more properties of subdifferential $\partial d_{S}$ , one can see p.146 in [6].

Lemma 3.2 Let $(H_{1})$ - $(H_{2})$ hold. Then there exists a constant $C > 0$ such that for all $(x_{0}, u (\cdot)), ({\hat{x}}_{0}, \hat{u} (\cdot)) \in R^{n} \times U [0, T]$ ,

{\begin{matrix} {sup}_{t \in [0, T]} | x (t; x_{0}, u (\cdot)) - x (t; {\hat{x}}_{0}, \hat{u} (\cdot)) | \\ \leq C (1 + | x_{0} | \lor | {\hat{x}}_{0} |) \bar{d} ((x_{0}, u (\cdot)), ({\hat{x}}_{0}, \hat{u} (\cdot))), \\ | J (x_{0}, u (\cdot)) - J ({\hat{x}}_{0}, \hat{u} (\cdot)) | \\ \leq C (1 + | x_{0} | \lor | {\hat{x}}_{0} |) \bar{d} ((x_{0}, u (\cdot)), ({\hat{x}}_{0}, \hat{u} (\cdot))) . \end{matrix}

(3.11)

Proof Denote $x (\cdot) = x (t; x_{0}, u (\cdot))$ and $\hat{x} (\cdot) = x (t; {\hat{x}}_{0}, \hat{u} (\cdot))$ . From the state equation (2.1) and condition $(H_{2})$ , we have

\begin{array}{rcl} | x (t) | & \leq & | x_{0} | + \int_{0}^{t} | f (s, x (s), u (s)) | d s \\ \leq & | x_{0} | + \int_{0}^{t} (| f (s, 0, u (s)) | + | f (s, x (s), u (s)) - f (s, 0, u (s)) |) d s \\ \leq & | x_{0} | + \int_{0}^{t} (L + L | x (s) |) d s . \end{array}

By Gronwall’s inequality, it follows that

| x (t) | \leq C (1 + | x_{0} |), \forall t \in [0, T] .

Similarly,

| \hat{x} (t) | \leq C (1 + | {\hat{x}}_{0} |), \forall t \in [0, T],

where the constant C is independent of controls $u (\cdot)$ and $\hat{u} (\cdot)$ , and may be different at different places throughout this paper. Further, noting the definition of $\bar{d}$ , we have

\begin{array}{rcl} | x (t) - \hat{x} (t) | & \leq & C | x_{0} - {\hat{x}}_{0} | + C \int_{0}^{t} | x (s) - \hat{x} (s) | d s \\ + \int_{0}^{t} | f (s, x (s), u (s)) - f (s, x (s), \hat{u} (s)) | d s \\ \leq & C | x_{0} - {\hat{x}}_{0} | + C (1 + | x_{0} | \lor | {\hat{x}}_{0} |) d (u (\cdot), \hat{u} (\cdot)) + C \int_{0}^{t} | x (s) - \hat{x} (s) | d s \\ \leq & C (1 + | x_{0} | \lor | {\hat{x}}_{0} |) \bar{d} ((x_{0}, u (\cdot)), ({\hat{x}}_{0}, \hat{u} (\cdot))) + C \int_{0}^{t} | x (s) - \hat{x} (s) | d s . \end{array}

Thus, by Gronwall’s inequality, we get

\begin{array}{rcl} | x (t) - \hat{x} (t) | & \leq & C (1 + | x_{0} | \lor | {\hat{x}}_{0} |) \bar{d} ((x_{0}, u (\cdot)), ({\hat{x}}_{0}, \hat{u} (\cdot))) e^{C t} \\ \leq & C (1 + | x_{0} | \lor | {\hat{x}}_{0} |) \bar{d} ((x_{0}, u (\cdot)), ({\hat{x}}_{0}, \hat{u} (\cdot))) e^{C T}, \forall t \in [0, T] . \end{array}

Taking supremum in the above inequality, the first inequality in (3.11) is obtained. The second inequality can be proved similarly. □

By the definition of $J_{ε} (x_{0}, u (\cdot))$ and Lemma 3.2, we can easily obtain the following result.

Corollary 3.3 The functional $J_{ε} (x_{0}, u (\cdot))$ is continuous on the space $(R^{n} \times U [0, T], \bar{d})$ .

Remark 3.4 By the definition of $J_{ε} (x_{0}, u (\cdot))$ (3.8) and Corollary 3.3, we can see that

{\begin{matrix} J_{ε} (x_{0}, u (\cdot)) > 0, \forall (x_{0}, u (\cdot)) \in R^{n} \times U [0, T], \\ J_{ε} ({\bar{x}}_{0}, \bar{u} (\cdot)) = ε \leq {inf}_{R^{n} \times U [0, T]} J_{ε} (x_{0}, u (\cdot)) + ε . \end{matrix}

(3.12)

Thus, by the Ekeland variational principle (see p.135 in [6] for details), there exists a pair $(x_{0}^{ε}, u^{ε} (\cdot)) \in R^{n} \times U [0, T]$ , such that

(3.13)

(3.14)

The above implies that if we let $x^{ε} (\cdot) = x (\cdot; x_{0}^{ε}, u^{ε} (\cdot))$ , then $(x^{ε} (\cdot), u^{ε} (\cdot))$ is an optimal pair for the problem where the state equation is (2.1) and the cost functional is $J_{ε} (x_{0}, u (\cdot))$ .

Now, we derive the necessary conditions for $(x^{ε} (\cdot), u^{ε} (\cdot))$ .

Lemma 3.5 Let $(x^{ε} (\cdot), u^{ε} (\cdot))$ be an optimal pair for the problem where the state equation is (2.1) and the cost functional is $J_{ε} (x_{0}, u (\cdot))$ , then there exists a nontrivial triple $({\bar{ψ_{ε}}}^{0}, \bar{φ_{ε}}, \bar{ψ_{ε}}) \in R \times R^{n} \times R^{n}$ such that

{| \bar{φ_{ε}} |}^{2} + {| \bar{ψ_{ε}} |}^{2} + {| {\bar{ψ_{ε}}}^{0} |}^{2} = 1,

and

- \sqrt{ε} (| η | + C) \leq 〈 \bar{φ_{ε}}, η 〉 + 〈 \bar{ψ_{ε}}, X_{ε} (T) 〉 + {\bar{ψ_{ε}}}^{0} Y_{ε},

where η, $X_{ε} (\cdot)$ , $Y_{ε}$ will be defined in the following proof.

Proof Similar to Section 2, we linearize $U [0, T]$ along $u^{ε} (\cdot)$ and denote $σ^{α, ε} (\cdot) = (1 - α) δ_{u^{ε} (\cdot)} + α δ_{u (\cdot)}$ , $x^{α, ε} (\cdot) = x (\cdot; x_{0}^{ε} + α η, σ^{α, ε} (\cdot))$ , where $η \in B_{1} (0) \subset R^{n}$ ( $B_{1} (0)$ denotes the ball whose center is 0 and radius is 1). Recall that $x^{ε} (\cdot) = x (\cdot; x_{0}^{ε}, u^{ε} (\cdot))$ , then we derive the variational equation.

From (2.5), we have

\begin{array}{rcl} X^{α, ε} (t) & ≜ & \frac{x^{α, ε} (t) - x^{ε} (t)}{α} \\ = & \int_{0}^{t} [\frac{f (s, x^{α, ε} (s), u^{ε} (s)) - f (s, x^{ε} (s), u^{ε} (s))}{α} \\ + f (s, x^{α, ε} (s), u (s)) - f (s, x^{α, ε} (s), u^{ε} (s))] d s \\ = & \int_{0}^{t} [\int_{0}^{1} f_{x} {(s, x^{ε} (s) + τ (x^{α, ε} (s) - x^{ε} (s)), u^{ε} (s))}^{⊤} d τ X^{α, ε} (s) \\ + f (s, x^{α, ε} (s), u (s)) - f (s, x^{α, ε} (s), u^{ε} (s))] d s, \end{array}

where $f_{x} (t, x, u)$ denotes the transpose of the Jacobi matrix of f on x. By virtue of $(H_{2})$ , and using the convergence of $x^{α, ε} (t) \to x^{ε} (t)$ (see the proof of Lemma 2.2 in [7]), we can easily obtain

X^{α, ε} (t) \to X^{ε} (t), as α \to 0, \forall t \in [0, T],

where $X^{ε} (t)$ is the solution of the following variational equation:

{\begin{matrix} {\dot{X}}^{ε} (t) = f_{x} {(t, x^{ε} (t), u^{ε} (t))}^{⊤} X^{ε} (t) + f (t, x^{ε} (t), u (t)) - f (t, x^{ε} (t), u^{ε} (t)), \\ X^{ε} (0) = η . \end{matrix}

(3.15)

From the definition of $σ^{α, ε} (\cdot)$ , by Lemma 2.3, for any fixed α, we can define

\tilde{d} (σ^{α, ε} (\cdot), δ_{u^{ε} (\cdot)}) = α d (u (\cdot), u^{ε} (\cdot)),

and

\tilde{d} ((x_{1}, σ^{α, ε} (\cdot)), (x_{2}, δ_{u^{ε} (\cdot)})) = | x_{1} - x_{2} | + \tilde{d} (σ^{α, ε} (\cdot), δ_{u^{ε} (\cdot)}) .

So, by virtue of (3.14) and Lemma 2.3, we have

J_{ε} (x_{0}^{ε} + α η, σ^{α, ε} (\cdot)) + \sqrt{ε} \tilde{d} ((x_{0}^{ε} + α η, σ^{α, ε} (\cdot)), (x_{0}^{ε}, δ_{u^{ε} (\cdot)})) \geq J_{ε} (x_{0}^{ε}, δ_{u^{ε} (\cdot)}) .

(3.16)

It means that when $α \to 0$ ,

\begin{array}{rcl} - \sqrt{ε} (| η | + d (u (\cdot), u^{ε} (\cdot))) & \leq & lim_{α \to 0} \frac{1}{α} (J_{ε} (x_{0}^{ε} + α η, σ^{α, ε} (\cdot)) - J_{ε} (x_{0}^{ε}, δ_{u^{ε} (\cdot)})) \\ = & \frac{1}{2 J_{ε} (x_{0}^{ε}, δ_{u^{ε} (\cdot)})} lim_{α \to 0} \frac{1}{α} {d_{S}^{2} (x_{0}^{ε} + α η, x^{α, ε} (T)) - d_{S}^{2} (x_{0}^{ε}, x^{ε} (T)) \\ + {[{(J (x_{0}^{ε} + α η, σ^{α, ε} (\cdot)) + ε)}^{+}]}^{2} - {[{(J (x_{0}^{ε}, δ_{u^{ε} (\cdot)}) + ε)}^{+}]}^{2}} . \end{array}

Note that the map $(x_{0}, x_{1}) \to d_{S}^{2} (x_{0}, x_{1})$ is continuously differentiable on $R^{n} \times R^{n}$ , then we get

(3.17)

with $(a_{ε}, b_{ε}) \in \partial d_{S} (x_{0}^{ε}, x^{ε} (T))$ and

{| a_{ε} |}^{2} + {| b_{ε} |}^{2} = 1 .

(3.18)

Similarly, we can obtain

(3.19)

where

\begin{array}{rcl} Y_{ε} & = & lim_{α \to 0} \frac{J ((1 - α) δ_{u^{ε} (\cdot)} + α δ_{u (\cdot)}) - J (δ_{u^{ε} (\cdot)})}{α} \\ = & lim_{α \to 0} \int_{0}^{T} [\frac{l (t, x^{α, ε} (t), u^{ε} (t)) - l (t, x^{ε} (t), u^{ε} (t))}{α} \\ + l (t, x^{α, ε} (t), u (t)) - l (t, x^{α, ε} (t), u^{ε} (t))] d t \\ = & lim_{α \to 0} \int_{0}^{T} [\int_{0}^{1} 〈 l_{x} (t, x^{ε} (t) + s (x^{α, ε} (t) - x^{ε} (t)), u^{ε} (t)), X^{α, ε} (t) 〉 d s \\ + l (t, x^{α, ε} (t), u (t)) - l (t, x^{α, ε} (t), u^{ε} (t))] d t \\ = & \int_{0}^{T} [〈 l_{x} (t, x^{ε} (t), u^{ε} (t)), X^{ε} (t) 〉 + l (t, x^{ε} (t), u (t)) - l (t, x^{ε} (t), u^{ε} (t))] d t . \end{array}

Combining (3.17) and (3.19), by sending $α \to 0$ , we obtain

- \sqrt{ε} (| η | + C) \leq 〈 \bar{φ_{ε}}, η 〉 + 〈 \bar{ψ_{ε}}, X_{ε} (T) 〉 + {\bar{ψ_{ε}}}^{0} Y_{ε},

(3.20)

where

{\begin{matrix} (\bar{φ_{ε}}, \bar{ψ_{ε}}) = \frac{d_{S} (x_{0}^{ε}, x^{ε} (T))}{J_{ε} (x_{0}^{ε}, δ_{u^{ε} (\cdot)})} (a_{ε}, b_{ε}), \\ {\bar{ψ_{ε}}}^{0} = \frac{{(J (x_{0}^{ε}, δ_{u^{ε} (\cdot)}) + ε)}^{+}}{J_{ε} (x_{0}^{ε}, δ_{u^{ε} (\cdot)})} . \end{matrix}

(3.21)

From (3.18), it is obvious that

{| \bar{φ_{ε}} |}^{2} + {| \bar{ψ_{ε}} |}^{2} + {| {\bar{ψ_{ε}}}^{0} |}^{2} = 1 .

(3.22)

□

Remark 3.6 By the definition of subdifferential of the function $d_{S} (\cdot)$ , for any $(x_{1}, x_{2}) \in S$ , we have

〈 \bar{φ_{ε}}, x_{1} - x_{0}^{ε} 〉 + 〈 \bar{ψ_{ε}}, x_{2} - x^{ε} (T) 〉 \leq 0 .

(3.23)

In order to pass to the limit as $ε \to 0$ , we give the following lemma first, which is necessary for the derivation of our maximum principle.

Lemma 3.7 It holds that

lim_{ε \to 0} [| x_{0}^{ε} - {\bar{x}}_{0} | + sup_{t \in [0, T]} | X^{ε} (t) - X (t) | + | Y_{ε} - Y |] = 0,

where $X (t)$ and Y satisfy the following equations:

(3.24)

(3.25)

Proof By (3.13) and the definition of $\bar{d}$ , it is easy to see that

lim_{ε \to 0} | x_{0}^{ε} - {\bar{x}}_{0} | = 0 .

(3.26)

From (3.15) and (3.24), we have

\begin{array}{rcl} | X^{ε} (t) - X (t) | & \leq & \int_{0}^{t} | f_{x} {(s, x^{ε} (s), u^{ε} (s))}^{⊤} X^{ε} (s) + f (s, x^{ε} (s), u (s)) - f (s, x^{ε} (s), u^{ε} (s)) \\ - (f_{x} {(s, \bar{x} (s), \bar{u} (s))}^{⊤} X (s) + f (s, \bar{x} (s), u (s)) - f (s, \bar{x} (s), \bar{u} (s))) | d s \\ \leq & \int_{0}^{t} [| f_{x} {(s, x^{ε} (s), u^{ε} (s))}^{⊤} | | X^{ε} (s) - X (s) | \\ + | f_{x} {(s, x^{ε} (s), u^{ε} (s))}^{⊤} - f_{x} {(s, \bar{x} (s), \bar{u} (s))}^{⊤} | | X (s) | \\ + | f (s, x^{ε} (s), u (s)) - f (s, \bar{x} (s), u (s)) | \\ + | f (s, x^{ε} (s), u^{ε} (s)) - f (s, \bar{x} (s), \bar{u} (s)) |] d s . \end{array}

(3.27)

By virtue of Lemma 3.2, then

(3.28)

Now, combining the condition ( $H_{2}$ ) and (3.27), we deduce that

(3.29)

Then by Gronwall’s inequality, we obtain

lim_{ε \to 0} sup_{t \in [0, T]} | X^{ε} (t) - X (t) | = 0 .

(3.30)

Similarly, we can prove that

lim_{ε \to 0} | Y_{ε} - Y | = 0 .

(3.31)

Thus, the proof of this lemma is completed. □

Remark 3.8 Now, we can let $ε \to 0$ . By (3.23), it is obvious that for any $(x_{1}, x_{2}) \in S$ , we have

〈 \bar{φ_{ε}}, x_{1} - {\bar{x}}_{0} 〉 + 〈 \bar{ψ_{ε}}, x_{2} - \bar{x} (T) 〉 \leq {({| x_{0}^{ε} - {\bar{x}}_{0} |}^{2} + {| x^{ε} (T) - \bar{x} (T) |}^{2})}^{\frac{1}{2}} ≜ δ_{ε} \to 0 .

(3.32)

Hence, combining (3.20) and (3.32), for any $(x_{1}, x_{2}) \in S$ , we obtain

(3.33)

From (3.22), we can find a subsequence (still denoted by itself) such that

(\bar{φ_{ε}}, \bar{ψ_{ε}}, {\bar{ψ_{ε}}}^{0}) \to (\bar{φ}, \bar{ψ}, {\bar{ψ}}^{0}) \neq 0 .

(3.34)

Now, by Lemma 3.7 and sending $ε \to 0$ in (3.33), for any $(x_{1}, x_{2}) \in S$ , $u (\cdot) \in U [0, T]$ and $η \in B_{1} (0)$ , we have

〈 \bar{φ}, η - (x_{1} - {\bar{x}}_{0}) 〉 + 〈 \bar{ψ}, X (T) - (x_{2} - \bar{x} (T)) 〉 + {\bar{ψ}}^{0} Y \geq 0 .

(3.35)

Based on the above preparation, now we start to prove Theorem 3.1 by the duality relations.

Proof of Theorem 3.1 Let $p (t)$ solve the adjoint equation

{\begin{matrix} \dot{p} (t) = - f_{x} (t, \bar{x} (t), \bar{u} (t)) p (t) - ψ^{0} l_{x} (t, \bar{x} (t), \bar{u} (t)), \\ p (T) = - \bar{ψ}, \end{matrix}

(3.36)

where $ψ^{0} = - {\bar{ψ}}^{0}$ .

So, by virtue of (3.35), we get

〈 \bar{φ}, x_{1} - {\bar{x}}_{0} - η 〉 + 〈 p (T), X (T) - (x_{2} - \bar{x} (T)) 〉 + ψ^{0} Y \leq 0 .

(3.37)

Simultaneously, we have the following duality equality:

(3.38)

where H is the Hamilton function defined in (3.5). Now, setting $η = 0$ and $(x_{1}, x_{2}) = ({\bar{x}}_{0}, \bar{x} (T))$ in (3.37), we obtain

\int_{0}^{T} [H (t, \bar{x} (t), u (t), p (t), ψ^{0}) - H (t, \bar{x} (t), \bar{u} (t), p (t), ψ^{0})] d t \leq 0 .

As U is separable and $(x_{0}, u (\cdot)) \in U_{ad} [0, T]$ is arbitrary, the above inequality can be written as

H (t, \bar{x} (t), u (t), p (t), ψ^{0}) - H (t, \bar{x} (t), \bar{u} (t), p (t), ψ^{0}) \leq 0, a.e. t \in [0, T] .

(3.39)

Thus,

(3.40)

Next, by taking $u (t) = \bar{u} (t)$ and $(x_{1}, x_{2}) = ({\bar{x}}_{0}, \bar{x} (T))$ in (3.37), using the duality equality (3.38), we get

〈 \bar{φ}, η 〉 \geq 〈 p (T), X (T) 〉 + ψ^{0} Y = 〈 p (0), η 〉, \forall η \in B_{1} (0) .

(3.41)

Thus, $\bar{φ} = p (0)$ . Then taking $η = 0$ and $u (t) = \bar{u} (t)$ in (3.37), combining with the duality equality (3.38), we obtain the transversality condition

〈 p (0), x_{1} - {\bar{x}}_{0} 〉 - 〈 p (T), x_{2} - \bar{x} (T) 〉 \leq 0, \forall (x_{1}, x_{2}) \in S .

(3.42)

Finally, we claim that $(ψ^{0}, p (\cdot)) \neq 0$ . Otherwise, in particular, we have that

\bar{ψ} = - p (T) = 0, \bar{φ} = p (0) = 0 .

(3.43)

Note that $ψ^{0} = 0$ in this case, so this gives a contradiction to (3.22). The Hamilton system (3.6) is obvious. Then the proof of the maximum principle is completed. □

Remark 3.9 We give some important special cases of our control problem.

(i)
The control problem with fixed endpoints. In this case, the constraint set is $S = {(x_{0}, x_{T})}$ and the endpoint constraint becomes of the following form:
$x (0) = x_{0}, x (T) = x_{T} .$
(ii)
The control problem with a terminal state constraint, i.e.,
$x (0) = x_{0}, x (T) \in S_{2} .$

References

Barbu V: Optimal of Variational Inequalities. Pitman, London; 1984.
MATH Google Scholar
Barbu V, Da Prato G: Hamilton Jacobi Equations in Hilbert Spaces. Pitman, London; 1983.
MATH Google Scholar
Barbu V, Precupanu T: Convexity and Optimization in Banach Spaces. Reidel, Dordrecht; 1986.
MATH Google Scholar
Capuzzo-Dolcetta I, Evans LC: Optimal switching for ordinary differential equations. SIAM J. Control Optim. 1984, 22: 143–161. 10.1137/0322011
Article MathSciNet MATH Google Scholar
Pontryagin LS: The maximum principle in the theory of control processes. Proc. 1st. Congress IFAC, Moscow 1960.
Google Scholar
Li XJ, Yong YM: Optimal Control Theory for Infinite Dimensional Systems. Birkhäuser, Boston; 1995.
Book Google Scholar
Lou HW: Second-order necessary/sufficient conditions for optimal control problems in the absence of linear structure. Discrete Contin. Dyn. Syst., Ser. B 2010, 14: 1445–1464.
Article MathSciNet MATH Google Scholar
Casas E: Control of an elliptic problem with pointwise state constraints. SIAM J. Control Optim. 1986, 24: 1309–1318. 10.1137/0324078
Article MathSciNet MATH Google Scholar
Casas E, Fernández LA: Optimal control of semilinear elliptic equations with pointwise constraints on the gradient of the state. Appl. Math. Optim. 1993, 27: 35–56. 10.1007/BF01182597
Article MathSciNet MATH Google Scholar
Yong JM, Zhou XY: Stochastic Controls: Hamiltonian System and HJB Equations. Springer, New York; 1999.
Book MATH Google Scholar
Warga J: Optimal Control of Differential and Functional Equations. Academic Press, New York; 1972.
MATH Google Scholar
Fattorini HO: Relaxed controls in infinite dimensional systems. Int. Ser. Numer. Math. 1991, 100: 115–128.
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We would like to thank the referees for the useful suggestions which helped to improve the previous version of this paper. This work was partially supported by NNSF of China (Grant No. 11171122).

Author information

Authors and Affiliations

School of Mathematics and Statistics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, P.R. China
Weifeng Wang & Bin Liu

Authors

Weifeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Bin Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bin Liu.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors considered this problem and carried out the proof. We all also read and approved the final manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Wang, W., Liu, B. A maximum principle for optimal control system with endpoint constraints. J Inequal Appl 2012, 231 (2012). https://doi.org/10.1186/1029-242X-2012-231

Download citation

Received: 30 May 2012
Accepted: 28 September 2012
Published: 12 October 2012
DOI: https://doi.org/10.1186/1029-242X-2012-231

A maximum principle for optimal control system with endpoint constraints

Abstract

1 Introduction

2 Preliminaries

3 Pontryagin’s maximum principle

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords