- Research
- Open access
- Published:
A global QP-free algorithm for mathematical programs with complementarity constraints
Journal of Inequalities and Applications volume 2020, Article number: 213 (2020)
Abstract
In this paper, a primal–dual interior point QP-free algorithm for mathematical programs with complementarity constraints is presented. Firstly, based on Fischer–Burmeister function and smoothing techniques, the investigated problem is approximated by a smooth nonlinear constrained optimization problem. Secondly, combining with an effective penalty function technique and working set, a QP-free algorithm is proposed to solve the smooth constrained optimization problem. At each iteration, only two reduced linear equations with the same coefficient matrix are solved to obtain the search direction. Under some mild conditions, the proposed algorithm possesses global convergence. Finally, some numerical results are reported.
1 Introduction
In this paper, we discuss the following mathematical programming problem with complementarity constraints (MPCC for short):
where \(f:\mathbb{R}^{n+m}\rightarrow\mathbb{R}\), \(g=(g_{1},\ldots ,g_{m_{g}})^{\mathrm{T}}:\mathbb{R}^{n+m}\rightarrow\mathbb {R}^{{m_{g}}}\), \(F=(F_{1},\ldots,F_{m})^{\mathrm{T}}:\mathbb {R}^{n+m}\rightarrow\mathbb{R}^{m}\) are continuously differentiable functions. “\(F(x,y)\perp y\)” means that the vectors \(F(x,y)\) and y are perpendicular to each other.
MPCC (1) has a broad of applications in real world, such as engineering design, traffic transportation, game theory and so on. A detailed overview of MPCC applications can be found in [1] and the monographs [2–4].
Since MPCC (1) is a nonconvex optimization problem and the standard Mangasarian–Fromovitz constraint qualification (MFCQ) is violated at any feasible point, the well-developed algorithms for the standard nonlinear programs (for example, [5–18]) typically have severe difficulties if they are directly used to solve the MPCC (1). Hence, MPCC-tailed algorithms are desired.
It is well known that the QP-free method is one of the efficient methods for nonlinear programming (see [14, 15, 19–25] ). The nice properties of the QP-free method are as follows: (a) The search directions are determined only by solving systems of linear equations rather than solving QP-subproblems. As a consequence, the computational cost is decreased greatly. (b) Compared with the SQP method, the size of the investigated problem solved by QP-free method is larger. It is worth mentioning that the primal–dual interior point QP-free algorithm in [21] has an improvement, that is, the strict restriction that the Lagrangian Hessian estimate is uniformly positive is relaxed. It is noticed that the active set identifying technique is incorporated into the QP-free algorithms in [22, 23]. Consequently, the computational cost is further decreased and the numerical performance in [22] is encouraging.
Although there are many OP-free algorithms for nonlinear programming, to the best of our knowledge, there are few MPCC-tailed QP-free algorithms for the MPCC (1). Motivated by the ideas of the algorithms in [21, 24–27] and combining with smoothing techniques, we propose a primal–dual interior point QP-free algorithm for the MPCC (1). The proposed algorithm possesses the following nice properties:
-
(a)
The technique of active set is incorporated into the algorithm. As a consequence, the size of SLEs becomes smaller and the computational cost is decreased.
-
(b)
At each iteration, the search direction is obtained by solving two SLEs with the same coefficient matrix, which further decreases the computational cost.
-
(c)
The uniformly positive definiteness on the Lagrangian Hessian estimate \({H_{k}}\) is relaxed.
-
(d)
The algorithm possesses global convergence without assuming that the stationary points are isolated.
2 Preliminaries and reformulation
In this section, for completeness, we first restate some definitions and results about the MPCC (1), then deduce an approximation problem of the MPCC (1) by Fischer–Burmeister function and smoothing techniques.
Denoted by \(X_{0}\) the feasible set of the MPCC (1), and denoted by \({I^{\mathcal{C}}}=\{1,2,\ldots,m\}\) and \(I^{g}=\{ 1,2,\ldots,{m_{g}}\}\) are the index sets of the complementarity constraints and the inequality constraints, respectively.
Definition 1
([2])
Given \((x^{*},y^{*})\in X_{0}\), if
then we call that the lower-level strict complementarity (LLSC) is satisfied at \((x^{*},y^{*})\).
Definition 2
([2])
A point \((x^{*},y^{*})\in X_{0}\) is said to be a KKT stationary point of the MPCC (1) if there exists a KKT multiplier vector \((\lambda^{*},\omega^{*},\mu^{*})\in\mathbb{R}^{m+m+m_{g}}\) such that
where \(E_{m}\in\mathbb{R}^{m\times m}\) is an mth-order identity matrix. The vector \(((x^{*},y^{*}),\lambda^{*},\omega^{*},\mu^{*})\) is said to be a KKT stationary pair of the MPCC (1).
Proposition 1
([2])
Suppose that \((x^{*}, y^{*})\in X_{0}\)satisfies LLSC, then \(((x^{*},y^{*}),\lambda^{*},\omega^{*},\mu^{*})\)is a KKT stationary pair of the MPCC (1) if and only if there exists a vector \(\iota ^{*}=(\iota^{*}_{i},i\in I^{\mathcal{C}})\in\mathbb{R}^{m}\), such that
where \(V^{*}=\operatorname{diag}(F_{i}(x^{*},y^{*}), i\in I^{\mathcal{C}})\), \(Y^{*}=\operatorname{diag}(y^{*}_{i},i\in I^{\mathcal{C}})\).
It is well known that the Fischer–Burmeister function is a complementarity function, which is defined by
Obviously, ψ satisfies the following basic property:
And ψ is continuous differentiable in \(\{(a,b)\in\mathbb{R}^{2}| (a,b)\neq(0,0)\}\), namely
Let \(v=F(x,y)\), according to the property (3), the MPCC (1) is equivalently transformed into the following nonsmooth nonlinear optimization problem:
where \(\varPsi(y,v)=(\psi(y_{i},v_{i}),i\in{I^{\mathcal{C}}})\).
Obviously, the function ψ is not differentiable at the point \((0,0)\). Borrowing ideas from [28], we define the functions as follows:
where the index set \({I^{\mathcal{C}}}(y,v,\varepsilon)= \{i\in {I^{\mathcal{C}}}\mid\sqrt{y^{2}_{i}+v^{2}_{i}} <\varepsilon, \varepsilon >0 \}\). Moreover, define
so we have
For any \(\varepsilon>0\), the function \(\psi_{\varepsilon}(y_{i},v_{i})\) is differentiable, and it follows that
So the smooth problem
will be used as an approximation of the MPCC (1). Obviously, if \(I^{\mathcal{C}}(y^{*},v^{*},\varepsilon)=\emptyset\), then the problem (6) is equivalent with the problem (4). Under some mild conditions, \(I^{\mathcal {C}}(y^{*},v^{*},\varepsilon)=\emptyset\) is guaranteed, where \((x^{*},y^{*},v^{*})\) is an accumulation point of the iterative sequence \(\{ (x^{k},y^{k},v^{k})\}\) (see the proof of Theorem 1 in the sequel).
Define a penalty function \(f_{r}(x,y)\) for the problem (6) by
where \(r>0\) is a penalty parameter. Similar to [29], we can convert the problem (6) to the following smoothing optimization only with inequality constraints:
For simplicity, we use the following notations throughout this paper:
The following proposition shows the equivalence between the problem (6) and the problem (7).
Proposition 2
If \((z,\lambda^{r})\)is a KKT pair of the problem (7), and \(v-F(x,y)=0\), \(\varPsi_{\varepsilon}(y,v)=0\), then \((z,\lambda)\)with multiplier \(\lambda=\lambda^{r}-r\tilde{e}\)is a KKT pair of the problem (6), where \(\lambda ^{r}:=(\lambda^{r}_{F,I^{\mathcal{C}}},\lambda^{r}_{{{\varPsi_{\varepsilon}}},I^{\mathcal{C}}},\lambda^{r}_{g,I^{g}})\), \(\lambda:=(\lambda_{F,I^{\mathcal{C}}},\lambda_{{{\varPsi_{\varepsilon}}},I^{\mathcal{C}}},\lambda_{g,I^{g}})\), \(\tilde{e}=(1,\ldots,1_{{ m\textit{th}}},1,\ldots,1_{(2m)\textit{th}},0,\ldots ,0_{(2m+m_{g})\textit{th}})^{\mathrm{T}}\).
Proof
Since \((z,\lambda^{r})\) is a KKT pair of the problem (7), the vector pair \((z,\lambda ^{r})\) satisfies the following relations:
Note that
we get
Let \(\lambda_{F,I^{\mathcal{C}}}=\lambda^{r}_{F,I^{\mathcal{C}}}-re_{I^{\mathcal{C}}}\), \(\lambda_{{\varPsi_{\varepsilon}},I^{\mathcal{C}}}=\lambda^{r}_{{\varPsi _{\varepsilon}},I^{\mathcal{C}}}-re_{I^{\mathcal{C}}}\), \(\lambda _{g,I^{g}}=\lambda^{r}_{g,I^{g}}\), we obtain
In view of \(v-F(x,y)=0\), \(\varPsi_{\varepsilon}(y,v)=0\), \(g(x,y)\leq0\), and
Let \(\lambda:=(\lambda_{F,I^{C}},\lambda_{\varPsi_{\varepsilon },I^{C}},\lambda_{g,I^{g}})=\lambda^{r}-r\tilde{e}\), so \((z,\lambda)\) is a KKT pair of the problem (6). □
3 Description of the algorithm
For the sake of theoretical analysis, we make a basic assumption throughout this paper.
Assumption 1
-
(1)
For any \((x,y)\in\mathbb{R}^{n+m}\), the matrix \(\nabla_{y}F(x,y)\) is a \(P_{0}\) matrix, i.e., all the principal minors of \(\nabla_{y}F(x,y)\) are nonnegative.
-
(2)
For any \((x,y)\in\mathbb{R}^{n+m}\), the submatrix \((\nabla _{y}F(x,y))_{J^{*}J^{*}}\) of the matrix \(\nabla_{y}F(x,y)\) is nonsingular, where the index set \(J^{*}=\{i\in I^{C} \mid\psi'_{\varepsilon ,y_{i}}(y_{i},v_{i})=0\}\).
In order to construct the coefficient matrix of linear equations conveniently, denote
and denote by \(A_{J}\) the gradient matrix of \(\varOmega_{J}\), that is,
where \(J\subseteq I^{g}\), the diagonal matrix
Define the matrix
Similar to the proof in [30, 31], we can prove that the following proposition is true.
Proposition 3
-
(1)
Suppose that the matrix U is nonsingular. Then the matrix \(A_{I(x,y)}\)is full of column rank, if and only if the matrix
$$\begin{aligned} \nabla_{x}g_{I(x,y)}(x,y)-\nabla_{x}F(x,y) \bigl(U^{-1} \bigr)_{m}\nabla_{y}g_{I(x,y)}(x,y) \end{aligned}$$is full of column rank, where \((U^{-1})_{m}\)is the mth-order principal submatrix of \(U^{-1}\), which consists of the first m rows and m columns of \(U^{-1}\).
-
(2)
Suppose that Assumption 1holds, then the matrix U is nonsingular.
Assumption 2
For any \(z\in\mathbb{R}^{n+2m}\), the matrix
is full of column rank.
Remark 1
According to Proposition 3, if Assumption 1 is true, then Assumption 2 is equivalent to the LICQ of the problem (7).
Based on Proposition 2, we know that if one can construct an efficient algorithm for the problem (7) and adjust penalty parameter r to force the iterate to asymptotically satisfy \(v-F(x,y)=0\), \(\varPsi _{\varepsilon}(y,v)=0\), then the solution to the problem (6) can be yielded.
Define the following optimal identification function:
where
and \(\lambda=(\lambda_{F,I^{\mathcal{C}}},\lambda_{{{\varPsi _{\varepsilon}}},I^{\mathcal{C}}},\lambda_{g,I^{g}})\), the parameter \(\vartheta\in(0,1)\), \(\|\cdot\|\) is the Euclidean norm, and we have the Lagrangian function
Obviously, ρ is nonnegative and continuous. It follows from [32] that \(\rho(z^{*},\lambda^{*})=0\) if and only if \((z^{*},\lambda^{*})\) is a KKT pair of the problem (6).
For the current iterate \(z^{k}\in\widetilde{X}_{0}\), we define the corresponding multiplier vector \(\lambda^{k}=(\lambda^{k}_{F,I^{\mathcal{C}}},\lambda^{k}_{{{\varPsi_{\varepsilon}}},I^{\mathcal{C}}},\lambda^{k}_{g,I^{g}})\) in (9) by
where \(s^{0}>0\), and \((\widehat{\lambda}^{k-1},r_{k-1})\) is computed in the previous \((k-1)\)th iteration, we construct the working set \(I_{k}\) as follows:
When \((z^{k},\lambda^{k})\) is sufficiently close to a KKT pair \((z^{*},\lambda^{*})\) of the problem (6), and the second-order sufficient conditions and the MFCQ hold at \((z^{*},\lambda ^{*})\), the set \(I_{k}\) equals the precise identification set \(I^{g}_{0}(z^{*})\). Therefore, only inequality constraints associated with the working set \(I_{k}\) need to be considered.
For the current iterate \(z^{k}\in\mathbb{R}^{n+2m}\), the first SLE in our algorithm is constructed as follows:
where the coefficient matrix \(M_{I_{k}}\) is defined by
with
the matrix \(H_{k}\in\mathbb{R}^{n+2m}\) is an approximation of the Lagrangian Hessian, the matrix \(S_{I_{k}}:=\operatorname{diag}(s^{k}_{I_{k}})\), where the vector \(s^{k}_{I_{k}}:=(s^{k}_{F,I^{\mathcal{C}}}, s^{k}_{{\varPsi_{\varepsilon}},I^{\mathcal{C}}}, s^{k}_{g,I_{k}})\in\mathbb{R}^{2m+|I_{k}|}\) is an approximation of \(\lambda^{k}_{I_{k}}:=(\lambda^{k}_{F,I^{\mathcal {C}}},\lambda^{k}_{\varPsi_{\varepsilon},I^{\mathcal{C}}},\lambda^{k}_{g,I_{k}})\).
It is not difficult to prove that the following result is true.
Lemma 1
Suppose that Assumptions 1–2hold, and the symmetric matrix \(H_{k}\)satisfies the following relation:
then the coefficient matrix \(M_{I_{k}}\)defined by (13) is invertible, where the matrix order \(A\succ B\)means that \(A-B\)is positive definite.
We now describe our algorithm as follows.
Algorithm A
Step 0. (Initialization)
Choose an initial point \(z^{0}\in\widetilde{X}_{0}\); \(\alpha\in (0,\frac{1}{2})\); \(\vartheta, \beta, \theta\in(0,1)\); \(r_{0}, \varepsilon_{0}, C, \nu, \gamma_{1}, \gamma_{2}, \gamma_{3}, \gamma _{4}>0\); \({\tau>1}\); \(q>2\); termination accuracy \(\epsilon>0\); the vector \(s^{0}:=(s^{0}_{F,I^{\mathcal{C}}},s^{0}_{{\varPsi_{\varepsilon}},I^{\mathcal{C}}},s^{0}_{g,I^{g}})\in\mathbb{R}^{2m+m_{g}}\) with \(s^{0}_{i}\in[s_{\min },s_{\max}]\), where \(s_{\max}>s_{\min}>0\). Set \(k:=0\).
Step 1. (Compute the working set)
Yield \(\lambda^{k}\) by (10), then compute \(\rho (z^{k},\lambda^{k})\) by (9). If \(\rho(z^{k},\lambda^{k})\leq\epsilon\), and \(I^{\mathcal{C}}(y^{k},v^{k},\varepsilon_{k})=\emptyset\), then \(z^{k}\) is a KKT point of the MPCC (1), stop; otherwise, compute the working set \(I_{k}\) by (11).
Step 2. (Yield a matrix \(H_{k}\))
Generate a symmetric matrix \(H_{k}\) such that it is an approximation of the Lagrangian Hessian of the problem (7) and satisfies the condition (14).
Step 3. (Generate a search direction)
-
(1)
Solve the first SLE1 (12). Let its solution be \((\widehat{dz}^{k},\widehat{\lambda}^{k}_{I_{k}})\), with \(\widehat{\lambda}^{k}_{I_{k}}:=(\widehat{\lambda}^{k}_{F,I^{\mathcal{C}}},\widehat{\lambda}^{k}_{{\varPsi_{\varepsilon}},I^{\mathcal{C}}}, \widehat{\lambda}^{k}_{g,I_{k}})\in\mathbb{R}^{2m+|I_{k}|}\). Set \(\widehat{\lambda}^{k} :=(\widehat{\lambda}^{k}_{F,I^{\mathcal{C}}}, \widehat{\lambda}^{k}_{{\varPsi_{\varepsilon}},I^{\mathcal{C}}}, \widehat{\lambda}^{k}_{g,I^{g}})\), where \(\widehat{\lambda}^{k}_{g,I^{g}} :=(\widehat{\lambda}^{k}_{g,I_{k}},0_{I^{g}\setminus I_{k}})\).
-
(2)
If the following conditions hold:
-
(i)
\(\|\widehat{dz}^{k}\|\leq\gamma_{1}\),
-
(ii)
\(\widehat{\lambda}^{k}\geq-\gamma_{2}e_{I}\),
-
(iii)
\(\widehat{\lambda}^{k}_{F,I^{\mathcal{C}}}\ngtr\gamma _{3}e_{I^{\mathcal{C}}}\),
-
(iv)
\(\widehat{\lambda}^{k}_{{\varPsi_{\varepsilon}},I^{\mathcal{C}}}\ngtr\gamma_{4}e_{I^{\mathcal{C}}}\),
-
(v)
\(I^{\mathcal{C}}(y^{k},v^{k},\varepsilon_{k})\neq\emptyset\), then update the parameter \(r_{k}\) by \(r_{k+1}=\tau r_{k}\), set \(z^{k+1}=z^{k}\), \(s^{k+1}=s^{k}\), \(H_{k+1}=H_{k}\), \(I_{k+1}=I_{k}\), \(\varepsilon _{k+1}=\frac{1}{2}\varepsilon_{k}\), and go back to Step 3(1); otherwise, set \(r_{k+1}=r_{k}\), go to Step 3(3).
-
(i)
-
(3)
Find \(\phi^{k}_{{F},I^{\mathcal{C}}}=(\phi^{k}_{{F},i})\in\mathbb{R}^{m}\), \(D^{k}_{{\varPsi_{\varepsilon}},I^{\mathcal{C}}}=(D^{k}_{{\varPsi _{\varepsilon}},i})\in\mathbb{R}^{m}\), \(Q^{k}_{g,I_{k}}=(Q^{k}_{g,i})\in\mathbb{R}^{|I_{k}|}\), respectively, as follows:
$$\begin{aligned} &\begin{aligned}[b]\phi^{k}_{F,i}& :=\phi_{i} \bigl(z^{k},\widehat{\lambda}^{k}_{F,i} \bigr) \\&=\min \bigl\{ 0,- \bigl(\max \bigl\{ -\widehat{\lambda}^{k}_{F,i},0 \bigr\} \bigr)^{\nu} -C \bigl(v^{k}_{i}-F_{i} \bigl(x^{k},y^{k} \bigr) \bigr) \bigr\} ,\quad i\in I^{\mathcal{C}},\end{aligned} \end{aligned}$$(15)$$\begin{aligned} &D^{k}_{{\varPsi_{\varepsilon}},i} :=D_{i} \bigl(z^{k}, \widehat{\lambda}^{k}_{{\varPsi_{\varepsilon}},i} \bigr) =\min \bigl\{ 0,- \bigl( \max \bigl\{ -\widehat{\lambda}^{k}_{{\varPsi_{\varepsilon }},i},0 \bigr\} \bigr)^{\nu} -C\psi_{\varepsilon} \bigl(y^{k}_{i},v^{k}_{i} \bigr) \bigr\} ,\quad i\in I^{\mathcal {C}}, \end{aligned}$$(16)$$\begin{aligned} &Q^{k}_{g,i} :=Q_{i} \bigl(z^{k}, \widehat{\lambda}^{k}_{g,i} \bigr) =\min \bigl\{ 0,- \bigl( \max \bigl\{ -\widehat{\lambda}^{k}_{g,i},0 \bigr\} \bigr)^{\nu }-Cg_{i} \bigl(x^{k},y^{k} \bigr) \bigr\} ,\quad i\in I_{k}. \end{aligned}$$(17)Denote \(\varpi^{k}=(\phi^{k}_{F, I^{C}}, D^{k}_{\varPsi_{\varepsilon}, I^{C}}, Q^{k}_{g, I_{k} })\), then compute
$$\begin{aligned} &\zeta^{k}=\nabla_{z} f_{r_{k}} \bigl(x^{k},y^{k} \bigr)^{\mathrm{T}}\, \widehat{dz}^{k} -\sum_{i\in I^{\mathcal{C}}} \frac{\widehat{\lambda }^{k}_{F,i}}{s^{k}_{F,i}}\phi^{k}_{F,i} -\sum _{i\in I^{\mathcal{C}}}\frac{\widehat{\lambda}^{k}_{\varPsi _{\varepsilon},i}}{s^{k}_{\varPsi_{\varepsilon},i}}D^{k}_{\varPsi _{\varepsilon},i} -\sum_{i\in I_{k}}\frac{\widehat{\lambda }^{k}_{g,i}}{s^{k}_{g,i}}Q^{k}_{g,i}, \end{aligned}$$(18)$$\begin{aligned} &b_{k}= \bigl( \bigl\Vert \widehat{dz}^{k} \bigr\Vert ^{q}+ \bigl\Vert \varpi^{k} \bigr\Vert \bigr) \biggl(\sum_{i\in I^{\mathcal{C}}}\widehat{\lambda}^{k}_{F,i} +\sum_{i\in I^{\mathcal{C}}}\widehat{\lambda}^{k}_{\varPsi _{\varepsilon}i} +\sum_{i\in I_{k}}\widehat{\lambda}^{k}_{g,i} \biggr) \\ &\phantom{b_{k}=}{} +\sum_{i\in I^{\mathcal{C}}}\frac{\widehat{\lambda }^{k}_{F,i}}{s^{k}_{F,i}} \phi^{k}_{F,i} +\sum_{i\in I^{\mathcal{C}}} \frac{\widehat{\lambda}^{k}_{{\varPsi _{\varepsilon}},i}}{s^{k}_{{\varPsi_{\varepsilon}},i}}D^{k}_{{\varPsi _{\varepsilon}},i} +\sum _{i\in I_{k}}\frac{\widehat{\lambda }^{k}_{g,i}}{s^{k}_{g,i}}Q^{k}_{g,i}, \end{aligned}$$(19)$$\begin{aligned} &\varrho_{k}= \left \{ \textstyle\begin{array}{l@{\quad}l} 1, &\mbox{if } b_{k}\leq0,\\ \min\{\frac{(1-\theta) \vert \zeta^{k} \vert }{b_{k}},1\}, &\mbox{if } b_{k}>0. \end{array}\displaystyle \right . \end{aligned}$$(20) -
(4)
Solve the second SLE as follows:
$$ \mathrm{SLE2}{:}\quad M_{I_{k}} \left ( \textstyle\begin{array}{c} dz \\ \lambda_{I_{k}} \end{array}\displaystyle \right ) = \left ( \textstyle\begin{array}{c} -\nabla_{z} f_{r_{k}}(x^{k},y^{k}) \\ \eta^{k}_{I_{k}} \end{array}\displaystyle \right ), $$(21)where the perturbation vector \(\eta^{k}_{I_{k}}=(\eta^{k}_{F,I^{\mathcal{C}}},\eta^{k}_{\varPsi_{\varepsilon},I^{\mathcal{C}}},\eta^{k}_{g,I_{k}})\) is determined by the following convex combination:
$$\begin{aligned} \eta^{k}_{I_{k}}=(1- \varrho_{k})\varpi^{k}+\varrho_{k} \bigl(- \bigl\Vert \widehat{dz}^{k} \bigr\Vert - \bigl\Vert \varpi^{k} \bigr\Vert \bigr)s^{k}_{I_{k}} \end{aligned}$$(22)with \(s^{k}_{I_{k}}:=(s^{k}_{F,I^{\mathcal{C}}},s^{k}_{{\varPsi_{\varepsilon }},I^{\mathcal{C}}},s^{k}_{g,I_{k}})\in\mathbb{R}^{2m+|I_{k}|}\).
Let its solution be \((dz^{k},\lambda^{k}_{I_{k}})\) with \(\lambda^{k}_{I_{k}}:=(\lambda^{k}_{F,I^{\mathcal{C}}},\lambda^{k}_{{\varPsi _{\varepsilon}},I^{\mathcal{C}}}, \lambda^{k}_{g,I_{k}})\in\mathbb{R}^{2m+|I_{k}|}\). Set \(\lambda^{k}:= (\lambda^{k}_{F,I^{\mathcal{C}}},\lambda^{k}_{{\varPsi_{\varepsilon }},I^{\mathcal{C}}},\lambda^{k}_{g,I^{g}})\) with \({\lambda}^{k}_{g,I^{g}} =({\lambda}^{k}_{g,I_{k}},0_{I^{g}\setminus I_{k}})\).
Step 4. (Perform line search) Compute the step size \(t_{k}\), which is the first number t of the sequence \(\{1,\beta,\beta^{2},\ldots \}\) satisfying
Step 5. (Update) Let \(z^{k+1}=z^{k}+t_{k} \,dz^{k}\), and compute
Step 6. Let \(k:=k+1\), and go back to Step 1.
Remark 2
The correction technique of penalty parameter \(r_{k}\) in Step 3(2) is from [33], when the conditions (i)–(ii) in Step 3(2) are satisfied, the current iteration point \(z^{k}\) is close to a KKT point of the problem (7). However, the conditions (iii)–(iv) in Step 3(2) indicate that \(z^{k}\) is far away from the feasible area of the problem (6), and the penalty parameter needs to be increased.
For convenience, denote
Lemma 2
For the directions \(\widehat{dz}^{k}\)and \(dz^{k}\)found in Step 3(1)(4), the following conclusions hold:
Furthermore, when iteration goes into Step 3(3)(4), we have \(\widehat {dz}^{k}\neq0\)and \(\zeta^{k}<0\), so \(dz^{k}\)is a feasible descent direction of the problem (7) at \(z^{k}\), hence, Algorithm Ais well defined.
Proof
It follows from the \(\mathrm{SLE1}\) (12) that
which indicates \(\nabla_{z} f_{r_{k}}(x^{k},y^{k})^{\mathrm{T}}\,\widehat{dz}^{k}= -(\widehat{dz}^{k})^{\mathrm{T}}\varLambda_{k}\,\widehat{dz}^{k}\leq0\).
It follows from (18) and (32) that
Taking into account SLE1 (12) and SLE2 (21), we have
It follows from (34) and (36) as well as the symmetry of the matrix \(H_{k}\) that
In view of (20), if \(b_{k}\leq0\), then \(\varrho _{k}b_{k}=b_{k}\leq0\). Furthermore, we have \(\zeta^{k}+\varrho _{k}b_{k}\leq\zeta^{k}\leq\theta\zeta^{k}\). If \(b_{k}>0\), then we have \(\zeta^{k}+\varrho_{k}b_{k}=\zeta^{k}+\min\{(1-\theta)|\zeta ^{k}|,b_{k}\}\leq\zeta^{k}+(\theta-1)\zeta^{k}=\theta\zeta^{k}\). In conclusion, we obtain \(\zeta^{k}+\varrho_{k}b_{k}\leq\theta\zeta ^{k}\). From (33) and (38), it follows that
which indicates that the inequality (29) is true.
If \(\widehat{dz}^{k}=0\), taking into account SLE1 (12), from \(v^{k}-F(x^{k},y^{k})<0\), \(\varPsi_{\varepsilon}(y^{k},v^{k})<0\), \(g(x^{k},y^{k})<0\), we have \(\widehat{\lambda}^{k}_{I_{k}}=0\), so the iterate k does not go into Step 3(3)(4). Thus, \(\widehat{dz}^{k}\neq0\) when the iterative process goes into Step 3(3)(4).
By \(\nabla_{z} f_{r_{k}}(x^{k},y^{k})^{\mathrm{T}}\,dz^{k}\leq0\), we know that \(dz^{k}\) is a feasible descent direction of the problem (7) at point \(z^{k}\). Together with \(v^{k}-F(x^{k},y^{k})<0\), \(\varPsi_{\varepsilon}(y^{k},v^{k})<0\), \(g(x^{k},y^{k})<0\) and \(\zeta^{k}<0\), Step 4 in Algorithm A can be finished by finite calculations, hence Algorithm A is well defined. □
4 Global convergence
In this paper, we assume that Algorithm A generates an infinite iteration sequence \(\{z^{k}\}\). For the sake of convenience, let \(z^{*}\) be an accumulation point of the iteration sequence \(\{z^{k}\}\). First, we show that the penalty parameter \(r_{k}\) can be augmented only in a finite number of steps. Then we prove that \(z^{*}\) is a KKT point of the problem (7). Finally, we show that \(z^{*}\) is a KKT point of the MPCC (1). More precisely, we show that Algorithm A is globally convergent. For this purpose, the following assumptions are necessary.
Assumption 3
Suppose that the sequences \(\{H_{k}\}\) and \(\{z^{k}\}\) are all bounded, each accumulation point of \(\{z^{k}\}\) satisfies LLSC (2). Assume that there exists a positive constant σ such that
Assumption 4
For any \(z\in\widetilde{X}\), if \(z\notin X\), then no scalars \(\lambda _{F,i}\geq0\), \(i\in I^{F}_{0}(z)\); \(\lambda_{\varPsi_{\varepsilon},i}\geq 0\), \(i\in I^{\varPsi_{\varepsilon}}_{0}(z)\) and \(\lambda_{g,i}\geq0\), \(i\in I^{g}_{0}(z)\), exist such that
Lemma 3
Suppose that Assumptions 1–4hold, then the penalty parameter \(r_{k}\)can be augmented only in a finite number of steps.
Proof
By contradiction, suppose that \(r_{k}\) is increased infinitely many times, i.e., there exists an infinite index set K such that \(r_{k+1}>r_{k}\) for any \(k\in K\). The increase of \(r_{k}\) must satisfy the following conditions for any \(k\in K\):
Denote
and \(S^{k}_{I^{g}}=\operatorname{diag}(s^{k}_{I^{g}})\), where \(s^{k}_{I^{g}}=(s^{k}_{F,I^{\mathcal{C}}},s^{k}_{\varPsi_{\varepsilon},I^{\mathcal{C}}},s^{k}_{g,I^{g}})\). According to the SLE
we have
Since \(\{r_{k}\}\) tends to infinity, and \(\lambda^{k}=\widehat {\lambda}^{k-1}-r_{k-1}\tilde{e}\), \(k\geq1\), such that \(\{\|\widehat {\lambda}^{k}\|_{\infty}\}\) tends to infinity as well, we have
Define
for k (∈K) large enough, it follows that
Since the sequence \(\{z^{k}\}_{k\in K}\) is bounded by Assumption 3, there exists an infinite index set \(K'\subseteq K\), \(z^{*}\in\mathbb{R}^{n+2m}\), and \((\bar{\lambda}^{*}_{F,I^{\mathcal{C}}}, \bar{\lambda}^{*}_{{\varPsi_{\varepsilon}},I^{\mathcal{C}}}, \bar{\lambda}^{*}_{g,I^{g}})\neq0\), for all \(k\in K'\), such that
In view of the boundedness of \(\{z^{k}\}\) and the continuity of the functions, it follows that
are bounded. Further, \(\{S^{k}_{I^{g}}\}\) are bounded by construction. Multiplying both sides of (41) by \(\beta^{-1}_{k}\), we have
Letting \(k\stackrel{K'}{\longrightarrow}\infty\), the condition (i) shows that \(\beta^{-1}_{k}G^{k}_{I^{g}}\widehat{\lambda}^{k}\stackrel{K'}{\longrightarrow}0\), implying that \(\beta^{-1}_{k}\widehat{\lambda }^{k}_{F,i}\stackrel{K'}{\longrightarrow}0\) for \(i\in I^{\mathcal{C}}\setminus I^{F}_{0}(z^{*})\). Similarly, \(\beta^{-1}_{k}\widehat {\lambda}^{k}_{\varPsi_{\varepsilon},i}\stackrel{K'}{\longrightarrow}0\) for \(i\in I^{\mathcal{C}}\setminus I^{\varPsi_{\varepsilon}}_{0}(z^{*})\).
From (10) and (42), we know that \(\{ \frac{r_{k}}{\beta_{k}}\}\) converges to \(a\geq0\) as \(k\stackrel{K'}{\longrightarrow}\infty\), with
Then it follows from the condition (ii) and (42) that
And from (41), we obtain
furthermore, multiplying both sides of this equation by \(\beta ^{-1}_{k}\), together with the condition (i) and taking the limit as \(k\stackrel{K'}{\longrightarrow}\infty\), we obtain
so \(\widehat{\lambda}^{*}_{g,i}=0\) for all \(i\in I^{g}\setminus I^{g}_{0}(z^{*})\).
In view of the condition (i) and the continuity of the functions, multiplying (40) by \(\beta^{-1}_{k}\) and taking the limit as \(k\stackrel{K'}{\rightarrow}\infty\), we get \(\beta ^{-1}_{k}A^{k}_{I^{g}}\widehat{\lambda}^{k}\stackrel{K'}{\longrightarrow}0\), i.e.,
Since \(\bar{\lambda}^{*}_{F,I^{\mathcal{C}}}\), \(\bar{\lambda}^{*}_{\varPsi _{\varepsilon},I^{\mathcal{C}}}\), and \(\bar{\lambda}^{*}_{g,I^{g}}\) are not zero, combining with (45) and Assumption 2, it follows that
i.e., \(z^{*}\notin X\) and \(a>0\).
Based on (43), dividing both sides of (45) by a, we obtain
Adding both sides of the above equality to
we obtain
where \(\lambda_{F,i}=1-\frac{\bar{\lambda}^{*}_{F,i}}{a}\), \(i\in I^{F}_{0}(z^{*})\); \(\lambda_{{\varPsi_{\varepsilon}},i}=1-\frac{\bar {\lambda}^{*}_{{\varPsi_{\varepsilon}},i}}{a}\), \(i\in I^{\varPsi_{\varepsilon }}_{0}(z^{*})\); \(\lambda_{g,i}=-\frac{\bar{\lambda}^{*}_{g,i}}{a}\), \(i\in I^{g}_{0}(z^{*})\). In view of (44) and \(z^{*}\notin X\). This contradicts Assumption 4, so the penalty parameter \(r_{k}\) is updated at most a finite number of times. □
Lemma 4
Suppose that Assumptions 1–4hold, then there exists an integer \(k_{0}\), such that \(\varepsilon_{k}\equiv\varepsilon_{k_{0}}\)for \(k\geq k_{0}\).
Proof
By Step 3(2) and Step 5 of Algorithm A, the sequence \(\{\varepsilon_{k}\}\) is monotonically decreasing and bounded from below, so the sequence \(\{\varepsilon_{k}\}\) is convergent. Let
where \(K=\{0,1,2,\ldots\}\). Now we turn to prove that \(K_{1}\) and \(K_{2}\) are finite sets.
By contradiction, if \(K_{1}\) is infinite set, then \(\lim_{k\rightarrow\infty}\varepsilon_{k}=0\). There exist \(i\in I^{\mathcal{C}}\) and \(\overline{K}_{1}\subseteq K_{1}\), such that
Together with the boundedness of \(\{z^{k+1}\}_{\overline{K}_{1}}\), we suppose that \(z^{k+1}\stackrel{\overline{K}_{1}}{\longrightarrow}z^{*}\). From (47), we have \((y^{*}_{i})^{2}+(v^{*}_{i})^{2}=0\), which contradicts Assumption 3. So \(K_{1}\) is a finite set. Similarly, we show that \(K_{2}\) is finite. Based on the finiteness of \(K_{1}\) and \(K_{2}\), from the update rule (27) of \(\varepsilon_{k}\) and Step 3(2), we can conclude that Lemma 4 holds. □
Based on Lemmas 3–4, in the following analysis, we suppose, without loss of generality, that \(r_{k}\equiv\widehat{r}\), \(\varepsilon_{k}\equiv \varepsilon\).
Lemma 5
Suppose that Assumptions 1–4hold. Then
-
(1)
The coefficient matrix sequence \(\{M_{I_{k}}\}\)is unified invertible, and there exists a positive constant L such that \(\| (M_{I_{k}})^{-1}\|\leq L\), \(\forall k\geq0\).
-
(2)
The sequences \(\{(\widehat{dz}^{k},\widehat{\lambda}^{k})\}\)and \(\{ ({dz}^{k},{\lambda}^{k})\}\)are bounded.
Proof
(1) By contradiction, suppose that there exists an infinite subset K such that \(\|(M_{I_{k}})^{-1}\|\stackrel{K}{\longrightarrow}\infty\). Since \(\{z^{k}\}\) and \(\{H_{k}\}\) are bounded, and the finite choice of \(I_{k}\), without loss of generality, for all \(k\in K\), let
Denote
then
where the matrix \(S^{*}_{I'}=\operatorname{diag}(s^{*}_{I'})\) with \(s^{*}_{I'}=(s^{*}_{F,I^{\mathcal{C}}}, s^{*}_{\varPsi_{\varepsilon},I^{\mathcal{C}}},s^{*}_{g,I'})\).
Under the Assumptions, it can be shown easily that \(M^{*}_{I'}\) is nonsingular. So we obtain
which contradicts \(\|(M_{I_{k}})^{-1}\|\stackrel{K}{\longrightarrow}\infty\).
(2) In view of SLE1 (12), and Lemma 3 as well as \(r_{k}\equiv\widehat{r}\), we see that \(\{(\widehat{dz}^{k},\widehat{\lambda}^{k})\}\) is bounded. By (15)–(22) and the boundedness of \(\{(\widehat {dz}^{k},\widehat{\lambda}^{k})\}\) as well as the boundedness of \(\{s^{k}\} \), we know that \(\{\eta^{k}_{I_{k}}\}\) is bounded. Hence, the boundedness of \(\{(dz^{k},\lambda^{k})\}\) follows by SLE2 (21). □
Lemma 6
Suppose that Assumptions 1–4hold, \(z^{*}\)is an accumulation point of the infinite sequence \(\{z^{k}\}\)generated by Algorithm A, and \(\{z^{k}\} _{K}\)converges to \(z^{*}\). If \(\{\zeta^{k}\}_{K}\rightarrow0\), then \(z^{*}\)is a KKT point of the problem (7), and both \(\{\widehat{\lambda}^{k}\}_{K}\)and \(\{\lambda ^{k}\}_{K}\)converge to the unique multiplier vector \(\lambda^{*}\)corresponding to \(z^{*}\).
Proof
Let \((\widehat{\lambda}^{*};\lambda^{*})\) be any accumulation point of \(\{(\widehat{\lambda}^{k};\lambda^{k})\}_{K}\). First, we verify that \((z^{*},\widehat{\lambda}^{*})\) is a KKT pair of the problem (7). Taking into account Assumption 3, Lemma 5, and the finite choice of \(I_{k}\), there exists an infinite index \(K'\subseteq K\) such that
It follows from (33), (28), Assumption 3 and \(\{\zeta^{k}\} _{K}\rightarrow0\) that \(\widehat{dz}^{*}=0\). Further, taking the limit in SLE1 (12) for \(k\in K'\), one obtains
Next we will prove \(\widehat{\lambda}^{*}_{I'}=(\widehat{\lambda}^{*}_{F,I^{\mathcal{C}}}, \widehat{\lambda}^{*}_{{\varPsi_{\varepsilon}},I^{\mathcal{C}}}, \widehat{\lambda}^{*}_{g,I'})\geq0\), and \(\widehat{\lambda}^{*} =(\widehat{\lambda}^{*}_{I'}, 0_{I^{g}\setminus{I'}})\geq0\). From \(\widehat{\lambda}^{*}_{F,i}(v^{*}_{i}-F_{i}(x^{*},y^{*}))=0\), for \(i\in I^{\mathcal{C}}\setminus I^{F}_{0}(z^{*})\), we have \(\widehat{\lambda }^{*}_{F,I^{\mathcal{C}}\setminus I^{F}_{0}(z^{*})}=0\). Similarly,
By (18) and \((\zeta^{k},\widehat{dz}^{k})\stackrel{K'}{\longrightarrow}(0,0)\), one obtains
Furthermore, by (32), we have \(\frac{\widehat{\lambda}^{k}_{F,i}\phi^{k}_{F,i}}{s^{k}_{F,i}}\geq 0\) for every \(i\in I^{\mathcal{C}}\). Combining with (49), we have \(\widehat{\lambda}^{k}_{F,i}\phi^{k}_{F,i}\stackrel{K'}{\longrightarrow}0\), \(i\in I^{\mathcal{C}}\). So, in view of (15), one has
which implies that \(\widehat{\lambda}^{*}_{F,i}\geq0\) for \(i\in I^{\mathcal{C}}\cap I^{F}_{0}(z^{*})\). Similarly, one obtains
So we can claim that \(\widehat{\lambda}^{*}_{I'}\geq0\). Obviously, \(\widehat{\lambda}^{*}=(\widehat{\lambda}^{*}_{I'},0_{I^{g}\setminus {I'}})\geq0\) is immediate.
Therefore, in view of \(z^{*}\in\widetilde{X}\) and (50), we see that \((z^{*},\widehat{\lambda}^{*})\) is a KKT pair of the problem (7). Moreover, the above analysis indicates that the sequence \(\{\widehat{\lambda}^{k}\}_{K}\) possesses a unique accumulation point \(\widehat{\lambda}^{*}\), so \(\lim_{k\in K}\widehat{\lambda}^{k}=\widehat{\lambda}^{*}\).
Last, noticing that \((\widehat{dz}^{k},\widehat{\lambda}^{k})\stackrel{K'}{\longrightarrow}(0,\widehat{\lambda}^{*})\geq0\), (15)–(17) and (22), we have
Denote
so it follows from the above SLE and SLE1 (12) that
which together with Lemma 5(1) gives
□
Based on the lemmas above, we now present the global convergence of Algorithm A.
Theorem 1
Suppose that Assumptions 1–4hold. \(z^{*}\)is an accumulation point of \(\{z^{k}\}\)generated by Algorithm A, then \(z^{*}\)is a KKT point of the MPCC (1).
Proof
A. We first show that \(z^{*}\) is a KKT point of the problem (6) (called Conclusion A).
Let \(K'\) be an infinite index set such that
Denote
where the matrix \(S^{*}_{I'}=\operatorname{diag}(s^{*}_{I'})\) with \(s^{*}_{I'}=(s^{*}_{F,I^{\mathcal{C}}},s^{*}_{\varPsi_{\varepsilon},I^{\mathcal {C}}},s^{*}_{g,I'})\).
By contradiction, suppose that \(z^{*}\) is not a KKT point of the problem (6). From Lemma 5, without loss of generality, suppose that \(\lambda ^{k}=\widehat{\lambda}^{k-1}-\widehat{r}\tilde{e}\stackrel{K'}{\longrightarrow}\widehat{\lambda}'\). Therefore, it follows that \((z^{*},\widehat{\lambda}')\) is not a KKT pair of the problem (6), which implies \(\rho(z^{*},\widehat{\lambda}')>0\) and \(I^{g}_{0}(z^{*})\subseteq I_{k}\), \(k\in K'\) large enough. The following two cases are discussed.
Case I. \(z^{*}\) is a KKT point of the problem (7). Then there exists a multiplier \(\widehat {\lambda}''=(\widehat{\lambda}''_{I_{*}},0_{I^{g}\setminus I'})\geq 0\), such that the KKT condition of the problem (7) is satisfied at \((z^{*},\widehat{\lambda }''_{I_{*}})\). For \(k\in K'\) large enough, we have \(I^{g}_{0}(z^{*})\subseteq I_{k}\equiv I'\). It is easy to see that \((0,\widehat{\lambda }''_{I_{*}})\) is a solution to the following SLE:
Passing to the limit in SLE1 (12), we know that \((\widehat {dz}^{*},\widehat{\lambda}^{*}_{I_{*}})\) is the solution to (53) above. Let \(\widehat{\lambda}^{*}=(\widehat{\lambda}^{*}_{I_{*}},0_{I^{g}\setminus I'})\). In view of Lemma 5 (1), taking into account the nonsingularity of the matrix \(M^{*}_{I'}\), we see that the solution of (53) is unique. Hence, we obtain \(\widehat{dz}^{*}=0\), \(\widehat{\lambda}^{*}_{I_{*}}=\widehat{\lambda }''_{I_{*}}\geq0\), which implies \(\widehat{\lambda}^{*}=\widehat{\lambda}''\geq0\). The conditions (i) and (ii) in Step 3 (2) are satisfied for \(k\in K'\) large enough. Moreover, in view of \(r_{k}\equiv\widehat{r}<\infty\) for k large enough, Step 3 (2) implies \(\widehat{\lambda}^{k}_{F,I^{\mathcal{C}}} >\gamma_{3}e_{I^{\mathcal{C}}}\), \(\widehat{\lambda}^{k}_{{\varPsi _{\varepsilon}},I^{\mathcal{C}}} >\gamma_{4}e_{I^{\mathcal{C}}}\) for \(k\in K'\) large enough, which further implies that
Since the pair \((z^{*},\widehat{\lambda}'')\) satisfies the complementary slackness, one gets
So \(v^{*}-F(x^{*},y^{*})=0\), \(\varPsi_{\varepsilon}(y^{*},v^{*})=0\), which together with Proposition 2 shows that \(z^{*}\) is a KKT point of the problem (6), which contradicts the assumption that \(z^{*}\) is not a KKT point of the problem (6).
Case II. \(z^{*}\) is not a KKT point of the problem (7). It follows from Lemma 6 and \(\zeta^{k}\leq0\) that \(\zeta^{k}\stackrel{K'}{\rightarrow}\bar{\zeta}<0\). By (18), (28) and Assumption 3, we have
so there exist a subset \(K^{''}\subseteq K'\) and a positive constant c̄ such that
In the following we first show that there exists a positive constant t̄, such that \(t_{k}\geq\bar{t}\) for all \(k\in K''\).
Based on the boundedness of \(\{dz^{k}\}_{K''}\) and the continuity of functions, for all \(k\in K''\) large enough and \(t>0\) sufficiently small, one gets
In view of \(i\in I^{F}_{0}(z^{*})\), we have \(v^{*}_{i}-F_{i}(u^{*})=0\). Similarly, \(\psi_{\varepsilon}(w^{*}_{i})=0\) for \(i\in I^{\varPsi_{\varepsilon }}_{0}(z^{*})\), and \(g_{i}(u^{*})=0\) for \(i\in I^{g}_{0}(z^{*})\). From (22) and SLE2 (21), we have
that is,
By Taylor’s expansion and \(\|\widehat{dz}^{k}\|\geq0\), for \(t>0\) sufficient small, we get
Furthermore, we consider \(\zeta^{k}\leq\frac{\bar{\zeta}}{2}<0\), by (19), (20), and the boundedness of \(b_{k}\) by Lemma 5, we see that there exists a constant \(\bar{a}>0\) such that \(\varrho_{k}\geq\bar{a}>0\), \(k\in K''\). For \(k\in K''\) large enough and \(t>0\) small enough, we have
Similarly, for \(k\in K''\) large enough and \(t>0\) small enough, we have
(ii) Consider the inequality (23). By (29) and Taylor expansion, for \(k\in K''\) large enough and \(t>0\) small enough, it follows that
By summarizing the above discussion, we can conclude that the inequalities (23)–(26) hold for \(k\in K''\) large enough and \(t>0\) small enough, that is, \(\bar{t}=\inf\{t_{k}:k\in K''\}>0\).
In what follows, based on \(t_{k}\geq\bar{t}>0\) (\(k\in K''\)) we will deduce a contradiction. In view of the monotone decreasing property of \(\{f_{\widehat{r}}(z^{k})\}\) and \(\lim_{k\in K''}f_{\widehat {r}}(z^{k})=f_{\widehat{r}}(z^{*})\), we have \(\lim_{k\rightarrow\infty }f_{\widehat{r}}(z^{k})=f_{\widehat{r}}(z^{*})\). Moreover, from (23) and (29), for \(k\in K''\) large enough, we have
Taking the limit \(k\stackrel{K''}{\rightarrow}\infty\) in the inequality above, we obtain a contradiction.
From the above analysis, we can conclude that \(z^{*}\) is a KKT point of the problem (6).
B. We now show that \(I^{\mathcal{C}}(y^{*},v^{*},\varepsilon )=\emptyset\) (called Conclusion B).
From Lemma 4 and the updating rule of \(\varepsilon_{k}\) in Step 5, we see that
which implies that \(I^{\mathcal{C}}(y^{*},v^{*},\varepsilon)=\emptyset\).
Based on Conclusions A and B, and combining with Proposition 1, we can conclude that \(z^{*}\) is a KKT point of the MPCC (1). □
5 Numerical experiments
In this section, a preliminary implementation of Algorithm A is given on a Intel(R) Core(TM) i5-7200U CPU (@ 2.50 GHz, 2.71 GHz) and RAM (4 GB). A Matlab code (Version R2014a) is written corresponding to this implementation.
In our tests, referring to [21], we initialize \(H_{0}=E_{n}\), and for \(k=1,2,\dots\), the approximate Hessian matrix \(H_{k}\) is determined by the following equation:
where \(E_{n+2m}\) is an \((n+2m)\)th-order identity matrix, \(h_{k}\) is chosen as follows:
where \(s_{\min}>0\) and \(s_{\max}>0\) are sufficiently small and sufficiently large, respectively, \(\lambda_{\min}\) is the smallest eigenvalue of the following matrix:
In the numerical experiments, the parameters are chosen as follows:
Algorithm A stops if one of the following termination criteria is satisfied:
(1) \(\rho(z^{k},\lambda^{k})\leq10^{-5}\) and \(I^{\mathcal{C}}(y^{k},v^{k},\varepsilon_{k})=\emptyset\).
(2) \(\|\widehat{dz}^{k}\|\leq10^{-5}\), \(\max\{-\widehat{\lambda }^{k}_{g,i},i\in I^{g}\}<10^{-5}\) and \(I^{\mathcal{C}}(y^{k},v^{k},\varepsilon _{k})=\emptyset\).
The following test problems are selected from [34]. Algorithm A can find their solutions within a small number of iterations. We compared Algorithm A with four algorithms, i.e., filtermpec, snopt, loqo, and knitro given in [35]. The computational results are given in Table 1. The meaning of some notations in Table 1 are as follows:
Problem: The problem in [34].
6 Conclusions
In this paper, based on Fischer–Burmeister function and working set techniques, a primal–dual interior point QP-free algorithm for mathematical programs with complementarity constraints is presented. At each iteration, only two reduced linear equations with the same coefficient matrix are solved to yield the search direction. The use of the working set decreases the computational cost. Moreover, the uniformly positive definiteness on the Lagrangian Hessian estimate \({H_{k}}\) is relaxed. Under some mild conditions, the proposed algorithm is globally convergent. The preliminary numerical results show that the proposed algorithm is effective.
References
Ferris, M.C., Pang, J.S.: Engineering and economic applications of complementarity problems. SIAM Rev. 39, 669–713 (1997)
Luo, Z.Q., Pang, J.S., Ralph, D.: Mathematical Programs with Equilibrium Constraints. Cambridge University Press, Cambridge (1996)
Kocvara, M., Outrata, J., Zowe, J.: Nonsmooth Approach to Optimization Problems with Equilibrium Constraints: Theory, Applications and Numerical Results. Nonconvex Optimization and Its Applications, vol. 28. Kluwer Academic, Dordrecht (1998)
Dempe, S.: Foundations of Bilevel Programming. Nonconvex Optimization and Its Applications, vol. 61. Kluwer Academic, Dordrecht (2002)
Fukushima, M.: A successive quadratic programming algorithm with global and superlinear convergence properties. Math. Program. 35, 253–264 (1986)
Panier, E.R., Tits, A.L.: A superlinearly convergent feasible method for the solution of inequality constrained optimization problems. SIAM J. Control Optim. 25, 934–950 (1987)
Anitescu, M.: A superlinearly convergent sequential quadratically constrained quadratic programming algorithm for degenerate nonlinear programming. SIAM J. Optim. 12, 949–978 (2002)
Fukushima, M., Luo, Z.Q., Tseng, P.: A sequential quadratically constrained quadratic programming method for differentiable convex minimization. SIAM J. Optim. 13, 1098–1119 (2003)
Jian, J.B.: New sequential quadratically constrained quadratic programming method of feasible directions and its convergence rate. J. Optim. Theory Appl. 129, 109–130 (2006)
Solodov, M.V.: Global convergence of an SQP method without boundedness assumptions on any of iterative sequences. Math. Program. 118, 1–12 (2009)
Jian, J.B., Tang, C.M., Zheng, H.Y.: Sequential quadratically constrained quadratic programming norm-relaxed algorithm of strongly sub-feasible directions. Eur. J. Oper. Res. 200, 645–657 (2010)
Jian, J.B., Liu, Y.: A superlinearly convergent method of quasi-strongly sub-feasible directions with active set identifying for constrained optimization. Nonlinear Anal., Real World Appl. 12, 2717–2729 (2011)
Jian, J.B., Hu, Q.J., Tang, C.M.: Superlinearly convergent norm-relaxed SQP method based on active set identification and new line search for constrained minimax problems. J. Optim. Theory Appl. 163, 859–883 (2014)
Panier, E.R., Tits, A.L., Herskovits, J.N.: A QP-free, globally convergent, locally superlinearly convergent algorithm for inequality constrained optimization. SIAM J. Control Optim. 26, 788–811 (1988)
Gao, Z.Y., He, G.P., Wu, F.: Sequential systems of linear equations algorithm for nonlinear optimization problems with general constraints. J. Optim. Theory Appl. 95, 371–397 (1997)
Takahashi, W.: The split feasibility problem in Banach spaces. J. Nonlinear Convex Anal. 15, 1349–1355 (2014)
Shang, Y.: A new filter QP-free method for the nonlinear inequality constrained optimization problem. J. Inequal. Appl. 2018, Article ID 278 (2018)
Takahahsi, W., Yao, J.C.: The split common fixed point problem for two finite families of nonlinear mappings in Hilbert spaces. J. Nonlinear Convex Anal. 20, 173–195 (2019)
Kanzow, C., Qi, H.D.: A QP-free constrained Newton-type method for variational inequality problems. Math. Program. 85, 81–106 (1999)
Liu, W.A., Shen, C.G., Zhu, X.J., Pu, D.G.: An infeasible QP-free algorithm without a penalty function or a filter for nonlinear inequality constrained optimization. Optim. Methods Softw. 168, 1–28 (2005)
Bakhtiari, S., Tits, A.L.: A simple primal–dual feasible interior-point method for nonlinear programming with monotone descent. Comput. Optim. Appl. 25, 17–38 (2003)
Yang, Y.F., Li, D.H., Qi, L.Q.: A feasible sequential linear equation method for inequality constrained optimization. SIAM J. Optim. 13, 1222–1244 (2003)
Chen, L., Wang, Y.L., He, G.P.: A feasible active set QP-free method for nonlinear programming. SIAM J. Optim. 17, 401–429 (2006)
Jian, J.B., Pan, H.Q., Tang, C.M., Li, J.L.: A strongly sub-feasible primal–dual quasi interior-point algorithm for nonlinear inequality constrained optimization. Appl. Math. Comput. 266, 560–578 (2015)
Jian, J.B., Zeng, H.J., Ma, G.D.: Primal–dual interior point QP-free algorithm for nonlinear constrained optimization. J. Inequal. Appl. 2017, Article ID 239 (2017)
Li, J.L., Jian, J.B.: A superlinearly convergent SSLE algorithm for optimization problems with linear complementarity constraints. J. Glob. Optim. 33, 477–510 (2005)
Li, J.L., Huang, R.S., Jian, J.B.: A superlinearly convergent QP-free algorithm for mathematical programs with equilibrium constraints. Appl. Math. Comput. 269, 885–903 (2015)
Ma, C.F., Liang, G.P.: A new successive approximation damped Newton method for nonlinear complementarity problems. J. Math. Res. Expo. 23, 1–6 (2003)
Mayne, D.Q., Polak, E.: Feasible directions algorithms for optimization problems with equality and inequality constraints. Math. Program. 11, 67–80 (1976)
Fukushima, M., Luo, Z.Q., Pang, J.S.: A globally convergent sequential quadratic programming algorithm for mathematical programs with linear complementarity constraints. Comput. Optim. Appl. 10, 5–34 (1998)
Jiang, H.Y., Ralph, D.: Smooth SQP methods for mathematical programs with nonlinear complementarity constraints. SIAM J. Optim. 10, 779–808 (2000)
Facchinei, F., Fischer, A., Kanzow, C.: On the accurate identification of active constraints. SIAM J. Optim. 9, 14–32 (1998)
Tits, A.L., Wachter, A., Bakhtiari, S., Urban, T.J., Lawrence, C.T.: A primal–dual interior-point method for nonlinear programming with strong global and local convergence properties. SIAM J. Optim. 14, 173–199 (2003)
Leyffer, S.: MacMPEC: AMPL collection of MPECS (2000). http://www.mcs.anl.go/~Leyffer/MacMPEC
Fletcher, R., Leyffer, S.: Solving mathematical programs with complementarity constraints as nonlinear programs. Optim. Methods Softw. 19, 15–40 (2004)
Availability of data and materials
Not applicable.
Funding
This research is supported by the National Natural Science Foundation (11561005) of China.
Author information
Authors and Affiliations
Contributions
The authors completed the paper. The authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Li, J.L., Zhang, Q. A global QP-free algorithm for mathematical programs with complementarity constraints. J Inequal Appl 2020, 213 (2020). https://doi.org/10.1186/s13660-020-02479-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13660-020-02479-6