A relational-theoretic approach to get solution of nonlinear matrix equations

Nashine, Hemant Kumar; Jain, Reena; Parvaneh, Vahid

doi:10.1186/s13660-022-02817-w

Research
Open access
Published: 13 June 2022

A relational-theoretic approach to get solution of nonlinear matrix equations

Journal of Inequalities and Applications volume 2022, Article number: 79 (2022) Cite this article

1548 Accesses
2 Citations
Metrics details

Abstract

In this study, we consider a nonlinear matrix equation of the form $\mathcal{X}= \mathcal{Q} + \sum_{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G} (\mathcal{X})\mathcal{A}_{i}$, where $\mathcal{Q}$ is a Hermitian positive definite matrix, $\mathcal{A}_{i}^{*}$ stands for the conjugate transpose of an $n\times n$ matrix $\mathcal{A}_{i}$, and $\mathcal{G}$ is an order-preserving continuous mapping from the set of all Hermitian matrices to the set of all positive definite matrices such that $\mathcal{G}(O)=O$. We discuss sufficient conditions that ensure the existence of a unique positive definite solution of the given matrix equation. For this, we derive some fixed point results for Suzuki-FG contractive mappings on metric spaces (not necessarily complete) endowed with arbitrary binary relation (not necessarily a partial order). We provide adequate examples to validate the fixed-point results and the importance of related work, and the convergence analysis of nonlinear matrix equations through an illustration with graphical representations.

1 Introduction

The study of nonlinear matrix equations (NME) appeared first in the literature concerned with an algebraic Riccati equation. These equations occur in a large number of problems in control theory, dynamical programming, ladder network, stochastic filtering, queuing theory, statistics, and many other applicable areas.

Let $\mathcal{H}(n)$ (resp. $\mathcal{K}(n)$, $\mathcal{P}(n)$) denote the set of all $n\times n$ Hermitian (resp. positive semi-definite, positive definite) matrices over $\mathbb{C}$ and $\mathcal{M}(n)$ the set of all $n\times n$ matrices over $\mathbb{C}$. In [1], Ran and Reurings discussed the existence of solutions of the following equation:

$$\begin{aligned} \mathcal{X} + \mathcal{B}^{*} F(\mathcal{X})\mathcal{B} = \mathcal{Q} \end{aligned}$$

(1)

in $\mathcal{K}(n)$, where $\mathcal{B} \in \mathcal{M}(n)$, $\mathcal{Q}$ is positive definite, and F is a mapping from $\mathcal{K}(n)$ into $\mathcal{M}(n)$. Note that $\mathcal{X}$ is a solution of (1) if and only if it is a fixed point of the mapping $\mathcal{G}(\mathcal{X}) = \mathcal{Q} - \mathcal{B}^{*} F( \mathcal{X})\mathcal{B}$. In [2], authors used the notion of partial ordering and established a modification of the Banach contraction principle, which they applied for solving a class of NMEs of the form $\mathcal{X} = \mathcal{Q} + \sum_{i=1}^{m} \mathcal{B}_{i}^{*} F( \mathcal{X})\mathcal{B}_{i}$ using the Ky Fan norm in $\mathcal{M}(n)$.

Theorem 1.1

([2])

Let $F: \mathcal{H}(n)\rightarrow \mathcal{H}(n)$ be an order-preserving, continuous mapping which maps $\mathcal{P}(n)$ into itself and $\mathcal{Q}\in \mathcal{P}(n)$. If $\mathcal{B}_{i},\mathcal{B}_{i}^{*}\in \mathcal{P}(n)$ and $\sum_{i=1}^{m}\mathcal{B}_{i}\mathcal{B}_{i}^{*}< M\cdot \mathcal{I}_{n}$ for some $M>0$ ($\mathcal{I}_{n}$ – the unit matrix in $\mathcal{M}(n)$) and if $|\operatorname{tr}({F(\mathcal{Y}) - F(\mathcal{X})})|\leq \frac{1}{M}|\operatorname{tr}(\mathcal{Y-X})|$ for all $\mathcal{X,Y}\in \mathcal{H}(n)$ with $\mathcal{X}\leq{\mathcal{Y}}$, then the equation $\mathcal{X}=\mathcal{Q}+\sum_{i=1}^{m}\mathcal{B}_{i}^{*}F( \mathcal{X})\mathcal{B}_{i}$ has a unique positive definite solution (PDS).

In recent years, a number of mathematicians have obtained fixed point results for contraction type mappings in metric spaces equipped with partial order. Some early results in this direction were established by Turinici in [3, 4]; one may note that their starting points were “amorphous” contributions in the area due to Matkowski [5, 6]. These types of results have been reinvestigated by Ran and Reurings [1] and also by Nieto and Ródríguez-López [7, 8]. Samet and Turinici [9] established fixed point theorem for nonlinear contraction under symmetric closure of an arbitrary relation. Ahmadullah et al. [10–12] and Alam and Imdad [13] employed an amorphous relation to prove a relation-theoretic analogue of the Banach contraction principle which in turn unifies a lot of well-known relevant order-theoretic fixed point theorems. Recently, Hasanuzzaman and Imdad [14] used the concept of simulation function and proved the relation theoretic metrical fixed point results for Suzuki type $\mathcal{Z}_{R}$-contraction and discussed application in solving nonlinear matrix equations.

Motivated by the above reference work, we introduce the notion of Suzuki-FG contractive mapping on metric spaces endowed with an arbitrary binary relation (not necessarily partial order), and then we prove existence and uniqueness fixed point results under weaker conditions. We justify our work by some illustrative examples and demonstrate the genuineness of Suzuki-FG contraction over Suzuki contraction, generalized Suzuki contraction, and implicit type contraction mapping. Further, we apply this result to NMEs and discuss its convergence behavior with respect to three different initial values with graphical representations and solutions by the surface plot. The experiment was run on a macOS Mojave version 10.14.6 CPU @1.6 GHz intel core i5 8GB with MATLAB R2020b as the programming language (Online).

2 Preliminaries

Throughout this article, the notations $\mathbb{Z}, \mathbb{N}, \mathbb{R}, \mathbb{R}^{+}$ have their usual meanings, and $\mathbb{N}^{*}=\mathbb{N} \cup \{0\}$.

We call $({\mathcal{E}},\mathfrak{R})$ a relational set if (i) ${\mathcal{E}} \neq \emptyset $ is a set and (ii) $\mathfrak{R}$ is a binary relation on $\mathcal{E}$.

In addition, if $({\mathcal{E}},d)$ is a metric space, we call $({\mathcal{E}},d,\mathfrak{R})$ a relational metric space (RMS, for short).

The following are some standard terms used in the theory of relational sets (see, e.g., [9, 13, 15–17]).

Let $({\mathcal{E}},\mathfrak{R})$ be a relational set, $(\mathcal{E}, d, \mathfrak{R})$ be an RMS, and let ℑ be a self-mapping on $\mathcal{E}$. Then:

1.
$\nu \in \mathcal{E}$ is $\mathfrak{R}$-related to $\vartheta \in \mathcal{E}$ if and only if $(\nu, \vartheta ) \in \mathfrak{R}$.
2.
The set $({\mathcal{E}},\mathfrak{R})$ is said to be comparable if for all $\nu,\vartheta \in \mathcal{E}, [\nu,\vartheta ]\in \mathfrak{R}$, where $[\nu,\vartheta ]\in \mathfrak{R}$ means that either $(\nu, \vartheta )\in \mathfrak{R}$ or $(\vartheta,\nu )\in \mathfrak{R}$.
3.
A sequence $(\nu _{n})$ in $\mathcal{E}$ is said to be $\mathfrak{R}$-preserving if $(\nu _{n}, \nu _{n+1})\in \mathfrak{R}$, $\forall n\in \mathbb{N}\cup \{0\}$.
4.
$(\mathcal{E}, d, \mathfrak{R})$ is said to be $\mathfrak{R}$-complete if every $\mathfrak{R}$-preserving Cauchy sequence converges in $\mathcal{E}$.
5.
$\mathfrak{R}$ is said to be ℑ-closed if $(\nu, \vartheta ) \in \mathfrak{R} \Rightarrow (\Im \nu, \Im \vartheta )\in \mathfrak{R}$. It is said to be weakly ℑ-closed if $(\nu, \vartheta ) \in \mathfrak{R} \Rightarrow [\Im \nu,\Im \vartheta ] \in \mathfrak{R}$.
6.
$\mathfrak{R}$ is said to be d-self-closed if for every $\mathfrak{R}$-preserving sequence with $\nu _{n}\rightarrow \nu $, there is a subsequence $(\nu _{n_{k}})$ of $(\nu _{n})$ such that $[\nu _{n_{k}},\nu ]\in \mathfrak{R}$ for all $k\in \mathbb{N}\cup \{0\}$.
7.
A subset $\mathfrak{Z}$ of $\mathcal{E}$ is called $\mathfrak{R}$-directed if for each $\nu,\vartheta \in \mathfrak{Z}$, there exists $\mu \in \mathcal{E}$ such that $(\nu,\mu )\in \mathfrak{R}$ and $(\vartheta,\mu )\in \mathfrak{R}$. It is called $(\Im,\mathfrak{R})$-directed if for each $\nu, \vartheta \in \mathfrak{Z}$ there exists $\mu \in \mathcal{E}$ such that $(\nu,\Im \mu )\in \mathfrak{R}$ and $(\vartheta, \Im \mu )\in \mathfrak{R}$.
8.
ℑ is said to be $\mathfrak{R}$-continuous at ν if for every $\mathfrak{R}$-preserving sequence $(\nu _{n})$ converging to ν, we get $\Im (\nu _{n})\rightarrow \Im (\nu )$ as $n\rightarrow \infty $. Moreover, ℑ is said to be $\mathfrak{R}$-continuous if it is $\mathfrak{R}$-continuous at every point of $\mathcal{E}$.
9.
For $\nu,\vartheta \in \mathcal{E}$, a path of length k (where k is a natural number) in $\mathfrak{R}$ from ν to ϑ is a finite sequence $\{\mu _{0}, \mu _{1}, \mu _{2}, \ldots, \mu _{k}\}\subset \mathcal{E}$ satisfying the following conditions:
1. (i)
  $\mu _{0}=\nu $ and $\mu _{k}=\vartheta $,
2. (ii)
  $(\mu _{i}, \mu _{i+1})\in \mathfrak{R}$ for each i $(0\leq i\leq k-1)$,
then this finite sequence is called a path of length k joining ν to ϑ in $\mathfrak{R}$.
10.
If, for a pair of $\nu,\vartheta \in \mathcal{E}$, there is a finite sequence $\{\mu _{0}, \mu _{1}, \mu _{2}, \ldots, \mu _{k}\}\subset \mathcal{E}$ satisfying the following conditions:
1. (i)
  $\Im \mu _{0}=\nu $ and $\Im \mu _{k}=\vartheta $,
2. (ii)
  $(\Im \mu _{i}, \Im \mu _{i+1})\in \mathfrak{R}$ for each i $(0\leq i\leq k-1)$,
then this finite sequence is called a ℑ-path of length k joining ν to ϑ in $\mathfrak{R}$.

Notice that a path of length k involves $k+1$ elements of $\mathcal{E}$ although they are not necessarily distinct.

We fix the following notation for a relational metric space $({\mathcal{E}},d, \mathfrak{R})$, a self-mapping ℑ on $\mathcal{E}$, and an $\mathfrak{R}$-directed subset $\mathfrak{D}$ of $\mathcal{E}$:

(i)
$\mathrm{Fix}(\Im ):=$ the set of all fixed points of ℑ,
(ii)
$\mathfrak{X}(\Im,\mathfrak{R}):=\{\nu \in \mathcal{E}: (\nu, \Im \nu )\in {\mathfrak{R}}\}$,
(iii)
$\mathfrak{P}(\nu,\vartheta,\mathfrak{R}):=$ the class of all paths in $\mathfrak{R}$ from ν to ϑ in $\mathfrak{R}$, where $\nu,\vartheta \in \mathcal{E}$.

3 Results on Suzuki-FG contractive mappings

Definition 3.1

([18])

The collection of all functions $\mathcal{F}: \mathbb{R}\mathbbm{_{+}}\to \mathbb{R}$ satisfying:

($\mathbb{F}_{1}$):: $\mathcal{F}$ is continuous and strictly increasing;
($\mathbb{F}_{2}$):: for each $\{\xi _{n}\}\subseteq \mathbb{R}\mathbbm{_{+}}$, $\lim_{n \to \infty}\xi _{n}=0$ iff $\lim_{n \to \infty}\mathcal{F}(\xi _{n})=- \infty $,

will be denoted by $\mathbb{F}$.

The collection of all pairs of mappings $(\mathcal{G},\beta )$, where $\mathcal{G}: \mathbb{R}\mathbbm{_{+}} \to \mathbb{R}$, $\beta: \mathbb{R}_{+} \to [0,1)$, satisfying:

($\mathbb{F}_{3}$):: for each $\{\xi _{n}\}\subseteq \mathbb{R}\mathbbm{_{+}}$, $\limsup_{n \to \infty}\mathcal{G}(\xi _{n})\ge 0$ iff $\limsup_{n \to \infty}\xi _{n}\ge 1$;
($\mathbb{F}_{4}$):: for each $\{\xi _{n}\}\subseteq \mathbb{R}_{+}$, $\limsup_{n \to \infty}\beta (\xi _{n})=1$ implies $\lim_{n \to \infty}\xi _{n}=0$;
($\mathbb{F}_{5}$):: for each $\{\xi _{n}\}\subseteq \mathbb{R}\mathbbm{_{+}}$, $\sum_{n=1}^{\infty} \mathcal{G}(\beta (\xi _{n}))=-\infty $,

will be denoted by $\mathbb{G}_{\beta}$.

Definition 3.2

Let $(\mathcal{E}, d, \mathfrak{R})$ be an RMS and $\mathcal{P} \colon \mathcal{E} \to \mathcal{E}$ be a given mapping. A mapping $\mathcal{P} $ is said to be a Suzuki-FG contractive mapping if there exist $\mathcal{F}\in \mathbb{F} $ and $(\mathcal{G},\beta )\in \mathbb{G}_{\beta}$ such that, for $(\nu,\vartheta ) \in \mathcal{E}$ with $(\nu,\vartheta ) \in \mathfrak{R}^{*}$,

$$\begin{aligned} \textstyle\begin{cases} \frac{1}{2} d(\nu,\mathcal{P} \nu ) \leq d(\nu, \vartheta ) \quad\text{implies } \\ \mathcal{F}(d(\mathcal{P} \nu,\mathcal{P} \vartheta ))\leq \mathcal{F}(\mathfrak{N}(\nu, \vartheta ))+ \mathcal{G}(\beta ( \mathfrak{N}(\nu, \vartheta ))), \end{cases}\displaystyle \end{aligned}$$

(2)

where

$$\begin{aligned} \begin{aligned} &\mathfrak{N}(\nu, \vartheta )= \max \biggl\{ d(\nu,\vartheta ),d( \nu,\mathcal{P} \nu ),d(\vartheta,\mathcal{P} \vartheta ), \frac{d(\nu,\mathcal{P} \vartheta )+ d(\vartheta,\mathcal{P} \nu )}{2} \biggr\} , \\ &\mathfrak{R}^{*} = \bigl\{ (\nu, \vartheta ) \in \mathfrak{R} \mid \mathcal{P}\nu \neq \mathcal{P}\vartheta \bigr\} . \end{aligned} \end{aligned}$$

(3)

We denote by $(\mathrm{SFG})_{\mathfrak{R}}$ the collection of all Suzuki-FG contractive mappings on $(\mathcal{E},d,\mathfrak{R})$.

Now, we are equipped to state and prove our first main result as follows.

Theorem 3.3

Let $(\mathcal{E}, d, \mathfrak{R})$ be an RMS and $\mathcal{P} \colon \mathcal{E} \to \mathcal{E}$. Suppose that the following conditions hold:

($C_{1}$):: $\mathfrak{X}(\mathcal{P},\mathfrak{R}) \neq \emptyset $;
($C_{2}$):: $\mathfrak{R}$ is $\mathcal{P} $-closed and $\mathcal{P} $-transitive;
($C_{3}$):: $\mathcal{E}$ is $\mathfrak{R}$-complete;
($C_{4}$):: $\mathcal{P} \in (\mathrm{SFG})_{\mathfrak{R}}$;
($C_{5}$):: ℑ is $\mathfrak{R}$-continuous or
($C'_{5}$):: $\mathfrak{R}$ is d-self-closed.

Then there exists a point $\nu _{*} \in \mathrm{Fix}(\mathcal{P} )$.

Proof

Starting with $\nu _{0}\in \mathcal{E}$ given by ($C_{1}$), we construct a sequence $\{\nu _{n}\}$ of Picard iterates $\nu _{n+1} = \mathcal{P}^{n}(\nu _{0})$ for all $n\in \mathbb{N}^{*}$.

Using ($C_{1}$)–($C_{2}$), we have that $(\mathcal{P}\nu _{0}, \mathcal{P}^{2} \nu _{0}) \in \mathfrak{R}$. Continuing this process inductively, we obtain

$$\begin{aligned} \bigl(\mathcal{P}^{n} \nu _{0}, \mathcal{P}^{n+1} \nu _{0} \bigr) \in \mathfrak{R} \end{aligned}$$

(4)

for any $n \in \mathbb{N}^{*}$. Hence, $\{\nu _{n}\}$ is an $\mathfrak{R}$-preserving sequence.

Now, if there exists some $n_{0}\in \mathbb{N}^{*}$ such that $d(\nu _{n_{0}}, \mathcal{P}\nu _{n_{0}})=0$, then the result follows immediately. Otherwise, for all $n\in \mathbb{N}^{*}, d(\nu _{n}, \mathcal{P}\nu _{n})>0$ so that $\mathcal{P}\nu _{n}\neq \mathcal{P}\nu _{n+1}$ which implies that $(\nu _{n}, \nu _{n+1})\in \mathfrak{R}^{*}$ and $\frac{1}{2}d(\nu _{n}, \mathcal{P}\nu _{n})< d(\nu _{n}, \mathcal{P}\nu _{n})$. Therefore, using $(C_{4})$ for $\nu = \nu _{n}$, $\vartheta =\nu _{n+1}$, we have

$$\begin{aligned} \mathcal{F} \bigl(d(\mathcal{P}\nu _{n},\mathcal{P}\nu _{n+1}) \bigr)\leq \mathcal{F} \bigl(\mathfrak{N}(\nu _{n},\nu _{n+1}) \bigr)+ \mathcal{G} \bigl(\beta \bigl( \mathfrak{N}(\nu _{n},\nu _{n+1}) \bigr) \bigr), \end{aligned}$$

where

$$\begin{aligned} \mathfrak{N}(\nu _{n},\nu _{n+1}) & = \max \begin{Bmatrix} d(\nu _{n},\nu _{n+1}),d(\nu _{n}, \mathcal{P}\nu _{n}),d(\nu _{n+1}, \mathcal{P}\nu _{n+1}), \\ \frac{d(\nu _{n},\mathcal{P}\nu _{n+1})+d(\nu _{n+1},\mathcal{P}\nu _{n})}{2} \end{Bmatrix} \\ & = \max \begin{Bmatrix} d(\nu _{n},\nu _{n+1}),d(\nu _{n},\nu _{n+1}),d(\nu _{n+1},\nu _{n+2}), \\ \frac{ d(\nu _{n},\nu _{n+2})}{2} \end{Bmatrix} \\ & \leq \max \begin{Bmatrix} d(\nu _{n},\nu _{n+1}),d(\nu _{n},\nu _{n+1}),d(\nu _{n+1},\nu _{n+2}), \\ \frac{ d(\nu _{n},\nu _{n+1})+d(\nu _{n+1},\nu _{n+2})}{2} \end{Bmatrix} \\ &=\max \bigl\{ d(\nu _{n},\nu _{n+1}),d(\nu _{n+1},\nu _{n+2}) \bigr\} . \end{aligned}$$

If $\mathfrak{N}(\nu _{n},\nu _{n+1})= d(\nu _{n+1},\nu _{n+2})$, then

$$\begin{aligned} \mathcal{F} \bigl(d(\nu _{n+1},\nu _{n+2}) \bigr) \leq \mathcal{F} \bigl(d(\nu _{n+1}, \nu _{n+2}) \bigr)+ \mathcal{G} \bigl(\beta \bigl(d(\nu _{n+1},\nu _{n+2}) \bigr) \bigr), \end{aligned}$$

which implies $\mathcal{G}(\beta (d(\nu _{n+1},\nu _{n+2})))\geq 0$, i.e., $\beta (d(\nu _{n+1},\nu _{n+2}))\geq 1$, a contradiction. Therefore

$$\begin{aligned} d(\nu _{n+1},\nu _{n+2})\leq d(\nu _{n},\nu _{n+1}) \quad\text{for all } n \in \mathbb{N}, \end{aligned}$$

(5)

and so

$$\begin{aligned} \mathcal{F} \bigl(d(\nu _{n+1},\nu _{n+2}) \bigr) \leq \mathcal {\mathcal{F}} \bigl(d(\nu _{n}, \nu _{n+1}) \bigr)+ \mathcal{G} \bigl(\beta \bigl(d(\nu _{n},\nu _{n+1}) \bigr) \bigr) \end{aligned}$$

for all $n\in \mathbb{N}$. Consequently,

$$\begin{aligned} \mathcal{F} \bigl(d(\nu _{n},\nu _{n+1}) \bigr)&\leq \mathcal{F} \bigl(d(\nu _{n-1}, \nu _{n}) \bigr)+ \mathcal{G} \bigl(\beta \bigl(d(\nu _{n-1},\nu _{n}) \bigr) \bigr) \\ &\vdots \\ &\leq \mathcal{F} \bigl(d(\nu _{0},\nu _{1}) \bigr)+ \sum _{i=1}^{i=n} \mathcal{G} \bigl(\beta \bigl(d( \nu _{i},\nu _{i-1}) \bigr) \bigr). \end{aligned}$$

(6)

Letting $n\to \infty $ gives $\lim_{n\to \infty} \mathcal{F}(d(\nu _{n},\nu _{n+1}))=- \infty $ and $\mathcal{F} \in \mathbb{F}$ gives

$$\begin{aligned} \lim_{n\to \infty} d(\nu _{n},\nu _{n+1})=0. \end{aligned}$$

(7)

We will now show that the sequence $\{\nu _{n}\}$ is an $\mathfrak{R}$-preserving Cauchy sequence in $(\mathcal{E},d)$. On the contrary, we suppose that there exist $\zeta >0$ and two subsequences $\{\nu _{n(j)} \}$ and $\{\nu _{m(j)} \}$ of $\{\nu _{n}\}$ such that $m(j)$ is the smallest index for which $m(j)>n(j)>j$ and

$$\begin{aligned} d(\nu _{m(j)},\nu _{n(j)})\geq \zeta. \end{aligned}$$

(8)

This means that $m(j)>n(j)>j$ and

$$\begin{aligned} d(\nu _{m(j)-1},\nu _{n(j)})< \zeta. \end{aligned}$$

(9)

On the other hand,

$$\begin{aligned} \zeta \leq d(\nu _{m(j)},\nu _{n(j)})\leq d(\nu _{m(j)}, \nu _{m(j)-1}) +d(\nu _{m(j)-1},\nu _{nj)})\leq d(\nu _{m(j)},\nu _{m(j)-1})+ \zeta. \end{aligned}$$

Taking $j\rightarrow \infty $ and using (7), we get

$$\begin{aligned} \lim_{j\rightarrow \infty} d(\nu _{m(j)},\nu _{n(j)})=\zeta, \end{aligned}$$

(10)

and hence

$$\begin{aligned} \lim_{j\rightarrow \infty}d(\nu _{m(j)+1},\nu _{n(j)+1})= \zeta. \end{aligned}$$

(11)

Then, from (7) and (10), one can select a positive integer $N\in \mathbb{N}$ such that

$$\begin{aligned} \frac{1}{2}d(\nu _{m(j)},\mathcal{P}\nu _{m(j)})< \frac{1}{2}\zeta < d( \nu _{m(j)},\nu _{n(j)}) \quad\text{for all }j\geq N. \end{aligned}$$

As the sequence $\{\nu _{n}\}$ is $\mathfrak{R}$-preserving and $\mathfrak{R}$ is $\mathcal{P}$-transitive, therefore $(\nu _{m(j)},\nu _{n(j)})\in \mathfrak{R}^{*}$, and we get

$$\begin{aligned} &\mathcal{F} \Bigl(\limsup_{j \to \infty}d(\nu _{m(j)+1},\nu _{n(j)+1}) \Bigr) \\ &\quad \le \mathcal{F} \Bigl(\limsup_{j \to \infty}\mathfrak{N}(\nu _{m(j)}, \nu _{n(j)}) \Bigr)+ \limsup_{\jmath \to \infty} \mathcal{G} \bigl( \beta \bigl(\mathfrak{N}(\nu _{m(j)},\nu _{n(j)}) \bigr) \bigr), \end{aligned}$$

(12)

where

$$\begin{aligned} &\mathfrak{N}(\nu _{m(j)},\nu _{n(j)}) \\ &\quad= \max \begin{Bmatrix} d(\nu _{m(j)},\nu _{n(j)}),d(\nu _{m(j)},\mathcal{P} \nu _{m(j)}), d( \nu _{n(j)}, \mathcal{P} \nu _{n(j)}), \\ \frac{d(\nu _{n(j)},\mathcal{P} \nu _{m(j)})+d(\nu _{m(j)},\mathcal{P} \nu _{n(j)})}{2} \end{Bmatrix} \\ &\quad= \max \begin{Bmatrix} d(\nu _{m(j)},\nu _{n(j)}),d(\nu _{m(j)},\nu _{m(j)+1}), d(\nu _{n(j)}, \nu _{n(j)+1}), \\ \frac{d(\nu _{n(j)},\nu _{m(j)+1})+d(\nu _{m(j)},\nu _{n(j)+1})}{2} \end{Bmatrix} \\ &\quad\leq \max \begin{Bmatrix} d(\nu _{m(j)},\nu _{n(j)}),d(\nu _{m(j)},\nu _{m(j)+1}), d(\nu _{n(j)}, \nu _{n(j)+1}), \\ \frac{d(\nu _{n(j)},\nu _{m(j)})+d(\nu _{m(j)},\nu _{m(j)+1})+d(\nu _{m(j)},\nu _{n(j)})+d(\nu _{n(j)},\nu _{n(j)+1})}{2} \end{Bmatrix}. \end{aligned}$$

Taking upper limit as $j\to \infty $ and making use of (7), (10), and (11), we get

$$\begin{aligned} \limsup_{j \to \infty} \mathfrak{N}(\nu _{m(j)}, \nu _{n(j)})= \limsup_{j \to \infty} d(\nu _{m(j)},\nu _{n(j)}). \end{aligned}$$

(13)

Therefore, from (12), (11), and (13), we have

$$\begin{aligned} \mathcal{F}(\zeta )& =\mathcal{F} \Bigl(\limsup_{j \to \infty}d( \nu _{m(j)+1},\nu _{n(j)+1}) \Bigr) \\ & \le \mathcal{F} \Bigl(\limsup_{j \to \infty}d(\nu _{m(j)},\nu _{n(j)}) \Bigr)+ \limsup_{\jmath \to \infty} \mathcal{G} \bigl(\beta \bigl(d(\nu _{m(j)}, \nu _{n(j)}) \bigr) \bigr) \\ & = \mathcal{F}(\zeta )+\limsup_{\jmath \to \infty} \mathcal{G}(\beta \bigl(d(\nu _{m(j)},\nu _{n(j)}) \bigr), \end{aligned}$$

which implies that $\limsup_{j\to \infty} \mathcal{G}(\beta (d(\nu _{m(j)}, \nu _{n(j)})))\geq 0$, which gives $\limsup_{j \to \infty} \beta (d (\nu _{m(j)}, \nu _{n(j)})) \ge 1$, and taking into account that $\beta (\xi )<1$ for all $\xi \ge 0$, we have $\limsup_{j\to \infty} \beta (d(\nu _{m(j)}, \nu _{n(j)}))= 1$. Therefore, $\limsup_{j\to \infty} d(\nu _{m(j)},\nu _{n(j)})= 0$, a contradiction. Hence, $\{\nu _{n}\}$ is an $\mathfrak{R}$ preserving Cauchy sequence in $\mathcal{E}$.

The $\mathfrak{R}$-completeness of $\mathcal{E}$ implies that there exists $\nu ^{*} \in \mathcal{E}$ such that $\lim_{n\to \infty}\nu _{n}=\nu ^{*}$. Now, first by (C5), we have

$$\begin{aligned} \nu ^{*}=\lim_{n\to \infty}\nu _{n+1}=\lim _{n\to \infty}\mathcal{P}\nu _{n}= \mathcal{P}\nu _{*}, \end{aligned}$$

(14)

and hence $\nu ^{*}$ is a fixed point of $\mathcal{P}$.

Alternatively, suppose that $\mathfrak{R}$ is d-self-closed. Then there exists a subsequence $\{\nu _{n_{k}}\}$ of $\{\nu _{n}\}$ with $[\nu _{n_{k}}, \nu _{*}]\in \mathfrak{R}$ for all $k \in \mathbb{N}^{*}$. Now, we assert that

$$\begin{aligned} \frac{1}{2}d(\nu _{n_{k}}, \mathcal{P}\nu _{n_{k}})< d( \nu _{n_{k}}, \nu _{*})\quad\text{or}\quad \frac{1}{2}d \bigl( \mathcal{P}\nu _{n_{k}}, \mathcal{P}^{2} \nu _{n_{k}} \bigr)< d(\mathcal{P} \nu _{n_{k}}, \nu _{*}) \end{aligned}$$

(15)

for all $k\in \mathbb{N}^{*}$.

Let, to the contrary, there exist $\varsigma \in \mathbb{N}$ such that

$$\begin{aligned} \frac{1}{2}d(\nu _{n(\varsigma )}, \mathcal{P}\nu _{n(\varsigma )})\geq d(\nu _{n(\varsigma )}, \nu _{*}) \quad\text{and}\quad \frac{1}{2}d \bigl(\mathcal{P}\nu _{n(\varsigma )}, \mathcal{P}^{2} \nu _{n(\varsigma )} \bigr)\geq d(\mathcal{P}\nu _{n( \varsigma )}, \nu _{*}), \end{aligned}$$

(16)

so that

$$\begin{aligned} 2d(\nu _{n(\varsigma )}, \nu _{*})\leq d(\nu _{n(\varsigma )}, \mathcal{P}\nu _{n(\varsigma )})\leq d(\nu _{n(\varsigma )}, \nu _{*})+d( \nu _{*}, \mathcal{P}\nu _{n(\varsigma )}), \end{aligned}$$

and

$$\begin{aligned} d(\nu _{n(\varsigma )}, \nu _{*})\leq d(\nu _{*}, \mathcal{P}\nu _{n(\varsigma )})\leq \frac{1}{2}d \bigl( \mathcal{P}\nu _{n( \varsigma )}, \mathcal{P}^{2}\nu _{n(\varsigma )} \bigr). \end{aligned}$$

(17)

Now, from (5) and using (16), (17), we have

$$\begin{aligned} d \bigl(\mathcal{P}\nu _{n(\varsigma )}, \mathcal{P}^{2}\nu _{n(\varsigma )} \bigr) &< d(\nu _{n(\varsigma )}, \mathcal{P}\nu _{n(\varsigma )}) \\ &\leq d(\nu _{n(\varsigma )}, \nu _{*})+d(\nu _{*}, \mathcal{P}\nu _{n( \varsigma )}) \\ &\leq \frac{1}{2}d \bigl(\mathcal{P}\nu _{n(\varsigma )}, \mathcal{P}^{2} \nu _{n(\varsigma )} \bigr)+\frac{1}{2}d \bigl( \mathcal{P}\nu _{n(\varsigma )}, \mathcal{P}^{2}\nu _{n(k)} \bigr) \\ &= d \bigl(\mathcal{P}\nu _{n(\varsigma )}, \mathcal{P}^{2}\nu _{n(\varsigma )} \bigr), \end{aligned}$$

a contradiction, and therefore (15) remains true.

Now, we distinguish two cases for $\Lambda =\{k\in \mathbb{N}: \mathcal{P} \nu _{n_{k}}= \mathcal{P} \nu _{*}\}$. If Λ is finite, then there exists $k_{0}\in \mathbb{N}$ such that $\mathcal{P}\nu _{n_{k}}\neq \mathcal{P}\nu _{*}$ for all $k>k_{0}$. It follows from (15) (for all $k>k_{0}$) that either

$$\begin{aligned} \mathcal{F} \bigl(d(\mathcal{P}\nu _{n_{k}},\mathcal{P}\nu _{*}) \bigr) \le \mathcal{F} \bigl(\mathfrak{N}(\nu _{n_{k}}, \nu _{*}) \bigr)+ \mathcal{G} \bigl( \beta \bigl(\mathfrak{N}(\nu _{n_{k}}, \nu _{*}) \bigr) \bigr), \end{aligned}$$

where

$$\begin{aligned} \mathfrak{N}(\nu _{n_{k}}, \nu _{*}) = \max \begin{Bmatrix} d(\nu _{n_{k}}, \nu _{*}),d(\nu _{n_{k}}, \mathcal{P}\nu _{n_{k}}), d( \nu _{*}, \mathcal{P}\nu _{*}), \\ \frac{d(\nu _{n_{k}},\mathcal{P}\nu _{*})+d(\nu _{*},\mathcal{P}\nu _{n_{k}})}{2} \end{Bmatrix}. \end{aligned}$$

Applying limit as $n \to \infty $, we get $\lim_{n \to \infty} \mathfrak{N}(\nu _{n_{k}}, \nu _{*})= d( \nu _{*},\mathcal{P}\nu _{*})$, which implies that $\limsup_{n\to \infty} \mathcal{G}(\beta (\mathfrak{N}(\nu _{n_{k}}, \nu _{*}))\geq 0$, which gives $\limsup_{n \to \infty} \beta (\mathfrak{N}(\nu _{n_{k}}, \nu _{*}))\ge 1$, and taking into account that $\beta (\xi )<1$ for all $\xi \ge 0$, we have $\limsup_{n \to \infty} \beta (\mathfrak{N}(\nu _{n_{k}}, \nu _{*}))= 1$. Therefore, $\limsup_{n \to \infty} \mathfrak{N}(\nu _{n_{k}}, \nu _{*})= 0$. Hence, $d(\nu _{*},\mathcal{P}\nu _{*})=0$, we get $\nu _{*}=\mathcal{P}\nu _{*}$.

Otherwise, if Λ is not finite, then there is a subsequence $\{\nu _{n(k(\varsigma ))}\}$ of $\{\nu _{n_{k}}\}$ such that

$$\begin{aligned} \nu _{n(k(\varsigma ))+1}=\mathcal{P}\nu _{n(k(\varsigma ))}= \mathcal{P} \nu _{*},\quad \forall \varsigma \in \mathbb{N}. \end{aligned}$$

As $\nu _{n_{k}}\rightarrow ^{d} \nu _{*}$, therefore $\mathcal{P}\nu _{*}=\nu _{*}$. □

Theorem 3.4

In addition to the assumptions of Theorem 3.3, let $\mathfrak{P}(\nu,\vartheta;\mathfrak{R}|_{\mathcal{P} (\mathcal{E})}) \neq \emptyset $ for all $\nu,\vartheta \in \mathcal{P} (\mathcal{E})$. Then $\mathcal{P} $ has a unique fixed point.

Proof

In view of Theorem 3.3, $\mathrm{Fix} (\mathcal{P} ) \neq \emptyset $. If $\mathrm{Fix} (\mathcal{P} )$ is a singleton, then we concluded the proof. Otherwise, let $\nu _{*} \neq \varpi \in \mathrm{Fix} (\mathcal{P} )$. Since $\mathfrak{P}(\nu,\vartheta; \mathfrak{R}|_{\mathcal{P} ( \mathcal{E})}) \neq \emptyset $ for all $\vartheta, \nu \in \mathcal{P} (\mathcal{E})$, there exists a path $\{\mathcal{P} z_{0}, \mathcal{P} z_{1}, \ldots, \mathcal{P} z_{k} \}$ of some length k in $\mathfrak{R}|_{\mathcal{P} (\mathcal{E})}$ such that $\mathcal{P} z_{0}=\nu _{*}, \mathcal{P} z_{k}=\varpi $ and $(\mathcal{P} z_{j}, \mathcal{P} z_{j+1})\in \mathfrak{R}|_{ \mathcal{P} (\mathcal{E})} $ for each $j=0,1,2, \ldots, k-1$. Since $\mathfrak{R}$ is $\mathcal{P} $-transitive, we have

$$\begin{aligned} (\nu _{*}, \mathcal{P} z_{1})\in \mathfrak{R}, (\mathcal{P} z_{1}, \mathcal{P} z_{2})\in \mathfrak{R}, \ldots, ( \mathcal{P} z_{k-1}, \varpi )\in \mathfrak{R} \Rightarrow (\nu _{*}, \varpi )\in \mathfrak{R}. \end{aligned}$$

Also, due to the fact $\frac{1}{2}d(\nu _{*}, \mathcal{P} \nu _{*})< d(\nu _{*}, \varpi )$ and $(\nu _{*}, \varpi ) \in \mathfrak{R^{*}}$, we have

$$\begin{aligned} \mathcal{F} \bigl(d(\mathcal{P}\nu _{*},\mathcal{P} \varpi ) \bigr)\leq \mathcal{F} \bigl(\mathfrak{N}(\varpi,\nu _{*}) \bigr)+ \mathcal{G} \bigl(\beta \bigl( \mathfrak{N}(\varpi,\nu _{*}) \bigr) \bigr), \end{aligned}$$

(18)

where

$$\begin{aligned} \mathfrak{N}(\varpi,\nu _{*}) & = \max \biggl\{ d(\nu _{*}, \varpi ),d( \nu _{*},\mathcal{P}\nu _{*}),d(\varpi, \mathcal{P} \varpi ), \frac{d(\nu _{*},\mathcal{P}\varpi )+d(\varpi,\mathcal{P}\nu _{*})}{2} \biggr\} \\ & = d(\nu _{*},\varpi ), \end{aligned}$$

which on substituting in (18) gives

$$\begin{aligned} \mathcal{F} \bigl(d(\nu _{*},\varpi ) \bigr)\leq \mathcal{F} \bigl(d( \nu _{*},\varpi ) \bigr)+ \mathcal{G} \bigl(\beta \bigl(d(\nu _{*}, \varpi ) \bigr) \bigr), \end{aligned}$$

which gives $\mathcal{G}(\beta (d(\nu _{*},\varpi )))\geq 0$ implies $\beta (d(\nu _{*},\varpi ))\geq 1$, a contradiction. Therefore $d(\nu _{*},\varpi )=0$. □

Theorem 3.5

In addition to the hypotheses of Theorem 3.3 (or Theorem 3.4), if any of the following conditions is fulfilled:

(I)
for all $u, v \in \mathcal{E}$, there exists $z \in \mathcal{E}$ such that
$$\begin{aligned} \bigl\{ (z,\mathcal{P}z),(z, u),(z,v) \bigr\} \subseteq \mathfrak{R}; \end{aligned}$$
(19)
(II)
the set $\mathcal{P}(\mathcal{E})$ is $\mathfrak{R}$-directed;
(III)
$\mathfrak{R}|_{\mathcal{P(E)}}$ is complete;
(IV)
$\mathcal{Y}(u,v,\mathrm{Fix}(\mathcal{P}),\mathfrak{R}^{s})$ is nonempty for each $u,v\in \mathrm{Fix}(\mathcal{P})$,

then $\mathcal{P}$ has a unique fixed point.

Proof

In view of Theorem 3.3 (or Theorem 3.4), $\mathrm{Fix}(\mathcal {P}) \neq \emptyset $.

Assume (I). Suppose that there exist distinct fixed points u and v of $\mathcal{P}$. We will consider the following two cases.

Case (A): We have $(u,v) \in \mathfrak{R}$, then $\mathcal{P}^{n}u = u$ and $\mathcal{P}^{n}v =v $ such that $(\mathcal{P}^{n}u,\mathcal{P}^{n}v) \in \mathcal{R^{*}}$ for $n = 0, 1,\ldots $ . Now, we assert that
$$\begin{aligned} \frac{1}{2}d \bigl(\mathcal{P}^{n} u, \mathcal{P}^{n+1} u \bigr)< d \bigl(\mathcal{P}^{n}u, \mathcal{P}^{n}v \bigr)\quad\text{or}\quad \frac{1}{2}d \bigl( \mathcal{P}^{n+1} u, \mathcal{P}^{n+2} u \bigr)< d \bigl( \mathcal{P}^{n+1} u,\mathcal{P}^{n}v \bigr). \end{aligned}$$
(20)

Let, to the contrary, there exist $\varsigma \in \mathbb{N}$ such that
$$\begin{aligned} \frac{1}{2}d \bigl(\mathcal{P}^{n} u_{\varsigma}, \mathcal{P}^{n+1} u_{\varsigma}\bigr)\geq d \bigl( \mathcal{P}^{n} u_{\varsigma}, \mathcal{P}^{n} v_{\varsigma}\bigr) \end{aligned}$$
(21)
and
$$\begin{aligned} \frac{1}{2}d \bigl(\mathcal{P}^{n+1} u_{\varsigma}, \mathcal{P}^{n+2} u_{\varsigma}\bigr)\geq d \bigl( \mathcal{P}^{n+1} u_{\varsigma}, \mathcal{P}^{n} v_{\varsigma}\bigr). \end{aligned}$$
(22)
These imply that
$$\begin{aligned} 2d \bigl(\mathcal{P}^{n} u_{\varsigma}, \mathcal{P}^{n} v_{\varsigma}\bigr)\leq d \bigl( \mathcal{P}^{n} u_{\varsigma}, \mathcal{P}^{n+1} u_{\varsigma}\bigr)\leq d \bigl( \mathcal{P}^{n} u_{\varsigma}, \mathcal{P}^{n} v_{\varsigma}\bigr)+d \bigl( \mathcal{P}^{n} v_{\varsigma}, \mathcal{P}^{n+1} u_{\varsigma}\bigr), \end{aligned}$$
and so
$$\begin{aligned} d \bigl(\mathcal{P}^{n} u_{\varsigma}, \mathcal{P}^{n} v_{\varsigma}\bigr)\leq d \bigl( \mathcal{P}^{n} v_{\varsigma}, \mathcal{P}^{n+1} u_{\varsigma}\bigr)\leq \frac{1}{2}d \bigl(\mathcal{P}^{n+1} u_{\varsigma}, \mathcal{P}^{n+2} u_{\varsigma}\bigr). \end{aligned}$$
(23)
Now, from (5) and using (21)–(23), we have
$$\begin{aligned} d \bigl(\mathcal{P}^{n+1} u_{\varsigma}, \mathcal{P}^{n+2} u_{\varsigma}\bigr) &< d \bigl(\mathcal{P}^{n} u_{\varsigma}, \mathcal{P}^{n+1} u_{\varsigma}\bigr) \\ &\leq d \bigl(\mathcal{P}^{n} u_{\varsigma}, \mathcal{P}^{n} v_{\varsigma}\bigr)+d \bigl( \mathcal{P}^{n} v_{\varsigma}, \mathcal{P}^{n+1} u_{\varsigma}\bigr) \\ &\leq \frac{1}{2}d \bigl(\mathcal{P}^{n+1} u_{\varsigma}, \mathcal{P}^{n+2} u_{\varsigma}\bigr)+\frac{1}{2}d \bigl( \mathcal{P}^{n+1} u_{\varsigma}, \mathcal{P}^{n+2} u_{\varsigma}\bigr) \\ &= d \bigl(\mathcal{P}^{n+1} u_{\varsigma}, \mathcal{P}^{n+2} u_{\varsigma}\bigr), \end{aligned}$$
a contradiction, and therefore (20) remains true. Therefore, using condition (2),
$$\begin{aligned} \mathcal{F}(d \bigl(\mathcal{P}^{n+1} u,\mathcal{P}^{n+1} v \bigr) \le \mathcal{F} \bigl(\mathfrak{N} \bigl(\mathcal{P}^{n}u, \mathcal{P}^{n}v \bigr) \bigr)+ \mathcal{G} \bigl(\beta \bigl( \mathfrak{N} \bigl(\mathcal{P}^{n}u,\mathcal{P}^{n}v \bigr) \bigr) \bigr), \end{aligned}$$
where
$$\begin{aligned} \mathfrak{N} \bigl(\mathcal{P}^{n}u,\mathcal{P}^{n}v \bigr))=\max \begin{Bmatrix} d \bigl(\mathcal{P}^{n}u, \mathcal{P}^{n}v \bigr),d \bigl(\mathcal{P}^{n}u, \mathcal{P}^{n+1}u \bigr),d \bigl( \mathcal{P}^{n}v, \mathcal{P}^{n+1}v \bigr), \\ \frac{d(\mathcal{P}^{n}u,\mathcal{P}^{n+1} v)+d(\mathcal{P}^{n} v,\mathcal{P}^{n+1} u)}{2} \end{Bmatrix}. \end{aligned}$$
Since u and v are fixed points of $\mathcal{P}$, we have
$$\begin{aligned} \mathfrak{N} \bigl(\mathcal{P}^{n}u,\mathcal{P}^{n}v \bigr)=d(u,v), \end{aligned}$$
and so we get
$$\begin{aligned} \mathcal{F} \bigl(d(u,v ) \bigr) & \le \mathcal{F} \bigl(d(u,v) \bigr)+ \mathcal{G} \bigl(\beta \bigl(d(u,v) \bigr) \bigr), \end{aligned}$$
which gives $\mathcal{G}(\beta (d(u,v)))\geq 0$, and so $\beta (d(u,v))\geq 1$, a contradiction. Therefore the fixed point is unique.

Case (B): By assumption (I), there exists $z \in \mathcal{E}$ satisfying condition (19). Due to the $\mathcal{P}$-closedness of $\mathfrak{R}$, we get
$$\begin{aligned} \bigl(\mathcal{P}^{n-1}z,u \bigr) \in \mathfrak{R},\qquad \bigl( \mathcal{P}^{n-1}z,v \bigr) \in \mathfrak{R}. \end{aligned}$$
Now, we assert that
$$\begin{aligned} \frac{1}{2}d \bigl(\mathcal{P}^{n-1} z, \mathcal{P}^{n} z \bigr)< d \bigl(\mathcal{P}^{n-1}z,u \bigr) \quad\text{or}\quad \frac{1}{2}d \bigl(\mathcal{P}^{n} z, \mathcal{P}^{n+1} z \bigr)< d \bigl( \mathcal{P}^{n}z,u \bigr). \end{aligned}$$
(24)

Let, to the contrary, there exist $\varsigma \in \mathbb{N}$ such that
$$\begin{aligned} \frac{1}{2}d \bigl(\mathcal{P}^{n-1} z_{\varsigma}, \mathcal{P}^{n} z_{\varsigma}\bigr) \geq d \bigl( \mathcal{P}^{n-1}z_{\varsigma},u_{\varsigma}\bigr) \end{aligned}$$
(25)
and
$$\begin{aligned} \frac{1}{2}d \bigl(\mathcal{P}^{n} z_{\varsigma}, \mathcal{P}^{n+1} z_{\varsigma}\bigr) \geq d \bigl( \mathcal{P}^{n}z_{\varsigma},u_{\varsigma}\bigr). \end{aligned}$$
(26)
These imply that
$$\begin{aligned} 2d \bigl(\mathcal{P}^{n-1} z_{\varsigma}, u_{\varsigma}\bigr) \leq d \bigl(\mathcal{P}^{n-1} z_{\varsigma}, \mathcal{P}^{n} z_{\varsigma}\bigr)\leq d \bigl(\mathcal{P}^{n-1} z_{\varsigma}, u_{\varsigma}\bigr)+d \bigl(u_{\varsigma}, \mathcal{P}^{n} z_{\varsigma}\bigr), \end{aligned}$$
which implies that (using (26))
$$\begin{aligned} d \bigl(\mathcal{P}^{n-1} z_{\varsigma}, u_{\varsigma}\bigr)\leq d \bigl(u_{\varsigma}, \mathcal{P}^{n} z_{\varsigma}\bigr)\leq \frac{1}{2}d \bigl(\mathcal{P}^{n} z_{\varsigma}, \mathcal{P}^{n+1} z_{\varsigma}\bigr). \end{aligned}$$
(27)
Now, from (5) and using (25)–(27), we have
$$\begin{aligned} d \bigl(\mathcal{P}^{n} z_{\varsigma}, \mathcal{P}^{n+1} z_{\varsigma}\bigr) &< d \bigl(\mathcal{P}^{n-1} z_{\varsigma}, \mathcal{P}^{n} z_{\varsigma}\bigr) \\ &\leq d \bigl(\mathcal{P}^{n-1} z_{\varsigma}, u_{\varsigma}\bigr)+d \bigl(u_{\varsigma}, \mathcal{P}^{n} z_{\varsigma}\bigr) \\ &\leq \frac{1}{2}d \bigl(\mathcal{P}^{n} z_{\varsigma}, \mathcal{P}^{n+1} z_{\varsigma}\bigr)+\frac{1}{2}d \bigl( \mathcal{P}^{n} z_{\varsigma}, \mathcal{P}^{n+1} z_{\varsigma}\bigr) \\ &= d \bigl(\mathcal{P}^{n} z_{\varsigma}, \mathcal{P}^{n+1} z_{\varsigma}\bigr), \end{aligned}$$
a contradiction, and therefore (24) remains true. Therefore, using condition (2),
$$\begin{aligned} \mathcal{F} \bigl(d \bigl(\mathcal{P}^{n} z, u \bigr) \bigr) \leq \mathcal{F} \bigl(\mathfrak{N} \bigl( \mathcal{P}^{n-1} z, u \bigr) \bigr)+ \mathcal{G} \bigl(\beta \bigl(\mathfrak{N} \bigl( \mathcal{P}^{n-1} z, u \bigr) \bigr) \bigr), \end{aligned}$$
(28)
where
$$\begin{aligned} &\mathfrak{N} \bigl(\mathcal{P}^{n-1} z, u \bigr) \\ &\quad=\max \biggl\{ d \bigl(\mathcal{P}^{n-1} z,u \bigr),d \bigl( \mathcal{P}^{n-1} z, \mathcal{P}^{n} z \bigr),d(u, \mathcal{P}u), \frac{d(\mathcal{P}^{n-1}z,\mathcal{P}u)+ d(u,\mathcal{P}^{n} z)}{2} \biggr\} \\ &\quad\leq \max \biggl\{ d \bigl(\mathcal{P}^{n-1} z,u \bigr),d \bigl( \mathcal{P}^{n-1} z, \mathcal{P}^{n} z \bigr), d(u, \mathcal{P}u), \frac{2d(\mathcal{P}^{n-1}z,u)+ d(\mathcal{P}^{n-1}z,\mathcal{P}^{n} z)}{2} \biggr\} \\ &\quad\leq \max \bigl\{ d \bigl(\mathcal{P}^{n-1}z,u \bigr),d \bigl( \mathcal{P}^{n-1}z,\mathcal{P}^{n} z \bigr),d(u,\mathcal{P}u) \bigr\} . \end{aligned}$$
Using $(z,\mathcal{P}z)\in \mathfrak{R}$, similarly as in the proof of Theorem 3.3, it can be shown that $d(\mathcal{P}^{n-1}z,\mathcal{P}^{n}z)\to 0$ as $n\to \infty $. Therefore, for n sufficiently large,
$$\begin{aligned} \max \bigl\{ d \bigl(\mathcal{P}^{n-1}z,u \bigr),d \bigl( \mathcal{P}^{n-1}z,\mathcal{P}^{n} z \bigr), d(u,\mathcal{P}u) \bigr\} =d \bigl(\mathcal{P}^{n-1}z,u \bigr) \end{aligned}$$
and from (28) we have
$$\begin{aligned} \mathcal{F} \bigl(d \bigl(\mathcal{P}^{n} z, u \bigr) \bigr) \leq \mathcal{F} \bigl(d \bigl(\mathcal{P}^{n-1}z,u \bigr) \bigr)+ \mathcal{G} \bigl(\beta \bigl(d \bigl(\mathcal{P}^{n-1}z,u \bigr) \bigr) \bigr). \end{aligned}$$
As in the proof of Theorem 3.3, it can be shown that $d(\mathcal{P}^{n} z, u) \leq d(\mathcal{P}^{n-1}z, u)$. It follows that the sequence $\{d(\mathcal{P}^{n} z, u)\}$ is nonincreasing. As earlier, we have
$$\begin{aligned} \lim_{n \to \infty} d \bigl(\mathcal{P}^{n} z, u \bigr)=0. \end{aligned}$$
Also, since $(z,v)\in \mathfrak{R}$, proceeding as earlier, we can prove that
$$\begin{aligned} \lim_{n \to \infty} d \bigl(\mathcal{P}^{n} z, v \bigr)=0, \end{aligned}$$
and by using limit uniqueness, we infer that $u=v$; i.e., the fixed point of $\mathcal{P}$ is unique.
Assume (II). For any two fixed points $u,v$ of $\mathcal{P}$, there must be an element $z\in \mathcal{P}(\mathcal{E})$ such that
$$\begin{aligned} (z,u)\in \mathfrak{R}\quad \text{and}\quad (z,v)\in \mathfrak{R}. \end{aligned}$$
As $\mathfrak{R}$ is $\mathcal{P}$-closed, so for all $n \in \mathbb{N}\cup \{0\}$,
$$\begin{aligned} \bigl(\mathcal{P}^{n}z,u\bigr)\in \mathfrak{R} \quad\text{and}\quad \bigl(\mathcal{P}^{n}z,v\bigr)\in \mathfrak{R}. \end{aligned}$$
In the line of proof of Case(B) (I), we obtain $u=v$, i.e., $\mathcal{P}$ has a unique fixed point.
Assume (III). Suppose that $u,v$ are two fixed points of $\mathcal{P}$. Then we must have $(u,v)\in \mathfrak{R}$, and since ${u}\neq \mathcal{P} v$, we have $(v,u)\in \mathcal{R^{*}}$. Also we can get $\frac{1}{2}d(u,\mathcal{P} u) \leq d(u,v)$ following the lines of the proof of Case A (I). Therefore, using condition (2),
$$\begin{aligned} \mathcal{F}(d(\mathcal{P} u,\mathcal{P} v ) & \le \mathcal{F} ( \mathfrak{N}(u,v)+ \mathcal{G} \bigl(\beta \bigl(\mathfrak{N}(u,v) \bigr) \bigr), \end{aligned}$$
where
$$\begin{aligned} \mathfrak{N}(u,v) & = \max \biggl\{ d(u,v),d(u,\mathcal{P}u),d(v, \mathcal{P} v), \frac{d(u,\mathcal{P} v)+ d( v,\mathcal{P} u)}{2} \biggr\} \\ &=d(u,v), \end{aligned}$$
which gives $\mathcal{G}(\beta (d(u,v))\geq 0$, and so $\beta (d(u,v)\geq 1$, a contradiction. Therefore the fixed point is unique. In a similar way, if $(v,u)\in \mathfrak{R}$, we have $u=v$.
Assume (IV). Suppose that $u,v$ are two fixed points of $\mathcal{P}$. Let $\{z_{0},z_{1},\dots,z_{k}\}$ be an $\mathfrak{R}^{s}$-path in $\mathrm{Fix}(\mathcal{P})$ connecting u and v. As in Case(A) (I), it must be $z_{i-1}=z_{i}$ for each $i=1,2,\dots,k$, and it follows that $u=v$.

□

If we take $\mathfrak{R} =\{(\nu, \nu )\in \mathcal{E} \times \mathcal{E} \mid \nu \preceq \nu \}$, then we have more new results as consequences of Theorem 3.3.

Corollary 3.6

Let $(\mathcal{E}, d, \preceq )$ be an ordered complete metric space. Let $\mathcal{P} \colon \mathcal{E} \to \mathcal{E}$ be increasing and $(\mathrm{SFG})_{\mathfrak{R}}$ on $\mathcal{E}_{\preceq}$. Suppose that there exists $\nu _{0}\in \mathcal{E}$ such that $\nu _{0}\preceq \mathcal{P} \nu _{0}$. If $\mathcal{P} $ is $\mathcal{E}_{\preceq}$-continuous or $\mathcal{E}_{\preceq}$ is d-self-closed, then $\nu _{*} \in \mathrm{Fix}(\mathcal{P} )$. Moreover, for each $\nu _{0}\in \mathcal{E}$ with $\nu _{0}\preceq \mathcal{P} \nu _{0}$, the Picard sequence $\mathcal{P} ^{n}(\nu _{0})$ for all $n\in \mathbb{N}$ converges to a $\nu _{*} \in \mathrm{Fix}(\mathcal{P} )$.

Corollary 3.7

Let $(\mathcal{E}, d, \mathfrak{R})$ be an RMS and $\mathcal{P} \colon \mathcal{E} \to \mathcal{E}$. Suppose that the following conditions hold:

(I)
$\mathfrak{X}(\mathcal{P},\mathfrak{R}) \neq \emptyset $;
(II)
$\mathfrak{R}$ is $\mathcal{P} $-closed and $\mathcal{P} $-transitive;
(III)
$\mathcal{E}$ is $\mathfrak{R}$-complete;
(IV)
$\mathcal{P} $ is FG-contraction, that is, there exists $\mathcal{G} \in \mathfrak{G}$ such that, for $(\nu,\vartheta ) \in \mathcal{E}$ with $(\nu,\vartheta ) \in \mathfrak{R}$,
$$\begin{aligned} \mathcal{F} \bigl(d(\mathcal{P} \nu,\mathcal{P} \vartheta ) \bigr) \leq \mathcal{F} \bigl(\mathfrak{N}(\nu, \vartheta ) \bigr)+ \mathcal{G} \bigl(\beta \bigl( \mathfrak{N}(\nu, \vartheta ) \bigr) \bigr), \end{aligned}$$
(29)
where $\mathfrak{N}(\nu, \vartheta )$ is defined in (3);
(V)
$\mathcal{P} $ is $\mathfrak{R}$-continuous, or
(V’)
$\mathfrak{R}$ is d-self-closed.

Then there exists a point $\nu _{*} \in \mathrm{Fix}(\mathcal{P})$.

4 Illustrations

Example 4.1

Let $\mathcal{E} =[0,8)$ be equipped with usual metric d. Consider the binary relation on $\mathcal{E}$ as follows:

$$\begin{aligned} \mathfrak{R}= \bigl\{ (0,1), (1,3), (2,1), (2,2), (2,5), (3,1), (3,2), (3,3), (3,5), (5,1), (5,2), (5,5) \bigr\} . \end{aligned}$$

Define a mapping $\mathcal{P}: \mathcal{E} \rightarrow \mathcal{E}$ by

$$\begin{aligned} \mathcal{P}\nu =\textstyle\begin{cases} 1, &0\leq \nu < 1; \\ 3,& \nu =1; \\ 5,& 1< \nu < 8. \end{cases}\displaystyle \end{aligned}$$

Then $\mathcal{P}$ is not continuous while $\mathcal{P}$ is $\mathfrak{R}$-continuous, $\mathfrak{R}$ is $\mathcal{P}$-closed, and $\mathcal{P}$-transitive; $\mathcal{E}$ is $\mathfrak{R}$-complete. Also $\mathfrak{R}^{*}=\{(0,1), (1,3), (5,1)\}$ and $\mathfrak{X}(\mathcal{P}; \mathfrak{R})\neq \emptyset $ as $(5, \mathcal{P}5)=(5,5)\in \mathfrak{R}$.

Now we take $F(t)= - \frac{1}{\sqrt{ t}}$, $G(t)=\ln t$ ($t >0$) and $\beta (t)=\lambda \in (0,1)$, $\tau =-\ln \lambda >0$, then (2) converted to

$$\begin{aligned} \frac{1}{2} d(\nu,\mathcal{P} \nu ) \leq d(\nu, \vartheta ) \quad\text{implies } \\ d(\mathcal{P} \nu,\mathcal{P} \vartheta )\leq \frac{\mathfrak{N}(\nu, \vartheta )}{(1+ \tau \sqrt{\mathfrak{N}(\nu, \vartheta )})^{2}}, \end{aligned}$$

(30)

where $\mathfrak{N}(\nu, \vartheta )$ given in (3).

Consider $(\nu, \vartheta )=(5,1) \in \mathfrak{R}^{*}$ with $\frac{1}{2}d(\nu, \mathcal{P}\nu )=0< 4=d(\nu, \vartheta )$. Then $d(\mathcal{P}\nu,\mathcal{P}\vartheta ) = 2$ and $\mathfrak{N}(\nu, \vartheta )= 4$. Therefore, condition (30) reduces to $2 \leq \frac{4}{(1+ \tau \sqrt{4})^{2}}$, which is true for $\tau =0.1$. Similarly, we can check for $(\nu, \vartheta )=(1,3) \in \mathfrak{R}^{*}$. Thus all the conditions of Theorem 3.3 are satisfied, hence $\mathcal{P}$ has a fixed point. Moreover, ${\mathfrak{R}}|_{\mathcal{P}({\mathcal{E})}}$ is transitive, while ${\mathfrak{R}}$ is not, and for all $\nu, \vartheta \in \mathcal{P}(\mathcal{E})$, we have $(\nu, \vartheta )\in \mathfrak{R}$, so $\mathfrak{P}(\nu,\vartheta,\mathfrak{R}) |_{\mathcal{P}( \mathcal{E})})$ is nonempty for all $\nu, \vartheta \in \mathcal{P}(\mathcal{E})$. Following Theorem 3.4, $\mathcal{P}$ has a unique fixed point which is $\nu ^{*}=5$.

Now, for $(0,1)\in \mathfrak{R}$,

$$\begin{aligned} d(\mathcal{P}\nu,\mathcal{P}\vartheta ) = 2 \nleq 2k = k \max \biggl\{ d(\nu, \vartheta ),d(\nu,\mathcal{P}\nu ), d(\vartheta, \mathcal{P}\vartheta ), \frac{1}{2} \bigl[d(\nu,\mathcal{P}\vartheta )+d( \vartheta,\mathcal{P}\nu ) \bigr] \biggr\} , \end{aligned}$$

which is not true for any $k \in (0,1)$, and hence $\mathcal{P}$ is not an implicit type mapping on $(\mathcal{E},d,\mathfrak{R})$. Hence [10, Theorem 1 and Theorem 2] cannot be applied to the present example.

Also, as $1, 0\in \mathcal{E}$, $(1,0) \notin \mathfrak{R}$ with $\mathcal{P}1=3 \neq 1=\mathcal{P}0$ such that $\frac{1}{2}d(1, \mathcal{P}1)=d(1,0)$ but $d(\mathcal{P}1, \mathcal{P}0) \nleq k\ d(1,0)$ and

$$\begin{aligned} d(\mathcal{P}\nu,\mathcal{P}\vartheta ) = 2 \nleq 2k = k \max \biggl\{ d(\nu, \vartheta ),d(\nu,\mathcal{P}\nu ), d(\vartheta, \mathcal{P}\vartheta ), \frac{1}{2} \bigl[d(\nu,\mathcal{P}\vartheta )+d( \vartheta,\mathcal{P}\nu ) \bigr] \biggr\} , \end{aligned}$$

which shows that $\mathcal{P}$ is neither Suzuki-contraction nor generalized Suzuki-contraction for any $k\in [0,1)$. Hence the results of Suzuki [19] and Popescu [20] cannot be applied to the present example, while our Theorem 3.3 and Theorem 3.4 are applicable. This shows that our results are genuine improvements over the corresponding results contained in Suzuki [19], Popescu [20], and Ahmadullah et al. [10, Theorem 1 and Theorem 2].

Example 4.2

Consider the set $\mathcal{E} = [0,1]$ with the usual metric d. Define a binary relation $\mathfrak{R}$ by

$$\begin{aligned} \mathfrak{R} = \biggl\{ (0,0), (0,1) \biggl(\frac{1}{5},1 \biggr), \biggl( \frac{1}{5},0 \biggr), \biggl(0, \frac{1}{5} \biggr), \biggl( \frac{1}{5}, \frac{1}{5} \biggr) \biggr\} . \end{aligned}$$

Consider the self-mapping $\mathcal{P}$ on $\mathcal{E}$ given by

$$\begin{aligned} \mathcal{P}(\vartheta )=\textstyle\begin{cases} 0, & 0\leq \vartheta \leq \frac{1}{5}, \\ \frac{1}{5}, & \frac{1}{5} < \vartheta \leq 1. \end{cases}\displaystyle \end{aligned}$$

It is clear that $\mathcal{E}$ is $\mathfrak{R}$ is complete and $\mathfrak{R}$ is $\mathcal{P}$-closed. Also ${\mathfrak{R}}^{*}=\{(0,1), (\frac{1}{5},1)\}$ and $\mathfrak{X}(\mathcal{P}; \mathfrak{R})\neq \emptyset $ as $(0, \mathcal{P}0)=(0,0)\in \mathfrak{R}$.

We consider (30) of previous Example 4.1 to verify $\mathcal{P} \in (\mathrm{SFG})_{\mathfrak{R}}$.

Let $(\vartheta,\nu ) = (0,1)$ with $\frac{1}{2}d(\nu, \mathcal{P}\nu )=0< 1 = d(\nu, \vartheta )$. Then $d(\mathcal{P} \vartheta, \mathcal{P} \nu ) = \frac{1}{5}$ and $\mathfrak{N}(\nu, \vartheta )= 1$. Therefore, condition (30) reduces to $1/5 \leq \frac{1}{(1+ \tau )^{2}}$.
Let $(\vartheta,\nu ) = (\frac{1}{5},1)$ with $\frac{1}{2}d(\nu, \mathcal{P}\nu )=0< \frac{4}{5} =d(\nu, \vartheta )$. Then $d(\mathcal{P} \vartheta, \mathcal{P} \nu ) = \frac{1}{5}$ and $\mathfrak{N}(\nu, \vartheta )= 4/5$. Therefore, condition (30) reduces to $1/5 \leq \frac{4/5}{(1+ \tau \sqrt{4/5})^{2}}$.

It can be easily checked that the above cases hold true for $\tau >0$ (in particular $\tau =0.1$). Thus $\mathcal{P} \in (\mathrm{SFG})_{\mathfrak{R}}$.

Let $(\nu _{n})$ be an $\mathfrak{R}$-preserving sequence converging to ν as $n \rightarrow \infty $. Then we must have

$$\begin{aligned} (\nu _{n}, \nu _{n+1})\in \biggl\{ (0,0), \biggl( \frac{1}{5},0 \biggr), \biggl(0, \frac{1}{5} \biggr), \biggl( \frac{1}{5}, \frac{1}{5} \biggr) \biggr\} \end{aligned}$$

implies that

$$\begin{aligned} \nu _{n}\in \biggl\{ 0, \frac{1}{5} \biggr\} . \end{aligned}$$

This implies that either $\nu _{n}\rightarrow 0$ or $\nu _{n} \rightarrow \frac{1}{5}$ as $n \rightarrow \infty $, and clearly we have $[\nu _{n}, \nu ]\in \mathfrak{R}$ for all $n \in \mathbb{N}$, where $\nu =0$ and $\frac{1}{5}$. This shows that $\mathfrak{R}$ is d-self-closed. Thus all the conditions of Theorem 3.3 are satisfied, hence $\mathcal{P}$ has a fixed point ($\vartheta ^{*} = 1/5$).

5 Application to nonlinear matrix equations

For a matrix $\mathcal{B}\in \mathcal{H}(n)$, we will denote by $s(\mathcal{B})$ any of its singular values and by $s^{+}(\mathcal{B})$ the sum of all of its singular values, that is, the trace norm $\Vert \mathcal{B}\Vert _{\mathrm{tr}} = s^{+}(\mathcal{B})$. For $\mathcal{C},\mathcal{D}\in \mathcal{H}(n)$, $\mathcal{C}\succeq \mathcal{D}$ (resp. $\mathcal{C}\succ \mathcal{D}$) will mean that the matrix $\mathcal{C}-\mathcal{D}$ is positive semi-definite (resp. positive definite).

The following lemmas are needed in the subsequent discussion.

Lemma 5.1

([1])

If $\mathcal{A} \succeq {O}$ and $\mathcal{B}\succeq {O}$ are $n\times n$ matrices, then

$$\begin{aligned} 0\leq \operatorname{tr}(\mathcal{AB})\leq \Vert \mathcal{A} \Vert \operatorname{tr}(\mathcal{B}). \end{aligned}$$

Lemma 5.2

([1])

If $\mathcal{A}\in \mathcal{H}(n)$ such that $\mathcal{A} \prec I_{n}$, then $\Vert \mathcal{A}\Vert <1$.

We establish the existence and uniqueness of the solution of the nonlinear matrix equation (NME)

$$\begin{aligned} \mathcal{X} = \mathcal{Q} + \sum_{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G}(\mathcal{X}) \mathcal{A}_{i}, \end{aligned}$$

(31)

where $\mathcal{Q}$ is a Hermitian positive definite matrix, $\mathcal{A}_{i}^{*}$ stands for the conjugate transpose of an $n\times n$ matrix $\mathcal{A}_{i}$, and $\mathcal{G}$ is an order-preserving continuous mapping from the set of all Hermitian matrices to the set of all positive definite matrices such that $\mathcal{G}(O)=O$.

Theorem 5.3

Consider NME (31). Assume that there exists a positive real number η such that

$(H_{1})$:

There exists $\mathcal{Q} \in \mathcal{P}(n)$ such that $\sum_{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G} ( \mathcal{Q})\mathcal{A}_{i}\succ 0$;

$(H_{2})$:

$\sum_{i=1}^{m}\mathcal{A}_{i}\mathcal{A}_{i}^{*}\prec \eta I_{n}$.

$(H_{3})$:

For every $\mathcal{X}, \mathcal{Y}\in \mathcal{P}(n)$ such that $\mathcal{X}\preceq \mathcal{Y}$ with

$$\begin{aligned} \sum_{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G}(\mathcal{X}) \mathcal{A}_{i} \neq \sum _{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G}( \mathcal{Y})\mathcal{A}_{i} \end{aligned}$$

and if

$$\begin{aligned} \Biggl\vert s^{+} \Biggl(\mathcal{X}- \mathcal{Q} - \sum _{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G}(\mathcal{X})\mathcal{A}_{i} \Biggr) \Biggr\vert < 2 \bigl\vert s^{+}(\mathcal{X}-\mathcal{Y}) \bigr\vert \end{aligned}$$

holds, then for $\tau > 0$ we have

$$\begin{aligned} & \bigl\vert s^{+} \bigl(\mathcal{G}(\mathcal{X})-\mathcal{G}( \mathcal{Y}) \bigr) \bigr\vert \\ &\quad\leq \frac{1}{\eta} \times \max \begin{Bmatrix} \frac{ \vert s^{+}(\mathcal{X}-\mathcal{Y}) \vert }{[1+ \tau \vert s^{+}(\mathcal{X}-\mathcal{Y}) \vert ^{1/2}]^{2}}, \frac{ \vert s^{+} (\mathcal{X}-\mathcal{Q} - \sum_{i=1}^{m}\mathcal{A}_{i}^{*}\mathcal{G}(\mathcal{X})\mathcal{A}_{i} ) \vert }{[1+\tau \vert s^{+} (\mathcal{X}-\mathcal{Q} - \sum_{i=1}^{m}\mathcal{A}_{i}^{*}\mathcal{G}(\mathcal{X})\mathcal{A}_{i} ) \vert ^{1/2}]^{2}}, \\ \frac{ \vert s^{+} (\mathcal{Y}-\mathcal{Q} - \sum_{i=1}^{m}\mathcal{A}_{i}^{*}\mathcal{G}(\mathcal{Y})\mathcal{A}_{i} ) \vert }{[1+\tau \vert s^{+} (\mathcal{Y}-\mathcal{Q} - \sum_{i=1}^{m}\mathcal{A}_{i}^{*}\mathcal{G}(\mathcal{Y})\mathcal{A}_{i} ) \vert ^{1/2}]^{2}}, \\ \frac{ \vert s^{+} (\mathcal{X} - \mathcal{Q} - \sum_{i=1}^{m}\mathcal{A}_{i}^{*}\mathcal{G}(\mathcal{Y})\mathcal{A}_{i} ) \vert }{[1+\tau \vert s^{+} (\mathcal{X} - \mathcal{Q} - \sum_{i=1}^{m}\mathcal{A}_{i}^{*}\mathcal{G}(\mathcal{Y})\mathcal{A}_{i} ) \vert ^{1/2}]^{2}}, \\ \frac{ \vert s^{+} (\mathcal{Y}- \mathcal{Q} - \sum_{i=1}^{m}\mathcal{A}_{i}^{*}\mathcal{G}(\mathcal{X})\mathcal{A}_{i} ) \vert }{[1+\tau \vert s^{+} (\mathcal{Y}- \mathcal{Q} - \sum_{i=1}^{m}\mathcal{A}_{i}^{*}\mathcal{G}(\mathcal{X})\mathcal{A}_{i} ) \vert ^{1/2}]^{2}}. \end{Bmatrix} \end{aligned}$$

Then NME (31) has a unique solution. Moreover, the iteration

$$\begin{aligned} \mathcal{X}_{n}=\mathcal{Q}+\sum _{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G}(\mathcal{X}_{n-1}) \mathcal{A}_{i}, \end{aligned}$$

(32)

where $\mathcal{X}_{0}\in \mathcal{P}(n)$ satisfies

$$\begin{aligned} \mathcal{X}_{0}\preceq \mathcal{Q}+\sum_{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G}(\mathcal{X}_{0}) \mathcal{A}_{i}, \end{aligned}$$

converges in the sense of trace norm $\Vert\cdot\Vert _{\mathrm{tr}}$ to the solution of matrix equation (31).

Proof

Define a mapping $\mathcal{T}:\mathcal{P}(n)\rightarrow \mathcal{P}(n)$ by

$$\begin{aligned} \mathcal{T}(\mathcal{X})= \mathcal{Q} + \sum_{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G}(\mathcal{X}) \mathcal{A}_{i}\quad\text{for all } \mathcal{X} \in \mathcal{P}(n), \end{aligned}$$

and a binary relation

$$\begin{aligned} \mathfrak{R}= \bigl\{ (\mathcal{X}, \mathcal{Y})\in \mathcal{P}(n)\times \mathcal{P}(n): \mathcal{X} \preceq \mathcal{Y} \bigr\} . \end{aligned}$$

Then a fixed point of the mapping $\mathcal{T}$ is a solution of matrix equation (31). Notice that $\mathcal{T}$ is well defined, $\mathfrak{R}$-continuous, and $\mathfrak{R}$ is $\mathcal{T}$-closed. Since

$$\begin{aligned} \sum_{i=1}^{m} \mathcal{A}_{i}^{*} \mathcal{G}(\mathcal{Q}) \mathcal{A}_{i}\succ 0, \end{aligned}$$

for some $\mathcal{Q} \in \mathcal{P}(n)$, we have $(\mathcal{Q}, \mathcal{T}(\mathcal{Q}))\in \mathfrak{R}$, and hence $\mathcal{P}(n)(\mathcal{T};\mathfrak{R})\neq \emptyset $.

Now, let $(\mathcal{X}, \mathcal{Y})\in \mathfrak{R}^{*}=\{(\mathcal{X}, \mathcal{Y})\in \mathfrak{R}:\mathcal{T}(\mathcal{X})\neq \mathcal{T}( \mathcal{Y})\}$ such that

$$\begin{aligned} \frac{1}{2} \bigl\Vert \mathcal{X}-\mathcal{T}(\mathcal{X}) \bigr\Vert _{\mathrm{tr}}< \Vert \mathcal{X} - \mathcal{Y} \Vert _{\mathrm{tr}}. \end{aligned}$$

Then

$$\begin{aligned} & \bigl\Vert \mathcal{T}(\mathcal{X})-\mathcal{T}( \mathcal{Y}) \bigr\Vert _{\mathrm{tr}} \\ &\quad=s^{+} \bigl(\mathcal{T}(\mathcal{X})-\mathcal{T}(\mathcal{Y}) \bigr) \\ &\quad= s^{+} \Biggl(\sum_{i=1}^{m} \mathcal{A}_{i}^{*} \bigl(\mathcal{G}(\mathcal{X})- \mathcal{G}(\mathcal{Y}) \bigr)\mathcal{A}_{i} \Biggr) \\ &\quad=\sum_{i=1}^{m} s^{+} \bigl(\mathcal{A}_{i}^{*} \bigl(\mathcal{G}(\mathcal{X})- \mathcal{G}(\mathcal{Y}) \bigr)\mathcal{A}_{i} \bigr) \\ &\quad=\sum_{i=1}^{m} s^{+} \bigl(\mathcal{A}_{i}\mathcal{A}_{i}^{*} \bigl( \mathcal{G}(\mathcal{X})-\mathcal{G}(\mathcal{Y}) \bigr) \bigr) \\ &\quad=s^{+} \Biggl(\sum_{i=1}^{m} \mathcal{A}_{i}\mathcal{A}_{i}^{*} \Biggr) s^{+} \bigl( \mathcal{G}(\mathcal{X})-\mathcal{G}(\mathcal{Y}) \bigr) \\ &\quad\leq \frac{ \Vert \sum_{i=1}^{m}\mathcal{A}_{i}\mathcal{A}_{i}^{*} \Vert }{\eta } \times \max \begin{Bmatrix} \frac{ \Vert \mathcal{X}-\mathcal{Y} \Vert _{\mathrm{tr}}}{[1+\tau \Vert \mathcal{X}-\mathcal{Y} \Vert _{\mathrm{tr}}^{1/2}]^{2}}, \frac{ \Vert \mathcal{X}-\mathcal{T}\mathcal{X} \Vert _{\mathrm{tr}}}{[1+\tau \Vert \mathcal{X}-\mathcal{T}\mathcal{X} \Vert _{\mathrm{tr}}^{1/2}]^{2}}, \frac{ \Vert \mathcal{Y}-\mathcal{T}\mathcal{Y} \Vert _{\mathrm{tr}}}{[1+\tau \Vert \mathcal{Y}-\mathcal{T}\mathcal{Y} \Vert _{\mathrm{tr}}^{1/2}]^{2}}, \\ \frac{ \Vert \mathcal{X}-\mathcal{T}\mathcal{Y} \Vert _{\mathrm{tr}}}{[1 + \tau \Vert \mathcal{X}-\mathcal{T}\mathcal{Y} \Vert _{\mathrm{tr}}^{1/2}]^{2}}, \frac{ \Vert \mathcal{Y}-\mathcal{T}\mathcal{X} \Vert _{\mathrm{tr}}}{[1 + \tau \Vert \mathcal{Y}-\mathcal{T}\mathcal{X} \Vert _{\mathrm{tr}}^{1/2}]^{2}} \end{Bmatrix} \\ &\quad\leq \frac{\Theta (\mathcal{X,Y})}{[1+\tau (\Theta (\mathcal{X,Y}))^{1/2}]^{2}}, \end{aligned}$$

(33)

where

$$\begin{aligned} \Theta (\mathcal{X,Y}) = \max \begin{Bmatrix} \frac{ \Vert \mathcal{X}-\mathcal{Y} \Vert _{\mathrm{tr}}}{[1+\tau \Vert \mathcal{X}-\mathcal{Y} \Vert _{\mathrm{tr}}^{1/2}]^{2}}, \frac{ \Vert \mathcal{X}-\mathcal{T}\mathcal{X} \Vert _{\mathrm{tr}}}{[1+\tau \Vert \mathcal{X}-\mathcal{T}\mathcal{X} \Vert _{\mathrm{tr}}^{1/2}]^{2}}, \frac{ \Vert \mathcal{Y}-\mathcal{T}\mathcal{Y} \Vert _{\mathrm{tr}}}{[1+\tau \Vert \mathcal{Y}-\mathcal{T}\mathcal{Y} \Vert _{\mathrm{tr}}^{1/2}]^{2}}, \\ \frac{ \Vert \mathcal{X}-\mathcal{T}\mathcal{Y} \Vert _{\mathrm{tr}}}{[1 + \tau \Vert \mathcal{X}-\mathcal{T}\mathcal{Y} \Vert _{\mathrm{tr}}^{1/2}]^{2}}, \frac{ \Vert \mathcal{Y}-\mathcal{T}\mathcal{X} \Vert _{\mathrm{tr}}}{[1 + \tau \Vert \mathcal{Y}-\mathcal{T}\mathcal{X} \Vert _{\mathrm{tr}}^{1/2}]^{2}} \end{Bmatrix}. \end{aligned}$$

(34)

Consider $\mathcal{F}(t)= - \frac{1}{\sqrt{ t}}$, $\mathcal{G}(t)=\ln t$ ($t >0$) and $\beta (t)=\lambda \in (0,1)$, $\tau =-\ln \lambda >0$, then (33) converted to

$$\begin{aligned} &\frac{1}{2} \bigl\Vert \mathcal{X}-\mathcal{T}(\mathcal{X}) \bigr\Vert _{\mathrm{tr}}< \Vert \mathcal{X} - \mathcal{Y} \Vert _{\mathrm{tr}}\quad \text{implies } \\ & \mathcal{F} \bigl( \bigl\Vert \mathcal{T}(\mathcal{X})-\mathcal{T}( \mathcal{Y}) \bigr\Vert _{\mathrm{tr}} \bigr) \leq \mathcal{F} \bigl(\Theta ( \mathcal{X,Y}) \bigr) + \mathcal{G} \bigl( \beta \bigl(\Theta (\mathcal{X,Y}) \bigr) \bigr), \end{aligned}$$

where $\Theta (\mathcal{X,Y})$ is given in (34). Thus all the hypotheses of Theorem 3.3 are satisfied, therefore there exists $\hat{\mathcal{X}}\in \mathcal{P}(n)$ such that $\mathcal{T}(\hat{\mathcal{X}})=\hat{\mathcal{X}}$, and hence matrix equation (31) has a solution in $\mathcal{P}(n)$. Furthermore, due to the existence of least upper bound and greatest lower bound for each $\mathcal{X},\mathcal{Y}\in \mathcal{T}(\mathcal{P}(n))$, we have $\mathfrak{P}(\mathcal{X}, \mathcal{Y};\mathfrak{R}|_{\mathcal{T}( \mathcal{P}(n))})\neq \emptyset $ for all $\mathcal{X},\mathcal{Y}\in \mathcal{T}(\mathcal{P}(n))$. Hence, on using Theorem 3.4, $\mathcal{T}$ has a unique fixed point, and hence we conclude that matrix equation (31) has a unique solution in $\mathcal{P}(n)$. □

Example 5.4

Consider NME (31) for $m=3$, $\eta =4.5$, $n=3$ with $\mathcal{G}(\mathcal{X})=\mathcal{X}^{1/5}$, i.e.,

$$\begin{aligned} \mathcal{X}= \mathcal{Q} +\mathcal{A}_{1}^{*} \mathcal{X}^{1/5} \mathcal{A}_{1}+\mathcal{A}_{2}^{*} \mathcal{X}^{1/5} \mathcal{A}_{2}+ \mathcal{A}_{3}^{*} \mathcal{X}^{1/5} \mathcal{A}_{3}, \end{aligned}$$

(35)

where

$$\begin{aligned} &\mathcal{Q}= \begin{bmatrix} 11.699540782825979 & 0.914622941324684 & 1.507188535497828 \\ 0.914622941324684 & 10.833657911203609 & 1.249452950221198 \\ 1.507188535497828 & 1.249452950221198 & 12.080319343374171 \end{bmatrix} ,\\ &\mathcal{A}_{1}= \begin{bmatrix} 0.082250000000000 & 0.110600000000000 & 0.218400000000000 \\ 0.088900000000000 & 0.053900000000000 & 0.223300000000000 \\ 0.228900000000000 & 0.090300000000000 & 0.042700000000000 \end{bmatrix} , \\ &\mathcal{A}_{2}= \begin{bmatrix} 0.028000000000000 & 0.036250000000000 & 0.041250000000000 \\ 0.058750000000000 & 0.039250000000000 & 0.046000000000000 \\ 0.061250000000000 & 0.059750000000000 & 0.039750000000000 \end{bmatrix} , \\ &\mathcal{A}_{3}= \begin{bmatrix} 0.679012345679012 & 1.061728395061728 & 0.333333333333333 \\ 0.567901234567901 & 0.296296296296296 & 0.641975308641975 \\ 1.185185185185185 & 0.444444444444444 & 0.691358024691358 \end{bmatrix} . \end{aligned}$$

The conditions of Theorem 5.3 can be checked numerically, taking various special values for matrices involved. For example, they can be tested (and verified to be true) for

$$\begin{aligned} &\mathcal{X}= \begin{bmatrix} 1.699436061575979 & 0.914189910074684 & 1.507087334247828 \\ 0.914189910074684 & 0.822435604328608 & 1.248590153939948 \\ 1.507087334247828 & 1.248590153939948 & 2.080170705685109 \end{bmatrix} ,\\ &\mathcal{Y}= \begin{bmatrix} 10.000104721250000 & 0.000433031250000 & 0.000101201250000 \\ 0.000433031250000 & 10.011222306875000 & 0.000862796281250 \\ 0.000101201250000 & 0.000862796281250 & 10.000148637689062 \end{bmatrix} . \end{aligned}$$

To see the convergence of the sequence $\{\mathcal{X}_{n}\}$ defined in (32), we start with three different initial values

$$\begin{aligned} \mathcal{U}_{0}= \begin{bmatrix} 0.015970559290683 & 0.014219828729812 & 0.004760641350592 \\ 0.014219828729812 & 0.045823355744100 & 0.011986278815522 \\ 0.004760641350592 & 0.011986278815522 & 0.014342909184651 \end{bmatrix} \end{aligned}$$

with $\Vert \mathcal{U}_{0} \Vert =0.076136824219434$,

$$\mathcal{V}_{0}= \begin{bmatrix} 1 & 0 & 0 \cr 0 & 1 & 0 \cr 0 & 0 & 1 \end{bmatrix} $$

with $\Vert \mathcal{V}_{0} \Vert =1$,

$$\mathcal{W}_{0}= \begin{bmatrix} 64.303848221681193 & 14.585212879712167 & 16.765822087028965 \\ 14.585212879712167 & 54.844490660932415 & 11.815345676105265 \\ 16.765822087028969 & 11.815345676105263 & 57.346307431417692 \end{bmatrix} $$

with $\Vert \mathcal{W}_{0} \Vert =1.764946463140313\times 10^{2}$.

After 10 iterations, we have the following approximation of the unique positive definite solution of system (31):

$$\begin{aligned} &\widehat{\mathcal{U}}\approx \mathcal{U}_{10}= \begin{bmatrix} 15.825962055386070 & 3.646303219900028 & 4.191455521733169 \\ 3.646303219900028 & 13.461122665210109 & 2.953836419006667 \\ 4.191455521733170 & 2.953836419006668 & 14.086576857837056 \end{bmatrix} \\ &\widehat{\mathcal{V}}\approx \mathcal{V}_{10}= \begin{bmatrix} 15.825962055420298 & 3.646303219928042 & 4.191455521757241 \\ 3.646303219928042 & 13.461122665233104 & 2.953836419026316 \\ 4.191455521757242 & 2.953836419026316 & 14.086576857854423 \end{bmatrix} \\ &\widehat{ \mathcal{W}}\approx \mathcal{W}_{10}= \begin{bmatrix} 15.825962055444425 & 3.646303219947785 & 4.191455521774207 \\ 3.646303219947785 & 13.461122665249309 & 2.953836419040164 \\ 4.191455521774207 & 2.953836419040163 & 14.086576857866664 \end{bmatrix} \end{aligned}$$

Also, the elements of each sequence are order preserving. The graphical representation of convergence of a sequence and a surface plot of solution are shown in Figs. 1 and 2, respectively.

Availability of data and materials

Not applicable.

References

Ran, A.C.M., Reurings, M.C.B.: On the matrix equation $X + A^{*}F(X)A = Q$: solutions and perturbation theory. Linear Algebra Appl. 346, 15–26 (2002)
Article MathSciNet Google Scholar
Ran, A.C.M., Reurings, M.C.B.: A fixed point theorem in partially ordered sets and some applications to matrix equations. Proc. Am. Math. Soc. 132, 1435–1443 (2004)
Article MathSciNet Google Scholar
Turinici, M.: Abstract comparison principles and multivariable Gronwall–Bellman inequalities. J. Math. Anal. Appl. 117, 100–127 (1986)
Article MathSciNet Google Scholar
Turinici, M.: Fixed points for monotone iteratively local contractions. Demonstr. Math. 19, 171–180 (1986)
MathSciNet MATH Google Scholar
Matkowski, J.: Integrable solutions of functional equations. Diss. Math. 127, 1–68 (1975)
MathSciNet MATH Google Scholar
Matkowski, J.: Fixed point theorems for mappings with a contractive iterate at a point. Proc. Am. Math. Soc. 62, 344–348 (1977)
Article MathSciNet Google Scholar
Nieto, J.J., López, R.R.: Contractive mapping theorems in partially ordered sets and applications to ordinary differential equations. Order 22, 223–239 (2005)
Article MathSciNet Google Scholar
Nieto, J.J., López, R.R.: Fixed point theorems in ordered abstract spaces. Proc. Am. Math. Soc. 135, 2505–2517 (2007)
Article MathSciNet Google Scholar
Samet, B., Turinici, M.: Fixed point theorems on a metric space endowed with an arbitrary binary relation and applications. Commun. Math. Anal. 13, 82–97 (2012)
MathSciNet MATH Google Scholar
Ahmadullah, M., Ali, J., Imdad, M.: Unified relation-theoretic metrical fixed point theorems under an implicit contractive condition with an application. Fixed Point Theory Appl. 2016, 42 (2016)
Article MathSciNet Google Scholar
Ahmadullah, M., Imdad, M.: Unified relation-theoretic fixed point results via $F_{R}$-Suzuki-contractions with an application. Fixed Point Theory 21(1), 19–34 (2020)
Article MathSciNet Google Scholar
Ahmadullah, M., Imdad, M., Arif, M.: Relation-theoretic metrical coincidence and common fixed point theorems under nonlinear contractions. Appl. Gen. Topol. 19(1), 65–84 (2018)
Article MathSciNet Google Scholar
Alam, A., Imdad, M.: Relation-theoretic contraction principle. J. Fixed Point Theory Appl. 17(4), 693–702 (2015)
Article MathSciNet Google Scholar
Hasanuzzaman, M., Imdad, M.: Relation theoretic metrical fixed point results for Suzuki type $\mathcal{Z}_{R}$-contraction with an application. AIMS Math. 5(3), 2071–2087 (2019)
Article Google Scholar
Kolman, B., Busby, R.C., Ross, S.: Discrete Mathematical Structures, 3rd edn. PHI Pvt. Ltd., New Delhi (2000)
Google Scholar
Lipschutz, S.: Schaum’s Outlines of Theory and Problems of Set Theory and Related Topics. McGraw-Hill, New York (1964)
MATH Google Scholar
Maddux, R.D.: Relation Algebras. Studies in Logic and Foundations of Mathematics, vol. 150. Elsevier, Amsterdam (2006)
MATH Google Scholar
Parvaneh, V., Hussain, N., Kadelburg, Z.: Generalized Wardowski type fixed point theorems via α-admissible FG-contractions in b-metric spaces. Acta Math. Sci. 36(5), 1445–1456 (2016)
Article MathSciNet Google Scholar
Suzuki, T.: A generalized Banach contraction principle which characterizes metric completeness. Proc. Am. Math. Soc. 136, 1861–1869 (2008)
Article Google Scholar
Popescu, O.: Two fixed point theorems for generalized contractions with constants in complete metric space cent. Eur. J. Math. 7(3), 529–538 (2009)
MathSciNet MATH Google Scholar

Download references

Acknowledgements

The first author is thankful to the Science and Engineering Research Board, India, for providing funds under the project—CRG/2018/000615. We thank the editor for his kind support. We are also grateful to the learned referee for useful suggestions which helped us to improve the text.

Funding

Not applicable.

Author information

Authors and Affiliations

Department of Mathematics, School of Advanced Sciences, Vellore Institute of Technology, Vellore, 632014, TN, India
Hemant Kumar Nashine
Department of Mathematics and Applied Mathematics, University of Johannesburg, Kingsway Campus, Auckland Park 2006, Johannesburg, South Africa
Hemant Kumar Nashine
Mathematics Division, SASL, VIT Bhopal University, Madhya Pradesh, 466114, India
Reena Jain
Department of Mathematics, Gilan-E-Gharb Branch, Islamic Azad University, Gilan-E-Gharb, Iran
Vahid Parvaneh

Authors

Hemant Kumar Nashine
View author publications
You can also search for this author in PubMed Google Scholar
Reena Jain
View author publications
You can also search for this author in PubMed Google Scholar
Vahid Parvaneh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Investigation, HKN, RJ, and VP; Methodology, RJ and HKN; Supervision, HKN and VP; Writing–original draft, RJ and HKN; Writing–review and editing, HKN and VP; and Software, RJ and HKN; All authors contributed equally and significantly in writing this article. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Vahid Parvaneh.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nashine, H.K., Jain, R. & Parvaneh, V. A relational-theoretic approach to get solution of nonlinear matrix equations. J Inequal Appl 2022, 79 (2022). https://doi.org/10.1186/s13660-022-02817-w

Download citation

Received: 05 April 2021
Accepted: 01 June 2022
Published: 13 June 2022
DOI: https://doi.org/10.1186/s13660-022-02817-w

A relational-theoretic approach to get solution of nonlinear matrix equations

Abstract

1 Introduction

Theorem 1.1

2 Preliminaries

3 Results on Suzuki-FG contractive mappings

Definition 3.1

Definition 3.2

Theorem 3.3

Proof

Theorem 3.4

Proof

Theorem 3.5

Proof

Corollary 3.6

Corollary 3.7

4 Illustrations

Example 4.1

Example 4.2

5 Application to nonlinear matrix equations

Lemma 5.1

Lemma 5.2

Theorem 5.3

Proof

Example 5.4

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

MSC

Keywords

A relational-theoretic approach to get solution of nonlinear matrix equations

Abstract

1 Introduction

Theorem 1.1

2 Preliminaries

3 Results on Suzuki-FG contractive mappings

Definition 3.1

Definition 3.2

Theorem 3.3

Proof

Theorem 3.4

Proof

Theorem 3.5

Proof

Corollary 3.6

Corollary 3.7

4 Illustrations

Example 4.1

Example 4.2

5 Application to nonlinear matrix equations

Lemma 5.1

Lemma 5.2

Theorem 5.3

Proof

Example 5.4

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

MSC

Keywords