Open Access

Sum of squared logarithms - an inequality relating positive definite matrices and their matrix logarithm

Journal of Inequalities and Applications20132013:168

DOI: 10.1186/1029-242X-2013-168

Received: 21 January 2013

Accepted: 28 March 2013

Published: 12 April 2013

Abstract

Let y 1 , y 2 , y 3 , a 1 , a 2 , a 3 ( 0 , ) be such that y 1 y 2 y 3 = a 1 a 2 a 3 and

y 1 + y 2 + y 3 a 1 + a 2 + a 3 , y 1 y 2 + y 2 y 3 + y 1 y 3 a 1 a 2 + a 2 a 3 + a 1 a 3 .

Then

( log y 1 ) 2 + ( log y 2 ) 2 + ( log y 3 ) 2 ( log a 1 ) 2 + ( log a 2 ) 2 + ( log a 3 ) 2 .

This can also be stated in terms of real positive definite 3 × 3 -matrices P 1 , P 2 : If their determinants are equal, det P 1 = det P 2 , then

tr P 1 tr P 2 and tr Cof P 1 tr Cof P 2 log P 1 F 2 log P 2 F 2 ,

where log is the principal matrix logarithm and P F 2 = i , j = 1 3 P i j 2 denotes the Frobenius matrix norm. Applications in matrix analysis and nonlinear elasticity are indicated.

MSC:26D05, 26D07.

Keywords

matrix logarithm elementary symmetric polynomials inequality characteristic polynomial positive definite matrices means

1 Introduction

Convexity is a powerful source for obtaining new inequalities; see, e.g., [1, 2]. In applications coming from nonlinear elasticity, we are faced, however, with variants of the squared logarithm function; see the last section. The function ( log ( x ) ) 2 is neither convex nor concave. Nevertheless, the sum of squared logarithms inequality holds. We will proceed as follows: In the first section, we will give several equivalent formulations of the inequality, for example, in terms of the coefficients of the characteristic polynomial (Theorem 1), in terms of elementary symmetric polynomials (Theorem 3), in terms of means (Theorem 5) or in terms of the Frobenius matrix norm (Theorem 7). A proof of the inequality will be given in Section 2, and some counterexamples for slightly changed variants of the inequality are discussed in Section 3. In the last section, an application of the sum of squared logarithms inequality in matrix analysis and in the mathematical theory of nonlinear elasticity is indicated.

2 Formulations of the problem

All theorems in this section are equivalent.

Theorem 1 For n = 2 or n = 3 let P 1 , P 2 R n × n be positive definite real matrices. Let the coefficients of the characteristic polynomials of P 1 and P 2 satisfy
tr P 1 tr P 2 and tr Cof P 1 tr Cof P 2 and det P 1 = det P 2 .
Then
log P 1 F 2 log P 2 F 2 .

For n = 3 , we will now give equivalent formulations of this statement. The case n = 2 can be treated analogously. For its proof, see Remark 15. By orthogonal diagonalization of P 1 and P 2 , the inequalities can be rewritten in terms of the eigenvalues y 1 , y 2 , y 3 and a 1 , a 2 , a 3 , respectively.

Theorem 2 Let the real numbers a 1 , a 2 , a 3 > 0 and y 1 , y 2 , y 3 > 0 be such that
y 1 + y 2 + y 3 a 1 + a 2 + a 3 , y 1 y 2 + y 2 y 3 + y 1 y 3 a 1 a 2 + a 2 a 3 + a 1 a 3 , y 1 y 2 y 3 = a 1 a 2 a 3 .
(1)
Then
( log y 1 ) 2 + ( log y 2 ) 2 + ( log y 3 ) 2 ( log a 1 ) 2 + ( log a 2 ) 2 + ( log a 3 ) 2 .
(2)
The elementary symmetric polynomials, see, e.g., [[3], p.178]
e 0 ( y 1 , y 2 , y 3 ) = 1 , e 1 ( y 1 , y 2 , y 3 ) = y 1 + y 2 + y 3 , e 2 ( y 1 , y 2 , y 3 ) = y 1 y 2 + y 1 y 3 + y 2 y 3 , e 3 ( y 1 , y 2 , y 3 ) = y 1 y 2 y 3

are known to have the Schur-concavity property (i.e., e k is Schur-convex) [1, 4]; see (16). It is possible to express the problem in terms of these elementary symmetric polynomials as follows.

Theorem 3 Let a 1 , a 2 , a 3 > 0 and y 1 , y 2 , y 3 > 0 satisfy
e 1 ( y 1 , y 2 , y 3 ) e 1 ( a 1 , a 2 , a 3 ) , e 2 ( y 1 , y 2 , y 3 ) e 2 ( a 1 , a 2 , a 3 ) , e 3 ( y 1 , y 2 , y 3 ) = e 3 ( a 1 , a 2 , a 3 ) .
Then
e 1 ( ( log y 1 ) 2 , ( log y 2 ) 2 , ( log y 3 ) 2 ) e 1 ( ( log a 1 ) 2 , ( log a 2 ) 2 , ( log a 3 ) 2 ) .
Because y 1 y 2 y 3 = a 1 a 2 a 3 > 0 , we have
y 1 y 2 + y 2 y 3 + y 1 y 3 a 1 a 2 + a 2 a 3 + a 1 a 3 1 y 1 + 1 y 2 + 1 y 3 1 a 1 + 1 a 2 + 1 a 3 .

Thus, we obtain the following theorem.

Theorem 4 Let the real numbers a 1 , a 2 , a 3 > 0 and y 1 , y 2 , y 3 > 0 be such that
y 1 + y 2 + y 3 a 1 + a 2 + a 3 , 1 y 1 + 1 y 2 + 1 y 3 1 a 1 + 1 a 2 + 1 a 3 , y 1 y 2 y 3 = a 1 a 2 a 3 .
(3)
Then
( log y 1 ) 2 + ( log y 2 ) 2 + ( log y 3 ) 2 ( log a 1 ) 2 + ( log a 2 ) 2 + ( log a 3 ) 2 .
(4)
The conditions (3) are also simple expressions in terms of arithmetic, harmonic and geometric and quadratic mean
A ( y 1 , y 2 , y 3 ) = y 1 + y 2 + y 3 3 , H ( y 1 , y 2 , y 3 ) = 3 1 y 1 + 1 y 2 + 1 y 3 , G ( y 1 , y 2 , y 3 ) = y 1 y 2 y 3 3 , Q ( y 1 , y 2 , y 3 ) = 1 3 ( y 1 2 + y 2 2 + y 3 2 ) .
Theorem 5 Let a 1 , a 2 , a 3 > 0 and y 1 , y 2 , y 3 > 0 . Then A ( y 1 , y 2 , y 3 ) A ( a 1 , a 2 , a 3 ) , H ( a 1 , a 2 , a 3 ) H ( y 1 , y 2 , y 3 ) (‘reverse!) and G ( y 1 , y 2 , y 3 ) = G ( a 1 , a 2 , a 3 ) imply
Q ( log y 1 , log y 2 , log y 3 ) Q ( log a 1 , log a 2 , log a 3 ) .
We denote by
a i = : d i 2 , y i = : x i 2

and arrive at

Theorem 6 Let the real numbers d i and x i be such that d 1 , d 2 , d 3 > 0 , x 1 , x 2 , x 3 > 0 and
x 1 2 + x 2 2 + x 3 2 d 1 2 + d 2 2 + d 3 2 , x 1 2 x 2 2 + x 2 2 x 3 2 + x 1 2 x 3 2 d 1 2 d 2 2 + d 2 2 d 3 2 + d 1 2 d 3 2 , x 1 x 2 x 3 = d 1 d 2 d 3 .
(5)
Then
( log x 1 ) 2 + ( log x 2 ) 2 + ( log x 3 ) 2 ( log d 1 ) 2 + ( log d 2 ) 2 + ( log d 3 ) 2 .
(6)

If we again view x i and d i as eigenvalues of positive definite matrices, an equivalent formulation of the problem can be given in terms of their Frobenius matrix norms:

Theorem 7 For n { 2 , 3 } , let P 1 , P 2 R n × n be positive definite real matrices. Let
P 1 F 2 P 2 F 2 and P 1 1 F 2 P 2 1 F 2 and det P 1 = det P 2 .
Then
log P 1 F 2 log P 2 F 2 .
Let us reconsider the formulation from Theorem 5. If we denote
c i : = log a i , z i : = log y i ,
from H ( a 1 , a 2 , a 3 ) H ( y 1 , y 2 , y 3 ) , we obtain
e z 1 + e z 2 + e z 3 e c 1 + e c 2 + e c 3 .
Theorem 8 Let the real numbers c 1 , c 2 , c 3 and z 1 , z 2 , z 3 be such that
e z 1 + e z 2 + e z 3 e c 1 + e c 2 + e c 3 , e z 1 + e z 2 + e z 3 e c 1 + e c 2 + e c 3 , z 1 + z 2 + z 3 = c 1 + c 2 + c 3 .
(7)
Then
z 1 2 + z 2 2 + z 3 2 c 1 2 + c 2 2 + c 3 2 .
(8)
In order to prove Theorem 8, one can assume without loss of generality that
z 1 + z 2 + z 3 = c 1 + c 2 + c 3 = 0 .
(9)

Thus, we have the equivalent formulation

Theorem 9 Let the real numbers c ¯ 1 , c ¯ 2 , c ¯ 3 and z ¯ 1 , z ¯ 2 , z ¯ 3 be such that
e z ¯ 1 + e z ¯ 2 + e z ¯ 3 e c ¯ 1 + e c ¯ 2 + e c ¯ 3 , e z ¯ 1 + e z ¯ 2 + e z ¯ 3 e c ¯ 1 + e c ¯ 2 + e c ¯ 3 , z ¯ 1 + z ¯ 2 + z ¯ 3 = c ¯ 1 + c ¯ 2 + c ¯ 3 = 0 .
(10)
Then
z ¯ 1 2 + z ¯ 2 2 + z ¯ 3 2 c ¯ 1 2 + c ¯ 2 2 + c ¯ 3 2 .
(11)
Let us prove that Theorem 8 can be reformulated as Theorem 9. Indeed, let us assume that Theorem 9 is valid and show that the statement of Theorem 8 also holds true. We denote by s the sum s = z 1 + z 2 + z 3 = c 1 + c 2 + c 3 and we designate
z ¯ i = z i s 3 , c ¯ i = c i s 3 ( i = 1 , 2 , 3 ) .
Then the real numbers z ¯ i and c ¯ i satisfy the hypotheses of Theorem 9 and we obtain z ¯ 1 2 + z ¯ 2 2 + z ¯ 3 2 c ¯ 1 2 + c ¯ 2 2 + c ¯ 3 2 . This inequality is equivalent to
i = 1 3 ( z i s 3 ) 2 i = 1 3 ( c i s 3 ) 2 ,
which, by virtue of the condition (7)3, reduces to
z 1 2 + z 2 2 + z 3 2 c 1 2 + c 2 2 + c 3 2 .

Thus, Theorem 8 is also valid.

By virtue of the logical equivalence
( A B C ) ( ¬ C ¬ A ¬ B )

for any statements A, B, C, we can formulate the inequality (11) (i.e., Theorem 9) in the following equivalent manner.

Theorem 10 Let the real numbers c 1 , c 2 , c 3 and z 1 , z 2 , z 3 be such that
z 1 + z 2 + z 3 = c 1 + c 2 + c 3 = 0 and z 1 2 + z 2 2 + z 3 2 < c 1 2 + c 2 2 + c 3 2 .
(12)
Then one of the following inequalities holds:
e z 1 + e z 2 + e z 3 < e c 1 + e c 2 + e c 3 or e z 1 + e z 2 + e z 3 < e c 1 + e c 2 + e c 3 .
(13)

We use the statement of Theorem 10 for the proof.

Before continuing, let us show that our new inequality is not a consequence of majorization and Karamata’s inequality [5]. Consider z = ( z 1 , , z n ) R + n and c = ( c 1 , , c n ) R + n arranged already in decreasing order z 1 z 2 z n and c 1 c 2 c n . If
i = 1 k z i i = 1 k c i ( 1 k n 1 ) , i = 1 n z i = i = 1 n c i ,
(14)
we say that z majorizes c, denoted by z c . The following result is well known [[6], p.89], [4, 5]. If f : R R is convex, then
z c i = 1 n f ( z i ) i = 1 n f ( c i ) .
(15)
A function g : R n R which satisfies
z c g ( z 1 , , z n ) g ( c 1 , , c n )
(16)
is called Schur-convex. In Theorem 8, the convex function to be considered would be f ( t ) = t 2 . Do conditions (7) (upon rearrangement of z , c R + 3 if necessary) yield already majorization z c ? This is not the case, as we explain now. Let the real numbers z 1 z 2 z 3 and c 1 c 2 c 3 be such that
e z 1 + e z 2 + e z 3 e c 1 + e c 2 + e c 3 , e z 1 + e z 2 + e z 3 e c 1 + e c 2 + e c 3 , z 1 + z 2 + z 3 = c 1 + c 2 + c 3 .
(17)
These conditions do not imply the majorization z c ,
z 1 c 1 , z 1 + z 2 c 1 + c 2 , z 1 + z 2 + z 3 = c 1 + c 2 + c 3 .
(18)

Therefore, our inequality (i.e., z 1 2 + z 2 2 + z 3 2 c 1 2 + c 2 2 + c 3 2 ) does not follow from majorization in disguise.

Indeed, let
z 1 = 1 2 + 0.95 2 3 , z 2 = 1 2 + 0.85 2 3 , z 3 = 1 0.9 3
and
c 1 = 1 2 + 1 2 3 , c 2 = 1 2 + 1 2 3 , c 3 = 1 3 .
Then we have z 1 > z 2 > z 3 and c 1 > c 2 > c 3 , together with
e z 1 + e z 2 + e z 3 = 4.49497 > 3.57137 = e c 1 + e c 2 + e c 3 , e z 1 + e z 2 + e z 3 = 5.50607 > 3.47107 = e c 1 + e c 2 + e c 3 , z 1 + z 2 + z 3 = c 1 + c 2 + c 3 = 0 ,

but the majorization inequalities (18) are not satisfied, since z 1 < c 1 .

3 Proof of the inequality

Of course, we may assume without loss of generality that c 1 c 2 c 3 and z 1 z 2 z 3 (and the same for a i , d i , x i , y i ).

The proof begins with the crucial lemma.

Lemma 11 Let the real numbers a b c and x y z be such that
a + b + c = x + y + z = 0 , a 2 + b 2 + c 2 = x 2 + y 2 + z 2 .
(19)
Then the inequality
e a + e b + e c e x + e y + e z
(20)
is satisfied if and only if the relation
a x
(21)
holds, or equivalently, if and only if
c z
(22)

holds.

Proof Let us denote by r : = 2 3 ( a 2 + b 2 + c 2 ) > 0 . Then, from (19), it follows
b + c = a , b 2 + c 2 = 3 2 r 2 a 2 , y + z = x , y 2 + z 2 = 3 2 r 2 x 2 ,
and we find
b = 1 2 ( a + 3 ( r 2 a 2 ) ) , c = 1 2 ( a 3 ( r 2 a 2 ) ) , y = 1 2 ( x + 3 ( r 2 x 2 ) ) , z = 1 2 ( x 3 ( r 2 x 2 ) ) .
(23)
In view of (19) and a b c , x y z , one can show that
a , x [ r 2 , r ] , b , y [ r 2 , r 2 ] , c , z [ r , r 2 ] .
(24)
Indeed, let us verify the relations (24). We have
r 2 a r 1 6 ( a 2 + b 2 + c 2 ) a 2 2 3 ( a 2 + b 2 + c 2 ) b 2 + c 2 5 a 2 and a 2 2 ( b 2 + c 2 ) b 2 + ( a + b ) 2 5 a 2 and ( b + c ) 2 2 ( b 2 + c 2 ) 4 a 2 2 a b 2 b 2 0 and b 2 + c 2 2 b c 2 ( a b ) ( 2 a + b ) 0 and ( b c ) 2 0 ,
which hold true since a b and 2 a + b a + b + c = 0 . Similarly, we have
r 2 b r 2 b 2 r 2 4 4 b 2 2 3 ( a 2 + b 2 + c 2 ) 5 b 2 a 2 + c 2 5 b 2 a 2 + ( a + b ) 2 2 a 2 + 2 a b 4 b 2 0 2 ( a b ) ( a + 2 b ) 0 ,
which holds true since a b and a + 2 b a + b + c = 0 . Also, we have
r c r 2 r 2 c 2 r 2 4 2 3 ( a 2 + b 2 + c 2 ) c 2 1 6 ( a 2 + b 2 + c 2 ) 2 ( a 2 + b 2 ) c 2 and 5 c 2 a 2 + b 2 2 ( a 2 + b 2 ) ( a + b ) 2 and 5 ( a + b ) 2 a 2 + b 2 ( a b ) 2 0 and 4 a 2 + 10 a b + 4 b 2 0 ( a b ) 2 0 and 2 ( a + 2 b ) ( 2 a + b ) 0 ,

which hold true since a + 2 b a + b + c = 0 and 2 a + b a + b + c = 0 . One can show in the same way that x [ r 2 , r ] , y [ r 2 , r 2 ] , z [ r , r 2 ] , so that (24) has been verified.

We prove now that the inequality (21) holds if and only if (22) holds. Indeed, using (23)2,4 and (24) we get
c z a 3 ( r 2 a 2 ) x 3 ( r 2 x 2 ) a r + 3 ( 1 ( a r ) 2 ) x r + 3 ( 1 ( x r ) 2 ) a x ,

since the function t t + 3 ( 1 t 2 ) is decreasing for t [ 1 2 , 1 ] .

Let us prove next that the inequalities (20) and (21) are equivalent. To accomplish this, we introduce the function f : [ r 2 , r ] R by
f ( x ) = e x + e ( x + 3 ( r 2 x 2 ) ) / 2 + e ( x 3 ( r 2 x 2 ) ) / 2 .
(25)
Taking into account (23) and (24)1, the inequality (20) can be written equivalently as
f ( a ) f ( x ) ,
(26)
which is equivalent to
a x ,
since the function f defined by (25) is monotone increasing on [ r 2 , r ] , as we show next. To this aim, we denote by
cos φ : = x r [ 1 2 , 1 ] , i.e. φ : = arccos ( x r ) [ 0 , π 3 ] .
Then the function (25) can be written as
f ( x ) = h ( r , φ ) , where  h : ( 0 , ) × [ 0 , π 3 ] R , h ( r , φ ) = e r cos φ + e r cos ( φ + 2 π / 3 ) + e r cos ( φ 2 π / 3 ) .
(27)
We have to show that h ( r , φ ) is decreasing with respect to φ [ 0 , π 3 ] . We compute the first derivative
φ h ( r , φ ) = r [ e r cos φ sin φ + e r cos ( φ + 2 π / 3 ) sin ( φ + 2 π 3 ) + e r cos ( φ 2 π / 3 ) sin ( φ 2 π 3 ) ] .
(28)
The function (28) has the same sign as the function
F ( r , φ ) : = 1 r e r cos φ φ h ( r , φ ) ,
(29)
i.e., the function F : ( 0 , ) × [ 0 , π 3 ] R given by
F ( r , φ ) = sin φ e r 3 sin ( φ + π / 3 ) sin ( φ + 2 π 3 ) e r 3 sin ( φ π / 3 ) sin ( φ 2 π 3 ) .
(30)
In order to show that F ( r , φ ) 0 for all ( r , φ ) ( 0 , ) × [ 0 , π 3 ] , we remark that lim r 0 F ( r , φ ) = 0 for fixed φ [ 0 , π 3 ] and we compute
r F ( r , φ ) = 3 [ e r 3 sin ( φ + π / 3 ) sin ( φ + π 3 ) sin ( φ + 2 π 3 ) e r 3 sin ( φ π / 3 ) sin ( φ π 3 ) sin ( φ 2 π 3 ) ] = 3 [ e r 3 sin ( φ + π / 3 ) 1 2 ( cos ( 2 φ + π ) + cos π 3 ) e r 3 sin ( φ π / 3 ) 1 2 ( cos ( 2 φ π ) + cos π 3 ) ] = 3 2 ( cos 2 φ + 1 2 ) [ e r 3 sin ( φ + π / 3 ) e r 3 sin ( φ π / 3 ) ] 0 ,

since φ [ 0 , π 3 ] implies cos 2 φ 1 2 and sin ( φ + π 3 ) sin ( φ π 3 ) .

Consequently, the function F ( r , φ ) is decreasing with respect to r and for any ( r , φ ) ( 0 , ) × [ 0 , π 3 ] we have that
F ( r , φ ) lim r 0 F ( r , φ ) = 0 .
(31)

From (29) and (31), it follows that h ( r , φ ) is decreasing with respect to φ [ 0 , π 3 ] . This means that f ( x ) is increasing as a function of x [ r 2 , r ] , i.e., the relation (26) is indeed equivalent to a x and the proof is complete. □

Consequence 12 Let the real numbers a b c and x y z be such that
a + b + c = x + y + z = 0 , a 2 + b 2 + c 2 = x 2 + y 2 + z 2 .
Then one of the following inequalities holds:
e a + e b + e c e x + e y + e z ,
(32)
or
e a + e b + e c e x + e y + e z .
(33)

The inequalities (32) and (33) are satisfied simultaneously if and only if a = x , b = y and c = z .

Proof According to Lemma 11, the inequality (32) is equivalent to
a x ,
(34)
while the inequality (33) is equivalent to
a x .
(35)

Since one of the relations (34) and (35) must hold, we have proved that one of the inequalities (32) and (33) is satisfied. They are simultaneously satisfied if and only if both (34) and (35) hold true, i.e., a = x (and consequently b = y , c = z ). □

Consequence 13 Let the real numbers a b c and x y z be such that
a + b + c = x + y + z = 0 , a 2 + b 2 + c 2 = x 2 + y 2 + z 2 and e a + e b + e c = e x + e y + e z .

Then we have a = x , b = y and c = z .

Proof Since by hypothesis e a + e b + e c e x + e y + e z holds, we can apply Lemma 11 to deduce a x and c z .

On the other hand, by virtue of the inverse inequality e x + e y + e z e a + e b + e c and Lemma 11, we obtain x a and z c . In conclusion, we get a = x , c = z and b = y . □

Proof of Theorem 10 In order to prove (13), we define the real numbers
t i = k z i ( i = 1 , 2 , 3 ) , where  k = c 1 2 + c 2 2 + c 3 2 z 1 2 + z 2 2 + z 3 2 > 1 .
(36)
Then we have
t 1 + t 2 + t 3 = c 1 + c 2 + c 3 = 0 and t 1 2 + t 2 2 + t 3 2 = c 1 2 + c 2 2 + c 3 2 .
(37)
If we apply the Consequence 12 for the numbers c 1 c 2 c 3 and t 1 t 2 t 3 , then we obtain that
e t 1 + e t 2 + e t 3 e c 1 + e c 2 + e c 3 or e t 1 + e t 2 + e t 3 e c 1 + e c 2 + e c 3 .
(38)
In what follows, let us show that
e z 1 + e z 2 + e z 3 < e t 1 + e t 2 + e t 3 .
(39)
Using the notations ρ : = 2 3 ( z 1 2 + z 2 2 + z 3 2 ) and
cos ζ : = z 1 ρ [ 1 2 , 1 ] , i.e. ζ : = arccos ( z 1 ρ ) [ 0 , π 3 ] ,
we have k ρ : = 2 3 ( t 1 2 + t 2 2 + t 3 2 ) and cos ζ = t 1 k ρ . With the help of the function h defined in (27), we can write the inequality (39) in the form
e ρ cos ζ + e ρ cos ( ζ + 2 π / 3 ) + e ρ cos ( ζ 2 π / 3 ) < e k ρ cos ζ + e k ρ cos ( ζ + 2 π / 3 ) + e k ρ cos ( ζ 2 π / 3 ) , or h ( ρ , ζ ) < h ( k ρ , ζ ) , ( ρ , ζ ) ( 0 , ) × [ 0 , π 3 ] , k > 1 .
(40)
The relation (40) asserts that the function h defined in (27) is increasing with respect to the first variable r ( 0 , ) . To show this, we compute the derivative
r h ( r , φ ) = e r cos φ cos φ + e r cos ( φ + 2 π / 3 ) cos ( φ + 2 π 3 ) + e r cos ( φ 2 π / 3 ) cos ( φ 2 π 3 ) .
(41)
By virtue of the Chebyshev’s sum inequality, we deduce from (41) that
r h ( r , φ ) > 0 .
(42)
Indeed, the Chebyshev’s sum inequality [[6], 2.17] asserts that: if a 1 a 2 a n and b 1 b 2 b n then
n k = 1 n a k b k ( k = 1 n a k ) ( k = 1 n b k ) .
In our case, we derive the following result: for any real numbers x, y, z such that x + y + z = 0 , the inequality
x e x + y e y + z e z 1 3 ( x + y + z ) ( e x + e y + e z ) = 0 ,
(43)

holds true, with equality if and only if x = y = z = 0 .

Applying the result (43) to the function (41), we deduce the relation (42). This means that h ( r , φ ) is an increasing function of r, i.e. the inequality (40) holds, and hence, we have proved (39).

One can show analogously that the inequality
e z 1 + e z 2 + e z 3 < e t 1 + e t 2 + e t 3
(44)

is also valid. From (38), (39) and (44), it follows that the assertion (13) holds true. Thus, the proof of Theorem 10 is complete. □

Since the statements of the Theorems 8 and 10 are equivalent, we have proved also the inequality (8).

Remark 14 The inequality (8) becomes an equality if and only if z i = c i , i = 1 , 2 , 3 .

Proof Indeed, assume that z 1 2 + z 2 2 + z 3 2 = c 1 2 + c 2 2 + c 3 2 . Then we can apply the Consequence 12 and we deduce that
e z 1 + e z 2 + e z 3 e c 1 + e c 2 + e c 3 or e z 1 + e z 2 + e z 3 e c 1 + e c 2 + e c 3 .
(45)
Taking into account (7)1,2 in conjunction with (45), we find
e z 1 + e z 2 + e z 3 = e c 1 + e c 2 + e c 3 or e z 1 + e z 2 + e z 3 = e c 1 + e c 2 + e c 3 .
(46)

By virtue of (46), we can apply the Consequence 13 to derive z 1 = c 1 , and consequently z 2 = c 2 , z 3 = c 3 . □

Let us prove the following version of the inequality (6) for two pairs of numbers d 1 , d 2 and x 1 , x 2 :

Remark 15 If the real numbers d 1 d 2 > 0 and x 1 x 2 > 0 are such that
x 1 2 + x 2 2 d 1 2 + d 2 2 and x 1 x 2 = d 1 d 2 = 1 ,
(47)
then the inequality
( log x 1 ) 2 + ( log x 2 ) 2 ( log d 1 ) 2 + ( log d 2 ) 2
(48)
holds true. Note that the additional condition
1 x 1 2 + 1 x 2 2 1 d 1 2 + 1 d 2 2

is automatically fulfilled.

Proof Since x 1 x 2 = d 1 d 2 = 1 and d 1 d 2 > 0 , x 1 x 2 > 0 , we have x 1 1 , d 1 1 and
log x 1 = log x 2 0 , log d 1 = log d 2 0 ,

so that the inequality (48) is equivalent to log x 1 log d 1 , i.e., we have to show that x 1 d 1 .

Indeed, if we insert x 2 = 1 x 1 and d 2 = 1 d 1 into the inequality (47)1 then we find
x 1 2 + 1 x 1 2 d 1 2 + 1 d 1 2 ,

which means that x 1 d 1 since the function t t 2 + 1 t 2 is increasing for t [ 1 , ) . This completes the proof. □

Alternative proof of Remark 15 Let x 3 = d 3 = 1 . Then (47) implies x 1 2 + x 2 2 + x 3 2 d 1 2 + d 2 2 + d 3 2 and x 1 x 2 x 3 = d 1 d 2 d 3 = 1 as well as
x 1 2 x 2 2 + x 2 2 x 3 2 + x 1 2 x 3 2 = 1 + x 2 2 + x 1 2 1 + d 2 2 + d 1 2 = d 1 2 d 2 2 + d 2 2 d 3 2 + d 1 2 d 3 2 ,
(49)

because x 1 2 x 2 2 = 1 = d 1 2 d 2 2 , and Theorem 6 provides the assertion. □

4 Some counterexamples for weakened assumptions

Example 16 Unlike in the 2D case in Remark 15, for two triples of numbers the second condition (18)2 of Theorem 2, namely y 1 y 2 + y 2 y 3 + y 1 y 3 a 1 a 2 + a 2 a 3 + a 1 a 3 , cannot be removed. Let
y 1 = e 6 , y 2 = 1 , y 3 = e 6 , a 1 = e 4 , a 2 = e 4 , a 3 = e 8 .
Then y 1 y 2 y 3 = a 1 a 2 a 3 = 1 and
y 1 + y 2 + y 3 > e 6 e 2 e 4 > 3 e 4 > a 1 + a 2 + a 3 ,
but
( log y 1 ) 2 + ( log y 2 ) 2 + ( log y 3 ) 2 = 36 + 0 + 36 < 16 + 16 + 64 = ( log a 1 ) 2 + ( log a 2 ) 2 + ( log a 3 ) 2 .
Example 17 The condition y 1 y 2 y 3 = a 1 a 2 a 3 cannot be weakened to y 1 y 2 y 3 a 1 a 2 a 3 . Indeed, let y 2 = y 3 = a 1 = a 2 = 1 , y 1 = e , a 3 = e 2 . Then
y 1 + y 2 + y 3 = e + 1 + 1 1 + 1 + e 2 = a 1 + a 2 + a 3 , y 1 y 2 + y 1 y 3 + y 2 y 3 = e + e + 1 1 + e 2 + e 2 = a 1 a 2 + a 1 a 3 + a 2 a 3 , y 1 y 2 y 3 = e e 2 = a 1 a 2 a 3 .
But nevertheless
( log y 1 ) 2 + ( log y 2 ) 2 + ( log y 3 ) 2 = 1 + 0 + 0 < 0 + 0 + 4 = ( log a 1 ) 2 + ( log a 2 ) 2 + ( log a 3 ) 2 .

A counterexample for the two variable case can be constructed analogously.

Example 18 Even with an analogous condition, the inequality (4) does not hold for n = 4 numbers (without further assumptions). Indeed, let
y 1 = e , y 2 = y 3 = e 7 , y 4 = e 15 , a 1 = a 2 = e 6 , a 3 = e 7 , a 4 = e 19 .
Then y 1 y 2 y 3 y 4 = a 1 a 2 a 3 a 4 = 1 . Also,
y 1 + y 2 + y 3 + y 4 = e + e 7 + e 7 + e 15 > 0 + e 7 + 2 e 6 + e 19 = a 1 + a 2 + a 3 + a 4 .
Furthermore,
y 1 y 2 + y 1 y 3 + y 1 y 4 + y 2 y 3 + y 2 y 4 + y 3 y 4 = e 8 + e 8 + e 14 + e 14 + e 8 + e 8
and
a 1 a 2 + a 1 a 3 + a 1 a 4 + a 2 a 3 + a 2 a 4 + a 3 a 4 = e 12 + e 13 + e 13 + e 13 + e 13 + e 12 .
Since e 2 > 2 e + 1 , we have e 14 > e 13 + e 13 + e 12 and, therefore,
y 1 y 2 + y 1 y 3 + y 1 y 4 + y 2 y 3 + y 2 y 4 + y 3 y 4 a 1 a 2 + a 1 a 3 + a 1 a 4 + a 2 a 3 + a 2 a 4 + a 3 a 4 .
Nevertheless, for the sum of squared logarithms, the ‘reverse’ inequality
( log y 1 ) 2 + ( log y 2 ) 2 + ( log y 3 ) 2 + ( log y 4 ) 2 = 1 + 49 + 49 + 225 = 324 < 482 = 36 + 36 + 49 + 361 = ( log a 1 ) 2 + ( log a 2 ) 2 + ( log a 3 ) 2 + ( log a 4 ) 2

holds true.

Example 19 The inequality (4) does not remain true either, if the function log ( y ) is replaced by its linearization ( y 1 ) . Indeed, let y 1 = 9 , y 2 = 5 , y 3 = 1 45 , a 1 = 10 , a 2 = 1 , a 3 = 1 10 . Then
y 1 + y 2 + y 3 > 14 > 11.1 = a 1 + a 2 + a 3
and
y 1 y 2 + y 1 y 3 + y 2 y 3 > 45 11.1 = a 1 a 2 + a 1 a 3 + a 2 a 3 .
But
( y 1 1 ) 2 + ( y 2 1 ) 2 + ( y 3 1 ) 2 = 64 + 16 + ( 44 45 ) 2 < 81 < 9 2 + 0 + ( 9 10 ) 2 = ( a 1 1 ) 2 + ( a 2 1 ) 2 + ( a 3 1 ) 2 .

5 Conjecture for arbitrary n

The structure of the inequality in dimensions n = 2 and n = 3 and extensive numerical sampling strongly suggest that the inequality holds for all n N if the n corresponding conditions are satisfied. More precisely, in terms of the elementary symmetric polynomials, we expect the following:

Conjecture 20 Let n N and y i , a i > 0 for i = 1 , , n . If for all i = 1 , , n 1 we have
e i ( y 1 , , y n ) e i ( a 1 , , a n ) and e n ( y 1 , , y n ) = e n ( a 1 , , a n ) ,
then
i = 1 n ( log y i ) 2 i = 1 n ( log a i ) 2 .

6 Applications

The investigation in this paper has been motivated by some recent applications. The new sum of squared logarithms inequality is one of the fundamental tools in deducing a novel optimality result in matrix analysis and the conditions in the form (3) had been deduced in the course of that work. Optimality in the matrix problem suggested the sum of squared logarithms inequality. Indeed, based on the present result in [7], it has been shown that for all invertible Z C 3 × 3 and for any definition of the matrix logarithm as possibly multivalued solution X C 3 × 3 of exp X = Z it holds
min Q Q = I log Q Z F 2 = log U p Z F 2 = log H F 2 , min Q Q = I sym log Q Z F 2 = sym log U p Z F 2 = log H F 2 ,
(50)
where sym X = 1 2 ( X + X ) is the Hermitian part of X C 3 × 3 and U p is the unitary factor in the polar decomposition of Z into unitary and Hermitian positive definite matrix H
Z = U p H .
(51)
This result (50) generalizes the fact that for any complex logarithm and for all z C { 0 }
min ϑ ( π , π ] | log C [ e i ϑ z ] | 2 = | log R | z | | 2 , min ϑ ( π , π ] | Re log C [ e i ϑ z ] | 2 = | log R | z | | 2 .
(52)
The optimality result (50) can now also be viewed as another characterization of the unitary factor in the polar decomposition. In addition, in a forthcoming contribution [8], we use (50) to calculate the geodesic distance of the isochoric part of the deformation gradient F det F 1 3 SL ( 3 , R ) to SO ( 3 , R ) in the canonical left-invariant Riemannian metric on SL ( 3 , R ) , to the effect that
dist geod 2 ( F det F 1 3 , SO ( 3 , R ) ) = dev 3 log F T F F 2 ,
(53)

where dev 3 X = X 1 3 ( tr X ) I is the orthogonal projection of X R 3 × 3 to trace free matrices. Thereby, we provide a rigorous geometric justification for the preferred use of the Hencky-strain measure log F T F F 2 in nonlinear elasticity and plasticity theory [9].

Declarations

Acknowledgements

The first author (MB) was supported by the German state grant: ‘Programm des Bundes und der Länder für bessere Studienbedingungen und mehr Qualität in der Lehre’.

Authors’ Affiliations

(1)
Lehrstuhl für Nichtlineare Analysis und Modellierung, Fakultät für Mathematik, Universität Duisburg-Essen
(2)
Department of Mathematics, University ‘A.I. Cuza’ of Iaşi

References

  1. Guan K: Schur-convexity of the complete elementary symmetric functions. J. Inequal. Appl. 2006., 2006: Article ID 67624. doi:10.1155/JIA/2006/67624Google Scholar
  2. Roventa I: A note on Schur-concave functions. J. Inequal. Appl. 2012, 159: 1–9.MathSciNetGoogle Scholar
  3. Steele JM: The Cauchy-Schwarz Master Class: an Introduction to the Art of Mathematical Inequalities. Cambridge University Press, Cambridge; 2004.View ArticleGoogle Scholar
  4. Khan AR, Latif N, Pečarić J: Exponential convexity for majorization. J. Inequal. Appl. 2012, 105: 1–13.Google Scholar
  5. Karamata J: Sur une inégalité relative aux fonctions convexes. Publ. Math. Univ. Belgrad 1932, 1: 145–148.Google Scholar
  6. Hardy GH, Littlewood JE, Pólya G: Inequalities. The University Press, Cambridge; 1934.Google Scholar
  7. Neff, P, Nagatsukasa, Y, Fischle, A: The unitary polar factor Q = U p minimizes Log ( Q Z ) 2 and sym Log ( Q Z ) 2 in the spectral norm in any dimension and the Frobenius matrix norm in three dimensions (2013, submitted)Google Scholar
  8. Neff, P, Eidel, B, Osterbrink, F, Martin, R: The isotropic Hencky strain energy log U 2 measures the geodesic distance of the deformation gradient F GL + ( n ) to SO ( n ) in the unique left-invariant Riemannian metric on GL + ( n ) which is also right O ( n ) -invariant (2013, in preparation)Google Scholar
  9. Hencky H: Über die Form des Elastizitätsgesetzes bei ideal elastischen Stoffen. Z. Techn. Physik 1928, 9: 215–220.Google Scholar

Copyright

© Bîrsan et al.; licensee Springer. 2013

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.