Skip to main content

Extremal problems related to convexity

Abstract

We consider the extremal problem of maximizing functions u in the class of real-valued biconvex functions satisfying a boundary condition ψ on a product of the unit ball with itself, with the \(\ell^{p}\)-norm. In 1986, Burkholder explicitly found the maximal function for \(p=2\). In this paper, we find some characterizations of such extremal functions. We establish that sufficiently smooth solutions to the convex extremal problems with given boundary values are affine on line segments and the domain D is foliated by such segments.

1 Introduction

In this paper, we consider properties of convex functions defined in a convex plane domain D that are maximal with respect to prescribed real boundary values. We show that if such an extremal function is in \(C^{3}\), then it is affine on certain line segments that foliate the domain D. This structural result represents a very preliminary step in a larger program of determining maximal ζ-convex functions in certain Banach spaces.

For precise definitions and background results we refer to Section 3. But let us note here that in Banach space theory one considers Banach spaces that are ζ-convex, that is, they support a biconvex function ζ satisfying (6). The question arises as to which Banach spaces are ζ-convex, and for them, what the largest biconvex function satisfying (6) is equal to. Banach spaces of the type \(\ell^{p}\) and \(L^{p}\) are known to be ζ-convex for \(1< p<\infty\). However, the extremal function ζ has been determined only for Hilbert spaces, by Burkholder in 1986. In 2008, Saksman drew the attention of the first author to the fact that the extremal function is not known in \(\ell^{p}\)- and \(L^{p}\)-spaces for \(p\neq 2\), even though it might be desirable to know it. This question provides the basic motivation for the present paper.

In order to get started with the study of such a question, we only consider the simplest non-trivial case, that of the Banach space \(\ell ^{p}\) in \({\mathbb{R}}^{2}\). Any results obtained in this case might then have generalizations to other cases. Let be the closed unit ball in \({\mathbb{R}}^{2}\) in the \(\ell^{p}\)-metric. We are looking for the largest biconvex function \(\zeta: \overline{B}\times\overline {B}\to{\mathbb{R}}\) such that \(\zeta(x,y)\leq|x+y|_{p}\) whenever \(|x|_{p}=|y|_{p}=1\). The Hilbert space case suggests that not much is lost if one fixes one variable, say y, and only uses initially the assumption that \(x\to\zeta(x,y)\) is a convex function of \(x\in \overline{B}\). In this spirit we consider the slightly more general problem of finding or characterizing the maximal function \(u:D\to {\mathbb{R}}\) defined in the closure of a bounded convex plane domain, such that \(u(x)\leq\psi(x)\) for all \(x\in\partial D\), for a given function ψ on ∂D. If the boundary function ψ is not sufficiently regular, it may of course happen that \(\psi_{0}(x)=u(x)\) for \(x\in\partial D\), for the extremal function u satisfies only \(\psi _{0}\leq\psi\) but not necessarily \(\psi_{0}=\psi\) on ∂D. However, the same extremal function u is then obtained for boundary values \(\psi_{0}\) and the structural results that we obtain allow us to express u in the interior of D in terms of \(\psi_{0}\) (which, in interesting applications, will often be the same as ψ) as soon as we know a certain foliation of D.

In applications it is likely that the extremal function is many times differentiable, perhaps even in \(C^{\infty}\), in the interior of D. Thus we see no significant loss of generality in assuming, in this paper, that u is in \(C^{3}\) and pursuing the implications of this assumption.

We first observe that the extremal function must be a solution of the Hopf differential equation. Indeed, we find that with \(x=(x_{1},x_{2})\in D\), the function \(A=u_{x_{1}x_{1}}/u_{x_{1}x_{2}}=u_{x_{1}x_{2}}/u_{x_{2}x_{2}}\) satisfies such an equation. The general form of such solutions is known, and they are given implicitly in terms of a parameter function Φ. We then ask what can be said about such functions A and, consequently, about the function u. We find that at each point x of D, the value of \(A(x)\) determines a line L through x and hence the line segment \(L_{1} = L\cap D\). We show that A is constant and that u is affine on each such segment \(L_{1}\). This shows that the domain D is foliated by such line segments: the segments are disjoint and their union is D. Hence on each segment \(L_{1}\), the function u, being affine, is determined by its values at the end points of \(L_{1}\), which are on ∂D, and these boundary values of u are the values of \(\psi_{0}\), hence often the same as the values of ψ.

Thus, if we knew what the foliation is explicitly, we could then compute the extremal function u. However, at this stage many foliations are still possible, and not all of them correspond to the extremal function. Further work shows that one can develop a differential equation for a parametrization of the foliation, and the solution of that equation will give the sought-for extremal function. The equation is complicated and we will leave the presentation of this work to another paper. When \(p=2\), we recover the extremal function u discovered by Burkholder. When \(p\neq 2\), the equation is complicated and it is not clear if explicit solutions can be found.

However, the structural results of this paper are of interest in their own right. In various papers of Burkholder, it has been pointed out that certain functions discovered there are affine on certain line segments, apparently without the realization that this is a general property of this type of extremal functions. Thus we gain a greater understanding of the nature of the solutions to these extremal problems. For example, in [1], p.687, (9.7), it is observed that a certain extremal biconcave function is affine on certain line segments. Similarly, Burkholder has noted that his extremal function (7) is affine on certain line segments. We now see that this is to be expected and that one can further approach all such problems by specifically looking for such foliations.

On the basis of preliminary studies, we feel that it is likely that our methods will work more generally in \({\mathbb{R}}^{n}\), for all \(n\geq2\), and it may be that similar results are valid in more general Banach spaces. This work will have to be the subject of a later paper.

It seems to be typical in applications that in fact, the convex domain D is divided by a line segment into two parts, in each of which the function A described above is well defined. Thus there is a foliation in each of the two parts of D and the separating segment does not belong to either foliation (or, in an extended sense, belongs to both). This is connected to certain second order partial derivatives of u being zero on the separating segment. The assumptions of our theorems exclude this case, but the phenomena described in the theorems of this paper are then valid in each of the two parts of D, each of them being a convex domain in its own right. When considering each part as a convex domain, the boundary values are not known on that part of the boundary corresponding to the separating segment, but this does not matter as that segment will not be involved in the foliation of the interior of either part of D.

2 Results

In Hilbert space, the greatest biconvex function in a class of biconvex functions satisfying certain boundary conditions was explicitly found by Burkholder [2] in 1986, see Theorem 3.1 in Section 3. Such a result would imply ζ-convexity which is one of the three equivalent conditions giving some geometric characterizations of Banach spaces, see [3]:

$$\zeta\mbox{-convex} \quad \Longleftrightarrow\quad \mbox{UMD}\quad \Longleftrightarrow\quad \mbox{HT-space}. $$

To study the extremal problems further in general Banach spaces, we may investigate the characterizations of such extremal functions. We consider the problem of finding the extremal function in the class of real-valued biconvex functions u satisfying a boundary condition ψ on a domain \(D\times D\) where D is a convex domain in \(\mathbb {R}^{2}\). In particular, we restrict the domain D to the unit ball in the \(\ell^{p} \)-norm or the unit disk when \(p=2\). We establish that sufficiently smooth solutions to the convex extremal problems with given boundary values are affine on line segments and the domain D is foliated by such segments.

In \(\ell^{p}(\mathbb{R}^{2})\), we denote the \(\ell^{p}\)-norm by \(\vert x\vert _{p}\) where \(x=(x_{1},x_{2})\) in \(\mathbb{R}^{2}\). Thus \(\vert x\vert _{p}^{p}=\vert x_{1}\vert ^{p}+\vert x_{2}\vert ^{p}\). We define the open and closed unit balls in \(\mathbb{R}^{2}\) for the \(\ell^{p}\)-metric by \(B= \{x\in\mathbb{R}^{2}:\vert x\vert _{p}<1 \}\) and \(\overline{B}= \{x\in\mathbb{R}^{2}:\vert x\vert _{p}\leq1 \}\).

Let \(u:\overline{B}\times\overline{B}\rightarrow\mathbb{R}\) be a continuous function that is in \(C^{3}\) in \(B\times B\). Fix \(y=(y_{1},y_{2})\in\overline{B}\). Then we can consider u as a function of \(x_{1}\) and \(x_{2}\), where \(x=(x_{1},x_{2})\) in . For \(x_{1}=\cos{\theta}\), \(x_{2}=\sigma(\theta)=\pm (1-|\cos(\theta )|^{p} )^{1/p}\) and \(y=(y_{1},y_{2})\in\overline{B}\), we define a boundary function ψ by

$$ \psi(\theta)=u\bigl(\cos(\theta),\sigma(\theta)\bigr)= \bigl(\bigl\vert \cos(\theta )+y_{1}\bigr\vert ^{p}+\bigl\vert \sigma(\theta)+y_{2}\bigr\vert ^{p} \bigr)^{1/p} = \vert x+y\vert _{p} . $$
(1)

When \(p=2\), this function ψ is the same boundary function as in Burkholder’s result [2].

We denote the partial derivatives of a function \(u=u(x_{1},x_{2})\) by using subscripts, for example,

$$u_{x_{1}x_{2}} = \frac{ \partial^{2} u }{ \partial x_{1}\, \partial x_{2} } . $$

We define a real-valued function A on by

$$ A(x_{1},x_{2})=\frac{u_{x_{1}x_{1}}}{u_{x_{1}x_{2}}}= \frac{u_{x_{1}x_{2}}}{u_{x_{2}x_{2}}}, $$
(2)

assuming that \(u_{x_{1}x_{1}}\), \(u_{x_{2}x_{2}}\) and \(u_{x_{1}x_{2}}\) do not vanish there (a milder requirement would be that \(u_{x_{1}x_{1}}\), \(u_{x_{2}x_{2}}\) and \(u_{x_{1}x_{2}}\) do not vanish on any open set, but we do not address that here). If \(u\in C^{3}\), then differentiation shows that we have the relation

$$AA_{x_{2}}=A_{x_{1}}, $$

which is a special case of the Hopf differential equation. Hence, the general solution \(A=A(x_{1},x_{2})\) of this equation is given implicitly by

$$ \Phi(A)=x_{2}+Ax_{1}, $$
(3)

where Φ is a suitable real-valued function (see [4]). Different admissible choices for Φ give rise to different solutions A.

From this perspective and using these notations, we obtain the following theorems.

Theorem 2.1

Let \(A=A(x_{1},x_{2})\) be a continuous real-valued function defined for \((x_{1},x_{2})\in D\), where D is the closure of a bounded convex plane domain. If Φ is a continuous real-valued function on an interval of the real axis that contains the set \(A(D)\), and if \(\Phi (A(x_{1},x_{2}))=x_{2}+x_{1}A(x_{1},x_{2})\) in D, then \(A(x_{1},x_{2})\) is constant on certain line segments that are maximal in the sense that each segment is the intersection of D with a straight line. The union of such line segments is D. Moreover, if the real-valued function u is in \(C^{2}\) in D and satisfies (2) there (which, in particular, means that \(u_{x_{2}x_{2}}\) and \(u_{x_{1}x_{2}}\) do not vanish in D), then u is affine on each such segment.

Remark 2.2

When \(D=\overline{B}\), Theorem 2.1 gives a foliation of the entire domain .

Theorem 2.3

Let u be a convex \(C^{2}\) real-valued function on a bounded convex plane domain D. If u is affine on an open line segment L in D, then at each point of L we have

$$ u_{x_{1}x_{1}}u_{x_{2}x_{2}}-u_{x_{1}x_{2}}^{2}=0. $$
(4)

Furthermore, at each point of L where \(u_{x_{1}x_{2}}\ne0\) and \(u_{x_{2}x_{2}}\ne0\), we have

$$\frac{u_{x_{1}x_{1}}}{u_{x_{1}x_{2}}}=\frac{u_{x_{1}x_{2}}}{u_{x_{2}x_{2}}} . $$

Let now D be the closure of a bounded convex plane domain, let ψ be a continuous real-valued function on ∂D, and let \({\mathcal{F}}\) be the set of real-valued convex functions v on D such that \(v\leq\psi\) on ∂D. Define

$$u(x) =\sup\bigl\{ v(x) : v\in{\mathcal{F}} \bigr\} . $$

Following Burkholder, we see that u is convex and \(u\in{\mathcal{F}}\), so that u is the maximal function in \({\mathcal{F}}\). Suppose that u is in \(C^{3}\) in the interior of D. The convexity of u implies that at each point we have \(u_{x_{1}x_{1}}\geq0\), \(u_{x_{2}x_{2}}\geq 0\), and \(u_{x_{1}x_{1}}u_{x_{2}x_{2}}-u_{x_{1}x_{2}}^{2}\geq0\). As noted by Burkholder, at each point equality must hold in at least one of these inequalities, for otherwise we may modify u slightly in a small neighborhood of the point and get an even larger element of \({\mathcal{F}}\). Now if \(u_{x_{1}x_{1}}= 0\) or \(u_{x_{2}x_{2}}= 0\), then from

$$u_{x_{1}x_{1}}u_{x_{2}x_{2}}-u_{x_{1}x_{2}}^{2} = - u_{x_{1}x_{2}}^{2} \geq0 $$

we deduce that \(u_{x_{1}x_{2}}=0\), so that

$$u_{x_{1}x_{1}}u_{x_{2}x_{2}}-u_{x_{1}x_{2}}^{2} = 0 . $$

Thus (4) holds at each point of the interior of D.

If we now assume that \(u_{x_{1}x_{2}}\ne0\) and \(u_{x_{2}x_{2}}\ne0\) in D, it follows that we may define the function A as in (2), and then A is given by (3) for a suitable function Φ. Now from Theorem 2.1 we see that u is affine on line segments that foliate D. Thus we obtain the following result.

Theorem 2.4

Let D be the closure of a bounded convex plane domain, let ψ be a continuous real-valued function on ∂D, and let u be the maximal real-valued convex function on D such that \(u\leq\psi\) on ∂D. Suppose that \(u\in C^{3}\). Then (4) holds at each point of the interior of D. Furthermore, if \(u_{x_{1}x_{2}}\ne0\) and \(u_{x_{2}x_{2}}\ne0\) in D, then we may define the function A as in (2), and then A is given by (3) for a suitable function Φ. Finally, u is affine on line segments that foliate D.

We conclude with the remark that the property of u being affine on line segments is equivalent to (4) in a suitable sense.

Theorem 2.5

Let D be a bounded convex plane domain. Let \(u:D \rightarrow\mathbb{R}\) be a convex function in the class \(C^{3}\). Then, on certain line segments that are maximal in the sense that each segment is the intersection of D with a straight line, we have

$$ \frac{u_{x_{1}x_{1}}}{u_{x_{1}x_{2}}}=\frac{u_{x_{1}x_{2}}}{u_{x_{2}x_{2}}} \quad \textit{if, and only if,} \quad u \textit{ is affine on the line segment}, $$
(5)

assuming that \(u_{x_{1}x_{2}}\ne0\) and \(u_{x_{2}x_{2}}\ne0\).

Namely, if u is affine on line segments as stated, then it follows from Theorem 2.3 that \(u_{x_{1}x_{1}}/u_{x_{1}x_{2}}=u_{x_{1}x_{2}}/u_{x_{2}x_{2}}\) on these segments if the denominators are assumed to be non-zero. Conversely, if the denominators are non-zero and \(u_{x_{1}x_{1}}/u_{x_{1}x_{2}}=u_{x_{1}x_{2}}/u_{x_{2}x_{2}}\), then we may define A as in (2), and it follows as explained above that u is affine on line segments that foliate D. This proves Theorem 2.5.

3 Some geometric characterizations of Banach spaces

To give some background, in order to motivate why we should study at all questions such as those addressed in the theorems of the previous section, we review some literature and results that give rise to questions on biconvex functions. Recall that a function \(u:D\to {\mathbb{R}}\), defined on a convex subset D of a Banach space, is said to be convex if \(u(tx+(1-t)y)\leq tu(x)+(1-t)u(y)\) whenever \(x,y\in D\) and \(0< t<1\) (note that then \(tx+(1-t)y\in D\)). A function \(u:D\times D\to{\mathbb{R}}\) is said to be biconvex if for each \(x\in D\), the function \(y\to u(x,y)\) is a convex function of \(y\in D\), and if for each \(y\in D\), the function \(x\to u(x,y)\) is a convex function of \(x\in D\). Taken together, the following results should motivate the quest for the best possible biconvex function in the definition of ζ-convexity, for \(L^{p}\)-spaces.

3.1 ζ-Convexity

A real or complex Banach space E with norm \(\Vert x\Vert \) is ζ-convex if there is a biconvex function \(\zeta:E\times E\rightarrow\mathbb{R}\) such that \(\zeta(0,0)>0\) and

$$ \zeta(x,y)\le \Vert x+y\Vert \quad \text{if } \Vert x\Vert =\Vert y\Vert =1. $$
(6)

Clearly the constant function \(\zeta\equiv0\) is biconvex and satisfies (6), so that it is the condition \(\zeta(0,0)>0\) that makes the requirements non-trivial. Burkholder has shown that \(\zeta (0,0)\le1\) and E is a (real or complex) Hilbert space if, and only if, it is possible to have \(\zeta(0,0)=1\). Thus for ζ-convex non-Hilbert Banach spaces we have \(0<\zeta(0,0)<1\).

It is shown in Lemma 3.1 of [2] that, in order to find the greatest biconvex function ζ satisfying (6), it is enough to have the function ζ defined and biconvex on the product of the closed unit ball of E with itself rather than on the whole space \(E\times E\).

Let be the closed unit ball in a real Hilbert space H with norm \(\Vert x\Vert \) and let \({\mathcal{F}}\) be the class of biconvex functions u on \(\overline{B}\times\overline{B}\) satisfying (6). Then we have the following theorem by Burkholder.

Theorem 3.1

Burkholder [2]

Let \(\zeta_{B}\) denote the greatest biconvex function u of \(\mathcal {F}\) and let \(\langle x,y \rangle\) be the inner product of x and y. Then

$$ \zeta_{B}(x,y) = \bigl(1+2 \langle x,y \rangle+\Vert x\Vert ^{2}\Vert y\Vert ^{2}\bigr)^{1/2}\quad \textit{for all }x,y\in\overline{B}. $$
(7)

Note that looking for a maximal ζ makes sense only when ζ is restricted to \(\overline{B}\times\overline{B}\), since outside this set one could always make ζ larger without violating any requirements.

3.2 UMD-unconditional for martingale differences

Let Ω be a probability space with a σ-algebra \({\mathcal{A}}\) of measurable sets for a measure P. A discrete-time E-valued martingale g on Ω is a sequence of E-valued functions \(g_{n}\) in \(L^{1}(\Omega)\) such that \(g_{n}\) is measurable with respect to a σ-algebra \({\mathcal{A}}_{n}\), where \({\mathcal{A}}_{n}\subset {\mathcal{A}}_{n+1}\subset{\mathcal{A}}\), such that, for each \(A\in {\mathcal{A}}_{n}\), we have \(\int_{A}(g_{n+1}-g_{n})\, dP=0\).

The Banach space E is said to be UMD (unconditional for martingale differences) if \(\beta_{p}(E)<\infty\) for some \(p\in(1,\infty)\); equivalently, for all \(p\in(1,\infty)\) by Maurey [5], 1975, where \(\beta_{p}(E)\) denotes the least \(\beta\in[1,\infty]\) such that

$$\Biggl\Vert \sum^{n}_{k=0} \varepsilon_{k}d_{k}\Biggr\Vert _{p}\leq\beta \Biggl\Vert \sum^{n}_{k=0}d_{k} \Biggr\Vert _{p} $$

for all E-valued martingale differences \(d_{k}=g_{k+1}-g_{k}\), all sequences \(\varepsilon= (\varepsilon_{1},\varepsilon_{2},\varepsilon _{3},\ldots )\) of numbers in \(\{1,-1 \}\), and all non-negative integers n.

The following theorem shows how the ζ-convexity property characterizes UMD-spaces.

Theorem 3.2

Burkholder [3]

A Banach space E is UMD if, and only if, it is ζ-convex.

3.3 HT-space

Let \(1< p< \infty\) and \(f:{\mathbb{R}}\to E\) belong to \(L_{E}^{p}(\mathbb {R})\). For each \(\varepsilon>0\), let \(H_{\varepsilon}\) denote the truncated Hilbert transform of f, that is,

$$H_{\varepsilon}f(x)=\frac{1}{\pi} \int_{\vert y\vert >\varepsilon}\frac {f(x-y)}{y}\, dy, $$

where \(x\in\mathbb{R}\), and we define \(\alpha_{p}(E)\) to be the least \(\alpha\in [0,\infty ]\) such that

$$\Vert Hf\Vert _{p}\le\alpha_{p}(E)\Vert f\Vert _{p} \quad \text{if } 1< p< \infty, \text{where } Hf(x)=\lim _{\varepsilon \downarrow0}H_{\varepsilon}f(x). $$

This limit exists for almost all \(x \in\mathbb{R}\), see M Riesz [6], 1928.

The Banach space E is said to be an HT-space if \(\alpha_{p}(E)\) is finite for some \(p\in(1,\infty)\); equivalently, for all \(p\in (1,\infty)\), see Schwartz [7], 1961 and Benedek, Calderón, and Panzone [8], 1962.

The following theorem shows the relation between HT-spaces and UMD-spaces.

Theorem 3.3

Bourgain [9], Burkholder [10], McConnell [10]

A Banach space E is an HT-space if, and only if, it is UMD.

By Theorem 3.2 and Theorem 3.3, we obtain the three equivalent conditions giving geometric characterizations of Banach spaces.

4 Proof of Theorem 2.1

Let the assumptions of Theorem 2.1 be satisfied. We separate the proof of Theorem 2.1 into two parts. First, we will geometrically prove that \(A(x_{1},x_{2})\) is constant on certain line segments that are maximal and then we use this fact to show that u is affine on each such segment.

4.1 The function \(A(x_{1},x_{2})\) is constant on certain line segments

Let \(O=c_{1}+ic_{2}\) be any point of D. Set \(a=A(O)\). This determines the line \(L_{1}\) containing all points \((x_{1},x_{2})=x_{1}+ix_{2}\) such that \(\Phi(a) = x_{2} + a x_{1}\), and the point O must be on this line, that is, by assumption, we have \(\Phi(a) = c_{2} + a c_{1}\).

Note that any point \((x_{1},x_{2})\) at which A takes the value a must be on line \(L_{1}\). Therefore, for each value of A, there is a corresponding line segment (the intersection of D and \(L_{1}\)), and the value a cannot be taken anywhere outside that line segment. At this stage there is no guarantee that A cannot take also values other than a on \(L_{1}\) since points other than O on \(L_{1}\) might lie also on relevant lines other than \(L_{1}\).

We seek to prove that A is constant on the intersection of D and \(L_{1}\). To get a contradiction, suppose that A is not constant on \(D\cap L_{1}\). Then there is a point \(P\in D\cap L_{1}\) such that \(A(P)\neq a\). We may assume that \(A(P) > a\), since the argument that follows would be similar in the case \(A(P) < a\).

Write \(P=d_{1}+id_{2}\) and \(A(P)=a'\). Let \(L_{2}\) denote the line of all points \((x_{1},x_{2})\) such that \(\Phi(a') = x_{2} + a' x_{1}\). The point P must be on this line, that is, \(\Phi(a') = d_{2} + a' d_{1}\). Since P also lies on \(L_{1}\), we further have \(\Phi(a) = d_{2} + a d_{1}\).

Since \(a' \neq a\), the lines \(L_{1}\) and \(L_{2}\) have different slopes and hence intersect at exactly one point, and therefore this point of intersection must be P. Note that the point O is not on \(L_{2}\).

The line \(L_{2}\) intersects the boundary of D at exactly two points, say \(P_{1}=a_{1}+ib_{1}\) and \(P_{2}=a_{2}+ib_{2}\). Let \(L_{3}\) and \(L_{4}\) denote the closed line segments from O to \(P_{1}\) and from O to \(P_{2}\), respectively.

Since A cannot take the value a at any point of \(L_{2}\) including P, by continuity, all values of A on \(L_{2}\) must be greater than a since \(A(P)>a\).

Let \(b=\min\{A(P_{1}),A(P_{2})\}\). Since \(A(P_{1})>a\) and \(A(P_{2})>a\), we have \(b>a\). Since A is real-valued and continuous, A must take all values belonging to the closed interval \([a,A(P_{1})]\) on the segment \(L_{3}\). Also A takes all values in the closed interval \([a,A(P_{2})]\) on the segment \(L_{4}\). Thus on each of \(L_{3}\) and \(L_{4}\), A takes all values on \([a,b]\).

For each value \(g\in(a,b)\), there is a point \(P'\) on \(L_{3}\) and a point \(P''\) on \(L_{4}\) such that \(A(P')=A(P'')=g\). Then both \(P'\) and \(P''\) must lie on the line of points \((x_{1},x_{2})\) such that \(\Phi(g) = x_{2} + g x_{1}\). Let this line be denoted by \(L_{g}\). Then \(L_{g}\) can intersect each of \(L_{3}\) and \(L_{4}\) only once, and all points where A takes the value g must lie on \(L_{g}\).

Concerning values \(\alpha< a\) that A might hypothetically take on \(L_{3}\), by continuity each such value α would have to be taken at least twice on \(L_{3}\). On the other hand, points where the value α are taken must lie on the line \(L_{\alpha}\), so that since two distinct points of \(L_{3}\) lie on \(L_{\alpha}\), it follows that the line \(L_{\alpha }\) contains the segment \(L_{3}\). This is a contradiction, since the line containing \(L_{3}\) must be equal to \(L_{\beta}\), where \(\beta= A(P_{1})>a>\alpha\). Thus A takes no values <a on \(L_{3}\). Similarly, A takes no values <a on \(L_{4}\). This argument implies that A is one-to-one on each of \(L_{3}\) and \(L_{4}\). This also means that \(P'\) and \(P''\) are unique, for each \(g\in(a,b)\). Thus A is strictly increasing on each of \(L_{3}\) and \(L_{4}\) when we move from O to points where A takes the value b.

Consider now values \(g>a\) that are arbitrarily close to a. Then \(P'\) and \(P''\) must also be arbitrarily close to O, by the continuity of A and the fact that A is one-to-one on these segments. Moreover, when g is close to a, also the line \(L_{g}\) must be close to \(L_{1}\), and in particular the slope of \(L_{g}\), that is, the slope of the line joining \(P'\) and \(P''\), must be close to the slope of \(L_{1}\).

Since the problem is invariant under translations and rotations when we now discuss only these slopes, we may disregard the connection of the slope to the value a, and assume that O is the origin and \(L_{1}\) is the real axis. Then P is a non-zero real number and we may assume that \(P>0\). The slope of \(L_{2}\) is non-zero, and we may assume that the slope is negative since the argument is similar in the other case. If \(P_{1}\) is the left end point, then \(a_{1}<0\) and \(b_{1}>0\), and then \(a_{2}>0\) and \(b_{2}<0\), see Figure 1.

Figure 1
figure 1

An illustration for the proof of Theorem  2.1 .

Now \(L_{3}\) is part of the line \(y = - k_{1} x\) and \(L_{4}\) is part of the line \(y = - k_{2} x\), where \(k_{1}=|b_{1}/a_{1}|\) and \(k_{2}=|b_{2}/a_{2}|\). When \(P'\) is close to 0, we have \(P'= -c + ik_{1}c\) for some small \(c>0\), and when \(P''\) is close to 0, we have \(P'' = d - ik_{2}d\) for some small \(d>0\). The slope of \(L_{g}\) is negative, and its absolute value is

$$(k_{1} c + k_{2} d)/(c+d)\geq\min\{k_{1},k_{2} \}>0 $$

and hence cannot be arbitrarily close to the slope of \(L_{1}\), which is zero.

This is the desired contradiction, and it follows that A had to be constant on the intersection of D and the line \(L_{1}\). This then shows that the lines \(\Phi(a)=x_{2}+ax_{1}\) for various real numbers a must foliate D, and on each of these lines, the function A takes the constant value a. Since a function cannot take two values at the same point, the intersections of these lines with D must be disjoint.

4.2 The function u is affine on line segments

Consider a point \((x_{1},x_{2})\) at which A takes a certain value a. Set \(b=\Phi(a)\). From Section 4.1, we see that A is equal to the constant a on the line \(b=x_{2}+ax_{1}\), denoted by L. Then L has a unit tangent vector \((c,s)\) where \(c=\cos\theta\) and \(s=\sin\theta\) for a certain θ, and then \(0=s+ac\).

We will show that the derivative of u in the direction \((c,s)\) is constant on L so that u is affine on L. To see this, we take the derivative of u in the direction of L twice and get

$$c(cu_{x_{1}}+su_{x_{2}})_{x_{1}} + s(cu_{x_{1}}+su_{x_{2}})_{x_{2}}=c^{2} u_{x_{1}x_{1}} + 2cs u_{x_{1}x_{2}} + s^{2} u_{x_{2}x_{2}}. $$

On the line L, we have \(a=A(x_{1},x_{2})=u_{x_{1}x_{1}}/u_{x_{1}x_{2}}=u_{x_{1}x_{2}}/u_{x_{2}x_{2}}\). Hence

$$u_{x_{1}x_{2}} = a u_{x_{2}x_{2}}\quad \text{and}\quad u_{x_{1}x_{1}} = a u_{x_{1}x_{2}} = a^{2} u_{x_{2}x_{2}}. $$

Therefore,

$$\begin{aligned} c^{2} u_{x_{1}x_{1}} + 2cs u_{x_{1}x_{2}} + s^{2} u_{x_{2}x_{2}} &= c^{2} a^{2} u_{x_{2}x_{2}} + 2cs a u_{x_{2}x_{2}} + s^{2} u_{x_{2}x_{2}} \\ &= (ca + s)^{2} u_{x_{2}x_{2}} \\ &=0. \end{aligned}$$

This completes the proof that u is affine on L. The proof of Theorem 2.1 is now complete.

5 Proof of Theorem 2.3

Let the assumptions of Theorem 2.3 be satisfied. Thus, let u be a convex function of \((x_{1},x_{2})\) such that u is affine on a certain line segment in the \((x_{1},x_{2})\)-plane and let \(v=(k,r)\) be a non-zero vector giving the direction at each point of the line segment on which u is affine.

Let \((a,b)\) be a point on a line segment on which u is affine. Then this line segment consists of points of the form

$$(a+sk,b+sr), $$

where s is real and runs through an interval of real axis.

On this line segment, \(S(s)=u(a+sk,b+sr)\) is a linear function of s, which means that \(S'(s)\) is a constant. We have

$$S'(s)=u_{x_{1}} k+u_{x_{2}} r, $$

where \(u_{x_{1}}\), \(u_{x_{2}}\) are evaluated at \((a+sk,b+sr)\).

Therefore, \(S''(s)=0\) for all s in an interval of real axis, and we have

$$S''(s)=k(u_{x_{1}})'+r(u_{x_{2}})', $$

where \((u_{x_{1}})'=(d/ds)u_{x_{1}}(a+sk,b+sr)\) and similarly for \(u_{x_{2}}\).

We have

$$(u_{x_{1}})'=u_{x_{1}x_{1}}k+u_{x_{1}x_{2}}r $$

and

$$(u_{x_{2}})'=u_{x_{2}x_{1}}k+u_{x_{2}x_{2}}r, $$

where \(u_{x_{1}x_{1}}\), \(u_{x_{1}x_{2}}\), \(u_{x_{2}x_{2}}\) are evaluated at \((a+sk,b+sr)\).

Hence

$$S''(s) = k (u_{x_{1}x_{1}} k + u_{x_{1}x_{2}} r) + r (u_{x_{1}x_{2}} k + u_{x_{2}x_{2}} r), $$

so, since \(u_{x_{2}x_{1}}=u_{x_{1}x_{2}}\), we have

$$S''(s) = k^{2} u_{x_{1}x_{1}} + 2 k r u_{x_{1}x_{2}} + r^{2} u_{x_{2}x_{2}} . $$

We may write this in the Hessian matrix notation of u as follows.

Let

$$H=\left ( \begin{matrix} u_{x_{1}x_{1}} & u_{x_{1}x_{2}} \\ u_{x_{2}x_{1}} & u_{x_{2}x_{2}} \end{matrix} \right ) . $$

By writing vectors generally as column vectors, let

$$v= \left ( \begin{matrix} k \\ r \end{matrix} \right ) $$

be a column vector and denote its transpose by \(v^{T}\). Then we have

$$S''(s) = v^{T} H v =0 . $$

By the convexity of u, we have \(\det H\ge0\), \(u_{x_{1}x_{1}}\ge0\) and \(u_{x_{2}x_{2}}\ge0\). Hence, \(\operatorname {tr}H=u_{x_{1}x_{1}}+u_{x_{2}x_{2}}\ge0\).

Thus H is a symmetric positive semi-definite matrix, and it has real non-negative eigenvalues. If H is positive definite, that is, if \(\det H >0\), then it is well known that \(v^{T} H v >0\) for all \(v\neq 0\). Thus \(\det H=0\), which is equivalent to (4). The last statement of Theorem 2.3 now follows immediately. The proof of Theorem 2.3 is complete.

6 Conclusion

We find some structural characterizations of the maximal convex functions \(u:D\rightarrow\mathbb{R}\) defined in the closure of a bounded convex plane domain, such that \(u(x)\le\psi(x)\) for all \(x\in \partial{D}\), for a given real-valued function ψ on ∂D. This is a more general problem than the one-variable version of finding or characterizing the largest biconvex function \(\zeta:\overline {B}\times\overline{B}\rightarrow\mathbb{R}\) such that \(\zeta(x,y)\le |x+y|_{p}\) whenever \(|x|_{p}=|y|_{p}=1\), where is the closed unit ball in \({\mathbb{R}}^{2}\) in the \(\ell^{p}\)-metric, obtained when the variable y is fixed. We show that if such an extremal function is in \(C^{3}\), then it is affine on certain line segments that foliate the domain D. Thus one can further approach all such problems by specifically looking for such foliations. In 1986, Burkholder explicitly found the maximal function for \(p=2\) without the realization of this structural property of this type of extremal functions. This paper provides a greater understanding of the nature of the solutions to these extremal problems.

References

  1. Burkholder, DL: Boundary value problems and sharp inequalities for martingale transforms. Ann. Probab. 12(3), 647-702 (1984)

    Article  MathSciNet  MATH  Google Scholar 

  2. Burkholder, DL: Martingales and Fourier analysis in Banach spaces. In: Probability and Analysis (Varenna, 1985). Lecture Notes in Math., vol. 1206, pp. 61-108. Springer, Berlin (1986). doi:10.1007/BFb0076300

    Chapter  Google Scholar 

  3. Burkholder, DL: A geometrical characterization of Banach spaces in which martingale difference sequences are unconditional. Ann. Probab. 9(6), 997-1011 (1981)

    Article  MathSciNet  MATH  Google Scholar 

  4. Hörmander, L: Lectures on Nonlinear Hyperbolic Differential Equations. Mathématiques & Applications (Berlin) [Mathematics & Applications], vol. 26. Springer, Berlin (1997)

    MATH  Google Scholar 

  5. Maurey, B: Système de Haar. In: Séminaire Maurey-Schwartz 1974–1975: Espaces L p, Applications Radonifiantes et Géométrie des Espaces de Banach, Exp. Nos. I et II, pp. 1-26. Centre Math., École Polytech., Paris (1975)

    Google Scholar 

  6. Riesz, M: Sur les fonctions conjuguées. Math. Z. 27(1), 218-244 (1928). doi:10.1007/BF01171098

    Article  MathSciNet  MATH  Google Scholar 

  7. Schwartz, J: A remark on inequalities of Calderon-Zygmund type for vector-valued functions. Commun. Pure Appl. Math. 14, 785-799 (1961)

    Article  MathSciNet  MATH  Google Scholar 

  8. Benedek, A, Calderón, A-P, Panzone, R: Convolution operators on Banach space valued functions. Proc. Natl. Acad. Sci. USA 48, 356-365 (1962)

    Article  MathSciNet  MATH  Google Scholar 

  9. Bourgain, J: Some remarks on Banach spaces in which martingale difference sequences are unconditional. Ark. Mat. 21(2), 163-168 (1983). doi:10.1007/BF02384306

    Article  MathSciNet  MATH  Google Scholar 

  10. Burkholder, DL: A geometric condition that implies the existence of certain singular integrals of Banach-space-valued functions. In: Conference on Harmonic Analysis in Honor of Antoni Zygmund (Chicago, Ill., 1981), vols. I, II. Wadsworth Math. Ser., pp. 270-286. Wadsworth, Belmont (1983)

    Google Scholar 

Download references

Acknowledgements

This material is supported by Faculty of Science, Silpakorn University under the grant SRF-PRG-2558-05.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sineenuch Suwannaphichat.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

The two authors contributed equally to all aspects of the paper, and read and approved the final manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hinkkanen, A., Suwannaphichat, S. Extremal problems related to convexity. J Inequal Appl 2016, 315 (2016). https://doi.org/10.1186/s13660-016-1258-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13660-016-1258-y

Keywords