Extremal problems related to convexity
- Aimo Hinkkanen^{1} and
- Sineenuch Suwannaphichat^{2}Email authorView ORCID ID profile
https://doi.org/10.1186/s13660-016-1258-y
© The Author(s) 2016
Received: 4 October 2016
Accepted: 22 November 2016
Published: 1 December 2016
Abstract
We consider the extremal problem of maximizing functions u in the class of real-valued biconvex functions satisfying a boundary condition ψ on a product of the unit ball with itself, with the \(\ell^{p}\)-norm. In 1986, Burkholder explicitly found the maximal function for \(p=2\). In this paper, we find some characterizations of such extremal functions. We establish that sufficiently smooth solutions to the convex extremal problems with given boundary values are affine on line segments and the domain D is foliated by such segments.
Keywords
1 Introduction
In this paper, we consider properties of convex functions defined in a convex plane domain D that are maximal with respect to prescribed real boundary values. We show that if such an extremal function is in \(C^{3}\), then it is affine on certain line segments that foliate the domain D. This structural result represents a very preliminary step in a larger program of determining maximal ζ-convex functions in certain Banach spaces.
For precise definitions and background results we refer to Section 3. But let us note here that in Banach space theory one considers Banach spaces that are ζ-convex, that is, they support a biconvex function ζ satisfying (6). The question arises as to which Banach spaces are ζ-convex, and for them, what the largest biconvex function satisfying (6) is equal to. Banach spaces of the type \(\ell^{p}\) and \(L^{p}\) are known to be ζ-convex for \(1< p<\infty\). However, the extremal function ζ has been determined only for Hilbert spaces, by Burkholder in 1986. In 2008, Saksman drew the attention of the first author to the fact that the extremal function is not known in \(\ell^{p}\)- and \(L^{p}\)-spaces for \(p\neq 2\), even though it might be desirable to know it. This question provides the basic motivation for the present paper.
In order to get started with the study of such a question, we only consider the simplest non-trivial case, that of the Banach space \(\ell ^{p}\) in \({\mathbb{R}}^{2}\). Any results obtained in this case might then have generalizations to other cases. Let B̅ be the closed unit ball in \({\mathbb{R}}^{2}\) in the \(\ell^{p}\)-metric. We are looking for the largest biconvex function \(\zeta: \overline{B}\times\overline {B}\to{\mathbb{R}}\) such that \(\zeta(x,y)\leq|x+y|_{p}\) whenever \(|x|_{p}=|y|_{p}=1\). The Hilbert space case suggests that not much is lost if one fixes one variable, say y, and only uses initially the assumption that \(x\to\zeta(x,y)\) is a convex function of \(x\in \overline{B}\). In this spirit we consider the slightly more general problem of finding or characterizing the maximal function \(u:D\to {\mathbb{R}}\) defined in the closure of a bounded convex plane domain, such that \(u(x)\leq\psi(x)\) for all \(x\in\partial D\), for a given function ψ on ∂D. If the boundary function ψ is not sufficiently regular, it may of course happen that \(\psi_{0}(x)=u(x)\) for \(x\in\partial D\), for the extremal function u satisfies only \(\psi _{0}\leq\psi\) but not necessarily \(\psi_{0}=\psi\) on ∂D. However, the same extremal function u is then obtained for boundary values \(\psi_{0}\) and the structural results that we obtain allow us to express u in the interior of D in terms of \(\psi_{0}\) (which, in interesting applications, will often be the same as ψ) as soon as we know a certain foliation of D.
In applications it is likely that the extremal function is many times differentiable, perhaps even in \(C^{\infty}\), in the interior of D. Thus we see no significant loss of generality in assuming, in this paper, that u is in \(C^{3}\) and pursuing the implications of this assumption.
We first observe that the extremal function must be a solution of the Hopf differential equation. Indeed, we find that with \(x=(x_{1},x_{2})\in D\), the function \(A=u_{x_{1}x_{1}}/u_{x_{1}x_{2}}=u_{x_{1}x_{2}}/u_{x_{2}x_{2}}\) satisfies such an equation. The general form of such solutions is known, and they are given implicitly in terms of a parameter function Φ. We then ask what can be said about such functions A and, consequently, about the function u. We find that at each point x of D, the value of \(A(x)\) determines a line L through x and hence the line segment \(L_{1} = L\cap D\). We show that A is constant and that u is affine on each such segment \(L_{1}\). This shows that the domain D is foliated by such line segments: the segments are disjoint and their union is D. Hence on each segment \(L_{1}\), the function u, being affine, is determined by its values at the end points of \(L_{1}\), which are on ∂D, and these boundary values of u are the values of \(\psi_{0}\), hence often the same as the values of ψ.
Thus, if we knew what the foliation is explicitly, we could then compute the extremal function u. However, at this stage many foliations are still possible, and not all of them correspond to the extremal function. Further work shows that one can develop a differential equation for a parametrization of the foliation, and the solution of that equation will give the sought-for extremal function. The equation is complicated and we will leave the presentation of this work to another paper. When \(p=2\), we recover the extremal function u discovered by Burkholder. When \(p\neq 2\), the equation is complicated and it is not clear if explicit solutions can be found.
However, the structural results of this paper are of interest in their own right. In various papers of Burkholder, it has been pointed out that certain functions discovered there are affine on certain line segments, apparently without the realization that this is a general property of this type of extremal functions. Thus we gain a greater understanding of the nature of the solutions to these extremal problems. For example, in [1], p.687, (9.7), it is observed that a certain extremal biconcave function is affine on certain line segments. Similarly, Burkholder has noted that his extremal function (7) is affine on certain line segments. We now see that this is to be expected and that one can further approach all such problems by specifically looking for such foliations.
On the basis of preliminary studies, we feel that it is likely that our methods will work more generally in \({\mathbb{R}}^{n}\), for all \(n\geq2\), and it may be that similar results are valid in more general Banach spaces. This work will have to be the subject of a later paper.
It seems to be typical in applications that in fact, the convex domain D is divided by a line segment into two parts, in each of which the function A described above is well defined. Thus there is a foliation in each of the two parts of D and the separating segment does not belong to either foliation (or, in an extended sense, belongs to both). This is connected to certain second order partial derivatives of u being zero on the separating segment. The assumptions of our theorems exclude this case, but the phenomena described in the theorems of this paper are then valid in each of the two parts of D, each of them being a convex domain in its own right. When considering each part as a convex domain, the boundary values are not known on that part of the boundary corresponding to the separating segment, but this does not matter as that segment will not be involved in the foliation of the interior of either part of D.
2 Results
To study the extremal problems further in general Banach spaces, we may investigate the characterizations of such extremal functions. We consider the problem of finding the extremal function in the class of real-valued biconvex functions u satisfying a boundary condition ψ on a domain \(D\times D\) where D is a convex domain in \(\mathbb {R}^{2}\). In particular, we restrict the domain D to the unit ball in the \(\ell^{p} \)-norm or the unit disk when \(p=2\). We establish that sufficiently smooth solutions to the convex extremal problems with given boundary values are affine on line segments and the domain D is foliated by such segments.
In \(\ell^{p}(\mathbb{R}^{2})\), we denote the \(\ell^{p}\)-norm by \(\vert x\vert _{p}\) where \(x=(x_{1},x_{2})\) in \(\mathbb{R}^{2}\). Thus \(\vert x\vert _{p}^{p}=\vert x_{1}\vert ^{p}+\vert x_{2}\vert ^{p}\). We define the open and closed unit balls in \(\mathbb{R}^{2}\) for the \(\ell^{p}\)-metric by \(B= \{x\in\mathbb{R}^{2}:\vert x\vert _{p}<1 \}\) and \(\overline{B}= \{x\in\mathbb{R}^{2}:\vert x\vert _{p}\leq1 \}\).
From this perspective and using these notations, we obtain the following theorems.
Theorem 2.1
Let \(A=A(x_{1},x_{2})\) be a continuous real-valued function defined for \((x_{1},x_{2})\in D\), where D is the closure of a bounded convex plane domain. If Φ is a continuous real-valued function on an interval of the real axis that contains the set \(A(D)\), and if \(\Phi (A(x_{1},x_{2}))=x_{2}+x_{1}A(x_{1},x_{2})\) in D, then \(A(x_{1},x_{2})\) is constant on certain line segments that are maximal in the sense that each segment is the intersection of D with a straight line. The union of such line segments is D. Moreover, if the real-valued function u is in \(C^{2}\) in D and satisfies (2) there (which, in particular, means that \(u_{x_{2}x_{2}}\) and \(u_{x_{1}x_{2}}\) do not vanish in D), then u is affine on each such segment.
Remark 2.2
When \(D=\overline{B}\), Theorem 2.1 gives a foliation of the entire domain B̅.
Theorem 2.3
If we now assume that \(u_{x_{1}x_{2}}\ne0\) and \(u_{x_{2}x_{2}}\ne0\) in D, it follows that we may define the function A as in (2), and then A is given by (3) for a suitable function Φ. Now from Theorem 2.1 we see that u is affine on line segments that foliate D. Thus we obtain the following result.
Theorem 2.4
Let D be the closure of a bounded convex plane domain, let ψ be a continuous real-valued function on ∂D, and let u be the maximal real-valued convex function on D such that \(u\leq\psi\) on ∂D. Suppose that \(u\in C^{3}\). Then (4) holds at each point of the interior of D. Furthermore, if \(u_{x_{1}x_{2}}\ne0\) and \(u_{x_{2}x_{2}}\ne0\) in D, then we may define the function A as in (2), and then A is given by (3) for a suitable function Φ. Finally, u is affine on line segments that foliate D.
We conclude with the remark that the property of u being affine on line segments is equivalent to (4) in a suitable sense.
Theorem 2.5
Namely, if u is affine on line segments as stated, then it follows from Theorem 2.3 that \(u_{x_{1}x_{1}}/u_{x_{1}x_{2}}=u_{x_{1}x_{2}}/u_{x_{2}x_{2}}\) on these segments if the denominators are assumed to be non-zero. Conversely, if the denominators are non-zero and \(u_{x_{1}x_{1}}/u_{x_{1}x_{2}}=u_{x_{1}x_{2}}/u_{x_{2}x_{2}}\), then we may define A as in (2), and it follows as explained above that u is affine on line segments that foliate D. This proves Theorem 2.5.
3 Some geometric characterizations of Banach spaces
To give some background, in order to motivate why we should study at all questions such as those addressed in the theorems of the previous section, we review some literature and results that give rise to questions on biconvex functions. Recall that a function \(u:D\to {\mathbb{R}}\), defined on a convex subset D of a Banach space, is said to be convex if \(u(tx+(1-t)y)\leq tu(x)+(1-t)u(y)\) whenever \(x,y\in D\) and \(0< t<1\) (note that then \(tx+(1-t)y\in D\)). A function \(u:D\times D\to{\mathbb{R}}\) is said to be biconvex if for each \(x\in D\), the function \(y\to u(x,y)\) is a convex function of \(y\in D\), and if for each \(y\in D\), the function \(x\to u(x,y)\) is a convex function of \(x\in D\). Taken together, the following results should motivate the quest for the best possible biconvex function in the definition of ζ-convexity, for \(L^{p}\)-spaces.
3.1 ζ-Convexity
Clearly the constant function \(\zeta\equiv0\) is biconvex and satisfies (6), so that it is the condition \(\zeta(0,0)>0\) that makes the requirements non-trivial. Burkholder has shown that \(\zeta (0,0)\le1\) and E is a (real or complex) Hilbert space if, and only if, it is possible to have \(\zeta(0,0)=1\). Thus for ζ-convex non-Hilbert Banach spaces we have \(0<\zeta(0,0)<1\).
It is shown in Lemma 3.1 of [2] that, in order to find the greatest biconvex function ζ satisfying (6), it is enough to have the function ζ defined and biconvex on the product of the closed unit ball of E with itself rather than on the whole space \(E\times E\).
Let B̅ be the closed unit ball in a real Hilbert space H with norm \(\Vert x\Vert \) and let \({\mathcal{F}}\) be the class of biconvex functions u on \(\overline{B}\times\overline{B}\) satisfying (6). Then we have the following theorem by Burkholder.
Theorem 3.1
Burkholder [2]
Note that looking for a maximal ζ makes sense only when ζ is restricted to \(\overline{B}\times\overline{B}\), since outside this set one could always make ζ larger without violating any requirements.
3.2 UMD-unconditional for martingale differences
Let Ω be a probability space with a σ-algebra \({\mathcal{A}}\) of measurable sets for a measure P. A discrete-time E-valued martingale g on Ω is a sequence of E-valued functions \(g_{n}\) in \(L^{1}(\Omega)\) such that \(g_{n}\) is measurable with respect to a σ-algebra \({\mathcal{A}}_{n}\), where \({\mathcal{A}}_{n}\subset {\mathcal{A}}_{n+1}\subset{\mathcal{A}}\), such that, for each \(A\in {\mathcal{A}}_{n}\), we have \(\int_{A}(g_{n+1}-g_{n})\, dP=0\).
The following theorem shows how the ζ-convexity property characterizes UMD-spaces.
3.3 HT-space
This limit exists for almost all \(x \in\mathbb{R}\), see M Riesz [6], 1928.
The Banach space E is said to be an HT-space if \(\alpha_{p}(E)\) is finite for some \(p\in(1,\infty)\); equivalently, for all \(p\in (1,\infty)\), see Schwartz [7], 1961 and Benedek, Calderón, and Panzone [8], 1962.
The following theorem shows the relation between HT-spaces and UMD-spaces.
Theorem 3.3
Bourgain [9], Burkholder [10], McConnell [10]
A Banach space E is an HT-space if, and only if, it is UMD.
By Theorem 3.2 and Theorem 3.3, we obtain the three equivalent conditions giving geometric characterizations of Banach spaces.
4 Proof of Theorem 2.1
Let the assumptions of Theorem 2.1 be satisfied. We separate the proof of Theorem 2.1 into two parts. First, we will geometrically prove that \(A(x_{1},x_{2})\) is constant on certain line segments that are maximal and then we use this fact to show that u is affine on each such segment.
4.1 The function \(A(x_{1},x_{2})\) is constant on certain line segments
Let \(O=c_{1}+ic_{2}\) be any point of D. Set \(a=A(O)\). This determines the line \(L_{1}\) containing all points \((x_{1},x_{2})=x_{1}+ix_{2}\) such that \(\Phi(a) = x_{2} + a x_{1}\), and the point O must be on this line, that is, by assumption, we have \(\Phi(a) = c_{2} + a c_{1}\).
Note that any point \((x_{1},x_{2})\) at which A takes the value a must be on line \(L_{1}\). Therefore, for each value of A, there is a corresponding line segment (the intersection of D and \(L_{1}\)), and the value a cannot be taken anywhere outside that line segment. At this stage there is no guarantee that A cannot take also values other than a on \(L_{1}\) since points other than O on \(L_{1}\) might lie also on relevant lines other than \(L_{1}\).
We seek to prove that A is constant on the intersection of D and \(L_{1}\). To get a contradiction, suppose that A is not constant on \(D\cap L_{1}\). Then there is a point \(P\in D\cap L_{1}\) such that \(A(P)\neq a\). We may assume that \(A(P) > a\), since the argument that follows would be similar in the case \(A(P) < a\).
Write \(P=d_{1}+id_{2}\) and \(A(P)=a'\). Let \(L_{2}\) denote the line of all points \((x_{1},x_{2})\) such that \(\Phi(a') = x_{2} + a' x_{1}\). The point P must be on this line, that is, \(\Phi(a') = d_{2} + a' d_{1}\). Since P also lies on \(L_{1}\), we further have \(\Phi(a) = d_{2} + a d_{1}\).
Since \(a' \neq a\), the lines \(L_{1}\) and \(L_{2}\) have different slopes and hence intersect at exactly one point, and therefore this point of intersection must be P. Note that the point O is not on \(L_{2}\).
The line \(L_{2}\) intersects the boundary of D at exactly two points, say \(P_{1}=a_{1}+ib_{1}\) and \(P_{2}=a_{2}+ib_{2}\). Let \(L_{3}\) and \(L_{4}\) denote the closed line segments from O to \(P_{1}\) and from O to \(P_{2}\), respectively.
Since A cannot take the value a at any point of \(L_{2}\) including P, by continuity, all values of A on \(L_{2}\) must be greater than a since \(A(P)>a\).
Let \(b=\min\{A(P_{1}),A(P_{2})\}\). Since \(A(P_{1})>a\) and \(A(P_{2})>a\), we have \(b>a\). Since A is real-valued and continuous, A must take all values belonging to the closed interval \([a,A(P_{1})]\) on the segment \(L_{3}\). Also A takes all values in the closed interval \([a,A(P_{2})]\) on the segment \(L_{4}\). Thus on each of \(L_{3}\) and \(L_{4}\), A takes all values on \([a,b]\).
For each value \(g\in(a,b)\), there is a point \(P'\) on \(L_{3}\) and a point \(P''\) on \(L_{4}\) such that \(A(P')=A(P'')=g\). Then both \(P'\) and \(P''\) must lie on the line of points \((x_{1},x_{2})\) such that \(\Phi(g) = x_{2} + g x_{1}\). Let this line be denoted by \(L_{g}\). Then \(L_{g}\) can intersect each of \(L_{3}\) and \(L_{4}\) only once, and all points where A takes the value g must lie on \(L_{g}\).
Concerning values \(\alpha< a\) that A might hypothetically take on \(L_{3}\), by continuity each such value α would have to be taken at least twice on \(L_{3}\). On the other hand, points where the value α are taken must lie on the line \(L_{\alpha}\), so that since two distinct points of \(L_{3}\) lie on \(L_{\alpha}\), it follows that the line \(L_{\alpha }\) contains the segment \(L_{3}\). This is a contradiction, since the line containing \(L_{3}\) must be equal to \(L_{\beta}\), where \(\beta= A(P_{1})>a>\alpha\). Thus A takes no values <a on \(L_{3}\). Similarly, A takes no values <a on \(L_{4}\). This argument implies that A is one-to-one on each of \(L_{3}\) and \(L_{4}\). This also means that \(P'\) and \(P''\) are unique, for each \(g\in(a,b)\). Thus A is strictly increasing on each of \(L_{3}\) and \(L_{4}\) when we move from O to points where A takes the value b.
Consider now values \(g>a\) that are arbitrarily close to a. Then \(P'\) and \(P''\) must also be arbitrarily close to O, by the continuity of A and the fact that A is one-to-one on these segments. Moreover, when g is close to a, also the line \(L_{g}\) must be close to \(L_{1}\), and in particular the slope of \(L_{g}\), that is, the slope of the line joining \(P'\) and \(P''\), must be close to the slope of \(L_{1}\).
This is the desired contradiction, and it follows that A had to be constant on the intersection of D and the line \(L_{1}\). This then shows that the lines \(\Phi(a)=x_{2}+ax_{1}\) for various real numbers a must foliate D, and on each of these lines, the function A takes the constant value a. Since a function cannot take two values at the same point, the intersections of these lines with D must be disjoint.
4.2 The function u is affine on line segments
Consider a point \((x_{1},x_{2})\) at which A takes a certain value a. Set \(b=\Phi(a)\). From Section 4.1, we see that A is equal to the constant a on the line \(b=x_{2}+ax_{1}\), denoted by L. Then L has a unit tangent vector \((c,s)\) where \(c=\cos\theta\) and \(s=\sin\theta\) for a certain θ, and then \(0=s+ac\).
5 Proof of Theorem 2.3
Let the assumptions of Theorem 2.3 be satisfied. Thus, let u be a convex function of \((x_{1},x_{2})\) such that u is affine on a certain line segment in the \((x_{1},x_{2})\)-plane and let \(v=(k,r)\) be a non-zero vector giving the direction at each point of the line segment on which u is affine.
Thus H is a symmetric positive semi-definite matrix, and it has real non-negative eigenvalues. If H is positive definite, that is, if \(\det H >0\), then it is well known that \(v^{T} H v >0\) for all \(v\neq 0\). Thus \(\det H=0\), which is equivalent to (4). The last statement of Theorem 2.3 now follows immediately. The proof of Theorem 2.3 is complete.
6 Conclusion
We find some structural characterizations of the maximal convex functions \(u:D\rightarrow\mathbb{R}\) defined in the closure of a bounded convex plane domain, such that \(u(x)\le\psi(x)\) for all \(x\in \partial{D}\), for a given real-valued function ψ on ∂D. This is a more general problem than the one-variable version of finding or characterizing the largest biconvex function \(\zeta:\overline {B}\times\overline{B}\rightarrow\mathbb{R}\) such that \(\zeta(x,y)\le |x+y|_{p}\) whenever \(|x|_{p}=|y|_{p}=1\), where B̅ is the closed unit ball in \({\mathbb{R}}^{2}\) in the \(\ell^{p}\)-metric, obtained when the variable y is fixed. We show that if such an extremal function is in \(C^{3}\), then it is affine on certain line segments that foliate the domain D. Thus one can further approach all such problems by specifically looking for such foliations. In 1986, Burkholder explicitly found the maximal function for \(p=2\) without the realization of this structural property of this type of extremal functions. This paper provides a greater understanding of the nature of the solutions to these extremal problems.
Declarations
Acknowledgements
This material is supported by Faculty of Science, Silpakorn University under the grant SRF-PRG-2558-05.
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Authors’ Affiliations
References
- Burkholder, DL: Boundary value problems and sharp inequalities for martingale transforms. Ann. Probab. 12(3), 647-702 (1984) MathSciNetView ArticleMATHGoogle Scholar
- Burkholder, DL: Martingales and Fourier analysis in Banach spaces. In: Probability and Analysis (Varenna, 1985). Lecture Notes in Math., vol. 1206, pp. 61-108. Springer, Berlin (1986). doi:10.1007/BFb0076300 View ArticleGoogle Scholar
- Burkholder, DL: A geometrical characterization of Banach spaces in which martingale difference sequences are unconditional. Ann. Probab. 9(6), 997-1011 (1981) MathSciNetView ArticleMATHGoogle Scholar
- Hörmander, L: Lectures on Nonlinear Hyperbolic Differential Equations. Mathématiques & Applications (Berlin) [Mathematics & Applications], vol. 26. Springer, Berlin (1997) MATHGoogle Scholar
- Maurey, B: Système de Haar. In: Séminaire Maurey-Schwartz 1974–1975: Espaces L ^{ p }, Applications Radonifiantes et Géométrie des Espaces de Banach, Exp. Nos. I et II, pp. 1-26. Centre Math., École Polytech., Paris (1975) Google Scholar
- Riesz, M: Sur les fonctions conjuguées. Math. Z. 27(1), 218-244 (1928). doi:10.1007/BF01171098 MathSciNetView ArticleMATHGoogle Scholar
- Schwartz, J: A remark on inequalities of Calderon-Zygmund type for vector-valued functions. Commun. Pure Appl. Math. 14, 785-799 (1961) MathSciNetView ArticleMATHGoogle Scholar
- Benedek, A, Calderón, A-P, Panzone, R: Convolution operators on Banach space valued functions. Proc. Natl. Acad. Sci. USA 48, 356-365 (1962) MathSciNetView ArticleMATHGoogle Scholar
- Bourgain, J: Some remarks on Banach spaces in which martingale difference sequences are unconditional. Ark. Mat. 21(2), 163-168 (1983). doi:10.1007/BF02384306 MathSciNetView ArticleMATHGoogle Scholar
- Burkholder, DL: A geometric condition that implies the existence of certain singular integrals of Banach-space-valued functions. In: Conference on Harmonic Analysis in Honor of Antoni Zygmund (Chicago, Ill., 1981), vols. I, II. Wadsworth Math. Ser., pp. 270-286. Wadsworth, Belmont (1983) Google Scholar