On Malfatti’s Marble Problem

Uuganbaatar Ninjbat The National University of Mongolia,
Ulaanbaatar Mongolia
uugnaa.ninjbat@gmail.com

Abstract

Consider the problem of finding three non-overlapping circles in a given triangle with the maximum total area. This is Malfatti’s marble problem, and it is known that the greedy arrangement is the solution. In this paper, we provide a simpler proof of this result by synthesizing earlier insights with more recent developments. We also discuss some related geometric extremum problems, and show that the greedy arrangement solves the problem of finding two non-overlapping circles in a tangential polygon with the maximum total radii and/or area. In the light of this discussion, we formulate a natural extension of Melissen’s conjecture.

Keywords

1 Introduction

In mathematics the art of proposing
a question must be
held higher than solving it.

Georg Cantor

Let $\triangle ABC$ be a given triangle in a plane, and let $n\in\mathbb{N}$ be a given number. Consider the following problem.

Problem 1.

Find $n$ non-overlapping circles inside of $\triangle ABC$ so that the sum of their areas is maximal.

When $n=3$, following a paper by Malfatti published in 1803, this problem is known as Malfatti’s marble problem; according to [Szabó et al.2007], this is one of the first examples of a packing problem appeared in European mathematics. Initially, G. Malfatti and others assumed that the solution would be three circles that are tangent to each other and each circle is tangent to two sides of $\triangle ABC$; these circles later became known as the Malfatti circles. However, [Lob and Richmond1930] discovered a case in which the Malfatti circles were not the solution to Malfatti’s marble problem, and [Goldberg1967] showed that they are never the optimal solution. Following this, [Zalgaller and Los1994] gave a complete solution to this problem by showing that the greedy arrangement is the optimal solution.

The greedy arrangement of $n$ circles in $\triangle ABC$ is the result of the $n$-step process where at each step one chooses the largest circle which does not overlap the previously selected circles and is contained by $\triangle ABC$. It is evident that, for $n=1$, the greedy arrangement solves Problem 1. It can be shown that the same is true for $n=2$ [see Theorem 1 in [Andreatta et al.2011], and also Sect. 2.3 in [Andreescu et al.2006]]. As mentioned above, [Zalgaller and Los1994] showed that one can extend this line of reasoning to the case of $n=3$. However, their proof is lengthy with the extensive usage of trigonometric methods.¹¹[Andreescu et al.2006] give a simple proof of [Zalgaller and Los1994]’s result for the case of an equilateral $\triangle ABC$.

In Sect. 2, we provide a simpler proof of the [Zalgaller and Los1994] result by synthesizing their insights with these in [Andreatta et al.2011]. The former paper shows that, for $n=3$, there are fourteen possible arrangements to be considered, and it eliminates each non-greedy arrangement as being non-optimal; the latter paper shows that not only there is an elegant and simple proof of the result for $n=2$, but also the same result holds for other regions including concave triangles. We connect the more difficult case of $n=3$ to the simple case of $n=2$, which, in turn, allows us to focus on seven groups of arrangements instead of fourteen cases, where each group is analyzed in a unified fashion. In addition to clarifying the proof, this approach also substantially reduces trigonometric calculations.

In Sect. 3, we discuss some other related geometric extremum problems and suggest a natural extension of Melissen’s conjecture (see Conjecture 3). We also show that the greedy arrangement solves the problem of finding two non-overlapping circles inscribed into a tangential convex polygon with the maximum total radii and/or area (see Theorem 5). This result generalizes some of the earlier results such as Theorem 1 in [Andreatta et al.2011].

2 The Proof

3 Some Related Extremum Problems

To prove and conjecture!

Paul Erdős

For any vector ${\mathbf{x}}\in\mathbb{R}^{n}$, let ${\mathbf{x}^{\prime}}\in\mathbb{R}^{n}$ be the vector obtained from ${\mathbf{x}}$ by reordering its components in a descending order. We say that ${\mathbf{x}}\in\mathbb{R}^{n}$ weakly majorizes ${\mathbf{y}}\in\mathbb{R}^{n}$, denoted as ${\mathbf{x}}\succeq{\mathbf{y}}$, if $\sum_{i=1}^{k}x^{\prime}_{i}\geq\sum_{i=1}^{k}y^{\prime}_{i}$ for all $k\in\{1,2,...,n\}$. Consider the following problem.

Problem 2.

Find an arrangement of $m\in\mathbb{N}$ circles in $\triangle ABC$ so that the sum of their radii is maximal.

The following result gives a direct connection between Problems 1 and 2.

Theorem 4.

Let $\triangle ABC$ be a given triangle in a plane, and let $n\in\mathbb{N}$ be a given number. If the greedy arrangement solves Problem 2 for all $1\leq m\leq n$, then it solves Problem 1.

Proof.

Lemma 4.

(Hardy-Littlewood-Pólya type inequality) Let ${\mathbf{x}},{\mathbf{y}}\in\mathbb{R}^{n}_{+}$ be such that ${\mathbf{x}}\succeq{\mathbf{y}}$. If $f:\mathbb{R}_{+}\rightarrow\mathbb{R}$ is an increasing and convex function, then $\sum_{i=1}^{n}f(x_{i})\geq\sum_{i=1}^{n}f(y_{i})$.

For a proof, see p. 92 in [Marshall et al.2011]. Let ${\mathbf{x}}=(r^{\star}_{1},...,r^{\star}_{n})\in\mathbb{R}^{n}_{+}$ be the vector of radii of $n$ circles arranged according to the greedy arrangement. Notice that, by definition, $r^{\star}_{1}>r^{\star}_{2}\geq...\geq r^{\star}_{n}$. Let ${\mathbf{y}}=(r_{1},...,r_{n})\in\mathbb{R}^{n}_{+}$ be the vector of radii of $n$ circles arranged arbitrarily. If there is any arrangement of $n$ circles in $\triangle ABC$, then any $k\leq n$ of them constitute an arrangement of $k$ circles in the same triangle. Then the condition that the greedy arrangement solves Problem 2 for all $1\leq m\leq n$ implies that ${\mathbf{x}}\succeq{\mathbf{y}}$. Since $f(r)=r^{2}$ is convex and increasing on $[0,\infty)$, by Lemma 4, we conclude that $\sum_{i=1}^{n}{r^{\star}_{i}}^{2}\geq\sum_{i=1}^{n}{r_{i}}^{2}$.$\square$

Notice that the objective function in Problem 2 is linear. Moreover, when $m\leq 2$, the solution of Problem 1 in [Andreatta et al.2011] directly applies to Problem 2. For $m=3$, the above solution of Problem 1 in Sect. 2 can be adapted to Problem 2 without much alteration if one makes the following observation: “the quadrilateral inequality that our proof is based on is strict when we restrict our attention to a triangular region, which, in turn, implies that, in a rigid arrangement, the sum of radii function is strictly convex.”

The analysis of all fourteen rigid arrangements in Problem 2, except arrangements 6 and 9 in Fig. 8, is the same as above. Only arrangements 6 and 9 need somewhat different approach. We should also note here that the idea of using majorization technique to connect optimization problems is rather classic, as stated in [Dahl and Margot1998]: “A general and important technique for finding inequalities in various fields is to discover some underlying majorization combined with a suitable Schur convex function.”

It is reported in [Andreatta et al.2011] that Melissen made the following conjecture in 1997.

Conjecture 1.

(Melissen) For all $n\in\mathbb{N}$, the greedy arrangement solves Problem 1.

The discussion above suggests that Problems 1 and 2 are likely to have the same solution. Probably, to solve Problem 2 is not much more difficult, if not easier, than to solve Problem 1, and Problem 2 also has broader implications. Therefore, it is natural to direct our attention to Problem 2, and update Conjecture 1 as

Conjecture 2.

For all $m\in\mathbb{N}$, the greedy arrangement solves Problem 2.

In the context of generalizing the Chebyshev center problem, [Enkhbat and Barsbold2013] studied the problem of inscribing two non-overlapping balls of the maximal total radii into a polytope. They formulated it as a bilevel programming problem, proposed a gradient based method, and demonstrated it by solving some test problems. Below, we show that there is a simple, elegant, and complete solution to this problem if we consider a certain class of polygons. From now on, we consider only convex polygons, and, as usual, a polygon is tangential if there is an inscribed circle that touches each of its sides, and two vertices of a tangential polygon are diagonally opposite if they are collinear with the incenter. Let us prove two useful lemmas.

Lemma 5.

Let $\omega$ be a circle, and $X$, $Y$ be two points disjoint from the region enclosed by $\omega$. Then any circle which passes through $X$ and $Y$ has an arc connecting these two points and disjoint from the region enclosed by $\omega$.

Proof.

An $XY$-circle is a circle that passes through the points $X$ and $Y$. An $XY$-line, $XY$-segment, and $XY$-arc are defined analogously. The plane is divided into two halves when we draw the $XY$-line. One of these halves we call the left half-plane, and the other one we call the right half-plane. It is well known (and can be easily proven) that locus of the centers of the $XY$-circles is the line perpendicular to the $XY$-segment, which divides each of the circles into two equal parts. We call this line the center line (see Fig. 12).

Figure 12: Locus of the centers of $XY$-circles

There are two cases: either $\omega$ intersects the $XY$-line, or it does not. If $\omega$ does not intersect the $XY$-line, we may assume, without loss of generality, that $\omega$ is located entirely in the left half-plane. Then, since every $XY$-circle has an $XY$-arc located in the right half-plane, this arc is disjoint from $\omega$ and its interior (see Fig. 13). Notice that this argument also applies if $\omega$ is tangent to the $XY$-line.

Figure 13: $\omega$ does not intersect the $XY$-line

If $\omega$ intersects the $XY$-line, there are again two possibilities: either $\omega$ intersects the $XY$-segment, or it does not. Assume that $\omega$ intersects the $XY$-segment. Then, there are exactly two $XY$-circles which are internally tangent to $\omega$. The existence of these $XY$-circles is assured by solving celebrated Apollonius problem for the triple $X$, $Y$, and $\omega$. Let their centers be $O_{1}$ and $O_{2}$ (see Fig. 14).

Figure 14: $\omega$ intersects the $XY$-segment

For any $XY$-circle whose center is located to the left of $O_{1}$ (or $O_{2}$), its $XY$-arc belonging to the left half-plane is disjoint from $\omega$ and its interior, since such a circle can be obtained as a continuous image of transforming $O_{1}$ (or $O_{2}$) to the left along the center line. The same argument applies to show that for any $XY$-circle whose center is located to the right of $O_{1}$ (or $O_{2}$), its $XY$-arc belonging to the right half-plane is disjoint from $\omega$ and its interior.

Assume now that $\omega$ intersects the $XY$-line, but does not intersect the $XY$-segment. Without loss of generality, we may assume also that the center of $\omega$ is located in the left half-plane. Again, by solving Apollonius problem, we can find two $XY$-circles which are externally tangent to $\omega$. Let their centers be $O_{1}$ and $O_{2}$ (see Fig. 15).

Figure 15: $\omega$ intersects the $XY$-line, but does not intersect the $XY$-segment

Then:

•

If an $XY$-circle has a center located to the left of $O_{1}$, then its $XY$-arc lying in the right half-plane is disjoint from $\omega$ and its interior;
•

If an $XY$-circle has a center located to the right of $O_{2}$, then its $XY$-arc lying in the left half-plane is disjoint from $\omega$ and its interior; and
•

If an $XY$-circle has a center located between $O_{1}$ and $O_{2}$, then it is entirely disjoint from $\omega$ and its interior.

Thus, in all cases, for any $XY$-circle, there is an $XY$-arc which is disjoint from $\omega$ and its interior. This proves Lemma 5. $\square$

Lemma 6.

Let $k\geq 3$, and let us consider a tangential $k$-gon and a circle inscribed into it. Then the circle touches two nonadjacent sides of the polygon if and only if it is the incircle.

Proof.

Consider the following problem.

Problem 3.

Let $k\geq 3$. Find an arrangement of two circles in a tangential $k$-gon such that the sum of their radii is maximal.

We already know that the greedy arrangement solves Problem 3 for $k=3$. If the polygon is a square, it solves also the closely related problem of maximizing the sum of the areas of two circles, as shown in Problem 2.3.1 in [Andreescu et al.2006]. Our next result is as follows.

Theorem 5.

Let $k\geq 3$. Then the greedy arrangement solves Problem 3.

Proof.

By arguments similar to those in the proof of Lemma 1A, we can focus only on rigid arrangements. We claim that, in any such arrangement, there exist two circles that are mutually tangent, and each of them touches two adjacent sides of the polygon. To see this, suppose that these circles are not externally tangent. Then one can enlarge one of them by moving its center toward the incenter of the polygon, while keeping the other circle fixed (see Fig. 17a). This contradicts rigidity. Thus, we may assume that some two circles are mutually tangent.

Figure 17: Violations of rigidity in a tangential polygon. Arrows indicate the directions of enlargement

Now suppose, by contradiction, that one of these circles (centered at $O_{1}$) does not touch any side of the polygon, and let $l$ be the inner tangent of the circles. As a consequence of the celebrated supporting hyperplane theorem [(see Chap. 2.5.2 in [Boyd and Vandenberghe2004]], $l$ divides the polygon into two small polygons, in one of which the circle centered at $O_{1}$ is inscribed in such a way that it touches only the side lying on $l$. Then the circle centered at $O_{1}$ can clearly be enlarged by moving its center along the direction orthogonal to $l$ until it touches another side of the polygon (see Fig. 17b). Since the other circle remains fixed throughout this enlargement, it contradicts rigidity. Thus, we may assume that each of the two mutually tangent circles is tangent to at least one side of the polygon.

Suppose, again by contradiction, that one of the circles (centered at $O_{1}$) is tangent to one side of the polygon ($AB$), but not to any of the two sides adjacent to this side. Draw the line $l$ described above. There are two possibilities: either $AB$ is not parallel to $l$, or it is. In the first case, one can enlarge the circle by moving its center along the bisector of the angle obtained by the intersection of $l$ with the line through $AB$ (see Fig. 18a). Such an enlargement is feasible as long as the circle does not touch any other side of the polygon, and it follows from Lemma 6 that this condition is indeed satisfied. But this enlargement does not affect the other circle; hence, it contradicts rigidity.

Figure 18: More violations of rigidity in a tangential polygon. Arrows indicate the directions of enlargement

Now, let $AB$ be parallel to $l$. Then one can displace the circle centered at $O_{1}$ by moving its center in a direction parallel to $l$; the other circle remains unaffected by this displacement (see Fig. 18b). Again, Lemma 6 ensures that such displacement is feasible. After this, we obtain two disjoint circles, one of which is the same as one of the original two circles, while the other one is obtained from the other of the original two circles by a parallel translation. But as we already showed, if we have two disjoint circles, we can always enlarge them, which contradicts rigidity. This proves our claim.

Consider any rigid arrangement, and let $V$ and $F$ be the two vertices of the polygon such that each of them is the common end point of a pair of adjacent sides corresponding to this arrangement. There are two cases: either $V$ and $F$ are diagonally opposite, or they are not. In the first case, the sum of radii of the two circles is a linear function. To see this, observe that if $V$ and $F$ are diagonally opposite, their bisectors coincide, which implies that the points $O_{1},O_{1}^{\prime},O_{2},O_{2}^{\prime}$ are collinear (see Fig. 19a).

Figure 19: Pair of rigid arrangements in a tangential polygon

Then, the quadrilateral inequality is an equality, which implies that

$R\left(\frac{r_{1}+r_{2}}{2}\right)=\frac{R(r_{1})+R(r_{2})}{2}.$

(1)

Equation (1) is called Jensen’s equality; it is known that any continuous function $R:[a,b]\rightarrow\mathbb{R}$ satisfying (1) is linear [(see p.43 in [Aczél1966]]. Since the sum of two linear functions is linear, this implies that our objective function $r_{1}+R(r_{1})$ is linear. Then, either it is a constant function, or it is not. In the first case, every point in its domain (which is a closed interval) is optimal; while in the second case, it attains its maximum at the end points of the domain. Thus, in either case, the greedy arrangement is optimal.

Figure 20: Cases in which the greedy arrangement is not optimal. In (a), the sum of the radii for the greedy arrangement is roughly the radius of the incircle which is equal to $|OV|$. The construction in (b) is inspired by Melissen’s pentagon

If $V$ and $F$ are not diagonally opposite, consider two circles whose centers lie on the bisectors of $\angle V^{\prime}VV^{\prime\prime}$ and $\angle F^{\prime}FF^{\prime\prime}$ (see Fig. 19b). Then one can repeat the argument in the proof of Lemma 3 to show that the function describing the sum of the radii of the two circles is strictly convex, which implies that any arrangement that does not contain the incircle is subject to a local improvement.³³It suffices to observe that if $V$ and $F$ are not diagonally opposite, the quadrilateral inequality on which our proof is based is strict. Thus, the sum function is strictly convex. Thus, we may conclude that an optimal arrangement must contain the incircle. Then it must be the greedy arrangement. This proves Theorem 5. $\square$

Let us add few remarks on Theorem 5. First, in the light of Theorem 4, it should be clear that the greedy arrangement solves the problem of inscribing two circles into a tangential polygon with the maximum total area. However, as mentioned above, the objective function for the problem of the sum of the radii can be constant over rigid arrangements centered on the main diagonal (indeed, this is the case when we consider regular $2k$-gons). This implies that for this problem there can be optimal arrangements other than the greedy arrangement. But this is not the case for the problem of the maximization of the sum of the areas as it has a strictly convex objective function. This is one important aspect where these two problems differ.

Second, one might attempt to generalize Theorem 5 for more than two circles. However, the example in Fig. 20a gives an arrangement of three circles in a regular 12-gon, which resembles an Apollonian gasket, which has a larger sum of the radii than the greedy arrangement.⁴⁴This construction is generic as it works for any $2k$-gon and, probably, for any $n>2$ circles. One might also look for a result analogous to Theorem 5 for cyclic polygons. But, again, there is a counterexample to such a claim (see Fig. 20b).

Finally, since a triangle is a tangential polygon, based on the above analysis, we suggest the following generalization of Conjecture 1.

Conjecture 3.

For all $n\in\mathbb{N}$ and $k\geq 3$, the greedy arrangement solves the problem of finding an arrangement of $n$ circles in a tangential $k$-gon with the maximal total area.

Notice that if we fix the radius of the incircle and let $k\rightarrow\infty$, we may think of the tangential polygon as a circle. Then, for any $n\in\mathbb{N}$, it is clear that the greedy arrangement is the only optimal solution for the problem of inscribing $n$ circles with the maximal total area into the limiting circle. This observation adds a credibility to Conjecture 3.

Acknowledgements

References

[Aczél1966] Aczél, J.: Lectures on Functional Equations and Their Applications. Academic, New York (1966)
[Andreatta et al.2011] Andreatta, M., Bezdek, A., Boroński, J.P.: The problem of Malfatti: two centuries of debate. Math. Intell. 33(1), 72–76 (2011)
[Andreescu et al.2006] Andreescu, T., Mushkarov, O., Stoyanov, L.: Geometric Problems on Maxima and Minima. Birkhäuser, Boston (2006)
[Boyd and Vandenberghe2004] Boyd, S., Vandenberghe, L.: Convex Optimization. Cambridge University Press, Cambridge (2004)
[Dahl and Margot1998] Dahl, G., Margot, F.: Weak $k$-majorization and polyhedra. Math. Program. 81(1), 37–53 (1998)
[Enkhbat and Barsbold2013] Enkhbat, R., Barsbold, B.: Optimal inscribing of two balls into polyhedral set. In: Chinchuluun, A., Pardalos, P.M., Enkhbat, R., Pistikopoulos, E.N. (eds.) Optimization, Simulation and Control, pp. 35–47. Springer, Heidelberg (2013)
[Goldberg1967] Goldberg, M.: On the original Malfatti’s problem. Math. Mag. 40(5), 241–247 (1967)
[Lob and Richmond1930] Lob, H., Richmond, H.W.: On the solution of Malfatti’s problem for a triangle. Proc. Lond. Math. Soc. 30(2), 287–304 (1930)
[Marshall et al.2011] Marshall, A.W., Olkin, I., Arnold, B.C.: Inequalities: Theory of Majorization and Its Applications, 2nd edn. Springer, Heidelberg (2011)
[Niculescu and Persson2006] Niculescu, C., Persson, L.-E.: Convex Functions and their Applications: A Contemporary Approach. Springer, Heidelberg (2006)
[Szabó et al.2007] Szabó, P.G., Markót, M.C., Csendes, T., Specht, E., Casado, L.G., García, I.: New Approaches to Circles Packing in a Square. Springer, Heidelberg (2007)
[Zalgaller and Los1994] Zalgaller, V.A., Los, G.A.: The solution of Malfatti’s problem. J. Math. Sci. 74(4), 3163–3177 (1994)

On Malfatti’s Marble Problem

Abstract

Keywords

1 Introduction

Problem 1.

2 The Proof

2.1 Preliminaries

Lemma 1.

Proof.

Lemma 2.

Theorem 1.

Proof.

Lemma 3.

Proof.

Theorem 2.

Proof.

2.2 Solution to Malfatti’s Marble Problem

Theorem 3.

Proof.

3 Some Related Extremum Problems

Problem 2.

Theorem 4.

Proof.

Lemma 4.

Conjecture 1.

Conjecture 2.

Lemma 5.

Proof.

Lemma 6.

Proof.

Problem 3.

Theorem 5.

Proof.

Conjecture 3.

Acknowledgements

References