Six-vertex model with rare corners and random restricted permutations

Vadim Gorin and Richard Kenyon Vadim Gorin
Departments of Statistics and Mathematics, UC Berkeley
[email protected] Richard Kenyon
Department of Mathematics, Yale University, New Haven, 06920
richard dot kenyon at yale.edu
(Date: August 26, 2024)
Abstract.

We study limit shapes in two equivalent models: the six-vertex model in the c0𝑐0c\to 0italic_c → 0 limit and the random Mallows permutation with restricted permutation matrix. We give the Euler-Lagrange equation for the limit shape and show how to solve it for a class of rectilinear polygonal domains. Its solutions are given by piecewise-algebraic functions with lines of discontinuities.

1. Introduction

The six-vertex model (see Figures 1, 2) is one of the most fundamental and well-studied models of statistical mechanics, with deep connections to combinatorics, probability, and quantum algebra, we refer to [LW72, Bax07, Res10, BL14, GN23] for extensive reviews. Despite spectacular progress on exact solutions by Lieb [Lie67], Sutherland-Yang-Yang [SYY67], Nolden [Nol92], Noh-Kim [NK96], and others, in the most general setting it remains unsolved: even closed expressions for the partition function on the torus or free energy as a function of the slope are not known.

In the combinatorics community, a six-vertex model setting of particular interest is given by “domain wall boundary conditions” (DWBC), as in Figure 2. This setup has a number of connections with algebraic and integrable combinatorics: special cases enumerate Alternating Sign Matrices, domino tilings of the Aztec Diamond, classes of symmetric polynomials, and so on, see e.g. [EKLP92, BP99, Gie09, ZJ09, DF18, ZJ24] and references therein.

In this paper we study the c0𝑐0c\to 0italic_c → 0 limit of the six-vertex model. For DWBC this is a point where the model is isomorphic to a random Mallows permutation, that is a permutation σ𝜎\sigmaitalic_σ weighted by qinv(σ)superscript𝑞inv𝜎q^{\mathrm{inv}(\sigma)}italic_q start_POSTSUPERSCRIPT roman_inv ( italic_σ ) end_POSTSUPERSCRIPT, where inv(σ)inv𝜎\mathrm{inv}(\sigma)roman_inv ( italic_σ ) is the number of inversions (and q=b2/a2𝑞superscript𝑏2superscript𝑎2q=b^{2}/a^{2}italic_q = italic_b start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is a positive parameter). The Mallows measure has been intensively studied since its introduction in [Mal57]; we refer to [HMV22, AK24] for two recent papers and many references to previous work. The closest to our text are papers [Sta09, SW18] which study the macroscopic limit shapes for random Mallows permutations.

In the present work we go beyond DWBC for the six-vertex model and obtain explicit limit shapes for a variety of natural domains, including polygonal subdomains for the domain-wall boundary conditions. In the random Mallows permutations language, we deal with permutations with restricted positions, as in [DGH01]. Our limit shapes satisfy a variational principle whose associated PDE is the hyperbolic Liouville equation

(logg)xy=2rg,subscript𝑔𝑥𝑦2𝑟𝑔(\log g)_{xy}=2rg,( roman_log italic_g ) start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT = 2 italic_r italic_g ,

for a function g=g(x,y)𝑔𝑔𝑥𝑦g=g(x,y)italic_g = italic_g ( italic_x , italic_y ), where r𝑟ritalic_r is a constant governing the speed of convergence of q𝑞qitalic_q to 1111 as the domain grows, see Proposition 3.11 and Remark 4.2. Surprisingly, our solutions are usually only piecewise analytic and in fact discontinuous, see Figures 5,6,7,8,9. We are in a remarkable situation where we can find explicit equations for nonanalytic limit shapes.

In the q=1𝑞1q=1italic_q = 1 case we prove that the variational problem has a unique maximizer, that is, limit shape, for any rectilinear domain, and give a robust procedure for computing it, based on the Sinkhorn-Knopp algorithm. For general q1𝑞1q\neq 1italic_q ≠ 1 (equivalently, r0𝑟0r\neq 0italic_r ≠ 0) we do not know whether the Euler-Lagrange equations associated to the variational problem have a unique solution. However, for a wide class of cases with q𝑞qitalic_q close to 1111, we are able to prove the uniqueness and further express the limit shape in terms of solutions to certain algebraic equations, see Theorems 3.15 and 4.9. As an outcome, in this situation the function g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) encoding the limit shape is a piecewise-rational function of erxsuperscript𝑒𝑟𝑥e^{rx}italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT, erysuperscript𝑒𝑟𝑦e^{ry}italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT, see Corollary 4.11 for the exact statement. We supplement our general theorems with explicit computations for a number of examples in Sections 4 and 5.

We treat our results as an important contribution to the long-standing open problem of finding the limit shapes for the six-vertex model, discussed, e.g., in [ZJ02, PR10, Res10]. There are only three previous special cases, where a full description of the set of limit shapes has been obtained. For the free-fermion subvariety in the space of the parameters, the model is equivalent to random domino tilings (cf. [FS06]) and in this language the model was analyzed in [CKP01, KO07, BK16, KP24], see also [Gor21]. In [dGKW21] and [KP22] limit shapes for the case ΔΔ\Delta\to\inftyroman_Δ → ∞ (the so-called five vertex model) were worked out. For stochastic weights and free boundary conditions, the model is equivalent to an interacting particle system, and this fact was used to compute limit shapes in [GS92, BCG16, BG19, Agg20b]. From this perspective, our key discovery is that there exists a fourth case, where the six-vertex model is equivalent to another system (random permutations) and its limit shapes can be found.

An interesting accompanying question, which we do not address in this work, is how to design efficient perfect samplers for configurations of the six-vertex model with c𝑐citalic_c close to 00 or for random Mallows restricted permutations. In the six-vertex literature a standard approach is to run a local Glauber dynamics until it mixes, as in [KS18, FR19, BR22, LKV23, PS23]; in the random permutations literature, sampling restricted permutations by multiplying with random transpositions was discussed in [DGH01, Bor11]. However, for general q𝑞qitalic_q and the domains we consider here the existing algorithms can be prohibitively slow.


Acknowledgments. We thank István Prause for helpful comments. Part of this research was performed while the authors were visiting the Institute for Pure and Applied Mathematics (IPAM), which is supported by the National Science Foundation (Grant No. DMS-1925919). V.G. was partially supported by NSF grant DMS - 2246449. R.K. was supported by NSF DMS-1940932 and the Simons Foundation grant 327929.

2. Problem setup

We present here two different points of view on the problem we study: limit shapes for the height function in an instance of the six-vertex model, permuton-type limits of restricted random permutations.

2.1. Six-vertex model with rare corners

A configuration in the six-vertex model in a finite domain Ω2Ωsuperscript2\Omega\subset\mathbb{Z}^{2}roman_Ω ⊂ blackboard_Z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is an assignment to the vertices of this domain of one of the six types of Figure 1 in a consistent way, i.e. so that the outcome is a collection of paths which are allowed to touch each other, but not to cross, and connecting points on the boundary of ΩΩ\Omegaroman_Ω. Let ΩΩ\partial\Omega∂ roman_Ω be the collection of all edges of the grid, connecting ΩΩ\Omegaroman_Ω to its complement 2Ωsuperscript2Ω\mathbb{Z}^{2}\setminus\Omegablackboard_Z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∖ roman_Ω. By boundary condition, we mean a choice of {0,1}01\{0,1\}{ 0 , 1 }-sequence of length |Ω|Ω|\partial\Omega|| ∂ roman_Ω |, where each 1111 indicates that the corresponding edge of ΩΩ\Omegaroman_Ω is occupied by a path and 00 indicates that the edge is not occupied. A configuration is consistent with boundary conditions, if the paths leave/enter domain ΩΩ\Omegaroman_Ω through the edges of ΩΩ\partial\Omega∂ roman_Ω labeled with 1111s and only through them.

Refer to caption
Figure 1. The six types of vertices and their weights.

As a simple example, take ΩΩ\Omegaroman_Ω to be the N×N𝑁𝑁N\times Nitalic_N × italic_N square, Ω={1,,N}×{1,,N}Ω1𝑁1𝑁\Omega=\{1,\dots,N\}\times\{1,\dots,N\}roman_Ω = { 1 , … , italic_N } × { 1 , … , italic_N } with |Ω|=4NΩ4𝑁|\partial\Omega|=4N| ∂ roman_Ω | = 4 italic_N and the boundary condition such that all the edges along the left and top boundaries are occupied and all the edges along the bottom and right boundaries are not occupied, see Figure 2. This situation is usually called “Domain Wall Boundary Conditions”.

Refer to caption
Refer to caption
Refer to caption
Figure 2. N×N𝑁𝑁N\times Nitalic_N × italic_N square with Domain Wall Boundary Conditions for N=5𝑁5N=5italic_N = 5 and two possible configurations of paths with c𝑐citalic_c–type vertices emphasized.

For a more general class of domains, we need an auxiliary definition.

Definition 2.1.

A k×𝑘k\times\ellitalic_k × roman_ℓ array [Iuv]1uk, 1vsubscriptdelimited-[]subscript𝐼𝑢𝑣formulae-sequence1𝑢𝑘1𝑣\bigl{[}I_{uv}\bigr{]}_{1\leq u\leq k,\,1\leq v\leq\ell}[ italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 ≤ italic_u ≤ italic_k , 1 ≤ italic_v ≤ roman_ℓ end_POSTSUBSCRIPT with 0/1010/10 / 1 entries is called convex, if it is horizontally and vertically convex, that is, if for each horizontal111We use the Cartesian coordinate system for enumerating elements of the array I𝐼Iitalic_I, rather than the usual matrix notation, so the origin of the array is in the lower left corner. coordinate u𝑢uitalic_u, the set {vIuv=1}conditional-set𝑣subscript𝐼𝑢𝑣1\{v\mid I_{uv}=1\}{ italic_v ∣ italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1 } is a non-empty connected segment of integers, and for each vertical coordinate v𝑣vitalic_v, the set {uIuv=1}conditional-set𝑢subscript𝐼𝑢𝑣1\{u\mid I_{uv}=1\}{ italic_u ∣ italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1 } is a non-empty connected segment of integers.

Remark 2.2.

Any convex array can be constructed by starting from k×𝑘k\times\ellitalic_k × roman_ℓ array of all 1111s, choosing four Young diagrams adjacent to the four corners, and converting the numbers inside those Young diagrams into 00s.

Example 2.3.

The array (110111010)matrix110111010\begin{pmatrix}1&1&0\\ 1&1&1\\ 0&1&0\end{pmatrix}( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) is convex, but (111110011)matrix111110011\begin{pmatrix}1&1&1\\ 1&1&0\\ 0&1&1\end{pmatrix}( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) is not.

We deal with domains parameterized by the following data:

  • Positive integers k𝑘kitalic_k, \ellroman_ℓ, and N𝑁Nitalic_N;

  • Integers 0=X0<X1<<Xk=N0subscript𝑋0subscript𝑋1subscript𝑋𝑘𝑁0=X_{0}<X_{1}<\dots<X_{k}=N0 = italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_N and 0=Y0<Y1<<Y=N0subscript𝑌0subscript𝑌1subscript𝑌𝑁0=Y_{0}<Y_{1}<\dots<Y_{\ell}=N0 = italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < italic_Y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_N;

  • A convex array [Iuv]1uk, 1vsubscriptdelimited-[]subscript𝐼𝑢𝑣formulae-sequence1𝑢𝑘1𝑣\bigl{[}I_{uv}\bigr{]}_{1\leq u\leq k,\,1\leq v\leq\ell}[ italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 ≤ italic_u ≤ italic_k , 1 ≤ italic_v ≤ roman_ℓ end_POSTSUBSCRIPT.

Definition 2.4.

We define ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT to be a subset of {1,,N}2superscript1𝑁2\{1,\dots,N\}^{2}{ 1 , … , italic_N } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT given by

ΩX,Y,I={(x,y)2Xu1<xXu,Yv1<yYv for some (u,v) such that Iuv=1}.superscriptΩ𝑋𝑌𝐼conditional-set𝑥𝑦superscript2formulae-sequencesubscript𝑋𝑢1𝑥subscript𝑋𝑢subscript𝑌𝑣1𝑦subscript𝑌𝑣 for some 𝑢𝑣 such that subscript𝐼𝑢𝑣1\Omega^{X,Y,I}=\{(x,y)\in\mathbb{Z}^{2}\mid X_{u-1}<x\leq X_{u},\,Y_{v-1}<y% \leq Y_{v}\text{ for some }(u,v)\text{ such that }I_{uv}=1\}.roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT = { ( italic_x , italic_y ) ∈ blackboard_Z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∣ italic_X start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT < italic_x ≤ italic_X start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT < italic_y ≤ italic_Y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT for some ( italic_u , italic_v ) such that italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1 } .

The boundary conditions are: horizontal paths enter into the left-most vertices in each of the N𝑁Nitalic_N rows; vertical paths leave the upper-most vertices in each of the N𝑁Nitalic_N columns.

We refer to Figure 3 for an example of ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT, and remark that for k==1𝑘1k=\ell=1italic_k = roman_ℓ = 1, this is exactly the N×N𝑁𝑁N\times Nitalic_N × italic_N square with Domain Wall Boundary Conditions, as in Figure 2.

Refer to caption
Refer to caption
Figure 3. Domain ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT with N=6𝑁6N=6italic_N = 6, k==3𝑘3k=\ell=3italic_k = roman_ℓ = 3, X=(0,2,4,6)𝑋0246X=(0,2,4,6)italic_X = ( 0 , 2 , 4 , 6 ), Y=(0,2,4,6)𝑌0246Y=(0,2,4,6)italic_Y = ( 0 , 2 , 4 , 6 ), I=(110111011)𝐼matrix110111011I=\begin{pmatrix}1&1&0\\ 1&1&1\\ 0&1&1\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ), and one possible configuration of paths with c𝑐citalic_c–type vertices emphasized and the values of the height function in light gray.

Next, we introduce a weight wi>0subscript𝑤𝑖0w_{i}>0italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 for each vertex type i=1,2,,6𝑖126i=1,2,\dots,6italic_i = 1 , 2 , … , 6 and define the probability of a configuration σ𝜎\sigmaitalic_σ of the six-vertex model in domain ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT as

(1) Prob(σ)=1Zi=16wiNi(σ),Prob𝜎1𝑍superscriptsubscriptproduct𝑖16superscriptsubscript𝑤𝑖subscript𝑁𝑖𝜎\mathrm{Prob}(\sigma)=\frac{1}{Z}\prod_{i=1}^{6}w_{i}^{N_{i}(\sigma)},roman_Prob ( italic_σ ) = divide start_ARG 1 end_ARG start_ARG italic_Z end_ARG ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT ,

where Ni(σ)subscript𝑁𝑖𝜎N_{i}(\sigma)italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_σ ) is the number of vertices of Type i𝑖iitalic_i in the configuration σ𝜎\sigmaitalic_σ. The normalization Z𝑍Zitalic_Z in (1) is chosen so that the sum over all σ𝜎\sigmaitalic_σ in ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT is 1111. One can show that for these deterministically fixed boundary conditions, the probability measure (1) depends on two parameters, rather than six (there are four conservation laws, see e.g. [GN23, Lemma 2.1]). Hence, there is no loss of generality to consider six symmetric weights (a,a,b,b,c,c)𝑎𝑎𝑏𝑏𝑐𝑐(a,a,b,b,c,c)( italic_a , italic_a , italic_b , italic_b , italic_c , italic_c ), as in Figure 1.

We study random configurations of the six-vertex model inΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT via their height functions.

Definition 2.5.

The height function H(x,y)𝐻𝑥𝑦H(x,y)italic_H ( italic_x , italic_y ) is defined for x,y+12𝑥𝑦12x,y\in\mathbb{Z}+\frac{1}{2}italic_x , italic_y ∈ blackboard_Z + divide start_ARG 1 end_ARG start_ARG 2 end_ARG situated inside the domain as the total number of paths one crosses to get from (x,y)𝑥𝑦(x,y)( italic_x , italic_y ) to the bottom-right corner of the domain.

We note that along the boundary of ΩΩ\Omegaroman_Ω, the values of H(x,y)𝐻𝑥𝑦H(x,y)italic_H ( italic_x , italic_y ) do not depend on the choice of the configuration and can be seen as an alternative encoding of the boundary conditions, cf. Figure 3.

Question 2.6.

What is the asymptotic behavior of 1NH(Nx,Ny)1𝑁𝐻𝑁𝑥𝑁𝑦\frac{1}{N}H(Nx,Ny)divide start_ARG 1 end_ARG start_ARG italic_N end_ARG italic_H ( italic_N italic_x , italic_N italic_y ) for large N𝑁Nitalic_N?

One hopes to have a deterministic limit 𝔥(x,y)=limN1NH(Nx,Ny)𝔥𝑥𝑦subscript𝑁1𝑁𝐻𝑁𝑥𝑁𝑦\mathfrak{h}(x,y)=\lim_{N\to\infty}\frac{1}{N}H(Nx,Ny)fraktur_h ( italic_x , italic_y ) = roman_lim start_POSTSUBSCRIPT italic_N → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_N end_ARG italic_H ( italic_N italic_x , italic_N italic_y ), and we refer to it as the limit shape. For generic values of (a,b,c)𝑎𝑏𝑐(a,b,c)( italic_a , italic_b , italic_c ) the limit shape is unknown even for the N×N𝑁𝑁N\times Nitalic_N × italic_N square of Figure 2. The notable exception is on the so-called free fermionic subvariety: this means a choice of (a,b,c)𝑎𝑏𝑐(a,b,c)( italic_a , italic_b , italic_c ) satisfying a2+b2c2=0superscript𝑎2superscript𝑏2superscript𝑐20a^{2}+b^{2}-c^{2}=0italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_b start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_c start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0. In this situation a bijection of the model with random domino tilings (see e.g., [FS06] and references therein) brings in various tools that can be used to find the limit shape 𝔥(x,y)𝔥𝑥𝑦\mathfrak{h}(x,y)fraktur_h ( italic_x , italic_y ), e.g. [CKP01], [BK16], [KP24].

We do not have anything new to say at generic (a,b,c)𝑎𝑏𝑐(a,b,c)( italic_a , italic_b , italic_c ), but we study here the case c=0𝑐0c=0italic_c = 0. In the limit c0𝑐0c\to 0italic_c → 0, the only surviving configurations of the model have the minimal possible number of vertices of Types 5555 and 6666. The specific form of ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT and its boundary conditions guarantee that the minimum is achieved on the configurations which have N𝑁Nitalic_N Type 5555 vertices (one per row, one per column) and zero Type 6666 vertices. Hence, we can equivalently use the degenerate weights of Figure 1. In particular, the right configurations of Figures 2 and 3 have positive probabilities in this limit, while the middle configuration of Figure 2 has vanishing probability and disappears.

Remark 2.7.

In our degeneration only five vertices remain. However, these are not the same five vertices as in [dGKW21], and our model and limit shapes are very different from those.


Our task is to describe the limit shape 𝔥(x,y)𝔥𝑥𝑦\mathfrak{h}(x,y)fraktur_h ( italic_x , italic_y ) for ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT with degenerate weights.

2.2. Restricted random permutations

We restart and give a different definition of our objects of interest. We deal with random permutations σ𝔖N𝜎subscript𝔖𝑁\sigma\in\mathfrak{S}_{N}italic_σ ∈ fraktur_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT of N𝑁Nitalic_N letters. There are many ways to introduce a probability distribution on permutations. We fix real q>0𝑞0q>0italic_q > 0 and focus on the celebrated Mallows measure:

(2) Prob(σ)=1Zqinv(σ),inv(σ)=#{i<jσ(i)>σ(j)}.formulae-sequenceProb𝜎1𝑍superscript𝑞inv𝜎inv𝜎#𝑖inner-product𝑗𝜎𝑖𝜎𝑗\mathrm{Prob}(\sigma)=\frac{1}{Z}q^{\mathrm{inv}(\sigma)},\qquad\mathrm{inv}(% \sigma)=\#\bigl{\{}i<j\mid\sigma(i)>\sigma(j)\bigr{\}}.roman_Prob ( italic_σ ) = divide start_ARG 1 end_ARG start_ARG italic_Z end_ARG italic_q start_POSTSUPERSCRIPT roman_inv ( italic_σ ) end_POSTSUPERSCRIPT , roman_inv ( italic_σ ) = # { italic_i < italic_j ∣ italic_σ ( italic_i ) > italic_σ ( italic_j ) } .

Such a measure was originally introduced in [Mal57] and since that time it has been intensively studied in probability, statistics, and theoretical physics. For instance, [HMV22] contains one recent result and many references to previous work. Note that when q=1𝑞1q=1italic_q = 1 this is the uniform measure on permutations.

In (2) a further choice can be made: either σ𝜎\sigmaitalic_σ can be any of the N!𝑁N!italic_N ! permutations, or there can be additional restrictions on σ𝜎\sigmaitalic_σ. The restrictions we deal with are parameterized by the following data abbreviated ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT:

  • Positive integers k𝑘kitalic_k, \ellroman_ℓ, and N𝑁Nitalic_N;

  • Integers 0=X0<X1<<Xk=N0subscript𝑋0subscript𝑋1subscript𝑋𝑘𝑁0=X_{0}<X_{1}<\dots<X_{k}=N0 = italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_N and 0=Y0<Y1<<Y=N0subscript𝑌0subscript𝑌1subscript𝑌𝑁0=Y_{0}<Y_{1}<\dots<Y_{\ell}=N0 = italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < italic_Y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_N;

  • A 0/1010/10 / 1 array [Iuv]1uk, 1vsubscriptdelimited-[]subscript𝐼𝑢𝑣formulae-sequence1𝑢𝑘1𝑣\bigl{[}I_{uv}\bigr{]}_{1\leq u\leq k,\,1\leq v\leq\ell}[ italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 ≤ italic_u ≤ italic_k , 1 ≤ italic_v ≤ roman_ℓ end_POSTSUBSCRIPT.

Definition 2.8.

We say that a permutation σSN𝜎subscript𝑆𝑁\sigma\in S_{N}italic_σ ∈ italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT is restricted by ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT, if for each m=1,,N𝑚1𝑁m=1,\dots,Nitalic_m = 1 , … , italic_N, we have Iu(m),v(m)=1subscript𝐼𝑢𝑚𝑣𝑚1I_{u(m),v(m)}=1italic_I start_POSTSUBSCRIPT italic_u ( italic_m ) , italic_v ( italic_m ) end_POSTSUBSCRIPT = 1, where

  • u(m)𝑢𝑚u(m)italic_u ( italic_m ) is the index 1uk1𝑢𝑘1\leq u\leq k1 ≤ italic_u ≤ italic_k, such that Xu1<σ(m)Xusubscript𝑋𝑢1𝜎𝑚subscript𝑋𝑢X_{u-1}<\sigma(m)\leq X_{u}italic_X start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT < italic_σ ( italic_m ) ≤ italic_X start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT; and

  • v(m)𝑣𝑚v(m)italic_v ( italic_m ) is the index 1vk1𝑣𝑘1\leq v\leq k1 ≤ italic_v ≤ italic_k, such that Yv1<mYvsubscript𝑌𝑣1𝑚subscript𝑌𝑣Y_{v-1}<m\leq Y_{v}italic_Y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT < italic_m ≤ italic_Y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT.

Remark 2.9.

If one identifies σ𝜎\sigmaitalic_σ with a collection of points {(σ(m),m)}m=1Nsuperscriptsubscript𝜎𝑚𝑚𝑚1𝑁\{(\sigma(m),m)\}_{m=1}^{N}{ ( italic_σ ( italic_m ) , italic_m ) } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT, then the condition says that all these points must be in ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT of Definition 2.4.

In particular, if I𝐼Iitalic_I is a matrix of all ones, then there are no restrictions and all N!𝑁N!italic_N ! permutations are possible; other choices of I𝐼Iitalic_I introduce various constraints, see Figure 4 for an example. We fix (X,Y,I)𝑋𝑌𝐼(X,Y,I)( italic_X , italic_Y , italic_I ) and analyze random permutations restricted by ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT and sampled according to (2). Such restricted random permutations play an important role in developing various statistical tests; we refer to [DGH01] for a review.

Remark 2.10.

If I𝐼Iitalic_I is an array of all ones, i.e., there are no restrictions, then Z𝑍Zitalic_Z in (2) is explicit: Z=(1+q)(1+q+q2)(1+q+qN1)𝑍1𝑞1𝑞superscript𝑞21𝑞superscript𝑞𝑁1Z=(1+q)(1+q+q^{2})\cdots(1+q+\dots q^{N-1})italic_Z = ( 1 + italic_q ) ( 1 + italic_q + italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ⋯ ( 1 + italic_q + … italic_q start_POSTSUPERSCRIPT italic_N - 1 end_POSTSUPERSCRIPT ). For generic (X,Y,I)𝑋𝑌𝐼(X,Y,I)( italic_X , italic_Y , italic_I ) we do not expect any explicit formula for Z𝑍Zitalic_Z.

Refer to caption
Refer to caption
Figure 4. Possible permutations for two choices of ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT with N=6𝑁6N=6italic_N = 6, k==3𝑘3k=\ell=3italic_k = roman_ℓ = 3, X=(0,2,4,6)𝑋0246X=(0,2,4,6)italic_X = ( 0 , 2 , 4 , 6 ), Y=(0,2,4,6)𝑌0246Y=(0,2,4,6)italic_Y = ( 0 , 2 , 4 , 6 ). Points (σ(m),m)𝜎𝑚𝑚(\sigma(m),m)( italic_σ ( italic_m ) , italic_m ) are prohibited in red shaded regions. I=(111101111)𝐼matrix111101111I=\begin{pmatrix}1&1&1\\ 1&0&1\\ 1&1&1\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) on the left and I=(110111011)𝐼matrix110111011I=\begin{pmatrix}1&1&0\\ 1&1&1\\ 0&1&1\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) on the right with σ=(152653)𝜎152653\sigma=(152653)italic_σ = ( 152653 ) and σ=(546132)𝜎546132\sigma=(546132)italic_σ = ( 546132 ), respectively.

We recall that a permuton is a probability measure μ𝜇\muitalic_μ on [0,1]2superscript012[0,1]^{2}[ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT with uniform marginals, i.e. such that μ([0,1]×[0,y])=y𝜇010𝑦𝑦\mu([0,1]\times[0,y])=yitalic_μ ( [ 0 , 1 ] × [ 0 , italic_y ] ) = italic_y and μ([0,x]×[0,1])=x𝜇0𝑥01𝑥\mu([0,x]\times[0,1])=xitalic_μ ( [ 0 , italic_x ] × [ 0 , 1 ] ) = italic_x for all 0x,y1formulae-sequence0𝑥𝑦10\leq x,y\leq 10 ≤ italic_x , italic_y ≤ 1. Each permutation σ𝔖N𝜎subscript𝔖𝑁\sigma\in\mathfrak{S}_{N}italic_σ ∈ fraktur_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT can be encoded by a permuton μσsubscript𝜇𝜎\mu_{\sigma}italic_μ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT of density

(3) μσ(x,y)=Nm=1N𝟏σ(m)1<xσ(m)𝟏m1<ym.subscript𝜇𝜎𝑥𝑦𝑁superscriptsubscript𝑚1𝑁subscript1𝜎𝑚1𝑥𝜎𝑚subscript1𝑚1𝑦𝑚\mu_{\sigma}(x,y)=N\sum_{m=1}^{N}\mathbf{1}_{\sigma(m)-1<x\leq\sigma(m)}% \mathbf{1}_{m-1<y\leq m}.italic_μ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ( italic_x , italic_y ) = italic_N ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT bold_1 start_POSTSUBSCRIPT italic_σ ( italic_m ) - 1 < italic_x ≤ italic_σ ( italic_m ) end_POSTSUBSCRIPT bold_1 start_POSTSUBSCRIPT italic_m - 1 < italic_y ≤ italic_m end_POSTSUBSCRIPT .

In words, each point (σ(m),m)𝜎𝑚𝑚(\sigma(m),m)( italic_σ ( italic_m ) , italic_m ) gives rise to a 1N×1N1𝑁1𝑁\tfrac{1}{N}\times\tfrac{1}{N}divide start_ARG 1 end_ARG start_ARG italic_N end_ARG × divide start_ARG 1 end_ARG start_ARG italic_N end_ARG square of density N𝑁Nitalic_N for μσsubscript𝜇𝜎\mu_{\sigma}italic_μ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT.

A random permutation σ𝜎\sigmaitalic_σ restricted by ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT and sampled according to (2) gives rise to a random permuton μσsubscript𝜇𝜎\mu_{\sigma}italic_μ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT. Our goal is to understand the limits of these permutons as N𝑁N\to\inftyitalic_N → ∞. Due to [Sta09], the answer is known for unrestricted permutations (i.e. when I𝐼Iitalic_I is the array filled with 1111s), and we would like to understand how it changes when we introduce restrictions.

2.3. Between the six-vertex and permutation points of view

Let us explain how the settings of Sections 2.1 and 2.2 are interconnected.

Theorem 2.11.

Take the (X,Y,I)𝑋𝑌𝐼(X,Y,I)( italic_X , italic_Y , italic_I ) data of Definitions 2.4, 2.8 with convex array I𝐼Iitalic_I and assume that there exists at least one permutation restricted by ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT. Consider the correspondence between configurations σ6vsuperscript𝜎6𝑣\sigma^{6v}italic_σ start_POSTSUPERSCRIPT 6 italic_v end_POSTSUPERSCRIPT of the six-vertex model in domain ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT with degenerate weights of Figure 1 (i.e. without Type 6666 vertices) and permutations σpermsuperscript𝜎perm\sigma^{\text{perm}}italic_σ start_POSTSUPERSCRIPT perm end_POSTSUPERSCRIPT restricted by ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT, identifying the Type 5555 vertices in σ6vsuperscript𝜎6𝑣\sigma^{6v}italic_σ start_POSTSUPERSCRIPT 6 italic_v end_POSTSUPERSCRIPT with points {(σperm(m),m)}m=1Nsuperscriptsubscriptsuperscript𝜎perm𝑚𝑚𝑚1𝑁\{(\sigma^{\text{perm}}(m),m)\}_{m=1}^{N}{ ( italic_σ start_POSTSUPERSCRIPT perm end_POSTSUPERSCRIPT ( italic_m ) , italic_m ) } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT. This correspondence is a weight-preserving bijection between probability measures (1) and (2) with q=b2a2𝑞superscript𝑏2superscript𝑎2q=\frac{b^{2}}{a^{2}}italic_q = divide start_ARG italic_b start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG.

Remark 2.12.

For non-convex I𝐼Iitalic_I, the restricted permutations make perfect sense, but the corresponding object on the six-vertex side is less natural. We would need to keep track of the six-vertex configurations inside the holes in ΩΩ\Omegaroman_Ω (cf. the left panel in Figure 4) and restrict to have no c𝑐citalic_c-type vertices there.

Proof of Theorem 2.11.

Consider a configuration of the six-vertex model in domain ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT. Let us trace a row of the domain from left to right. Note that on the left border there is an incoming horizontal path, while on the right border there is none. Hence, there should be at least one vertex of Type 5555 in this row. By the same argument there should be at least one vertex of Type 5555 in each of N𝑁Nitalic_N columns. Therefore, the minimum number of vertices of Types 5555 and 6666 is N𝑁Nitalic_N and it is achieved by having exactly one Type 5555 vertex in each row and column (and no vertices of Type 6666). Hence, we arrive at the model with degenerate weights of Figure 1 and with configuration of the Type 5555 vertices forming a permutation matrix. It is straightforward to check that these N𝑁Nitalic_N Type 5555 vertices uniquely determine the types of all other vertices and that every permutation restricted by ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT can be obtained in this way. This is our bijection between degenerate six-vertex configurations and permutations; it remains to check the correspondence between the weights (1) and (2).

For that we extend a configuration of the six-vertex model in domain ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT to the configuration in N×N𝑁𝑁N\times Nitalic_N × italic_N square with Domain Wall Boundary conditions. We note that {1,,N}2ΩX,Y,Isuperscript1𝑁2superscriptΩ𝑋𝑌𝐼\{1,\dots,N\}^{2}\setminus\Omega^{X,Y,I}{ 1 , … , italic_N } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∖ roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT is a disjoint union of four connected domains adjacent to the four corners on the square (in Figure 3, the top–left and bottom-right domains are empty, while the top-right and bottom–left domains are 2×2222\times 22 × 2 squares), add Type 1111 vertices in all positions of the bottom–right domain, add Type 2222 vertices in the top–left domain, add Type 3333 vertices in the bottom–left domain, and Type 4444 vertices in the top–right domain. Since the additional vertices are exactly the same for each configuration in ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT, this extension does not change the probability measure (1).222This step would not have worked the same way for non-convex I𝐼Iitalic_I.

In order to compute the weight of the configuration in the full N×N𝑁𝑁N\times Nitalic_N × italic_N square, we repeat the argument of [GL23, Proposition 7.2]: In row k𝑘kitalic_k of the configuration corresponding to the permutation σ𝜎\sigmaitalic_σ there is exactly one Type 5555 vertex and:

  • Nσ(k)#{i<k:σ(i)>σ(k)}𝑁𝜎𝑘#conditional-set𝑖𝑘𝜎𝑖𝜎𝑘N-\sigma(k)-\#\{i<k:\sigma(i)>\sigma(k)\}italic_N - italic_σ ( italic_k ) - # { italic_i < italic_k : italic_σ ( italic_i ) > italic_σ ( italic_k ) } vertices of Type 1111,

  • #{i<k:σ(i)<σ(k)}#conditional-set𝑖𝑘𝜎𝑖𝜎𝑘\#\{i<k:\sigma(i)<\sigma(k)\}# { italic_i < italic_k : italic_σ ( italic_i ) < italic_σ ( italic_k ) } vertices of Type 2222,

  • σ(k)1#{i<k:σ(i)<σ(k)}𝜎𝑘1#conditional-set𝑖𝑘𝜎𝑖𝜎𝑘\sigma(k)-1-\#\{i<k:\sigma(i)<\sigma(k)\}italic_σ ( italic_k ) - 1 - # { italic_i < italic_k : italic_σ ( italic_i ) < italic_σ ( italic_k ) } vertices of Type 3333,

  • #{i<k:σ(i)>σ(k)}#conditional-set𝑖𝑘𝜎𝑖𝜎𝑘\#\{i<k:\sigma(i)>\sigma(k)\}# { italic_i < italic_k : italic_σ ( italic_i ) > italic_σ ( italic_k ) } vertices of Type 4444.

Note that #{i<k:σ(i)<σ(k)}=k1#{i<k:σ(i)>σ(k)}#conditional-set𝑖𝑘𝜎𝑖𝜎𝑘𝑘1#conditional-set𝑖𝑘𝜎𝑖𝜎𝑘\#\{i<k:\sigma(i)<\sigma(k)\}=k-1-\#\{i<k:\sigma(i)>\sigma(k)\}# { italic_i < italic_k : italic_σ ( italic_i ) < italic_σ ( italic_k ) } = italic_k - 1 - # { italic_i < italic_k : italic_σ ( italic_i ) > italic_σ ( italic_k ) }. Hence, the product of the weights (we use the degenerate weights of Figure 1) of the vertices in row k𝑘kitalic_k is

aN+k1σ(k)2#{i<k:σ(i)>σ(k)}bσ(k)k+2#{i<k:σ(i)>σ(k)}.superscript𝑎𝑁𝑘1𝜎𝑘2#conditional-set𝑖𝑘𝜎𝑖𝜎𝑘superscript𝑏𝜎𝑘𝑘2#conditional-set𝑖𝑘𝜎𝑖𝜎𝑘a^{N+k-1-\sigma(k)-2\cdot\#\{i<k:\,\sigma(i)>\sigma(k)\}}\,b^{\sigma(k)-k+2% \cdot\#\{i<k:\,\sigma(i)>\sigma(k)\}}.italic_a start_POSTSUPERSCRIPT italic_N + italic_k - 1 - italic_σ ( italic_k ) - 2 ⋅ # { italic_i < italic_k : italic_σ ( italic_i ) > italic_σ ( italic_k ) } end_POSTSUPERSCRIPT italic_b start_POSTSUPERSCRIPT italic_σ ( italic_k ) - italic_k + 2 ⋅ # { italic_i < italic_k : italic_σ ( italic_i ) > italic_σ ( italic_k ) } end_POSTSUPERSCRIPT .

Multiplying over all k=1,,N𝑘1𝑁k=1,\dots,Nitalic_k = 1 , … , italic_N, and omitting a prefactor which does not depend on σ𝜎\sigmaitalic_σ, we conclude that the probability of a configuration corresponding to σ𝜎\sigmaitalic_σ is proportional to

(b2a2)#{1i<kN:σ(i)>σ(k)}=(b2a2)inv(σ).superscriptsuperscript𝑏2superscript𝑎2#conditional-set1𝑖𝑘𝑁𝜎𝑖𝜎𝑘superscriptsuperscript𝑏2superscript𝑎2inv𝜎\left(\frac{b^{2}}{a^{2}}\right)^{\#\{1\leq i<k\leq N:\,\sigma(i)>\sigma(k)\}}% =\left(\frac{b^{2}}{a^{2}}\right)^{\mathrm{inv}(\sigma)}.\qed( divide start_ARG italic_b start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT # { 1 ≤ italic_i < italic_k ≤ italic_N : italic_σ ( italic_i ) > italic_σ ( italic_k ) } end_POSTSUPERSCRIPT = ( divide start_ARG italic_b start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT roman_inv ( italic_σ ) end_POSTSUPERSCRIPT . italic_∎
Proposition 2.13.

Under the correspondence of Theorem 2.11, the height function of Definition 2.5 is related to the permuton of (3) by:

(4) 1NH(Nx+12,Ny+12)=0yx1μσ(s,t)𝑑s𝑑t.1𝑁𝐻𝑁𝑥12𝑁𝑦12superscriptsubscript0𝑦superscriptsubscript𝑥1subscript𝜇𝜎𝑠𝑡differential-d𝑠differential-d𝑡\frac{1}{N}H(Nx+\frac{1}{2},Ny+\frac{1}{2})=\int_{0}^{y}\int_{x}^{1}\mu_{% \sigma}(s,t)\,ds\,dt.divide start_ARG 1 end_ARG start_ARG italic_N end_ARG italic_H ( italic_N italic_x + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , italic_N italic_y + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ( italic_s , italic_t ) italic_d italic_s italic_d italic_t .
Remark 2.14.

The identity (4) is valid for x,y{0N,1N,,NN}𝑥𝑦0𝑁1𝑁𝑁𝑁x,y\in\left\{\tfrac{0}{N},\tfrac{1}{N},\dots,\tfrac{N}{N}\right\}italic_x , italic_y ∈ { divide start_ARG 0 end_ARG start_ARG italic_N end_ARG , divide start_ARG 1 end_ARG start_ARG italic_N end_ARG , … , divide start_ARG italic_N end_ARG start_ARG italic_N end_ARG }, but also can be taken as a definition of (piecewise-linear) extension of H𝐻Hitalic_H to real arguments.

Proof of Proposition 2.13.

H(Nx+12,Ny+12)𝐻𝑁𝑥12𝑁𝑦12H(Nx+\frac{1}{2},Ny+\frac{1}{2})italic_H ( italic_N italic_x + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , italic_N italic_y + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) is the total number of paths one crosses to get from (Nx+12,Ny+12)𝑁𝑥12𝑁𝑦12\left(Nx+\frac{1}{2},Ny+\frac{1}{2}\right)( italic_N italic_x + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , italic_N italic_y + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) to the bottom-right corner of the domain, which is equal to the total number of c𝑐citalic_c–type vertices situated in the down-right direction from (Nx+12,Ny+12)𝑁𝑥12𝑁𝑦12\left(Nx+\frac{1}{2},Ny+\frac{1}{2}\right)( italic_N italic_x + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , italic_N italic_y + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ). The latter number is the same as N0yx1μσ(s,t)𝑑s𝑑t𝑁superscriptsubscript0𝑦superscriptsubscript𝑥1subscript𝜇𝜎𝑠𝑡differential-d𝑠differential-d𝑡N\int_{0}^{y}\int_{x}^{1}\mu_{\sigma}(s,t)\,ds\,dtitalic_N ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ( italic_s , italic_t ) italic_d italic_s italic_d italic_t. ∎

3. Asymptotic theorems

3.1. Asymptotic regime

We fix k𝑘kitalic_k, \ellroman_ℓ, and matrix I𝐼Iitalic_I. In addition, we fix rational parameters 0=x0<x1<<xk1<xk=10subscriptx0subscriptx1subscriptx𝑘1subscriptx𝑘10=\mathrm{x}_{0}<\mathrm{x}_{1}<\dots<\mathrm{x}_{k-1}<\mathrm{x}_{k}=10 = roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT < roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1 and 0=y0<y1<<y1<y=10subscripty0subscripty1subscripty1subscripty10=\mathrm{y}_{0}<\mathrm{y}_{1}<\dots<\mathrm{y}_{\ell-1}<\mathrm{y}_{\ell}=10 = roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_y start_POSTSUBSCRIPT roman_ℓ - 1 end_POSTSUBSCRIPT < roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 let M𝑀Mitalic_M be the common denominator of all these parameters. We define 0=X0<X1<<Xk=N0subscript𝑋0subscript𝑋1subscript𝑋𝑘𝑁0=X_{0}<X_{1}<\dots<X_{k}=N0 = italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_N and 0=Y0<Y1<<Y=N0subscript𝑌0subscript𝑌1subscript𝑌𝑁0=Y_{0}<Y_{1}<\dots<Y_{\ell}=N0 = italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < italic_Y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_N through

(5) Xu=Nxu0uk,Yv=Nyv,0v.formulae-sequenceformulae-sequencesubscript𝑋𝑢𝑁subscriptx𝑢0𝑢𝑘formulae-sequencesubscript𝑌𝑣𝑁subscripty𝑣0𝑣X_{u}=N\mathrm{x}_{u}\quad 0\leq u\leq k,\qquad Y_{v}=N\mathrm{y}_{v},\quad 0% \leq v\leq\ell.italic_X start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = italic_N roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT 0 ≤ italic_u ≤ italic_k , italic_Y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT = italic_N roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT , 0 ≤ italic_v ≤ roman_ℓ .

N𝑁Nitalic_N will be a large parameter of the form N=Mn𝑁𝑀𝑛N=Mnitalic_N = italic_M italic_n, n=1,2,3,𝑛123n=1,2,3,\dotsitalic_n = 1 , 2 , 3 , …. This guarantees that all the numbers in (5) are integers. When we write N𝑁N\to\inftyitalic_N → ∞, what we mean is N=Mn𝑁𝑀𝑛N=Mnitalic_N = italic_M italic_n and n𝑛n\to\inftyitalic_n → ∞. For the parameter q=b2a2𝑞superscript𝑏2superscript𝑎2q=\frac{b^{2}}{a^{2}}italic_q = divide start_ARG italic_b start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG we further assume

(6) q=exp(rN),r.formulae-sequence𝑞𝑟𝑁𝑟q=\exp\left(-\frac{r}{N}\right),\qquad r\in\mathbb{R}.italic_q = roman_exp ( - divide start_ARG italic_r end_ARG start_ARG italic_N end_ARG ) , italic_r ∈ blackboard_R .

3.2. Variational principle

The large N𝑁Nitalic_N limits for configurations in the interpretations of either Sections 2.1 or 2.2, can be described via a variational principle which we now present. Taking into account the correspondence of Proposition 2.13, we encode the configurations by the normalized height function:

(7) hσ(x,y)=0yx1μσ(s,t)dsdt,0x,y1.formulae-sequencesubscript𝜎𝑥𝑦superscriptsubscript0𝑦superscriptsubscript𝑥1subscript𝜇𝜎𝑠𝑡differential-d𝑠differential-d𝑡formulae-sequence0𝑥𝑦1h_{\sigma}(x,y)=\int_{0}^{y}\int_{x}^{1}\mu_{\sigma}(s,t)\mathrm{d}s\mathrm{d}% t,\qquad 0\leq x,y\leq 1.italic_h start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ( italic_x , italic_y ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ( italic_s , italic_t ) roman_d italic_s roman_d italic_t , 0 ≤ italic_x , italic_y ≤ 1 .

The function hσ(x,y)subscript𝜎𝑥𝑦h_{\sigma}(x,y)italic_h start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ( italic_x , italic_y ) belongs to the following class of functions:

Definition 3.1.

𝔉𝔉\mathfrak{F}fraktur_F is the class of all real functions h(x,y)𝑥𝑦h(x,y)italic_h ( italic_x , italic_y ), 0x,y,10\leq x,y,\leq 10 ≤ italic_x , italic_y , ≤ 1, which are 1111–Lipshitz in both x𝑥xitalic_x and y𝑦yitalic_y variables, satisfy for all 0x1<x210subscript𝑥1subscript𝑥210\leq x_{1}<x_{2}\leq 10 ≤ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ 1, 0y1<y210subscript𝑦1subscript𝑦210\leq y_{1}<y_{2}\leq 10 ≤ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ 1, the inequality

h(x1,y1)+h(x2,y2)h(x1,y2)h(x2,y1)0,subscript𝑥1subscript𝑦1subscript𝑥2subscript𝑦2subscript𝑥1subscript𝑦2subscript𝑥2subscript𝑦10h(x_{1},y_{1})+h(x_{2},y_{2})-h(x_{1},y_{2})-h(x_{2},y_{1})\leq 0,italic_h ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + italic_h ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_h ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_h ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ≤ 0 ,

and satisfy the boundary conditions:

h(x,0)=0,h(x,1)=1x,h(0,y)=y,h(1,y)=0.formulae-sequence𝑥00formulae-sequence𝑥11𝑥formulae-sequence0𝑦𝑦1𝑦0h(x,0)=0,\qquad h(x,1)=1-x,\qquad h(0,y)=y,\qquad h(1,y)=0.italic_h ( italic_x , 0 ) = 0 , italic_h ( italic_x , 1 ) = 1 - italic_x , italic_h ( 0 , italic_y ) = italic_y , italic_h ( 1 , italic_y ) = 0 .
Remark 3.2.

Taking y1=0subscript𝑦10y_{1}=0italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0, we conclude that each h𝔉𝔉h\in\mathfrak{F}italic_h ∈ fraktur_F is weakly decreasing in x𝑥xitalic_x. Taking x2=1subscript𝑥21x_{2}=1italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1, we conclude that each h𝔉𝔉h\in\mathfrak{F}italic_h ∈ fraktur_F is weakly increasing in y𝑦yitalic_y.

We equip 𝔉𝔉\mathfrak{F}fraktur_F with the uniform topology, i.e. the topology generated by the metric d(f,g)=sup0x,y1|f(x,y)g(x,y)|𝑑𝑓𝑔subscriptsupremumformulae-sequence0𝑥𝑦1𝑓𝑥𝑦𝑔𝑥𝑦d(f,g)=\sup_{0\leq x,y\leq 1}|f(x,y)-g(x,y)|italic_d ( italic_f , italic_g ) = roman_sup start_POSTSUBSCRIPT 0 ≤ italic_x , italic_y ≤ 1 end_POSTSUBSCRIPT | italic_f ( italic_x , italic_y ) - italic_g ( italic_x , italic_y ) |. It is straightforward to check that the correspondence h(x,y)μ([x,1]×[0,y])𝑥𝑦𝜇𝑥10𝑦h(x,y)\leftrightarrow\mu([x,1]\times[0,y])italic_h ( italic_x , italic_y ) ↔ italic_μ ( [ italic_x , 1 ] × [ 0 , italic_y ] ) is a bijection between functions in 𝔉𝔉\mathfrak{F}fraktur_F and permutons. Under this bijection, the uniform topology on 𝔉𝔉\mathfrak{F}fraktur_F becomes the weak topology on permutons treated as probability measures on [0,1]2superscript012[0,1]^{2}[ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT.

When the configurations are restricted by ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT, the normalized height functions belong to a closed (and thus compact) subset of 𝔉𝔉\mathfrak{F}fraktur_F.

Definition 3.3.

Given integers k𝑘kitalic_k, \ellroman_ℓ, a k×𝑘k\times\ellitalic_k × roman_ℓ array I𝐼Iitalic_I filled with 00s and 1s1𝑠1s1 italic_s, and real parameters 0=x0<x1<<xk1<xk=10subscriptx0subscriptx1subscriptx𝑘1subscriptx𝑘10=\mathrm{x}_{0}<\mathrm{x}_{1}<\dots<\mathrm{x}_{k-1}<\mathrm{x}_{k}=10 = roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT < roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1 and 0=y0<y1<<y1<y=10subscripty0subscripty1subscripty1subscripty10=\mathrm{y}_{0}<\mathrm{y}_{1}<\dots<\mathrm{y}_{\ell-1}<\mathrm{y}_{\ell}=10 = roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_y start_POSTSUBSCRIPT roman_ℓ - 1 end_POSTSUBSCRIPT < roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1, we define 𝔉x,y,Isuperscript𝔉xy𝐼\mathfrak{F}^{\mathrm{x},\mathrm{y},I}fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT to be a closed subset of 𝔉𝔉\mathfrak{F}fraktur_F consisting of all functions h𝔉𝔉h\in\mathfrak{F}italic_h ∈ fraktur_F, such that whenever Iuv=0subscript𝐼𝑢𝑣0I_{uv}=0italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 0,

(8) h(xu,yv)+h(xu1,yv1)h(xu,yv1)h(xu1,yv)=0.subscriptx𝑢subscripty𝑣subscriptx𝑢1subscripty𝑣1subscriptx𝑢subscripty𝑣1subscriptx𝑢1subscripty𝑣0h(\mathrm{x}_{u},\mathrm{y}_{v})+h(\mathrm{x}_{u-1},\mathrm{y}_{v-1})-h(% \mathrm{x}_{u},\mathrm{y}_{v-1})-h(\mathrm{x}_{u-1},\mathrm{y}_{v})=0.italic_h ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) + italic_h ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_h ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_h ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) = 0 .
Remark 3.4.

In terms of the corresponding permuton, (8) says that the mass of the square [xu1,xu]×[yv1,yv]subscriptx𝑢1subscriptx𝑢subscripty𝑣1subscripty𝑣[\mathrm{x}_{u-1},\mathrm{x}_{u}]\times[\mathrm{y}_{v-1},\mathrm{y}_{v}][ roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ] × [ roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ] is zero.

Definition 3.5.

We say that the data x0<x1<<xksubscriptx0subscriptx1subscriptx𝑘\mathrm{x}_{0}<\mathrm{x}_{1}<\dots<\mathrm{x}_{k}roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, y0<y1<<ysubscripty0subscripty1subscripty\mathrm{y}_{0}<\mathrm{y}_{1}<\dots<\mathrm{y}_{\ell}roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, and 0/1010/10 / 1 array I=[Iuv]𝐼delimited-[]subscript𝐼𝑢𝑣I=[I_{uv}]italic_I = [ italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT ] of size k×𝑘k\times\ellitalic_k × roman_ℓ is non-degenerate if there exists a non-negative real array [Buv]delimited-[]subscript𝐵𝑢𝑣[B_{uv}][ italic_B start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT ], such that Buv>0subscript𝐵𝑢𝑣0B_{uv}>0italic_B start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT > 0 if and only if Iuv=1subscript𝐼𝑢𝑣1I_{uv}=1italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1 and v=1Buv=xuxu1superscriptsubscript𝑣1subscript𝐵𝑢𝑣subscriptx𝑢subscriptx𝑢1\sum_{v=1}^{\ell}B_{uv}=\mathrm{x}_{u}-\mathrm{x}_{u-1}∑ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT italic_B start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT, for all 1uk1𝑢𝑘1\leq u\leq k1 ≤ italic_u ≤ italic_k, and u=1kBuv=yvyv1superscriptsubscript𝑢1𝑘subscript𝐵𝑢𝑣subscripty𝑣subscripty𝑣1\sum_{u=1}^{k}B_{uv}=\mathrm{y}_{v}-\mathrm{y}_{v-1}∑ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_B start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT for all 1v1𝑣1\leq v\leq\ell1 ≤ italic_v ≤ roman_ℓ.

Note that (x,y,I)xy𝐼(\mathrm{x},\mathrm{y},I)( roman_x , roman_y , italic_I ) being non-degenerate implies that the set 𝔉x,y,Isuperscript𝔉xy𝐼\mathfrak{F}^{\mathrm{x},\mathrm{y},I}fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT is non-empty. In addition, this condition rules out the cases when the choice of xx\mathrm{x}roman_x and yy\mathrm{y}roman_y implies vanishing of the permuton on some rectangles; in this situation we can always change some elements of I𝐼Iitalic_I to zeros and achieve a non-degenerate situation.333For instance, take k==2𝑘2k=\ell=2italic_k = roman_ℓ = 2, x=y=(0,12,1)xy0121\mathrm{x}=\mathrm{y}=(0,\tfrac{1}{2},1)roman_x = roman_y = ( 0 , divide start_ARG 1 end_ARG start_ARG 2 end_ARG , 1 ) and I=(1011)𝐼matrix1011I=\begin{pmatrix}1&0\\ 1&1\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ). This is a degenerate situation, because uniform marginals conditions implies that the permuton vanishes in the bottom-left square. Hence, we should replace I𝐼Iitalic_I with (1001)matrix1001\begin{pmatrix}1&0\\ 0&1\end{pmatrix}( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) to get a non-degenerate triplet corresponding to exactly the same set 𝔉x,y,Isuperscript𝔉xy𝐼\mathfrak{F}^{\mathrm{x},\mathrm{y},I}fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT. From now on we tacitly assume that (x,y,I)xy𝐼(\mathrm{x},\mathrm{y},I)( roman_x , roman_y , italic_I ) is non-degenerate.

We use the permuton energy of the height functions to compute limit shapes:

Definition 3.6.

Given a function h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT we define its permuton energy to be

(9) 𝔈(h)=0101[hxyln(hxy)rhxhy]dxdy,𝔈superscriptsubscript01superscriptsubscript01delimited-[]subscript𝑥𝑦subscript𝑥𝑦𝑟subscript𝑥subscript𝑦differential-d𝑥differential-d𝑦\mathfrak{E}(h)=\int_{0}^{1}\int_{0}^{1}\bigl{[}h_{xy}\ln(-h_{xy})-rh_{x}h_{y}% \bigr{]}\,\mathrm{d}x\,\mathrm{d}y,fraktur_E ( italic_h ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT [ italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT roman_ln ( - italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ) - italic_r italic_h start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ] roman_d italic_x roman_d italic_y ,

(where r𝑟ritalic_r is from (6)) if the integral exists and 𝔈(h)=𝔈\mathfrak{E}(h)=-\inftyfraktur_E ( italic_h ) = - ∞ otherwise. The product hxyln(hxy)subscript𝑥𝑦subscript𝑥𝑦h_{xy}\ln(-h_{xy})italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT roman_ln ( - italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ) is understood as 00 when hxy=0subscript𝑥𝑦0h_{xy}=0italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT = 0.

Using the correspondence, h(x,y)μ([x,1]×[0,y])𝑥𝑦𝜇𝑥10𝑦h(x,y)\leftrightarrow\mu([x,1]\times[0,y])italic_h ( italic_x , italic_y ) ↔ italic_μ ( [ italic_x , 1 ] × [ 0 , italic_y ] ), the first term in (9) is the entropy of the measure μ𝜇\muitalic_μ. A proper way to think about this term is that it is given by the double Lebesgue integral 0101μ(x,y)ln(μ(x,y))dxdysuperscriptsubscript01superscriptsubscript01𝜇𝑥𝑦𝜇𝑥𝑦differential-d𝑥differential-d𝑦-\int_{0}^{1}\int_{0}^{1}\mu(x,y)\ln(\mu(x,y))\,\mathrm{d}x\,\mathrm{d}y- ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_μ ( italic_x , italic_y ) roman_ln ( italic_μ ( italic_x , italic_y ) ) roman_d italic_x roman_d italic_y, whenever μ𝜇\muitalic_μ has a density μ(x,y)𝜇𝑥𝑦\mu(x,y)italic_μ ( italic_x , italic_y ) with respect to the Lebesgue measure on [0,1]2superscript012[0,1]^{2}[ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT; the latter integral is either finite or -\infty- ∞. If μ𝜇\muitalic_μ is not absolutely continuous, then we set the integral to be -\infty- ∞ by definition. As for the second term in (9), since 1hx01subscript𝑥0-1\leq h_{x}\leq 0- 1 ≤ italic_h start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ≤ 0 and 0hy10subscript𝑦10\leq h_{y}\leq 10 ≤ italic_h start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ≤ 1 by Lipshitz condition, the integral of this term always exists and is upper bounded by |r|𝑟|r|| italic_r | in magnitude.

Theorem 3.7.

In the asymptotic regime of Section 3.1, for any closed non-empty A𝔉x,y,I𝐴superscript𝔉xy𝐼A\subset\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_A ⊂ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT we have

(10) lim supN1Nln(1N!σ𝔖N:hσAqinv(σ))suphA𝔈(h)subscriptlimit-supremum𝑁1𝑁1𝑁subscript:𝜎subscript𝔖𝑁subscript𝜎𝐴superscript𝑞inv𝜎subscriptsupremum𝐴𝔈\limsup_{N\to\infty}\frac{1}{N}\ln\left(\frac{1}{N!}\sum_{\sigma\in\mathfrak{S% }_{N}:\leavevmode\nobreak\ h_{\sigma}\in A}q^{\mathrm{inv}(\sigma)}\right)\leq% \sup_{h\in A}\mathfrak{E}(h)lim sup start_POSTSUBSCRIPT italic_N → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_N end_ARG roman_ln ( divide start_ARG 1 end_ARG start_ARG italic_N ! end_ARG ∑ start_POSTSUBSCRIPT italic_σ ∈ fraktur_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT : italic_h start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ∈ italic_A end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT roman_inv ( italic_σ ) end_POSTSUPERSCRIPT ) ≤ roman_sup start_POSTSUBSCRIPT italic_h ∈ italic_A end_POSTSUBSCRIPT fraktur_E ( italic_h )

and for any non-empty open444We mean here open as a subset of 𝔉x,y,Isuperscript𝔉xy𝐼\mathfrak{F}^{\mathrm{x},\mathrm{y},I}fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT, rather than open as a subset of 𝔉𝔉\mathfrak{F}fraktur_F. A𝔉x,y,I𝐴superscript𝔉xy𝐼A\subset\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_A ⊂ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT, we have

(11) lim infN1Nln(1N!σ𝔖N:hσA)suphA𝔈(h).subscriptlimit-infimum𝑁1𝑁1𝑁subscript:𝜎subscript𝔖𝑁subscript𝜎𝐴subscriptsupremum𝐴𝔈\liminf_{N\to\infty}\frac{1}{N}\ln\left(\frac{1}{N!}\sum_{\sigma\in\mathfrak{S% }_{N}:\leavevmode\nobreak\ h_{\sigma}\in A}\right)\geq\sup_{h\in A}\mathfrak{E% }(h).lim inf start_POSTSUBSCRIPT italic_N → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_N end_ARG roman_ln ( divide start_ARG 1 end_ARG start_ARG italic_N ! end_ARG ∑ start_POSTSUBSCRIPT italic_σ ∈ fraktur_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT : italic_h start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ∈ italic_A end_POSTSUBSCRIPT ) ≥ roman_sup start_POSTSUBSCRIPT italic_h ∈ italic_A end_POSTSUBSCRIPT fraktur_E ( italic_h ) .
Remark 3.8.

Since A𝔉x,y,I𝐴superscript𝔉xy𝐼A\subset\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_A ⊂ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT, the condition hσAsubscript𝜎𝐴h_{\sigma}\in Aitalic_h start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ∈ italic_A guarantees that σ𝜎\sigmaitalic_σ is restricted by ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT. Hence, (10), (11) count asymptotic asymptotics for the probability measures (1) with degenerate weights or (2).

We omit the proof for Theorem 3.7 and only mention that the first term in (9) is the entropy appearing from counting the permutations, while the second term is the logarithm of the weight qinv(σ)superscript𝑞inv𝜎q^{\mathrm{inv}(\sigma)}italic_q start_POSTSUPERSCRIPT roman_inv ( italic_σ ) end_POSTSUPERSCRIPT reexpressed in terms of the height function hσsubscript𝜎h_{\sigma}italic_h start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT. Statements very similar to Theorem 3.7 can be found in [Sta09, Tra08, Muk16, KKRW20, SW18, BDMW24]. Note that in contrast to the variational principle for domino tilings of [CKP01] (domino tilings are equivalent to configurations of the six-vertex model with weights satisfying a2+b2=c2superscript𝑎2superscript𝑏2superscript𝑐2a^{2}+b^{2}=c^{2}italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_b start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_c start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, see [FS06]), the prefactor in front of ln\lnroman_ln in (10),(11) is 1N1𝑁\tfrac{1}{N}divide start_ARG 1 end_ARG start_ARG italic_N end_ARG rather than 1N21superscript𝑁2\tfrac{1}{N^{2}}divide start_ARG 1 end_ARG start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG.

Corollary 3.9 (Variational principle).

In the asymptotic regime of Section 3.1, suppose that the permuton energy 𝔈(h)𝔈\mathfrak{E}(h)fraktur_E ( italic_h ), h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT, has a unique maximizer denoted 𝔥𝔥\mathfrak{h}fraktur_h. Then, with σ𝜎\sigmaitalic_σ distributed according to (1) with degenerate weights or (2), for each ε>0𝜀0\varepsilon>0italic_ε > 0 we have

limNProb(sup0x,y1|hσ(x,y)𝔥(x,y)|<ε)=1.subscript𝑁Probsubscriptsupremumformulae-sequence0𝑥𝑦1subscript𝜎𝑥𝑦𝔥𝑥𝑦𝜀1\lim_{N\to\infty}\mathrm{Prob}\left(\sup_{0\leq x,y\leq 1}|h_{\sigma}(x,y)-% \mathfrak{h}(x,y)|<\varepsilon\right)=1.roman_lim start_POSTSUBSCRIPT italic_N → ∞ end_POSTSUBSCRIPT roman_Prob ( roman_sup start_POSTSUBSCRIPT 0 ≤ italic_x , italic_y ≤ 1 end_POSTSUBSCRIPT | italic_h start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ( italic_x , italic_y ) - fraktur_h ( italic_x , italic_y ) | < italic_ε ) = 1 .
Proof.

We apply Theorem 3.7 to the ε𝜀\varepsilonitalic_ε–neighborhood of 𝔥𝔥\mathfrak{h}fraktur_h and its complement. ∎

Remark 3.10.

If we use the interpretation of Section 2.2 in terms of random permutations, then the convexity of I𝐼Iitalic_I is not used in Corollary 3.9.

In the rest of the paper we concentrate on identifying the maximizers of 𝔈(h)𝔈\mathfrak{E}(h)fraktur_E ( italic_h ).

3.3. Maximizers via Euler-Lagrange equations

We start the analysis of the variational problem 𝔈(h)max𝔈\mathfrak{E}(h)\to\maxfraktur_E ( italic_h ) → roman_max, h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT. We don’t assume that I𝐼Iitalic_I is convex unless explicitly stated.

We say that a point (x,y)𝑥𝑦(x,y)( italic_x , italic_y ), 0<x,y<1formulae-sequence0𝑥𝑦10<x,y<10 < italic_x , italic_y < 1, is strictly inside the (rescaled) domain Ωx,y,IsuperscriptΩxy𝐼\Omega^{\mathrm{x},\mathrm{y},I}roman_Ω start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT, if xu1<x<xu,subscriptx𝑢1𝑥subscriptx𝑢\mathrm{x}_{u-1}<x<\mathrm{x}_{u},roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT < italic_x < roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , and yv1<y<yvsubscripty𝑣1𝑦subscripty𝑣\mathrm{y}_{v-1}<y<\mathrm{y}_{v}roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT < italic_y < roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT for some (u,v)𝑢𝑣(u,v)( italic_u , italic_v ) such that Iuv=1subscript𝐼𝑢𝑣1I_{uv}=1italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1. Note that we prohibit the end-points x=xu𝑥subscript𝑥𝑢x=x_{u}italic_x = italic_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, y=yv𝑦subscript𝑦𝑣y=y_{v}italic_y = italic_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT, etc.

Proposition 3.11 (Four-point relation).

Let h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT be a maximizer of 𝔈𝔈\mathfrak{E}fraktur_E and set g=hxy𝑔subscript𝑥𝑦g=-h_{xy}italic_g = - italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT to be the density of the corresponding permuton. Take 0<x1<x2<10subscript𝑥1subscript𝑥210<x_{1}<x_{2}<10 < italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < 1, 0<y1<y2<10subscript𝑦1subscript𝑦210<y_{1}<y_{2}<10 < italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < 1, such that the four points (x1,y1)subscript𝑥1subscript𝑦1(x_{1},y_{1})( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ), (x1,y2)subscript𝑥1subscript𝑦2(x_{1},y_{2})( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), (x2,y2)subscript𝑥2subscript𝑦2(x_{2},y_{2})( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), (x2,y1)subscript𝑥2subscript𝑦1(x_{2},y_{1})( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) are strictly inside Ωx,y,IsuperscriptΩxy𝐼\Omega^{\mathrm{x},\mathrm{y},I}roman_Ω start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT and g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) is continuous and positive at these points. Then

(12) ln(g(x1,y1)g(x2,y2)g(x1,y2)g(x2,y1))2ry1y2x1x2g(x,y)dxdy=0.𝑔subscript𝑥1subscript𝑦1𝑔subscript𝑥2subscript𝑦2𝑔subscript𝑥1subscript𝑦2𝑔subscript𝑥2subscript𝑦12𝑟superscriptsubscriptsubscript𝑦1subscript𝑦2superscriptsubscriptsubscript𝑥1subscript𝑥2𝑔𝑥𝑦differential-d𝑥differential-d𝑦0\ln\left(\frac{g(x_{1},y_{1})g(x_{2},y_{2})}{g(x_{1},y_{2})g(x_{2},y_{1})}% \right)-2r\int_{y_{1}}^{y_{2}}\int_{x_{1}}^{x_{2}}g(x,y)\,\mathrm{d}x\,\mathrm% {d}y=0.roman_ln ( divide start_ARG italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_g ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) italic_g ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG ) - 2 italic_r ∫ start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_x roman_d italic_y = 0 .
Proof.

We choose two small constants ε𝜀\varepsilon\in\mathbb{R}italic_ε ∈ blackboard_R and δ>0𝛿0\delta>0italic_δ > 0, consider four squares of side-length 2δ2𝛿2\delta2 italic_δ centered at (x1,y1)subscript𝑥1subscript𝑦1(x_{1},y_{1})( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ), (x1,y2)subscript𝑥1subscript𝑦2(x_{1},y_{2})( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), (x2,y2)subscript𝑥2subscript𝑦2(x_{2},y_{2})( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), (x2,y1)subscript𝑥2subscript𝑦1(x_{2},y_{1})( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ), add ε𝜀\varepsilonitalic_ε to the values of g𝑔gitalic_g in the first and third one and subtract ε𝜀\varepsilonitalic_ε from the values in the second and fourth ones. Note that the boundary conditions on hhitalic_h are unchanged under this transformation and let us compute the change in the double integral expression (9) for the energy 𝔈(h)𝔈\mathfrak{E}(h)fraktur_E ( italic_h ).

Differentiating the first term in the double integral in g=hxy𝑔subscript𝑥𝑦g=-h_{xy}italic_g = - italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT, we conclude that it changes by

(13) 4δ2ε[lng(x1,y1)+lng(x2,y2)lng(x1,y2)lng(x2,y1)+o(1)],4superscript𝛿2𝜀delimited-[]𝑔subscript𝑥1subscript𝑦1𝑔subscript𝑥2subscript𝑦2𝑔subscript𝑥1subscript𝑦2𝑔subscript𝑥2subscript𝑦1𝑜1-4\delta^{2}\varepsilon\bigl{[}\ln g(x_{1},y_{1})+\ln g(x_{2},y_{2})-\ln g(x_{% 1},y_{2})-\ln g(x_{2},y_{1})+o(1)\bigr{]},- 4 italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ε [ roman_ln italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + roman_ln italic_g ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - roman_ln italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - roman_ln italic_g ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + italic_o ( 1 ) ] ,

as ε,δ0𝜀𝛿0\varepsilon,\delta\to 0italic_ε , italic_δ → 0. For the second term, we notice that

hx=0yg(x,t)dt,hy=x1g(s,y)ds.formulae-sequencesubscript𝑥superscriptsubscript0𝑦𝑔𝑥𝑡differential-d𝑡subscript𝑦superscriptsubscript𝑥1𝑔𝑠𝑦differential-d𝑠h_{x}=-\int_{0}^{y}g(x,t)\,\mathrm{d}t,\qquad h_{y}=\int_{x}^{1}g(s,y)\,% \mathrm{d}s.italic_h start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = - ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT italic_g ( italic_x , italic_t ) roman_d italic_t , italic_h start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT = ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_g ( italic_s , italic_y ) roman_d italic_s .

Hence, the integrand in the second term changes only in δ𝛿\deltaitalic_δ–neighborhood of the (x1,y1)(x1,y2)(x2,y2)(x2,y1)subscript𝑥1subscript𝑦1subscript𝑥1subscript𝑦2subscript𝑥2subscript𝑦2subscript𝑥2subscript𝑦1(x_{1},y_{1})-(x_{1},y_{2})-(x_{2},y_{2})-(x_{2},y_{1})( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) rectangle. Inside the corner four squares of side-length 2δ2𝛿2\delta2 italic_δ, hxsubscript𝑥h_{x}italic_h start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT and hysubscript𝑦h_{y}italic_h start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT change by O(εδ)𝑂𝜀𝛿O(\varepsilon\delta)italic_O ( italic_ε italic_δ ). Therefore, the total change in the part of the double integral where we integrated over these squares is O(εδ3)𝑂𝜀superscript𝛿3O(\varepsilon\delta^{3})italic_O ( italic_ε italic_δ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ). This term will be negligible eventually. More important is the change of hxsubscript𝑥h_{x}italic_h start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT and hysubscript𝑦h_{y}italic_h start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT in the δ𝛿\deltaitalic_δ–neighborhood of the sides. This change is:

2δx1x2(2rεδhx(x,y1)2rεδhx(x,y2))dx+2δy1y2(2rεδhy(x1,y)2rεδhy(x2,y))dy+o(εδ2),2𝛿superscriptsubscriptsubscript𝑥1subscript𝑥22𝑟𝜀𝛿subscript𝑥𝑥subscript𝑦12𝑟𝜀𝛿subscript𝑥𝑥subscript𝑦2differential-d𝑥2𝛿superscriptsubscriptsubscript𝑦1subscript𝑦22𝑟𝜀𝛿subscript𝑦subscript𝑥1𝑦2𝑟𝜀𝛿subscript𝑦subscript𝑥2𝑦differential-d𝑦𝑜𝜀superscript𝛿22\delta\int_{x_{1}}^{x_{2}}(2r\varepsilon\delta h_{x}(x,y_{1})-2r\varepsilon% \delta h_{x}(x,y_{2}))\,\mathrm{d}x\\ +2\delta\int_{y_{1}}^{y_{2}}(2r\varepsilon\delta h_{y}(x_{1},y)-2r\varepsilon% \delta h_{y}(x_{2},y))\,\mathrm{d}y+o(\varepsilon\delta^{2}),2 italic_δ ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( 2 italic_r italic_ε italic_δ italic_h start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_x , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - 2 italic_r italic_ε italic_δ italic_h start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_x , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ) roman_d italic_x + 2 italic_δ ∫ start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( 2 italic_r italic_ε italic_δ italic_h start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y ) - 2 italic_r italic_ε italic_δ italic_h start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y ) ) roman_d italic_y + italic_o ( italic_ε italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,

which simplifies to

(14) 4rδ2ε(x1x2y1y2g(x,y)dydx+y1y2x1x2g(x,y)dxdy+o(1))=8rδ2ε(x1x2y1y2g(x,y)dydx+o(1)).4𝑟superscript𝛿2𝜀superscriptsubscriptsubscript𝑥1subscript𝑥2superscriptsubscriptsubscript𝑦1subscript𝑦2𝑔𝑥𝑦differential-d𝑦differential-d𝑥superscriptsubscriptsubscript𝑦1subscript𝑦2superscriptsubscriptsubscript𝑥1subscript𝑥2𝑔𝑥𝑦differential-d𝑥differential-d𝑦𝑜18𝑟superscript𝛿2𝜀superscriptsubscriptsubscript𝑥1subscript𝑥2superscriptsubscriptsubscript𝑦1subscript𝑦2𝑔𝑥𝑦differential-d𝑦differential-d𝑥𝑜14r\delta^{2}\varepsilon\left(\int_{x_{1}}^{x_{2}}\int_{y_{1}}^{y_{2}}g(x,y)\,% \mathrm{d}y\,\mathrm{d}x+\int_{y_{1}}^{y_{2}}\int_{x_{1}}^{x_{2}}g(x,y)\,% \mathrm{d}x\,\mathrm{d}y+o(1)\right)\\ =8r\delta^{2}\varepsilon\left(\int_{x_{1}}^{x_{2}}\int_{y_{1}}^{y_{2}}g(x,y)\,% \mathrm{d}y\,\mathrm{d}x+o(1)\right).start_ROW start_CELL 4 italic_r italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ε ( ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_y roman_d italic_x + ∫ start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_x roman_d italic_y + italic_o ( 1 ) ) end_CELL end_ROW start_ROW start_CELL = 8 italic_r italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ε ( ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_y roman_d italic_x + italic_o ( 1 ) ) . end_CELL end_ROW

The total change is the sum of (13) and (14). Since ε𝜀\varepsilonitalic_ε can be of both signs, the sum should be zero in the leading order, as otherwise hhitalic_h is not a maximizer. This gives the equation

4δ2ε[lng(x1,y1)+lng(x2,y2)lng(x1,y2)lng(x2,y1)]+8rδ2ε(x1x2y1y2g(x,y)dydx)=0.4superscript𝛿2𝜀delimited-[]𝑔subscript𝑥1subscript𝑦1𝑔subscript𝑥2subscript𝑦2𝑔subscript𝑥1subscript𝑦2𝑔subscript𝑥2subscript𝑦18𝑟superscript𝛿2𝜀superscriptsubscriptsubscript𝑥1subscript𝑥2superscriptsubscriptsubscript𝑦1subscript𝑦2𝑔𝑥𝑦differential-d𝑦differential-d𝑥0-4\delta^{2}\varepsilon\bigl{[}\ln g(x_{1},y_{1})+\ln g(x_{2},y_{2})-\ln g(x_{% 1},y_{2})-\ln g(x_{2},y_{1})\bigr{]}\\ +8r\delta^{2}\varepsilon\left(\int_{x_{1}}^{x_{2}}\int_{y_{1}}^{y_{2}}g(x,y)\,% \mathrm{d}y\,\mathrm{d}x\right)=0.\qedstart_ROW start_CELL - 4 italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ε [ roman_ln italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + roman_ln italic_g ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - roman_ln italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - roman_ln italic_g ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ] end_CELL end_ROW start_ROW start_CELL + 8 italic_r italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ε ( ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_y roman_d italic_x ) = 0 . italic_∎ end_CELL end_ROW

Proposition 3.11 applies only at the points where g𝑔gitalic_g is positive. Yet, in all the examples we checked, g𝑔gitalic_g is actually positive everywhere inside the domain. We state this as a conjecture.

Conjecture 3.12.

Suppose that the data is non-degenerate in the sense of Definition 12. Let h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT be a maximizer of 𝔈𝔈\mathfrak{E}fraktur_E and set g=hxy𝑔subscript𝑥𝑦g=-h_{xy}italic_g = - italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT to be the density of the corresponding permuton. Then there exist constants 0<c<C0𝑐𝐶0<c<C0 < italic_c < italic_C (which depend on all the data) such that c<g(x,y)<C𝑐𝑔𝑥𝑦𝐶c<g(x,y)<Citalic_c < italic_g ( italic_x , italic_y ) < italic_C for almost all points (x,y)Ωx,y,I𝑥𝑦superscriptΩxy𝐼(x,y)\in\Omega^{\mathrm{x},\mathrm{y},I}( italic_x , italic_y ) ∈ roman_Ω start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT.

Remark 3.13.

In contrast to our c0𝑐0c\to 0italic_c → 0 situation, for the six-vertex model with generic parameters a,b,c𝑎𝑏𝑐a,b,citalic_a , italic_b , italic_c positivity of the density of the c𝑐citalic_c–type vertices does not hold everywhere. Instead, there are interesting arctic curves bounding the regions with no c𝑐citalic_c–type vertices, see e.g. [JPS98, CP10, Agg20a].

The four-point relation (12) is the Euler-Lagrange equation for the variational problem 𝔈(h)max𝔈\mathfrak{E}(h)\to\maxfraktur_E ( italic_h ) → roman_max, h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT. In general, this variational problem is not convex, so additional care is required to identify the solutions of the Euler-Lagrange equation with the maximizers. However, for small r𝑟ritalic_r we can still use convexity ideas. For that we define the concept of a smooth solution.

Definition 3.14.

We call a function h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT a smooth solution to the EL-equations of class [c,C]𝑐𝐶[c,C][ italic_c , italic_C ], if there exists another (measurable) function g:[0,1]20:𝑔superscript012subscriptabsent0g:[0,1]^{2}\to\mathbb{R}_{\geq 0}italic_g : [ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT → blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT, such that:

  1. (1)

    h(x,y)=0yx1g(s,t)𝑑s𝑑t,0x,y1.formulae-sequence𝑥𝑦superscriptsubscript0𝑦superscriptsubscript𝑥1𝑔𝑠𝑡differential-d𝑠differential-d𝑡formulae-sequence0𝑥𝑦1\displaystyle h(x,y)=\int_{0}^{y}\int_{x}^{1}g(s,t)\,ds\,dt,\qquad 0\leq x,y% \leq 1.italic_h ( italic_x , italic_y ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_g ( italic_s , italic_t ) italic_d italic_s italic_d italic_t , 0 ≤ italic_x , italic_y ≤ 1 .

  2. (2)

    For each 1uk1𝑢𝑘1\leq u\leq k1 ≤ italic_u ≤ italic_k, 1vl1𝑣𝑙1\leq v\leq l1 ≤ italic_v ≤ italic_l, the restriction of g𝑔gitalic_g on the open rectangle (xu1,xu)×(yv1,yv)subscriptx𝑢1subscriptx𝑢subscripty𝑣1subscripty𝑣(\mathrm{x}_{u-1},\mathrm{x}_{u})\times(\mathrm{y}_{v-1},\mathrm{y}_{v})( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) × ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) has a continuous extension guvsuperscript𝑔𝑢𝑣g^{uv}italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT to the closed rectangle [xu1,xu]×[yv1,yv]subscriptx𝑢1subscriptx𝑢subscripty𝑣1subscripty𝑣[\mathrm{x}_{u-1},\mathrm{x}_{u}]\times[\mathrm{y}_{v-1},\mathrm{y}_{v}][ roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ] × [ roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ].

  3. (3)

    If Iu,v=0subscript𝐼𝑢𝑣0I_{u,v}=0italic_I start_POSTSUBSCRIPT italic_u , italic_v end_POSTSUBSCRIPT = 0, then guv(x,y)=0superscript𝑔𝑢𝑣𝑥𝑦0g^{uv}(x,y)=0italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 0.
    If Iu,v=1subscript𝐼𝑢𝑣1I_{u,v}=1italic_I start_POSTSUBSCRIPT italic_u , italic_v end_POSTSUBSCRIPT = 1, then cguv(x,y)C𝑐superscript𝑔𝑢𝑣𝑥𝑦𝐶c\leq g^{uv}(x,y)\leq Citalic_c ≤ italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) ≤ italic_C for all xu1xxusubscriptx𝑢1𝑥subscriptx𝑢\mathrm{x}_{u-1}\leq x\leq\mathrm{x}_{u}roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ≤ italic_x ≤ roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, yv1yyvsubscripty𝑣1𝑦subscripty𝑣\mathrm{y}_{v-1}\leq y\leq\mathrm{y}_{v}roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ≤ italic_y ≤ roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT.

  4. (4)

    For each 0<x1x2<10subscript𝑥1subscript𝑥210<x_{1}\leq x_{2}<10 < italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < 1, 0<y1y2<10subscript𝑦1subscript𝑦210<y_{1}\leq y_{2}<10 < italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < 1, we have

    (15) ln(g(x1,y1)g(x2+,y2+)g(x1,y2+)g(x2+,y1))2ry1y2x1x2g(x,y)𝑑x𝑑y=0,𝑔limit-fromsubscript𝑥1limit-fromsubscript𝑦1𝑔limit-fromsubscript𝑥2limit-fromsubscript𝑦2𝑔limit-fromsubscript𝑥1limit-fromsubscript𝑦2𝑔limit-fromsubscript𝑥2limit-fromsubscript𝑦12𝑟superscriptsubscriptsubscript𝑦1subscript𝑦2superscriptsubscriptsubscript𝑥1subscript𝑥2𝑔𝑥𝑦differential-d𝑥differential-d𝑦0\ln\left(\frac{g(x_{1}-,y_{1}-)g(x_{2}+,y_{2}+)}{g(x_{1}-,y_{2}+)g(x_{2}+,y_{1% }-)}\right)-2r\int_{y_{1}}^{y_{2}}\int_{x_{1}}^{x_{2}}g(x,y)\,dx\,dy=0,roman_ln ( divide start_ARG italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - ) italic_g ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + ) end_ARG start_ARG italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + ) italic_g ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - ) end_ARG ) - 2 italic_r ∫ start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) italic_d italic_x italic_d italic_y = 0 ,

    where the ±plus-or-minus\pm± signs are the left and right limits, e.g. g(x1,y1)=limε0+g(x1ε,y1ε)𝑔limit-fromsubscript𝑥1limit-fromsubscript𝑦1subscript𝜀limit-from0𝑔subscript𝑥1𝜀subscript𝑦1𝜀g(x_{1}-,y_{1}-)=\lim\limits_{\varepsilon\to 0+}g(x_{1}-\varepsilon,y_{1}-\varepsilon)italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - ) = roman_lim start_POSTSUBSCRIPT italic_ε → 0 + end_POSTSUBSCRIPT italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_ε , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_ε ).

Note that we include the cases of degenerate rectangles x1=x2subscript𝑥1subscript𝑥2x_{1}=x_{2}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and/or y1=y2subscript𝑦1subscript𝑦2y_{1}=y_{2}italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in (15) above.

Theorem 3.15.

Fix two real constants 0<c<C0𝑐𝐶0<c<C0 < italic_c < italic_C, array I𝐼Iitalic_I, and real parameters 0=x0<x1<<xk1<xk=10subscriptx0subscriptx1subscriptx𝑘1subscriptx𝑘10=\mathrm{x}_{0}<\mathrm{x}_{1}<\dots<\mathrm{x}_{k-1}<\mathrm{x}_{k}=10 = roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT < roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1 and 0=y0<y1<<y1<y=10subscripty0subscripty1subscripty1subscripty10=\mathrm{y}_{0}<\mathrm{y}_{1}<\dots<\mathrm{y}_{\ell-1}<\mathrm{y}_{\ell}=10 = roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_y start_POSTSUBSCRIPT roman_ℓ - 1 end_POSTSUBSCRIPT < roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1. There exists a real constant r0>0subscript𝑟00r_{0}>0italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0 (depending on all the data from the previous sentence), such that for all r𝑟ritalic_r satisfying |r|<r0𝑟subscript𝑟0|r|<r_{0}| italic_r | < italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, if h^𝔉x,y,I^superscript𝔉xy𝐼\hat{h}\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}over^ start_ARG italic_h end_ARG ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT is a smooth solution to the EL equations of class [c,C]𝑐𝐶[c,C][ italic_c , italic_C ], then h^^\hat{h}over^ start_ARG italic_h end_ARG is the unique solution of the variational problem 𝔈(h)max𝔈\mathfrak{E}(h)\to\maxfraktur_E ( italic_h ) → roman_max, h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT.

Our computed examples of Sections 4 and 5 are all smooth, and hence unique solutions for small |r|𝑟|r|| italic_r | by this result.

Remark 3.16.

It is plausible that r0=+subscript𝑟0r_{0}=+\inftyitalic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = + ∞ in many situations. For instance, for the unrestricted Mallows measure (corresponding to k==1𝑘1k=\ell=1italic_k = roman_ℓ = 1, I=(1)𝐼1I=(1)italic_I = ( 1 ) case) this is known to be true, see [Sta09, SW18].

Proof of Theorem 3.15.

We fix r𝑟ritalic_r and a parameter ε>0𝜀0\varepsilon>0italic_ε > 0. We consider the function h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT given by

h(x,y)=h^(x,y)+εf(x,y).𝑥𝑦^𝑥𝑦𝜀𝑓𝑥𝑦h(x,y)=\hat{h}(x,y)+\varepsilon f(x,y).italic_h ( italic_x , italic_y ) = over^ start_ARG italic_h end_ARG ( italic_x , italic_y ) + italic_ε italic_f ( italic_x , italic_y ) .

Our task is to show that whenever f𝑓fitalic_f is not identical zero, we have

(16) 𝔈(h)<𝔈(h^).𝔈𝔈^\mathfrak{E}(h)<\mathfrak{E}(\hat{h}).fraktur_E ( italic_h ) < fraktur_E ( over^ start_ARG italic_h end_ARG ) .

We assume without loss of generality that hxysubscript𝑥𝑦-h_{xy}- italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT (the density of the permuton corresponding to hhitalic_h) is well-defined as a density of an absolutely continuous probability measure; otherwise 𝔈(h)=𝔈\mathfrak{E}(h)=-\inftyfraktur_E ( italic_h ) = - ∞ and (16) is clear.

We expand in ε𝜀\varepsilonitalic_ε the integrand in the permuton energy (9):

(17) (h^xy+εfxy)ln(h^xyεfxy)r(h^x+εfx)(h^y+εfy)=h^xyln(h^xy)rh^xh^y+ε[fxyln(h^xy)+h^xyfxyh^xyrh^xfyrfxh^y]+ε2[h^xyln(1+εfxyh^xy)εfxyh^xyε2+fxyln(1+εfxyh^xy)εrfxfy].subscript^𝑥𝑦𝜀subscript𝑓𝑥𝑦subscript^𝑥𝑦𝜀subscript𝑓𝑥𝑦𝑟subscript^𝑥𝜀subscript𝑓𝑥subscript^𝑦𝜀subscript𝑓𝑦subscript^𝑥𝑦subscript^𝑥𝑦𝑟subscript^𝑥subscript^𝑦𝜀delimited-[]subscript𝑓𝑥𝑦subscript^𝑥𝑦subscript^𝑥𝑦subscript𝑓𝑥𝑦subscript^𝑥𝑦𝑟subscript^𝑥subscript𝑓𝑦𝑟subscript𝑓𝑥subscript^𝑦superscript𝜀2delimited-[]subscript^𝑥𝑦1𝜀subscript𝑓𝑥𝑦subscript^𝑥𝑦𝜀subscript𝑓𝑥𝑦subscript^𝑥𝑦superscript𝜀2subscript𝑓𝑥𝑦1𝜀subscript𝑓𝑥𝑦subscript^𝑥𝑦𝜀𝑟subscript𝑓𝑥subscript𝑓𝑦\bigl{(}\hat{h}_{xy}+\varepsilon f_{xy}\bigr{)}\ln\bigl{(}-\hat{h}_{xy}-% \varepsilon f_{xy}\bigr{)}-r\bigl{(}\hat{h}_{x}+\varepsilon f_{x}\bigr{)}\bigl% {(}\hat{h}_{y}+\varepsilon f_{y}\bigr{)}\\ =\hat{h}_{xy}\ln\bigl{(}-\hat{h}_{xy}\bigr{)}-r\hat{h}_{x}\hat{h}_{y}+% \varepsilon\left[f_{xy}\ln\bigl{(}-\hat{h}_{xy}\bigr{)}+\hat{h}_{xy}\cdot\frac% {f_{xy}}{\hat{h}_{xy}}-r\hat{h}_{x}f_{y}-rf_{x}\hat{h}_{y}\right]\\ +\varepsilon^{2}\left[\hat{h}_{xy}\frac{\ln\left(1+\varepsilon\frac{f_{xy}}{% \hat{h}_{xy}}\right)-\varepsilon\frac{f_{xy}}{\hat{h}_{xy}}}{\varepsilon^{2}}+% f_{xy}\frac{\ln\left(1+\varepsilon\frac{f_{xy}}{\hat{h}_{xy}}\right)}{% \varepsilon}-rf_{x}f_{y}\right].start_ROW start_CELL ( over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT + italic_ε italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ) roman_ln ( - over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT - italic_ε italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ) - italic_r ( over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT + italic_ε italic_f start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) ( over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT + italic_ε italic_f start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL = over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT roman_ln ( - over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ) - italic_r over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT + italic_ε [ italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT roman_ln ( - over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ) + over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ⋅ divide start_ARG italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG - italic_r over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT - italic_r italic_f start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ] end_CELL end_ROW start_ROW start_CELL + italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT [ over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT divide start_ARG roman_ln ( 1 + italic_ε divide start_ARG italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG ) - italic_ε divide start_ARG italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT divide start_ARG roman_ln ( 1 + italic_ε divide start_ARG italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG ) end_ARG start_ARG italic_ε end_ARG - italic_r italic_f start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ] . end_CELL end_ROW

Note that the 0101superscriptsubscript01superscriptsubscript01\int_{0}^{1}\int_{0}^{1}∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT integral of the term ε[]𝜀delimited-[]\varepsilon\bigl{[}\cdot\bigr{]}italic_ε [ ⋅ ] vanishes, because h^^\hat{h}over^ start_ARG italic_h end_ARG is a solution to the Euler–Lagrange equations: we provide more detail in Lemma 3.18 below. Hence, we are interested in ε2[]superscript𝜀2delimited-[]\varepsilon^{2}\bigl{[}\cdot\bigr{]}italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT [ ⋅ ] term and we would like to show that its 0101superscriptsubscript01superscriptsubscript01\int_{0}^{1}\int_{0}^{1}∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT integral is always negative. The particular value of ε𝜀\varepsilonitalic_ε is not important (it can be absorbed into the definition of f𝑓fitalic_f) and we will set ε=1𝜀1\varepsilon=1italic_ε = 1 from now on. Integrating by parts in the rfxfy𝑟subscript𝑓𝑥subscript𝑓𝑦-rf_{x}f_{y}- italic_r italic_f start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT term and noticing that boundary terms vanish because hhitalic_h and h^^\hat{h}over^ start_ARG italic_h end_ARG satisfy the same boundary conditions, we want to prove that

(18) 0101𝒮[2]dxdy<?0,𝒮[2]:=h^xy(1+fxyh^xy)ln(1+fxyh^xy)fxy+rffxy.formulae-sequencesuperscript?superscriptsubscript01superscriptsubscript01superscript𝒮delimited-[]2differential-d𝑥differential-d𝑦0assignsuperscript𝒮delimited-[]2subscript^𝑥𝑦1subscript𝑓𝑥𝑦subscript^𝑥𝑦1subscript𝑓𝑥𝑦subscript^𝑥𝑦subscript𝑓𝑥𝑦𝑟𝑓subscript𝑓𝑥𝑦\int_{0}^{1}\int_{0}^{1}{\mathcal{S}}^{[2]}\mathrm{d}x\mathrm{d}y\stackrel{{% \scriptstyle?}}{{<}}0,\qquad{\mathcal{S}}^{[2]}:=\hat{h}_{xy}\left(1+\frac{f_{% xy}}{\hat{h}_{xy}}\right)\ln\left(1+\frac{f_{xy}}{\hat{h}_{xy}}\right)-f_{xy}+% rff_{xy}.∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT caligraphic_S start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT roman_d italic_x roman_d italic_y start_RELOP SUPERSCRIPTOP start_ARG < end_ARG start_ARG ? end_ARG end_RELOP 0 , caligraphic_S start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT := over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG ) roman_ln ( 1 + divide start_ARG italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG ) - italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT + italic_r italic_f italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT .

Let us denote

(19) ν=fxyh^xy,𝜈subscript𝑓𝑥𝑦subscript^𝑥𝑦\nu=\frac{f_{xy}}{\hat{h}_{xy}},italic_ν = divide start_ARG italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT end_ARG ,

so that

(20) f(x,y)=0yx1ν(s,t)h^xy(s,t)𝑑s𝑑t,0x,y1.formulae-sequence𝑓𝑥𝑦superscriptsubscript0𝑦superscriptsubscript𝑥1𝜈𝑠𝑡subscript^𝑥𝑦𝑠𝑡differential-d𝑠differential-d𝑡formulae-sequence0𝑥𝑦1f(x,y)=-\int_{0}^{y}\int_{x}^{1}\nu(s,t)\hat{h}_{xy}(s,t)\,ds\,dt,\qquad 0\leq x% ,y\leq 1.italic_f ( italic_x , italic_y ) = - ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_ν ( italic_s , italic_t ) over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ( italic_s , italic_t ) italic_d italic_s italic_d italic_t , 0 ≤ italic_x , italic_y ≤ 1 .

and

𝒮[2]=h^xy[(1+ν)ln(1+ν)ν+rfν].superscript𝒮delimited-[]2subscript^𝑥𝑦delimited-[]1𝜈1𝜈𝜈𝑟𝑓𝜈{\mathcal{S}}^{[2]}=\hat{h}_{xy}\bigl{[}\left(1+\nu\right)\ln\left(1+\nu\right% )-\nu+rf\nu\bigr{]}.caligraphic_S start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT = over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT [ ( 1 + italic_ν ) roman_ln ( 1 + italic_ν ) - italic_ν + italic_r italic_f italic_ν ] .

We will use the following statement whose proof we postpone:

Lemma 3.17.

We have

|0101f(x,y)ν(x,y)dxdy|Cν2𝟏|ν|1/2𝑑x𝑑y+4Cc|ν|𝟏|ν|>1/2𝑑x𝑑y.superscriptsubscript01superscriptsubscript01𝑓𝑥𝑦𝜈𝑥𝑦differential-d𝑥differential-d𝑦𝐶double-integralsuperscript𝜈2subscript1𝜈12differential-d𝑥differential-d𝑦4𝐶𝑐double-integral𝜈subscript1𝜈12differential-d𝑥differential-d𝑦\left|\int_{0}^{1}\int_{0}^{1}f(x,y)\nu(x,y)\,\mathrm{d}x\,\mathrm{d}y\right|% \leq C\iint\nu^{2}\mathbf{1}_{|\nu|\leq 1/2}\,dx\,dy+\frac{4C}{c}\iint|\nu|% \mathbf{1}_{|\nu|>1/2}\,dx\,dy.| ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_f ( italic_x , italic_y ) italic_ν ( italic_x , italic_y ) roman_d italic_x roman_d italic_y | ≤ italic_C ∬ italic_ν start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_1 start_POSTSUBSCRIPT | italic_ν | ≤ 1 / 2 end_POSTSUBSCRIPT italic_d italic_x italic_d italic_y + divide start_ARG 4 italic_C end_ARG start_ARG italic_c end_ARG ∬ | italic_ν | bold_1 start_POSTSUBSCRIPT | italic_ν | > 1 / 2 end_POSTSUBSCRIPT italic_d italic_x italic_d italic_y .

Simultaneously, we use of the following elementary bound, valid for all z>1𝑧1z>-1italic_z > - 1 (and in which we have not tried to make the constants optimal):

(1+z)ln(1+z)zz24𝟏|z|1/2+|z|10𝟏|z|>1/2.1𝑧1𝑧𝑧superscript𝑧24subscript1𝑧12𝑧10subscript1𝑧12(1+z)\ln(1+z)-z\geq\frac{z^{2}}{4}\mathbf{1}_{|z|\leq 1/2}+\frac{|z|}{10}% \mathbf{1}_{|z|>1/2}.( 1 + italic_z ) roman_ln ( 1 + italic_z ) - italic_z ≥ divide start_ARG italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 4 end_ARG bold_1 start_POSTSUBSCRIPT | italic_z | ≤ 1 / 2 end_POSTSUBSCRIPT + divide start_ARG | italic_z | end_ARG start_ARG 10 end_ARG bold_1 start_POSTSUBSCRIPT | italic_z | > 1 / 2 end_POSTSUBSCRIPT .

Therefore, noting that h^xysubscript^𝑥𝑦\hat{h}_{xy}over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT is negative and cabsent𝑐\leq-c≤ - italic_c, we get

(21) 0101S[2]𝑑x𝑑yc0101[ν(x,y)24𝟏|ν|1/2+|ν(x,y)|10𝟏|ν|>1/2]𝑑x𝑑y+|r|Cν(x,y)2𝟏|ν|1/2𝑑x𝑑y+|r|4Cc|ν(x,y)|𝟏|ν|>1/2𝑑x𝑑y.superscriptsubscript01superscriptsubscript01superscript𝑆delimited-[]2differential-d𝑥differential-d𝑦𝑐superscriptsubscript01superscriptsubscript01delimited-[]𝜈superscript𝑥𝑦24subscript1𝜈12𝜈𝑥𝑦10subscript1𝜈12differential-d𝑥differential-d𝑦𝑟𝐶double-integral𝜈superscript𝑥𝑦2subscript1𝜈12differential-d𝑥differential-d𝑦𝑟4𝐶𝑐double-integral𝜈𝑥𝑦subscript1𝜈12differential-d𝑥differential-d𝑦\int_{0}^{1}\int_{0}^{1}S^{[2]}\,dx\,dy\leq-c\int_{0}^{1}\int_{0}^{1}\left[% \frac{\nu(x,y)^{2}}{4}\mathbf{1}_{|\nu|\leq 1/2}+\frac{|\nu(x,y)|}{10}\mathbf{% 1}_{|\nu|>1/2}\right]\,dx\,dy\\ +|r|C\iint\nu(x,y)^{2}\mathbf{1}_{|\nu|\leq 1/2}\,dx\,dy+|r|\frac{4C}{c}\iint|% \nu(x,y)|\mathbf{1}_{|\nu|>1/2}\,dx\,dy.start_ROW start_CELL ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_S start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT italic_d italic_x italic_d italic_y ≤ - italic_c ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT [ divide start_ARG italic_ν ( italic_x , italic_y ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 4 end_ARG bold_1 start_POSTSUBSCRIPT | italic_ν | ≤ 1 / 2 end_POSTSUBSCRIPT + divide start_ARG | italic_ν ( italic_x , italic_y ) | end_ARG start_ARG 10 end_ARG bold_1 start_POSTSUBSCRIPT | italic_ν | > 1 / 2 end_POSTSUBSCRIPT ] italic_d italic_x italic_d italic_y end_CELL end_ROW start_ROW start_CELL + | italic_r | italic_C ∬ italic_ν ( italic_x , italic_y ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_1 start_POSTSUBSCRIPT | italic_ν | ≤ 1 / 2 end_POSTSUBSCRIPT italic_d italic_x italic_d italic_y + | italic_r | divide start_ARG 4 italic_C end_ARG start_ARG italic_c end_ARG ∬ | italic_ν ( italic_x , italic_y ) | bold_1 start_POSTSUBSCRIPT | italic_ν | > 1 / 2 end_POSTSUBSCRIPT italic_d italic_x italic_d italic_y . end_CELL end_ROW

Given the values of C𝐶Citalic_C and c𝑐citalic_c, we can choose r1=r1(C,c)subscript𝑟1subscript𝑟1𝐶𝑐r_{1}=r_{1}(C,c)italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_C , italic_c ), so that the last expression is negative whenever |r|<r1𝑟subscript𝑟1|r|<r_{1}| italic_r | < italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and ν𝜈\nuitalic_ν is different from identical zero. ∎

Proof of Lemma 3.17.

Using (20), we have

(22) |0101f(x,y)ν(x,y)𝑑x𝑑y|C01010yx1|ν(s,t)||ν(x,y)|𝑑s𝑑t𝑑x𝑑ysuperscriptsubscript01superscriptsubscript01𝑓𝑥𝑦𝜈𝑥𝑦differential-d𝑥differential-d𝑦𝐶superscriptsubscript01superscriptsubscript01superscriptsubscript0𝑦superscriptsubscript𝑥1𝜈𝑠𝑡𝜈𝑥𝑦differential-d𝑠differential-d𝑡differential-d𝑥differential-d𝑦\left|\int_{0}^{1}\int_{0}^{1}f(x,y)\nu(x,y)\,dx\,dy\right|\leq C\int_{0}^{1}% \int_{0}^{1}\int_{0}^{y}\int_{x}^{1}|\nu(s,t)||\nu(x,y)|\,ds\,dt\,dx\,dy| ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_f ( italic_x , italic_y ) italic_ν ( italic_x , italic_y ) italic_d italic_x italic_d italic_y | ≤ italic_C ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | italic_ν ( italic_s , italic_t ) | | italic_ν ( italic_x , italic_y ) | italic_d italic_s italic_d italic_t italic_d italic_x italic_d italic_y

We split the integral into three parts and then sum the estimates:

  • If both |ν(s,t)|1/2𝜈𝑠𝑡12|\nu(s,t)|\leq 1/2| italic_ν ( italic_s , italic_t ) | ≤ 1 / 2 and |ν(x,y)|1/2𝜈𝑥𝑦12|\nu(x,y)|\leq 1/2| italic_ν ( italic_x , italic_y ) | ≤ 1 / 2, then we write

    |ν(s,t)||ν(x,y)|12(ν2(s,t)+ν2(x,y)),𝜈𝑠𝑡𝜈𝑥𝑦12superscript𝜈2𝑠𝑡superscript𝜈2𝑥𝑦|\nu(s,t)||\nu(x,y)|\leq\frac{1}{2}\left(\nu^{2}(s,t)+\nu^{2}(x,y)\right),| italic_ν ( italic_s , italic_t ) | | italic_ν ( italic_x , italic_y ) | ≤ divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_ν start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_s , italic_t ) + italic_ν start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) ,

    plug into (22) and integrate over s,t,x,y𝑠𝑡𝑥𝑦s,t,x,yitalic_s , italic_t , italic_x , italic_y to get an upper bound

    C01010yx1𝟏|ν(s,t)|1/2𝟏|ν(x,y)|1/212(ν2(s,t)+ν2(x,y))𝑑s𝑑t𝑑x𝑑yC0101ν2(x,y)𝟏|ν(x,y)|1/2𝑑x𝑑y.𝐶superscriptsubscript01superscriptsubscript01superscriptsubscript0𝑦superscriptsubscript𝑥1subscript1𝜈𝑠𝑡12subscript1𝜈𝑥𝑦1212superscript𝜈2𝑠𝑡superscript𝜈2𝑥𝑦differential-d𝑠differential-d𝑡differential-d𝑥differential-d𝑦𝐶superscriptsubscript01superscriptsubscript01superscript𝜈2𝑥𝑦subscript1𝜈𝑥𝑦12differential-d𝑥differential-d𝑦C\int_{0}^{1}\int_{0}^{1}\int_{0}^{y}\int_{x}^{1}\mathbf{1}_{|\nu(s,t)|\leq 1/% 2}\mathbf{1}_{|\nu(x,y)|\leq 1/2}\frac{1}{2}\left(\nu^{2}(s,t)+\nu^{2}(x,y)% \right)\,ds\,dt\,dx\,dy\\ \leq C\int_{0}^{1}\int_{0}^{1}\nu^{2}(x,y)\mathbf{1}_{|\nu(x,y)|\leq 1/2}\,dx% \,dy.start_ROW start_CELL italic_C ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_1 start_POSTSUBSCRIPT | italic_ν ( italic_s , italic_t ) | ≤ 1 / 2 end_POSTSUBSCRIPT bold_1 start_POSTSUBSCRIPT | italic_ν ( italic_x , italic_y ) | ≤ 1 / 2 end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_ν start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_s , italic_t ) + italic_ν start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) italic_d italic_s italic_d italic_t italic_d italic_x italic_d italic_y end_CELL end_ROW start_ROW start_CELL ≤ italic_C ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_ν start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) bold_1 start_POSTSUBSCRIPT | italic_ν ( italic_x , italic_y ) | ≤ 1 / 2 end_POSTSUBSCRIPT italic_d italic_x italic_d italic_y . end_CELL end_ROW
  • If |ν(x,y)|>1/2𝜈𝑥𝑦12|\nu(x,y)|>1/2| italic_ν ( italic_x , italic_y ) | > 1 / 2, then we recall that g=hxy𝑔subscript𝑥𝑦g=-h_{xy}italic_g = - italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT is a non-negative function with total integral 1111 (by Definition 3.3 for the set 𝔉x,y,Isuperscript𝔉xy𝐼\mathfrak{F}^{\mathrm{x},\mathrm{y},I}fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT), so is h^xysubscript^𝑥𝑦-\hat{h}_{xy}- over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT, and we further have fxy=hxyh^xysubscript𝑓𝑥𝑦subscript𝑥𝑦subscript^𝑥𝑦f_{xy}=h_{xy}-\hat{h}_{xy}italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT = italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT - over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT. Therefore, using (19),

    0101|ν(s,t)|𝑑u𝑑v1c0101|fxy(s,t)|𝑑s𝑑t2c.superscriptsubscript01superscriptsubscript01𝜈𝑠𝑡differential-d𝑢differential-d𝑣1𝑐superscriptsubscript01superscriptsubscript01subscript𝑓𝑥𝑦𝑠𝑡differential-d𝑠differential-d𝑡2𝑐\int_{0}^{1}\int_{0}^{1}|\nu(s,t)|\,dudv\leq\frac{1}{c}\int_{0}^{1}\int_{0}^{1% }|f_{xy}(s,t)|\,ds\,dt\leq\frac{2}{c}.∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | italic_ν ( italic_s , italic_t ) | italic_d italic_u italic_d italic_v ≤ divide start_ARG 1 end_ARG start_ARG italic_c end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ( italic_s , italic_t ) | italic_d italic_s italic_d italic_t ≤ divide start_ARG 2 end_ARG start_ARG italic_c end_ARG .

    Hence, the part of the integral (22) with |ν(x,y)|>1/2𝜈𝑥𝑦12|\nu(x,y)|>1/2| italic_ν ( italic_x , italic_y ) | > 1 / 2 is upper-bounded by

    2Cc|ν|𝟏|ν|>1/2𝑑x𝑑y.2𝐶𝑐double-integral𝜈subscript1𝜈12differential-d𝑥differential-d𝑦\frac{2C}{c}\iint|\nu|\mathbf{1}_{|\nu|>1/2}\,dx\,dy.divide start_ARG 2 italic_C end_ARG start_ARG italic_c end_ARG ∬ | italic_ν | bold_1 start_POSTSUBSCRIPT | italic_ν | > 1 / 2 end_POSTSUBSCRIPT italic_d italic_x italic_d italic_y .
  • If |ν(s,t)|>1/2𝜈𝑠𝑡12|\nu(s,t)|>1/2| italic_ν ( italic_s , italic_t ) | > 1 / 2 and |ν(x,y)|1/2𝜈𝑥𝑦12|\nu(x,y)|\leq 1/2| italic_ν ( italic_x , italic_y ) | ≤ 1 / 2, then we upper bound by the part of the integral where only |ν(s,t)|>1/2𝜈𝑠𝑡12|\nu(s,t)|>1/2| italic_ν ( italic_s , italic_t ) | > 1 / 2 (dropping the second condition). Using exactly the same argument is in the previous case, we again get a bound 2Cc|ν|𝟏|ν|>1/2𝑑x𝑑y2𝐶𝑐double-integral𝜈subscript1𝜈12differential-d𝑥differential-d𝑦\frac{2C}{c}\iint|\nu|\mathbf{1}_{|\nu|>1/2}\,dx\,dydivide start_ARG 2 italic_C end_ARG start_ARG italic_c end_ARG ∬ | italic_ν | bold_1 start_POSTSUBSCRIPT | italic_ν | > 1 / 2 end_POSTSUBSCRIPT italic_d italic_x italic_d italic_y. ∎

Lemma 3.18.

Suppose that f(x,y)𝑓𝑥𝑦f(x,y)italic_f ( italic_x , italic_y ) is a difference of two functions from 𝔉x,y,Isuperscript𝔉xy𝐼\mathfrak{F}^{\mathrm{x},\mathrm{y},I}fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT, such that mixed partials of each of them are (minus) densities of absolutely continuous measures on [0,1]2superscript012[0,1]^{2}[ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Also suppose that h^𝔉x,y,I^superscript𝔉xy𝐼\hat{h}\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}over^ start_ARG italic_h end_ARG ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT is a smooth solution to EL equations of class [c,C]𝑐𝐶[c,C][ italic_c , italic_C ]. Then

(23) 0101[fxyln(h^xy)+fxyrh^xfyrfxh^y]𝑑x𝑑y=0.superscriptsubscript01superscriptsubscript01delimited-[]subscript𝑓𝑥𝑦subscript^𝑥𝑦subscript𝑓𝑥𝑦𝑟subscript^𝑥subscript𝑓𝑦𝑟subscript𝑓𝑥subscript^𝑦differential-d𝑥differential-d𝑦0\int_{0}^{1}\int_{0}^{1}\left[f_{xy}\ln\bigl{(}-\hat{h}_{xy}\bigr{)}+f_{xy}-r% \hat{h}_{x}f_{y}-rf_{x}\hat{h}_{y}\right]\,dx\,dy=0.∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT [ italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT roman_ln ( - over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ) + italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT - italic_r over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT - italic_r italic_f start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ] italic_d italic_x italic_d italic_y = 0 .
Proof.

In terms of g^=h^xy^𝑔subscript^𝑥𝑦\hat{g}=-\hat{h}_{xy}over^ start_ARG italic_g end_ARG = - over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT, the left-hand side of (23) is rewritten as

(24) 0101[fxylng^(x,y)g^(0,0)g^(x,0)g^(0,y)+rfy0yg^(x,v)dv+rfx0xg^(u,y)durfx01g^(u,y)du+fxy[lng^(x,0)+lng^(0,y)lng^(0,0)+1]]dxdysuperscriptsubscript01superscriptsubscript01delimited-[]subscript𝑓𝑥𝑦^𝑔𝑥𝑦^𝑔00^𝑔𝑥0^𝑔0𝑦𝑟subscript𝑓𝑦superscriptsubscript0𝑦^𝑔𝑥𝑣𝑑𝑣𝑟subscript𝑓𝑥superscriptsubscript0𝑥^𝑔𝑢𝑦𝑑𝑢𝑟subscript𝑓𝑥superscriptsubscript01^𝑔𝑢𝑦𝑑𝑢subscript𝑓𝑥𝑦delimited-[]^𝑔𝑥0^𝑔0𝑦^𝑔001𝑑𝑥𝑑𝑦\int_{0}^{1}\int_{0}^{1}\Biggl{[}f_{xy}\ln\frac{\hat{g}(x,y)\hat{g}(0,0)}{\hat% {g}(x,0)\hat{g}(0,y)}+rf_{y}\int_{0}^{y}\hat{g}(x,v)\,dv+rf_{x}\int_{0}^{x}% \hat{g}(u,y)\,du\\ -rf_{x}\int_{0}^{1}\hat{g}(u,y)\,du+f_{xy}\left[\ln\hat{g}(x,0)+\ln\hat{g}(0,y% )-\ln\hat{g}(0,0)+1\right]\Biggr{]}\,dx\,dystart_ROW start_CELL ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT [ italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT roman_ln divide start_ARG over^ start_ARG italic_g end_ARG ( italic_x , italic_y ) over^ start_ARG italic_g end_ARG ( 0 , 0 ) end_ARG start_ARG over^ start_ARG italic_g end_ARG ( italic_x , 0 ) over^ start_ARG italic_g end_ARG ( 0 , italic_y ) end_ARG + italic_r italic_f start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT over^ start_ARG italic_g end_ARG ( italic_x , italic_v ) italic_d italic_v + italic_r italic_f start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT over^ start_ARG italic_g end_ARG ( italic_u , italic_y ) italic_d italic_u end_CELL end_ROW start_ROW start_CELL - italic_r italic_f start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT over^ start_ARG italic_g end_ARG ( italic_u , italic_y ) italic_d italic_u + italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT [ roman_ln over^ start_ARG italic_g end_ARG ( italic_x , 0 ) + roman_ln over^ start_ARG italic_g end_ARG ( 0 , italic_y ) - roman_ln over^ start_ARG italic_g end_ARG ( 0 , 0 ) + 1 ] ] italic_d italic_x italic_d italic_y end_CELL end_ROW

The integral of the second line vanishes, because f𝑓fitalic_f has zero boundary conditions along all sides of [0,1]2superscript012[0,1]^{2}[ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. For the first line, we integrate by parts in x𝑥xitalic_x for the second term and in y𝑦yitalic_y for the third term, transforming (24) into

0101[fxylng^(x,y)g^(0,0)g^(x,0)g^(0,y)rfxy0x0yg^(s,t)𝑑t𝑑srfxy0y0xg^(s,t)𝑑s𝑑t]𝑑x𝑑y+01[rfy0x0yg^(s,t)𝑑t𝑑s]x=0x=1𝑑y+01[rfx0y0xg^(s,t)𝑑s𝑑t]y=0y=1𝑑x.superscriptsubscript01superscriptsubscript01delimited-[]subscript𝑓𝑥𝑦^𝑔𝑥𝑦^𝑔00^𝑔𝑥0^𝑔0𝑦𝑟subscript𝑓𝑥𝑦superscriptsubscript0𝑥superscriptsubscript0𝑦^𝑔𝑠𝑡differential-d𝑡differential-d𝑠𝑟subscript𝑓𝑥𝑦superscriptsubscript0𝑦superscriptsubscript0𝑥^𝑔𝑠𝑡differential-d𝑠differential-d𝑡differential-d𝑥differential-d𝑦superscriptsubscript01subscriptsuperscriptdelimited-[]𝑟subscript𝑓𝑦superscriptsubscript0𝑥superscriptsubscript0𝑦^𝑔𝑠𝑡differential-d𝑡differential-d𝑠𝑥1𝑥0differential-d𝑦superscriptsubscript01subscriptsuperscriptdelimited-[]𝑟subscript𝑓𝑥superscriptsubscript0𝑦superscriptsubscript0𝑥^𝑔𝑠𝑡differential-d𝑠differential-d𝑡𝑦1𝑦0differential-d𝑥\int_{0}^{1}\int_{0}^{1}\Biggl{[}f_{xy}\ln\frac{\hat{g}(x,y)\hat{g}(0,0)}{\hat% {g}(x,0)\hat{g}(0,y)}-rf_{xy}\int_{0}^{x}\int_{0}^{y}\hat{g}(s,t)\,dt\,ds-rf_{% xy}\int_{0}^{y}\int_{0}^{x}\hat{g}(s,t)\,ds\,dt\Biggr{]}\,dx\,dy\\ +\int_{0}^{1}\Biggl{[}rf_{y}\int_{0}^{x}\int_{0}^{y}\hat{g}(s,t)\,dt\,ds\Biggr% {]}^{x=1}_{x=0}dy+\int_{0}^{1}\Biggl{[}rf_{x}\int_{0}^{y}\int_{0}^{x}\hat{g}(s% ,t)\,ds\,dt\Biggr{]}^{y=1}_{y=0}\,dx.start_ROW start_CELL ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT [ italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT roman_ln divide start_ARG over^ start_ARG italic_g end_ARG ( italic_x , italic_y ) over^ start_ARG italic_g end_ARG ( 0 , 0 ) end_ARG start_ARG over^ start_ARG italic_g end_ARG ( italic_x , 0 ) over^ start_ARG italic_g end_ARG ( 0 , italic_y ) end_ARG - italic_r italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT over^ start_ARG italic_g end_ARG ( italic_s , italic_t ) italic_d italic_t italic_d italic_s - italic_r italic_f start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT over^ start_ARG italic_g end_ARG ( italic_s , italic_t ) italic_d italic_s italic_d italic_t ] italic_d italic_x italic_d italic_y end_CELL end_ROW start_ROW start_CELL + ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT [ italic_r italic_f start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT over^ start_ARG italic_g end_ARG ( italic_s , italic_t ) italic_d italic_t italic_d italic_s ] start_POSTSUPERSCRIPT italic_x = 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x = 0 end_POSTSUBSCRIPT italic_d italic_y + ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT [ italic_r italic_f start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT over^ start_ARG italic_g end_ARG ( italic_s , italic_t ) italic_d italic_s italic_d italic_t ] start_POSTSUPERSCRIPT italic_y = 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_y = 0 end_POSTSUBSCRIPT italic_d italic_x . end_CELL end_ROW

The first line of the last expression is zero because of (15), and the second one vanishes because of the zero boundary conditions for f𝑓fitalic_f. ∎

4. Smooth solutions to Euler-Lagrange equations

In this section we discuss how to solve the Euler-Lagrange equations for the maximizers of the permuton energy 𝔈(h)𝔈\mathfrak{E}(h)fraktur_E ( italic_h ).

4.1. Consequences of Definition 3.14

Proposition 4.1.

Take r0𝑟0r\neq 0italic_r ≠ 0 and let hhitalic_h and g𝑔gitalic_g be as in Definition 3.14. For each 1uk1𝑢𝑘1\leq u\leq k1 ≤ italic_u ≤ italic_k, 1v1𝑣1\leq v\leq\ell1 ≤ italic_v ≤ roman_ℓ, there exist continuous functions ϕuv(x):[xu1,xu]{}:superscriptitalic-ϕ𝑢𝑣𝑥subscriptx𝑢1subscriptx𝑢\phi^{uv}(x):[\mathrm{x}_{u-1},\mathrm{x}_{u}]\to\mathbb{R}\cup\{\infty\}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) : [ roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ] → blackboard_R ∪ { ∞ } and ψuv(y):[yv1,yv]{}:superscript𝜓𝑢𝑣𝑦subscripty𝑣1subscripty𝑣\psi^{uv}(y):[\mathrm{y}_{v-1},\mathrm{y}_{v}]\to\mathbb{R}\cup\{\infty\}italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) : [ roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ] → blackboard_R ∪ { ∞ }, taking the value \infty at most once each, and such that

(25) guv(x,y)=1rxϕuv(x)yψuv(y)[ϕuv(x)ψuv(y)]2.superscript𝑔𝑢𝑣𝑥𝑦1𝑟𝑥superscriptitalic-ϕ𝑢𝑣𝑥𝑦superscript𝜓𝑢𝑣𝑦superscriptdelimited-[]superscriptitalic-ϕ𝑢𝑣𝑥superscript𝜓𝑢𝑣𝑦2g^{uv}(x,y)=-\frac{1}{r}\cdot\frac{\frac{\partial}{\partial x}\phi^{uv}(x)% \frac{\partial}{\partial y}\psi^{uv}(y)}{[\phi^{uv}(x)-\psi^{uv}(y)]^{2}}.italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - divide start_ARG 1 end_ARG start_ARG italic_r end_ARG ⋅ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) end_ARG start_ARG [ italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG .
Remark 4.2.

A way to arrive at (25) is by sending (x2x1)0subscript𝑥2subscript𝑥10(x_{2}-x_{1})\to 0( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) → 0, (y2y1)0subscript𝑦2subscript𝑦10(y_{2}-y_{1})\to 0( italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) → 0 in (15), leading to the (hyperbolic) Liouville PDE:

(26) (lng)xy=2rg,subscript𝑔𝑥𝑦2𝑟𝑔(\ln g)_{xy}=2rg,( roman_ln italic_g ) start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT = 2 italic_r italic_g ,

because (25) is one possible form of a general solution to this PDE.

Remark 4.3.

The choice of ϕitalic-ϕ\phiitalic_ϕ and ψ𝜓\psiitalic_ψ in (25) is not unique: the Moebius transformations

(27) ϕuv(x)αϕuv(x)+βγϕuv(x)+δ,ψuv(y)αψuv(y)+βγψuv(y)+δformulae-sequencemaps-tosuperscriptitalic-ϕ𝑢𝑣𝑥𝛼superscriptitalic-ϕ𝑢𝑣𝑥𝛽𝛾superscriptitalic-ϕ𝑢𝑣𝑥𝛿maps-tosuperscript𝜓𝑢𝑣𝑦𝛼superscript𝜓𝑢𝑣𝑦𝛽𝛾superscript𝜓𝑢𝑣𝑦𝛿\phi^{uv}(x)\mapsto\frac{\alpha\phi^{uv}(x)+\beta}{\gamma\phi^{uv}(x)+\delta},% \qquad\psi^{uv}(y)\mapsto\frac{\alpha\psi^{uv}(y)+\beta}{\gamma\psi^{uv}(y)+\delta}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) ↦ divide start_ARG italic_α italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) + italic_β end_ARG start_ARG italic_γ italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) + italic_δ end_ARG , italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) ↦ divide start_ARG italic_α italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) + italic_β end_ARG start_ARG italic_γ italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) + italic_δ end_ARG

keep the expression in the right-hand side of (25) unchanged. In particular, these transformations with real α,β,γ,δ𝛼𝛽𝛾𝛿\alpha,\beta,\gamma,\deltaitalic_α , italic_β , italic_γ , italic_δ can be used to move around the points (if any) where ϕuv(x)=superscriptitalic-ϕ𝑢𝑣𝑥\phi^{uv}(x)=\inftyitalic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) = ∞ or ψuv(y)=superscript𝜓𝑢𝑣𝑦\psi^{uv}(y)=\inftyitalic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) = ∞. One can also get rid of the infinities completely by using a Moebius transform mapping the real line to the unit circle in \mathbb{C}blackboard_C. Then we can rewrite the representation (25) in terms of two angles on the circle as:

(28) guv(x,y)superscript𝑔𝑢𝑣𝑥𝑦\displaystyle g^{uv}(x,y)italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) =xθ1uv(x)yθ2uv(y)r[sinθ1uv(x)+𝐢cosθ1uv(x)][sinθ2uv(y)+𝐢cosθ2uv(y)][cosθ1uv(x)+𝐢sinθ1uv(x)cosθ2uv(y)𝐢sinθ2uv(y)]2absent𝑥superscriptsubscript𝜃1𝑢𝑣𝑥𝑦superscriptsubscript𝜃2𝑢𝑣𝑦𝑟delimited-[]superscriptsubscript𝜃1𝑢𝑣𝑥𝐢superscriptsubscript𝜃1𝑢𝑣𝑥delimited-[]superscriptsubscript𝜃2𝑢𝑣𝑦𝐢superscriptsubscript𝜃2𝑢𝑣𝑦superscriptdelimited-[]superscriptsubscript𝜃1𝑢𝑣𝑥𝐢superscriptsubscript𝜃1𝑢𝑣𝑥superscriptsubscript𝜃2𝑢𝑣𝑦𝐢superscriptsubscript𝜃2𝑢𝑣𝑦2\displaystyle=\frac{\frac{\partial}{\partial x}\theta_{1}^{uv}(x)\frac{% \partial}{\partial y}\theta_{2}^{uv}(y)}{r}\cdot\frac{[-\sin\theta_{1}^{uv}(x)% +\mathbf{i}\cos\theta_{1}^{uv}(x)][-\sin\theta_{2}^{uv}(y)+\mathbf{i}\cos% \theta_{2}^{uv}(y)]}{[\cos\theta_{1}^{uv}(x)+\mathbf{i}\sin\theta_{1}^{uv}(x)-% \cos\theta_{2}^{uv}(y)-\mathbf{i}\sin\theta_{2}^{uv}(y)]^{2}}= divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) end_ARG start_ARG italic_r end_ARG ⋅ divide start_ARG [ - roman_sin italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) + bold_i roman_cos italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) ] [ - roman_sin italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) + bold_i roman_cos italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) ] end_ARG start_ARG [ roman_cos italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) + bold_i roman_sin italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) - roman_cos italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) - bold_i roman_sin italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG
=1rxθ1uv(x)yθ2uv(y)[sinθ1uv(x)sinθ2uv(y)]2+[cosθ1uv(x)cosθ2uv(y)]2.absent1𝑟𝑥superscriptsubscript𝜃1𝑢𝑣𝑥𝑦superscriptsubscript𝜃2𝑢𝑣𝑦superscriptdelimited-[]superscriptsubscript𝜃1𝑢𝑣𝑥superscriptsubscript𝜃2𝑢𝑣𝑦2superscriptdelimited-[]superscriptsubscript𝜃1𝑢𝑣𝑥superscriptsubscript𝜃2𝑢𝑣𝑦2\displaystyle=\frac{1}{r}\cdot\frac{\frac{\partial}{\partial x}\theta_{1}^{uv}% (x)\frac{\partial}{\partial y}\theta_{2}^{uv}(y)}{[\sin\theta_{1}^{uv}(x)-\sin% \theta_{2}^{uv}(y)]^{2}+[\cos\theta_{1}^{uv}(x)-\cos\theta_{2}^{uv}(y)]^{2}}.= divide start_ARG 1 end_ARG start_ARG italic_r end_ARG ⋅ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) end_ARG start_ARG [ roman_sin italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) - roman_sin italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + [ roman_cos italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) - roman_cos italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG .
Proof of Proposition 4.1.

Our argument is somewhat similar to [Sta09, Section 5]. We fix 1uk1𝑢𝑘1\leq u\leq k1 ≤ italic_u ≤ italic_k and 1v1𝑣1\leq v\leq\ell1 ≤ italic_v ≤ roman_ℓ and omit them from the notations throughout the proof. We assume r<0𝑟0r<0italic_r < 0 throughout the proof: the case r>0𝑟0r>0italic_r > 0 can be obtained by applying g(x,y)g(1x,y)𝑔𝑥𝑦𝑔1𝑥𝑦g(x,y)\leftrightarrow g(1-x,y)italic_g ( italic_x , italic_y ) ↔ italic_g ( 1 - italic_x , italic_y ) involution. Using (15) inside the rectangle (xu1,xu)×(yv1,yv)subscriptx𝑢1subscriptx𝑢subscripty𝑣1subscripty𝑣(\mathrm{x}_{u-1},\mathrm{x}_{u})\times(\mathrm{y}_{v-1},\mathrm{y}_{v})( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) × ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) and extending it to the boundaries by continuity, we conclude that for all xu1xxusubscriptx𝑢1𝑥subscriptx𝑢\mathrm{x}_{u-1}\leq x\leq\mathrm{x}_{u}roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ≤ italic_x ≤ roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, yv1yyvsubscripty𝑣1𝑦subscripty𝑣\mathrm{y}_{v-1}\leq y\leq\mathrm{y}_{v}roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ≤ italic_y ≤ roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT

(29) g(x,y)g(xu1,yv1)g(xu1,y)g(x,yv1)=exp(2ryv1yxu1xg(s,t)dsdt).𝑔𝑥𝑦𝑔subscriptx𝑢1subscripty𝑣1𝑔subscriptx𝑢1𝑦𝑔𝑥subscripty𝑣12𝑟superscriptsubscriptsubscripty𝑣1𝑦superscriptsubscriptsubscriptx𝑢1𝑥𝑔𝑠𝑡differential-d𝑠differential-d𝑡g(x,y)\frac{g(\mathrm{x}_{u-1},\mathrm{y}_{v-1})}{g(\mathrm{x}_{u-1},y)g(x,% \mathrm{y}_{v-1})}=\exp\left(2r\int_{\mathrm{y}_{v-1}}^{y}\int_{\mathrm{x}_{u-% 1}}^{x}g(s,t)\,\mathrm{d}s\,\mathrm{d}t\right).italic_g ( italic_x , italic_y ) divide start_ARG italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_y ) italic_g ( italic_x , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG = roman_exp ( 2 italic_r ∫ start_POSTSUBSCRIPT roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_g ( italic_s , italic_t ) roman_d italic_s roman_d italic_t ) .

We treat (29) as a functional equation on the unknown function g𝑔gitalic_g, supplied with boundary conditions g(x,yv1)𝑔𝑥subscripty𝑣1g(x,\mathrm{y}_{v-1})italic_g ( italic_x , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ), xu1xxusubscriptx𝑢1𝑥subscriptx𝑢\mathrm{x}_{u-1}\leq x\leq\mathrm{x}_{u}roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ≤ italic_x ≤ roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, and g(xu1,y)𝑔subscriptx𝑢1𝑦g(\mathrm{x}_{u-1},y)italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_y ), yv1yyvsubscripty𝑣1𝑦subscripty𝑣\mathrm{y}_{v-1}\leq y\leq\mathrm{y}_{v}roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ≤ italic_y ≤ roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT. We further argue in two steps. In Step 1, we show that there exists a solution to (29) of the form (25). In Step 2 we show that (29) uniquely determines g𝑔gitalic_g.

Step 1. Note that the right-hand side of (25) can be rewritten as

1r2xyln|ϕ(x)ψ(y)|.1𝑟superscript2𝑥𝑦italic-ϕ𝑥𝜓𝑦-\frac{1}{r}\frac{\partial^{2}}{\partial x\partial y}\ln\bigl{|}\phi(x)-\psi(y% )\bigr{|}.- divide start_ARG 1 end_ARG start_ARG italic_r end_ARG divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_x ∂ italic_y end_ARG roman_ln | italic_ϕ ( italic_x ) - italic_ψ ( italic_y ) | .

Plugging into (29), we need to check

[ϕ(x)ψ(yv1)]2[ϕ(xu1)ψ(y)]2[ϕ(xu1)ψ(yv1)]2[ϕ(x)ψ(y)]2=?exp(2ln|(ϕ(x)ψ(y))(ϕ(x)ψ(yv1))(ϕ(x)ψ(yv1))(ϕ(xu1)ψ(y))|),superscript?superscriptdelimited-[]italic-ϕ𝑥𝜓subscripty𝑣12superscriptdelimited-[]italic-ϕsubscriptx𝑢1𝜓𝑦2superscriptdelimited-[]italic-ϕsubscriptx𝑢1𝜓subscripty𝑣12superscriptdelimited-[]italic-ϕ𝑥𝜓𝑦22italic-ϕ𝑥𝜓𝑦italic-ϕ𝑥𝜓subscript𝑦𝑣1italic-ϕ𝑥𝜓subscript𝑦𝑣1italic-ϕsubscriptx𝑢1𝜓𝑦\frac{[\phi(x)-\psi(\mathrm{y}_{v-1})]^{2}[\phi(\mathrm{x}_{u-1})-\psi(y)]^{2}% }{[\phi(\mathrm{x}_{u-1})-\psi(\mathrm{y}_{v-1})]^{2}[\phi(x)-\psi(y)]^{2}}% \stackrel{{\scriptstyle?}}{{=}}\exp\left(-2\ln\left|\frac{\bigl{(}\phi(x)-\psi% (y)\bigr{)}\bigl{(}\phi(x)-\psi(y_{v-1})\bigr{)}}{\bigl{(}\phi(x)-\psi(y_{v-1}% )\bigr{)}\bigl{(}\phi(\mathrm{x}_{u-1})-\psi(y)\bigr{)}}\right|\right),divide start_ARG [ italic_ϕ ( italic_x ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT [ italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) - italic_ψ ( italic_y ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG [ italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT [ italic_ϕ ( italic_x ) - italic_ψ ( italic_y ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG ? end_ARG end_RELOP roman_exp ( - 2 roman_ln | divide start_ARG ( italic_ϕ ( italic_x ) - italic_ψ ( italic_y ) ) ( italic_ϕ ( italic_x ) - italic_ψ ( italic_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ) end_ARG start_ARG ( italic_ϕ ( italic_x ) - italic_ψ ( italic_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ) ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) - italic_ψ ( italic_y ) ) end_ARG | ) ,

which is trivially true for any choice of functions such that |ϕ(x)ψ(y)|0italic-ϕ𝑥𝜓𝑦0|\phi(x)-\psi(y)|\neq 0| italic_ϕ ( italic_x ) - italic_ψ ( italic_y ) | ≠ 0 for all relevant x𝑥xitalic_x and y𝑦yitalic_y. We also need to match the functions to the boundary conditions g(x,yv1)𝑔𝑥subscripty𝑣1g(x,\mathrm{y}_{v-1})italic_g ( italic_x , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) and g(xu1,y)𝑔subscriptx𝑢1𝑦g(\mathrm{x}_{u-1},y)italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_y ). For instance, we can choose

(30) ϕ(x)italic-ϕ𝑥\displaystyle\phi(x)italic_ϕ ( italic_x ) =12+[1|r|xu1xg(s,yv1)g(xu1,yv1)ds]1,absent12superscriptdelimited-[]1𝑟superscriptsubscriptsubscriptx𝑢1𝑥𝑔𝑠subscripty𝑣1𝑔subscriptx𝑢1subscripty𝑣1differential-d𝑠1\displaystyle=-\frac{1}{2}+\left[1-\sqrt{|r|}\int_{\mathrm{x}_{u-1}}^{x}\frac{% g(s,\mathrm{y}_{v-1})}{\sqrt{g(\mathrm{x}_{u-1},\mathrm{y}_{v-1})}}\mathrm{d}s% \right]^{-1},= - divide start_ARG 1 end_ARG start_ARG 2 end_ARG + [ 1 - square-root start_ARG | italic_r | end_ARG ∫ start_POSTSUBSCRIPT roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT divide start_ARG italic_g ( italic_s , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG square-root start_ARG italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG end_ARG roman_d italic_s ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ,
(31) ψ(y)𝜓𝑦\displaystyle\psi(y)italic_ψ ( italic_y ) =12[1sgn(r)|r|yv1yg(xu1,t)g(xu1,yv1)dt]1,absent12superscriptdelimited-[]1sgn𝑟𝑟superscriptsubscriptsubscripty𝑣1𝑦𝑔subscriptx𝑢1𝑡𝑔subscriptx𝑢1subscripty𝑣1differential-d𝑡1\displaystyle=\frac{1}{2}-\left[1-\mathrm{sgn}(r)\sqrt{|r|}\int_{\mathrm{y}_{v% -1}}^{y}\frac{g(\mathrm{x}_{u-1},t)}{\sqrt{g(\mathrm{x}_{u-1},\mathrm{y}_{v-1}% )}}\mathrm{d}t\right]^{-1},= divide start_ARG 1 end_ARG start_ARG 2 end_ARG - [ 1 - roman_sgn ( italic_r ) square-root start_ARG | italic_r | end_ARG ∫ start_POSTSUBSCRIPT roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT divide start_ARG italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_t ) end_ARG start_ARG square-root start_ARG italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG end_ARG roman_d italic_t ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ,

leading through (25) to

(32) g(x,y)=g(x,yv1)g(xu1,y)g(xu1,yv1)[1rg(xu1,yv1)xu1xg(s,yv1)dsyv1yg(xu1,t)dt]2,𝑔𝑥𝑦𝑔𝑥subscripty𝑣1𝑔subscriptx𝑢1𝑦𝑔subscriptx𝑢1subscripty𝑣1superscriptdelimited-[]1𝑟𝑔subscriptx𝑢1subscripty𝑣1superscriptsubscriptsubscriptx𝑢1𝑥𝑔𝑠subscripty𝑣1differential-d𝑠superscriptsubscriptsubscripty𝑣1𝑦𝑔subscriptx𝑢1𝑡differential-d𝑡2g(x,y)=\frac{g(x,\mathrm{y}_{v-1})g(\mathrm{x}_{u-1},y)}{g(\mathrm{x}_{u-1},% \mathrm{y}_{v-1})}\left[1-\frac{r}{g(\mathrm{x}_{u-1},\mathrm{y}_{v-1})}\int_{% \mathrm{x}_{u-1}}^{x}g(s,\mathrm{y}_{v-1})\mathrm{d}s\int_{\mathrm{y}_{v-1}}^{% y}g(\mathrm{x}_{u-1},t)\mathrm{d}t\right]^{-2},italic_g ( italic_x , italic_y ) = divide start_ARG italic_g ( italic_x , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_y ) end_ARG start_ARG italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG [ 1 - divide start_ARG italic_r end_ARG start_ARG italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG ∫ start_POSTSUBSCRIPT roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_g ( italic_s , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) roman_d italic_s ∫ start_POSTSUBSCRIPT roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_t ) roman_d italic_t ] start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ,

which clearly satisfies the desired boundary conditions g(x,yv1)𝑔𝑥subscripty𝑣1g(x,\mathrm{y}_{v-1})italic_g ( italic_x , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) and g(xu1,y)𝑔subscriptx𝑢1𝑦g(\mathrm{x}_{u-1},y)italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_y ). Note that since r<0𝑟0r<0italic_r < 0, (32) is well-defined and uniformly bounded for all xu1xxusubscriptx𝑢1𝑥subscriptx𝑢\mathrm{x}_{u-1}\leq x\leq\mathrm{x}_{u}roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ≤ italic_x ≤ roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, yv1yyvsubscripty𝑣1𝑦subscripty𝑣\mathrm{y}_{v-1}\leq y\leq\mathrm{y}_{v}roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ≤ italic_y ≤ roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT. Further, |ϕ(x)|>1/2italic-ϕ𝑥12|\phi(x)|>1/2| italic_ϕ ( italic_x ) | > 1 / 2 for xu1<xxusubscriptx𝑢1𝑥subscriptx𝑢\mathrm{x}_{u-1}<x\leq\mathrm{x}_{u}roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT < italic_x ≤ roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT and |ψ(y)|<1/2𝜓𝑦12|\psi(y)|<1/2| italic_ψ ( italic_y ) | < 1 / 2 for yv1<yyvsubscripty𝑣1𝑦subscripty𝑣\mathrm{y}_{v-1}<y\leq\mathrm{y}_{v}roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT < italic_y ≤ roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT, hence ϕ(x)ψ(y)italic-ϕ𝑥𝜓𝑦\phi(x)-\psi(y)italic_ϕ ( italic_x ) - italic_ψ ( italic_y ) stays bounded away from zero, as desired. Also note that ϕ(x)italic-ϕ𝑥\phi(x)italic_ϕ ( italic_x ) might have a pole, but it cancels when plugging into (25). Clearly, (30), (31) imply that ϕitalic-ϕ\phiitalic_ϕ and ψ𝜓\psiitalic_ψ can not have more than one pole, as claimed in the statement.

Step 2. Suppose that g[1]superscript𝑔delimited-[]1g^{[1]}italic_g start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT and g[2]superscript𝑔delimited-[]2g^{[2]}italic_g start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT are two positive continuous functions on [xu1,xu]×[yv1,yv]subscriptx𝑢1subscriptx𝑢subscripty𝑣1subscripty𝑣[\mathrm{x}_{u-1},\mathrm{x}_{u}]\times[\mathrm{y}_{v-1},\mathrm{y}_{v}][ roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ] × [ roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ], which both satisfy (29) and g[1](x,yv1)=g[2](x,yv1)superscript𝑔delimited-[]1𝑥subscripty𝑣1superscript𝑔delimited-[]2𝑥subscripty𝑣1g^{[1]}(x,\mathrm{y}_{v-1})=g^{[2]}(x,\mathrm{y}_{v-1})italic_g start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT ( italic_x , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) = italic_g start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT ( italic_x , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ), g[1](xu1,y)=g[2](xu1,y)superscript𝑔delimited-[]1subscriptx𝑢1𝑦superscript𝑔delimited-[]2subscriptx𝑢1𝑦g^{[1]}(\mathrm{x}_{u-1},y)=g^{[2]}(\mathrm{x}_{u-1},y)italic_g start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_y ) = italic_g start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_y ). We claim that then g[1]superscript𝑔delimited-[]1g^{[1]}italic_g start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT and g[2]superscript𝑔delimited-[]2g^{[2]}italic_g start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT coincide. Indeed, define

Δg(x,y)=maxs[xu1,x]t[yv1,y]|g[1](s,t)g[2](s,t)|.Δ𝑔𝑥𝑦subscript𝑠subscriptx𝑢1𝑥𝑡subscripty𝑣1𝑦superscript𝑔delimited-[]1𝑠𝑡superscript𝑔delimited-[]2𝑠𝑡\Delta g(x,y)=\max_{\begin{smallmatrix}s\in[\mathrm{x}_{u-1},x]\\ t\in[\mathrm{y}_{v-1},y]\end{smallmatrix}}|g^{[1]}(s,t)-g^{[2]}(s,t)|.roman_Δ italic_g ( italic_x , italic_y ) = roman_max start_POSTSUBSCRIPT start_ROW start_CELL italic_s ∈ [ roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_x ] end_CELL end_ROW start_ROW start_CELL italic_t ∈ [ roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , italic_y ] end_CELL end_ROW end_POSTSUBSCRIPT | italic_g start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT ( italic_s , italic_t ) - italic_g start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT ( italic_s , italic_t ) | .

Subtracting (29) for g[1]superscript𝑔delimited-[]1g^{[1]}italic_g start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT and g[2]superscript𝑔delimited-[]2g^{[2]}italic_g start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT and noting that g(xu1,yv1)g(xu1,y)g(x,yv1)𝑔subscriptx𝑢1subscripty𝑣1𝑔subscriptx𝑢1𝑦𝑔𝑥subscripty𝑣1\frac{g(\mathrm{x}_{u-1},\mathrm{y}_{v-1})}{g(\mathrm{x}_{u-1},y)g(x,\mathrm{y% }_{v-1})}divide start_ARG italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_g ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , italic_y ) italic_g ( italic_x , roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG is the same for both, we deduce existence of a constant C1>0subscript𝐶10C_{1}>0italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 0, such that

(33) Δg(x,y)C1xu1xyv1yΔg(s,t)dtds.Δ𝑔𝑥𝑦subscript𝐶1superscriptsubscriptsubscriptx𝑢1𝑥superscriptsubscriptsubscripty𝑣1𝑦Δ𝑔𝑠𝑡differential-d𝑡differential-d𝑠\Delta g(x,y)\leq C_{1}\int_{\mathrm{x}_{u-1}}^{x}\int_{\mathrm{y}_{v-1}}^{y}% \Delta g(s,t)\,\mathrm{d}t\mathrm{d}s.roman_Δ italic_g ( italic_x , italic_y ) ≤ italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT roman_Δ italic_g ( italic_s , italic_t ) roman_d italic_t roman_d italic_s .

Plugging (33) into itself (n1)𝑛1(n-1)( italic_n - 1 ) times, and after that bounding |Δg(s,t)|<C2Δ𝑔𝑠𝑡subscript𝐶2|\Delta g(s,t)|<C_{2}| roman_Δ italic_g ( italic_s , italic_t ) | < italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, we deduce

Δg(x,y)C2[C1(xxu1)(yyv1)]nn!.Δ𝑔𝑥𝑦subscript𝐶2superscriptdelimited-[]subscript𝐶1𝑥subscriptx𝑢1𝑦subscripty𝑣1𝑛𝑛\Delta g(x,y)\leq C_{2}\frac{\bigl{[}C_{1}(x-\mathrm{x}_{u-1})(y-\mathrm{y}_{v% -1})\bigr{]}^{n}}{n!}.roman_Δ italic_g ( italic_x , italic_y ) ≤ italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT divide start_ARG [ italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) ( italic_y - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ] start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG start_ARG italic_n ! end_ARG .

Since n=1,2,𝑛12n=1,2,\dotsitalic_n = 1 , 2 , … is arbitrary, Δg(x,y)Δ𝑔𝑥𝑦\Delta g(x,y)roman_Δ italic_g ( italic_x , italic_y ) has to be 00. ∎

The next step is to explain how functions guv(x,y)superscript𝑔𝑢𝑣𝑥𝑦g^{uv}(x,y)italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) are related to each other.

Proposition 4.4.

Let hhitalic_h and g𝑔gitalic_g be as in Definition 3.14. If 1uk1𝑢𝑘1\leq u\leq k1 ≤ italic_u ≤ italic_k and 1v1<v21subscript𝑣1subscript𝑣21\leq v_{1}<v_{2}\leq\ell1 ≤ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ roman_ℓ are such that Iu,v1+1=Iu,v1+2==Iu,v21=0subscript𝐼𝑢subscript𝑣11subscript𝐼𝑢subscript𝑣12subscript𝐼𝑢subscript𝑣210I_{u,v_{1}+1}=I_{u,v_{1}+2}=\dots=I_{u,v_{2}-1}=0italic_I start_POSTSUBSCRIPT italic_u , italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_u , italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 end_POSTSUBSCRIPT = ⋯ = italic_I start_POSTSUBSCRIPT italic_u , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT = 0, but Iu,v1=Iu,v2=1subscript𝐼𝑢subscript𝑣1subscript𝐼𝑢subscript𝑣21I_{u,v_{1}}=I_{u,v_{2}}=1italic_I start_POSTSUBSCRIPT italic_u , italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_u , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 1, then there exists a constant Cu,v1v2>0subscript𝐶𝑢subscript𝑣1subscript𝑣20C_{u,v_{1}\to v_{2}}>0italic_C start_POSTSUBSCRIPT italic_u , italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT → italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > 0, such that

(34) guv2(x,yv21)=Cu,v1v2guv1(x,yv1),for all x[xu1,xu].formulae-sequencesuperscript𝑔𝑢subscript𝑣2𝑥subscriptysubscript𝑣21subscript𝐶𝑢subscript𝑣1subscript𝑣2superscript𝑔𝑢subscript𝑣1𝑥subscriptysubscript𝑣1for all 𝑥subscriptx𝑢1subscriptx𝑢g^{uv_{2}}(x,\mathrm{y}_{v_{2}-1})=C_{u,v_{1}\to v_{2}}\cdot g^{uv_{1}}(x,% \mathrm{y}_{v_{1}}),\qquad\text{for all }x\in[\mathrm{x}_{u-1},\mathrm{x}_{u}].italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) = italic_C start_POSTSUBSCRIPT italic_u , italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT → italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , for all italic_x ∈ [ roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ] .

Similarly, if 1u1<u2k1subscript𝑢1subscript𝑢2𝑘1\leq u_{1}<u_{2}\leq k1 ≤ italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_k and 1v1𝑣1\leq v\leq\ell1 ≤ italic_v ≤ roman_ℓ are such that Iu1+1,v=Iu1+2,v==Iu21,v=0subscript𝐼subscript𝑢11𝑣subscript𝐼subscript𝑢12𝑣subscript𝐼subscript𝑢21𝑣0I_{u_{1}+1,v}=I_{u_{1}+2,v}=\dots=I_{u_{2}-1,v}=0italic_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 , italic_v end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 , italic_v end_POSTSUBSCRIPT = ⋯ = italic_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 , italic_v end_POSTSUBSCRIPT = 0, but Iu1,v=Iu2,v=1subscript𝐼subscript𝑢1𝑣subscript𝐼subscript𝑢2𝑣1I_{u_{1},v}=I_{u_{2},v}=1italic_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_v end_POSTSUBSCRIPT = 1, then there exists a constant Cu1u1,vsubscript𝐶subscript𝑢1subscript𝑢1𝑣C_{u_{1}\to u_{1},v}italic_C start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT → italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v end_POSTSUBSCRIPT, such that

(35) gu2v(xu21,y)=Cu1u2,vgu1v(xu1,y)for all y[yv1,yv].formulae-sequencesuperscript𝑔subscript𝑢2𝑣subscriptxsubscript𝑢21𝑦subscript𝐶subscript𝑢1subscript𝑢2𝑣superscript𝑔subscript𝑢1𝑣subscriptxsubscript𝑢1𝑦for all 𝑦subscripty𝑣1subscripty𝑣g^{u_{2}v}(\mathrm{x}_{u_{2}-1},y)=C_{u_{1}\to u_{2},v}\cdot g^{u_{1}v}(% \mathrm{x}_{u_{1}},y)\qquad\text{for all }y\in[\mathrm{y}_{v-1},\mathrm{y}_{v}].italic_g start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT , italic_y ) = italic_C start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT → italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_v end_POSTSUBSCRIPT ⋅ italic_g start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_y ) for all italic_y ∈ [ roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ] .

In words, Proposition 4.4 says that g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) makes a multiplicative jump by a constant when moving to an adjacent rectangle or over a hole in the domain.

Proof of Proposition 4.4.

For (34), we choose any x1<x2[xu1,xu]subscript𝑥1subscript𝑥2subscriptx𝑢1subscriptx𝑢x_{1}<x_{2}\in[\mathrm{x}_{u-1},\mathrm{x}_{u}]italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ [ roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ] and note that (15) for y1=yv1subscript𝑦1subscript𝑦subscript𝑣1y_{1}=y_{v_{1}}italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, y2=yv21subscript𝑦2subscript𝑦subscript𝑣21y_{2}=y_{v_{2}-1}italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT has the right-hand side 1111 and can be rewritten as

guv1(x1,yv1)guv2(x1,yv21)=guv1(x2,yv1)guv2(x2,yv21).superscript𝑔𝑢subscript𝑣1subscript𝑥1subscriptysubscript𝑣1superscript𝑔𝑢subscript𝑣2subscript𝑥1subscriptysubscript𝑣21superscript𝑔𝑢subscript𝑣1subscript𝑥2subscriptysubscript𝑣1superscript𝑔𝑢subscript𝑣2subscript𝑥2subscriptysubscript𝑣21\frac{g^{uv_{1}}(x_{1},\mathrm{y}_{v_{1}})}{g^{uv_{2}}(x_{1},\mathrm{y}_{v_{2}% -1})}=\frac{g^{uv_{1}}(x_{2},\mathrm{y}_{v_{1}})}{g^{uv_{2}}(x_{2},\mathrm{y}_% {v_{2}-1})}.divide start_ARG italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) end_ARG = divide start_ARG italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) end_ARG .

Hence, the ratio guv1(x,yv1)guv2(x,yv21)superscript𝑔𝑢subscript𝑣1𝑥subscriptysubscript𝑣1superscript𝑔𝑢subscript𝑣2𝑥subscriptysubscript𝑣21\frac{g^{uv_{1}}(x,\mathrm{y}_{v_{1}})}{g^{uv_{2}}(x,\mathrm{y}_{v_{2}-1})}divide start_ARG italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) end_ARG does not depend on x𝑥xitalic_x. For (35) the argument is similar. ∎

Proposition 4.5.

Let hhitalic_h and g𝑔gitalic_g be as in Definition 3.14. If 1u1<u1k1subscript𝑢1subscript𝑢1𝑘1\leq u_{1}<u_{1}\leq k1 ≤ italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_k and 1v1<v21subscript𝑣1subscript𝑣21\leq v_{1}<v_{2}\leq\ell1 ≤ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ roman_ℓ are such that Iu1v1=Iu1v2=Iu2v1=Iu2v2=1subscript𝐼subscript𝑢1subscript𝑣1subscript𝐼subscript𝑢1subscript𝑣2subscript𝐼subscript𝑢2subscript𝑣1subscript𝐼subscript𝑢2subscript𝑣21I_{u_{1}v_{1}}=I_{u_{1}v_{2}}=I_{u_{2}v_{1}}=I_{u_{2}v_{2}}=1italic_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 1, but Iuv=0subscript𝐼𝑢𝑣0I_{uv}=0italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 0 for all u1<u<u2subscript𝑢1𝑢subscript𝑢2u_{1}<u<u_{2}italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_u < italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT v1<v<v2subscript𝑣1𝑣subscript𝑣2v_{1}<v<v_{2}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_v < italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, then

(36) gu1v1(xu1,yv1)gu2v2(xu21,yv21)gu1v2(xu1,yv21)gu2v1(xu21,yv1)=1.superscript𝑔subscript𝑢1subscript𝑣1subscriptxsubscript𝑢1subscriptysubscript𝑣1superscript𝑔subscript𝑢2subscript𝑣2subscriptxsubscript𝑢21subscriptysubscript𝑣21superscript𝑔subscript𝑢1subscript𝑣2subscriptxsubscript𝑢1subscriptysubscript𝑣21superscript𝑔subscript𝑢2subscript𝑣1subscriptxsubscript𝑢21subscriptysubscript𝑣11\frac{g^{u_{1}v_{1}}(\mathrm{x}_{u_{1}},\mathrm{y}_{v_{1}})g^{u_{2}v_{2}}(% \mathrm{x}_{u_{2}-1},\mathrm{y}_{v_{2}-1})}{g^{u_{1}v_{2}}(\mathrm{x}_{u_{1}},% \mathrm{y}_{v_{2}-1})g^{u_{2}v_{1}}(\mathrm{x}_{u_{2}-1},\mathrm{y}_{v_{1}})}=1.divide start_ARG italic_g start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_g start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_g start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) italic_g start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG = 1 .
Proof.

We apply (15) for x1=xu1subscript𝑥1subscriptxsubscript𝑢1x_{1}=\mathrm{x}_{u_{1}}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, x2=xu21subscript𝑥2subscriptxsubscript𝑢21x_{2}=\mathrm{x}_{u_{2}-1}italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT, y1=yv1subscript𝑦1subscriptysubscript𝑣1y_{1}=\mathrm{y}_{v_{1}}italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, y2=yv21subscript𝑦2subscriptysubscript𝑣21y_{2}=\mathrm{y}_{v_{2}-1}italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT. ∎

Proposition 4.6.

The functions ϕuvsuperscriptitalic-ϕ𝑢𝑣\phi^{uv}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT in Proposition 4.1 are Moebius transforms of each other and the functions ψuvsuperscript𝜓𝑢𝑣\psi^{uv}italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT are also Moebius transforms of each other. In other words there exist functions ϕ:[0,1]:italic-ϕ01\phi:[0,1]\to\mathbb{R}italic_ϕ : [ 0 , 1 ] → blackboard_R and ψ:[0,1]:𝜓01\psi:[0,1]\to\mathbb{R}italic_ψ : [ 0 , 1 ] → blackboard_R, such that for each u,v𝑢𝑣u,vitalic_u , italic_v, we have

ϕuv(x)=α1uvϕ(x)+β1uvγ1uvϕ(x)+δ1uv,x[xu1,xu];ψuv(y)=α2uvψ(y)+β2uvγ2uvψ(y)+δ2uv,y[yv1,yv].formulae-sequencesuperscriptitalic-ϕ𝑢𝑣𝑥superscriptsubscript𝛼1𝑢𝑣italic-ϕ𝑥superscriptsubscript𝛽1𝑢𝑣superscriptsubscript𝛾1𝑢𝑣italic-ϕ𝑥superscriptsubscript𝛿1𝑢𝑣formulae-sequence𝑥subscriptx𝑢1subscriptx𝑢formulae-sequencesuperscript𝜓𝑢𝑣𝑦superscriptsubscript𝛼2𝑢𝑣𝜓𝑦superscriptsubscript𝛽2𝑢𝑣superscriptsubscript𝛾2𝑢𝑣𝜓𝑦superscriptsubscript𝛿2𝑢𝑣𝑦subscripty𝑣1subscripty𝑣\phi^{uv}(x)=\frac{\alpha_{1}^{uv}\phi(x)+\beta_{1}^{uv}}{\gamma_{1}^{uv}\phi(% x)+\delta_{1}^{uv}},\quad x\in[\mathrm{x}_{u-1},\mathrm{x}_{u}];\qquad\psi^{uv% }(y)=\frac{\alpha_{2}^{uv}\psi(y)+\beta_{2}^{uv}}{\gamma_{2}^{uv}\psi(y)+% \delta_{2}^{uv}},\quad y\in[\mathrm{y}_{v-1},\mathrm{y}_{v}].italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x ) = divide start_ARG italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT italic_ϕ ( italic_x ) + italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT italic_ϕ ( italic_x ) + italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT end_ARG , italic_x ∈ [ roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ] ; italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_y ) = divide start_ARG italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT italic_ψ ( italic_y ) + italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT italic_ψ ( italic_y ) + italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT end_ARG , italic_y ∈ [ roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ] .
Proof.

Note that for different values of u𝑢uitalic_u the functions ϕuvsuperscriptitalic-ϕ𝑢𝑣\phi^{uv}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT have different domains of definitions (which might only overlap by a point), hence, the statement of being a Moebius transform of each other is trivially true. Hence, we only fix u𝑢uitalic_u and study what happens as we vary v𝑣vitalic_v. Note that only those v𝑣vitalic_v where Iuv=1subscript𝐼𝑢𝑣1I_{uv}=1italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1 matter, and take two adjacent such v𝑣vitalic_vs, i.e. suppose v1<v2subscript𝑣1subscript𝑣2v_{1}<v_{2}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are as in Proposition 4.4. Using (34) and substituting (25) for guv1superscript𝑔𝑢subscript𝑣1g^{uv_{1}}italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, guv2superscript𝑔𝑢subscript𝑣2g^{uv_{2}}italic_g start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, we get:

(37) x[yψuv2(yv21)ϕuv2(x)ψuv2(yv21)]=x[Cu,v1v2yψuv1(yv1)ϕuv1(x)ψuv1(yv1)]𝑥delimited-[]𝑦superscript𝜓𝑢subscript𝑣2subscriptysubscript𝑣21superscriptitalic-ϕ𝑢subscript𝑣2𝑥superscript𝜓𝑢subscript𝑣2subscriptysubscript𝑣21𝑥delimited-[]subscript𝐶𝑢subscript𝑣1subscript𝑣2𝑦superscript𝜓𝑢subscript𝑣1subscriptysubscript𝑣1superscriptitalic-ϕ𝑢subscript𝑣1𝑥superscript𝜓𝑢subscript𝑣1subscriptysubscript𝑣1\frac{\partial}{\partial x}\left[\frac{\frac{\partial}{\partial y}\psi^{uv_{2}% }(\mathrm{y}_{v_{2}-1})}{\phi^{uv_{2}}(x)-\psi^{uv_{2}}(\mathrm{y}_{v_{2}-1})}% \right]=\frac{\partial}{\partial x}\left[\frac{C_{u,v_{1}\to v_{2}}\frac{% \partial}{\partial y}\psi^{uv_{1}}(\mathrm{y}_{v_{1}})}{\phi^{uv_{1}}(x)-\psi^% {uv_{1}}(\mathrm{y}_{v_{1}})}\right]divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG [ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) end_ARG ] = divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG [ divide start_ARG italic_C start_POSTSUBSCRIPT italic_u , italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT → italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG ]

Hence, for a constants C1subscript𝐶1C_{1}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, C3subscript𝐶3C_{3}italic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, C4subscript𝐶4C_{4}italic_C start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT, and C5subscript𝐶5C_{5}italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT, not depending on x𝑥xitalic_x, we have

C1ϕuv2(x)C2=C3ϕuv1(x)C4+C5.subscript𝐶1superscriptitalic-ϕ𝑢subscript𝑣2𝑥subscript𝐶2subscript𝐶3superscriptitalic-ϕ𝑢subscript𝑣1𝑥subscript𝐶4subscript𝐶5\frac{C_{1}}{\phi^{uv_{2}}(x)-C_{2}}=\frac{C_{3}}{\phi^{uv_{1}}(x)-C_{4}}+C_{5}.divide start_ARG italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x ) - italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG = divide start_ARG italic_C start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x ) - italic_C start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_ARG + italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT .

Therefore, ϕuv2(x)superscriptitalic-ϕ𝑢subscript𝑣2𝑥\phi^{uv_{2}}(x)italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x ) and ϕuv1(x)superscriptitalic-ϕ𝑢subscript𝑣1𝑥\phi^{uv_{1}}(x)italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x ) are Moebius transforms of each other. The argument for ψ𝜓\psiitalic_ψ is the same. ∎

For a general matrix I𝐼Iitalic_I, all the functions ϕuvsuperscriptitalic-ϕ𝑢𝑣\phi^{uv}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT and ψuvsuperscript𝜓𝑢𝑣\psi^{uv}italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT need to be related to each other using Propositions 4.4, 4.5, and 4.6. An example will be given in Section 5.2. For convex I𝐼Iitalic_I there is a simplification.

Proposition 4.7.

Suppose that I𝐼Iitalic_I is convex. Then all functions ϕuvsuperscriptitalic-ϕ𝑢𝑣\phi^{uv}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT and ψuvsuperscript𝜓𝑢𝑣\psi^{uv}italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT in Proposition 4.1 can be chosen to not depend on u𝑢uitalic_u or v𝑣vitalic_v. The resulting ϕitalic-ϕ\phiitalic_ϕ and ψ𝜓\psiitalic_ψ can be chosen to be continuous functions from [0,1]01[0,1][ 0 , 1 ] to the extended real line {}\mathbb{R}\cup\{\infty\}blackboard_R ∪ { ∞ } with finitely many \infty values.

Proof.

For each v=1,2,,𝑣12v=1,2,\dots,\ellitalic_v = 1 , 2 , … , roman_ℓ, we let {u(v),u(v)+1,,u+(v)}subscript𝑢𝑣subscript𝑢𝑣1subscript𝑢𝑣\{u_{-}(v),u_{-}(v)+1,\dots,u_{+}(v)\}{ italic_u start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ( italic_v ) , italic_u start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ( italic_v ) + 1 , … , italic_u start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ( italic_v ) } denote the maximal segment of indices such that Iuv=1subscript𝐼𝑢𝑣1I_{uv}=1italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1. Consider the rectangle [xu(v)1,xu+(v)]×[yv1,yv]subscriptxsubscript𝑢𝑣1subscriptxsubscript𝑢𝑣subscripty𝑣1subscripty𝑣[\mathrm{x}_{u_{-}(v)-1},\mathrm{x}_{u_{+}(v)}]\times[\mathrm{y}_{v-1},\mathrm% {y}_{v}][ roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ( italic_v ) - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ( italic_v ) end_POSTSUBSCRIPT ] × [ roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ]. Repeating the argument of Proposition 4.1, we can choose continuous functions ϕvsuperscriptitalic-ϕ𝑣\phi^{v}italic_ϕ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT and ψvsuperscript𝜓𝑣\psi^{v}italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT, such that

(38) guv(x,y)=1rxϕv(x)yψv(y)[ϕv(x)ψv(y)]2,u{u(v),u(v)+1,,u+(v)}.formulae-sequencesuperscript𝑔𝑢𝑣𝑥𝑦1𝑟𝑥superscriptitalic-ϕ𝑣𝑥𝑦superscript𝜓𝑣𝑦superscriptdelimited-[]superscriptitalic-ϕ𝑣𝑥superscript𝜓𝑣𝑦2𝑢subscript𝑢𝑣subscript𝑢𝑣1subscript𝑢𝑣g^{uv}(x,y)=-\frac{1}{r}\cdot\frac{\frac{\partial}{\partial x}\phi^{v}(x)\frac% {\partial}{\partial y}\psi^{v}(y)}{[\phi^{v}(x)-\psi^{v}(y)]^{2}},\qquad u\in% \{u_{-}(v),u_{-}(v)+1,\dots,u_{+}(v)\}.italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - divide start_ARG 1 end_ARG start_ARG italic_r end_ARG ⋅ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG italic_ϕ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x ) divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_y ) end_ARG start_ARG [ italic_ϕ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_y ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , italic_u ∈ { italic_u start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ( italic_v ) , italic_u start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ( italic_v ) + 1 , … , italic_u start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ( italic_v ) } .

Indeed, the argument of Proposition 4.1 did not use the continuity of g𝑔gitalic_g, and it works equally well for piecewise continuous g𝑔gitalic_g.

Combining Propositions 4.4 and 4.5, we conclude that as we cross the horizontal line y=yv𝑦subscripty𝑣y=\mathrm{y}_{v}italic_y = roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT, the function g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) is being multiplied by a single constant Cvv+1subscript𝐶𝑣𝑣1C_{v\to v+1}italic_C start_POSTSUBSCRIPT italic_v → italic_v + 1 end_POSTSUBSCRIPT. Hence, for each v=1,2,,1𝑣121v=1,2,\dots,\ell-1italic_v = 1 , 2 , … , roman_ℓ - 1 we have a version of (37), which now reads:

(39) x[yψv+1(yv)ϕv+1(x)ψv+1(yv)]=x[Cvv+1yψv(yv)ϕv(x)ψv(yv)], or 𝑥delimited-[]𝑦superscript𝜓𝑣1subscripty𝑣superscriptitalic-ϕ𝑣1𝑥superscript𝜓𝑣1subscripty𝑣𝑥delimited-[]subscript𝐶𝑣𝑣1𝑦superscript𝜓𝑣subscripty𝑣superscriptitalic-ϕ𝑣𝑥superscript𝜓𝑣subscripty𝑣 or \frac{\partial}{\partial x}\left[\frac{\frac{\partial}{\partial y}\psi^{v+1}(% \mathrm{y}_{v})}{\phi^{v+1}(x)-\psi^{v+1}(\mathrm{y}_{v})}\right]=\frac{% \partial}{\partial x}\left[\frac{C_{v\to v+1}\frac{\partial}{\partial y}\psi^{% v}(\mathrm{y}_{v})}{\phi^{v}(x)-\psi^{v}(\mathrm{y}_{v})}\right],\qquad\text{ % or }divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG [ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG ] = divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG [ divide start_ARG italic_C start_POSTSUBSCRIPT italic_v → italic_v + 1 end_POSTSUBSCRIPT divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG ] , or
(40) yψv+1(yv)ϕv+1(x)ψv+1(yv)+C~vv+1=Cvv+1yψv(yv)ϕv(x)ψv(yv), or 𝑦superscript𝜓𝑣1subscripty𝑣superscriptitalic-ϕ𝑣1𝑥superscript𝜓𝑣1subscripty𝑣subscript~𝐶𝑣𝑣1subscript𝐶𝑣𝑣1𝑦superscript𝜓𝑣subscripty𝑣superscriptitalic-ϕ𝑣𝑥superscript𝜓𝑣subscripty𝑣 or \frac{\frac{\partial}{\partial y}\psi^{v+1}(\mathrm{y}_{v})}{\phi^{v+1}(x)-% \psi^{v+1}(\mathrm{y}_{v})}+\tilde{C}_{v\to v+1}=\frac{C_{v\to v+1}\frac{% \partial}{\partial y}\psi^{v}(\mathrm{y}_{v})}{\phi^{v}(x)-\psi^{v}(\mathrm{y}% _{v})},\qquad\text{ or }divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG + over~ start_ARG italic_C end_ARG start_POSTSUBSCRIPT italic_v → italic_v + 1 end_POSTSUBSCRIPT = divide start_ARG italic_C start_POSTSUBSCRIPT italic_v → italic_v + 1 end_POSTSUBSCRIPT divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG , or
(41) Cvv+1yψv(yv)[yψv+1(yv)ϕv+1(x)ψv+1(yv)+C~vv+1]1+ψv(yv)=ϕv(x).subscript𝐶𝑣𝑣1𝑦superscript𝜓𝑣subscripty𝑣superscriptdelimited-[]𝑦superscript𝜓𝑣1subscripty𝑣superscriptitalic-ϕ𝑣1𝑥superscript𝜓𝑣1subscripty𝑣subscript~𝐶𝑣𝑣11superscript𝜓𝑣subscripty𝑣superscriptitalic-ϕ𝑣𝑥C_{v\to v+1}\frac{\partial}{\partial y}\psi^{v}(\mathrm{y}_{v})\left[\frac{% \frac{\partial}{\partial y}\psi^{v+1}(\mathrm{y}_{v})}{\phi^{v+1}(x)-\psi^{v+1% }(\mathrm{y}_{v})}+\tilde{C}_{v\to v+1}\right]^{-1}+\psi^{v}(\mathrm{y}_{v})=% \phi^{v}(x).italic_C start_POSTSUBSCRIPT italic_v → italic_v + 1 end_POSTSUBSCRIPT divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) [ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG + over~ start_ARG italic_C end_ARG start_POSTSUBSCRIPT italic_v → italic_v + 1 end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) = italic_ϕ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x ) .

We now use (41) to sequentially redefine the functions ϕvsuperscriptitalic-ϕ𝑣\phi^{v}italic_ϕ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT and ψvsuperscript𝜓𝑣\psi^{v}italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT for v=2,3,,𝑣23v=2,3,\dots,\ellitalic_v = 2 , 3 , … , roman_ℓ. Functions ϕ1superscriptitalic-ϕ1\phi^{1}italic_ϕ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT and ψ1superscript𝜓1\psi^{1}italic_ψ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT are unchanged. We replace (ϕ2,ψ2)superscriptitalic-ϕ2superscript𝜓2(\phi^{2},\psi^{2})( italic_ϕ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_ψ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) with (ϕ~2,ψ~2)superscript~italic-ϕ2superscript~𝜓2(\tilde{\phi}^{2},\tilde{\psi}^{2})( over~ start_ARG italic_ϕ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , over~ start_ARG italic_ψ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) defined by:

ϕ~2(x)superscript~italic-ϕ2𝑥\displaystyle\tilde{\phi}^{2}(x)over~ start_ARG italic_ϕ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x ) :=C12yψ1(y1)[yψ2(y1)ϕ2(x)ψ2(y1)+C~12]1+ψ1(y1),assignabsentsubscript𝐶12𝑦superscript𝜓1subscripty1superscriptdelimited-[]𝑦superscript𝜓2subscripty1superscriptitalic-ϕ2𝑥superscript𝜓2subscripty1subscript~𝐶121superscript𝜓1subscripty1\displaystyle:=C_{1\to 2}\frac{\partial}{\partial y}\psi^{1}(\mathrm{y}_{1})% \left[\frac{\frac{\partial}{\partial y}\psi^{2}(\mathrm{y}_{1})}{\phi^{2}(x)-% \psi^{2}(\mathrm{y}_{1})}+\tilde{C}_{1\to 2}\right]^{-1}+\psi^{1}(\mathrm{y}_{% 1}),:= italic_C start_POSTSUBSCRIPT 1 → 2 end_POSTSUBSCRIPT divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) [ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG + over~ start_ARG italic_C end_ARG start_POSTSUBSCRIPT 1 → 2 end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + italic_ψ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ,
ψ~2(y)superscript~𝜓2𝑦\displaystyle\tilde{\psi}^{2}(y)over~ start_ARG italic_ψ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_y ) :=C12yψ1(y1)[yψ2(y1)ψ2(y)ψ2(y1)+C~12]1+ψ1(y1).assignabsentsubscript𝐶12𝑦superscript𝜓1subscripty1superscriptdelimited-[]𝑦superscript𝜓2subscripty1superscript𝜓2𝑦superscript𝜓2subscripty1subscript~𝐶121superscript𝜓1subscripty1\displaystyle:=C_{1\to 2}\frac{\partial}{\partial y}\psi^{1}(\mathrm{y}_{1})% \left[\frac{\frac{\partial}{\partial y}\psi^{2}(\mathrm{y}_{1})}{\psi^{2}(y)-% \psi^{2}(\mathrm{y}_{1})}+\tilde{C}_{1\to 2}\right]^{-1}+\psi^{1}(\mathrm{y}_{% 1}).:= italic_C start_POSTSUBSCRIPT 1 → 2 end_POSTSUBSCRIPT divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) [ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_y ) - italic_ψ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG + over~ start_ARG italic_C end_ARG start_POSTSUBSCRIPT 1 → 2 end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + italic_ψ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) .

As we noticed in Remark 4.3, the simultaneous Moebius transformations of ϕitalic-ϕ\phiitalic_ϕ and ψ𝜓\psiitalic_ψ do not change g𝑔gitalic_g. Hence, the redefined (ϕ2,ψ2)superscriptitalic-ϕ2superscript𝜓2(\phi^{2},\psi^{2})( italic_ϕ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_ψ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) still satisfy (38). On the other hand, comparing with (41), we see that ϕ~2(x)=ϕ1(x)superscript~italic-ϕ2𝑥superscriptitalic-ϕ1𝑥\tilde{\phi}^{2}(x)=\phi^{1}(x)over~ start_ARG italic_ϕ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x ) = italic_ϕ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x ) for all x𝑥xitalic_x for which both functions are simultaneously defined. Repeating this procedure for all larger v=3,4,,𝑣34v=3,4,\dots,\ellitalic_v = 3 , 4 , … , roman_ℓ, we achieve that all ϕitalic-ϕ\phiitalic_ϕ functions are the same. All ψ𝜓\psiitalic_ψ functions have different domains of definitions, hence, they can also be thought as being independent of v𝑣vitalic_v.

We also need to check the continuity. Note that ϕvsuperscriptitalic-ϕ𝑣\phi^{v}italic_ϕ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT were continuous (taking values in extended real line {}\mathbb{R}\cup\{\infty\}blackboard_R ∪ { ∞ } by construction, because (30) is a continuous function of x𝑥xitalic_x; the continuity is preserved under Moebius transformations, hence, this the final ϕ(x)italic-ϕ𝑥\phi(x)italic_ϕ ( italic_x ) is continuous in x𝑥xitalic_x.

For the ψ𝜓\psiitalic_ψ function we need a separate argument, as it might have been discontinuous at y=yv𝑦subscripty𝑣y=\mathrm{y}_{v}italic_y = roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT for some v𝑣vitalic_v. In order to see that this does not happen, we go back to (40). Taking into account that after redefining ϕitalic-ϕ\phiitalic_ϕ, it no longer depends on v𝑣vitalic_v, we have

(42) yψv+1(yv)ϕ(x)ψv+1(yv)+C~vv+1=Cvv+1yψv(yv)ϕ(x)ψv(yv).𝑦superscript𝜓𝑣1subscripty𝑣italic-ϕ𝑥superscript𝜓𝑣1subscripty𝑣subscript~𝐶𝑣𝑣1subscript𝐶𝑣𝑣1𝑦superscript𝜓𝑣subscripty𝑣italic-ϕ𝑥superscript𝜓𝑣subscripty𝑣\frac{\frac{\partial}{\partial y}\psi^{v+1}(\mathrm{y}_{v})}{\phi(x)-\psi^{v+1% }(\mathrm{y}_{v})}+\tilde{C}_{v\to v+1}=\frac{C_{v\to v+1}\frac{\partial}{% \partial y}\psi^{v}(\mathrm{y}_{v})}{\phi(x)-\psi^{v}(\mathrm{y}_{v})}.divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG + over~ start_ARG italic_C end_ARG start_POSTSUBSCRIPT italic_v → italic_v + 1 end_POSTSUBSCRIPT = divide start_ARG italic_C start_POSTSUBSCRIPT italic_v → italic_v + 1 end_POSTSUBSCRIPT divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ϕ ( italic_x ) - italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG .

Treating everything except ϕ(x)italic-ϕ𝑥\phi(x)italic_ϕ ( italic_x ) as constants, (42) is an identity of two functions of x𝑥xitalic_x. Clearly, this identity can be true only if ψv+1(yv)=ψv(yv)superscript𝜓𝑣1subscripty𝑣superscript𝜓𝑣subscripty𝑣\psi^{v+1}(\mathrm{y}_{v})=\psi^{v}(\mathrm{y}_{v})italic_ψ start_POSTSUPERSCRIPT italic_v + 1 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) = italic_ψ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ), which proves the desired continuity. ∎

Remark 4.8.

In several examples we checked with non-convex I𝐼Iitalic_I, it was impossible to find a solution of the EL-equations of the form (25) with ϕuvsuperscriptitalic-ϕ𝑢𝑣\phi^{uv}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT and ψuvsuperscript𝜓𝑢𝑣\psi^{uv}italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT not depending on u𝑢uitalic_u or v𝑣vitalic_v.

4.2. Algebraic equations for convex I𝐼Iitalic_I

Summarizing the developments of the previous section, in order to solve the EL-equations for the permuton energy with convex I𝐼Iitalic_I we need to find two continuous functions

ϕ:[0,1]{},ψ:[0,1]{},:italic-ϕmaps-to01𝜓:maps-to01\phi:[0,1]\mapsto\mathbb{R}\cup\{\infty\},\qquad\psi:[0,1]\mapsto\mathbb{R}% \cup\{\infty\},italic_ϕ : [ 0 , 1 ] ↦ blackboard_R ∪ { ∞ } , italic_ψ : [ 0 , 1 ] ↦ blackboard_R ∪ { ∞ } ,

such that the function

g(x,y)=1rxϕ(x)yψ(y)[ϕ(x)ψ(y)]2𝑔𝑥𝑦1𝑟𝑥italic-ϕ𝑥𝑦𝜓𝑦superscriptdelimited-[]italic-ϕ𝑥𝜓𝑦2g(x,y)=-\frac{1}{r}\cdot\frac{\frac{\partial}{\partial x}\phi(x)\frac{\partial% }{\partial y}\psi(y)}{[\phi(x)-\psi(y)]^{2}}italic_g ( italic_x , italic_y ) = - divide start_ARG 1 end_ARG start_ARG italic_r end_ARG ⋅ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG italic_ϕ ( italic_x ) divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ ( italic_y ) end_ARG start_ARG [ italic_ϕ ( italic_x ) - italic_ψ ( italic_y ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG

is non-negative whenever (x,y)(xu1,xu)×(yv1,yv)𝑥𝑦subscriptx𝑢1subscriptx𝑢subscripty𝑣1subscripty𝑣(x,y)\in(\mathrm{x}_{u-1},\mathrm{x}_{u})\times(\mathrm{y}_{v-1},\mathrm{y}_{v})( italic_x , italic_y ) ∈ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) × ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) with Iuv=1subscript𝐼𝑢𝑣1I_{uv}=1italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1 and satisfies the permuton conditions (equivalent to the boundry conditions on hhitalic_h):

(43) v=1Iuvyv1yvg(x,y)dy=1,x(xu1,xu),1uk,formulae-sequencesuperscriptsubscript𝑣1subscript𝐼𝑢𝑣superscriptsubscriptsubscripty𝑣1subscripty𝑣𝑔𝑥𝑦differential-d𝑦1formulae-sequence𝑥subscriptx𝑢1subscriptx𝑢1𝑢𝑘\sum_{v=1}^{\ell}I_{uv}\int_{\mathrm{y}_{v-1}}^{\mathrm{y}_{v}}g(x,y)\mathrm{d% }y=1,\qquad x\in(\mathrm{x}_{u-1},\mathrm{x}_{u}),\quad 1\leq u\leq k,∑ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_y = 1 , italic_x ∈ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) , 1 ≤ italic_u ≤ italic_k ,
(44) u=1kIuvxu1xug(x,y)dx=1,y(yv1,yv),1v.formulae-sequencesuperscriptsubscript𝑢1𝑘subscript𝐼𝑢𝑣superscriptsubscriptsubscriptx𝑢1subscriptx𝑢𝑔𝑥𝑦differential-d𝑥1formulae-sequence𝑦subscripty𝑣1subscripty𝑣1𝑣\sum_{u=1}^{k}I_{uv}\int_{\mathrm{x}_{u-1}}^{\mathrm{x}_{u}}g(x,y)\mathrm{d}x=% 1,\qquad y\in(\mathrm{y}_{v-1},\mathrm{y}_{v}),\quad 1\leq v\leq\ell.∑ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_x = 1 , italic_y ∈ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) , 1 ≤ italic_v ≤ roman_ℓ .

The algorithm for finding functions ϕitalic-ϕ\phiitalic_ϕ and ψ𝜓\psiitalic_ψ is given in the following theorem.

Theorem 4.9.

The k++2𝑘2k+\ell+2italic_k + roman_ℓ + 2 numbers ϕ(xu)italic-ϕsubscriptx𝑢\phi(\mathrm{x}_{u})italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ), 0uk0𝑢𝑘0\leq u\leq k0 ≤ italic_u ≤ italic_k, ψ(yv)𝜓subscripty𝑣\psi(\mathrm{y}_{v})italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ), 0v0𝑣0\leq v\leq\ell0 ≤ italic_v ≤ roman_ℓ, are found by solving a system of k+𝑘k+\ellitalic_k + roman_ℓ polynomial equations:

(45) er(xuxu1)superscript𝑒𝑟subscriptx𝑢subscriptx𝑢1\displaystyle e^{r(\mathrm{x}_{u}-\mathrm{x}_{u-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =v=1[ψ(yv)ϕ(xu1)ψ(yv1)ϕ(xu1)ψ(yv1)ϕ(xu)ψ(yv)ϕ(xu)]Iuv,1uk,formulae-sequenceabsentsuperscriptsubscriptproduct𝑣1superscriptdelimited-[]𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢subscript𝐼𝑢𝑣1𝑢𝑘\displaystyle=\prod_{v=1}^{\ell}\left[\frac{\psi(\mathrm{y}_{v})-\phi(\mathrm{% x}_{u-1})}{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_{u-1})}\cdot\frac{\psi(% \mathrm{y}_{v-1})-\phi(\mathrm{x}_{u})}{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_{% u})}\right]^{I_{uv}},\quad 1\leq u\leq k,= ∏ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT [ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , 1 ≤ italic_u ≤ italic_k ,
(46) er(yvyv1)superscript𝑒𝑟subscripty𝑣subscripty𝑣1\displaystyle e^{r(\mathrm{y}_{v}-\mathrm{y}_{v-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =u=1k[ψ(yv)ϕ(xu1)ψ(yv1)ϕ(xu1)ψ(yv1)ϕ(xu)ψ(yv)ϕ(xu)]Iuv,1v.formulae-sequenceabsentsuperscriptsubscriptproduct𝑢1𝑘superscriptdelimited-[]𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢subscript𝐼𝑢𝑣1𝑣\displaystyle=\prod_{u=1}^{k}\left[\frac{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_% {u-1})}{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_{u-1})}\cdot\frac{\psi(\mathrm{% y}_{v-1})-\phi(\mathrm{x}_{u})}{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_{u})}% \right]^{I_{uv}},\quad 1\leq v\leq\ell.= ∏ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT [ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , 1 ≤ italic_v ≤ roman_ℓ .

The values at other points are found from polynomial equations in variables ϕ(x)italic-ϕ𝑥\phi(x)italic_ϕ ( italic_x ) and ψ(x)𝜓𝑥\psi(x)italic_ψ ( italic_x ):

(47) er(xxu1)superscript𝑒𝑟𝑥subscriptx𝑢1\displaystyle e^{r(x-\mathrm{x}_{u-1})}italic_e start_POSTSUPERSCRIPT italic_r ( italic_x - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =v=1[ψ(yv1)ϕ(x)ψ(yv)ϕ(x)ψ(yv)ϕ(xu1)ψ(yv1)ϕ(xu1)]Iuv,x[xu1,xu],1uk,formulae-sequenceabsentsuperscriptsubscriptproduct𝑣1superscriptdelimited-[]𝜓subscripty𝑣1italic-ϕ𝑥𝜓subscripty𝑣italic-ϕ𝑥𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1subscript𝐼𝑢𝑣formulae-sequence𝑥subscriptx𝑢1subscriptx𝑢1𝑢𝑘\displaystyle=\prod_{v=1}^{\ell}\left[\frac{\psi(\mathrm{y}_{v-1})-\phi(x)}{% \psi(\mathrm{y}_{v})-\phi(x)}\cdot\frac{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_{% u-1})}{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_{u-1})}\right]^{I_{uv}},\qquad x% \in[\mathrm{x}_{u-1},\mathrm{x}_{u}],\quad 1\leq u\leq k,= ∏ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT [ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( italic_x ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( italic_x ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_x ∈ [ roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ] , 1 ≤ italic_u ≤ italic_k ,
er(yyv1)superscript𝑒𝑟𝑦subscripty𝑣1\displaystyle e^{r(y-\mathrm{y}_{v-1})}italic_e start_POSTSUPERSCRIPT italic_r ( italic_y - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =u=1k[ψ(y)ϕ(xu1)ψ(y)ϕ(xu)ψ(yv1)ϕ(xu)ψ(yv1)ϕ(xu1)]Iuv,y[yv1,yv],1v.formulae-sequenceabsentsuperscriptsubscriptproduct𝑢1𝑘superscriptdelimited-[]𝜓𝑦italic-ϕsubscriptx𝑢1𝜓𝑦italic-ϕsubscriptx𝑢𝜓subscripty𝑣1italic-ϕsubscriptx𝑢𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1subscript𝐼𝑢𝑣formulae-sequence𝑦subscripty𝑣1subscripty𝑣1𝑣\displaystyle=\prod_{u=1}^{k}\left[\frac{\psi(y)-\phi(\mathrm{x}_{u-1})}{\psi(% y)-\phi(\mathrm{x}_{u})}\cdot\frac{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_{u})% }{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_{u-1})}\right]^{I_{uv}},\qquad y\in[% \mathrm{y}_{v-1},\mathrm{y}_{v}],\quad 1\leq v\leq\ell.= ∏ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT [ divide start_ARG italic_ψ ( italic_y ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( italic_y ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_y ∈ [ roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ] , 1 ≤ italic_v ≤ roman_ℓ .
Remark 4.10.

Recall from Remark 4.3 that functions ϕitalic-ϕ\phiitalic_ϕ and ψ𝜓\psiitalic_ψ are defined up to 3333–dimensional group of Moebius transformations. Hence, we can fix three out of k++2𝑘2k+\ell+2italic_k + roman_ℓ + 2 variables ϕ(xu)italic-ϕsubscriptx𝑢\phi(\mathrm{x}_{u})italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ), ψ(yv)𝜓subscripty𝑣\psi(\mathrm{y}_{v})italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) in arbitrary ways, and find the remaining k+1𝑘1k+\ell-1italic_k + roman_ℓ - 1 variables from (45),(46). One equation is abundant: the product of all k𝑘kitalic_k equations (45) is the same as the produc of all \ellroman_ℓ equations (46). Therefore, eventually the number of variables matches the number of equations.

Proof of Theorem 4.9.

We rewrite g(x,y)=1r2xyln(ϕ(x)ψ(y))𝑔𝑥𝑦1𝑟superscript2𝑥𝑦italic-ϕ𝑥𝜓𝑦g(x,y)=-\frac{1}{r}\frac{\partial^{2}}{\partial x\partial y}\ln\bigl{(}\phi(x)% -\psi(y)\bigr{)}italic_g ( italic_x , italic_y ) = - divide start_ARG 1 end_ARG start_ARG italic_r end_ARG divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_x ∂ italic_y end_ARG roman_ln ( italic_ϕ ( italic_x ) - italic_ψ ( italic_y ) ) and compute

(48) yv1yvxu1xug(x,y)dxdy=1rln[(ϕ(xu)ψ(yv))(ϕ(xu1)ψ(yv1))(ϕ(xu1)ψ(yv))(ϕ(xu)ψ(yv1))].superscriptsubscriptsubscripty𝑣1subscripty𝑣superscriptsubscriptsubscriptx𝑢1subscriptx𝑢𝑔𝑥𝑦differential-d𝑥differential-d𝑦1𝑟italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣italic-ϕsubscriptx𝑢𝜓subscripty𝑣1\int_{\mathrm{y}_{v-1}}^{\mathrm{y}_{v}}\int_{\mathrm{x}_{u-1}}^{\mathrm{x}_{u% }}g(x,y)\,\mathrm{d}x\mathrm{d}y=-\frac{1}{r}\ln\left[\frac{(\phi(\mathrm{x}_{% u})-\psi(\mathrm{y}_{v}))(\phi(\mathrm{x}_{u-1})-\psi(\mathrm{y}_{v-1}))}{(% \phi(\mathrm{x}_{u-1})-\psi(\mathrm{y}_{v}))(\phi(\mathrm{x}_{u})-\psi(\mathrm% {y}_{v-1}))}\right].∫ start_POSTSUBSCRIPT roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_x roman_d italic_y = - divide start_ARG 1 end_ARG start_ARG italic_r end_ARG roman_ln [ divide start_ARG ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ) ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ) end_ARG start_ARG ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ) ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ) end_ARG ] .

Hence, integrating (43) from xu1subscriptx𝑢1\mathrm{x}_{u-1}roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT to xusubscriptx𝑢\mathrm{x}_{u}roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, we get

r(xuxu1)=v=1Iuvln[(ϕ(xu)ψ(yv))(ϕ(xu1)ψ(yv1))(ϕ(xu1)ψ(yv))(ϕ(xu)ψ(yv1))].𝑟subscriptx𝑢subscriptx𝑢1superscriptsubscript𝑣1subscript𝐼𝑢𝑣italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣italic-ϕsubscriptx𝑢𝜓subscripty𝑣1r(\mathrm{x}_{u}-\mathrm{x}_{u-1})=-\sum_{v=1}^{\ell}I_{uv}\ln\left[\frac{(% \phi(\mathrm{x}_{u})-\psi(\mathrm{y}_{v}))(\phi(\mathrm{x}_{u-1})-\psi(\mathrm% {y}_{v-1}))}{(\phi(\mathrm{x}_{u-1})-\psi(\mathrm{y}_{v}))(\phi(\mathrm{x}_{u}% )-\psi(\mathrm{y}_{v-1}))}\right].italic_r ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) = - ∑ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT roman_ln [ divide start_ARG ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ) ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ) end_ARG start_ARG ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ) ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ) end_ARG ] .

Exponentiating, we get (45). Similarly, integrating (44) from yv1subscripty𝑣1\mathrm{y}_{v-1}roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT to yvsubscripty𝑣\mathrm{y}_{v}roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT and exponentiating, we get (46).

By the same trick, the first equation of (47) is obtained by integrating (43) from xu1subscriptx𝑢1\mathrm{x}_{u-1}roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT and then exponentiating. The second equation is obtained by integrating (44) from yv1subscripty𝑣1\mathrm{y}_{v-1}roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT to y𝑦yitalic_y and then exponentiating. ∎

Theorem 4.9 has the following corollary.

Corollary 4.11.

For I𝐼Iitalic_I convex, the functions ϕ(x)italic-ϕ𝑥\phi(x)italic_ϕ ( italic_x ) and ψ(y)𝜓𝑦\psi(y)italic_ψ ( italic_y ) are Moebius transformations of erxsuperscript𝑒𝑟𝑥e^{rx}italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT and erysuperscript𝑒𝑟𝑦e^{ry}italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT, respectively, on each rectangle. Consequently g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) has the form

(49) guv(x,y)=(adbc)rer(x+y)(a+bery+cerx+der(x+y))2superscript𝑔𝑢𝑣𝑥𝑦𝑎𝑑𝑏𝑐𝑟superscript𝑒𝑟𝑥𝑦superscript𝑎𝑏superscript𝑒𝑟𝑦𝑐superscript𝑒𝑟𝑥𝑑superscript𝑒𝑟𝑥𝑦2g^{uv}(x,y)=-\frac{(ad-bc)re^{r(x+y)}}{(a+be^{ry}+ce^{rx}+de^{r(x+y)})^{2}}italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - divide start_ARG ( italic_a italic_d - italic_b italic_c ) italic_r italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_a + italic_b italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT + italic_c italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT + italic_d italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG

on each rectangle, where the constants a,b,c,d𝑎𝑏𝑐𝑑a,b,c,ditalic_a , italic_b , italic_c , italic_d depend on u,v𝑢𝑣u,vitalic_u , italic_v.

Proof.

Because I𝐼Iitalic_I is convex, the equations (47) telescope, leading to, for each u𝑢uitalic_u,

er(xxu1)=CuAuϕ(x)Buϕ(x)superscript𝑒𝑟𝑥subscript𝑥𝑢1subscript𝐶𝑢subscript𝐴𝑢italic-ϕ𝑥subscript𝐵𝑢italic-ϕ𝑥e^{r(x-x_{u-1})}=C_{u}\frac{A_{u}-\phi(x)}{B_{u}-\phi(x)}italic_e start_POSTSUPERSCRIPT italic_r ( italic_x - italic_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT = italic_C start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT divide start_ARG italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - italic_ϕ ( italic_x ) end_ARG start_ARG italic_B start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - italic_ϕ ( italic_x ) end_ARG

for some constants Au,Bu,Cusubscript𝐴𝑢subscript𝐵𝑢subscript𝐶𝑢A_{u},B_{u},C_{u}italic_A start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , italic_B start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, and similarly for ψ(y)𝜓𝑦\psi(y)italic_ψ ( italic_y ) for each v𝑣vitalic_v. This implies the first statement. Now applying (25) yields the general form (49) of g𝑔gitalic_g. ∎

Theorem 4.9 indicates that finding the limit shapes for restricted random permutations for convex I𝐼Iitalic_I crucially depends on being able to solve the system of equations (45), (46). In Sections 4.3, 4.4, 4.5 we explain various approaches for finding the solutions of (45), (46), and then show additional examples in Section 5.

When I𝐼Iitalic_I is non-convex, the algorithm is more complicated. According to Proposition 4.6, the functions ϕuvsuperscriptitalic-ϕ𝑢𝑣\phi^{uv}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT and ψuvsuperscript𝜓𝑢𝑣\psi^{uv}italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT now depend on u𝑢uitalic_u and v𝑣vitalic_v through Moebius transforms and all these transforms should be identified by using the relationships of Proposition 4.4 and 4.5; in principle this can be done (one can check that the number of degrees of freedom matches the number of constraints), yet computationally it is quite hard. After this is done, one can again use formula (45), (46), (47), but with ϕitalic-ϕ\phiitalic_ϕ and ψ𝜓\psiitalic_ψ replaced with ϕuvsuperscriptitalic-ϕ𝑢𝑣\phi^{uv}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT and ψuvsuperscript𝜓𝑢𝑣\psi^{uv}italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT, respectively, in the factor raised to the Iuvsubscript𝐼𝑢𝑣I_{uv}italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT power. Through this approach, we managed to compute the simplest non-convex example in Section 5.2. It would be interesting to find further simplifications, which would allow the investigation of other non-convex situations.

4.3. The case r=0𝑟0r=0italic_r = 0

In the case r=0𝑟0r=0italic_r = 0 the equations (45), (46), and (47) are dramatically simplified.

Theorem 4.12.

For r=0𝑟0r=0italic_r = 0, the maximizer in the variational problem 𝔈(h)max𝔈\mathfrak{E}(h)\to\maxfraktur_E ( italic_h ) → roman_max, h𝔉x,y,Isuperscript𝔉xy𝐼h\in\mathfrak{F}^{\mathrm{x},\mathrm{y},I}italic_h ∈ fraktur_F start_POSTSUPERSCRIPT roman_x , roman_y , italic_I end_POSTSUPERSCRIPT, has the following form in terms of density g=hxy𝑔subscript𝑥𝑦g=-h_{xy}italic_g = - italic_h start_POSTSUBSCRIPT italic_x italic_y end_POSTSUBSCRIPT:

  • On each rectangle (xu1,xu)×(yv1,yv)subscriptx𝑢1subscriptx𝑢subscripty𝑣1subscripty𝑣(\mathrm{x}_{u-1},\mathrm{x}_{u})\times(\mathrm{y}_{v-1},\mathrm{y}_{v})( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) × ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ), the density g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) is constant:

    (50) g(x,y)=guv(xuxu1)(yvyv1),x(xu1,xu),y(yv1,yv).formulae-sequence𝑔𝑥𝑦superscript𝑔𝑢𝑣subscriptx𝑢subscriptx𝑢1subscripty𝑣subscripty𝑣1formulae-sequence𝑥subscriptx𝑢1subscriptx𝑢𝑦subscripty𝑣1subscripty𝑣g(x,y)=\frac{g^{uv}}{(\mathrm{x}_{u}-\mathrm{x}_{u-1})(\mathrm{y}_{v}-\mathrm{% y}_{v-1})},\qquad x\in(\mathrm{x}_{u-1},\mathrm{x}_{u}),\quad y\in(\mathrm{y}_% {v-1},\mathrm{y}_{v}).italic_g ( italic_x , italic_y ) = divide start_ARG italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT end_ARG start_ARG ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG , italic_x ∈ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT , roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) , italic_y ∈ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT , roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) .
  • There exist two sequences of positive reals λ1,λ2,,λksubscript𝜆1subscript𝜆2subscript𝜆𝑘\lambda_{1},\lambda_{2},\dots,\lambda_{k}italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and μ1,μ2,,μsubscript𝜇1subscript𝜇2subscript𝜇\mu_{1},\mu_{2},\dots,\mu_{\ell}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, such that guv=Iuvλuμvsuperscript𝑔𝑢𝑣subscript𝐼𝑢𝑣subscript𝜆𝑢subscript𝜇𝑣g^{uv}=I_{uv}\lambda_{u}\mu_{v}italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT = italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT. These sequences are found from the k+𝑘k+\ellitalic_k + roman_ℓ conditions:

    (51) xuxu1subscriptx𝑢subscriptx𝑢1\displaystyle\mathrm{x}_{u}-\mathrm{x}_{u-1}roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT =v=1Iuvλuμv,u=1,2,,k,formulae-sequenceabsentsuperscriptsubscript𝑣1subscript𝐼𝑢𝑣subscript𝜆𝑢subscript𝜇𝑣𝑢12𝑘\displaystyle=\sum_{v=1}^{\ell}I_{uv}\lambda_{u}\mu_{v},\qquad u=1,2,\dots,k,= ∑ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT , italic_u = 1 , 2 , … , italic_k ,
    (52) yvyv1subscripty𝑣subscripty𝑣1\displaystyle\mathrm{y}_{v}-\mathrm{y}_{v-1}roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT =u=1kIuvλuμv,v=1,2,,.formulae-sequenceabsentsuperscriptsubscript𝑢1𝑘subscript𝐼𝑢𝑣subscript𝜆𝑢subscript𝜇𝑣𝑣12\displaystyle=\sum_{u=1}^{k}I_{uv}\lambda_{u}\mu_{v},\qquad v=1,2,\dots,\ell.= ∑ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT , italic_v = 1 , 2 , … , roman_ℓ .
Proof.

We explain why two sequences λ𝜆\lambdaitalic_λ and μ𝜇\muitalic_μ satisfying (51),(52) exist immediately after the proof. Taking this as granted, note that, given (50), the conditions (51),(52) are equivalent to the boundary conditions on hhitalic_h or (43), (44). Simultaneously, the condition (15) with r=0𝑟0r=0italic_r = 0 holds, because g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) factors as a function of x𝑥xitalic_x times a function of y𝑦yitalic_y. Therefore, g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) of (50) satisfies the Euler-Lagrange equations and, hence, by Theorem 3.15 it is a maximizer. ∎

In matrix notation, (51),(52), ask to multiply Iuvsubscript𝐼𝑢𝑣I_{uv}italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT on the right and left by two diagonal matrices, so that its row and column sums become equal to the prescribed constants (e.g. if these constants were all 1111, then we would have been aiming for a bistochastic matrix). This problem is well-studied in the statistics, computer science, economics, and optimal transport literature; the algorithm for solving it was rediscovered many times: some of the names are iterative proportional fitting and Sinkhorn–Knopp algorithm, see [Ide16] for the detailed historic overview and many references.

This is an iterative algorithm, which starts by setting μ[0]=(1,,1)superscript𝜇delimited-[]011\mu^{[0]}=(1,\dots,1)italic_μ start_POSTSUPERSCRIPT [ 0 ] end_POSTSUPERSCRIPT = ( 1 , … , 1 ) and then sequentially updating λ[n]superscript𝜆delimited-[]𝑛\lambda^{[n]}italic_λ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT, μ[n]superscript𝜇delimited-[]𝑛\mu^{[n]}italic_μ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT, n=1,2,3,𝑛123n=1,2,3,\dotsitalic_n = 1 , 2 , 3 , … by the following two rules:

  1. (1)

    Using μ[n1]superscript𝜇delimited-[]𝑛1\mu^{[n-1]}italic_μ start_POSTSUPERSCRIPT [ italic_n - 1 ] end_POSTSUPERSCRIPT, we define λ[n]superscript𝜆delimited-[]𝑛\lambda^{[n]}italic_λ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT so that (51) holds, i.e.:

    λu[n]=xuxu1v=1Iuvμv[n1],1uk.formulae-sequencesubscriptsuperscript𝜆delimited-[]𝑛𝑢subscriptx𝑢subscriptx𝑢1superscriptsubscript𝑣1subscript𝐼𝑢𝑣subscriptsuperscript𝜇delimited-[]𝑛1𝑣1𝑢𝑘\lambda^{[n]}_{u}=\frac{\mathrm{x}_{u}-\mathrm{x}_{u-1}}{\sum_{v=1}^{\ell}I_{% uv}\mu^{[n-1]}_{v}},\qquad 1\leq u\leq k.italic_λ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = divide start_ARG roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_μ start_POSTSUPERSCRIPT [ italic_n - 1 ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT end_ARG , 1 ≤ italic_u ≤ italic_k .
  2. (2)

    Using λ[n]superscript𝜆delimited-[]𝑛\lambda^{[n]}italic_λ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT, we define μ[n]superscript𝜇delimited-[]𝑛\mu^{[n]}italic_μ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT so that (52) holds, i.e.:

    μv[n]=yvyv1u=1kIuvλu[n],1v.formulae-sequencesubscriptsuperscript𝜇delimited-[]𝑛𝑣subscripty𝑣subscripty𝑣1superscriptsubscript𝑢1𝑘subscript𝐼𝑢𝑣subscriptsuperscript𝜆delimited-[]𝑛𝑢1𝑣\mu^{[n]}_{v}=\frac{\mathrm{y}_{v}-\mathrm{y}_{v-1}}{\sum_{u=1}^{k}I_{uv}% \lambda^{[n]}_{u}},\qquad 1\leq v\leq\ell.italic_μ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT = divide start_ARG roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_λ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_ARG , 1 ≤ italic_v ≤ roman_ℓ .

Assume that I𝐼Iitalic_I and {xu}u=1ksuperscriptsubscriptsubscriptx𝑢𝑢1𝑘\{\mathrm{x}_{u}\}_{u=1}^{k}{ roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT, {yv}v=1superscriptsubscriptsubscripty𝑣𝑣1\{\mathrm{y}_{v}\}_{v=1}^{\ell}{ roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT are non-degenerate, as in Definition 3.5. Then λ[n]superscript𝜆delimited-[]𝑛\lambda^{[n]}italic_λ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT, μ[n]superscript𝜇delimited-[]𝑛\mu^{[n]}italic_μ start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT of the above iterative algorithm converge as n𝑛n\to\inftyitalic_n → ∞ towards λ𝜆\lambdaitalic_λ and μ𝜇\muitalic_μ solving (51), (52), see [Ide16, Theorem 4.1] and references therein. In practice, the convergence is extremely fast, as many authors show.

4.4. Numeric algorithms for small r𝑟ritalic_r

We next explore how to leverage the existence of the iterative proportional fitting algorithm at r=0𝑟0r=0italic_r = 0, in order to obtain numeric solutions to the polynomial equations (45), (46) for small r𝑟ritalic_r.

We first connect two sets of equations: (45), (46) and (51), (52).

Lemma 4.13.

Make a change of variables ϕ(xu)=1rχ(xu)italic-ϕsubscriptx𝑢1𝑟𝜒subscriptx𝑢\phi(\mathrm{x}_{u})=\frac{1}{r\chi(\mathrm{x}_{u})}italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG italic_r italic_χ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG, take the logarithm of (45), (46), and divide by r𝑟ritalic_r. Sending r0𝑟0r\to 0italic_r → 0, one arrives at the equations (51), (52) under identification

(53) χ(xu)χ(xu1)=λu,1uk,ψ(yv)ψ(yv1)=μv,1v.formulae-sequenceformulae-sequence𝜒subscriptx𝑢𝜒subscriptx𝑢1subscript𝜆𝑢1𝑢𝑘formulae-sequence𝜓subscripty𝑣𝜓subscripty𝑣1subscript𝜇𝑣1𝑣\chi(\mathrm{x}_{u})-\chi(\mathrm{x}_{u-1})=\lambda_{u},\quad 1\leq u\leq k,% \qquad\psi(\mathrm{y}_{v})-\psi(\mathrm{y}_{v-1})=\mu_{v},\quad 1\leq v\leq\ell.italic_χ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) - italic_χ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) = italic_λ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , 1 ≤ italic_u ≤ italic_k , italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) = italic_μ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT , 1 ≤ italic_v ≤ roman_ℓ .
Proof.

Once we make the change of variables, we have as r0𝑟0r\to 0italic_r → 0

ln[ψ(yv)ϕ(xu1)ψ(yv1)ϕ(xu1)ψ(yv1)ϕ(xu)ψ(yv)ϕ(xu)]=ln[1rχ(xu1)ψ(yv)1rχ(xu1)ψ(yv1)1rχ(xu)ψ(yv1)1rχ(xu)ψ(yv)]=r[(χ(xu)χ(xu1))(ψ(yv)ψ(yv1))]+o(r).𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝑟𝜒subscriptx𝑢1𝜓subscripty𝑣1𝑟𝜒subscriptx𝑢1𝜓subscripty𝑣11𝑟𝜒subscriptx𝑢𝜓subscripty𝑣11𝑟𝜒subscriptx𝑢𝜓subscripty𝑣𝑟delimited-[]𝜒subscriptx𝑢𝜒subscriptx𝑢1𝜓subscripty𝑣𝜓subscripty𝑣1𝑜𝑟\ln\left[\frac{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_{u-1})}{\psi(\mathrm{y}_{v% -1})-\phi(\mathrm{x}_{u-1})}\cdot\frac{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_% {u})}{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_{u})}\right]=\ln\left[\frac{1-r\chi% (\mathrm{x}_{u-1})\psi(\mathrm{y}_{v})}{1-r\chi(\mathrm{x}_{u-1})\psi(\mathrm{% y}_{v-1})}\cdot\frac{1-r\chi(\mathrm{x}_{u})\psi(\mathrm{y}_{v-1})}{1-r\chi(% \mathrm{x}_{u})\psi(\mathrm{y}_{v})}\right]\\ =r\left[(\chi(\mathrm{x}_{u})-\chi(\mathrm{x}_{u-1}))(\psi(\mathrm{y}_{v})-% \psi(\mathrm{y}_{v-1}))\right]+o(r).\qedstart_ROW start_CELL roman_ln [ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG ] = roman_ln [ divide start_ARG 1 - italic_r italic_χ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG start_ARG 1 - italic_r italic_χ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG 1 - italic_r italic_χ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG 1 - italic_r italic_χ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) end_ARG ] end_CELL end_ROW start_ROW start_CELL = italic_r [ ( italic_χ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) - italic_χ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) ) ( italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ) ] + italic_o ( italic_r ) . italic_∎ end_CELL end_ROW

Combining the r=0𝑟0r=0italic_r = 0 algorithm of the previous section with Lemma 4.13 we arrive at the following numerical method for solving (45), (46).

  • Encode the r𝑟ritalic_r–dependent solution through

    ϕ(xu)=1rχr(xu),ψ(yv)=ψr(yv).formulae-sequenceitalic-ϕsubscriptx𝑢1𝑟superscript𝜒𝑟subscriptx𝑢𝜓subscripty𝑣superscript𝜓𝑟subscripty𝑣\phi(\mathrm{x}_{u})=\frac{1}{r\chi^{r}(\mathrm{x}_{u})},\quad\psi(\mathrm{y}_% {v})=\psi^{r}(\mathrm{y}_{v}).italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG italic_r italic_χ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG , italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) = italic_ψ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) .
  • At r=0𝑟0r=0italic_r = 0, find χ0(xu)superscript𝜒0subscriptx𝑢\chi^{0}(\mathrm{x}_{u})italic_χ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ), 0uk0𝑢𝑘0\leq u\leq k0 ≤ italic_u ≤ italic_k, and ψ0(yv)superscript𝜓0subscripty𝑣\psi^{0}(\mathrm{y}_{v})italic_ψ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ), 0v0𝑣0\leq v\leq\ell0 ≤ italic_v ≤ roman_ℓ, through the iterative proportional fitting algorithm of the previous section.

  • Make steps rr+Δr𝑟𝑟Δ𝑟r\to r+\Delta ritalic_r → italic_r + roman_Δ italic_r with very small ΔrΔ𝑟\Delta rroman_Δ italic_r, by writing

    χr+Δr(xu)=χr(xu)+Δχr(xu),ψr+Δr(yv)=ψr(yv)+Δψr(yv),formulae-sequencesuperscript𝜒𝑟Δ𝑟subscriptx𝑢superscript𝜒𝑟subscriptx𝑢Δsuperscript𝜒𝑟subscriptx𝑢superscript𝜓𝑟Δ𝑟subscripty𝑣superscript𝜓𝑟subscripty𝑣Δsuperscript𝜓𝑟subscripty𝑣\chi^{r+\Delta r}(\mathrm{x}_{u})=\chi^{r}(\mathrm{x}_{u})+\Delta\chi^{r}(% \mathrm{x}_{u}),\qquad\psi^{r+\Delta r}(\mathrm{y}_{v})=\psi^{r}(\mathrm{y}_{v% })+\Delta\psi^{r}(\mathrm{y}_{v}),italic_χ start_POSTSUPERSCRIPT italic_r + roman_Δ italic_r end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) = italic_χ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) + roman_Δ italic_χ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) , italic_ψ start_POSTSUPERSCRIPT italic_r + roman_Δ italic_r end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) = italic_ψ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) + roman_Δ italic_ψ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ,

    assuming that the increments are small, linearizing the equations (45), (46) in terms of the increments, and solving the resulting system of linear equations.

One delicate point is that, as we mentioned in Remark 4.3, for each r𝑟ritalic_r, there is a 3333–dimensional group of symmetries of the solutions, and we need to choose one solution, by manually imposing three conditions (e.g. choosing the values of three variables). Similarly, at r=0𝑟0r=0italic_r = 0, the 3333–dimensional group is generated by shifting χ0superscript𝜒0\chi^{0}italic_χ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT by a constant, shifting ψ0superscript𝜓0\psi^{0}italic_ψ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT by a constant, and simultaneous multiplication of χ0superscript𝜒0\chi^{0}italic_χ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT by a constant together with division of χ0superscript𝜒0\chi^{0}italic_χ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT by the same constant. It is unclear which choice of the three conditions is the best from the numerics perspective. We experimented with some choices and implemented several versions of the above algorithm — this is how our Figure 5 is produced. In all our experiments, the algorithms work well for small r𝑟ritalic_r and start exploding at some point as r𝑟ritalic_r grows (by growing to \infty or hitting one of the zeros of denominators in the equations (45), (46)).

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 5. Numeric solution of the EL equations for k=4𝑘4k=4italic_k = 4, (xu)=(yv)=(0,15,35,45,1)subscriptx𝑢subscripty𝑣01535451(\mathrm{x}_{u})=(\mathrm{y}_{v})=(0,\tfrac{1}{5},\tfrac{3}{5},\tfrac{4}{5},1)( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) = ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) = ( 0 , divide start_ARG 1 end_ARG start_ARG 5 end_ARG , divide start_ARG 3 end_ARG start_ARG 5 end_ARG , divide start_ARG 4 end_ARG start_ARG 5 end_ARG , 1 ), I=(1100111011110111)𝐼matrix1100111011110111I=\begin{pmatrix}1&1&0&0\\ 1&1&1&0\\ 1&1&1&1\\ 0&1&1&1\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ), r=3𝑟3r=3italic_r = 3 in the top panel and r=3𝑟3r=-3italic_r = - 3 in the bottom panel. Pictures obtained by refining to 100×100100100100\times 100100 × 100 grid, and solving trigonometric equations (54), (55) by increasing r𝑟ritalic_r by small steps from r=0𝑟0r=0italic_r = 0.

It would be interesting to implement a similar numeric algorithm for the case of a general, not necessarily convex matrix I𝐼Iitalic_I. For the r=0𝑟0r=0italic_r = 0 case, Theorem 4.12 did not require convexity of I𝐼Iitalic_I; hence, it can still serve as a starting point.

Remark 4.14.

Continuing Remark 4.3, one can transform the functions ϕitalic-ϕ\phiitalic_ϕ and ψ𝜓\psiitalic_ψ into the unit circle and reparameterize them by angles θ𝜃\thetaitalic_θ and ρ𝜌\rhoitalic_ρ. Equations (45), (46) then turn into an alternative trigonometric form:

(54) er(xuxu1)superscript𝑒𝑟subscriptx𝑢subscriptx𝑢1\displaystyle e^{r(\mathrm{x}_{u}-\mathrm{x}_{u-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =v=1[cos(r(θv+ρu1))cos(r(θv1+ρu))cos(r(θv+ρu))cos(r(θv1+ρu1))]Iuv,1uk,formulae-sequenceabsentsuperscriptsubscriptproduct𝑣1superscriptdelimited-[]𝑟subscript𝜃𝑣subscript𝜌𝑢1𝑟subscript𝜃𝑣1subscript𝜌𝑢𝑟subscript𝜃𝑣subscript𝜌𝑢𝑟subscript𝜃𝑣1subscript𝜌𝑢1subscript𝐼𝑢𝑣1𝑢𝑘\displaystyle=\prod_{v=1}^{\ell}\left[\frac{\cos\bigl{(}\sqrt{r}(\theta_{v}+% \rho_{u-1})\bigr{)}\cos\bigl{(}\sqrt{r}(\theta_{v-1}+\rho_{u})\bigr{)}}{\cos% \bigl{(}\sqrt{r}(\theta_{v}+\rho_{u})\bigr{)}\cos\bigl{(}\sqrt{r}(\theta_{v-1}% +\rho_{u-1})\bigr{)}}\right]^{I_{uv}},\quad 1\leq u\leq k,= ∏ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT [ divide start_ARG roman_cos ( square-root start_ARG italic_r end_ARG ( italic_θ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT + italic_ρ start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) ) roman_cos ( square-root start_ARG italic_r end_ARG ( italic_θ start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT + italic_ρ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) ) end_ARG start_ARG roman_cos ( square-root start_ARG italic_r end_ARG ( italic_θ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT + italic_ρ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) ) roman_cos ( square-root start_ARG italic_r end_ARG ( italic_θ start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT + italic_ρ start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , 1 ≤ italic_u ≤ italic_k ,
(55) er(yvyv1)superscript𝑒𝑟subscripty𝑣subscripty𝑣1\displaystyle e^{r(\mathrm{y}_{v}-\mathrm{y}_{v-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =u=1k[cos(r(θv+ρu1))cos(r(θv1+ρu))cos(r(θv+ρu))cos(r(θv1+ρu1))]Iuv,1v.formulae-sequenceabsentsuperscriptsubscriptproduct𝑢1𝑘superscriptdelimited-[]𝑟subscript𝜃𝑣subscript𝜌𝑢1𝑟subscript𝜃𝑣1subscript𝜌𝑢𝑟subscript𝜃𝑣subscript𝜌𝑢𝑟subscript𝜃𝑣1subscript𝜌𝑢1subscript𝐼𝑢𝑣1𝑣\displaystyle=\prod_{u=1}^{k}\left[\frac{\cos\bigl{(}\sqrt{r}(\theta_{v}+\rho_% {u-1})\bigr{)}\cos\bigl{(}\sqrt{r}(\theta_{v-1}+\rho_{u})\bigr{)}}{\cos\bigl{(% }\sqrt{r}(\theta_{v}+\rho_{u})\bigr{)}\cos\bigl{(}\sqrt{r}(\theta_{v-1}+\rho_{% u-1})\bigr{)}}\right]^{I_{uv}},\quad 1\leq v\leq\ell.= ∏ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT [ divide start_ARG roman_cos ( square-root start_ARG italic_r end_ARG ( italic_θ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT + italic_ρ start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) ) roman_cos ( square-root start_ARG italic_r end_ARG ( italic_θ start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT + italic_ρ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) ) end_ARG start_ARG roman_cos ( square-root start_ARG italic_r end_ARG ( italic_θ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT + italic_ρ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) ) roman_cos ( square-root start_ARG italic_r end_ARG ( italic_θ start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT + italic_ρ start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , 1 ≤ italic_v ≤ roman_ℓ .

where we rescaled the variables, so that the r0𝑟0r\to 0italic_r → 0 limit (after taking logarithms) becomes transparent using cos(x)=1x22+o(x3)𝑥1superscript𝑥22𝑜superscript𝑥3\cos(x)=1-\frac{x^{2}}{2}+o(x^{3})roman_cos ( italic_x ) = 1 - divide start_ARG italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG + italic_o ( italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ), x0𝑥0x\to 0italic_x → 0.

4.5. Domains with unique solution to algebraic equations

In general, (45),(46) can be polynomial equations of a high degree and might have many solutions. One can hope that in many situations imposing the positivity conditions following from g(x,y)0𝑔𝑥𝑦0g(x,y)\geq 0italic_g ( italic_x , italic_y ) ≥ 0 leads to a unique choice for the solution, but we do not have precise theorems in this direction, see Section 5 for some examples. Instead, in this section we describe a class of arrays I𝐼Iitalic_I, for which (45), (46) have a unique solution modulo Moebius transformations, and this solution can be found by a straightforward exact algorithm.

The idea is to reduce arrays Iuvsubscript𝐼𝑢𝑣I_{uv}italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT to smaller arrays, until we reach 1×1111\times 11 × 1 array I11subscript𝐼11I_{11}italic_I start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT, for which solving (45), (46) is immediate. The reductions are based on two operations, which take a convex k×𝑘k\times\ellitalic_k × roman_ℓ array I𝐼Iitalic_I and output a convex array I~~𝐼\tilde{I}over~ start_ARG italic_I end_ARG of smaller size:

  1. (1)

    If I𝐼Iitalic_I has a unique 1111 in the first row or the last row, then remove this row and output the remaining k×(1)𝑘1k\times(\ell-1)italic_k × ( roman_ℓ - 1 ) array. If I𝐼Iitalic_I has a unique 1111 in the first column or the last column, then remove this column and output the remaining (k1)×𝑘1(k-1)\times\ell( italic_k - 1 ) × roman_ℓ array.

  2. (2)

    If two rows of I𝐼Iitalic_I, v𝑣vitalic_v and v+1𝑣1v+1italic_v + 1 are completely filled with 1111s: Iuv=Iu,v+1=1subscript𝐼𝑢𝑣subscript𝐼𝑢𝑣11I_{uv}=I_{u,v+1}=1italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_u , italic_v + 1 end_POSTSUBSCRIPT = 1 for all 1uk1𝑢𝑘1\leq u\leq k1 ≤ italic_u ≤ italic_k, then remove the row v𝑣vitalic_v and output the remaining k×(1)𝑘1k\times(\ell-1)italic_k × ( roman_ℓ - 1 ) array. If two columns of I𝐼Iitalic_I, u𝑢uitalic_u and u+1𝑢1u+1italic_u + 1 are completely filled with 1111s: Iuv=Iu+1,v=1subscript𝐼𝑢𝑣subscript𝐼𝑢1𝑣1I_{uv}=I_{u+1,v}=1italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = italic_I start_POSTSUBSCRIPT italic_u + 1 , italic_v end_POSTSUBSCRIPT = 1 for all 1v1𝑣1\leq v\leq\ell1 ≤ italic_v ≤ roman_ℓ, then remove the column u+1𝑢1u+1italic_u + 1 and output the remaining (k1)×𝑘1(k-1)\times\ell( italic_k - 1 ) × roman_ℓ array.

Definition 4.15.

A convex (in the sense of Definition 2.1) k×𝑘k\times\ellitalic_k × roman_ℓ array I𝐼Iitalic_I is called simple, if it can be reduced by a sequence of the above operations to the 1×1111\times 11 × 1 array I11=1subscript𝐼111I_{11}=1italic_I start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT = 1.

For 2×2222\times 22 × 2 case, all possible arrays (with at least one 1111 in each row and column) are both convex and simple. For 2×3232\times 32 × 3 case, there are several non-convex arrays, but all convex arrays are simple. For 3×3333\times 33 × 3 case some convex arrays are simple and some are not. For example, (001111110)matrix001111110\begin{pmatrix}0&0&1\\ 1&1&1\\ 1&1&0\end{pmatrix}( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) and (111111110)matrix111111110\begin{pmatrix}1&1&1\\ 1&1&1\\ 1&1&0\end{pmatrix}( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) are simple, while (011111110)matrix011111110\begin{pmatrix}0&1&1\\ 1&1&1\\ 1&1&0\end{pmatrix}( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) is not.

Theorem 4.16.

Fix r0𝑟0r\neq 0italic_r ≠ 0, a simple convex k×𝑘k\times\ellitalic_k × roman_ℓ array I𝐼Iitalic_I, and any sequences of numbers x0<x1<<xksubscriptx0subscriptx1subscriptx𝑘\mathrm{x}_{0}<\mathrm{x}_{1}<\dots<\mathrm{x}_{k}roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, y0<y1<<ysubscripty0subscripty1subscripty\mathrm{y}_{0}<\mathrm{y}_{1}<\dots<\mathrm{y}_{\ell}roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, such that xkx0=yy0subscriptx𝑘subscriptx0subscriptysubscripty0\mathrm{x}_{k}-\mathrm{x}_{0}=\mathrm{y}_{\ell}-\mathrm{y}_{0}roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and the data is non-degenerate in the sense of Definition 3.5. Then the solution ϕ(xu)italic-ϕsubscriptx𝑢\mathrm{\phi}(\mathrm{x}_{u})italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ), 0uk0𝑢𝑘0\leq u\leq k0 ≤ italic_u ≤ italic_k, ψ(yv))\mathrm{\psi}(\mathrm{y}_{v}))italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ), 0v0𝑣0\leq v\leq\ell0 ≤ italic_v ≤ roman_ℓ, to equations (45), (46) is unique up to Moebius transformations555I.e. any two solutions can be obtained from each other by ϕ(x)αϕ(x)+βγϕ(x)+δmaps-toitalic-ϕ𝑥𝛼italic-ϕ𝑥𝛽𝛾italic-ϕ𝑥𝛿\phi(x)\mapsto\frac{\alpha\phi(x)+\beta}{\gamma\phi(x)+\delta}italic_ϕ ( italic_x ) ↦ divide start_ARG italic_α italic_ϕ ( italic_x ) + italic_β end_ARG start_ARG italic_γ italic_ϕ ( italic_x ) + italic_δ end_ARG, ψ(y)αψ(y)+βγψ(y)+δmaps-to𝜓𝑦𝛼𝜓𝑦𝛽𝛾𝜓𝑦𝛿\psi(y)\mapsto\frac{\alpha\psi(y)+\beta}{\gamma\psi(y)+\delta}italic_ψ ( italic_y ) ↦ divide start_ARG italic_α italic_ψ ( italic_y ) + italic_β end_ARG start_ARG italic_γ italic_ψ ( italic_y ) + italic_δ end_ARG. and can be obtained by the recursive algorithm in the following proof.

Proof.

The argument is by induction in k+𝑘k+\ellitalic_k + roman_ℓ. For the base k==1𝑘1k=\ell=1italic_k = roman_ℓ = 1, both equations (45), (46) are the same and read

(56) er(x1x0)=ψ(y1)ϕ(x0)ψ(y0)ϕ(x0)ψ(y0)ϕ(x1)ψ(y1)ϕ(x1).superscript𝑒𝑟subscriptx1subscriptx0𝜓subscripty1italic-ϕsubscriptx0𝜓subscripty0italic-ϕsubscriptx0𝜓subscripty0italic-ϕsubscriptx1𝜓subscripty1italic-ϕsubscriptx1e^{r(\mathrm{x}_{1}-\mathrm{x}_{0})}=\frac{\psi(\mathrm{y}_{1})-\phi(\mathrm{x% }_{0})}{\psi(\mathrm{y}_{0})-\phi(\mathrm{x}_{0})}\cdot\frac{\psi(\mathrm{y}_{% 0})-\phi(\mathrm{x}_{1})}{\psi(\mathrm{y}_{1})-\phi(\mathrm{x}_{1})}.italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT = divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG .

Note that the four variables involved, ϕ(x0)italic-ϕsubscriptx0\phi(\mathrm{x}_{0})italic_ϕ ( roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), ϕ(x1)italic-ϕsubscriptx1\phi(\mathrm{x}_{1})italic_ϕ ( roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ), ψ(y0)𝜓subscripty0\psi(\mathrm{y}_{0})italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), ψ(y1)𝜓subscripty1\psi(\mathrm{y}_{1})italic_ψ ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ), should be all distinct in order for (56) to hold. We can apply a Moebius transformation to map three of them to any three numbers in {}\mathbb{R}\cup\{\infty\}blackboard_R ∪ { ∞ }; since the cross ratio is invariant under Moebius transformations, (56) is not going to change. Say, let us map (ϕ(x0),ψ(y0),ψ(y1))(0,,1)maps-toitalic-ϕsubscriptx0𝜓subscripty0𝜓subscripty101(\phi(\mathrm{x}_{0}),\psi(\mathrm{y}_{0}),\psi(\mathrm{y}_{1}))\mapsto(0,% \infty,1)( italic_ϕ ( roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , italic_ψ ( roman_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) ↦ ( 0 , ∞ , 1 ). Then the equation turns into

er(x1x0)=11ϕ(x1),superscript𝑒𝑟subscriptx1subscriptx011italic-ϕsubscriptx1e^{r(\mathrm{x}_{1}-\mathrm{x}_{0})}=\frac{1}{1-\phi(\mathrm{x}_{1})},italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG 1 - italic_ϕ ( roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG ,

which clearly has a unique solution ϕ(x1)italic-ϕsubscriptx1\phi(\mathrm{x}_{1})\in\mathbb{R}italic_ϕ ( roman_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ∈ blackboard_R.

For the induction step, we explore the two operations in the definition of the simple array. For the first operation, assume without loss of generality that I𝐼Iitalic_I has a unique non-zero element in the last column, at position Ikv^=1subscript𝐼𝑘^𝑣1I_{k\hat{v}}=1italic_I start_POSTSUBSCRIPT italic_k over^ start_ARG italic_v end_ARG end_POSTSUBSCRIPT = 1 and I~~𝐼\tilde{I}over~ start_ARG italic_I end_ARG is obtained from I𝐼Iitalic_I by removing that column.

The last, u=k𝑢𝑘u=kitalic_u = italic_k, equation (45) reads

(57) er(xkxk1)=ψ(yu^)ϕ(xk1)ψ(yu^1)ϕ(xk1)ψ(yu^1)ϕ(xk)ψ(yu^)ϕ(xk).superscript𝑒𝑟subscriptx𝑘subscriptx𝑘1𝜓subscripty^𝑢italic-ϕsubscriptx𝑘1𝜓subscripty^𝑢1italic-ϕsubscriptx𝑘1𝜓subscripty^𝑢1italic-ϕsubscriptx𝑘𝜓subscripty^𝑢italic-ϕsubscriptx𝑘e^{r(\mathrm{x}_{k}-\mathrm{x}_{k-1})}=\frac{\psi(\mathrm{y}_{\hat{u}})-\phi(% \mathrm{x}_{k-1})}{\psi(\mathrm{y}_{\hat{u}-1})-\phi(\mathrm{x}_{k-1})}\cdot% \frac{\psi(\mathrm{y}_{\hat{u}-1})-\phi(\mathrm{x}_{k})}{\psi(\mathrm{y}_{\hat% {u}})-\phi(\mathrm{x}_{k})}.italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT = divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG .

We see that once ϕ(xk1)italic-ϕsubscriptx𝑘1\phi(\mathrm{x}_{k-1})italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ), ψ(yu^)𝜓subscripty^𝑢\psi(\mathrm{y}_{\hat{u}})italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ), ψ(yu^1)𝜓subscripty^𝑢1\psi(\mathrm{y}_{\hat{u}-1})italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) are determined, there is a unique ϕ(xk)italic-ϕsubscriptx𝑘\phi(\mathrm{x}_{k})italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) satisfying (57). On the other hand, the k++1𝑘1k+\ell+1italic_k + roman_ℓ + 1 variables ϕ(xu)italic-ϕsubscriptx𝑢\mathrm{\phi}(\mathrm{x}_{u})italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ), 0uk10𝑢𝑘10\leq u\leq k-10 ≤ italic_u ≤ italic_k - 1, ψ(yv))\mathrm{\psi}(\mathrm{y}_{v}))italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ), 0v0𝑣0\leq v\leq\ell0 ≤ italic_v ≤ roman_ℓ, satisfy a system of k+1𝑘1k+\ell-1italic_k + roman_ℓ - 1 equations:

er(xuxu1)superscript𝑒𝑟subscriptx𝑢subscriptx𝑢1\displaystyle e^{r(\mathrm{x}_{u}-\mathrm{x}_{u-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =v=1[ψ(yv)ϕ(xu1)ψ(yv1)ϕ(xu1)ψ(yv1)ϕ(xu)ψ(yv)ϕ(xu)]Iuv,1uk1,formulae-sequenceabsentsuperscriptsubscriptproduct𝑣1superscriptdelimited-[]𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢subscript𝐼𝑢𝑣1𝑢𝑘1\displaystyle=\prod_{v=1}^{\ell}\left[\frac{\psi(\mathrm{y}_{v})-\phi(\mathrm{% x}_{u-1})}{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_{u-1})}\cdot\frac{\psi(% \mathrm{y}_{v-1})-\phi(\mathrm{x}_{u})}{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_{% u})}\right]^{I_{uv}},\quad 1\leq u\leq k-1,= ∏ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT [ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , 1 ≤ italic_u ≤ italic_k - 1 ,
er(yvyv1)superscript𝑒𝑟subscripty𝑣subscripty𝑣1\displaystyle e^{r(\mathrm{y}_{v}-\mathrm{y}_{v-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =u=1k1[ψ(yv)ϕ(xu1)ψ(yv1)ϕ(xu1)ψ(yv1)ϕ(xu)ψ(yv)ϕ(xu)]Iuv,1v,vv^,formulae-sequenceformulae-sequenceabsentsuperscriptsubscriptproduct𝑢1𝑘1superscriptdelimited-[]𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢subscript𝐼𝑢𝑣1𝑣𝑣^𝑣\displaystyle=\prod_{u=1}^{k-1}\left[\frac{\psi(\mathrm{y}_{v})-\phi(\mathrm{x% }_{u-1})}{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_{u-1})}\cdot\frac{\psi(% \mathrm{y}_{v-1})-\phi(\mathrm{x}_{u})}{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_{% u})}\right]^{I_{uv}},\quad 1\leq v\leq\ell,\quad v\neq\hat{v},= ∏ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT [ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , 1 ≤ italic_v ≤ roman_ℓ , italic_v ≠ over^ start_ARG italic_v end_ARG ,
er(yv^yv^1)er(xkxk1)superscript𝑒𝑟subscripty^𝑣subscripty^𝑣1superscript𝑒𝑟subscriptx𝑘subscriptx𝑘1\displaystyle e^{r(\mathrm{y}_{\hat{v}}-\mathrm{y}_{\hat{v}-1})}e^{-r(\mathrm{% x}_{k}-\mathrm{x}_{k-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - italic_r ( roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =u=1k1[ψ(yv^)ϕ(xu1)ψ(yv^1)ϕ(xu1)ψ(yv^1)ϕ(xu)ψ(yv^)ϕ(xu)]Iuv^,absentsuperscriptsubscriptproduct𝑢1𝑘1superscriptdelimited-[]𝜓subscripty^𝑣italic-ϕsubscriptx𝑢1𝜓subscripty^𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty^𝑣1italic-ϕsubscriptx𝑢𝜓subscripty^𝑣italic-ϕsubscriptx𝑢subscript𝐼𝑢^𝑣\displaystyle=\prod_{u=1}^{k-1}\left[\frac{\psi(\mathrm{y}_{\hat{v}})-\phi(% \mathrm{x}_{u-1})}{\psi(\mathrm{y}_{\hat{v}-1})-\phi(\mathrm{x}_{u-1})}\cdot% \frac{\psi(\mathrm{y}_{\hat{v}-1})-\phi(\mathrm{x}_{u})}{\psi(\mathrm{y}_{\hat% {v}})-\phi(\mathrm{x}_{u})}\right]^{I_{u\hat{v}}},= ∏ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT [ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u over^ start_ARG italic_v end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ,

where we used (57) to get the last equation. The last system of equations is again of the form (45),(46) for the (k1)×𝑘1(k-1)\times\ell( italic_k - 1 ) × roman_ℓ array I~~𝐼\tilde{I}over~ start_ARG italic_I end_ARG, and for the difference yv^yv^1subscript𝑦^𝑣subscripty^𝑣1y_{\hat{v}}-\mathrm{y}_{\hat{v}-1}italic_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG - 1 end_POSTSUBSCRIPT replaced with (yv^yv^1)(xkxk1)subscript𝑦^𝑣subscripty^𝑣1subscriptx𝑘subscriptx𝑘1(y_{\hat{v}}-\mathrm{y}_{\hat{v}-1})-(\mathrm{x}_{k}-\mathrm{x}_{k-1})( italic_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG - 1 end_POSTSUBSCRIPT ) - ( roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ). (Notice that (yv^yv^1)(xkxk1)>0subscript𝑦^𝑣subscripty^𝑣1subscriptx𝑘subscriptx𝑘10(y_{\hat{v}}-\mathrm{y}_{\hat{v}-1})-(\mathrm{x}_{k}-\mathrm{x}_{k-1})>0( italic_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG - 1 end_POSTSUBSCRIPT ) - ( roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ) > 0, because of the non-degeneracy of the data, as in Definition 3.5.) Hence, by the induction assumption, this system has a unique (up to Moebius transformations) solution, and then supplementing with (57) so does the original system (45), (46).

For the second operation, assume without loss of generality that the columns u^^𝑢\hat{u}over^ start_ARG italic_u end_ARG and u^+1^𝑢1\hat{u}+1over^ start_ARG italic_u end_ARG + 1 of I𝐼Iitalic_I are completely filled with 1s and let I~~𝐼\tilde{I}over~ start_ARG italic_I end_ARG be obtained from I𝐼Iitalic_I by removing the column u^+1^𝑢1\hat{u}+1over^ start_ARG italic_u end_ARG + 1. The equations (45) for u=u^𝑢^𝑢u=\hat{u}italic_u = over^ start_ARG italic_u end_ARG and u=u^+1𝑢^𝑢1u=\hat{u}+1italic_u = over^ start_ARG italic_u end_ARG + 1 read after cancellations:

er(xu^xu^1)superscript𝑒𝑟subscriptx^𝑢subscriptx^𝑢1\displaystyle e^{r(\mathrm{x}_{\hat{u}}-\mathrm{x}_{\hat{u}-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =ψ(y)ϕ(xu^1)ψ(y0)ϕ(xu^1)ψ(y0)ϕ(xu^)ψ(y)ϕ(xu^),absent𝜓subscriptyitalic-ϕsubscriptx^𝑢1𝜓subscripty0italic-ϕsubscriptx^𝑢1𝜓subscripty0italic-ϕsubscriptx^𝑢𝜓subscriptyitalic-ϕsubscriptx^𝑢\displaystyle=\frac{\psi(\mathrm{y}_{\ell})-\phi(\mathrm{x}_{\hat{u}-1})}{\psi% (\mathrm{y}_{0})-\phi(\mathrm{x}_{\hat{u}-1})}\cdot\frac{\psi(\mathrm{y}_{0})-% \phi(\mathrm{x}_{\hat{u}})}{\psi(\mathrm{y}_{\ell})-\phi(\mathrm{x}_{\hat{u}})},= divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) end_ARG ,
er(xu^+1xu^)superscript𝑒𝑟subscriptx^𝑢1subscriptx^𝑢\displaystyle e^{r(\mathrm{x}_{\hat{u}+1}-\mathrm{x}_{\hat{u}})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG + 1 end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =ψ(y)ϕ(xu^)ψ(y0)ϕ(xu^)ψ(y0)ϕ(xu^+1)ψ(y)ϕ(xu^+1).absent𝜓subscriptyitalic-ϕsubscriptx^𝑢𝜓subscripty0italic-ϕsubscriptx^𝑢𝜓subscripty0italic-ϕsubscriptx^𝑢1𝜓subscriptyitalic-ϕsubscriptx^𝑢1\displaystyle=\frac{\psi(\mathrm{y}_{\ell})-\phi(\mathrm{x}_{\hat{u}})}{\psi(% \mathrm{y}_{0})-\phi(\mathrm{x}_{\hat{u}})}\cdot\frac{\psi(\mathrm{y}_{0})-% \phi(\mathrm{x}_{\hat{u}+1})}{\psi(\mathrm{y}_{\ell})-\phi(\mathrm{x}_{\hat{u}% +1})}.= divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG + 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG + 1 end_POSTSUBSCRIPT ) end_ARG .

Let us replace the last two equations by the first one and their product, i.e. by

(58) er(xu^xu^1)superscript𝑒𝑟subscriptx^𝑢subscriptx^𝑢1\displaystyle e^{r(\mathrm{x}_{\hat{u}}-\mathrm{x}_{\hat{u}-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =ψ(y)ϕ(xu^1)ψ(y0)ϕ(xu^1)ψ(y0)ϕ(xu^)ψ(y)ϕ(xu^),absent𝜓subscriptyitalic-ϕsubscriptx^𝑢1𝜓subscripty0italic-ϕsubscriptx^𝑢1𝜓subscripty0italic-ϕsubscriptx^𝑢𝜓subscriptyitalic-ϕsubscriptx^𝑢\displaystyle=\frac{\psi(\mathrm{y}_{\ell})-\phi(\mathrm{x}_{\hat{u}-1})}{\psi% (\mathrm{y}_{0})-\phi(\mathrm{x}_{\hat{u}-1})}\cdot\frac{\psi(\mathrm{y}_{0})-% \phi(\mathrm{x}_{\hat{u}})}{\psi(\mathrm{y}_{\ell})-\phi(\mathrm{x}_{\hat{u}})},= divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) end_ARG ,
(59) er(xu^+1xu^1)superscript𝑒𝑟subscriptx^𝑢1subscriptx^𝑢1\displaystyle e^{r(\mathrm{x}_{\hat{u}+1}-\mathrm{x}_{\hat{u}-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG + 1 end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =ψ(y)ϕ(xu^1)ψ(y0)ϕ(xu^1)ψ(y0)ϕ(xu^+1)ψ(y)ϕ(xu^+1).absent𝜓subscriptyitalic-ϕsubscriptx^𝑢1𝜓subscripty0italic-ϕsubscriptx^𝑢1𝜓subscripty0italic-ϕsubscriptx^𝑢1𝜓subscriptyitalic-ϕsubscriptx^𝑢1\displaystyle=\frac{\psi(\mathrm{y}_{\ell})-\phi(\mathrm{x}_{\hat{u}-1})}{\psi% (\mathrm{y}_{0})-\phi(\mathrm{x}_{\hat{u}-1})}\cdot\frac{\psi(\mathrm{y}_{0})-% \phi(\mathrm{x}_{\hat{u}+1})}{\psi(\mathrm{y}_{\ell})-\phi(\mathrm{x}_{\hat{u}% +1})}.= divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG + 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG + 1 end_POSTSUBSCRIPT ) end_ARG .

Once ϕ(xu^1)italic-ϕsubscriptx^𝑢1\phi(\mathrm{x}_{\hat{u}-1})italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ), ψ(y0)𝜓subscripty0\psi(\mathrm{y}_{0})italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), ψ(y)𝜓subscripty\psi(\mathrm{y}_{\ell})italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) are determined, the equation (58) uniquely determines ϕ(xu^)italic-ϕsubscriptx^𝑢\phi(\mathrm{x}_{\hat{u}})italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ). On the other hand, k++1𝑘1k+\ell+1italic_k + roman_ℓ + 1 variables ϕ(x0),,ϕ(xu^1),ϕ(xu^+1),,ϕ(xk)italic-ϕsubscriptx0italic-ϕsubscriptx^𝑢1italic-ϕsubscriptx^𝑢1italic-ϕsubscriptx𝑘\phi(\mathrm{x}_{0}),\dots,\phi(\mathrm{x}_{\hat{u}-1}),\phi(\mathrm{x}_{\hat{% u}+1}),\dots,\phi(\mathrm{x}_{k})italic_ϕ ( roman_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , … , italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG - 1 end_POSTSUBSCRIPT ) , italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG + 1 end_POSTSUBSCRIPT ) , … , italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ); ψ(y0),,ψ(y)𝜓subscripty0𝜓subscripty\psi(\mathrm{y}_{0}),\dots,\psi(\mathrm{y}_{\ell})italic_ψ ( roman_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , … , italic_ψ ( roman_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) satisfy a system of k+1𝑘1k+\ell-1italic_k + roman_ℓ - 1 equations, given by (59) and

(60) er(xuxu1)superscript𝑒𝑟subscriptx𝑢subscriptx𝑢1\displaystyle e^{r(\mathrm{x}_{u}-\mathrm{x}_{u-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =v=1[ψ(yv)ϕ(xu1)ψ(yv1)ϕ(xu1)ψ(yv1)ϕ(xu)ψ(yv)ϕ(xu)]Iuv,1uu^1 or u^+2uk,formulae-sequenceabsentsuperscriptsubscriptproduct𝑣1superscriptdelimited-[]𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢subscript𝐼𝑢𝑣1𝑢^𝑢1 or ^𝑢2𝑢𝑘\displaystyle=\prod_{v=1}^{\ell}\left[\frac{\psi(\mathrm{y}_{v})-\phi(\mathrm{% x}_{u-1})}{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_{u-1})}\cdot\frac{\psi(% \mathrm{y}_{v-1})-\phi(\mathrm{x}_{u})}{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_{% u})}\right]^{I_{uv}},\quad 1\leq u\leq\hat{u}-1\text{ or }\hat{u}+2\leq u\leq k,= ∏ start_POSTSUBSCRIPT italic_v = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT [ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , 1 ≤ italic_u ≤ over^ start_ARG italic_u end_ARG - 1 or over^ start_ARG italic_u end_ARG + 2 ≤ italic_u ≤ italic_k ,
(61) er(yvyv1)superscript𝑒𝑟subscripty𝑣subscripty𝑣1\displaystyle e^{r(\mathrm{y}_{v}-\mathrm{y}_{v-1})}italic_e start_POSTSUPERSCRIPT italic_r ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT - roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT =u=1k[ψ(yv)ϕ(xu1)ψ(yv1)ϕ(xu1)ψ(yv1)ϕ(xu)ψ(yv)ϕ(xu)]Iuv,1v.formulae-sequenceabsentsuperscriptsubscriptproduct𝑢1𝑘superscriptdelimited-[]𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢subscript𝐼𝑢𝑣1𝑣\displaystyle=\prod_{u=1}^{k}\left[\frac{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_% {u-1})}{\psi(\mathrm{y}_{v-1})-\phi(\mathrm{x}_{u-1})}\cdot\frac{\psi(\mathrm{% y}_{v-1})-\phi(\mathrm{x}_{u})}{\psi(\mathrm{y}_{v})-\phi(\mathrm{x}_{u})}% \right]^{I_{uv}},\quad 1\leq v\leq\ell.= ∏ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT [ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) end_ARG ] start_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , 1 ≤ italic_v ≤ roman_ℓ .

Note that there is no ϕ(xu^)italic-ϕsubscriptx^𝑢\phi(\mathrm{x}_{\hat{u}})italic_ϕ ( roman_x start_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG end_POSTSUBSCRIPT ) in the last equation, as it cancels between u=u^𝑢^𝑢u=\hat{u}italic_u = over^ start_ARG italic_u end_ARG and u=u^+1𝑢^𝑢1u=\hat{u}+1italic_u = over^ start_ARG italic_u end_ARG + 1 factors. We conclude that the system (59),(60),(61) is exactly of the same form as (45), (46) for (k1)×𝑘1(k-1)\times\ell( italic_k - 1 ) × roman_ℓ matrix I~~𝐼\tilde{I}over~ start_ARG italic_I end_ARG. Hence, the induction hypothesis applies and this system has a unique solution up to Moebius transformations. ∎

Let us demonstrate how the algorithm works for the case k==2𝑘2k=\ell=2italic_k = roman_ℓ = 2, x=(0,a,1)x0𝑎1\mathrm{x}=(0,a,1)roman_x = ( 0 , italic_a , 1 ), y=(0,b,1)y0𝑏1\mathrm{y}=(0,b,1)roman_y = ( 0 , italic_b , 1 ), I=(1011)𝐼matrix1011I=\begin{pmatrix}1&0\\ 1&1\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ). The inductive reductions are I(11)(1)𝐼matrix11matrix1I\to\begin{pmatrix}1\\ 1\end{pmatrix}\to\begin{pmatrix}1\end{pmatrix}italic_I → ( start_ARG start_ROW start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW end_ARG ) → ( start_ARG start_ROW start_CELL 1 end_CELL end_ROW end_ARG ). On the first step we remove the last column of I𝐼Iitalic_I and record the equation for determining ϕ(1)italic-ϕ1\phi(1)italic_ϕ ( 1 ):

(62) er(1a)=ψ(b)ϕ(a)ψ(0)ϕ(a)ψ(0)ϕ(1)ψ(b)ϕ(1).superscript𝑒𝑟1𝑎𝜓𝑏italic-ϕ𝑎𝜓0italic-ϕ𝑎𝜓0italic-ϕ1𝜓𝑏italic-ϕ1e^{r(1-a)}=\frac{\psi(b)-\phi(a)}{\psi(0)-\phi(a)}\cdot\frac{\psi(0)-\phi(1)}{% \psi(b)-\phi(1)}.italic_e start_POSTSUPERSCRIPT italic_r ( 1 - italic_a ) end_POSTSUPERSCRIPT = divide start_ARG italic_ψ ( italic_b ) - italic_ϕ ( italic_a ) end_ARG start_ARG italic_ψ ( 0 ) - italic_ϕ ( italic_a ) end_ARG ⋅ divide start_ARG italic_ψ ( 0 ) - italic_ϕ ( 1 ) end_ARG start_ARG italic_ψ ( italic_b ) - italic_ϕ ( 1 ) end_ARG .

On the second step, we remove the top row of (11)matrix11\begin{pmatrix}1\\ 1\end{pmatrix}( start_ARG start_ROW start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW end_ARG ) array — we can use either of the two operations for that, say, we use the first one again. We get an equation for determining ψ(1)𝜓1\psi(1)italic_ψ ( 1 ):

(63) er(1b)=ψ(1)ϕ(0)ψ(b)ϕ(0)ψ(b)ϕ(a)ψ(1)ϕ(a).superscript𝑒𝑟1𝑏𝜓1italic-ϕ0𝜓𝑏italic-ϕ0𝜓𝑏italic-ϕ𝑎𝜓1italic-ϕ𝑎e^{r(1-b)}=\frac{\psi(1)-\phi(0)}{\psi(b)-\phi(0)}\cdot\frac{\psi(b)-\phi(a)}{% \psi(1)-\phi(a)}.italic_e start_POSTSUPERSCRIPT italic_r ( 1 - italic_b ) end_POSTSUPERSCRIPT = divide start_ARG italic_ψ ( 1 ) - italic_ϕ ( 0 ) end_ARG start_ARG italic_ψ ( italic_b ) - italic_ϕ ( 0 ) end_ARG ⋅ divide start_ARG italic_ψ ( italic_b ) - italic_ϕ ( italic_a ) end_ARG start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a ) end_ARG .

The final equation for the 1×1111\times 11 × 1 matrix I𝐼Iitalic_I is

(64) er(a+b1)=ψ(b)ϕ(0)ψ(0)ϕ(0)ψ(0)ϕ(a)ψ(b)ϕ(a).superscript𝑒𝑟𝑎𝑏1𝜓𝑏italic-ϕ0𝜓0italic-ϕ0𝜓0italic-ϕ𝑎𝜓𝑏italic-ϕ𝑎e^{r(a+b-1)}=\frac{\psi(b)-\phi(0)}{\psi(0)-\phi(0)}\cdot\frac{\psi(0)-\phi(a)% }{\psi(b)-\phi(a)}.italic_e start_POSTSUPERSCRIPT italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT = divide start_ARG italic_ψ ( italic_b ) - italic_ϕ ( 0 ) end_ARG start_ARG italic_ψ ( 0 ) - italic_ϕ ( 0 ) end_ARG ⋅ divide start_ARG italic_ψ ( 0 ) - italic_ϕ ( italic_a ) end_ARG start_ARG italic_ψ ( italic_b ) - italic_ϕ ( italic_a ) end_ARG .

Note that it is necessary to have a+b>1𝑎𝑏1a+b>1italic_a + italic_b > 1 for the data to be non-degenerate in the sense of Definition 3.5. Due to Moebius invariance, we can fix three parameters in an arbitrary way and we choose ϕ(0)=0italic-ϕ00\phi(0)=0italic_ϕ ( 0 ) = 0, ψ(0)=𝜓0\psi(0)=\inftyitalic_ψ ( 0 ) = ∞, ψ(b)=1𝜓𝑏1\psi(b)=1italic_ψ ( italic_b ) = 1. Then (64) turns into

(65) er(a+b1)=11ϕ(a)ϕ(a)=1er(a+b1).formulae-sequencesuperscript𝑒𝑟𝑎𝑏111italic-ϕ𝑎italic-ϕ𝑎1superscript𝑒𝑟𝑎𝑏1e^{r(a+b-1)}=\frac{1}{1-\phi(a)}\quad\Longleftrightarrow\quad\phi(a)=1-e^{-r(a% +b-1)}.italic_e start_POSTSUPERSCRIPT italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG 1 - italic_ϕ ( italic_a ) end_ARG ⟺ italic_ϕ ( italic_a ) = 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT .

Through (62), (63) we compute:

(66) ϕ(1)=1erb,ψ(1)=1er(a+b1)1era.formulae-sequenceitalic-ϕ11superscript𝑒𝑟𝑏𝜓11superscript𝑒𝑟𝑎𝑏11superscript𝑒𝑟𝑎\phi(1)=1-e^{-rb},\qquad\psi(1)=\frac{1-e^{-r(a+b-1)}}{1-e^{-ra}}.italic_ϕ ( 1 ) = 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_b end_POSTSUPERSCRIPT , italic_ψ ( 1 ) = divide start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_a end_POSTSUPERSCRIPT end_ARG .

Further, we reconstruct ϕ(x)italic-ϕ𝑥\phi(x)italic_ϕ ( italic_x ) and ψ(y)𝜓𝑦\psi(y)italic_ψ ( italic_y ) in all other points through equations (47):

erx=1er(a+b1)1era1er(a+b1)1eraϕ(x)superscript𝑒𝑟𝑥1superscript𝑒𝑟𝑎𝑏11superscript𝑒𝑟𝑎1superscript𝑒𝑟𝑎𝑏11superscript𝑒𝑟𝑎italic-ϕ𝑥\displaystyle e^{rx}=\frac{\frac{1-e^{-r(a+b-1)}}{1-e^{-ra}}}{\frac{1-e^{-r(a+% b-1)}}{1-e^{-ra}}-\phi(x)}\quaditalic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT = divide start_ARG divide start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_a end_POSTSUPERSCRIPT end_ARG end_ARG start_ARG divide start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_a end_POSTSUPERSCRIPT end_ARG - italic_ϕ ( italic_x ) end_ARG ϕ(x)=1er(a+b1)1era(1erx),x[0,a],formulae-sequenceitalic-ϕ𝑥1superscript𝑒𝑟𝑎𝑏11superscript𝑒𝑟𝑎1superscript𝑒𝑟𝑥𝑥0𝑎\displaystyle\Longleftrightarrow\quad\phi(x)=\frac{1-e^{-r(a+b-1)}}{1-e^{-ra}}% (1-e^{-rx}),\qquad x\in[0,a],⟺ italic_ϕ ( italic_x ) = divide start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_a end_POSTSUPERSCRIPT end_ARG ( 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_x end_POSTSUPERSCRIPT ) , italic_x ∈ [ 0 , italic_a ] ,
er(xa)=er(a+b1)1ϕ(x)superscript𝑒𝑟𝑥𝑎superscript𝑒𝑟𝑎𝑏11italic-ϕ𝑥\displaystyle e^{r(x-a)}=\frac{e^{-r(a+b-1)}}{1-\phi(x)}\quaditalic_e start_POSTSUPERSCRIPT italic_r ( italic_x - italic_a ) end_POSTSUPERSCRIPT = divide start_ARG italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_ϕ ( italic_x ) end_ARG ϕ(x)=1er(x+b1),x[a,1],formulae-sequenceitalic-ϕ𝑥1superscript𝑒𝑟𝑥𝑏1𝑥𝑎1\displaystyle\Longleftrightarrow\quad\phi(x)=1-e^{-r(x+b-1)},\qquad x\in[a,1],⟺ italic_ϕ ( italic_x ) = 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_x + italic_b - 1 ) end_POSTSUPERSCRIPT , italic_x ∈ [ italic_a , 1 ] ,
ery=ψ(y)ψ(y)1+erbsuperscript𝑒𝑟𝑦𝜓𝑦𝜓𝑦1superscript𝑒𝑟𝑏\displaystyle e^{ry}=\frac{\psi(y)}{\psi(y)-1+e^{-rb}}\quaditalic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT = divide start_ARG italic_ψ ( italic_y ) end_ARG start_ARG italic_ψ ( italic_y ) - 1 + italic_e start_POSTSUPERSCRIPT - italic_r italic_b end_POSTSUPERSCRIPT end_ARG ψ(y)=1erb1ery,y[0,b],formulae-sequence𝜓𝑦1superscript𝑒𝑟𝑏1superscript𝑒𝑟𝑦𝑦0𝑏\displaystyle\Longleftrightarrow\quad\psi(y)=\frac{1-e^{-rb}}{1-e^{-ry}},% \qquad y\in[0,b],⟺ italic_ψ ( italic_y ) = divide start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_b end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_y end_POSTSUPERSCRIPT end_ARG , italic_y ∈ [ 0 , italic_b ] ,
er(yb)=ψ(y)er(a+b1)ψ(y)1+er(a+b1)superscript𝑒𝑟𝑦𝑏𝜓𝑦superscript𝑒𝑟𝑎𝑏1𝜓𝑦1superscript𝑒𝑟𝑎𝑏1\displaystyle e^{r(y-b)}=\frac{\psi(y)e^{-r(a+b-1)}}{\psi(y)-1+e^{-r(a+b-1)}}\quaditalic_e start_POSTSUPERSCRIPT italic_r ( italic_y - italic_b ) end_POSTSUPERSCRIPT = divide start_ARG italic_ψ ( italic_y ) italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_ψ ( italic_y ) - 1 + italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT end_ARG ψ(y)=1er(a+b1)1er(a+y1),y[b,1].formulae-sequence𝜓𝑦1superscript𝑒𝑟𝑎𝑏11superscript𝑒𝑟𝑎𝑦1𝑦𝑏1\displaystyle\Longleftrightarrow\quad\psi(y)=\frac{1-e^{-r(a+b-1)}}{1-e^{-r(a+% y-1)}},\qquad y\in[b,1].⟺ italic_ψ ( italic_y ) = divide start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_b - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a + italic_y - 1 ) end_POSTSUPERSCRIPT end_ARG , italic_y ∈ [ italic_b , 1 ] .

It remains to plug the last formulas into g(x,y)=1rxϕ(x)yψ(y)[ϕ(x)ψ(y)]2𝑔𝑥𝑦1𝑟𝑥italic-ϕ𝑥𝑦𝜓𝑦superscriptdelimited-[]italic-ϕ𝑥𝜓𝑦2g(x,y)=-\frac{1}{r}\cdot\frac{\frac{\partial}{\partial x}\phi(x)\frac{\partial% }{\partial y}\psi(y)}{[\phi(x)-\psi(y)]^{2}}italic_g ( italic_x , italic_y ) = - divide start_ARG 1 end_ARG start_ARG italic_r end_ARG ⋅ divide start_ARG divide start_ARG ∂ end_ARG start_ARG ∂ italic_x end_ARG italic_ϕ ( italic_x ) divide start_ARG ∂ end_ARG start_ARG ∂ italic_y end_ARG italic_ψ ( italic_y ) end_ARG start_ARG [ italic_ϕ ( italic_x ) - italic_ψ ( italic_y ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG for (x,y)(0,a)×(0,b)(a,1)×(0,b)(0,a)×(b,1)𝑥𝑦0𝑎0𝑏𝑎10𝑏0𝑎𝑏1(x,y)\in(0,a)\times(0,b)\cup(a,1)\times(0,b)\cup(0,a)\times(b,1)( italic_x , italic_y ) ∈ ( 0 , italic_a ) × ( 0 , italic_b ) ∪ ( italic_a , 1 ) × ( 0 , italic_b ) ∪ ( 0 , italic_a ) × ( italic_b , 1 ). Figure 6 plots the result.

Refer to caption
Refer to caption
Figure 6. Solution of EL equations for I=(1011)𝐼matrix1011I=\begin{pmatrix}1&0\\ 1&1\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ), x=(0,12,1)x0121\mathrm{x}=(0,\tfrac{1}{2},1)roman_x = ( 0 , divide start_ARG 1 end_ARG start_ARG 2 end_ARG , 1 ), y=(0,34,1)y0341\mathrm{y}=(0,\tfrac{3}{4},1)roman_y = ( 0 , divide start_ARG 3 end_ARG start_ARG 4 end_ARG , 1 ). Left panel: numeric solution via 100×100100100100\times 100100 × 100 grid refinement. Right panel: exact formulas.

5. Further examples

5.1. Convex 3×3333\times 33 × 3 example

Listing all possible cases, one checks that for 2×1212\times 12 × 1, 1×2121\times 21 × 2, 3×1313\times 13 × 1, 2×2222\times 22 × 2, 1×3131\times 31 × 3, 3×2323\times 23 × 2, and 2×3232\times 32 × 3 arrays, I𝐼Iitalic_I being convex implies being simple. Hence, we can always use the algorithm of Section 4.5 to solve the equations (45), (46). For 3×3333\times 33 × 3 case, there are two non-simple convex arrays:

(011111110) and (110111011).matrix011111110 and matrix110111011\begin{pmatrix}0&1&1\\ 1&1&1\\ 1&1&0\end{pmatrix}\text{ and }\begin{pmatrix}1&1&0\\ 1&1&1\\ 0&1&1\end{pmatrix}.( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) and ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) .

In the analysis of (45),(46) these two arrays differ by a symmetry accompanied by rr𝑟𝑟r\to-ritalic_r → - italic_r change. Hence, it would be sufficient to only study the first one, which we do in this section.

Let us denote x=(0<a1<a2<1)x0subscript𝑎1subscript𝑎21\mathrm{x}=(0<a_{1}<a_{2}<1)roman_x = ( 0 < italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < 1 ) and y=(0<b1<b2<1)y0subscript𝑏1subscript𝑏21\mathrm{y}=(0<b_{1}<b_{2}<1)roman_y = ( 0 < italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < 1 ). We need to determine the values of 8888 variables ϕ(0),ϕ(a1),ϕ(a2),ϕ(1)italic-ϕ0italic-ϕsubscript𝑎1italic-ϕsubscript𝑎2italic-ϕ1\phi(0),\phi(a_{1}),\phi(a_{2}),\phi(1)italic_ϕ ( 0 ) , italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , italic_ϕ ( 1 ), ψ(0),ψ(b1),ψ(b2),ψ(1)𝜓0𝜓subscript𝑏1𝜓subscript𝑏2𝜓1\psi(0),\psi(b_{1}),\psi(b_{2}),\psi(1)italic_ψ ( 0 ) , italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , italic_ψ ( 1 ), solving 6666 equations (45),(46). The equations involve products of 7777 different cross ratios, corresponding to 7777 pairs (u,v)𝑢𝑣(u,v)( italic_u , italic_v ) with Iuv=1subscript𝐼𝑢𝑣1I_{uv}=1italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1. Let X𝑋Xitalic_X and Y𝑌Yitalic_Y denote  the two missing cross ratios, corresponding to Iuv=0subscript𝐼𝑢𝑣0I_{uv}=0italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 0:

(67) X=ψ(1)ϕ(0)ψ(b2)ϕ(0)ψ(b2)ϕ(a1)ψ(1)ϕ(a1),Y=ψ(b1)ϕ(a2)ψ(0)ϕ(a2)ψ(0)ϕ(1)ψ(b1)ϕ(1).formulae-sequence𝑋𝜓1italic-ϕ0𝜓subscript𝑏2italic-ϕ0𝜓subscript𝑏2italic-ϕsubscript𝑎1𝜓1italic-ϕsubscript𝑎1𝑌𝜓subscript𝑏1italic-ϕsubscript𝑎2𝜓0italic-ϕsubscript𝑎2𝜓0italic-ϕ1𝜓subscript𝑏1italic-ϕ1X=\frac{\psi(1)-\phi(0)}{\psi(b_{2})-\phi(0)}\cdot\frac{\psi(b_{2})-\phi(a_{1}% )}{\psi(1)-\phi(a_{1})},\qquad Y=\frac{\psi(b_{1})-\phi(a_{2})}{\psi(0)-\phi(a% _{2})}\cdot\frac{\psi(0)-\phi(1)}{\psi(b_{1})-\phi(1)}.italic_X = divide start_ARG italic_ψ ( 1 ) - italic_ϕ ( 0 ) end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_ϕ ( 0 ) end_ARG ⋅ divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG , italic_Y = divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( 0 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( 0 ) - italic_ϕ ( 1 ) end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_ϕ ( 1 ) end_ARG .

Noting telescoping cancellations in (45), (46), we can rewrite them as six equations, and further fix ϕ(0)=0italic-ϕ00\phi(0)=0italic_ϕ ( 0 ) = 0, ϕ(1)=1italic-ϕ11\phi(1)=1italic_ϕ ( 1 ) = 1, ψ(0)=𝜓0\psi(0)=\inftyitalic_ψ ( 0 ) = ∞ by using Moebius invariance, to get

{era1X=ψ(1)ϕ(0)ψ(0)ϕ(0)ψ(0)ϕ(a1)ψ(1)ϕ(a1),er(a2a1)=ψ(1)ϕ(a1)ψ(0)ϕ(a1)ψ(0)ϕ(a2)ψ(1)ϕ(a2),er(1a2)Y=ψ(1)ϕ(a2)ψ(0)ϕ(a2)ψ(0)ϕ(1)ψ(1)ϕ(1),erb1Y=ψ(b1)ϕ(0)ψ(0)ϕ(0)ψ(0)ϕ(1)ψ(b1)ϕ(1),er(b2b1)=ψ(b2)ϕ(0)ψ(b1)ϕ(0)ψ(b1)ϕ(1)ψ(b2)ϕ(1),er(1b2)X=ψ(1)ϕ(0)ψ(b2)ϕ(0)ψ(b2)ϕ(1)ψ(1)ϕ(1).{era1X=ψ(1)ψ(1)ϕ(a1),er(a2a1)=ψ(1)ϕ(a1)ψ(1)ϕ(a2),er(1a2)Y=ψ(1)ϕ(a2)ψ(1)1,erb1Y=ψ(b1)ψ(b1)1,er(b2b1)=ψ(b2)ψ(b1)ψ(b1)1ψ(b2)1,er(1b2)X=ψ(1)ψ(b2)ψ(b2)1ψ(1)1.casessuperscript𝑒𝑟subscript𝑎1𝑋absent𝜓1italic-ϕ0𝜓0italic-ϕ0𝜓0italic-ϕsubscript𝑎1𝜓1italic-ϕsubscript𝑎1superscript𝑒𝑟subscript𝑎2subscript𝑎1absent𝜓1italic-ϕsubscript𝑎1𝜓0italic-ϕsubscript𝑎1𝜓0italic-ϕsubscript𝑎2𝜓1italic-ϕsubscript𝑎2superscript𝑒𝑟1subscript𝑎2𝑌absent𝜓1italic-ϕsubscript𝑎2𝜓0italic-ϕsubscript𝑎2𝜓0italic-ϕ1𝜓1italic-ϕ1superscript𝑒𝑟subscript𝑏1𝑌absent𝜓subscript𝑏1italic-ϕ0𝜓0italic-ϕ0𝜓0italic-ϕ1𝜓subscript𝑏1italic-ϕ1superscript𝑒𝑟subscript𝑏2subscript𝑏1absent𝜓subscript𝑏2italic-ϕ0𝜓subscript𝑏1italic-ϕ0𝜓subscript𝑏1italic-ϕ1𝜓subscript𝑏2italic-ϕ1superscript𝑒𝑟1subscript𝑏2𝑋absent𝜓1italic-ϕ0𝜓subscript𝑏2italic-ϕ0𝜓subscript𝑏2italic-ϕ1𝜓1italic-ϕ1casessuperscript𝑒𝑟subscript𝑎1𝑋absent𝜓1𝜓1italic-ϕsubscript𝑎1superscript𝑒𝑟subscript𝑎2subscript𝑎1absent𝜓1italic-ϕsubscript𝑎1𝜓1italic-ϕsubscript𝑎2superscript𝑒𝑟1subscript𝑎2𝑌absent𝜓1italic-ϕsubscript𝑎2𝜓11superscript𝑒𝑟subscript𝑏1𝑌absent𝜓subscript𝑏1𝜓subscript𝑏11superscript𝑒𝑟subscript𝑏2subscript𝑏1absent𝜓subscript𝑏2𝜓subscript𝑏1𝜓subscript𝑏11𝜓subscript𝑏21superscript𝑒𝑟1subscript𝑏2𝑋absent𝜓1𝜓subscript𝑏2𝜓subscript𝑏21𝜓11\begin{cases}e^{ra_{1}}\cdot X&=\frac{\psi(1)-\phi(0)}{\psi(0)-\phi(0)}\cdot% \frac{\psi(0)-\phi(a_{1})}{\psi(1)-\phi(a_{1})},\\ e^{r(a_{2}-a_{1})}&=\frac{\psi(1)-\phi(a_{1})}{\psi(0)-\phi(a_{1})}\cdot\frac{% \psi(0)-\phi(a_{2})}{\psi(1)-\phi(a_{2})},\\ e^{r(1-a_{2})}\cdot Y&=\frac{\psi(1)-\phi(a_{2})}{\psi(0)-\phi(a_{2})}\cdot% \frac{\psi(0)-\phi(1)}{\psi(1)-\phi(1)},\\ e^{rb_{1}}\cdot Y&=\frac{\psi(b_{1})-\phi(0)}{\psi(0)-\phi(0)}\cdot\frac{\psi(% 0)-\phi(1)}{\psi(b_{1})-\phi(1)},\\ e^{r(b_{2}-b_{1})}&=\frac{\psi(b_{2})-\phi(0)}{\psi(b_{1})-\phi(0)}\cdot\frac{% \psi(b_{1})-\phi(1)}{\psi(b_{2})-\phi(1)},\\ e^{r(1-b_{2})}\cdot X&=\frac{\psi(1)-\phi(0)}{\psi(b_{2})-\phi(0)}\cdot\frac{% \psi(b_{2})-\phi(1)}{\psi(1)-\phi(1)}.\end{cases}\Longleftrightarrow\begin{% cases}e^{ra_{1}}\cdot X&=\frac{\psi(1)}{\psi(1)-\phi(a_{1})},\\ e^{r(a_{2}-a_{1})}&=\frac{\psi(1)-\phi(a_{1})}{\psi(1)-\phi(a_{2})},\\ e^{r(1-a_{2})}\cdot Y&=\frac{\psi(1)-\phi(a_{2})}{\psi(1)-1},\\ e^{rb_{1}}\cdot Y&=\frac{\psi(b_{1})}{\psi(b_{1})-1},\\ e^{r(b_{2}-b_{1})}&=\frac{\psi(b_{2})}{\psi(b_{1})}\cdot\frac{\psi(b_{1})-1}{% \psi(b_{2})-1},\\ e^{r(1-b_{2})}\cdot X&=\frac{\psi(1)}{\psi(b_{2})}\cdot\frac{\psi(b_{2})-1}{% \psi(1)-1}.\end{cases}{ start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋅ italic_X end_CELL start_CELL = divide start_ARG italic_ψ ( 1 ) - italic_ϕ ( 0 ) end_ARG start_ARG italic_ψ ( 0 ) - italic_ϕ ( 0 ) end_ARG ⋅ divide start_ARG italic_ψ ( 0 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_CELL start_CELL = divide start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( 0 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( 0 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r ( 1 - italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ⋅ italic_Y end_CELL start_CELL = divide start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( 0 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( 0 ) - italic_ϕ ( 1 ) end_ARG start_ARG italic_ψ ( 1 ) - italic_ϕ ( 1 ) end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋅ italic_Y end_CELL start_CELL = divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_ϕ ( 0 ) end_ARG start_ARG italic_ψ ( 0 ) - italic_ϕ ( 0 ) end_ARG ⋅ divide start_ARG italic_ψ ( 0 ) - italic_ϕ ( 1 ) end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_ϕ ( 1 ) end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_CELL start_CELL = divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_ϕ ( 0 ) end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_ϕ ( 0 ) end_ARG ⋅ divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_ϕ ( 1 ) end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_ϕ ( 1 ) end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r ( 1 - italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ⋅ italic_X end_CELL start_CELL = divide start_ARG italic_ψ ( 1 ) - italic_ϕ ( 0 ) end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_ϕ ( 0 ) end_ARG ⋅ divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_ϕ ( 1 ) end_ARG start_ARG italic_ψ ( 1 ) - italic_ϕ ( 1 ) end_ARG . end_CELL end_ROW ⟺ { start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋅ italic_X end_CELL start_CELL = divide start_ARG italic_ψ ( 1 ) end_ARG start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_CELL start_CELL = divide start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r ( 1 - italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ⋅ italic_Y end_CELL start_CELL = divide start_ARG italic_ψ ( 1 ) - italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( 1 ) - 1 end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋅ italic_Y end_CELL start_CELL = divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - 1 end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_CELL start_CELL = divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - 1 end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - 1 end_ARG , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_r ( 1 - italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ⋅ italic_X end_CELL start_CELL = divide start_ARG italic_ψ ( 1 ) end_ARG start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG ⋅ divide start_ARG italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - 1 end_ARG start_ARG italic_ψ ( 1 ) - 1 end_ARG . end_CELL end_ROW

Assuming X𝑋Xitalic_X and Y𝑌Yitalic_Y to be known, the system of equations is readily solved in terms of them, resulting in:

(68) ψ(b1)=11erb1/Y,ψ(b2)=11erb2/Y,ψ(1)=11er/(XY),formulae-sequence𝜓subscript𝑏111superscript𝑒𝑟subscript𝑏1𝑌formulae-sequence𝜓subscript𝑏211superscript𝑒𝑟subscript𝑏2𝑌𝜓111superscript𝑒𝑟𝑋𝑌\psi(b_{1})=\frac{1}{1-e^{-rb_{1}}/Y},\quad\psi(b_{2})=\frac{1}{1-e^{-rb_{2}}/% Y},\quad\psi(1)=\frac{1}{1-e^{-r}/(XY)},italic_ψ ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_Y end_ARG , italic_ψ ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_Y end_ARG , italic_ψ ( 1 ) = divide start_ARG 1 end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT / ( italic_X italic_Y ) end_ARG ,
(69) ϕ(a1)=1era1/X1er/(XY),ϕ(a2)=1era2/X1er/(XY).formulae-sequenceitalic-ϕsubscript𝑎11superscript𝑒𝑟subscript𝑎1𝑋1superscript𝑒𝑟𝑋𝑌italic-ϕsubscript𝑎21superscript𝑒𝑟subscript𝑎2𝑋1superscript𝑒𝑟𝑋𝑌\phi(a_{1})=\frac{1-e^{-ra_{1}}/X}{1-e^{-r}/(XY)},\quad\phi(a_{2})=\frac{1-e^{% -ra_{2}}/X}{1-e^{-r}/(XY)}.italic_ϕ ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = divide start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_X end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT / ( italic_X italic_Y ) end_ARG , italic_ϕ ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = divide start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_X end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT / ( italic_X italic_Y ) end_ARG .

In order to find X𝑋Xitalic_X and Y𝑌Yitalic_Y, we plug the values of ψ𝜓\psiitalic_ψ and ϕitalic-ϕ\phiitalic_ϕ back into (67) and get two equations:

(70) X=1(1erb2/Y)(1era1/X)1er/(XY)era1/X,Y=1(1era2/X)(1erb1/Y)1er/(XY)erb1/Y.formulae-sequence𝑋11superscript𝑒𝑟subscript𝑏2𝑌1superscript𝑒𝑟subscript𝑎1𝑋1superscript𝑒𝑟𝑋𝑌superscript𝑒𝑟subscript𝑎1𝑋𝑌11superscript𝑒𝑟subscript𝑎2𝑋1superscript𝑒𝑟subscript𝑏1𝑌1superscript𝑒𝑟𝑋𝑌superscript𝑒𝑟subscript𝑏1𝑌X=\frac{1-\frac{(1-e^{-rb_{2}}/Y)(1-e^{-ra_{1}}/X)}{1-e^{-r}/(XY)}}{e^{-ra_{1}% }/X},\qquad Y=\frac{1-\frac{(1-e^{-ra_{2}}/X)(1-e^{-rb_{1}}/Y)}{1-e^{-r}/(XY)}% }{e^{-rb_{1}}/Y}.italic_X = divide start_ARG 1 - divide start_ARG ( 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_Y ) ( 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_X ) end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT / ( italic_X italic_Y ) end_ARG end_ARG start_ARG italic_e start_POSTSUPERSCRIPT - italic_r italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_X end_ARG , italic_Y = divide start_ARG 1 - divide start_ARG ( 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_X ) ( 1 - italic_e start_POSTSUPERSCRIPT - italic_r italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_Y ) end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT / ( italic_X italic_Y ) end_ARG end_ARG start_ARG italic_e start_POSTSUPERSCRIPT - italic_r italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_Y end_ARG .

Simplifying using X0𝑋0X\neq 0italic_X ≠ 0, Y0𝑌0Y\neq 0italic_Y ≠ 0, we get

erXY=er(1a1)+erb2Xer(b2a1)Y,erXY=er(1b1)+era2XYer(a2b1),formulae-sequencesuperscript𝑒𝑟𝑋𝑌superscript𝑒𝑟1subscript𝑎1superscript𝑒𝑟subscript𝑏2𝑋superscript𝑒𝑟subscript𝑏2subscript𝑎1𝑌superscript𝑒𝑟𝑋𝑌superscript𝑒𝑟1subscript𝑏1superscript𝑒𝑟subscript𝑎2𝑋𝑌superscript𝑒𝑟subscript𝑎2subscript𝑏1e^{-r}-XY=e^{-r(1-a_{1})}+e^{-rb_{2}}-Xe^{-r(b_{2}-a_{1})}-Y,\qquad e^{-r}-XY=% e^{-r(1-b_{1})}+e^{-ra_{2}}-X-Ye^{-r(a_{2}-b_{1})},italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT - italic_X italic_Y = italic_e start_POSTSUPERSCRIPT - italic_r ( 1 - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_r italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - italic_X italic_e start_POSTSUPERSCRIPT - italic_r ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - italic_Y , italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT - italic_X italic_Y = italic_e start_POSTSUPERSCRIPT - italic_r ( 1 - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_r italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - italic_X - italic_Y italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ,

which is equivalent to a linear relation and a quadratic equation:

(71) Y=er(1a1)+erb2er(1b1)era2+X(1er(b2a1))1er(a2b1),𝑌superscript𝑒𝑟1subscript𝑎1superscript𝑒𝑟subscript𝑏2superscript𝑒𝑟1subscript𝑏1superscript𝑒𝑟subscript𝑎2𝑋1superscript𝑒𝑟subscript𝑏2subscript𝑎11superscript𝑒𝑟subscript𝑎2subscript𝑏1Y=\frac{e^{-r(1-a_{1})}+e^{-rb_{2}}-e^{-r(1-b_{1})}-e^{-ra_{2}}+X(1-e^{-r(b_{2% }-a_{1})})}{1-e^{-r(a_{2}-b_{1})}},italic_Y = divide start_ARG italic_e start_POSTSUPERSCRIPT - italic_r ( 1 - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_r italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_r ( 1 - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_r italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + italic_X ( 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ,
(72) X2(1er(b2a1))+X(er(1a1)+erb2+er(a2a1+b2b1)er(1b1)era21)+er(1b1)+era2+er(1+a2b1)er(1a1+a2b1)er(b2+a2b1)er=0.superscript𝑋21superscript𝑒𝑟subscript𝑏2subscript𝑎1𝑋superscript𝑒𝑟1subscript𝑎1superscript𝑒𝑟subscript𝑏2superscript𝑒𝑟subscript𝑎2subscript𝑎1subscript𝑏2subscript𝑏1superscript𝑒𝑟1subscript𝑏1superscript𝑒𝑟subscript𝑎21superscript𝑒𝑟1subscript𝑏1superscript𝑒𝑟subscript𝑎2superscript𝑒𝑟1subscript𝑎2subscript𝑏1superscript𝑒𝑟1subscript𝑎1subscript𝑎2subscript𝑏1superscript𝑒𝑟subscript𝑏2subscript𝑎2subscript𝑏1superscript𝑒𝑟0X^{2}(1-e^{-r(b_{2}-a_{1})})+X(e^{-r(1-a_{1})}+e^{-rb_{2}}+e^{-r(a_{2}-a_{1}+b% _{2}-b_{1})}-e^{-r(1-b_{1})}-e^{-ra_{2}}-1)\\ +e^{-r(1-b_{1})}+e^{-ra_{2}}+e^{-r(1+a_{2}-b_{1})}-e^{-r(1-a_{1}+a_{2}-b_{1})}% -e^{-r(b_{2}+a_{2}-b_{1})}-e^{-r}=0.start_ROW start_CELL italic_X start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) + italic_X ( italic_e start_POSTSUPERSCRIPT - italic_r ( 1 - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_r italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_r ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_r ( 1 - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_r italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - 1 ) end_CELL end_ROW start_ROW start_CELL + italic_e start_POSTSUPERSCRIPT - italic_r ( 1 - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_r italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_r ( 1 + italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_r ( 1 - italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_r ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT = 0 . end_CELL end_ROW

The two solutions of the last quadratic equation lead through (71) and then (68), (69) to two solutions of the system (45), (46), which then lead to the formula for g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) via (47). In order to pick the correct solution we need to additionally impose the positivity condition:

(73) 1rln[(ϕ(xu)ψ(yv))(ϕ(xu1)ψ(yv1))(ϕ(xu1)ψ(yv))(ϕ(xu)ψ(yv1))]>0,whenever Iuv=1.formulae-sequence1𝑟italic-ϕsubscriptx𝑢𝜓subscripty𝑣italic-ϕsubscriptx𝑢1𝜓subscripty𝑣1italic-ϕsubscriptx𝑢1𝜓subscripty𝑣italic-ϕsubscriptx𝑢𝜓subscripty𝑣10whenever subscript𝐼𝑢𝑣1-\frac{1}{r}\ln\left[\frac{(\phi(\mathrm{x}_{u})-\psi(\mathrm{y}_{v}))(\phi(% \mathrm{x}_{u-1})-\psi(\mathrm{y}_{v-1}))}{(\phi(\mathrm{x}_{u-1})-\psi(% \mathrm{y}_{v}))(\phi(\mathrm{x}_{u})-\psi(\mathrm{y}_{v-1}))}\right]>0,\qquad% \text{whenever }I_{uv}=1.- divide start_ARG 1 end_ARG start_ARG italic_r end_ARG roman_ln [ divide start_ARG ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ) ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ) end_ARG start_ARG ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u - 1 end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ) ( italic_ϕ ( roman_x start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) - italic_ψ ( roman_y start_POSTSUBSCRIPT italic_v - 1 end_POSTSUBSCRIPT ) ) end_ARG ] > 0 , whenever italic_I start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT = 1 .

which is a corollary of positivity of the density g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) and (48).

The formula (72) is quite complicated in the general case and it is intructive to specialize to an equally-spaced grid a1=b1=13subscript𝑎1subscript𝑏113a_{1}=b_{1}=\tfrac{1}{3}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 3 end_ARG, a2=b2=23subscript𝑎2subscript𝑏223a_{2}=b_{2}=\tfrac{2}{3}italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = divide start_ARG 2 end_ARG start_ARG 3 end_ARG. The quadratic equation after dividing by (1er/3)1superscript𝑒𝑟3(1-e^{-r/3})( 1 - italic_e start_POSTSUPERSCRIPT - italic_r / 3 end_POSTSUPERSCRIPT ) becomes

(74) X2X(1+er/3)+2e2r/3er=0,superscript𝑋2𝑋1superscript𝑒𝑟32superscript𝑒2𝑟3superscript𝑒𝑟0X^{2}-X(1+e^{-r/3})+2e^{-2r/3}-e^{-r}=0,italic_X start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_X ( 1 + italic_e start_POSTSUPERSCRIPT - italic_r / 3 end_POSTSUPERSCRIPT ) + 2 italic_e start_POSTSUPERSCRIPT - 2 italic_r / 3 end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT = 0 ,

and Y=X𝑌𝑋Y=Xitalic_Y = italic_X. One directly checks that, when writing two solutions of (74) in terms of ±Discriminantplus-or-minusDiscriminant\pm\sqrt{\mathrm{Discriminant}}± square-root start_ARG roman_Discriminant end_ARG, the correct sign giving (73) is “+++” for r>0𝑟0r>0italic_r > 0 and “--” for r<0𝑟0r<0italic_r < 0.

As r0𝑟0r\to 0italic_r → 0, one should rescale X=1+rX[0]𝑋1𝑟subscript𝑋delimited-[]0X=1+rX_{[0]}italic_X = 1 + italic_r italic_X start_POSTSUBSCRIPT [ 0 ] end_POSTSUBSCRIPT, with (74) turning asymptotically into

(75) X[0]2+13X[0]19=0.superscriptsubscript𝑋delimited-[]0213subscript𝑋delimited-[]0190X_{[0]}^{2}+\frac{1}{3}X_{[0]}-\frac{1}{9}=0.italic_X start_POSTSUBSCRIPT [ 0 ] end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 3 end_ARG italic_X start_POSTSUBSCRIPT [ 0 ] end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG 9 end_ARG = 0 .

This time, chosing the correct solution is simple: Theorem 4.12 and Lemma 4.13 imply that we need the positive one.

See two plots of the density for different values of r𝑟ritalic_r in Figure 7.

Refer to caption
Refer to caption
Figure 7. Solution of EL equations for I=(011111110)𝐼matrix011111110I=\begin{pmatrix}0&1&1\\ 1&1&1\\ 1&1&0\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ), x=y=(0,13,23,1)xy013231\mathrm{x}=\mathrm{y}=(0,\tfrac{1}{3},\tfrac{2}{3},1)roman_x = roman_y = ( 0 , divide start_ARG 1 end_ARG start_ARG 3 end_ARG , divide start_ARG 2 end_ARG start_ARG 3 end_ARG , 1 ). Left panel: r=1𝑟1r=1italic_r = 1. Right panel: r=5𝑟5r=-5italic_r = - 5

5.2. Non-convex 3×2323\times 23 × 2 example

For our next example, we consider a non-convex I𝐼Iitalic_I. We take k=3𝑘3k=3italic_k = 3, =22\ell=2roman_ℓ = 2, I=(101111)𝐼matrix101111I=\begin{pmatrix}1&0&1\\ 1&1&1\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) and x=(0,13,23,1)x013231\mathrm{x}=(0,\tfrac{1}{3},\tfrac{2}{3},1)roman_x = ( 0 , divide start_ARG 1 end_ARG start_ARG 3 end_ARG , divide start_ARG 2 end_ARG start_ARG 3 end_ARG , 1 ), y=(0,23,1)y0231\mathrm{y}=(0,\tfrac{2}{3},1)roman_y = ( 0 , divide start_ARG 2 end_ARG start_ARG 3 end_ARG , 1 ). The formulas of Theorem 4.9 do not apply in this setting, and we need to follow a more complicated path, as outlined at the end of Section 4.2. By (a small generalization of) Corollary 4.11, we have that ϕuvsuperscriptitalic-ϕ𝑢𝑣\phi^{uv}italic_ϕ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT and ψuvsuperscript𝜓𝑢𝑣\psi^{uv}italic_ψ start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT are Moebius transforms of erxsuperscript𝑒𝑟𝑥e^{rx}italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT and erysuperscript𝑒𝑟𝑦e^{ry}italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT, and

(76) guv(x,y)=(auvduvbuvcuv)rerx+ry(auv+buvery+cuverx+duverx+ry)2.superscript𝑔𝑢𝑣𝑥𝑦subscript𝑎𝑢𝑣subscript𝑑𝑢𝑣subscript𝑏𝑢𝑣subscript𝑐𝑢𝑣𝑟superscript𝑒𝑟𝑥𝑟𝑦superscriptsubscript𝑎𝑢𝑣subscript𝑏𝑢𝑣superscript𝑒𝑟𝑦subscript𝑐𝑢𝑣superscript𝑒𝑟𝑥subscript𝑑𝑢𝑣superscript𝑒𝑟𝑥𝑟𝑦2g^{uv}(x,y)=-\frac{(a_{uv}d_{uv}-b_{uv}c_{uv})re^{rx+ry}}{(a_{uv}+b_{uv}e^{ry}% +c_{uv}e^{rx}+d_{uv}e^{rx+ry})^{2}}.italic_g start_POSTSUPERSCRIPT italic_u italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - divide start_ARG ( italic_a start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT ) italic_r italic_e start_POSTSUPERSCRIPT italic_r italic_x + italic_r italic_y end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_a start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT + italic_b start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT + italic_c start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT + italic_d start_POSTSUBSCRIPT italic_u italic_v end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_r italic_x + italic_r italic_y end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG .

Each pair uv𝑢𝑣uvitalic_u italic_v has 3333 variables (a,b,c,d𝑎𝑏𝑐𝑑a,b,c,ditalic_a , italic_b , italic_c , italic_d up to scale), for a total of 15151515 constants to determine. They are found from equations given by Proposition 4.4, Proposition 4.5, and versions of (45), (46) expressing the uniform marginals for g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ). After putting all those conditions into mathematical software, we found a unique solution:

(77) g(x,y)={rer(x+y)(1+er2)(1e2r3)(1eryerx+[er2+e2r3+e7r6]er(x+y))2,0<x<13 and 0<y<23,rer(x+y)(1er3)(er6+e2r3)(1+[er6+er2e2r3]eryerx+erer(x+y))2,0<x<13 and 23<y<1,rer(x+y)(1er6+er3)(1e2r3)(1ery+[er61er3]erx+[e2r3e5r6+er]er(x+y))2,13<x<23 and 0<y<23,0,13<x<23 and 23<y<1,rer(x+y)(1e2r3)(er6+e2r3)(er6er6ery+[er2+e2r31]erx+e7r6er(x+y))2,23<x<1 and 0<y<23,rer(x+y)e4r3(1er3)(1+er2)([1er6+er2]e2r3erye2r3erx+e5r3er(x+y))2,23<x<1 and 23<y<1.𝑔𝑥𝑦cases𝑟superscript𝑒𝑟𝑥𝑦1superscript𝑒𝑟21superscript𝑒2𝑟3superscript1superscript𝑒𝑟𝑦superscript𝑒𝑟𝑥delimited-[]superscript𝑒𝑟2superscript𝑒2𝑟3superscript𝑒7𝑟6superscript𝑒𝑟𝑥𝑦20𝑥13 and 0𝑦23𝑟superscript𝑒𝑟𝑥𝑦1superscript𝑒𝑟3superscript𝑒𝑟6superscript𝑒2𝑟3superscript1delimited-[]superscript𝑒𝑟6superscript𝑒𝑟2superscript𝑒2𝑟3superscript𝑒𝑟𝑦superscript𝑒𝑟𝑥superscript𝑒𝑟superscript𝑒𝑟𝑥𝑦20𝑥13 and 23𝑦1𝑟superscript𝑒𝑟𝑥𝑦1superscript𝑒𝑟6superscript𝑒𝑟31superscript𝑒2𝑟3superscript1superscript𝑒𝑟𝑦delimited-[]superscript𝑒𝑟61superscript𝑒𝑟3superscript𝑒𝑟𝑥delimited-[]superscript𝑒2𝑟3superscript𝑒5𝑟6superscript𝑒𝑟superscript𝑒𝑟𝑥𝑦213𝑥23 and 0𝑦23013𝑥23 and 23𝑦1𝑟superscript𝑒𝑟𝑥𝑦1superscript𝑒2𝑟3superscript𝑒𝑟6superscript𝑒2𝑟3superscriptsuperscript𝑒𝑟6superscript𝑒𝑟6superscript𝑒𝑟𝑦delimited-[]superscript𝑒𝑟2superscript𝑒2𝑟31superscript𝑒𝑟𝑥superscript𝑒7𝑟6superscript𝑒𝑟𝑥𝑦223𝑥1 and 0𝑦23𝑟superscript𝑒𝑟𝑥𝑦superscript𝑒4𝑟31superscript𝑒𝑟31superscript𝑒𝑟2superscriptdelimited-[]1superscript𝑒𝑟6superscript𝑒𝑟2superscript𝑒2𝑟3superscript𝑒𝑟𝑦superscript𝑒2𝑟3superscript𝑒𝑟𝑥superscript𝑒5𝑟3superscript𝑒𝑟𝑥𝑦223𝑥1 and 23𝑦1g(x,y)=\begin{cases}\frac{re^{r(x+y)}\bigl{(}1+e^{-\frac{r}{2}}\bigr{)}\bigl{(% }1-e^{-\frac{2r}{3}}\bigr{)}}{\left(1-e^{ry}-e^{rx}+\bigl{[}-e^{-\frac{r}{2}}+% e^{-\frac{2r}{3}}+e^{-\frac{7r}{6}}\bigr{]}e^{r(x+y)}\right)^{2}},&0<x<\frac{1% }{3}\text{ and }0<y<\frac{2}{3},\\ \frac{re^{r(x+y)}\bigl{(}1-e^{-\frac{r}{3}}\bigr{)}\bigl{(}e^{-\frac{r}{6}}+e^% {-\frac{2r}{3}}\bigr{)}}{\left(1+\bigl{[}-e^{-\frac{r}{6}}+e^{-\frac{r}{2}}-e^% {-\frac{2r}{3}}\bigr{]}e^{ry}-e^{rx}+e^{-r}e^{r(x+y)}\right)^{2}},&0<x<\frac{1% }{3}\text{ and }\frac{2}{3}<y<1,\\ \frac{re^{r(x+y)}\bigl{(}1-e^{-\frac{r}{6}}+e^{-\frac{r}{3}}\bigr{)}\bigl{(}1-% e^{-\frac{2r}{3}}\bigr{)}}{\left(1-e^{ry}+\bigl{[}e^{-\frac{r}{6}}-1-e^{-\frac% {r}{3}}\bigr{]}e^{rx}+\bigl{[}e^{-\frac{2r}{3}}-e^{-\frac{5r}{6}}+e^{-r}\bigr{% ]}e^{r(x+y)}\right)^{2}},&\frac{1}{3}<x<\frac{2}{3}\text{ and }0<y<\frac{2}{3}% ,\\ 0,&\frac{1}{3}<x<\frac{2}{3}\text{ and }\frac{2}{3}<y<1,\\ \frac{re^{r(x+y)}\bigl{(}1-e^{-\frac{2r}{3}}\bigr{)}\bigl{(}e^{-\frac{r}{6}}+e% ^{-\frac{2r}{3}}\bigr{)}}{\left(e^{-\frac{r}{6}}-e^{-\frac{r}{6}}e^{ry}+\bigl{% [}-e^{-\frac{r}{2}}+e^{-\frac{2r}{3}}-1\bigr{]}e^{rx}+e^{-\frac{7r}{6}}e^{r(x+% y)}\right)^{2}},&\frac{2}{3}<x<1\text{ and }0<y<\frac{2}{3},\\ \frac{re^{r(x+y)}e^{-\frac{4r}{3}}\bigl{(}1-e^{-\frac{r}{3}}\bigr{)}\bigl{(}1+% e^{-\frac{r}{2}}\bigr{)}}{\left(\bigl{[}1-e^{-\frac{r}{6}}+e^{-\frac{r}{2}}% \bigr{]}-e^{-\frac{2r}{3}}e^{ry}-e^{-\frac{2r}{3}}e^{rx}+e^{-\frac{5r}{3}}e^{r% (x+y)}\right)^{2}},&\frac{2}{3}<x<1\text{ and }\frac{2}{3}<y<1.\end{cases}italic_g ( italic_x , italic_y ) = { start_ROW start_CELL divide start_ARG italic_r italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ( 1 + italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) ( 1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ) end_ARG start_ARG ( 1 - italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT + [ - italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG 7 italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT ] italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , end_CELL start_CELL 0 < italic_x < divide start_ARG 1 end_ARG start_ARG 3 end_ARG and 0 < italic_y < divide start_ARG 2 end_ARG start_ARG 3 end_ARG , end_CELL end_ROW start_ROW start_CELL divide start_ARG italic_r italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ( 1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ) ( italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ) end_ARG start_ARG ( 1 + [ - italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ] italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , end_CELL start_CELL 0 < italic_x < divide start_ARG 1 end_ARG start_ARG 3 end_ARG and divide start_ARG 2 end_ARG start_ARG 3 end_ARG < italic_y < 1 , end_CELL end_ROW start_ROW start_CELL divide start_ARG italic_r italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ( 1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ) ( 1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ) end_ARG start_ARG ( 1 - italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT + [ italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT - 1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ] italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT + [ italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - divide start_ARG 5 italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT ] italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 3 end_ARG < italic_x < divide start_ARG 2 end_ARG start_ARG 3 end_ARG and 0 < italic_y < divide start_ARG 2 end_ARG start_ARG 3 end_ARG , end_CELL end_ROW start_ROW start_CELL 0 , end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 3 end_ARG < italic_x < divide start_ARG 2 end_ARG start_ARG 3 end_ARG and divide start_ARG 2 end_ARG start_ARG 3 end_ARG < italic_y < 1 , end_CELL end_ROW start_ROW start_CELL divide start_ARG italic_r italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ( 1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ) ( italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ) end_ARG start_ARG ( italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT + [ - italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT - 1 ] italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG 7 italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , end_CELL start_CELL divide start_ARG 2 end_ARG start_ARG 3 end_ARG < italic_x < 1 and 0 < italic_y < divide start_ARG 2 end_ARG start_ARG 3 end_ARG , end_CELL end_ROW start_ROW start_CELL divide start_ARG italic_r italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - divide start_ARG 4 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ( 1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT ) ( 1 + italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) end_ARG start_ARG ( [ 1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 6 end_ARG end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_r end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ] - italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_r italic_y end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - divide start_ARG 2 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_r italic_x end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - divide start_ARG 5 italic_r end_ARG start_ARG 3 end_ARG end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_r ( italic_x + italic_y ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , end_CELL start_CELL divide start_ARG 2 end_ARG start_ARG 3 end_ARG < italic_x < 1 and divide start_ARG 2 end_ARG start_ARG 3 end_ARG < italic_y < 1 . end_CELL end_ROW

A plot for two different values of r𝑟ritalic_r is in Figure 8.

Refer to caption
Refer to caption
Figure 8. Solution of EL equations for I=(101111)𝐼matrix101111I=\begin{pmatrix}1&0&1\\ 1&1&1\end{pmatrix}italic_I = ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ), x=(0,13,23,1)x013231\mathrm{x}=(0,\tfrac{1}{3},\tfrac{2}{3},1)roman_x = ( 0 , divide start_ARG 1 end_ARG start_ARG 3 end_ARG , divide start_ARG 2 end_ARG start_ARG 3 end_ARG , 1 ), y=(0,23,1)y0231\mathrm{y}=(0,\tfrac{2}{3},1)roman_y = ( 0 , divide start_ARG 2 end_ARG start_ARG 3 end_ARG , 1 ) via (77). Left panel: r=1𝑟1r=-1italic_r = - 1. Right panel: r=7𝑟7r=7italic_r = 7.

5.3. Non-rectangular example

Our setting, explained in Section 3.1, admits a natural generalization to a larger class of domains than just rectilinear polygonal domains. We can deal with domains ΩΩ\Omegaroman_Ω, whose complexity grows with N𝑁Nitalic_N by allowing k𝑘kitalic_k and \ellroman_ℓ to grow, and in the limit ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT can approximate essentially arbitrary subsets of [0,1]2superscript012[0,1]^{2}[ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Theorems 3.7 and 3.15 extend to such setting with no changes. Theorem 4.9 is no longer available, because the number and complexity of equations (45), (46), (47) grows with k𝑘kitalic_k and \ellroman_ℓ. However, in some situations at least in the r=0𝑟0r=0italic_r = 0 case we can still find an explicit formula for the maximizer of the permuton energy. Let us provide an example.

Consider the domain U𝑈Uitalic_U obtained from [0,1]2superscript012[0,1]^{2}[ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT by removing the region below the line ax+by=1𝑎𝑥𝑏𝑦1ax+by=1italic_a italic_x + italic_b italic_y = 1 for a,b>1𝑎𝑏1a,b>1italic_a , italic_b > 1, see Figure 9. Set r=0𝑟0r=0italic_r = 0 and assume that ΩX,Y,IsuperscriptΩ𝑋𝑌𝐼\Omega^{X,Y,I}roman_Ω start_POSTSUPERSCRIPT italic_X , italic_Y , italic_I end_POSTSUPERSCRIPT approximates U𝑈Uitalic_U as N𝑁N\to\inftyitalic_N → ∞. Then a generalization of Corollary 3.9, yields that the permuton g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) corresponding to the limit shape should be a maximizer of

(78) 0101g(x,y)lng(x,y)dxdy,superscriptsubscript01superscriptsubscript01𝑔𝑥𝑦𝑔𝑥𝑦differential-d𝑥differential-d𝑦-\int_{0}^{1}\int_{0}^{1}g(x,y)\ln g(x,y)\,\mathrm{d}x\,\mathrm{d}y,- ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_ln italic_g ( italic_x , italic_y ) roman_d italic_x roman_d italic_y ,

subject to the conditions

  • g(x,y)=0𝑔𝑥𝑦0g(x,y)=0italic_g ( italic_x , italic_y ) = 0 for ax+by<1𝑎𝑥𝑏𝑦1ax+by<1italic_a italic_x + italic_b italic_y < 1,

  • 01g(x,y)dx=1superscriptsubscript01𝑔𝑥𝑦differential-d𝑥1\int_{0}^{1}g(x,y)\mathrm{d}x=1∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_x = 1, 0y10𝑦10\leq y\leq 10 ≤ italic_y ≤ 1, and 01g(x,y)dy=1superscriptsubscript01𝑔𝑥𝑦differential-d𝑦1\int_{0}^{1}g(x,y)\mathrm{d}y=1∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_g ( italic_x , italic_y ) roman_d italic_y = 1, 0x10𝑥10\leq x\leq 10 ≤ italic_x ≤ 1.

In the r=0𝑟0r=0italic_r = 0 case the Euler-Lagrange equations (12) mean a factorization

g(x,y)=λ(x)μ(y)𝟏ax+by1.𝑔𝑥𝑦𝜆𝑥𝜇𝑦subscript1𝑎𝑥𝑏𝑦1g(x,y)=\lambda(x)\mu(y)\mathbf{1}_{ax+by\geq 1}.italic_g ( italic_x , italic_y ) = italic_λ ( italic_x ) italic_μ ( italic_y ) bold_1 start_POSTSUBSCRIPT italic_a italic_x + italic_b italic_y ≥ 1 end_POSTSUBSCRIPT .

Let us find λ𝜆\lambdaitalic_λ and μ𝜇\muitalic_μ for which g𝑔gitalic_g has uniform marginals. We can assume that the function λ(x)𝜆𝑥\lambda(x)italic_λ ( italic_x ) has some constant value c1subscript𝑐1c_{1}italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT for x>1/a𝑥1𝑎x>1/aitalic_x > 1 / italic_a, and the function μ(y)𝜇𝑦\mu(y)italic_μ ( italic_y ) has some constant value c2subscript𝑐2c_{2}italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT for y>1/b𝑦1𝑏y>1/bitalic_y > 1 / italic_b. This assumption is based on the fact that the distribution of the random restricted permutations of interest is invariant under left and right multiplications with permutations of {N/a,,N}𝑁𝑎𝑁\{N/a,\dots,N\}{ italic_N / italic_a , … , italic_N } and {N/b,,N}𝑁𝑏𝑁\{N/b,\dots,N\}{ italic_N / italic_b , … , italic_N }, respectively. Hence, the limit shape should also have this invariance. The uniform marginals conditions become:

(79) λ(x)c2(11b)+λ(x)(1ax)/b1/bμ(y)𝑑y𝜆𝑥subscript𝑐211𝑏𝜆𝑥superscriptsubscript1𝑎𝑥𝑏1𝑏𝜇𝑦differential-d𝑦\displaystyle\lambda(x)c_{2}\,\left(1-\frac{1}{b}\right)+\lambda(x)\int_{(1-ax% )/b}^{1/b}\mu(y)\,dyitalic_λ ( italic_x ) italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 - divide start_ARG 1 end_ARG start_ARG italic_b end_ARG ) + italic_λ ( italic_x ) ∫ start_POSTSUBSCRIPT ( 1 - italic_a italic_x ) / italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / italic_b end_POSTSUPERSCRIPT italic_μ ( italic_y ) italic_d italic_y =1,absent1\displaystyle=1,= 1 , for 0<x<1a,for 0<x<1a\displaystyle\text{for $0<x<\frac{1}{a}$},for 0 < italic_x < divide start_ARG 1 end_ARG start_ARG italic_a end_ARG ,
(80) μ(y)c1(11a)+μ(y)(1by)/a1/aλ(x)𝑑x𝜇𝑦subscript𝑐111𝑎𝜇𝑦superscriptsubscript1𝑏𝑦𝑎1𝑎𝜆𝑥differential-d𝑥\displaystyle\mu(y)c_{1}\,\left(1-\frac{1}{a}\right)+\mu(y)\int_{(1-by)/a}^{1/% a}\lambda(x)\,dxitalic_μ ( italic_y ) italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( 1 - divide start_ARG 1 end_ARG start_ARG italic_a end_ARG ) + italic_μ ( italic_y ) ∫ start_POSTSUBSCRIPT ( 1 - italic_b italic_y ) / italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / italic_a end_POSTSUPERSCRIPT italic_λ ( italic_x ) italic_d italic_x =1,absent1\displaystyle=1,= 1 , for 0<y<1b,for 0<y<1b\displaystyle\text{for $0<y<\frac{1}{b}$},for 0 < italic_y < divide start_ARG 1 end_ARG start_ARG italic_b end_ARG ,
(81) c101/bμ(y)𝑑y+c1c2(11b)subscript𝑐1superscriptsubscript01𝑏𝜇𝑦differential-d𝑦subscript𝑐1subscript𝑐211𝑏\displaystyle c_{1}\int_{0}^{1/b}\mu(y)\,dy+c_{1}c_{2}\,\left(1-\frac{1}{b}\right)italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / italic_b end_POSTSUPERSCRIPT italic_μ ( italic_y ) italic_d italic_y + italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 - divide start_ARG 1 end_ARG start_ARG italic_b end_ARG ) =1,absent1\displaystyle=1,= 1 , for 1a<x<1,for 1a<x<1\displaystyle\text{for $\frac{1}{a}<x<1$},for divide start_ARG 1 end_ARG start_ARG italic_a end_ARG < italic_x < 1 ,
(82) c201/aλ(x)𝑑x+c1c2(11a)subscript𝑐2superscriptsubscript01𝑎𝜆𝑥differential-d𝑥subscript𝑐1subscript𝑐211𝑎\displaystyle c_{2}\int_{0}^{1/a}\lambda(x)\,dx+c_{1}c_{2}\,\left(1-\frac{1}{a% }\right)italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / italic_a end_POSTSUPERSCRIPT italic_λ ( italic_x ) italic_d italic_x + italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 - divide start_ARG 1 end_ARG start_ARG italic_a end_ARG ) =1,absent1\displaystyle=1,= 1 , for 1b<y<1.for 1b<y<1\displaystyle\text{for $\frac{1}{b}<y<1$}.for divide start_ARG 1 end_ARG start_ARG italic_b end_ARG < italic_y < 1 .

We can solve this system as follows. Divide (79) by λ(x)𝜆𝑥\lambda(x)italic_λ ( italic_x ) and differentiate to get

(83) λ(x)λ(x)2=abμ(1axb).superscript𝜆𝑥𝜆superscript𝑥2𝑎𝑏𝜇1𝑎𝑥𝑏\frac{\lambda^{\prime}(x)}{\lambda(x)^{2}}=-\frac{a}{b}\,\mu\left(\frac{1-ax}{% b}\right).divide start_ARG italic_λ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) end_ARG start_ARG italic_λ ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = - divide start_ARG italic_a end_ARG start_ARG italic_b end_ARG italic_μ ( divide start_ARG 1 - italic_a italic_x end_ARG start_ARG italic_b end_ARG ) .

Likewise from (80) we get

(84) μ(y)μ(y)2=baλ(1bya).superscript𝜇𝑦𝜇superscript𝑦2𝑏𝑎𝜆1𝑏𝑦𝑎\frac{\mu^{\prime}(y)}{\mu(y)^{2}}=-\frac{b}{a}\,\lambda\left(\frac{1-by}{a}% \right).divide start_ARG italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_y ) end_ARG start_ARG italic_μ ( italic_y ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = - divide start_ARG italic_b end_ARG start_ARG italic_a end_ARG italic_λ ( divide start_ARG 1 - italic_b italic_y end_ARG start_ARG italic_a end_ARG ) .

Differentiating (83) and plugging into (84) yields a differential equation for λ𝜆\lambdaitalic_λ:

(λ(x)λ(x)2)=baλ(x)2λ(x)3.superscriptsuperscript𝜆𝑥𝜆superscript𝑥2𝑏𝑎superscript𝜆superscript𝑥2𝜆superscript𝑥3\left(\frac{\lambda^{\prime}(x)}{\lambda(x)^{2}}\right)^{\prime}=-\frac{b}{a}% \,\frac{\lambda^{\prime}(x)^{2}}{\lambda(x)^{3}}.( divide start_ARG italic_λ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) end_ARG start_ARG italic_λ ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = - divide start_ARG italic_b end_ARG start_ARG italic_a end_ARG divide start_ARG italic_λ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_λ ( italic_x ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG .

Under the substitution λ(x)=eF(x)𝜆𝑥superscript𝑒𝐹𝑥\lambda(x)=e^{F(x)}italic_λ ( italic_x ) = italic_e start_POSTSUPERSCRIPT italic_F ( italic_x ) end_POSTSUPERSCRIPT this becomes

F′′(x)=(1ba)F(x)2,superscript𝐹′′𝑥1𝑏𝑎superscript𝐹superscript𝑥2F^{\prime\prime}(x)=\left(1-\frac{b}{a}\right)F^{\prime}(x)^{2},italic_F start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x ) = ( 1 - divide start_ARG italic_b end_ARG start_ARG italic_a end_ARG ) italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

giving

F(x)=c4aablog((ab)x+c3)𝐹𝑥subscript𝑐4𝑎𝑎𝑏𝑎𝑏𝑥subscript𝑐3F(x)=c_{4}-\frac{a}{a-b}\log((a-b)x+c_{3})italic_F ( italic_x ) = italic_c start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT - divide start_ARG italic_a end_ARG start_ARG italic_a - italic_b end_ARG roman_log ( ( italic_a - italic_b ) italic_x + italic_c start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT )

for some constants c3,c4subscript𝑐3subscript𝑐4c_{3},c_{4}italic_c start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_c start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT. A similar expression can be obtained for μ(y)𝜇𝑦\mu(y)italic_μ ( italic_y ) with a,b𝑎𝑏a,bitalic_a , italic_b reversed. Plugging back into (81) and (82) to compute the constants yields the final answer:

g(x,y)={b(b1)bab((ab)x+b1)aba,x<1a and y>1b,a(a1)aba((ba)y+a1)bab,x>1a and y<1b,baababba((ab)x+b1)aba((ba)y+a1)bab,ax+by>1 and x<1a and y<1b,(bb1)bba(aa1)aab,x>1a and y>1b,0,ax+by<1.𝑔𝑥𝑦cases𝑏superscript𝑏1𝑏𝑎𝑏superscript𝑎𝑏𝑥𝑏1𝑎𝑏𝑎𝑥expectation1𝑎 and 𝑦1𝑏𝑎superscript𝑎1𝑎𝑏𝑎superscript𝑏𝑎𝑦𝑎1𝑏𝑎𝑏𝑥1𝑎 and 𝑦1𝑏superscript𝑏𝑎𝑎𝑏superscript𝑎𝑏𝑏𝑎superscript𝑎𝑏𝑥𝑏1𝑎𝑏𝑎superscript𝑏𝑎𝑦𝑎1𝑏𝑎𝑏𝑎𝑥𝑏𝑦1 and 𝑥1𝑎 and 𝑦1𝑏superscript𝑏𝑏1𝑏𝑏𝑎superscript𝑎𝑎1𝑎𝑎𝑏𝑥1𝑎 and 𝑦1𝑏0𝑎𝑥𝑏𝑦1g(x,y)=\begin{cases}b(b-1)^{\frac{b}{a-b}}((a-b)x+b-1)^{\frac{a}{b-a}},&x<% \frac{1}{a}\text{ and }y>\frac{1}{b},\\ a(a-1)^{\frac{a}{b-a}}((b-a)y+a-1)^{\frac{b}{a-b}},&x>\frac{1}{a}\text{ and }y% <\frac{1}{b},\\ b^{\frac{a}{a-b}}a^{\frac{b}{b-a}}((a-b)x+b-1)^{\frac{a}{b-a}}((b-a)y+a-1)^{% \frac{b}{a-b}},&ax+by>1\text{ and }x<\frac{1}{a}\text{ and }y<\frac{1}{b},\\ (\frac{b}{b-1})^{\frac{b}{b-a}}(\frac{a}{a-1})^{\frac{a}{a-b}},&x>\frac{1}{a}% \text{ and }y>\frac{1}{b},\\ 0,&ax+by<1.\end{cases}italic_g ( italic_x , italic_y ) = { start_ROW start_CELL italic_b ( italic_b - 1 ) start_POSTSUPERSCRIPT divide start_ARG italic_b end_ARG start_ARG italic_a - italic_b end_ARG end_POSTSUPERSCRIPT ( ( italic_a - italic_b ) italic_x + italic_b - 1 ) start_POSTSUPERSCRIPT divide start_ARG italic_a end_ARG start_ARG italic_b - italic_a end_ARG end_POSTSUPERSCRIPT , end_CELL start_CELL italic_x < divide start_ARG 1 end_ARG start_ARG italic_a end_ARG and italic_y > divide start_ARG 1 end_ARG start_ARG italic_b end_ARG , end_CELL end_ROW start_ROW start_CELL italic_a ( italic_a - 1 ) start_POSTSUPERSCRIPT divide start_ARG italic_a end_ARG start_ARG italic_b - italic_a end_ARG end_POSTSUPERSCRIPT ( ( italic_b - italic_a ) italic_y + italic_a - 1 ) start_POSTSUPERSCRIPT divide start_ARG italic_b end_ARG start_ARG italic_a - italic_b end_ARG end_POSTSUPERSCRIPT , end_CELL start_CELL italic_x > divide start_ARG 1 end_ARG start_ARG italic_a end_ARG and italic_y < divide start_ARG 1 end_ARG start_ARG italic_b end_ARG , end_CELL end_ROW start_ROW start_CELL italic_b start_POSTSUPERSCRIPT divide start_ARG italic_a end_ARG start_ARG italic_a - italic_b end_ARG end_POSTSUPERSCRIPT italic_a start_POSTSUPERSCRIPT divide start_ARG italic_b end_ARG start_ARG italic_b - italic_a end_ARG end_POSTSUPERSCRIPT ( ( italic_a - italic_b ) italic_x + italic_b - 1 ) start_POSTSUPERSCRIPT divide start_ARG italic_a end_ARG start_ARG italic_b - italic_a end_ARG end_POSTSUPERSCRIPT ( ( italic_b - italic_a ) italic_y + italic_a - 1 ) start_POSTSUPERSCRIPT divide start_ARG italic_b end_ARG start_ARG italic_a - italic_b end_ARG end_POSTSUPERSCRIPT , end_CELL start_CELL italic_a italic_x + italic_b italic_y > 1 and italic_x < divide start_ARG 1 end_ARG start_ARG italic_a end_ARG and italic_y < divide start_ARG 1 end_ARG start_ARG italic_b end_ARG , end_CELL end_ROW start_ROW start_CELL ( divide start_ARG italic_b end_ARG start_ARG italic_b - 1 end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_b end_ARG start_ARG italic_b - italic_a end_ARG end_POSTSUPERSCRIPT ( divide start_ARG italic_a end_ARG start_ARG italic_a - 1 end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_a end_ARG start_ARG italic_a - italic_b end_ARG end_POSTSUPERSCRIPT , end_CELL start_CELL italic_x > divide start_ARG 1 end_ARG start_ARG italic_a end_ARG and italic_y > divide start_ARG 1 end_ARG start_ARG italic_b end_ARG , end_CELL end_ROW start_ROW start_CELL 0 , end_CELL start_CELL italic_a italic_x + italic_b italic_y < 1 . end_CELL end_ROW

Figure 9 gives an example. The reader might enjoy taking the limit ba𝑏𝑎b\to aitalic_b → italic_a of the expression for g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ).

Refer to caption
Figure 9. Maximizing permuton g(x,y)𝑔𝑥𝑦g(x,y)italic_g ( italic_x , italic_y ) for r=0𝑟0r=0italic_r = 0 and the region 32x+2y>1.32𝑥2𝑦1\frac{3}{2}x+2y>1.divide start_ARG 3 end_ARG start_ARG 2 end_ARG italic_x + 2 italic_y > 1 .

References

  • [Agg20a] Amol Aggarwal. Arctic boundaries of the ice model on three-bundle domains. Inventiones mathematicae, 220(2):611–671, 2020.
  • [Agg20b] Amol Aggarwal. Limit shapes and local statistics for the stochastic six-vertex model. Communications in Mathematical Physics, 376:681–746, 2020.
  • [AK24] Radosław Adamczak and Michał Kotowski. The global and local limit of the continuous-time mallows process. arXiv preprint arXiv:2404.08554, 2024.
  • [Bax07] R. Baxter. Exactly solved models in statistical mechanics. Courier Dover Publications, 2007.
  • [BCG16] Alexei Borodin, Ivan Corwin, and Vadim Gorin. Stochastic six-vertex model. Duke Math. J., 165(3):563–624, 2016.
  • [BDMW24] Jacopo Borga, Sayan Das, Sumit Mukherjee, and Peter Winkler. Large deviation principle for random permutations. International Mathematics Research Notices, 2024(3):2138–2191, 2024.
  • [BG19] Alexei Borodin and Vadim Gorin. A stochastic telegraph equation from the six-vertex model. The Annals of Probability, 47(6):4137–4194, 2019.
  • [BK16] Alexey Bufetov and Alisa Knizel. Asymptotics of random domino tilings of rectangular Aztec diamonds. Annales de l’institut Henri Poincare (B) Probability and Statistics, 54, 04 2016.
  • [BL14] Pavel Bleher and Karl Liechty. Random matrices and the six-vertex model, volume 32 of CRM Monograph Series. American Mathematical Society, Providence, RI, 2014.
  • [Bor11] Olena Bormashenko. Permutations with Interval Restrictions. Thesis at Stanford University, 2011.
  • [BP99] David Bressoud and James Propp. How the alternating sign matrix conjecture was solved. Notices of the AMS, 46(6):637–646, 1999.
  • [BR22] Pavel Belov and Nicolai Reshetikhin. The two-point correlation function in the six-vertex model. Journal of Physics A: Mathematical and Theoretical, 55(15):155001, 2022.
  • [CKP01] H. Cohn, R. Kenyon, and J. Propp. A variational principle for domino tilings. Journal of American Mathematical Society, 14(2):297–346, 2001. arXiv:math/0008220.
  • [CP10] Filippo Colomo and Andrei G Pronko. The arctic curve of the domain-wall six-vertex model. Journal of Statistical Physics, 138:662–700, 2010.
  • [DF18] Philippe Di Francesco. Integrable combinatorics. In Proceedings of the International Congress of Mathematicians (ICM 2018) (In 4 Volumes) Proceedings of the International Congress of Mathematicians 2018, pages 2581–2596. World Scientific, 2018.
  • [DGH01] Persi Diaconis, Ronald Graham, and Susan P Holmes. Statistical problems involving permutations with restricted positions. Lecture Notes-Monograph Series, pages 195–222, 2001.
  • [dGKW21] J. de Gier, R. Kenyon, and S. Watson. Limit shapes for the asymmetric five vertex model. Commun. Math. Phys., pages 1–44, 2021. arXiv:1812.11934 [math.PR].
  • [EKLP92] Noam Elkies, Greg Kuperberg, Michael Larsen, and James Propp. Alternating-sign matrices and domino tilings (part i). Journal of Algebraic Combinatorics, 1:111–132, 1992.
  • [FR19] Matthew Fahrbach and Dana Randall. Slow mixing of Glauber dynamics for the six-vertex model in the ordered phases. In Proceedings of the 23rd International Conference on Randomization and Computation, volume 37, pages 1–37, 2019.
  • [FS06] Patrik L Ferrari and Herbert Spohn. Domino tilings and the six-vertex model at its free-fermion point. Journal of Physics A: Mathematical and General, 39(33):10297, 2006.
  • [Gie09] Jan de Gier. Fully packed loop models on finite geometries. In Polygons, Polyominoes and Polycubes, pages 317–346. Springer, 2009.
  • [GL23] V. Gorin and K. Liechty. Boundary statistics for the six-vertex model with DWBC. arXiv preprint arXiv:2310.12735, 2023.
  • [GN23] Vadim Gorin and Matthew Nicoletti. Six-vertex model and random matrix distributions. 2023. arXiv:2309.12495, to appear in Bulletin of the American Mathematical Society.
  • [Gor21] Vadim Gorin. Lectures on random lozenge tilings. Cambridge Studies in Advanced Mathematics. Cambridge University Press, 2021.
  • [GS92] Leh-Hun Gwa and Herbert Spohn. Six-vertex model, roughened surfaces, and an asymmetric spin Hamiltonian. Physical review letters, 68(6):725, 1992.
  • [HMV22] Jimmy He, Tobias Müller, and Teun Verstraaten. Cycles in Mallows random permutations. Random Structures and Algorithms, 2022. arXiv:2201.11610.
  • [Ide16] Martin Idel. A review of matrix scaling and Sinkhorn’s normal form for matrices and positive maps. arXiv preprint arXiv:1609.06349, 2016.
  • [JPS98] William Jockusch, James Propp, and Peter Shor. Random domino tilings and the arctic circle theorem. arXiv preprint math/9801068, 1998.
  • [KKRW20] Richard Kenyon, Daniel Král’, Charles Radin, and Peter Winkler. Permutations with fixed pattern densities. Random Structures & Algorithms, 56(1):220–250, 2020.
  • [KO07] Richard Kenyon and Andrei Okounkov. Limit shapes and the complex Burgers equation. Acta Math, 199:263–302, 2007.
  • [KP22] Richard Kenyon and István Prause. The genus-zero five-vertex model. Probability and Mathematical Physics, 3:707–729, 12 2022.
  • [KP24] Richard Kenyon and István Prause. Limit shapes from harmonicity: dominos and the five vertex model. Journal of Physics A: Mathematical and Theoretical, 57(3):035001, 2024.
  • [KS18] David Keating and Ananth Sridhar. Random tilings with the GPU. Journal of Mathematical Physics, 59(9), 2018.
  • [Lie67] Elliott H. Lieb. Exact solution of the problem of the entropy of two-dimensional ice. Physical Review Letters, 18(17):692, 1967.
  • [LKV23] Ivar Lyberg, Vladimir Korepin, and Jacopo Viti. Fluctuation of the phase boundary in the six-vertex model with domain wall boundary conditions: a Monte Carlo study. Journal of Physics A: Mathematical and Theoretical, 56(49):495002, 2023.
  • [LW72] E. H. Lieb and F. Y. Wu. Two dimensional ferroelectric models. In C. Domb and M. Green, editors, Phase Transitions and Critical Phenomena, volume 1, pages 331–490. Academic Press, 1972.
  • [Mal57] Colin L. Mallows. Non-null ranking models. i. Biometrika, 44(1/2):114–130, 1957.
  • [Muk16] Sumit Mukherjee. Estimation in exponential families on permutations. The Annals of Statistics, 44(2):853, 2016.
  • [NK96] Jae Dong Noh and Doochul Kim. Finite-size scaling and the toroidal partition function of the critical asymmetric six-vertex model. Phys. Rev. E, 53:3225–3239, Apr 1996.
  • [Nol92] I. M. Nolden. The asymmetric six-vertex model. Journal of Statistical Physics, 67(1):155–201, 1992.
  • [PR10] Konstantin Palamarchuk and Nicolai Reshetikhin. The 6-vertex model with fixed boundary conditions. arXiv preprint arXiv:1010.5011, 2010.
  • [PS23] Michael Prähofer and Herbert Spohn. Domain wall fluctuations of the six-vertex model at the ice point. Journal of Physics A: Mathematical and Theoretical, 57(2):025001, 2023.
  • [Res10] Nicolai Reshetikhin. Lectures on the integrability of the 6-vertex model. In Exact Methods in Low-dimensional Statistical Physics and Quantum Computing, pages 197–266. Oxford Univ. Press, 2010. arXiv:1010.5031.
  • [Sta09] Shannon Starr. Thermodynamic limit for the Mallows model on Snsubscript𝑆𝑛S_{n}italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. Journal of mathematical physics, 50(9), 2009.
  • [SW18] Shannon Starr and Meg Walters. Phase uniqueness for the mallows measure on permutations. Journal of Mathematical Physics, 59(6), 2018.
  • [SYY67] B Sutherland, CN Yang, and CP Yang. Exact solution of a model of two-dimensional ferroelectrics in an arbitrary external electric field. Physical Review Letters, 19(10):588, 1967.
  • [Tra08] José Trashorras. Large deviations for symmetrised empirical measures. Journal of Theoretical Probability, 21:397–412, 2008.
  • [ZJ02] P Zinn-Justin. The influence of boundary conditions in the six-vertex model. arXiv preprint cond-mat/0205192, 2002.
  • [ZJ09] Paul Zinn-Justin. Six-vertex, loop and tiling models: integrability and combinatorics. arXiv preprint arXiv:0901.0665, 2009.
  • [ZJ24] Paul Zinn-Justin. Integrability and combinatorics. arXiv preprint arXiv:2404.13221, 2024.