]> An IDEAL Group, CLC, Project

ON COMPARABILITY OF RANDOM PERMUTATIONS

DISSERTATIO N

Presented in Partial Fulfillment of the Requirements for

the Degree Doctor of Philosophy in the Graduate

School of the Ohio State University

By

Adam Hammett, B.S.

**** *

The Ohio State University

2007

Dissertation Committee:

133C1bClbl11111111lbbCC. Approved by

Dr. Boris Pittel, Advisor

Dr. Gerald Edgar

1 ċ.

Advisor

Dr. Akos Seress Graduate Program in

Graduate Program in Mathematics

ABSTRACT

Two permutations of [n] :={1, 2, ..., n} are comparable in the Bruhat order if one

can be obtained from the other by a sequence of transpositions decreasing the number

of inversions. We show that the total number of pairs of permutations (π, σ) with

π<σ is of order (n!)2/n2 at most. Equivalently, if π, a are chosen uniformly at

random and independently of each other, then P(π<σ) is of order n-2 at most. By

a direct probabilistic argument we prove P(π<σ) is of order (0.708)n at least, so

that there is currently a wide qualitative gap between the upper and lower bounds.

Next, emboldened by a connection with Ferrers diagrams and plane partitions

implicit in Bressoud's book [13], we return to the Bruhat order upper bound and show

that for n permutations π1, ..., πr selected independently and uniformly at random,

P(π1<...<πr)=O(n-r(r-1)),

thus providing an extension of our result for pairs of permutations to chains of length

r>2.

Turning to the related weak order "-"- when only adjacent transpositions are

admissible - we use a non-inversion set criterion to prove that Pn*:=P(πσ) is

submultiplicative, thus showing existence of ρ=limPn*n. We demonstrate that ρ

is 0.362 at most. Moreover, we prove the lower bound i=1n(H(i)/i) for Pn*, where

ii

H(i):=j=1i1/j. In light of numerical experiments, we conjecture that for each

order the upper bounds for permutation-pairs are qualitatively close to the actual

behavior. We believe that extensions to r-chains similar to that for the Bruhat order

upper bound can be made for our other bounds in each order, and are presently

working in this direction.

Finally, the weak order poset happens to be a lattice, and we study some properties

of its infimums and supremums. Namely, we prove that the number of r-tuples

(π1 , . . . ' πr) of n-permutations with minimal infimum, 12 ... n, asymptotically equals

-(n!)rhr(z*)(z*)n+1, r>2, n . (1)

Here, z*=z*(r)(1, 2) is the unique (positive) root of the equation

hr(z):=j0(-1)j(j!)rzj=0

within the disk |z|<2. Moreover, (1) is also the asymptotic number of r-tuples with

maximal supremum, n(n -1) ...1.

iii

To my wife, Rachael, for her unending suppport.

iv

ACKNOWLEDGMENTS

First and foremost, I wish to thank my advisor, Boris Pittel. Our collaboration was

quite intensive on this research project, and without his countless suggestions and

endless encouragement, none of these results would have materialized. He is truly an

outstanding example of what a mentor should be.

This work was inspired in large part by a thought-provoking talk Mark Skandera

gave at the MIT Combinatorics Conference honoring Richard Stanley's 60th birthday.

We are grateful to Mark for an enlightening follow-up discussion of comparability

criteria for the Bruhat order. We thank Sergey Fomin for encouragement and for

introducing us to an instrumental notion of the permutation-induced poset, and also

for many insightful suggestions regarding the history of Bruhat order. Without Ed

Overman's invaluable guidance we would not have been able to obtain our numerical

results. Craig Lennon gave us an idea for proving an exponential lower bound in

the case of Bruhat order. We thank Mikl6s B6na for his interest in this work and

for drawing our attention to the lower bound for the number of linear extensions in

Richard Stanley's book.

v

VITA

April 28, 1979. . . . . . . . . . . . . . . . . . . . . . . . . Born - Santa Maria, CA

1997-2001 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Undergraduate,

Westmont College - Santa Barbara, CA

2001 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.S. in Mathematics,

Westmont College

2001-Present . . . . . . . . . . . . . . . . . . . . . . . . . . Graduate Teaching Associate,

The Ohio State University

FIELDS OF STUDY

Major Field: Mathematics

Specialization: Combinatorial Probability

vi

TABLE OF CONTENTS

Abstract ii

Dedication . 1v

Acknowledgments . v

Vita vi

List of Figures x

List of Tables xi

CHAPTER PAGE

1 Introduction 1

1.1 Bruhat Order, Preliminaries 1

1.2 Main Results Related to the Bruhat Order 5

1.3 Weak Order, Preliminaries 7

1.4 Main Results Related to the Weak Order 8

2The Proof of the Bruhat Order Upper Bound 14

2.1 A Necessary Condition for Bruhat Comparability 14

2.2 A Probabilistic Simplification 21

2.3 Asymptotics 23

3The Proof of the Bruhat Order Lower Bound . 26

3.1 A Sufficient Condition for Bruhat Comparability 26

3.2 A Reduction to Uniforms 29

3.3 An Algorithm to Maximize the Bound 34

vii

4

An Extension to Chains in Bruhat Order

39

4.1 The General Setup . 39

4.2 A Tractable Necessary Condition 41

4.3 The Core Counting Problem 42

4.4 A Probabilistic Simplification 56

4.5 Asymptotics 57

5Some Properties of the Weak Ordering 62

5.1 A Criterion for Weak Comparability 62

5.2 Submultiplicativity of Pn* 65

6 The Proof of the Weak Order Upper Bound 67

6.1 A Necessary Condition for Weak Comparability 67

6.2 A Reduction to Uniforms 69

6.3 An Algorithm to Minimize the Bound 71

7The Proof of the Weak Order Lower Bound 75

7.1 A Formula for P(Ei(π)Ei(σ)) . . 75

7.2 Positive Correlation of the Events {Ei(π)Ei(σ)} 77

7.3 Linear Extensions of Arbitrary Finite Posets 81

8Numerics . 83

8.1 Bruhat Order Numerics 83

8.2 Weak Order Numerics 85

9On Infs and Sups in the Weak Order Lattice 89

9.1 A Connection with Complete, Directed, Acyclic Graphs 89

9.2 Computing Infs and Sups in the Weak Order Lattice 93

9.3 Some Equivalent Criteria for inf{π1 , . . . ' πr}=12 ... n 96

9.4 Submultiplicativity Again 99

9.5 Sharp Asymptotics of Pn(r) . 1Ot

9.5.1 Step 1: An Exact Formula for Pn(r) lai

9.5.2 Step 2: A Generating Function for Pn(r).. 105

9.5.3 Step 3: Asymptotics 108

Vlll

10

Open Problems

114

10.1 The Problems 114

Bibliography 115

IX

LIST OF FIGURES

FIGURE PAGE

1.1 The Bruhat order on S3 and S4. 2

1.2 Superimposing M(π) and M(σ) to form M(π, σ). 5

1.3 The permutation-induced poset 7(2143). 10

2.1 Finding a necessary condition for π<σ. 16

2.2 Selection of first m=m1 ( n/2-m resp.) rows (columns resp.) in

corners to support M(π). 21

3.1 G5 and an emboldened subgrid C. 27

3.2 Deletion of 2 largest elements of π, σ, and its affect on C. . 28

4.1 Finding a necessary condition for π1<... <πr. 43

4.2 A non-intersecting nest of 7 lattice paths. 46

4.3 An intersecting nest of 7 lattice paths. 47

4.4 "Swapping" the tails in Figure 4.3. 49

8.1 Experimental determination of the exponent -a in the asymptotic

equation Pncn-a. 85

8.2 Experimental determination of the ratio ρ in the asymptotic equation

Pn*cρn. 86

x

LIST OF TABLES

TABLE PAGE

3.1 Exact computation of Nk for smallish k. 37

6.1 Exact computation of Nk* for smallish k. 74

8.1 Computer simulation data for Pn. 84

8.2 Exact computation of Pn for smallish n. 84

8.3 Computer simulation data for Pn*. 87

8.4 Exact computation of Pn* for smallish n. 87

8.5 Our theoretical lower bound for Pn* applied for smallish n. 88

Xi

(JIAPTER 1

INTRODUCTION

In this chapter, we present the fundamental ideas necessary to understand the ma-

terial in subsequent chapters. We also state the main results to be proved in this

dissertation. It is assumed that the reader has some familiarity with probability the-

ory and combinatorics. A good graduate-level introduction to these subjects can be

found, for instance, in Billingsley [4] and Stanley's volumes on enumerative combina-

torics, [46] and [47].

1.1 Bruhat Order, Preliminaries

Let n>1 be an integer. Two permutations of [n] :={1, ..., n} are comparable in the

Bruhat order if one can be obtained from the other by a sequence of transpositions of

pairs of elements forming an inversion. Here is a precise definition of the Bruhat order

on the set of permutations Sn (see [46, p. 172, ex. 75. a.], Humphreys [30, p. 119]).

If ω =ω(1) ... ω(n) Sn, then a reduction of ω is a permutation obtained from ω

by interchanging some ω (i) with some ω (j) provided i<j and ω (i)>ω (j). We say

that π<σ in the Bruhat order if there is a chain σ=ω1ω2...ωs=π, where

each ωt is a reduction of ωt-1. The number of inversions in ωt strictly decreases with t.

1

4321

321

312

132

123

1234

Figure 1.1: The Bruhat order on S3 and S4.

Indeed, one can show that if ω2 is a reduction of ω1 via the interchange ω1(i)ω1(j),

i<j, then

inv(ω1)=inv(ω2)+2N(ω1)+1,

N(ω1):=|{k : i<k<j, ω1(i)>ω1(k)>ω1(j)}|;

here inv(ω1), say, is the number of inversions in ω1 (see Björner and Brenti [6]). Figure

1.1 illustrates this poset on S3 and S4. The Bruhat order notion can be extended

to other Coxeter groups (see Björner [5], Deodhar [20], and [6, p. 63] for historical

background), but we will be dealing with the symmetric group Sn only.

The definition of the Bruhat order is very transparent, and yet deciding for given π, a

2

whether π<σ from the definition is computationally difficult, even for smallish n.

Fortunately, there exist efficient algorithms for checking Bruhat comparability, which

can all be traced back to an algorithmic comparability criterion due to Ehresmann

(1934) [22] (see also Knuth [34], Björner and Brenti [6]). The Ehresmann "tableau

criterion" states that π<σ if and only if πi,j<σi,j for all 1<i<j<n -1, where

πi,j and σi,j are the i-th entry in the increasing rearrangement of π(1) , . . . , π(j)

and of a(1) , . . . ' σ(j). These arrangements form two staircase tableaux, hence the

term tableau criterion" For example, 41523 > 21534 is verified by element-wise

comparisons of the two tableaux

12451235

14 5 1 2 5

1412

4 2

Also, it is well-known that Ehresmann's criterion is equivalent to the (0, 1)-matrix

criterion. It involves comparing the number of Fs contained in certain submatrices

of the (0, 1)-permutation matrices representing π and a (see B6na[10], [6]). Later,

Björner and Brenti [7] were able to improve on the result of [22], giving a tableau

criterion that requires fewer operations. Very recently, Drake, Gerrish and Skandera

[21] have found two new comparability criteria, involving totally nonnegative poly-

nomials and the Schur functions respectively. We are aware of other criteria (see [5],

Fulton [25, pp. 173-177], Lascoux and Schiitzenberger [36], [20] ), but we found the

(0, 1)-matrix and Ehresmann criteria most amenable to probabilistic study.

3

The (0, 1)-matrix criterion for Bruhat order on Sn says that for π, σSn, π<σ if

and only if for all i, j<n, the number of π(1) , . . . , π(i) that are at most j exceeds

(or equals) the number of a(1), . . . ' a(i) that are at most j (see [11] for this version).

It is referred to as the (0, 1)-matrix criterion because of the following recasting of this

condition: let M (π), M(σ) be the permutation matrices corresponding to π, σ, so

that for instance the (i, j)-entry of M(π) is 1 if π(j)=i and 0 otherwise. Here, we

are labeling columns 1, 2, . . . ' n when reading from left to right, and rows are labeled

1, 2, . . . ' n when reading from bottom to top so that this interpretation is like placing

ones at points (i, π(i)) of the n ×n integer lattice and zeroes elsewhere. Denoting

submatrices of M (ċ) corresponding to rows I and columns J by M (ċ)I,J, this criterion

says that π<σ if and only if for all i, j<n, the number of ones in M (π)[i],[j] is at

least the number of ones in M (σ)[i],[j] (see [21] for this version).

An effective way of visualizing this criterion is to imagine the matrices M (π) and

M (σ) as being superimposed on one another into a single matrix, M (π, σ), with the

ones for M (π) represented by ×'s("crosses"), the ones for M (σ) by 's ("balls")

and the zeroes for both by empty entries. Note that some entries of a (π, σ) may

be occupied by both a cross and a ball. Then the (0, 1)-matrix criterion says that

π<σ if and only if every southwest submatrix of M (π, σ) contains at least as many

crosses as balls. Here, in the notation above, a southwest submatrix is a submatrix

M (π, σ)[i],[j] of M (π, σ) for some i, j<n. It is clear that we could also check

π<σ by checking that crosses are at least as numerous as balls in every northeast

submatrix of M (π, σ). Likewise, π<σ if and only if balls are at least as numerous

as crosses in every northwest submatrix of M (π, σ), or similarly balls are at least

4

M(0)

Figure 1.2: Superimposing M(π) and M(σ) to form M(π, σ).

as numerous as crosses in every southeast submatrix of M (π, σ). Parts of all four

of these equivalent conditions will be used in our proofs. As a quick example, with

π= 21534 and σ=41523, π<σ is checked by examining southwest submatrices of

M (π, σ) in Figure 1.2. Also, the superimposing of M (π) with M (σ) to form M (π, σ)

is illustrated in this figure.

1.2 Main Results Related to the Bruhat Order

In this dissertation, we use the (0, 1)-matrix and the Ehresmann criteria to obtain

upper and lower bounds for the number of pairs (π, σ) with π<σ.

5

Theorem 1.2.1. Let n>1 be an integer, and let π, σSn be selected independently

and uniformly at random. Then there exist universal constants c1, c2>0 such that

c1(0.708)n<P(π<σ)<c2/n2

Equivalently, the number of pairs (π, σ) with π<σ is sandwiched between the

counts c1(0.708)n(n!)2 and c2n-2(n!)2. The lower bound follows from a sufficient

condition derived from the (0, 1)-matrix criterion, and a computer-aided tabulation

of an attendant function of a smallish integer argument. Empirical estimates based on

generating pairs of random permutations suggest that P(π<σ) is of order n-(2+δ),

for δ close to 0.5 from above. So apparently it is the upper bound which comes close

to the true proportion P(π<σ). It is certain that the constant 0.708 can be further

improved, but we do not know if our method could be extended to deliver a lower

bound (1 -o(1))n. A lower bound n-a, a qualitative match of the upper bound, seems

out of sight presently.

A deeper insight reveals a more general result, related to chains of length r in Bruhat

order, once we realize some connections with MacMahon's formula [13] for counting

plane partitions contained in an r×s×t box. Without going into much unnecessary

detail here, one can visualize a plane partition as stacks of unit cubes pushed into a

corner. The k-th Ehresmann condition contains a clear connection between Bruhat

order on permutations and counting combinatorial objects related to plane partitions,

namely non-intersecting lattice paths, a notion we will make precise later on. A closer

look at our methods for permutation-pairs in the spirit of Gessel and Viennot's work

6

[26] implies an extension of Theorem 1.2.1, upper bound, from pairs of permutations

to r-tuples:

Theorem 1.2.2. Let π1, ..., πrSn be selected independently and uniformly at

random. Then there exists a uniform constant c >0 such that

P(π1<...<πr)<c/nr(r-1).

Note that this result implies that there are at most cn-r(r-1) (n!)r length r chains in

the Bruhat order poset.

1.3 Weak Order, Preliminaries

Then we turn to the modified order on Sn, the rveak order "-" Here πσ if there

is a chain σ=ω1ω2...ωs=π, where each ωt is a simple reduction of ωt-1,

i.e. obtained from ωt-1 by transposing two adjacent elements ωt-1(i), ωt-1(i+1) with

ωt-1(i)>ωt-1(i+1). Since at each step the number of inversions decreases by 1,

all chains connecting a and π have the same length. Alternatively, there is a simple

non-inversion (resp. inversion) set criterion, contained in Berge [3], we can use to

check πσ. Indeed, given ωSn introduce the set of non-inversions of ω:

E(ω)={(i, j) : i<j, ω-1(i)<ω-1(j)}.

Similarly, for ωSn we introduce the set of inversions of ω:

7

E*(ω)={(i, j) : i>j, ω-1(i)<ω-1(j)}.

Then, for given π, σSn, we have πσ if and only if E(π)E(σ) (equivalently

E*(π)E*(σ)). Note that ωSn is uniquely determined by its E(ω) (resp. its

E*(ω)).

It turns out that the poset (Sn,) is a lattice (see [3]). Indeed, given π1, ..., πrSn,

there is an efficient way to compute E(inf{π1, ..., πr}) (resp. E*(sup{π1 , . . . , πr}))

from the set i=1rE(πi) (resp. i=1rE " ( πi )). We will see precisely how to do this later.

1.4 Main Results Related to the Weak Order

We prove the following probabilistic result for weak order comparability:

Theorem 1.4.1. Let π, σSn be selected independently and uniformly at random,

and let Pn*:=P(πσ). Then Pn* is submultiplicative, i.e. Pn1+n2*<Pn1*Pn2*. Con-

sequently there exists ρ=limPn*n. Furthermore, there exists an absolute constant

c>0 such that

i=1n(H(i)/i)<Pn*<c(0.362)n

where H(i):=j=1i1/j. Consequently, ρ< 0.362.

The proof of the upper bound is parallel to that of Theorem 1.2.1, lower bound,

while the lower bound follows from the non-inversion (resp. inversion) set criterion

8

described last section. Empirical estimates indicate that ρ is close to 0.3. So here too,

as in Theorem 1.2.1, the upper bound seems to be qualitatively close to the actual

probability Pn*. And our lower bound, though superior to the trivial bound 1/n!, is

decreasing superexponentially fast with n, which makes us believe that there ought

to be a way to vastly improve it.

Paradoxically, it is the lower bound that required a deeper combinatorial insight.

Clearly the number of π's below (or equal to) a equals e(P), the total number of

linear extensions of 7=P(σ), the poset induced by σ. (The important notion of

7(σ) was brought to our attention by Sergey Fomin [24].) We prove that for any

poset 7 of cardinality n,

e(P)>n!/iPd(i), (1.1)

where d(i):=| { jP : j<i in 7} |. (This bound is an exact value of e(P) if the

Hasse diagram is a (directed) rooted tree, Knuth [34, sect. 5.1.4, ex. 20], or a forest

of such trees, Björner and Wachs [8].) The bound (1.1) for e(P(σ)) together with the

independence of sequential ranks in the uniform permutation were the key ingredients

in the proof of Theorem 1.4.1, lower bound.

Mikl6s B6na[12] has informed us that this general lower bound for e(P) had been

stated by Richard Stanley as a level 2 exercise in [46, p. 312, ex. 1] without a

solution. We have decided to keep the proof in the dissertation, since we could

not find a published proof anywhere either. The classic hook formula provides an

example of a poset 7 for which (1.1) is markedly below e(P). It remains to be seen

whether (1.1) can be strengthened in general, or at least for 7(σ). As an illustration,

9

1 2

Figure 1.3: The permutation-induced poset 7(2143).

7=P(2143) has the Hasse diagram appearing in Figure 1.3. Then e(P) =4, but

(1.1) delivers only

e(P)>24/9 e(P)>3.

Regarding the lattice properties of (Sn,), note that the identity permutation 12 ... n

is the unique minimum, and n(n-1) ...1 is the unique maximum. Let π1, ..., πrSn

be selected independently and uniformly at random. It is natural to ask: "How

likely is it that the infimum (resp. supremum) of {π1 , . . . ' πr} is the unique mini-

mal (resp. maximal) element in the weak order lattice?" Equivalently, what is the

asymptotic number of r-tuples (π1 , . . . ' πr) such that inf{π1, . . . , πr}=12 ... n (resp.

sup{π1, ..., πr}=n(n -1) ...1), n ? It turns out that the answer is the same

whether we consider infs or sups, which allows us to focus only on infimums. We

prove the following:

Theorem 1.4.2. Let Pn(r)=P (inf{π1, ..., πr}=12 ...n). Then

1. As a function of n, Pn(r) is submultiplicative. Hence, there exists

to

p(r)=limPn(r)n= inf Pn(r)n, r>1.

2. For each r>1, put hr(z)=j0(-1)j(j!)rzj and Hr(z)=(hr(z))-1 Then, letting

P0(r)=1, we have

Hr(z)=n0Pn(r)zn,

from which we obtain (Darboux theorem [2])

Pn(r)-1z*hr(z*)1(z*)n, n .

Here, z*=z*(r)(1, 2) is the unique simple root of hr(z)=0 in the disc

|z|<2. Consequently, p(r)=1/z*

Unlike our results about comparability in the Bruhat and weak orderings, here we

have a sharp asymptotic formula. The key to the proof of this theorem is establishing

the exact formula

Pn(r)=k=0n-1(-1)k b1+...+bn-k=nb1,...,bn-k1,(b1!)r...(bn-k!)r1,

which follows from the principle of inclusion-exclusion. This formula for Pn(r) is, in

some sense, an r-analog of that for the Eulerian numbers ( B6na[11], Knuth [34]).

Indeed, it turns out that Pn(r) is the probability that the uniform, independent permu-

tations π1-1, ..., πr-1 have no common descents. Introduce the random variable Sn(r),

11

the number of these common descents, so that Pn(r)=P(Sn(r)=0). Another natural

question here is:

Problem. What is the limiting distribution of Sn(r)l.?

We believe that the answer here is "Gaussian", as it is in the case of the number

of descents in a single uniformly random permutation (Sachkov [44]). Our feeling

is that the proof will involve use of the bivariate generating function Fr(x, y)=

n1xnE[(1+y)Sn(r)], which we prove has the simple form

Fr(x, y)=xfr(xy)1-xfr(xy), fr(z).=j0zj(j+1)!r .

Interestingly, this generating function is a special case of a more general result proved

by Richard Stanley [45], although he was probably unaware of the connections his

work had with the weak ordering.

In conclusion we mention several papers that are in the same spirit of this dissertation.

First, [39] and [40] (both by Boris Pittel), where the "probability-of-comparability"

problems were solved for the poset of integer partitions of n under dominance order,

and for the poset of set partitions of [n] ordered by refinement. Also, [41] (again

by B. Pittel), where the "infimum/supremum" problem was solved for the lattice

of set partitions of [n] ordered by refinement. In [16], E. R. Canfield presents an

enlightening extension of the inf/sup work done in [41]. Very recently, in [1], R. M.

Adin and Y. Roichman explore the valency properties of a typical permutation in the

Bruhat order Hasse diagram.

12

This work is in large part the result of an intensive collaborative effort with my

doctoral advisor, Boris Pittel. Portions of this dissertation have been accepted for

publication (2006) in the journal Transactions of the American Mathematical Society

(see [28] for availability).

13

(JIAPTER 2

THE PROOF OF THE BRUHAT ORDER UPPER BOUND

In this chapter, we focus on the proof of Theorem 1.2.1, upper bound. The proof

divides naturally into three steps, hence the divisions of the sections that follow.

We need to show that

P(π<σ)=O(n-2).

The argument is based on the (0, 1)-matrix criterion. We assume that n is even. Only

minor modifications are necessary for n odd.

2.1 A Necessary Condition for Bruhat Comparability

The (0, 1)-matrix criterion requires that a set of n2 conditions are met. The challenge

is to select a subset of those conditions which meets two conflicting demands. It has

to be sufficiently simple so that we can compute (estimate) the probability that the

random pair (π, σ) satisfies all the chosen conditions. On the other hand, collectively

these conditions need to be quite stringent for this probability to be o(1). In our first

advance we were able (via Ehresmann's criterion) to get a bound O(n-1/2) by using

about 2 n1/2 conditions. We are about to describe a set of 2n conditions that does the

job.

14

Let us split the matrices M (π, σ), M (π) and M (σ) into 4 submatrices of equal size

n/2×n/2- the southwest, northeast, northwest and southeast corners, denoting them

Msw(ċ), Mne(ċ), Mnw(ċ) and Mse(ċ) respectively. In the southwest corner Msw(π, σ),

we restrict our attention to southwest submatrices of the form i×n/2, i=1, ..., n/2.

If π<σ, then as we read off rows of Msw(π, σ) from bottom to top keeping track

of the total number of balls and crosses encountered thus far, at any intermediate

point we must have at least as many crosses as balls. Let us denote the set of pairs

(π, σ) such that this occurs by Esw. We draw analogous conclusions for the northeast

corner, reading rows from top to bottom, and we denote by Ene the set of pairs (π, σ)

satisfying this condition.

Similarly, we can read columns from left to right in the northwest corner, and here we

must always have at least as many balls as crosses. Denote the set of these pairs (π, σ)

by Enw. The same condition holds for the southeast corner when we read columns

from right to left. Denote the set of these pairs (π, σ) by Ese. Letting E denote the

set of pairs (π, σ) satisfying all four of the conditions above, we get

{π<σ}E=EswEneEnwEse.

Pairs of permutations in E satisfy 2n of the n2 conditions required by the (0, 1)-matrix

criterion. And unlike the set {π<σ}, we are able to compute |E|, and to show that

P(E)=(n!)-2|E|=O(n-2). Figure 2.1 is a graphical visualization of the reading-off

process that generates the restrictions defining the set E.

If a row (column) of a submatrix M(π)I,J ( M(σ)I,J resp.) contains a marked entry, we

say that it supports the submatrix. Clearly the number of supporting rows (columns)

15

M(π,0)

-

|

-

Figure 2.1: Finding a necessary condition for π<σ.

equals the number of marked entries in M(π)I,J ( M(σ)I,J resp.). Now, given π, σ,

let M1=M1(π), M2=M2(σ) denote the total number of rows that support Msw(π)

and Msw(σ) respectively. Then Mnw(π) , Mnw(σ) are supported by M3=n/2-M1

columns and by M4=n/2-M2 columns respectively. The same holds for the

southeastern corners of M(π) and M(σ). Obviously the northeastern submatrices of

M(π) and M(σ) are supported by M1 rows and M2 rows respectively. Then we have

16

P(E)=m1,m2P(EA (m1, m2)),

(2.1)

A (m1, m2):={(π, σ) : M1=m1, M2=m2} .

Clearly EA (m1, m2)= if m1<m2. We claim that, for m1>m2,

P(EA(m1, m2))=[(m1-m2+1)(n/2+1)(n/2-m2+1)(m1+1)]4ċi=14(n/2mi)(n/2n)2. (T)

Here and below m3:=n/2-m1 and m4:=n/2-m2 stand for generic values of M3

and M4 in the event A (m1, m2).

To prove (T), let us count the number of pairs (π, σ) in EA(m1, m2). First consider

the southwest corner, Msw(π, σ). Introduce L1=L1(π, σ), the number of rows

supporting both Msw(π) and Msw(σ). So L1 is the number of rows in the southwest

corner Msw (π, σ) containing both a cross and a ball. Suppose that we are on the

event {L1=P1}. We choose p1 rows to support both Msw(π) and Msw(σ) from the

n/2 first rows. Then, we choose (m1-P1+m2-P1) more rows from the remaining

(n/2-P1) rows. Each of these secondary rows is to support either Msw(π) or Msw(σ) ,

but not both. This step can be done in

ways. Next, we partition the set of (m1-P1+m2-P1) secondary rows into two row

subsets of cardinality (m1-l1) (rows to contain crosses) and (m2-l1) (rows to contain

balls) that will support Msw(π) and Msw(σ) , accompanying the p1 primary rows

17

supporting both submatrices. We can visualize each of the resulting row selections as

a subsequence of (1, . . . , n/2) which is a disjoint union of two subsequences, one with

p1 elements labeled by a ball and a cross, and another with (m1-P1+m2-P1) elements,

(m1-l1) labeled by crosses and the remaining (m2-l1) elements labeled by balls.

The condition Esw is equivalent to the restriction: moving along the subsequence from

left to right, at each point the number of crosses is not to fall below the number of

balls. Obviously, no double-marked element can cause violation of this condition.

Thus, our task is reduced to determination of the number of (m1-P1+m2-P1)-long

sequences of m1-l1 crosses and m2-l1 balls such that at no point the number of

crosses is strictly less than the number of balls. By the classic ballot theorem (see

Takacs [49, pp. 2-7] ), the total number of such sequences equals

(m1-l1+1)-(m2-l1)(m1-l1+1)+(m2-l1)

=m1-m2+1m1-l1+1 .

The second binomial coefficient is the total number of (m1-p1+m2-P1)-long

sequences of (m1-l1) crosses and (m2-l1) balls. So the second fraction is the

probability that the sequence chosen uniformly at random among all such sequences

meets the ballot theorem condition. The total number of ways to designate the rows

supporting Msw(π) and Msw(σ) , subject to the condition Esw, is the product of two

counts, namely

18

(l1n/2)(m1-l1+m2-l1n/2-l1)(m1-l1+m2-l1m1-l1)m1-m2+1m1-l1+1

=m1-m2+1n/2-m2+1(m2n/2)(p1m2)(m1-l1+1n/2-m2+1)

Suummmmirng this last expression over all p1<m2, we obtain

m1-m2+1n/2-m2+1 m2

=m1-m2+1n/2-m2+1 (2.2)

=(m1-m2+1)(n/2+1)(n/2-m2+1)(m1+1) .

Here, in the first equality, we have used the binomial theorem. The product of the

two binomial coefficients in the final count (2.2) is the total number of row selections

from the first n/2 rows, m1 to contain crosses and m2 to contain balls. So the

fraction preceding these two binomial factors is the probability that a particular row

selection chosen uniformly at random from all such row selections satisfies our ballot

condition "crosses never fall below balls" Equivalently, by the very derivation, the

expression (2.2) is the total number of paths (X(t), Y(t))0tn/2 on the square lattice

connecting (0, 0) and (m1, m2) such that X(t+1) -X(t), Y(t+1) -Y(t){0,1}, and

X(t)>Y(t) for every t. (To be sure, if X(t+1) -X(t)=1 and Y(t+1) -Y(t)=1,

the corresponding move is a combination of horizontal and vertical unit moves.)

Likewise, we consider the northeast corner, Mne(π, σ). We introduce L2=L2(π, σ),

the number of rows in Mne(π, σ) containing both a cross and a ball. By initially

restricting to the event {L2=P2}, then later summing over all p2<m2, we obtain

19

another factor (2.2). Analogously, a third and fourth factor (2.2) comes from con-

sidering columns in the northwest and southeast corners, Mnw(π, σ) and Mse(π, σ).

Importantly, the row selections for the southwest and the northeast submatrices do

not interfere with the column selections for the northwest and the southeast corners.

So by multiplying these four factors (2.2) we obtain the total number of row and

column selections on the event A(m1, m2) subject to all four restrictions defining E!

Once such a row-column selection has been made, we have determined which rows

and columns support the four submatrices of M(π) and M(σ). Consider, for instance,

the southwest corner of M(π). We have selected m1 rows (from the first n/2 rows)

supporting Msw(π) , and we have selected m3 columns (from the first n/2 columns)

supporting Mnw(π). Then it is the remaining n/2-m3=m1 columns that support

Msw(π). The number of ways to match these m1 rows and m1 columns, thus to

determine Msw(π) completely, is m1!. The northeast corner contributes another m1!,

while each of the two other corners contributes m3!, whence the overall matching

factor is (m1!m3!)2. The matching factor for a is (m2!m4!)2. Multiplying the number

of admissible row-column selections by the resulting i=14(mi!)2 and dividing by (n!)2,

we obtain

P(EA(m1, m2))=[(m1-m2+1)(n/2+1)(n/2-m2+1)(m1+1) ]4. i=14(mi!)2(n!)2,

which is equivalent to (T). Figure 2.2 is a graphical explanation of this matching

factor. In it, we show the matrix M(π) in a case when in the southwest and the

20

n/2"-m

])m

m(

-v

n/2-m

Figure 2.2: Selection of first m=m1 ( n/2-m resp.) rows (columns resp.) in corners

to support M(π).

northeast squares π is supported by the bottom m(=m1) and the top m rows re-

spectively; likewise, in the northwest and the southeast squares π is supported by the

n/2-m leftmost and the n/2-m rightmost columns respectively.

2.2 A Probabilistic Simplification

Let us show that (2.1) and (T) imply

21

P (E)<E[(M1-M2+1)4(n/2+1)4(n/2-M2+1)4(M1+1)4

()

First, M1 and M2 are independent with

P(Mi=mi)=(n/2mi)(n/2n), i=1,2.

Indeed, Mi obviously equals the cardinality of the intersection with [n/2] of a uni-

formly random subset of size n/2 from [n], which directly implies these formulas.

Thus, each Mi has the hypergeometric distribution with parameters n/2, n/2, n/2; in

other words, Mi has the same distribution as the number of red balls in a uniformly

random sample of n/2 balls from an urn containing n/2 red balls and n/2 white balls.

By the independence of M1 and M2 , we obtain

P(M1=m1, M2=m2)=(n/2m1)(n/2m2)(n/2n)2 2.

It remains to observe that (2.1) and (T) imply

P(E)=m1m2(m1-m2+1)4(n/2+1)4(n/2-m2+1)4(m1+1)4ċP(M1=m1, M2=m2)

<m1,m2(m1-m2+1)4(n/2+1)4(n/2-m2+1)4(m1+1)4ċP(M1=m1, M2=m2)

=E[(M1-M2+1)4(n/2+1)4(n/2-M2+1)4(M1+1)4],

and (‡) is proved.

22

2.3 Asymptotics

The advantage of (‡) is that it allows us to use probabilistic tools exploiting the

independence of the random variables M1 and M2. Typically the Mi's are close to

n/4, while |M1-M2| is of order n1/2 at most. So, in view of (‡) we expect that

P(E)=O(n-2).

We now make this argument rigorous. First of all, by the "sample-from-urn" inter-

pretation of Mi,

E[Mi]=n2(n-1n/2-1)(n/2n) =n/4. (2.3)

Then (see Janson et al. [31, p. 29]) the probability generating function of Mi is

dominated by that of Bin(n, 1/4), and consequently for each t>0 we have

P(|Mi-n/4|>t)=O(exp(-4t2/n)) .

Hence, setting t=n2/3 we see that

P(n/4-n2/3<Mi<n/4+n2/3)>1 -e-cn1/3,

for some absolute constant c>0. Introduce the event

An=2i=1{n/4-n2/3<Mi<n/4+n2/3} .

Combining the estimates for Mi, we see that for some absolute constant c1>0,

P(An)>1 -e-c1n1/3

23

Now the random variable in (‡), call it Xn, is bounded by 1, and on the event An,

within a factor of 1+O(n-1/3),

Xn=(4n)8(M1-M2+1)4(n/2+1)4

Therefore

P(E)<(5n)8(n/2+1)4E[(M1-M2+1)4]+O(e-c1n1/3).

It remains to prove that this expected value is O(n2). Introduce M¯i=Mi-E[Mi],

i=1,2. Then

(M1-M2+1)4=(M¯1-M¯2+1)4<27(M¯14+M¯24+1),

as

(a+b+c)2<3(a2+b2+c2).

We now demonstrate that E[M¯i4]=O(n2). To this end, notice first that E[M¯i2] is of

order n exactly. Indeed, extending the computation in (2.3),

E[Mi(Mi-1)]=n2(n2-1) (n-2n/2-2)(n/2n)

=n(n-2)216(n-1).

Therefore

24

E [M¯i2]=Var[Mi]

=E[Mi(Mi-1)]+E[Mi]-E2[Mi]

=n(n-2)216(n-1)+n4-n216 (2.4)

=n16+O(1).

Furthermore, as a special instance of the hypergeometrically distributed random vari-

able, Mi has the same distribution as the sum of n/2 independent Bernoulli variables

Yj{0,1} (see Vatutin and Mikhailov [38], alternatively [31, p. 30]). Therefore,

(2.4) and the Lindeberg-Feller Central Limit Theorem imply

M¯in/16N(0,1), (2.5)

where N(0,1) is the standard normal random variable. In fact, since

Yj-E[Yj]n/160, n ,

we can say more. Indeed, we have (2.5) together with convergence of all the moments

(see Billingsley [4, p. 391]). Therefore, in particular

E[M¯i4](n/16)4E[N(0,1)4], n ,

i.e. E[M¯i4]=O(n2). This completes the proof of Theorem 1.2.1 (upper bound).

25

(JIAPTER 3

THE PROOF OF THE BRUHAT ORDER LOWER

BOUND

In this chapter, we prove Theorem 1.2.1, lower bound. We will actually prove some-

thing better than what was stated there, showing that for each ε >0

P(π<σ)=Ω((α-ε)n),

where

(y =112549793885132421333522!=0.70879 . . . .

First, some preliminaries.

3.1 A Sufficient Condition for Bruhat Comparability

Introduce π* ( σ* resp.), the permutation π ( σ resp.) with the element n deleted.

More generally, for k<n, πk* ( σk* resp.) is the permutation of [n -k] obtained from

π ( σ resp.) by deletion of the k largest elements, n, n -1, ..., n -k+1. The key to

the proof is the following:

26

Figure 3.1: G5 and an emboldened subgrid C.

Lemma 3.1.1. Let k [n]. If every northeastern submatrix of M(π, σ) with at most

k rows contains at least as many crosses as balls, and πk*<σk*, then π<σ.

Before proceeding with the proof, we introduce one more bit of notation. Let Gn be

the empty n ×n grid depicted in the M(ċ)'s of Figure 1.2. Figure 3.1 is a depiction

of G5 and an emboldened northeastern-corner 3 ×4 subgrid of it, denoted by C. If

C is any subgrid of Gn, then M(ċ|C) denotes the submatrix of M(ċ) that "sits" on

C. To repeat, the (0, 1)-matrix criterion says that π<σ if and only if for each

northeastern-corner subgrid C of Gn, we have at least as many crosses as balls in

M(π, σ|C).

Proof. By the assumption, it suffices to show that the balls do not outnumber crosses

in M ( π, a |C) for every such subgrid C with strictly more than k rows. Consider any

such C. Let C(k) denote the subgrid formed by the top k rows of C. Given a submatrix

A of M(π) (of M(σ) resp.), let |A| denote the number of columns in A with a cross

(a ball resp.). We need to show

27

|-

-

-

-

-|

-

-

-

|

|

o

|

|

o

|

|

o

-

-

*

-

-|

-

-

-

*t

|

×

|

×

|

|

×

|

|

Figure 3.2: Deletion of 2 largest elements of π, σ, and its affect on C.

|M(π|C)|>|M(σ|C)|.

By the assumption, we have |M(π|C(k))|>|M(σ|C(k))|. Write |M(π|C(k))|=

|M(σ|C(k))|+λ, A >0. We now delete the top k rows from M(π), M(σ) together

with the k columns that contain the top k crosses in the case of M(π) and the k

columns that contain the top k balls in the case of M(σ). This produces the matrices

M(πk*) and M(σk*). In either case, we obtain the grid Gn-k together with a new

northeastern subgrid: C(πk*) in the case of M(π) and C(σk*) in the case of M(σ).

Figure 3.2 is a graphical visualization of this deletion process in the special case

π=12534, σ=45132, k=2 and C the 3 ×4 northeastern subgrid of G5. We have

emboldened C in M(π) , M(σ) , and the resulting C(π2*), C(σ2*) in M(π2*), M(σ2*)

respectively.

Since we delete more columns in the case of π than σ, note that C(πk*)C(σk*) as

28

northeastern subgrids of Gn-k. In fact, these grids have the same number of rows,

but C(πk*) has A fewer columns. Hence, as πk*<σk*, we have

|M(πk*|C(πk*))|>|M(σk*|C(πk*))|>|M(σk*|C(σk*))|-λ.

So

|M(π|C)|=|M(π|C(k))|+|M(πk*|C(πk*))|

=|M (a |C(k)) |+ A +|M(πk*|C(πk*))|

>|M(σ|C(k))|+|M(σk*|C(σk*))|

=|M(σ|C)|,

which proves the lemma.

3.2 A Reduction to Uniforms

For each k<n, let En , k denote the event "every northeast submatrix of the top k

rows has at least as many crosses as balls" Then by Lemma 3.1.1,

{π<σ}En,k{πk*<σk*}.

Now the events En,k and {πk*<σk*} are independent! So we get

P(π<σ)>P(En,k)P(πk*<σk*) . (3.1)

For the permutation π ( σ resp.) introduce li(π)=π-1(i) ( li(σ)=σ-1(i) resp.),

the index of a column that contains a cross (a ball resp.) at the intersection with

29

row i. In terms of the li(ċ)'s, En,k is the event: for each integer j<k and m<n,

the number of ln(π), pn-1(π), ..., ln-j+1(π) that are m at least is more than or equal

to the number of ln(σ), pn-1(σ), ..., ln-j+1(σ) that are m at least. We could have

replaced an integer m<n with a real number which means that

En,k={(π, σ) : (l(π), l(σ))Ck},

for some cone-shaped (Borel) set CkR2k ; here l(π)={ln-i+1(π)}1ik, l(σ)=

{ln-i+1(σ)}1ik.

Our task is to estimate sharply P(En,k) for a fixed k, and n . Observe first that

l(π) and l(σ) are independent, and each uniformly distributed. For instance

P(ln(π)=j1, ..., ln-k+1(π)=jk)=1(n)k, 1<j1...jk<n,

where (n)k=n(n -1) ...(n -k+1). Since (n)knk as n , ln(π), ...,

pn-k+1(π) are almost independent [n]-uniforms for large n, and fixed k. Let us make

this asymptotic reduction rigorous. Let U be a uniform-[0, 1] random variable, and

let U1, ..., Un be independent copies of U. Then each nUi is uniform on [n], and it

is easy to show that

P (nU1=i1, ..., nUk=ik|nU1...nUk)=1(n)k.

In other words, {ln-i+1(π)}1ik has the same distribution as the random vector

nU :={nUi}1ik conditioned on the event An,k={nU1...nUk}.

Analogously {ln-i+1(σ)}1ik is distributed as nV :={nVi}1ik conditioned on

30

Bn,k={nV1...nVk}, where V1, ..., Vk are independent [0, 1]-uniforms,

independent of U1, ..., Uk. We will need yet another event Dn,k on which

min{minij|Ui-Uj|, minij|Vi-Vj|, m,jn|UReject,-Vj|}>1/n.

Clearly on Dn,k

(nU, nV )Ck(U, V)Ck ;

here U:={Ui}1ik, V:={Vi}1ik. In addition Dn,kAn,kBn,k, and

P(Dn,kc)<2k2P(|U1-U2|<1/n)<4k2/n.

Therefore

P(En,k)=P((l(π), l(σ))Ck)

=P({(nU,nV)Ck}{An,kBn,k})P(An,kBn,k)

=P({(nU,nV)Ck}Dn,k)+O(P(Dn,kc))1-O(P(Dn,kc))

=P((U,V)Ck)+O(k2/n)1-O(k2/n)

=Qk+O(k2/n),

where Qk=P((U, V)Ck). Let us write Pn=P(π<σ). Using (3.1) and the last

estimate we obtain then

Pn>QkPn-k (1 +O(k2/n))=QkPn-kexp(O(k2/n)) , n >k.

31

Iterating this inequality n/k times gives

Pn>Qkn/kPn-n/kkexp[j=0n/k-1O(k2n-jk)]

Since the sum in the exponent is of order O(k2logn), we get

lim inf Pnn>Qkk, k>1.

Thus

lim inf Pnn>supkQkk.

Therefore, for each k and ε (0, Qkk) , we have

Pn=Ω((Qkk-ε)n) . (3.2)

Next

Lemma 3.2.1. As a function of k, Qk is supermultiplicative, i.e. Qk1+k2>Qk1Qk2

for all k1, k2>1. Consequently there exists limkQkk, and moreover

limkQkk=supk1Qkk.

Thus we expect that our lower bound would probably improve as k increases.

Proof. Qk is the probability of the event Ek={(U(k), V(k))Ck}; here U(k) :=

{Ui}1ik, V(k):={Vi}1ik. Explicitly, for each j<k and each c[0,1], the

32

number of U1, ..., Uj not exceeding c is at most the number of V1, ..., Vj not exceeding

c. So Qk1+k2=P(Ek1+k2), Qk1=P(Ek1), while Qk2=P(Ek2)=P(Ek2*). Here the

event Ek2* means that for each j*<k2 and each c[0,1], the number of Ui, i=

k1+1, ..., k1+j*, not exceeding c is at most the number of Vi, i=k1+1, ..., k1+j*,

not exceeding c. The events Ek1 and Ek2* are independent. Consider the intersection

of Ek1 and Ek2* . There are two cases:

1) j< A4. Then the number of Ui, i<j not exceeding c is at most the number of

Vi, i<j not exceeding c, as Ek1 holds.

2) A4 <j< A4 +k2. Then the number of Ui, i<j, not exceeding c is at most

the number of Vi, i< A4 not exceeding c (as Ek1 holds), plus the number of Vi,

A4 <i<j, not exceeding c (as Ek2* holds). The total number of these Vi is the

number of all Vi, i<j, that are at most c, c[0,1].

So Ek1+k2Ek1Ek2*, and we get Qk1+k2>Qk1Qk2. The rest of the statement

follows from a well-known result about super(sub)multiplicative sequences (see P61ya

and Szegö [43, p. 23, ex. 98] ).

El

Given 1<j<i<k, let Ui,j ( Vi,j resp.) denote the j-th element in the increasing

rearrangement of U1, ..., Ui ( V1, ..., Vi resp.). Then, to put it another way, Qk is

the probability that the k Ehresmann conditions are met by the independent k-

dimensional random vectors U and V, both of which have independent entries. That

is, we check Ui,j>Vi,j for each 1<j<i<k by performing element-wise comparisons

in the following tableaux:

33

Uk,1 Uk,2 Uk,3

Uk,k

Vk,1 Vk,2 Vk,3

Vk,k

.ċ. .ċ. .ċ. .ċ. .ċ. .ċ. .ċ. .ċ.

U3,1 U3,2 U3,3 V3,1 V3,2 V3,3

U2,1 U2,2 V2,1 V2,2

U1,1 V1,1

3.3 An Algorithm to Maximize the Bound

What's left is to explain how we determined a = 0.70879....

It should be clear that whether or not (U(k), V(k)) is in Ck depends only on the

size ordering of U1, ..., Uk, V1, ..., Vk. There are (2k)! possible orderings, all being

equally likely. Thus Qk=Nk/(2k)!, where Nk is the number of these linear orderings

satisfying this modified Ehresmann criterion. Since the best constant in the lower

exponential bound is probably limkQkk, our task was to compute Nk for k as

large as our computer could handle. ( "Probably", because we do not know for certain

that Qkk increases with k.)

Here is how Nk was tabulated. Recursively, suppose we have determined all Nk-1

orderings of x1, ..., xk-1, y1, ..., yk-1 such that (x(k-1), y(k-1))Ck-1. Each such

ordering can be assigned a2 (k-1)-long sequence of O's and l's, O's for xi 's and Fs

for yj's, 1<i, j<k-1. Each such sequence meets the ballot-theorem condition:

as we read it from left to right the number of Fs never falls below the number of

O's. We also record the multiplicity of each sequence, which is the number of times

it is encountered in the list of all Nk-1 orderings. The knowledge of all 2(k-1)-long

34

ballot-sequences together with their multiplicities is all we need to compile the list of

all 2k-long ballot-sequences with their respective multiplicities.

For k=1, there is only one ballot-sequence to consider, namely 10, and its multiplicity

is 1. So N1=1, and

Q1=1/2!.

Passing to k=2, we must count the number of ways to insert 1 and 0 into 10 so that

we get a 4-long ballot-sequence of two O's and two Fs. Inserting 1 at the beginning,

giving 110, we can insert 0 into positions 2, 3 or 4, producing three ballot-sequences

1010, 1100, 1100,

respectively. (Inserting 0 into position 1 would have resulted in 0110 which is not a

ballot-sequence.) Similarly, inserting 1 into position 2, we get 110, and inserting 0

under the ballot condition gives th ree ballot-sequences

1010, 1100, 1 tOO.

Finally, inserting 1 at the end, giving 101, we can only insert 0 at the end, obtaining

one ballot-sequence

toto.

Hence, starting from the ballot-sequence 10 of multiplicity 1, we have obtained two

35

4-long ballot-sequences, 1010 of multiplicity 3 and 1100 of multiplicity 4. Therefore

N2=3+4=7, and

Q2=7/4!.

Pass to k=3. Sequentially we insert 1 in each of 5 positions in the ballot-sequence

1010, and then determine all positions for the new 0 which would result in a 6-long

ballot-sequence. While doing this we keep track of how many times each 6-long

ballot-sequence is encountered. Multiplying these numbers by 3, the multiplicity of

1010, we obtain a list of 6-long ballot-sequences spawned by 1010 with the number

of their occurrences. We do the same with the second sequence 1100. Adding the

numbers of occurrences of each 6-long ballot-sequence for 1010 and 1tOO we arrive at

the following list of five 6-long ballot-sequences with their respective multiplicities:

tttooo : 36,

ttotoo : 32,

ttooto : 24,

tottoo : 24,

tototo : 19.

Therefore N3=36+32+24+24+19 =135, and

Q3=135/6!.

36

Table 3.1: Exact computation of Nk for smallish k.

We wrote a computer program for this algorithm. Pushed to its limit, the computer

delivered table 3.1.

Combining (3.2) and the value of 1Q111 in this table, we see that for each ε>0,

Pn=Ω((1Q111-ε)n)=Ω((0.708...-ε)n) .

The numbers Qkk increase steadily for k<12, so at this moment we would not rule

out the tantalizing possibility that Qkk1 as k. Determination of the actual

37

limi t is a challenging open problem. The proof just given only involves mention of

the (0, 1)-matrix criterion, but it was the Ehresmann criterion that actually inspired

our initial insights.

38

(JIAPTER 4

AN EXTENSION TO CHAINS IN BRUHAT ORDER

Our goal in this chapter is to prove Theorem 1.2.2, which extends our upper bound

result on Bruhat-comparability of permutation-pairs. Namely, we will show that for

π1, ..., πrSn selected independently and uniformly at random, we have

P(π1<...<πr)=O(n-r(r-1)) .

So the number of length r chains in the Bruhat poset is of order at most n-r(r-1) (n!)r.

Our basic approach will be at its core, the same as it was for permutation-pairs, but

our enumerative techniques will mimic those established by Gessel and Viennot to

count various classes of non-intersecting lattice paths (see [26]). These same tech-

niques are also highlighted in Bressoud's book [13], which recounts the proof of the

Alternating-Sign Matrix Conjecture.

4.1 The General Setup

First, some preliminaries. Recall the "superimposed" matrix M(π, σ) of ×'s and 's

we introduced earlier ( ×'s for π and 's for σ). Let's introduce the analogous more

general matrix M(π1 , . . . , πr), where we place ×j's at positions (i, πj(i)), 1<i<n,

39

1<j<r. Here, we read rows bottom to top and columns left to right. For instance,

if π1=21543, π2=25143 and π3= 54213, we have

5

4

3

M(π1, π2, π3)=

2

1

1234 5

Given a set of rows I[n] and columns J[n], let M(ċ)I,J denote the submatrix

of M(ċ) corresponding to rows I and columns J. Again, rows are labeled 1, 2, . . . ' n

from bottom to top, and columns are labeled 1, 2, . . . ' n from left to right. The (0, 1)-

matrix criterion says that π1<...<πr if and only if for each southwest submatrix

M(π1 , . . . , πr)[μ],[ν] , μ, U [n], we have

×1's> . . . > ×r 's.

Note that this is the case in M(π1, π2, π3) above, so that we have π1<π2<π3. One

more bit of notation: let Mj(μ, u) denote the number of ×j's in M(πj)[μ] , [ν] . In this

notation,

π1< . . . <πr M1(μ, ν)> . . . >Mr(μ, ν) μ, U [n].

40

4.2 A Tractable Necessary Condition

Now, as in the case of permutation-pairs, we need to find an event which contains

{π1<...<πr}

that is more amenable to enumerative techniques. For simplicity, let's assume n is

even and fix μ=u =n/2 (in what follows, only minor modifications are necessary for

n odd). In our computations, we will primarily concentrate on the single southwest

submatrix

Msw(π1, ..., πr):=M(π1, ..., πr)[n/2],[n/2].

We similarly denote by

Mnw(π1, ..., πr), Mne(π1, ..., πr), Mse(π1, ..., πr)

the northwest, northeast and southeast n/2×n/2 subsquares of M(π1 , . . . , πr), re-

spectively. If π1<...<πr, it is necessary that

M1(i, n/2)>...>Mr(i, n/2), i=1, ..., n/2. (4.1)

This is analogous to the necessary condition we considered for permutation-pairs: we

read off rows of Msw (π1 , . . . ' πr) one at a time, keeping track of the total number of

×j's encountered thus far, 1<j<r. If π1<...<πr, at any intermediate point in

this "reading-off" process, we must never have encountered more ×j's than ×j-1 's,

1<j<r.

41

Let Esw denote the event described in (4.1). Recalling our work with permutation-

pairs, we extract similar events necessary for {π1<...<πr} by considering columns

in the northwest n/2×n/2 square (denote this event Enw), rows in the northeast

square (Ene) and columns in the southeast square (Ese). Then

{π1<...<πr}E:=EswEnwEneEse,

and unlike the set {π1<...<πr}, we can compute |E|! Namely, our task is reduced

to showing

P(E)=|E|(n!)r=O(n-r(r-1)) .

We have seen Figure 4.1 before, but we show it again here to aid in visualizing the

"reading-off" process used to generate the restrictions defining the event E.

4.3 The Core Counting Problem

If a row (column) of a submatrix M(πj)I,J contains a marked entry, we say that it

supports the submatrix. Clearly the number of supporting rows (columns) equals

the number of marked entries in M(πj)I,J. Now, given πj, 1<j<r, let Mj:=

Mj(n/2, n/2), the total number of rows that support Msw(πj). Then Mnw(πj) is

supported by n/2-M1 columns. The same holds for the southeastern corner, Mse(πj).

Obviously the northeastern submatrix Mne(πj) is supported by Mj rows. Then we

have

42

M(π,0)

-

|

-

Figure 4.1: Finding a necessary condition for π1<...<πr.

P(E)= P(E(m1, ..., mr)), (4.2)

m1, . . . , mr

E (m1, ..., mr):=E{(π1, ..., πr) : M1=m1, ..., Mr=mr}.

Clearly, by the (0, 1)-matrix criterion, if there is 1<i<j<r such that mi<mj,

then E(m1 , . . . , mr)=. Otherwise, we claim:

Theorem 4.3.1. For m1>...>mr,

43

P(E ( m1, . . . , mr) )=[1IIr(mi-mj+j-i)(n/2+j-i)(mi+j-i)(n/2-mj+j-i)]4 . i=1r(min/2)(n/2-min/2)(n/2n).

This result is really the crux of our argument. The proof, however, will take a little

work. We present it in three steps.

Proof. Let us assume we are on the event E(m1 , . . . , mr) and that m1>...>mr.

STEP 1. As promised, we first concentrate on the southwest n/2×n/2 subsquare

Msw (π1 , . . . , πr). For each 1<j<r, we need to choose mj rows to support Msw(πj)

in such a way that Esw is satisfied. We claim that the number of ways to choose these

supporting rows, which we denote Nn/2 (m1 , . . . , mr), is given by the determinant-type

formula

(4.3)

Nn/2(m1, ..., mr)=det[ ]i,j=1r ;

here i is the row index, and j the column index. To prove (4.3), we will exploit a

connection with non-intersecting lattice paths implicit in the restrictions defining Esw.

As we have already mentioned, these enumerative techniques were first introduced

by Gessel and Viennot [26], and were highlighted by Bressoud [13].

For each j[r], we construct a lattice path associated with Msw(πj) as follows: start

at the point (j, -j) in the plane. If the first row of Msw(πj) contains a marked entry,

execute the move (0, 1). Otherwise, execute the move (1, 0). In general, when looking

at the i-th row of Msw(πj), 1<i<n/2, we move from our current position up 1 unit

44

if the i-th row contains a marked entry, and move right 1 unit otherwise. This way,

Msw(πj) generates a lattice path consisting of unit moves (0, 1) or (1, 0), connecting

the point (j, -j) with (n/2-mj+j, mj-j).

Now, by considering all r of these paths together in the plane, we get what is known

as a nest of lattice paths. The restrictions defining Esw imply that the nest of r

lattice paths we have just constructed is non-intersecting , i.e. no two paths touch

each other. So to prove the formula (4.3) we need only show that this determinant

counts the total number of these nests of r non-intersecting lattice paths, with moves

(0, 1) and (1, 0), joining the points S :={(j, -j) :j[r]} to the points F:=

{(n/2-mi+i, mi-i) : i[r]}. Figure 4.2 is an illustration of one such nest of

r=7 paths.

To count the number of these non-intersecting nests, we instead consider the collection

of all nests of r lattice paths, with moves (0, 1) and (1, 0), joining the points of S to

the points of F. We require only that no two of the r paths in this nest begin at the

same point or end at the same point, with no further restrictions. In particular, such

a nest of r lattice paths uses every point from both S and F. This allows for some

very tangled nests, like the one shown in Figure 4.3.

To weed out the intersecting nests from the non-intersecting ones, we will employ a

special inclusion-exclusion type argument which gives rise to a sum over permutations

of [r]. Each nest gives rise to a permutation of [r] as follows: define the i-th path

to be the one that ends at (n/2-mi+i, mi-i), i[r]. If the i-th path starts at

(j, -j), j[r], then we define σ(i)=j. For instance, the tangled nest in Figure 4.3

corresponds to the permutation σ= 3512476.

45

(n/2-m1+1, m1-1)

(1,-1)

(n/2-m7+7, m7-7

(7,-7)

Figure 4.2: A non-intersecting nest of 7 lattice paths.

46

Figure 4.3: An intersecting nest of 7 lattice paths.

On the other hand, given σSr, in order that a nest give rise to a in this cor-

respondence the i-th lattice path must end at (n/2-mi+i, mi-i) and begin at

(σ(i), -σ(i)), and so takes a total of mi-i+σ(i) steps northward and n/2-mi+i-σ(i)

steps eastward, i[r]. Hence, the total number of nests corresponding to the per-

mutation a equals

47

i=1r .

Introduce

I(σ):=|{(i, j) : 1<i<j<r, σ-1(i)>σ-1(j)}|,

the inversion number of σ. We claim that the number of nests of r non-intersecting

lattice paths joining S to F equals

(4.4)

σSr(-1)1(σ)i=1r(mi-i+n/2 a (i))=det[ ]i,j=1r,

which proves the formula (4.3).

To prove (4.4), notice that this determinant sums over all possible nests, both in-

tersecting and non-intersecting, where each nest is counted as +1 if the inversion

number of the corresponding a is even, and as -1 otherwise. If a nest happens to be

non-intersecting, then the corresponding permutation is the identity, 12 ...n, which

has inversion number 0, and so these nests are counted as +1. We need to show that

everything else in this sum cancels. To do this, we will pair intersecting nests up,

one corresponding to a permutation with an even inversion number the other an odd

inversion number.

Let a nest N with at least one intersection point be given, and let σSr be its

corresponding permutation. Consider the intersection point (x, y) furthest to the

right in N. If there is more than one intersection point in this column, let (x, y) be

the one that is highest. In Figure 4.3, this is the point (13, 2). We now "swap tails"

48

Figure 4.4: "Swapping" the tails in Figure 4.3.

at (x, y). Specifically, if the paths cross each other at (x, y), we swap the tails so

that they just meet, and vice versa in the other situation. Figure 4.4 is a graphical

visualization of this "swapping" process in the case of our running example Figure

4.3.

Doing this, we get a new intersecting nest N that differs from N only at this "swap-

ping point, (x, y). Let σSr denote the permutation corresponding to N. By

our choice of intersection point (x, y), it is clear that σ only differs from a by a

single adjacent swap of entries in σ. For instance, in our Figure 4.3 example, we have

σ= 3512476 and σ= 3512746. In general we will have I(σ)=I(σ)±1, and so this

pair of intersecting nests cancel each other out in the sum (4.4). Therefore, what we

claimed in (4.4) (and hence (4.3) also) is proved.

STEP 2. Next, we claim that formula (4.3) implies

P(E(m1, ..., mr))=(det[ ]i,j=1r)4ċi=1r[(mi!)2(n/2-mi)!2](n!)r. (4.5)

As a first step to the proof of (4.5), we notice that we have shown something more

49

general regarding the count Nn/2 (m1 , . . . , mr). Consider the following ballot-counting

problem: suppose we have r canditates, C1, ..., Cr, running for election, receiving a

total of μ1>...>μr votes respectively. Suppose we count the votes in a rather

peculiar way: we have a total of lJ ballot boxes arranged in a row. Each box is

allowed to have at most one vote for each candidate with no further restrictions. In

particular, a given box could possibly be empty, and may have at most r ballots in it,

one cast for each candidate. We open the ballot boxes one at a time, keeping track

of the cumulative total votes cast for each candidate at every intermediate point. We

wish to know the total number of allocations of ballots in boxes so that at each of

these intermediate points, we have C1 with at least as many votes as C2, who in turn

has at least as many votes as C3, and so on. By our very derivation above this count

is given by NU(μ1, ..., μr). Namely, we have proved:

Lemma 4.3.2. For the ballot-counting problem above, we have

NU(μ1, ..., μr)=det[ ]i,j=1r

What's more we claim that

NU(ν -μr, ..., u -μ1)=NU(μ1, ..., μr). (4.6)

Indeed, by Lemma 4.3.2, the left-hand side is given by

50

NU(ν -μr, ..., u -μ1)=det[ ]i,j=1r

=det[ ]i,j=1r

We now switch row i with row r-i+1, and column j with column r-j+1, i, j[r].

This has no effect on the determinant. Hence

NU(ν -μr, ..., u -μ1)=det[ ]i,j=1r=NU(μ1, ..., μr),

and formula (4.6) is proved.

We now prove (4.5). First of all, we have already seen that the number of allow-

able supporting-row selections in the southwest subsquare, subject to the restrictions

defining Esw is given by the count in (4.3). A second factor (4.3) comes from choosing

supporting-rows subject to the restrictions defining Ene in the northeast subsquare.

By considering supporting- column selections in the northwest subsquare, subject to

Enw, Lemma 4.3.2 together with equation (4.6) tell us that the total number of al-

lowable supporting-column selections equals

Nn/2 (n/2-mr, ..., n/2-m1)=Nn/2(m1, ..., mr)=det[ ]i,j=1r

also, thus giving a third factor. Analogously, a fourth factor comes from considering

supporting-column selections in the southeast subsquare, subject to the restrictions

defining Ese. So by multiplying these four factors (4.3) we obtain the total number of

51

row and column selections on the event E (m1, . . . , mr) subject to all four restrictions

defining E!

Once such a row-column selection has been made, we have determined which rows

and columns support the four submatrices of M(πi), i[r]. Consider, for instance,

the southwest corner of M(π1). We have selected m1 rows (from the first n/2 rows)

supporting Msw (π1), and we have selected n/2-m1 columns (from the first n/2

columns) supporting Mnw(π1). Then it is the remaining n/2-(n/2-m1)=m1

columns that support Msw (π1). The number of ways to match these m1 rows and

m1 columns, thus to determine Msw (π1) completely, is m1!. The northeast corner

contributes another m1!, while each of the two other corners contributes (n/2-m1)!,

whence the overall matching factor is (m1!)2(n/2-m1)!2. In general, the matching

factor for πi is (mi!)2(n/2-mi)!2, i[r]. Multiplying the number of admissible row-

column selections by the resulting total matching factor i=1r[(mi!)2(n/2-mi)!2] and

dividing by (n!)r, we obtain the formula (4.5).

STEP 3. As a final step in the proof of Theorem 4.3.1, we show that

det[ ]i,j=1r= ...1i<jr(mi-mj+j-i)(n/2+j-i)(mi+j-i)(n/2-mj+j-i).

(4.7)

By putting (4.7) into equation (4.5), we leave it to the interested reader to verify that

we get the formula stated in the theorem.

First of all, we note that, for j>i,

52

=(n/2)!(mi+j-i)...(mi+1)mi!(n/2-mi+i-j)!

=(n/2-mi)(n/2-mi-1)...(n/2-.mi+i+1-j)(mi+j-i)(mi+j-i-1)ċċ(mi+1)

=(n/2mi)(mi+r-i)...(mi+1)(n/2-mi+i-1) ...(n/2-mi+1)

×(n/2-mi+i-1)(n/2-mi+i-2)...(n/2-mi+i+1 -j)

×(mi+j-i+1)(mi+j-i+2)...(mi+r-i)

=(n/2mi)(mi+r-i)...(mi+1)(n/2-mi+i-1) ...(n/2-mi+1)

×[-(xi+b2)][-(xi+b3)]...[-(xi+bj)]

×(xi+aj+1)(xi+aj+2)...(xi+ar), (4.8)

where xs:=ms- (s-1), 1<s<r, at:=t-1 and bt:=-n/2+t-2,2<t<r.

Similarly, for j<i,

53

=(n/2).!(mi+j-i)!(n/2-mi+i-j)ċċ(n/2-mi+1)(n/2-mi)!

=mi(mi-1)...(mi+j-i+.1)(n/2-mi+i-j)(n/2-mi+i-j-1)ċċ(n/2-mi+1)

=(n/2mi)(mi+r-i)...(mi+1)(n/2-mi+i-1) ...(n/2-mi+1)

×(n/2-mi+i-1)(n/2-mi+i-2)...(n/2-mi+i+1 -j)

×(mi+j-i+1)(mi+j-i+2)...(mi+r-i)

=(n/2mi)(mi+r-i)...(mi+1)(n/2-mi+i-1) ...(n/2-mi+1)

×[-(xi+b2)][-(xi+b3)]...[-(xi+bj)]

×(xi+aj+1)(xi+aj+2)...(xi+ar), (4.9)

Obviously, for j=i the identities (4.8) and (4.9) hold also. Hence, we obtain

det[ ]i,j=1r

=II (n/2mi)(mi+r-i)...(mi+1)(n/2-mi+i-1) ...(n/2-mi+1)

×det[[-(xi+b2)]...[-(xi+bj)](xi+aj+1)...(xi+ar)]i,j=1r

= ...1i<jr1(mi+j-i)(n/2-mj+j-i)

×det[[-(xi+b2)]...[-(xi+bj)](xi+aj+1)...(xi+ar)]i,j=1r, (4.10)

so our task is reduced to computing the last determinant in (4.10). For this, we

54

apply the following result of Krattenthaler [35], which extends the Vandermonde

determinant :

Theorem 4.3.3. (Krattenthaler's formula) Given arbitrary values for x1, . . . , xr,

a2, . . . ' ar, and b2, . . . ' br, we have

det[(xi+b2)...(xi+bj)(xi+aj+1)...(xi+ar)]i,j=1r

=II(xi-xj)II(bi-aj)1r2r.

In order to use this result, we must factor (-l) out of column j, 1<j<r, in the

last determinant in (4.10). Doing this, we obtain

det[[-(xi+b2)]...[-(xi+bj)](xi+aj+1)...(xi+ar)]i,j=1r

=(-1)(2r)det[(xi+b2)...(xi+bj)(xi