]> An IDEAL Group, CLC, Project

SOME RESULTS ON RECURRENCE AND ENTROPY

DISSERTATIO N

Presented in Partial Fulfillment of the Requirements for

the Degree Doctor of Philosophy in the Graduate

School of the Ohio State University

By

Ronald Lee Pavlov Jr., B.S., M.S.

**** *

The Ohio State University

2007

Dissertation Committee:

133C1bClbl11111111lbbCC. Approved by

Professor Vitaly Bergelson, Advisor

Professor Alexander Leibman

Advisor

Professor Manfred Einsiedler Graduate Program in Mξ

Graduate Program in Mathematics

ABSTRACT

This thesis is comprised primarily of two separate portions. In the first portion, we

exhibit, for any sparse enough increasing sequence {pn} of integers, a totally minimal,

totally uniquely ergodic, and topologically mixing system (X, T) and fC(X) for

which the averages 1Nn=0N-1f(Tpnx) fail to converge on a residual set in X, answering

negatively an open question of Bergelson. We also construct here a totally minimal,

totally uniquely ergodic, and topologically mixing system (X, T) and xX for

which x{Tpnx}.

In the second portion, we study perturbations of multidimensional shifts of finite

type. Given any Zd shift of finite type X for d>1 and any word w in the language of

X, denote by Xw the set of elements of X in which w does not appear. If X satisfies

a uniform mixing condition called strong irreducibility, we obtain exponential upper

and lower bounds on htop(X)-htop(Xw) dependent only on the size of w. This result

generalizes a result of Lind about Z shifts of finite type.

ii

To Dilip

ill

ACKNOWLEDGMENTS

First and foremost, I would like to thank my advisor Vitaly Bergelson. This the-

sis could never have been completed without his help. His infectious enthusiasm

for mathematics, along with a willingness to share his knowledge, were a constant

inspiration.

I would also like to thank my other committee members, Professors Alexander Leib-

man and Manfred Einsiedler, for agreeing to serve on my committee and for many

fruitful mathematical discussions.

Finally, I thank my friends and loved ones, without whom I would never have been

able to get as far as I have. In no particular order:

Thank you to Mom and Dad, who have always been willing to do anything to

support me and have always believed in me.

Thank you to Greg, without whose friendship and sense of humor graduate school

would have been much less bearable.

Thank you to Jasmine, who was always there for me. I can never thank you

enough for your constant love and support.

IV

VITA

2000-Present . . . . . . . . . . . . . . . . . . . . . . . . . . Graduate Teaching Associate,

The Ohio State University

1998-1999 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Student Instructional Associate,

The Ohio State University

2003 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . M.S. in Mathematics,

The Ohio State University

2000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.S. in Mathematics,

The Ohio State University

v

PUBLICATIONS

Research Papers

Some Counterexamples In Topological Dynamics (Ergodic Theory and Dynam-

ical Systems, to appear)

Perturbations of Multidimensional Shifts of Finite Type (submitted)

FIELDS OF STUDY

Major Field: Mathematics

Specialization: Ergodic Theory and Dynamical Systems

vi

TABLE OF CONTENTS

Abstract ii

Dedication . iii

Acknowledgments . 1V

Vita v

List of Figures ix

CHAPTER PAGE

1Introduction 1

2Recurrence and convergence of ergodic averages along sparse sequences 8

2.1 Introduction 8

2.2 Some general symbolic constructions 16

2.3 Some symbolic counterexamples 31

2.4 Some general constructions on connected manifolds . 42

2.5 Some counterexamples on connected manifolds 55

2.6 A counterexample about simultaneous recurrence 61

2.7 Questions 65

3Perturbations of multidimensional shifts of finite type 67

3.1 Introduction 67

3.2 Some measure-theoretic preliminaries 84

3.3 A replacement theorem 93

3.4 The proof of the main result 114

3.5 A closer look at the main result 146

3.6 An application to an undecidability question 155

3.7 Questions 159

Vll

Bibliography

161

Vlll

LIST OF FIGURES

FIGURE PAGE

3.1 f2 's action on a sample element of Y 73

3.2 A portion of a sample element of Z 76

3.3 a4 77

3.4 b4 77

3.5 Rk(j1+R),k(j2+R),...,k(jd+R) 85

3.6 A standard replacement of u. 95

3.7 A sample element S of Rj 97

3.8 An element S of Rj associated to two standard replacements 98

3.9 The suboctants of Bj 99

3.10 Elements S, S of Rj whose difference is a multiple of ei tOO

3.11 An element S of Rj which contains O 106

3.12 107

3.13 Bj 108

3.14 Intersecting occurrences of w 116

3.15 SiSi(R) 120

3.16 Disallowed and allowed pairs of overlapping wj,d 127

3.17 A point xXy 138

ix

3.18 The correspondence between copies of ΓW and points in Γj(2)

139

3.19 How a copy of ΓW is filled if bj,d(p)=1 140

3.20 f

141

x

(JIAPTER 1

INTRODUCTION

This introduction will serve as a brief mathematical and historical overview of both

of the main problems that we will examine in this thesis. Due to their somewhat

disparate natures, the two portions of this thesis will each contain a more in-depth

introduction as well. For this reason, we will for the most part relegate formal defi-

nitions to their pertinent introduction.

Ergodic theory is the study of average or long-term behavior of systems which evolve

with time. For example, one can consider a particle bouncing in a box at fixed speed.

Its position and velocity at any time can be represented by a vector in R6. One can

then model this behavior by taking X to be the space of all possible positions and

velocities for the particle, and T a self-map of X which, given a position and velocity,

gives the position and velocity one second later. This pairing of a space X with a

self-map T is called a dynamical system.

The more general setup is that of a group G acting on a space X with some sort

of structure by maps {Tg}gG preserving this structure. In this thesis, G is always

Zd for some d N. Measure-theoretic ergodic theory occurs when X is a measure

space with a probability measure μ invariant under each Tg, and so in this case we call

1

(X, B, μ, {Tg}gG) a measure-preserving dynamical system. Topological dynam-

ics occurs when X is a compact topological space and each Tg is a homeomorphism,

and so in this case we call (X, {Tg}gG) a topological dynamical system. In ei-

ther type of system, if G=Z, then Tn=(T1)n for any integer n, and so we shorten

(X, B, μ, {Tn}nZ) to (X, B, μ, T) and (X, {Tn}nZ) to (X, T). (Here T=T1.)

These cases are not as disjoint as they may first appear: the famed Bogoliouboff-

Krylov theorem states that any topological dynamical system (X, {Tg}gG) has at

least one invariant probability measure μ as long as G is an amenable group. (An

examination of amenability is beyond the scope of this thesis, but we mention that

amenable groups are a class which contain all abelian groups. We only consider

G=Zd in this thesis, so all topological dynamical systems examined here will possess

invariant measures.) This sometimes allows one to examine topological properties of

a system via properties of invariant measures. For instance, in Chapter 3, we will

prove some purely combinatorial properties of a Zd-action by homeomorphisms of a

Cantor set by means of studying the invariant measures of this action.

Chapter 2 deals primarily with ergodic averaging. One of the fundamental results of

ergodic theory is Birkhoff's ergodic theorem:

Theorem 1.0.1. ([Bi]) For any measure-preserving dynamical system (X, B, μ, T)

and any fL1(X), limn1ni=0n-1f(Tix) exists for μ-almost every xX.

Several results have been proven about the convergence of such averages when one

averages not along all powers of T, but only along some distinguished subset of the

2

integers. ([Bou], [Bou2], [Wi]) In particular, when one averages along {p(n)} for a

polynomial p(n) with integer coefficients, there is the following result of Bourgain:

Theorem 2.1.12. ([Bou], p. 7, Theorem t) For any measure-preserving dynamical

system (X, B, μ, T), for any polynomial q(t)Z[t], and for any fLp(X, B, μ) with

p>1, limN1Nn=1Nf(Tq(n)x) exists for μ-almost every xX.

Theorem 2.1.12 can be interpreted as follows: for any polynomial q(t)Z[t], any

measure-preserving system (X, B, μ, T), and any measure-theoretically "nice" func-

tion f, the set of points x where limN1Nn=1Nf(Tq(n)x) fails to converge is of mea-

sure zero, or negligible measure-theoretically. It is then natural to wonder whether or

not there is a topological parallel to this result using topological notions of "niceness"

(continuity) and negligibility (first category), and in fact such a question was posed

by Bergelson:

Question 2.1.13. ([Be], p. 51, Question 5) Assume that a topological dynamical

system (X, T) is uniquely ergodic1, and let pZ[t] and fC(X). Is it true that for

all but a first category set of points limn1ni=0n-1f(Tp(i)x) exists?

One of our results is a (quite negative) answer to Question 2.1.13:

Theorem 2.1.15. For any increasing sequence {pn} of integers with upper Banach

density zero2, there exists a totally minimal, totally uniquely ergodic, and topologically

1A topological dynamical system (X, B, μ, T) is uniquely ergodic if there exists exactly one T-

invariant measure μ.

2A set AN has upper Banach density zero if limsupnsupmN|{m,m+1,...,m+n-1}A|n=0.

3

mixing topological dynamical system (X, T) and a continuous function f on X with

the property that 1Nn=0N-1f(Tpnx) fails to converge for a residual set of x.

We also examine recurrence along distinguished subsets of the integers, motivated

primarily by the following result:

Theorem 2.1.17. ( [BeL], p. 14, Corollary 1.8) For any dN, any minimal3 topolog-

ical dynamical system (X, {Tv}vZd) and any polynomials q1(t), q2(t), ..., qd(t)Z[t]

with qi(0)=0 for 1<i<d, for a residual set of xX there exists a sequence

{ni} of positive integers such that Tq(ni)eixx for 1<i<d, where {ei}i=1d is the

standard orthonormal basis of Rd.

In particular, this implies that for any minimal topological dynamical system (X, T)

and any polynomial q(t)Z[t] with q(0)=0, the set of xX with x{Tq(n)x}nN

is residual. The following result shows that for q with degree at least two, this residual

set is not necessarily all of X.

Theorem 2.1.18. For any increasing sequence {pn} of integers with upper Banach

density zero, there exists a totally minimal, totally uniquely ergodic, and topologically

mixing topological dynamical system (X, T) and an uncountable set A X such that

for every x A, the sequence {Tpnx} does not have x as a limit point, i.e. there is

no sequence of positive integers {ni} such that Tpnix converges to x.

Another consequence of Theorem 2.1.17 is that for any commuting minimal homeo-

morphisms T and S of a compact space X, there exists a residual set of x for which

sA topological dynamical system (X, B, μ, T) is minimal if there exist no nonempty proper closed

T-invariant subsets of X.

4

there exists a sequence of positive integers {ni} such that Tnixx and Snixx.

The following theorem shows that for some systems, it is the case that this residual

set of x is not all of X.

Theorem 2.1.21. There exists a totally minimal topological dynamical system (X, T)

and a point x X such that for any positive integers r s, any sequence {ni} of

integers satisfying Trnixx and Tsnixx is eventually zero.

The systems which we construct to prove Theorems 2.1.15 and 2.1.18 are symbolic

dynamical systems. A symbolic dynamical system is defined by first choosing a

finite set A, called the alphabet. Ω=AG endowed with the product topology is a

compact space, and for any gG, we may define σg the shift homeomorphism by

(σgω)(h)=ω(hg) for ω Q. Any closed set XΩ has a topology induced by Ω, and

if X is invariant under each Tg, then (X, {σg}gG) is a topological dynamical system

which we call a symbolic dynamical system.

In Chapter 3, we examine symbolic dynamical systems exclusively. There, we consider

a specific type of symbolic dynamical system called a shift of finite type. A Zd-shift

of finite type is a symbolic dynamical system (X, {σv}vZd) where X is defined by

specifying a finite set F of finite words (a word is a function from a finite subset of

Zd to A) and taking X to be the set of all elements of Ω in which none of the words

in F appear. The shift of finite type X specified by a finite set F of words in this

way is denoted by ΩF. For example, the set of all biinfinite sequences of zeroes and

ones in which no two ones occur consecutively is a shift of finite type, as is the set of

5

all Z2 arrays of zeroes and ones in which no three-by-four blocks of all zeroes or all

ones appear.

We are particularly interested in the effects of forbidding a particular word from a

shift of finite type X. Given any word w, define Xw to be the set of elements of X in

which w does not appear. Then clearly Xw is a subset of X, and so the topological

entropy htop(Xw) of Xw is not greater than the topological entropy htop(X) of X. It

is natural to wonder how much the topological entropy drops by when w is removed,

as it is a sort of measure of how important w is to the information-retaining capacity

of X. In [L], Lind proved the following: (the condition wLΓn(X) means that

wA[1,...,n]d and that w appears in some element of X.)

Theorem 3.1.16. ([L], p. 360, Theorem 3) For any topologically transitive Z-shift

of finite type X=ΩF with positive topological entropy htop(X), there exist constants

CX, DX, and NX such that for any n> NX and any word wLΓn(X), if we denote

by Xw the shift of finite type ΩFu{w}, then

CXehtop(X)n<htop(X)-htop(Xw)<DXehtop(X)n.

Our main result is a generalization of Theorem 3.1.16 for Zd-shifts of finite type

which satisfy a mixing condition called strong irreducibility. (We formally define

strong irreducibility in Definition 3.1.19.)

Theorem 3.1.22. For any d >1 and any strongly irreducible Zd-shift X =ΩF

of finite type with uniform filling length R and positive topological entropy htop(X),

6

there exist constants NXN and DXR such that for any n> NX and any word

wLΓn(X), if we denote by Xw the shift of finite type ΩFu{w}, then

1ehtop(X)(n+44R+70)d<htop(X)-htop(Xw)<DXehtop(X)(n-2R)d.

One way in which Zd-shifts of finite type are more difficult to deal with for d>1 is

that given a finite collection F of patterns, it is undecidable whether or not the shift

of finite type X induced by F is even nonempty. Our methods yield a situation in

which this question can be answered:

Theorem 3.6.1. For any alphabet A, there exist F, G N such that for any m >0

and any finite set of words Fm={wkLΓnk(X) : 1<k<m} satisfying n1>G

and nk>F(nk-1)4d2 for 1<k<m, ΩFm.

7

(JIAPTER 2

RECURRENCE AND CONVERGENCE OF ERGODIC

AVERAGES ALONG SPARSE SEQUENCES

2.1 Introduction

In this chapter, we are concerned with the convergence of averages of the form

1Nn=0N-1f(Tpnx) for an increasing sequence of integers {pn}. We begin with some

definitions.

Definition 2.1.1. A measure-preserving dynamical system (X, B, μ, {Tg}gG)

consists of a measure space X, a probability measure μ with σ-algebra 8 of measurable

sets, and a group action {Tg}gG of transformations Tg : XX with μ(Tg-1A)=

μ(A) for all AB.

Definition 2.1.2. A measure-preserving dynamical system (X, B, μ, {Tg}gG) is er-

godic if any set A satisfying TgA A for all g G has μ(A){0,1}.

Definition 2.1.3. A topological dynamical system (X, {Tg}gG) consists of a

compact topological space X and a group action {Tg}gG of homeomorphisms Tg :

X X.

8

Definition 2.1.4. Given a topological dynamical system (X, {Tg}gG), a Borel prob-

ability measure μ on X is called ergodic if (X, B(X), μ, {Tg}gG) is an ergodic

measure-preserving dynamical system, where B(X) is the Borel σ-algebra of X.

In this chapter, all dynamical systems have G=Z, so as already described we

will use the notations (X, B, μ, T) and (X, T) for measure-preserving and topological

dynamical systems respectively.

Definition 2.1.5. A topological dynamical system (X, T) is minimal if for any

closed set K with T-1KK, K = or K =X. (X, T) is totally minimal if

(X, Tn) is minimal for every n O.

Definition 2.1.6. A topological dynamical system (X, T) is uniquely ergodic if

there is only one Borel measure μ on X such that μ(A)=μ(T-1A) for every Borel

set A X. (X, T) is totally uniquely ergodic if (X, Tn) is uniquely ergodic for

every n N.

Definition 2.1.7. A topological dynamical system (X, T) is topologically mixing

if for any open sets U, VX, there exists N N such that for any n >N,

UTnV.

Definition 2.1.8. For any set AN, the upper Banach density of A is defined

by

d*(A)=limsupsup|{m,m+1,...,m+n-1}A|n.

nmN

9

Definition 2.1.9. For any set AN, the upper density of A is defined by

d¯(A)=limnsup|{1,...,n}A|n.

Definition 2.1.10. For a set AN, the density of A is defined by

d(A)=limn|{1,...,n}A|n

if this limit exists.

Definition 2.1.11. Given a topological dynamical system (X, T) and a T-invariant

Borel probability measure μ, a point x X is (T, μ)-generic if for every f C(X),

limn1ni=0n-1f(Tix)=fdμ.

In the measure-preserving setup, there are several positive results about convergence

of averages of the form 1Nn=0N-1f(Tpnx), including this theorem of Bourgain:

Theorem 2.1.12. ([Bou], p. 7, Theorem t) For any measure-preserving dynamical

system (X, B, μ, T), for any polynomial q(t)Z[t], and for any fLp(X, B, μ) with

p>1, limN1Nn=1Nf(Tq(n)x) exists for μ-almost every xX.

The following question regarding a possible topological version of Theorem 2.1.12 was

posed by Bergelson:

Question 2.1.13. ([Be], p. 51, Question 5) Assume that a topological dynamical

system (X, T) is uniquely ergodic, and let pZ[t] and fC(X). Is it true that for

all but a first category set of points limn1ni=0n-1f(Tp(i)x) exists?

to

Bergelson added the hypothesis of unique ergodicity because it is a classical result

that a system (X, T) is uniquely ergodic with unique T-invariant measure μ if and

only if for every xX and fC(X), limn1ni=0n-1f(Tix)=Xfdμ, and so in

the topological setup this is a natural assumption to make about (X, T) to achieve

the desired result.

However, Bergelson was particularly interested in the convergence of these averages

to the "correct limit," i.e. Xfdμ where μ is the unique T-invariant measure on

X. To have any hope for such a result, it also becomes necessary to assume ergod-

icity for all powers of T in order to avoid some natural counterexamples related to

distribution (mod k) of p(n) for positive integers k. For example, if p(n)=n2, T is

the permutation on X={0,1, 2} defined by Tx=x+1 (mod 3), μ is normalized

counting measure on X, and f=χ{0}, then (X, T) is obviously uniquely ergodic with

unique invariant measure μ=δ0+δ1+δ23, but

limn1ni=0n-1f(Tp(i)x)= ifx=1ifx=0iffx=2', .and

To avoid such examples, we would need T to be totally ergodic as well as uniquely

ergodic, and so it makes sense to assume total unique ergodicity to encompass both

properties. Bergelson's revised question then looks like this:

Question 2.1.14. Assume that a topological dynamical system (X, T) is totally uniquely

11

ergodic with unique T-invariant measure μ, and let pZ[t] and fC(X). Is it true

that for all but a first category set of points limn1ni=0n-1f(Tp(i)x)=Xfdμl.?

We answer Questions 2.1.13 and 2.1.14 negatively in the case where the degree of

p is at least two, and in fact prove some slightly more general results. The level of

generality depends on what hypotheses we place on the space X. In particular, we

can exhibit more counterexamples in the case where X is a totally disconnected space

than we can in the case where X is a connected space such as Tk.

Theorem 2.1.15. For any increasing sequence {pn} of integers with upper Banach

density zero, there exists a totally minimal, totally uniquely ergodic, and topologically

mixing topological dynamical system (X, T) and a continuous function f on X with

the property that 1Nn=0N-1f(Tpnx) fails to converge for a residual set of x.

Theorem 2.1.16. For any increasing sequence {pn} of integers with the property

that for some integer d, pn+1<(pn+1-pn)d for all suffiffifficiently large n, there ex-

ists a totally minimal, totally uniquely ergodic, and topologically mixing topological

dynamical system (X, T) and a continuous function f on X with the property that

1Nn=0N-1f(Tpnx) fails to converge for a residual set of x. In addition, the space X is

a connected 2d+ 9-manifold.

We note that Theorems 2.1.15 and 2.1.16 answer Question 2.1.13 negatively for p

with degree at least two, since the sequence pn=p(n) for any nonlinear p(t)Z[t]

satisfies the hypotheses of both theorems.

Theorems 2.1.15 and 2.1.16 are about nonconvergence of ergodic averages along cer-

tain sequences of powers of x. We also prove two similar results about nonrecurrence

12

of points. As motivation, we note that a minimal system has the property that every

point is recurrent. In other words, if (X, T) is minimal, then for all xX, it is the

case that x{Tnx}nN. If (X, T) is totally minimal, then all points are recurrent

even along infinite arithmetic progressions: for any nonnegative integers a, b, and for

all xX, x{Tan+bx}nN. It is then natural to wonder if the same is true for other

sequences of powers of T, and in this vein there is the following result of Bergelson

and Leibman, which is a corollary to their Polynomial van der Waerden theorem:

Theorem 2.1.17. ( [BeL], p. 14, Corollary 1.8) For any dN, any minimal topolog-

ical dynamical system (X, {Tv}vZd) and any polynomials q1(t), q2(t), ..., qd(t)Z[t]

with qi(0)=0 for 1<i<d, for a residual set of xX there exists a sequence

{ni} of positive integers such that Tq(ni)eixx for 1<i<d, where {ei}i=1d is the

standard orthonormal basis of Rd.

In particular, Theorem 2.1.17 implies that for a minimal system (X, T) and any

polynomial p(t) with p(0)=0, the set of x such that x{Tp(n)x}nN is residual. If

(X, T) is assumed to be totally minimal, then by definition if the degree of p is one,

then every point xX is in {Tp(n)x}nN. The following two results imply that if the

degree of p is greater than one, total minimality of (X, T) does not necessarily imply

that every point xX is in {Tp(n)x}nN.

Theorem 2.1.18. For any increasing sequence {pn} of integers with upper Banach

density zero, there exists a totally minimal, totally uniquely ergodic, and topologically

mixing topological dynamical system (X, T) and an uncountable set A X such that

13

for every xA, the sequence {Tpnx} does not have x as a limit point, i.e. there is

no sequence of positive integers {ni} such that Tpnix converges to x.

Theorem 2.1.19. For any increasing sequence {pn} of integers with the property

that for some integer d, pn+1<(pn+1-pn)d for all suffiffifficiently large n, there exists a

totally minimal, totally uniquely ergodic, and topologically mixing topological dynam-

ical system (X, T) and a point xX such that the sequence {Tpnx} does not have

x as a limit point, i.e. there is no sequence of positive integers {ni} such that Tpnix

converges to x. In addition, the space X is a connected 2d+ 7-manifold.

The following simple lemma shows that if (X, T) is topologically mixing, then The-

orems 2.1.18 and 2.1.19 cannot be improved too much, i.e. there is no increasing

sequence {pn} for which we can exhibit topologically mixing examples with a second

category set of such nonrecurrent points.

Lemma 2.1.20. If a topological dynamical system (X, T) is topologically mixing, then

for any increasing sequence {pn}, the set of x X for which x {Tpnx} is of first

category.

Proof: Define Cε={x : d(x, Tpnx)>nZ}. It is clear that Cε is closed. We

claim that Cε contains no nonempty open set, which shows that it is nowhere dense,

implying that C=n=1C1n the set of points x for which x is not a limit point of

{Tpnx} is of first category. Suppose, for a contradiction, that there is a nonempty

open set U with UCε for some ε. Then, there exists V with diam(V <ε such that

VUCε. By topological mixing, there exists n such that VT-pnV. This

14

implies that there exists xV so that TpnxV. Since diam(V <ε, d(x, Tpnx)<ε.

However, xVCε, so we have a contradiction.

Theorem 2.1.18 shows that it is possible for this set of points nonrecurrent along pn

to be uncountable though.

We mention that some mixing condition is necessary for a statement like Lemma 2.1.20;

as a simple example, consider an irrational circle rotation T : xx+α on the cir-

cle T. There is clearly some increasing sequence of integers {pn} such that pn(y

(mod 1) 12. Then, for any xT, Tpnxx+12, and so for every xX, {Tpnx}

does not have x as a limit point.

We also prove a result about simultaneous recurrence. Theorem 2.1.17 implies that

for any commuting minimal homeomorphisms T and S of a compact space X, there

exists a residual set of x for which there exists a sequence of positive integers {ni}

such that Tnixx and Snixx. The following theorem shows that for some

systems, it is the case that this residual set of x is not all of X. (In this particular

example, T and S are powers of the same homeomorphism.)

Theorem 2.1.21. There exists a totally minimal topological dynamical system (X, T)

and a point x X such that for any positive integers r s, any sequence {ni} of

integers satisfying Trnixx and Tsnixx is eventually zero.

Before proceeding with the proofs, we now give a brief description of the content of

this chapter. In Section 2.2, we will describe some general symbolic constructions of

15

topological dynamical systems with particular mixing properties. At the end of this

section, we will arrive at a construction of a system which is totally minimal, totally

uniquely ergodic, and topologically mixing, and which has as a parameter a sequence

of integers {nk}.

In Section 2.3, by taking this sequence {nk} to grow very quickly, we will show

that the examples constructed in Section 2.2 are sufficient to prove Theorems 2.1.15

and 2.1.18. Some interesting questions also arise and are answered in Section 2.3

pertaining to the upper Banach density of countable unions of sets of upper Banach

density zero.

In Section 2.4, we create a flow under a function with base transformation a skew

product, which acts on a connected manifold, and which is totally minimal, totally

uniquely ergodic, and topologically mixing. This transformation has as a parameter

a function fC(T). We use conditions of Fayad ([Fa]) on flows under functions

to achieve topological mixing, and some conditions of Furstenberg ([Fu]) on skew

products to prove total unique ergodicity.

In Section 2.5, by a judicious choice of f, we use the examples of Section 2.4 to prove

Theorems 2.1.16 and 2.1.19.

In Section 2.6, we prove Theorem 2.1.21.

Finally, in Section 2.7 we give some open questions about strengthening our results.

2.2 Some general symbolic constructions

Definition 2.2.1. An alphabet is any finite set, whose elements are called letters.

16

Definition 2.2.2. A symbolic dynamical system with alphabet A is a topological

dynamical system (X, {σg}gG) where for any gG, σg is the shift homeomorphism

of Ω=AG defined by (σgω)(h)=ω(hg) for ωΩ, and X is any closed subset of Ω

invariant under each σg.

Definition 2.2.3. A word on the alphabet A is any element of AF for some finite

set F G. F is called the shape of w.

For any words v of length m and w of length n, we denote by vw their concatenation,

i.e. the word v[1]v[2]... v[m]w[1]w[2]... w[n] of length m+n. We denote by wk the

word torn . . . w given by the concatenation of k copies of w.

Definition 2.2.4. Given any symbolic dynamical system (X, {σg}gG), the language

of X, denoted by L(X), is the set of all words which appear as subwords of X.

All symbolic dynamical systems appearing in this chapter have alphabet {0, 1}, G=

Z. We also restrict ourselves in this chapter to words with shape {1, ..., n} for some

n. In other words, a word can be thought of for now as a finite string of letters

w=w(1)w(2)... w(n). The number of letters in a word w is called its length.

Definition 2.2.5. A word w of length n is a subword of a sequence u Ω if there

exists k Z such that u(i+k)=w(i) for 1<i<n. Analogously, w is a subword

of a word v of length m if there exists 0<k<m-n such that v(i+k)=w(i) for

1<i<n.

We will outline three constructions which algorithmically create X such that (X, σ)

have certain properties. (Here the "certain properties" in question depend on which

17

construction is used.) It should also be mentioned that many ideas from these con-

structions are taken from work of Hahn and Katznelson ([HaK]), where they also

algorithmically constructed symbolic topological dynamical systems with certain er-

godicity and mixing properties.

Construction 1: (Minimal) We define inductively nk and Ak, which are, respec-

tively, sequences of positive integers and sets of words on the alphabet {0, 1}. Each

word in Ak is of length nk. (We will use the term " Ak-word" to refer to a member of

Ak from now on.) We define these as follows: always define n1=1 and A1={0,1}.

Then, for any k>1, nk+1 is defined to be any integer greater than or equal to nk|Ak|

which is also a multiple of nk, and then Ak+1 is chosen to be the set of words of

length nk+1 which are concatenations of Ak-words, containing each Ak word in the

concatenation at least once.

We then define X to be the set of all xΩ which are limits of shifted Ak words In

other words, xX if there exist wkAk and mkZ such that for all large enough

i, x(i)=wk(i-mk). Since Ω is compact, such x exist by a standard diagonalization

argument. It is easily checkable that X is closed and a-invariant. The claim is that

regardless of the choice of the integers nk, as long as nk divides nk+1, and nk+1>

nk|Ak|, (X, σ) is minimal. It suffices to show that for any yX, {σny}nZ=X.

Choose any yX and wL(X). By the definition of X, there exists k and some

Ak word wk such that w is a subword of wk. wk is an Ak-word, so by definition,

every Ak+1-word contains wk, and therefore w, as a subword. Finally, note that again

by the definition of Construction 1, y is an biinfinite concatenation of Ak+1 words

18

This implies that y(1). . . y(2nk+1) contains some Ak+1 word, and therefore w, as a

subword, and so there exists nZ so that σny begins with w. Since w was an

arbitrary word in L(X), this implies that {σny}nZ=X, and so (X, σ) is minimal.

So, we have now demonstrated a way of constructing minimal (X, σ). We will now

make this construction a bit more complex in order to make (X, σ) totally minimal.

Construction 2: (Totally minimal) We define inductively nk and Ak, which are,

respectively, sequences of positive integers and sets of words on the alphabet {0, 1}.

Each word in Ak is of length nk. We define these as follows: always define n1=1

and A1={0,1}. Then, for any k>1, nk+1 is defined to be any integer greater than

or equal to (k!)2nk|Ak|+k!+nk2 , and then Ak+1 is chosen to be the set of words w of

length nk+1 which are concatenations of Ak-words and the word 1 with the following

properties: the word 1 does not appear at the beginning or end of w, only a single

1 can be concatenated between two Ak-words, and for every wAk, and for every

0<i<k!, w appears in w at an i (mod k!)-indexed place. That is, there exists

mi (mod k!) with w(m)w(m+1) . . . w(m+nk-1) =w. From now on, to refer

to this second condition, we say that every wAk occurs in w at places indexed by

all residue classes modulo k!.

Since this construction is a bit complicated, a few quick examples may be in or-

der. Suppose that n2=6, |A2|=4, and we choose n3=134. Say that A2=

{a, b, c, d}.Thenw=abcd1abcd1dabcdabcabcdab is an A3-word: each A2 word ap-

pears at least once beginning with a letter of w with an odd index, and at least

19

once beginning with a letter of w with an even index. Examples of words which

would not be A3-words include abcdllabcddabcdababcabcd (1 is concatenated twice

between d and a), abcdo lbcdldbcdaabcbbbbdc (occurrences of the word a begin only

with even-indexed letters), or abcdldcbabcdabcdabcdadb (wrong number of letters.)

We can then define X to again be the set of all xΩ which are limits of shifted

Ak-words. For this definition to make sense, it suffices to show that Ak is nonempty

for all k, since if this is true then X by compactness of Ω and a diagonalization

argument just as in Construction 1. A1 is nonempty, so it suffices to show that Ak

implies Ak+1 for all k. For any k, assume that wkAk. Then, enumerate the

elements of Ak by wk=a1,a2,...,a|Ak|,ankddnk+1-k1(k1nk|Ak|+1)-i(n+1)nk efine the words uk+1=a1k!a2k! . . . a|Ak|k!

and w=(uk+11)k!(a11)ia1 , where ink+1-k! (mod nk). Since

nk+1>(k!)2nk|Ak|+k!+nk2, w exists, and is a concatenation of Ak-words and the

word 1 with length nk+1. In w, at most a single 1 is concatenated between any two

Ak-words and 1 does not appear at the beginning or end of w. Also, since the length

of uk+1 is divisible by k!, and since all Ak-words are subwords of uk+1, all Ak words

appear in (uk+11)k! at places indexed by all residue classes modulo k!, and so all

Ak words appear in w at places indexed by all residue classes modulo k! as well.

Therefore, wAk+1, and so Ak+1 is nonempty.

We will show that (X, σ) is totally minimal. Fix any m>0. We wish to show that

for any yX, {σmny}nZ=X. Choose any such y, and fix any word wL(X).

By definition of X, there exists k and an Ak word wk such that w is a subword of

wk. Without loss of generality, we assume that k>m. By the construction, wk

20

occurs in every Ak+1-word, and it occurs at places indexed by every residue class

modulo k!. Since k>m, in particular this implies that wk, and therefore w, occurs

in every Ak+1-word at places indexed by every residue class modulo m. By definition

of Construction 2, y is a biinfinite concatenation of Ak+1-words and the word 1.

Therefore, y(1). . . y(2nk+1+2) contains some Ak+1-word as a subword, and so w

occurs in y(1). . . y(2nk+1+2) at places indexed by every residue class modulo m.

There then exists n so that σmny begins with w. Since w was an arbitrary word of

L(X), this shows that {σmny}nZ=X, and since m was arbitrary, that (X, σ) is

totally minimal.

We now define one more general type of construction, again more complex than the

last, so that the system created is always totally uniquely ergodic and topologically

mixing, in addition to being totally minimal. For this last construction, we first need

a couple of definitions.

Definition 2.2.6. For any integers 0<i<m and k, and wAk-1 and w

Ak, we define fri,m* (w, w) to be the ratio of the number of occurrences of w as a

concatenated Ak-1-rvord at i (mod m)-indexed places in w to the total number of

Ak-1-rvords concatenated in w.

We consider any positive integer to be equal to 0 (mod t) for the purposes of this

definition. An example is clearly in order: if A1={01, 10}, w=01, (in Construc-

tions 2 and 3, A1 is always taken to be {0, 1}, but here we deviate from this for

21

illustrative purposes) and w is the A2 word Ot |10|1|01 |10 (here vertical bars show

where breaks in the concatenation occur), then w occurs twice out of four A1-words,

so fr0,1* (w, w) =12. Since one of these occurrences begins at w(1) and one begins at

w(6), fr0,2*(w, w)=fr1,2*(w, w)=14. We make a quick note here that there could

be some ambiguity here if an Ak+1-word could be decomposed as a concatenation of

Ak words and ones in more than one way. For this reason, we just assume that when

computing fri,j* (w, w) , the definition of the Ak+1 word w includes its representation

as a concatenation of Ak-words and ones. (i.e. in the example given, w is defined

as the concatenation Of |10|1|01 |10 of A1-words and ones, rather than the nine-letter

word 011010110.)

Definition 2.2.7. Given any words w of length n and w of length n<n, and any

integers 0<i<m, define fri,m (w, w) to be the number of occurrences of w at i

(mod m)-indexed places in w, divided by n-n+1.

Taking the previous example again, fr0,1(w, w)=38, since 01 occurs three times as

a subword of 011010110. Since two of these occurrences begin at letters of w with

even indices and one begins at a letter of w with odd index, fr0,2(w, w)=28 and

fr1,2(w, w)=18.

Construction 3: (Totally minimal, totally uniquely ergodic, and topologi-

cally mixing) We define inductively nk and Ak, which are, respectively, sequences

of positive integers and sets of words on the alphabet {0, 1}. Each word in Ak is of

length nk. We define these as follows: always define n1=1 and A1={0,1}. Then,

we fix any sequence {dk} of positive reals such that k=1dk< and define, for each

22

k>1, some nk+1=Ck(k+1)!|Ak|nk+p for any integer Ck>nk>1dk and prime

nk<p<2nk (We may choose such a p by Bertrand's postulate. [HaW]) Note that

this implies that (nk, k!)=1 for all kN. We then define Ak+1 to be the set of words

w of length nk+1 with all of the same properties as in Construction 2, along with the

property that, for any wAk, and for any 0<i<k!, fri,k!*(w, w)[1-dkk!|Ak|, 1+dkk!|Ak|].

X is again taken to be the set of xΩ which are limits of shifted Ak words For

this definition to make sense, it suffices to show that Ak is nonempty for all k, since

if this is true then X by compactness of Ω and a diagonalization argument

just as in Construction 1. A1 is nonempty, so it suffices to show that Ak

implies Ak+1 for all k. For any k, assume that wkAk. Then, enumerate the

elements of Ak by wk=a1, a2, ..., a|Ak|, and define the words uk+1=a1k!a2k! . . . a|Ak|k!

and w= (uk+1)Ck(k+1)-p(uk+11)p. Clearly for large k, w exists, and is a concatenation

of Ak-words and the word 1 with length nk+1. In w, at most a single 1 is concatenated

between any two Ak-words and 1 does not appear at the beginning or end of w. Since

(nk, k!)=1, for every 0<i<k! and xAk, x appears in uk+1 exactly once as a

concatenated Ak-word at an i (mod k!)-indexed place. Therefore, x appears in w

exactly Ck(k+1) times as a concatenated Ak-word at i (mod k!)-indexed places, and

so fri,k!*(x, w)=1|Ak|nkk! . Since i and x were arbitrary, wAk+1, and so Ak+1.

(X, σ) is totally minimal by the same proof used for Construction 2. We claim that

(X, σ) is also totally uniquely ergodic. Take any word wL(X), and any fixed integer

j. We define two sequences {mk(j)} and {Mk(j)} as follows: mk(j) is the minimum value

23

of fri,j(w, w) , where 0<i<j and w ranges over all Ak word and Mk(j) is the

maximum value of fri,j (w, w) , where 0<i<j and w ranges over all Ak-words.

Suppose that mk(j) and Mk(j) are known, and that k>j. We wish to show that amk+1(j)

and Mk+1(j) are very close to each other. Let us consider any element w of Ak+1 and,

for any fixed 0<i<j, see how few occurrences of w there could possibly be at i

(mod j)-indexed places in w. By the definition of Construction 3, for every wAk,

and 0<i<k!, the ratio of the number of times w occurs as a concatenated Ak word

in w whose first letter is a letter of w whose index is equal to i (mod k!) to the total

number of Ak-words concatenated in w is at least 1-dkk!|Ak|. Since j divides k!, then for

any 0<i<j, the ratio of the number of times that w occurs as a concatenated Ak-

word at i (mod j)-indexed places in w to the total number of Ak-words concatenated

in w is at least 1-dkj|Ak|. Since the total number of Ak-words concatenated in w is at

least nk+1nk+1, this implies that the number of such occurrences of w in w is at least

1-dkj|Ak|nk+1nk+1 for any i and w. For any w and i, the number of times that w occurs at i

(mod j)-indexed places in w as a subword of an occurrence of w that occurs at an i

(mod j)-indexed place in w is then at least 1-dkj|Ak|nk+1nk+1(nk-|w|+1)fri-i(mod j),j(w, w).

Summing over all wAk and 0<i<j, the number of occurrences of w in w at i

(mod j)-indexed places is at least

(1 -dk)nk+1nk-|w|+1nk+1wAkm=0j-1frm,j(w,w)j|Ak|.

Since w was arbitrary in Ak+1 and 0<i<j was arbitrary,

24

mk+1(j)> (1 -dk)nk+1nk+1-|w|+1nk-|w|+1nk+1wAkm=0j-1frm,j(w,w)j|Ak|.

Let us now bound from above the number of occurrences of w in w at i (mod j)-

indexed places. By precisely the same reasons as above, for any 0<i<j, the

number of occurrences of w at i (mod j)-indexed places which lie entirely within a

concatenated Ak-word in w is not more than

(1 +dk)nk+1nk-|w|+1nkwAkm=0j-1frm,j(w,w)j|Ak|.

(The denominator of the first fraction changed because there are at most nk+1nkAk-

words concatenated in w.) However, it is possible that there are occurrences of w

in w which do not lie entirely within a concatenated Ak-word in w. The number

of such occurrences of w is not more than |w|+1 times the number of concatenated

Ak words in w, which in turn is less than or equal to (|w|+1) nk+1nk. This means that

the number of occurrences of w at i (mod j)-indexed places in w is bounded from

above by

(1 +dk)nk+1nk-|w|+1nkwAkm=0j-1frm,j(w,w)j|Ak|+(|w|+1)nk+1nk,

and since 0<i<j was arbitrary, this implies that

Mk+1(j)< (1 +dk)nk-|w|+1nkwAkm=0j-1frm,j(w,w)j|Ak|nk+1

nk+1-|w|+1

+nk+1|w|+1.

nk+1-|w|+1 nk

This implies that

25

Mk+1(j)-mk+1(j)<2dknk+1nk+1-|w|+1nk-|w|+1nkwAkm=0j-1frm,j(w,w)j|Ak|

+nk+1|w|+1.

nk+1-|w|+1 nk

Since frm,j(w, w)<1 for every 0<m<j and wAk, for large k this shows that

Mk+1(j)-mk+1(j)<2dk+2(|w|+1)nk, which clearly approaches zero as k.

We now note that since

mk+1(j)> (1 -dk)nk+1nk+1-|w|+1nk-|w|+1nk+1wAkm=0j-1frm,j(w,w)j|Ak|,

and since by definition frm,j(w, w)>mk(j) for all wAk, we see that mk+1(j)>

(1 -dk)nk+1nk+1-|w|+1nk-|w|+1nk+1mk(j), and so

mk+1(j)-mk(j)>mk(j)[(1-dk)(1+|w|nk+1-|w|+1)(1-|w|-1nk+1)-1]

>-mk(j)(dk+|w|nk+1)>-(dk+|w|nk).

By almost completely analogous reasoning, for large k

Mk+1(j)-Mk(j)<Mk(j)[(1+dk)(1+|w|-1nk+1-|w|+1)(1-|w|-1nk)-1]

+nk+1nk+1-|w|+1|w|+1nk<Mk(j)dk+2(|w|+1)nk<dk+2|w|+1nk.

Therefore,

mk+1(j)<Mk+1(j)<Mk(j)+dk+2|w|+1nk<mk(j) a 2dk-1+2(|w|+1)nk-1+dk+2|w|+1nk

26

<mk(j)+2dk-1+dk+4|w|+3nk-1,

so |mk+1(j)-mk(j)|<dk+2dk-1+4|w|+3nk-1. In a completely analogous fashion, |Mk+1(j)-

Mk(j)|<dk+2dk-1+3|w|+2nk-1. We know that k=1dk converges, and since nk>2k for

all k, k=11nk converges as well. Therefore, we see that the sequences {mk(j)} and

{Mk(j)} are Cauchy, and converge. Since we also showed that Mk(j)-mk(j)0, we

know that they have the same limit, call it (y.

This implies that for very large k, |fri,j(w, w)-α| is very small for every 0<i<j and

wAk. We claim that this, in turn, implies that for very large N, |fri,j(w, w)-α| is

very small for every word wL(X) of length N: fix any ε>0, and take k such that

|fri,j(w, w)-α|<: for every 0<i<j and wAk, and such that 1+|w|nk<ε4. Then

for any word wL(X) of length at least 8nkε, w is a subword of a concatenation

of Ak-words and copies of the word 1. The number of full Ak-words appearing in

the concatenation formi ng w iiss aatt lleeaasstt |w|nk+1-2, and aatt mmoosstt |w|nk. So, the number

of occurrences of w at i (mod j)-indexed places in w which are contained entirely

within a concatenated Ak-word is at least ((nk-|w|+1)|w|nk+1-2(nk-|w|+1))(α-ε2) >

|w|((1-ε4)-ε4)(α-ε2)>|w|(α-ε), and at most |w|nk-|w|+1nk(α+ε2)<|w|(α+ε2).

Since there are at most (|w|+1)|w|nk<|w|ε4 occurrences of w not contained entirely

within a concatenated Ak-word, this implies that fri,j(w, w) is at least (y -ε, and

at most a +ε. Since for any ε>0, this statement is true for any long enough word

wL(X) and 0<i<j, we see that 1ni=0n-1χ[w](σijy) a uniformly for yX.

Since w was arbitrary, and since characteristic functions of cylinder sets are dense in

C(X), 1ni=0n-1f(σijy) approaches a uniform limit for all fC(X), and so (X, σj)

27

is uniquely ergodic for every j N. Since an invariant measure for (X, σ) would be

invariant for any (X, σj) as well, the unique invariant measure is the same for every

j.

Finally, we claim that (X, σ) is also topologically mixing. Consider any words w, w

L(X). By construction, there exists k so that there are Ak words y, y with w a

subword of y and w a subword of y. We also claim that for any nk+16<i<5nk+16, there

exists an Ak+1 word bi where bi(i+1)bi(i+2) . . . bi(i+nk)=y, and similarly biAk+1

with bi(i+1)bi(i+2) . . . bi(i+nk)=y. We show only the existence of bi, as the proof for

bi is trivially similar. Consider any nk+16<i<5nk+16, and take j=i (mod nk). Then,

if we enumerate the elements of Ak by a1, a2, ..., a|Ak|, first define the word uk+1=

a1k!a2k!... a|Ak|k!, and then define the word yi=(uk+11)j(uk+1)Ck(k+1)-p(uk+11)p-j. yi

has the property that yi(i+1)yi(i+2) . . . yi(i+nk) is a concatenated Ak-word in yi as

long as yi(i+1)yi(i+2) . . . yi(i+nk) lies in the subword (uk+1)Ck(k+1)-p of yi, which is

true for large k and i(nk+16, 5nk+16). This means that if we reorder a1, ..., a|Ak| in the

definition of uk+1, we may create a word bi where bi(i+1)bi(i+2) . . . bi(i+nk)=y.

biAk+1, since for every 0<i<k! and xAk, x appears exactly C(k+1)

times in bi as a concatenated Ak-word at i (mod k!)-indexed places, implying that

fri,k!*(x, bi)=1|Ak|nkk! (This uses the fact that ( nk, k!)=1, which was already shown.)

We create bi in the same way for each i. Since w is a subword of y and w is a subword

of y, for every nk+16+nk<i<5nk+16-nk, it is easy to choose a word zi to be bj

for properly chosen j so that zi(i+1) . . . zi(i+|w|)=w, and similarly zi so that

zi(i+1) . . . zi(i+|w|)=w. For large k, this means that we can construct such zi

and zi for any i[nk+15, 4nk+15].

28

We will now use these zi and zi to prove that for any n>|w|+nk+1, there exists a

word xL(X) of length n such that rvxrv' L(X). We do this by proving a lemma:

Lemma 2.2.8. For any t >k+1, and for any 0<i, j <nt such that there exists an

At word x where x(i+1)x(i+2) . . . x(i+nk+1) and x(j+1)x(j+2) . . . x(j+nk+1) are

concatenated Ak+1-rvords in x, and for any trvo Ak+1 words z and z, there exists an

At word x where x(i+1)x(i+2)... x(i+nk+1)=z and x(j+1)x(j+2) . . . x(j+

nk+1)=z.

Proof: We prove this by induction. First we prove the base case t=k+2; take an

Ak+2 word x where x(i+1)x(i+2) . . . x(i+nk+1) and x(j+1)x(j+2) . . . x(j+nk+1)

are concatenated Ak+1-words in x, call them a and b respectively. Since x is an Ak+2-

word, there exists an occurrence of z at an ( i (mod (k+1)!)) (mod (k+1) !)-indexed

place, i.e. there exists ii (mod (A +1)!) such that x(i+1)x(i+2) . . . x(i+

nk+1)=z. Similarly, there exists jj (mod (A +1)!) such that x(j+1)x(j+

2) . . . x(j+nk+1)=z. We now create x by leaving almost all of x alone, but defining

x(i+1) . . . x(i+nk+1)=z, x(j+1) . . . x(j+nk+1)=z, x(i+1) . . . x(i+nk+1)=a,

and x(j+1) . . . x(j+nk+1)=b. This new word x is still a concatenation of Ak+1-

words and ones, and since we switched two pairs of Ak+1-words which occurred at

indices with the same residue class modulo (k+1)!, fri,(k+1)!*(w, x)=fri,(k+1)!*(w, x)

for all 0<i<(k+1)! and wAk+1. Therefore, x is an Ak+2-word, with z and z

occurring at the proper places, completing our proof of the base case.

Now, let us assume that the inductive hypothesis is true for a certain value of t,

and prove it for t+1. Consider an At+1 word x where x(i+1) . . . x(i+nk+1) and

29

x(j+1) . . . x(j+nk+1) are concatenated Ak+1-words in x, call them a and b. Call

the concatenated At-word that x(i+1) ... x(i+nk+1) is a subword of a, and denote

by b the corresponding At-word for x(j+1) . . . x(j+nk+1) b. From now on, when

we speak of these words a, b, a, b, we are talking about the pertinent occurrences at

the places within x already described. There are two cases; either a and b are the

same; i.e. the same At-word in x, occurring at the same place, or they are not. If a

and b do occur at the same place, then by the inductive hypothesis, there exists an

At-word c with an occurrence of z at the same place as a occurs in a=b, and an

occurrence of z at the same place as b occurs in a=b. If we can replace a=b

by c in x, then we will be done. If a and b do not occur at the same place, then

since a is a concatenated Ak+1-word in a, by the inductive hypothesis there exists

an At-word a such that a has z occurring at the same place where a occurs in a.

Similarly, there exists an At-word b such that b has an occurrence of z at the same

place where b occurs in b. If we replace a by a and b by b in x, then we will be

done. So regardless of which case we are in, our goal is to replace one or two chosen

At-words within x with one or two other At-words. We will show how to replace

two, which clearly implies that replacing one is possible. We wish to replace a by

a and b by b. We do this in exactly the same way as in the base case; say that

a=x(i+1) . . . x(i+nt) and b=x(j+1) . . . x(j+nt). Since aAt, there

exists i=i (mod t!) and j=j (mod t!) such that x(i+1) . . . x(i+nt)=a

and x(j+1) . . . x(j+nt)=b. As in the base case, we create x by making

x(i+1) . . . x(i+nt)=a and x(i+1) . . . x(i+nt)=a, x(j+1) . . . x(j+nt)=b,

30

and x(j+1) . . . x(j+nt)=b. Then x is an At+1- word, and by construction

x(i+1) . . . x(i+nk+1)=z and x(j+1) . . . x(j+nk+1)=z.

Choose any sequence {vm} of Am-words for all m>k+1. For any such m, take

Pm={ n : vm(n+1) ... vm(n+nk+1) is a concatenated Ak+1 word in vm }. Since

vm is a concatenation of Ak+1-words and ones, if we write the elements of Pm as

p1(m)<p2(m)<...<pt(m), then for any 1<l <t, pl+1(m)-pl(m)< a k+1+1. For any

1<l <t, and i, j[nk+15, 4nk+15], by Lemma 2.2.8, there exists an Am word v with the

property that v(p1(m)+1) ... v(p1(m)+nk+1)=zi and v(pl(m)+1) ... v(pl(m)+nk+1)=zj.

This implies that there is a subword of v of the form rvxrv' where the length of x is

pl(m)-p1(m)+(j-i)-|w|. We note that j-i can take any integer value between -3nk+15

and 3nk+15 inclusive. Therefore, the set of possible lengths of x for which rvxrv' L(X)

contains

l=2t{(|w|+pl(m)-p1(m)+[-3nk+15, 3nk+15])}.

When l is increased by one, pl(m) is increased by at most nk+1+1. This, along with

the fact that the intervals [-3nk+15, 3nk+15] have length 6nk+15, which for large k exceeds

nk+1+1, implies that this set of possible lengths of v contains [ |w|+p2(m)-p1(m) -

3nk+15, |w|+pt(m)-p1(m)+3nk+15][|w|+nk+1, |w|+nm-2nk+1]. Since this entire

argument could be made for any m, we see that for any n>|w|+nk+1, there exists

xL(X) of length n so that rvxrv' L(X). Then, for any nonempty open sets

31

U, VX, there exist w and w such that [w]U and [w]V. By the above

arguments, there exists N so that for any n>N, [w]σn[w], implying that

UσnV. This shows that (X, σ) is topologically mixing.

2.3 Some symbolic counterexamples

Proof of Theorem 2.1.15: We take the continuous function f(y)=y(0) for all

yX, and first note that

{yX : 1Nn=0N-1f(σpny) does not converge}

(n>0k>n{yX : fr0,1 (0, y(p1)y(p2). . . y(pk))<14})

(n>0k>n{yX : fr0,1(0, y(p1)y(p2)...y(pk))>34}),

and that the latter set, call it B, is clearly a Gδ. We will choose nk so that B is dense

in X. This will imply that B is a dense Gδ, and since X is a complete metric space,

by the Baire category theorem, that B is residual, which will prove Theorem 2.1.15.

Now let us describe the construction of {nk}.

Recall that we have assumed that the sequence {pn} has upper Banach density zero.

We define A:={pn :nN}. We also define the intervals of integers Bj=

[2j!, (j+1)!]N for every jN, and take any partition of N into infinitely many

disjoint infinite sets in N, call them C1, C2, . . .. Define the set D1=jC1Bj, and

32

then define the set A1={pn : nD1}+1. Next, choose some r2 large enough so that

(minCr2)!>2ċ2, and define D2=jCr2Bj, and then define A2={pn : nD2}+2.

Continuing in this way, we may inductively define Ak, Dk for all kN so that for

1ll k, Dk=jCrkBj for some rk with the property that (minCrk)! >2k, and

Ak={pn :nDk}+k. We will verify some properties of these sets. Most

importantly, we denote by H the union n=1An, and claim that d*(H)=0. We

show this by noting that H has a certain structure; H consists of shifted subintervals

of A, separated by gaps which approach infinity. More rigorously:

Lemma 2.3.1. There exist intervals Ik=[ak, bk]N and integers jk such that

H=k=1((AIk)+jk) and such that limk(min((AIk+1)+jk+1)-max((A

Ik)+jk))=.

Proof: Take the set Q=k=1Crk, and denote its members by q1<q2< . . ..

Then, for any k, Bqk is a subset of some Ds. The interval Ik is then defined to be

[p2(qk)!, p(qk+1)!)], (which means ak=p2(qk)! and bk=pqk(qk)!) and jk is defined to be s.

It is just a rewriting of the definition of the Ak that H=k=1((AIk)+jk) with

these notations. All that must be checked is that limk(min((AIk+1)+jk+1)-

max((AIk)+jk))=. We will show that ak+1+jk+1-bk-jk, which

implies the desired result. Since qk+1>qk, ak+1-bk=(qk+1)!>(qk)!. So, we must

simply show that (qk)!-jk. Suppose that Bqk is a subset of Ds. Then jk=s.

We also note that by construction, (minCrs)!>2s. But, since BqkDs, qkCrs,

and so minCrs<qk. Therefore, (qk)!>2s, and so (qk)!-jk=(qk)!-s>(qk)!2 , which

33

clearly shows that this quantity approaches , since {qk} is an increasing sequence

of integers.

We now prove a general lemma that implies, in particular, that d*(H)=0.

Lemma 2.3.2. If d*(A)=0, and if there exist intervals Ik=[ak, bk]N and integers

jk such that limk ( min ((AIk+1)+jk+1)-max((AIk)+jk))=, then the

set B=k=1(AIk)+jk has upper Banach density zero.

Proof: Fix ε>0. By the fact that d*(A)=0, there exists N such that for any interval

J of integers of length at least N, |AJ||J|<ε. Take J to be any interval of integers of

length exactly N. Since limk(min((AIk+1)+jk+1)-max((AIk)+jk))=,

there is some K such that if J has nonempty intersection with (AIk)+jk for some

k>K, it is disjoint from (AIk)+jk for every kk. Therefore, for intervals

J of integers of length N with large enough minimum element, JB consists of a

subset of a shifted copy of JA, and so |BJ||J|<|AJ||J|, for some interval J of integers

whose length is also N. This means that in this case, |BJ||J|<ε. We have then shown

that for every ε, there exist N, M such that for any interval of integers J of length

N with minJ>M, |BJ||J|<ε. We will show that this slightly modified definition

still implies that d*(B)=0. Again fix ε>0, and define M and N as was just done.

Now consider any interval of integers I with length at least N+Mε. Then, partition I

into subintervals: define I0=I {1, ..., M}, and then break II0 into consecutive

subintervals of length N, called I1, I2, ..., Ik. There may be one last subinterval left

34

over of length less than N; call it Ik+1 (which may be empty.) Note that |I|>Nk,

or N|I|<1k. We see that

|BI||I|=|BI0||I|+i=1k|BIi||I|+|BIk+1||I|<M|I|+N|I|(i=1k|BIi||Ii|)+N|I|

<M+N|I|+1k(kε)<2ε.

Since ε was arbitrary, d*(B)=0.

By combining Lemmas 2.3.1 and 2.3.2, d*(H)=0. We will now create X. We note

that this part of our construction uses only the fact that d*(H)=0, and no other

properties. We take n1=1 and A1={0,1}. We recall that {nk} must be a sequence

of integers with the following properties: for all k>1, nk+1=Ck(k+1)!|Ak|nk+p

for some positive integer Ck>nk>1dk and prime nk<p<2nk. We also require nk

to grow quickly enough so that for all k, and for any interval of integers I of length

at least nk+1, |IH||I|<dk2k!|Ak|nk. That we may choose such nk is a consequence of the

fact that d*(H)=0. Using these nk, we define Ak as in Construction 3. We now

prove a lemma:

Lemma 2.3.3. For any k N, m Z, and for any u {0, 1}N, there exists an

Ak-rvord vu,k,m such that vu,k,m(i-m) =u(i) for all i H[m+1, m+nk].

Proof: This is proved by induction on k. Clearly the hypothesis is true for k=1

and for any u, m. Now suppose it to be true for a particular k. We will show

35

that it is true for k+1 and every u, m. We again construct an auxiliary word

uk+1 : enumerate the elements of Ak by a1, a2, ..., a|Ak|. Then, we again define the

word uk+1=a1k!a2k!... a|Ak|k!. Define vk+1= (uk+11)p(uk+1)Ck(k+1)-p, where nk+1=

Ck(k+1)!|Ak|nk+p as above. We note that for any 0<i<k! and wAk,

w occurs exactly Ck(k+1) times as a concatenated Ak-word at i (mod k!)-indexed

places in vk+1. (This uses the fact that (nk, k!)=1 for all k, which has already

been shown.) Now, fix mZ. We wish to construct an Ak+1 word vu,k+1,m such

that vu,k+1,m(i-m)=u(i) for all iH[ m+1, m+ a k+1]. We begin with the

Ak+1 word vk+1. Clearly it is not necessarily true that vk+1(i-m) =u(i) for all

iH[m+1, m+nk+1]. We force this condition to be true by changing some of the

Ak-words concatenated in vk+1. We show that this is possible; for any concatenated

Ak- word in vk+1, say vk+1(j)vk+1(j+1) ... vk+1(j+nk-1), the necessary condition is

that ones or zeroes (depending on u) be introduced at digits whose indices are of the

form i-m for all iH[ m+j+1, ..., a +j+nk-1]. To do this, we replace this Ak-

word by vu,k,m+j, which by the inductive hypothesis has the correct digits of u at the

desired places. So, we may change vk+1 into a concatenation of Ak-words and ones,

call it vu,k+1,m, which has the proper digits of u in all desired places. This may be done

by changing at most |H[m+1, ..., m+nk+1]|<dknk+12k!|Ak|nk<Ck(k+1)dkAk-words.

Therefore, since for any 0<i<k! and wAk, w occurred in vk+1 as a concatenated

Ak-word at i (mod k!)-indexed places exactly Ck(k+1) times, w occurs in vu,k+1,m as

a concatenated Ak-word at i (mod k!)-indexed places between Ck(k+1)(1-dk) and

Ck(k+1)(1+dk) times. This implies that fri,k!*(w, vk+1,m)[1-dkk!|Ak|, 1+dkk!|Ak|], and since

36

i and w were arbitrary, that vk+1,m is an Ak+1-word. By induction, Lemma 2.3.3 is

proved.

This implies in particular that for every u, k there exists an Ak-word vu,k,-nk-1 with

vu,k,nk-1(i+nk-1)=u(i) for all iH(-nk-1, ..., nk-nk-1]. By a standard

diagonalization argument, there exists a sequence {kj} and xΩ such that for

all iZ, x(i)=vu,kj,-nkj-1(i+nkj-1) for all large enough j. Since for every j,

vu,kj,-nkj-1(i+nkj-1)=u(i) for all iH(-nkj-1, nkj-nkj-1], clearly x(i)=u(i)

for all iH. Since x is a limit of shifted Ak-words, xX. As mentioned above,

this entire construction could be done with any set of zero upper Banach density in

place of H, which lets us state the following corollary:

Corollary 2.3.4. For any C N with d*(C)=0, there exists (X, σ) totally mini-

mal, totally uniquely ergodic, topologically mixing, and with the property that for any

sequence u {0, 1}N, there exists xuX such that xu(i)=u(i)