]>
SOME RESULTS ON RECURRENCE AND ENTROPY
DISSERTATIO
Presented in Partial Fulfillment of the Requirements for
the Degree Doctor of Philosophy in the Graduate
School of the Ohio State University
By
Ronald Lee Pavlov Jr., B.S., .
The Ohio State University
2007
Dissertation Committee:
. Approved by
Professor Vitaly Bergelson, Advisor
Professor Alexander Leibman
Advisor
Professor Manfred Einsiedler Graduate Program in
Graduate Program in Mathematics
ABSTRACT
This thesis is comprised primarily of two separate portions. In the first portion, we
exhibit, for any sparse enough increasing sequence of integers, a totally minimal,
totally uniquely ergodic, and topologically mixing system and for
which the averages fail to converge on a residual set in , answering
negatively an open question of Bergelson. We also construct here a totally minimal,
totally uniquely ergodic, and topologically mixing system and for
which .
In the second portion, we study perturbations of multidimensional shifts of finite
type. Given any shift of finite type for and any word in the language of
, denote by the set of elements of in which does not appear. If satisfies
a uniform mixing condition called strong irreducibility, we obtain exponential upper
and lower bounds on dependent only on the size of . This result
generalizes a result of Lind about shifts of finite type.
ii
To Dilip
ill
ACKNOWLEDGMENTS
First and foremost, I would like to thank my advisor Vitaly Bergelson. This the-
sis could never have been completed without his help. His infectious enthusiasm
for mathematics, along with a willingness to share his knowledge, were a constant
inspiration.
I would also like to thank my other committee members, Professors Alexander Leib-
man and Manfred Einsiedler, for agreeing to serve on my committee and for many
fruitful mathematical discussions.
Finally, I thank my friends and loved ones, without whom I would never have been
able to get as far as I have. In no particular order:
Thank you to Mom and Dad, who have always been willing to do anything to
support me and have always believed in me.
Thank you to Greg, without whose friendship and sense of humor graduate school
would have been much less bearable.
Thank you to Jasmine, who was always there for me. I can never thank you
enough for your constant love and support.
IV
VITA
2000-Present . . . . . . . . . . . . . . . . . . . . . . . . . . Graduate Teaching Associate,
The Ohio State University
1998-1999 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Student Instructional Associate,
The Ohio State University
2003 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . in Mathematics,
The Ohio State University
2000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . in Mathematics,
The Ohio State University
PUBLICATIONS
Research Papers
Some Counterexamples In Topological Dynamics (Ergodic Theory and Dynam-
ical Systems, to appear)
Perturbations of Multidimensional Shifts of Finite Type (submitted)
FIELDS OF STUDY
Major Field: Mathematics
Specialization: Ergodic Theory and Dynamical Systems
vi
TABLE OF CONTENTS
Abstract ii
Dedication .
Acknowledgments . 1V
Vita
List of Figures ix
CHAPTER PAGE
1Introduction 1
2Recurrence and convergence of ergodic averages along sparse sequences 8
2.1 Introduction 8
2.2 Some general symbolic constructions 16
2.3 Some symbolic counterexamples 31
2.4 Some general constructions on connected manifolds . 42
2.5 Some counterexamples on connected manifolds 55
2.6 A counterexample about simultaneous recurrence 61
2.7 Questions 65
3Perturbations of multidimensional shifts of finite type 67
3.1 Introduction 67
3.2 Some measure-theoretic preliminaries 84
3.3 A replacement theorem 93
3.4 The proof of the main result 114
3.5 A closer look at the main result 146
3.6 An application to an undecidability question 155
3.7 Questions 159
Vll
Bibliography
161
Vlll
LIST OF FIGURES
FIGURE PAGE
3.1 's action on a sample element of 73
3.2 A portion of a sample element of 76
3.3 77
3.4 77
3.5 85
3.6 A standard replacement of . 95
3.7 A sample element of 97
3.8 An element of associated to two standard replacements 98
3.9 The suboctants of 99
3.10 Elements , of whose difference is a multiple of tOO
3.11 An element of which contains 106
3.12 107
3.13 108
3.14 Intersecting occurrences of 116
3.15 120
3.16 Disallowed and allowed pairs of overlapping 127
3.17 A point 138
ix
3.18 The correspondence between copies of and points in
139
3.19 How a copy of is filled if 140
3.20
141
(JIAPTER 1
INTRODUCTION
This introduction will serve as a brief mathematical and historical overview of both
of the main problems that we will examine in this thesis. Due to their somewhat
disparate natures, the two portions of this thesis will each contain a more in-depth
introduction as well. For this reason, we will for the most part relegate formal defi-
nitions to their pertinent introduction.
Ergodic theory is the study of average or long-term behavior of systems which evolve
with time. For example, one can consider a particle bouncing in a box at fixed speed.
Its position and velocity at any time can be represented by a vector in . One can
then model this behavior by taking to be the space of all possible positions and
velocities for the particle, and a self-map of which, given a position and velocity,
gives the position and velocity one second later. This pairing of a space with a
self-map is called a dynamical system.
The more general setup is that of a group acting on a space with some sort
of structure by maps preserving this structure. In this thesis, is always
for some N. Measure-theoretic ergodic theory occurs when is a measure
space with a probability measure invariant under each , and so in this case we call
1
a measure-preserving dynamical system. Topological dynam-
ics occurs when is a compact topological space and each is a homeomorphism,
and so in this case we call a topological dynamical system. In ei-
ther type of system, if , then for any integer , and so we shorten
to and to . (Here )
These cases are not as disjoint as they may first appear: the famed Bogoliouboff-
Krylov theorem states that any topological dynamical system has at
least one invariant probability measure as long as is an amenable group. (An
examination of amenability is beyond the scope of this thesis, but we mention that
amenable groups are a class which contain all abelian groups. We only consider
in this thesis, so all topological dynamical systems examined here will possess
invariant measures.) This sometimes allows one to examine topological properties of
a system via properties of invariant measures. For instance, in Chapter 3, we will
prove some purely combinatorial properties of a -action by homeomorphisms of a
Cantor set by means of studying the invariant measures of this action.
Chapter 2 deals primarily with ergodic averaging. One of the fundamental results of
ergodic theory is Birkhoff's ergodic theorem:
Theorem 1.0.1. ([Bi]) For any measure-preserving dynamical system
and any , exists for -almost every .
Several results have been proven about the convergence of such averages when one
averages not along all powers of , but only along some distinguished subset of the
2
integers. ([Bou], [Bou2], [Wi]) In particular, when one averages along for a
polynomial with integer coefficients, there is the following result of Bourgain:
Theorem 2.1.12. ([Bou], p. 7, Theorem t) For any measure-preserving dynamical
system , for any polynomial , and for any with
, exists for -almost every .
Theorem 2.1.12 can be interpreted as follows: for any polynomial , any
measure-preserving system , and any measure-theoretically "nice" func-
tion , the set of points where fails to converge is of mea-
sure zero, or negligible measure-theoretically. It is then natural to wonder whether or
not there is a topological parallel to this result using topological notions of "niceness"
(continuity) and negligibility (first category), and in fact such a question was posed
by Bergelson:
Question 2.1.13. ([Be], p. 51, Question 5) Assume that a topological dynamical
system is uniquely , and let and . Is it true that for
all but a first category set of points exists?
One of our results is a (quite negative) answer to Question 2.1.13:
Theorem 2.1.15. For any increasing sequence of integers with upper Banach
density , there exists a totally minimal, totally uniquely ergodic, and topologically
1A topological dynamical system is uniquely ergodic if there exists exactly one T-
invariant measure .
set has upper Banach density zero if .
3
mixing topological dynamical system and a continuous function on with
the property that fails to converge for a residual set of .
We also examine recurrence along distinguished subsets of the integers, motivated
primarily by the following result:
Theorem 2.1.17. ( , p. 14, Corollary 1.8) For any , any topolog-
ical dynamical system and any polynomials , , ,
with for , for a residual set of there exists a sequence
of positive integers such that for , where is the
standard orthonormal basis of .
In particular, this implies that for any minimal topological dynamical system
and any polynomial with , the set of with
is residual. The following result shows that for with degree at least two, this residual
set is not necessarily all of .
Theorem 2.1.18. For any increasing sequence of integers with upper Banach
density zero, there exists a totally minimal, totally uniquely ergodic, and topologically
mixing topological dynamical system (X,T) and an uncountable set A such that
for every x , the sequence does not have x as a limit point, i.e. there is
no sequence of positive integers such that x converges to x.
Another consequence of Theorem 2.1.17 is that for any commuting minimal homeo-
morphisms and of a compact space , there exists a residual set of for which
sA topological dynamical system is minimal if there exist no nonempty proper closed
-invariant subsets of .
4
there exists a sequence of positive integers such that and .
The following theorem shows that for some systems, it is the case that this residual
set of is not all of .
Theorem 2.1.21. There exists a totally minimal topological dynamical system (X,T)
and a point x such that for any positive integers r , any sequence of
integers satisfying and is eventually zero.
The systems which we construct to prove Theorems 2.1.15 and 2.1.18 are symbolic
dynamical systems. A symbolic dynamical system is defined by first choosing a
finite set , called the alphabet. endowed with the product topology is a
compact space, and for any , we may define the shift homeomorphism by
for Q. Any closed set has a topology induced by , and
if is invariant under each , then is a topological dynamical system
which we call a symbolic dynamical system.
In Chapter 3, we examine symbolic dynamical systems exclusively. There, we consider
a specific type of symbolic dynamical system called a shift of finite type. A -shift
of finite type is a symbolic dynamical system where is defined by
specifying a finite set of finite words (a word is a function from a finite subset of
to ) and taking to be the set of all elements of in which none of the words
in appear. The shift of finite type specified by a finite set of words in this
way is denoted by . For example, the set of all biinfinite sequences of zeroes and
ones in which no two ones occur consecutively is a shift of finite type, as is the set of
5
all arrays of zeroes and ones in which no three-by-four blocks of all zeroes or all
ones appear.
We are particularly interested in the effects of forbidding a particular word from a
shift of finite type . Given any word , define to be the set of elements of in
which does not appear. Then clearly is a subset of , and so the topological
entropy of is not greater than the topological entropy of . It
is natural to wonder how much the topological entropy drops by when is removed,
as it is a sort of measure of how important is to the information-retaining capacity
of . In [L], Lind proved the following: (the condition means that
and that appears in some element of )
Theorem 3.1.16. ([L], p. 360, Theorem 3) For any topologically transitive Z-shift
of finite type with positive topological entropy , there exist constants
, , and such that for any and any word , if we denote
by the shift of finite type , then
.
Our main result is a generalization of Theorem 3.1.16 for -shifts of finite type
which satisfy a mixing condition called strong irreducibility. (We formally define
strong irreducibility in Definition 3.1.19.)
Theorem 3.1.22. For any d 1 and any strongly irreducible -shift X
of finite type with uniform filling length R and positive topological entropy ,
6
there exist constants and such that for any and any word
, if we denote by the shift of finite type , then
.
One way in which -shifts of finite type are more difficult to deal with for is
that given a finite collection of patterns, it is undecidable whether or not the shift
of finite type induced by is even nonempty. Our methods yield a situation in
which this question can be answered:
Theorem 3.6.1. For any alphabet A, there exist F, G such that for any m
and any finite set of words : satisfying
and for , .
7
(JIAPTER 2
RECURRENCE AND CONVERGENCE OF ERGODIC
AVERAGES ALONG SPARSE SEQUENCES
2.1 Introduction
In this chapter, we are concerned with the convergence of averages of the form
for an increasing sequence of integers . We begin with some
definitions.
Definition 2.1.1. A measure-preserving dynamical system
consists of a measure space , a probability measure with -algebra 8 of measurable
sets, and a group action of transformations : with
for all .
Definition 2.1.2. A measure-preserving dynamical system (X,B, is er-
godic if any set A satisfying A for all g has {0,1}.
Definition 2.1.3. A topological dynamical system (X, consists of
compact topological space X and a group action of homeomorphisms :
X .
8
Definition 2.1.4. Given a topological dynamical system (X,, a Borel prob-
ability measure on X is called ergodic if (X,, , is an ergodic
measure-preserving dynamical system, where is the Borel -algebra of X.
In this chapter, all dynamical systems have , so as already described we
will use the notations and for measure-preserving and topological
dynamical systems respectively.
Definition 2.1.5. A topological dynamical system (X,T) is minimal if for any
closed set K with , K or K . (X,T) is totally minimal if
(X, is minimal for every n O.
Definition 2.1.6. A topological dynamical system (X,T) is uniquely ergodic if
there is only one Borel measure on X such that for every Borel
set X. (X,T) is totally uniquely ergodic if (X, is uniquely ergodic for
every n .
Definition 2.1.7. A topological dynamical system (X,T) is topologically mixing
if for any open sets U, , there exists N such that for any n N,
.
Definition 2.1.8. For any set , the upper Banach density of A is defined
by
.
9
Definition 2.1.9. For any set , the upper density of A is defined by
.
Definition 2.1.10. For a set , the density of A is defined by
if this limit exists.
Definition 2.1.11. Given a topological dynamical system (X,T) and a T-invariant
Borel probability measure , a point x is (T,-generic if for every f ,
.
In the measure-preserving setup, there are several positive results about convergence
of averages of the form , including this theorem of Bourgain:
Theorem 2.1.12. ([Bou], p. 7, Theorem t) For any measure-preserving dynamical
system , for any polynomial , and for any with
, exists for -almost every .
The following question regarding a possible topological version of Theorem 2.1.12 was
posed by Bergelson:
Question 2.1.13. ([Be], p. 51, Question 5) Assume that a topological dynamical
system is uniquely ergodic, and let and . Is it true that for
all but a first category set of points exists?
to
Bergelson added the hypothesis of unique ergodicity because it is a classical result
that a system is uniquely ergodic with unique -invariant measure if and
only if for every and , , and so in
the topological setup this is a natural assumption to make about to achieve
the desired result.
However, Bergelson was particularly interested in the convergence of these averages
to the "correct limit," i.e. where is the unique -invariant measure on
. To have any hope for such a result, it also becomes necessary to assume ergod-
icity for all powers of in order to avoid some natural counterexamples related to
distribution (mod ) of for positive integers . For example, if , is
the permutation on defined by (mod 3), is normalized
counting measure on , and , then is obviously uniquely ergodic with
unique invariant measure , but
, .and
To avoid such examples, we would need to be totally ergodic as well as uniquely
ergodic, and so it makes sense to assume total unique ergodicity to encompass both
properties. Bergelson's revised question then looks like this:
Question 2.1.14. Assume that a topological dynamical system (X,T) is totally uniquely
11
ergodic with unique -invariant measure , and let and . Is it true
that for all but a first category set of points ?
We answer Questions 2.1.13 and 2.1.14 negatively in the case where the degree of
is at least two, and in fact prove some slightly more general results. The level of
generality depends on what hypotheses we place on the space . In particular, we
can exhibit more counterexamples in the case where is a totally disconnected space
than we can in the case where is a connected space such as .
Theorem 2.1.15. For any increasing sequence of integers with upper Banach
density zero, there exists a totally minimal, totally uniquely ergodic, and topologically
mixing topological dynamical system and a continuous function on with
the property that fails to converge for a residual set of .
Theorem 2.1.16. For any increasing sequence of integers with the property
that for some integer , for all suffiffifficiently large , there ex-
ists a totally minimal, totally uniquely ergodic, and topologically mixing topological
dynamical system and a continuous function on with the property that
fails to converge for a residual set of . In addition, the space is
a connected 9-manifold.
We note that Theorems 2.1.15 and 2.1.16 answer Question 2.1.13 negatively for
with degree at least two, since the sequence for any nonlinear
satisfies the hypotheses of both theorems.
Theorems 2.1.15 and 2.1.16 are about nonconvergence of ergodic averages along cer-
tain sequences of powers of . We also prove two similar results about nonrecurrence
12
of points. As motivation, we note that a minimal system has the property that every
point is recurrent. In other words, if is minimal, then for all , it is the
case that . If is totally minimal, then all points are recurrent
even along infinite arithmetic progressions: for any nonnegative integers , , and for
all , . It is then natural to wonder if the same is true for other
sequences of powers of , and in this vein there is the following result of Bergelson
and Leibman, which is a corollary to their Polynomial van der Waerden theorem:
Theorem 2.1.17. ( , p. 14, Corollary 1.8) For any , any minimal topolog-
ical dynamical system and any polynomials , , ,
with for , for a residual set of there exists a sequence
of positive integers such that for , where is the
standard orthonormal basis of .
In particular, Theorem 2.1.17 implies that for a minimal system and any
polynomial with , the set of such that is residual. If
is assumed to be totally minimal, then by definition if the degree of is one,
then every point is in . The following two results imply that if the
degree of is greater than one, total minimality of does not necessarily imply
that every point is in .
Theorem 2.1.18. For any increasing sequence of integers with upper Banach
density zero, there exists a totally minimal, totally uniquely ergodic, and topologically
mixing topological dynamical system (X,T) and an uncountable set A such that
13
for every , the sequence does not have as a limit point, . there is
no sequence of positive integers such that converges to .
Theorem 2.1.19. For any increasing sequence of integers with the property
that for some integer , for all suffiffifficiently large , there exists
totally minimal, totally uniquely ergodic, and topologically mixing topological dynam-
ical system and a point such that the sequence does not have
as a limit point, . there is no sequence of positive integers such that
converges to . In addition, the space is a connected 7-manifold.
The following simple lemma shows that if is topologically mixing, then The-
orems 2.1.18 and 2.1.19 cannot be improved too much, i.e. there is no increasing
sequence for which we can exhibit topologically mixing examples with a second
category set of such nonrecurrent points.
Lemma 2.1.20. If a topological dynamical system (X,T) is topologically mixing, then
for any increasing sequence , the set of x for which x is of first
category.
Proof: Define . It is clear that is closed. We
claim that contains no nonempty open set, which shows that it is nowhere dense,
implying that the set of points for which is not a limit point of
is of first category. Suppose, for a contradiction, that there is a nonempty
open set with for some . Then, there exists with diam(V such that
. By topological mixing, there exists such that . This
14
implies that there exists so that . Since diam(V , .
However, , so we have a contradiction.
Theorem 2.1.18 shows that it is possible for this set of points nonrecurrent along
to be uncountable though.
We mention that some mixing condition is necessary for a statement like Lemma 2.1.20;
as a simple example, consider an irrational circle rotation : on the cir-
cle T. There is clearly some increasing sequence of integers such that
(mod ) . Then, for any , , and so for every ,
does not have as a limit point.
We also prove a result about simultaneous recurrence. Theorem 2.1.17 implies that
for any commuting minimal homeomorphisms and of a compact space , there
exists a residual set of for which there exists a sequence of positive integers
such that and . The following theorem shows that for some
systems, it is the case that this residual set of is not all of X. (In this particular
example, and are powers of the same homeomorphism.)
Theorem 2.1.21. There exists a totally minimal topological dynamical system (X,T)
and a point x such that for any positive integers r , any sequence of
integers satisfying and is eventually zero.
Before proceeding with the proofs, we now give a brief description of the content of
this chapter. In Section 2.2, we will describe some general symbolic constructions of
15
topological dynamical systems with particular mixing properties. At the end of this
section, we will arrive at a construction of a system which is totally minimal, totally
uniquely ergodic, and topologically mixing, and which has as a parameter a sequence
of integers .
In Section 2.3, by taking this sequence to grow very quickly, we will show
that the examples constructed in Section 2.2 are sufficient to prove Theorems 2.1.15
and 2.1.18. Some interesting questions also arise and are answered in Section 2.3
pertaining to the upper Banach density of countable unions of sets of upper Banach
density zero.
In Section 2.4, we create a flow under a function with base transformation a skew
product, which acts on a connected manifold, and which is totally minimal, totally
uniquely ergodic, and topologically mixing. This transformation has as a parameter
a function . We use conditions of Fayad ([Fa]) on flows under functions
to achieve topological mixing, and some conditions of Furstenberg ([Fu]) on skew
products to prove total unique ergodicity.
In Section 2.5, by a judicious choice of , we use the examples of Section 2.4 to prove
Theorems 2.1.16 and 2.1.19.
In Section 2.6, we prove Theorem 2.1.21.
Finally, in Section 2.7 we give some open questions about strengthening our results.
2.2 Some general symbolic constructions
Definition 2.2.1. An alphabet is any finite set, whose elements are called letters.
16
Definition 2.2.2. A symbolic dynamical system with alphabet is a topological
dynamical system where for any , is the shift homeomorphism
of defined by for , and is any closed subset of
invariant under each .
Definition 2.2.3. A word on the alphabet A is any element of for some finite
set F . F is called the shape of w.
For any words of length and of length , we denote by their concatenation,
i.e. the word of length . We denote by the
word torn . . . given by the concatenation of copies of .
Definition 2.2.4. Given any symbolic dynamical system (X,, the language
of X, denoted by , is the set of all words which appear as subwords of X.
All symbolic dynamical systems appearing in this chapter have alphabet {0, 1},
Z. We also restrict ourselves in this chapter to words with shape 1, , for some
. In other words, a word can be thought of for now as a finite string of letters
. The number of letters in a word is called its length.
Definition 2.2.5. A word w of length n is a subword of a sequence u if there
exists k such that for . Analogously, w is a subword
of a word v of length m if there exists -n such that for
.
We will outline three constructions which algorithmically create such that
have certain properties. (Here the "certain properties" in question depend on which
17
construction is used.) It should also be mentioned that many ideas from these con-
structions are taken from work of Hahn and Katznelson , where they also
algorithmically constructed symbolic topological dynamical systems with certain er-
godicity and mixing properties.
Construction 1: (Minimal) We define inductively and , which are, respec-
tively, sequences of positive integers and sets of words on the alphabet {0, 1}. Each
word in is of length . (We will use the term " -word" to refer to a member of
from now on.) We define these as follows: always define and .
Then, for any , is defined to be any integer greater than or equal to
which is also a multiple of , and then is chosen to be the set of words of
length which are concatenations of -words, containing each word in the
concatenation at least once.
We then define to be the set of all which are limits of shifted words In
other words, if there exist and such that for all large enough
, . Since is compact, such exist by a standard diagonalization
argument. It is easily checkable that is closed and a-invariant. The claim is that
regardless of the choice of the integers , as long as divides , and
, is minimal. It suffices to show that for any , .
Choose any and . By the definition of , there exists and some
word such that is a subword of . is an -word, so by definition,
every -word contains , and therefore , as a subword. Finally, note that again
by the definition of Construction 1, is an biinfinite concatenation of words
18
This implies that . . . contains some word, and therefore , as a
subword, and so there exists so that begins with . Since was an
arbitrary word in , this implies that , and so is minimal.
So, we have now demonstrated a way of constructing minimal . We will now
make this construction a bit more complex in order to make totally minimal.
Construction 2: (Totally minimal) We define inductively and , which are,
respectively, sequences of positive integers and sets of words on the alphabet {0, 1}.
Each word in is of length . We define these as follows: always define
and . Then, for any , is defined to be any integer greater than
or equal to , and then is chosen to be the set of words of
length which are concatenations of -words and the word 1 with the following
properties: the word 1 does not appear at the beginning or end of , only a single
1 can be concatenated between two -words, and for every , and for every
, appears in at an (mod )-indexed place. That is, there exists
(mod ) with . . . . From now on, to refer
to this second condition, we say that every occurs in at places indexed by
all residue classes modulo .
Since this construction is a bit complicated, a few quick examples may be in or-
der. Suppose that , , and we choose . Say that
is an : each word ap-
pears at least once beginning with a letter of with an odd index, and at least
19
once beginning with a letter of with an even index. Examples of words which
would not be -words include abcdllabcddabcdababcabcd (1 is concatenated twice
between and ), abcdo lbcdldbcdaabcbbbbdc (occurrences of the word begin only
with even-indexed letters), or abcdldcbabcdabcdabcdadb (wrong number of letters.)
We can then define to again be the set of all which are limits of shifted
-words. For this definition to make sense, it suffices to show that is nonempty
for all , since if this is true then by compactness of and a diagonalization
argument just as in Construction 1. is nonempty, so it suffices to show that
implies for all . For any , assume that . Then, enumerate the
elements of by efine the words . . .
and , where (mod ). Since
, exists, and is a concatenation of -words and the
word 1 with length . In , at most a single 1 is concatenated between any two
-words and 1 does not appear at the beginning or end of . Also, since the length
of is divisible by , and since all -words are subwords of , all words
appear in at places indexed by all residue classes modulo , and so all
words appear in at places indexed by all residue classes modulo as well.
Therefore, , and so is nonempty.
We will show that is totally minimal. Fix any . We wish to show that
for any , . Choose any such , and fix any word .
By definition of , there exists and an word such that is a subword of
. Without loss of generality, we assume that . By the construction,
20
occurs in every -word, and it occurs at places indexed by every residue class
modulo . Since , in particular this implies that , and therefore , occurs
in every -word at places indexed by every residue class modulo . By definition
of Construction 2, is a biinfinite concatenation of -words and the word 1.
Therefore, . . . contains some -word as a subword, and so
occurs in . . . at places indexed by every residue class modulo .
There then exists so that begins with . Since was an arbitrary word of
, this shows that , and since was arbitrary, that is
totally minimal.
We now define one more general type of construction, again more complex than the
last, so that the system created is always totally uniquely ergodic and topologically
mixing, in addition to being totally minimal. For this last construction, we first need
a couple of definitions.
Definition 2.2.6. For any integers and , and and
, we define to be the ratio of the number of occurrences of as
concatenated -rvord at (mod )-indexed places in to the total number of
-rvords concatenated in .
We consider any positive integer to be equal to 0 (mod t) for the purposes of this
definition. An example is clearly in order: if , , (in Construc-
tions 2 and 3, is always taken to be {0, 1}, but here we deviate from this for
21
illustrative purposes) and is the word Ot (here vertical bars show
where breaks in the concatenation occur), then occurs twice out of four -words,
so . Since one of these occurrences begins at and one begins at
, . We make a quick note here that there could
be some ambiguity here if an -word could be decomposed as a concatenation of
words and ones in more than one way. For this reason, we just assume that when
computing , the definition of the word includes its representation
as a concatenation of -words and ones. (i.e. in the example given, is defined
as the concatenation Of of -words and ones, rather than the nine-letter
word 011010110.)
Definition 2.2.7. Given any words of length and w of length , and any
integers , define (w, to be the number of occurrences of w at
(mod m)-indexed places in , divided by .
Taking the previous example again, , since 01 occurs three times as
a subword of 011010110. Since two of these occurrences begin at letters of with
even indices and one begins at a letter of with odd index, and
.
Construction 3: (Totally minimal, totally uniquely ergodic, and topologi-
cally mixing) We define inductively and , which are, respectively, sequences
of positive integers and sets of words on the alphabet {0, 1}. Each word in is of
length . We define these as follows: always define and . Then,
we fix any sequence of positive reals such that and define, for each
22
, some for any integer and prime
(We may choose such a by Bertrand's postulate. ) Note that
this implies that for all . We then define to be the set of words
of length with all of the same properties as in Construction 2, along with the
property that, for any , and for any , .
is again taken to be the set of which are limits of shifted words For
this definition to make sense, it suffices to show that is nonempty for all , since
if this is true then by compactness of and a diagonalization argument
just as in Construction 1. is nonempty, so it suffices to show that
implies for all . For any , assume that . Then, enumerate the
elements of by , , , , and define the words . . .
and . Clearly for large , exists, and is a concatenation
of -words and the word 1 with length . In , at most a single 1 is concatenated
between any two -words and 1 does not appear at the beginning or end of . Since
, for every and , appears in exactly once as a
concatenated -word at an (mod )-indexed place. Therefore, appears in
exactly times as a concatenated -word at (mod )-indexed places, and
so . Since and were arbitrary, , and so .
is totally minimal by the same proof used for Construction 2. We claim that
is also totally uniquely ergodic. Take any word , and any fixed integer
. We define two sequences and as follows: is the minimum value
23
of , where and ranges over all word and is the
maximum value of , where and ranges over all -words.
Suppose that and are known, and that . We wish to show that
and are very close to each other. Let us consider any element of and,
for any fixed , see how few occurrences of there could possibly be at
(mod )-indexed places in . By the definition of Construction 3, for every ,
and , the ratio of the number of times occurs as a concatenated word
in whose first letter is a letter of whose index is equal to (mod ) to the total
number of -words concatenated in is at least . Since divides , then for
any , the ratio of the number of times that occurs as a concatenated
word at (mod )-indexed places in to the total number of -words concatenated
in is at least . Since the total number of -words concatenated in is at
least , this implies that the number of such occurrences of in is at least
for any and . For any and , the number of times that occurs at
(mod )-indexed places in as a subword of an occurrence of that occurs at an
(mod )-indexed place in is then at least .
Summing over all and , the number of occurrences of in at
(mod )-indexed places is at least
.
Since was arbitrary in and was arbitrary,
24
.
Let us now bound from above the number of occurrences of in at (mod )-
indexed places. By precisely the same reasons as above, for any , the
number of occurrences of at (mod )-indexed places which lie entirely within a
concatenated -word in is not more than
.
(The denominator of the first fraction changed because there are at most
words concatenated in ) However, it is possible that there are occurrences of
in which do not lie entirely within a concatenated -word in . The number
of such occurrences of is not more than times the number of concatenated
words in , which in turn is less than or equal to . This means that
the number of occurrences of at (mod )-indexed places in is bounded from
above by
,
and since was arbitrary, this implies that
.
This implies that
25
.
Since for every and , for large this shows that
, which clearly approaches zero as .
We now note that since
,
and since by definition for all , we see that
, and so
.
By almost completely analogous reasoning, for large
.
Therefore,
a
26
,
so . In a completely analogous fashion,
. We know that converges, and since for
all , converges as well. Therefore, we see that the sequences and
are Cauchy, and converge. Since we also showed that , we
know that they have the same limit, call it .
This implies that for very large , is very small for every and
. We claim that this, in turn, implies that for very large , is
very small for every word of length : fix any , and take such that
: for every and , and such that . Then
for any word of length at least , is a subword of a concatenation
of -words and copies of the word 1. The number of full -words appearing in
the concatenation formi iiss aatt lleeaasstt , and aatt mmoosstt . So, the number
of occurrences of at (mod )-indexed places in which are contained entirely
within a concatenated -word is at least
, and at most .
Since there are at most occurrences of not contained entirely
within a concatenated -word, this implies that is at least ( , and
at most a . Since for any , this statement is true for any long enough word
and , we see that a uniformly for .
Since was arbitrary, and since characteristic functions of cylinder sets are dense in
, approaches a uniform limit for all , and so
27
is uniquely ergodic for every N. Since an invariant measure for would be
invariant for any as well, the unique invariant measure is the same for every
.
Finally, we claim that is also topologically mixing. Consider any words ,
. By construction, there exists so that there are words , with a
subword of and a subword of . We also claim that for any , there
exists an word where . . . , and similarly
with . . . . We show only the existence of , as the proof for
is trivially similar. Consider any , and take (mod ). Then,
if we enumerate the elements of by , , , , first define the word
, and then define the word .
has the property that . . . is a concatenated -word in as
long as . . . lies in the subword of , which is
true for large and . This means that if we reorder , , in the
definition of , we may create a word where . . . .
, since for every and , appears exactly
times in as a concatenated -word at (mod )-indexed places, implying that
(This uses the fact that ( , , which was already shown.)
We create in the same way for each . Since is a subword of and is a subword
of , for every , it is easy to choose a word to be
for properly chosen so that . . . , and similarly so that
. . . . For large , this means that we can construct such
and for any .
28
We will now use these and to prove that for any , there exists a
word of length such that rvxrv' . We do this by proving a lemma:
Lemma 2.2.8. For any t , and for any , j such that there exists an
word x where . . . and . . . are
concatenated -rvords in x, and for any trvo words z and , there exists an
word where and . . .
.
Proof: We prove this by induction. First we prove the base case ; take an
word where . . . and . . .
are concatenated -words in , call them and respectively. Since is an
word, there exists an occurrence of at an ( (mod !)) (mod )-indexed
place, i.e. there exists (mod (A +1)!) such that . . .
. Similarly, there exists (mod (A +1)!) such that
2) . . . . We now create by leaving almost all of alone, but defining
. . . , . . . , . . . ,
and . . . . This new word is still a concatenation of
words and ones, and since we switched two pairs of -words which occurred at
indices with the same residue class modulo ,
for all and . Therefore, is an -word, with and
occurring at the proper places, completing our proof of the base case.
Now, let us assume that the inductive hypothesis is true for a certain value of ,
and prove it for . Consider an word where . . . and
29
. . . are concatenated -words in x, call them a and b. Call
the concatenated -word that is a subword of , and denote
by the corresponding -word for . . . . From now on, when
we speak of these words , , , , we are talking about the pertinent occurrences at
the places within already described. There are two cases; either and are the
same; i.e. the same -word in , occurring at the same place, or they are not. If
and do occur at the same place, then by the inductive hypothesis, there exists an
-word with an occurrence of at the same place as occurs in , and an
occurrence of at the same place as occurs in . If we can replace
by in , then we will be done. If and do not occur at the same place, then
since is a concatenated -word in , by the inductive hypothesis there exists
an -word such that has occurring at the same place where occurs in .
Similarly, there exists an -word such that has an occurrence of at the same
place where occurs in . If we replace by and by in , then we will be
done. So regardless of which case we are in, our goal is to replace one or two chosen
-words within with one or two other -words. We will show how to replace
two, which clearly implies that replacing one is possible. We wish to replace by
and by . We do this in exactly the same way as in the base case; say that
. . . and . . . . Since , there
exists (mod ) and (mod ) such that . . .
and . . . . As in the base case, we create by making
. . . and . . . , . . . ,
30
and . . . . Then is an word, and by construction
. . . and . . . .
Choose any sequence of -words for all . For any such , take
{ : is a concatenated word in }. Since
is a concatenation of -words and ones, if we write the elements of as
, then for any , a . For any
, and , , by Lemma 2.2.8, there exists an word with the
property that and .
This implies that there is a subword of of the form rvxrv' where the length of is
. We note that can take any integer value between
and inclusive. Therefore, the set of possible lengths of for which rvxrv'
contains
.
When is increased by one, is increased by at most . This, along with
the fact that the intervals have length , which for large exceeds
, implies that this set of possible lengths of contains [ -
, . Since this entire
argument could be made for any , we see that for any , there exists
of length so that rvxrv' . Then, for any nonempty open sets
31
, , there exist and such that and . By the above
arguments, there exists so that for any , , implying that
. This shows that is topologically mixing.
2.3 Some symbolic counterexamples
Proof of Theorem 2.1.15: We take the continuous function for all
, and first note that
: does not
: . . .
: ),
and that the latter set, call it , is clearly a . We will choose so that is dense
in . This will imply that is a dense , and since is a complete metric space,
by the Baire category theorem, that is residual, which will prove Theorem 2.1.15.
Now let us describe the construction of .
Recall that we have assumed that the sequence has upper Banach density zero.
We define . We also define the intervals of integers
for every , and take any partition of into infinitely many
disjoint infinite sets in , call them , , . . .. Define the set , and
32
then define the set . Next, choose some large enough so that
, and define , and then define .
Continuing in this way, we may inductively define , for all so that for
1ll , for some with the property that , and
. We will verify some properties of these sets. Most
importantly, we denote by the union , and claim that . We
show this by noting that has a certain structure; consists of shifted subintervals
of , separated by gaps which approach infinity. More rigorously:
Lemma 2.3.1. There exist intervals and integers such that
and such that
.
Proof: Take the set , and denote its members by . . ..
Then, for any , is a subset of some . The interval is then defined to be
, (which means and ) and is defined to be .
It is just a rewriting of the definition of the that with
these notations. All that must be checked is that
. We will show that , which
implies the desired result. Since , . So, we must
simply show that . Suppose that is a subset of . Then .
We also note that by construction, . But, since , ,
and so . Therefore, , and so , which
33
clearly shows that this quantity approaches , since is an increasing sequence
of integers.
We now prove a general lemma that implies, in particular, that .
Lemma 2.3.2. If , and if there exist intervals and integers
such that ( , then the
set has upper Banach density zero.
Proof: Fix . By the fact that , there exists such that for any interval
of integers of length at least , . Take to be any interval of integers of
length exactly . Since ,
there is some such that if has nonempty intersection with for some
, it is disjoint from for every . Therefore, for intervals
of integers of length with large enough minimum element, consists of a
subset of a shifted copy of , and so , for some interval of integers
whose length is also . This means that in this case, . We have then shown
that for every , there exist , such that for any interval of integers of length
with , . We will show that this slightly modified definition
still implies that . Again fix , and define and as was just done.
Now consider any interval of integers I with length at least . Then, partition
into subintervals: define , and then break into consecutive
subintervals of length , called , , , . There may be one last subinterval left
34
over of length less than ; call it (which may be empty.) Note that ,
or . We see that
.
Since was arbitrary, .
By combining Lemmas 2.3.1 and 2.3.2, . We will now create . We note
that this part of our construction uses only the fact that , and no other
properties. We take and . We recall that must be a sequence
of integers with the following properties: for all ,
for some positive integer and prime . We also require
to grow quickly enough so that for all , and for any interval of integers I of length
at least , . That we may choose such is a consequence of the
fact that . Using these , we define as in Construction 3. We now
prove a lemma:
Lemma 2.3.3. For any k , m , and for any u {0, , there exists an
-rvord such that -m) for all i .
Proof: This is proved by induction on . Clearly the hypothesis is true for
and for any , . Now suppose it to be true for a particular . We will show
35
that it is true for and every , . We again construct an auxiliary word
: enumerate the elements of by , , , . Then, we again define the
word . Define , where
as above. We note that for any and ,
occurs exactly times as a concatenated -word at (mod )-indexed
places in . (This uses the fact that for all , which has already
been shown.) Now, fix . We wish to construct an word such
that for all [ , a ]. We begin with the
word . Clearly it is not necessarily true that for all
. We force this condition to be true by changing some of the
-words concatenated in . We show that this is possible; for any concatenated
word in , say , the necessary condition is
that ones or zeroes (depending on ) be introduced at digits whose indices are of the
form for all [ , , a ]. To do this, we replace this
word by , which by the inductive hypothesis has the correct digits of at the
desired places. So, we may change into a concatenation of -words and ones,
call it , which has the proper digits of in all desired places. This may be done
by changing at most -words.
Therefore, since for any and , occurred in as a concatenated
-word at (mod )-indexed places exactly times, occurs in as
a concatenated -word at (mod )-indexed places between and
times. This implies that , and since
36
and were arbitrary, that is an -word. By induction, Lemma 2.3.3 is
proved.
This implies in particular that for every , there exists an -word with
for all , , ]. By a standard
diagonalization argument, there exists a sequence and such that for
all , for all large enough . Since for every ,
for all , clearly
for all . Since is a limit of shifted -words, . As mentioned above,
this entire construction could be done with any set of zero upper Banach density in
place of , which lets us state the following corollary:
Corollary 2.3.4. For any C with , there exists (X, totally mini-
mal, totally uniquely ergodic, topologically mixing, and with the property that for any
sequence u {0, , there exists such that