math 104: introduction to analysis - github pages

Math 104: Introduction to Analysis

Contents

1 Lecture 1 3

1.1 The natural numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.2 Equivalence relations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

1.3 The integers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

1.4 The rational numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

2 Lecture 2 6

2.1 The real numbers by axioms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

2.2 The real numbers by Dedekind cuts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.3 Properties of R . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

3 Lecture 3 9

3.1 Metric spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

3.2 Topological definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

3.3 Some topological fundamentals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

4 Lecture 4 11

4.1 Sequences and convergence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

4.2 Sequences in R . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

4.3 Extended real numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

5 Lecture 5 13

5.1 Compactness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

6 Lecture 6 14

6.1 Compactness in Rk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

7 Lecture 7 15

7.1 Subsequences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

7.2 Cauchy sequences, complete metric spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

7.3 Aside: Construction of the real numbers by completion . . . . . . . . . . . . . . . . . . . . . . . . . . 17

8 Lecture 8 18

8.1 Taking powers in the real numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

8.2 “Toolbox” sequences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

9 Lecture 9 19

9.1 Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

9.2 Adding, regrouping series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

9.3 “Toolbox” series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

1

10 Lecture 10 22

10.1 Root and ratio tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

10.2 Summation by parts, alternating series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

10.3 Absolute convergence, multiplying and rearranging series . . . . . . . . . . . . . . . . . . . . . . . . . 24

11 Lecture 11 26

11.1 Limits of functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

11.2 Continuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

12 Lecture 12 27

12.1 Properties of continuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

13 Lecture 13 28

13.1 Uniform continuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

13.2 The derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

14 Lecture 14 31

14.1 Mean value theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

15 Lecture 15 32

15.1 L’Hospital’s Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

15.2 Power series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

15.3 Taylor series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

16 Lecture 16 35

16.1 The Riemann-Stieltjes integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

16.2 Some Riemann-integrable functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

17 Lecture 17 38

17.1 Properties of the integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

18 Lecture 18 41

18.1 The fundamental theorem of calculus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

19 Lecture 19 42

19.1 Things that aren’t true . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

19.2 Uniform convergence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

20 Lecture 20 46

20.1 Basic criteria for uniform convergence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

20.2 Uniform convergence and continuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

21 Lecture 21 48

21.1 Uniform convergence and differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

21.2 An everywhere continuous but nowhere differentiable function . . . . . . . . . . . . . . . . . . . . . . 48

21.3 Differentiation and integration of power series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49

22 Lecture 22 50

22.1 The Stone-Weierstrass theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50

2

Though it may seem (or not!) as though I put care into these notes, they are actually very sloppily written. So

I guarantee you that there will be errors, typos, flat out lies, and other varieties of mistakes. You may alert me if

you find one, but the best thing to do is probably to read it with a skeptical and also not-so-literal eye. You should

be especially skeptical about anything I say that seems like it has to do with mathematical logic, since I literally

know nothing about that and am just making stuff up. Much of the content of these notes is taken from Walter

Rudin’s Principles of Mathematical Analysis and to a lesser extent Kenneth Ross’ Elementary Analysis, though of

course the errors are all mine.

1 Lecture 1

Remark 1.1 (Proof by contradiction). This is something you should become acquainted with, if you have not

already.

1.1 The natural numbers

We will take the following as our axiomatic definition of the natural numbers. There are constructions of the natural

numbers from more basic principles but this is our starting point. As Bertrand Russell (maybe) said: it’s turtles

all the way down.

Definition 1.2 (The Peano axioms for the natural numbers). The set of natural numbers, denoted N, is defined

axiomatically by:

1. There is a distinguished element which we denote 1 P N. It is the “first element” of the natural numbers.

2. There is a function S : N Ñ N called the successor function. This should be thought of as the function

Spxq “ x` 1. In other words, every element has a (unique) well-defined successor.

3. The first element 1 is not a successor of any element, i.e. 1 is not in the image of S.

4. S is injective, i.e. an element can succeed at most one element.

5. Let P be a property. If 1 has P , and also if x has P implies Spxq has P , then every x P N has P . This is the

induction principle.

Remark 1.3. The inductive axiom is important. Otherwise, we may be allowed things like two copies of the natural

numbers: NY N. In some ways it specifies that there is a unique first element. It does more than that though: it

tells us that “arguments by induction are allowed.” If your next question is: what other kinds of arguments are

allowed or not allowed, my answer is I don’t know, and that you should consult a logician. I do not worry much

about issues like this.

This is a good starting point, but we are not only interested in N as a set but also in the addition and

multiplication operations on it, which give it some additional structure. We should define these in terms of the

axioms above.

Definition 1.4 (Addition, multiplication). One can define addition on N recursively as follows.

a` 1 :“ Spaq

a` Spbq :“ Spa` bq

One can also define multiplication on N rescursively as follows.

a ¨ 1 :“ a

a ¨ Spbq :“ a` ab

One can then prove that addition and multiplication as defined above are commutative, associative, and distributive,

i.e. that the following are true

3

a` b “ b` a

pa` bq ` c “ a` pb` cq

ab “ ba

pabqc “ apbcq

apb` cq “ ab` ac

Since I am taking the natural numbers as we know them “intuitively” as a starting point, I won’t actually prove

these things. If you are interested you can try it.

Remark 1.5. Sometimes people define the natural numbers to include zero. I am making an arbitrary decision

here not to invite zero to the party. (It wasn’t a very positive number to be around... ha ha ha...)

1.2 Equivalence relations

I will introduce a notion which is very useful in making constructions.

Definition 1.6. Given a set S, an equivalence relation on the set S is a subset E Ă S ˆ S. If pa, bq P E, we often

write a „ b. This subset E satisfies the following properties:

(reflexivity) for all s P S, we have ps, sq P E (i.e. s „ s),

(symmetry) for all ps, tq P E, we have pt, sq P E (i.e. if s „ t then t „ s),

(transitivity) for all ps, tq P E and pt, uq P E, we have ps, uq P E (i.e. if s „ t and t „ u then s „ u).

This notion is supposed to generalize equality, as equality is very rigid and literal, but there are situations in

which we might want to consider two non-equal things to be “the same.”

Example 1.7. Examples of equivalence relations:

(1) Equality is an equivalence relation.

e(2) Similarity and congruence in Euclidean geometry.

(3) Similarity of matrices in linear algebra.

(4) Congruence modulo n.

Definition 1.8 (Equivalence classes). We will introduce the following notation first. For s P S, denote rss to be

the subset of S containing all elements equivalent to s, i.e.

rss “ tt P S | s „ tu.

An equivalence class is a subset of the form rss for some s P S.

Remark 1.9 (More intrinsic characterization of equivalence classes). Another characterization of equivalence classes

is the following: a subset T Ă S is an equivalence class such that (1) every t, t1 P T has t „ t1; and (2) if s R T then

for all t P T we have t s.

Remark 1.10 (Equivalence classes partition S). Let S be a set with an equivalence relation„. Then the equivalence

classes partition S. Check this as an exercise.

Definition 1.11 (Quotients). Let „ be an equivalence relation on S. Then the quotient S{ „ is the set of

equivalence classes.

Example 1.12. Examples of quotients

(1) Under equality, for any set S, S{ „“ S.

(2) Left as exercise.

(3) Let Mn be the set of all n ˆ n matrices. Then Mn{ „ is the set of equivalence classes with representatives all

possible Jordan normal forms, up to reordering.

(4) Z{ „“ tr0s, r1s, . . . , rn´ 1su

4

1.3 The integers

We will define the integers from the natural numbers as follows. The idea is to let a pair pa, bq represent the quantity

a ´ b. However, sometimes different pairs will be “equal” under this assignment so we want to capture this using

an equivalence relation.

Definition 1.13. The set of integers Z is defined as the quotient pN ˆ Nq{ „ under the equivalence relation

pa, bq „ pc, dq if a ` d “ c ` b. The set of integers has an operation, called addition, which is defined by rpa, bqs `

rpc, dqs “ rpa` c, b` dqs. This operation is well-defined in the quotient (check as exercise). The set of integers also

has a additive identity (i.e. zero), which is rp1, 1qs. The set of integers also has an operation called multiplication

defined by pa, bq ¨ pc, dq “ pac ` bd, ad ` bcq. It also has a multiplicative identity given by rp2, 1qs. Finally, there is

an operation called additive inversion, where ´pa, bq “ pb, aq.

Remark 1.14 (Motivating the equivalence relation). We want to think of the pair pa, bq of natural numbers as

“representing” the integer a ´ b. However, doing this means that every integer, e.g. 0, has many representatives,

e.g. p1, 1q, p2, 2q, p3, 3q, . . .. Thus we must make all these representatives “equal” using an equivalence relation. The

equivalence relation that captures when two pairs pa, bq „ pc, dq is that a´ b “ c´ d. However, since subtraction is

not well-defined in the natural numbers, we must rewrite this as a` d “ c` b.

Remark 1.15 (Checking operations are well defined). We defined the addition operation by

rpa, bqs ` rpc, dqs “ rpa` c, b` dqs.

What we’ve done is, for each equivalence class, we chose particular representatives and our operation returns back

an element of NˆN. Then we take the equivalence class of that element. To show this is well-defined, one must show

that if we chose different representatives, we what we get by applying the operation “rule” gives us an equivalent

element.

The set of integers forms what is called a commutative ring.

Definition 1.16. A commutative ring is a set R with addition and multiplication operations, denoted ` and ¨;

additive and multiplicative identities denoted 0 and 1; and additive inverses, denoted ´. The following properties

are satisfied:

1. (associativity of addition) pa` bq ` c “ a` pb` cq

2. (commutativity of addition) a` b “ b` a

3. (additive identity) 0` a “ a

4. (additive inverses) a` p´aq “ 0

5. (associativity of multiplication) pabqc “ apbcq

6. (commutativity of multiplication) ab “ ba

7. (multiplicative identity) 1 ¨ a “ a

8. (distributivity) apb` cq “ ab` ac

Remark 1.17. This is not a very important definition for this course; it appears frequently in algebra. It will

not appear beyond this lecture. You can think of this definition to just be a word which is shorthand for saying:

a set which has addition and multiplication and their respective identity elements which satisfies commutativity,

associativity and distributativity properties. Or, a set that has all the operations that the integers do.

Proposition 1.18. The integers form a ring.

5

Proof. We will prove one of the properties above: additive inverses. The rest are left as exercises. What we want

to show is that:

pa`, a´q ` pa´, a`q „ p1, 1q

Using the above formula for addition, we have:

pa` ` a´, a` ` a´q „ p1, 1q

This is true, since using our definition of „, we check that indeed:

a` ` a´ ` 1 “ a` ` a´ “ 1.

Example 1.19 (Examples of other commutative rings). The (multivariate) polynomials with integer or rational or

real or complex coefficients, under the usual addition and multiplication of polynomials, form a ring, denoted krxs

for one variable, krx, ys for two variables x and y, et cetera, where k “ Z,Q,R,C.

1.4 The rational numbers

The construction of the rational numbers is left as a homework exercise. The rational numbers form a field, which

is to say all numbers except 0 have a multiplcative inverse.

Definition 1.20. A field k is a commutative ring with multiplicative inverses for all nonzero elements, denoted

a´1, i.e. such that aa´1 “ 1.

We will make the following definition as well. We could have made similar definitions before, but I am lazy.

Definition 1.21. A (total) order on a set S is a subset of L Ă SˆS satisfying the following properties. If pa, bq P L

then we will write for shorthand a ď b.

(antisymmetry) if a ď b and b ď a then a “ b

(transitivity) if a ď b adb b ď c then a ď c

(totality) at least one of the following must be true: a ď b or b ď a.

An ordered field k is a field with a total order such that:

(a) if a ď b then a` c ď b` c for any c P k

(b) if a ě 0 and b ě 0 then ab ě 0

Remark 1.22. There are other notions of orders: for example, there are orders on sets where two elements might

be incomparable. You can check Wikipedia if you’re interested. We are interested in total orders, but since there

are no other orders we will consider, I might be lazy about writing “total.”

Example 1.23. The rationals and reals, Q,R, are both ordered fields. The complex numbers, C is a field but it is

not an ordered field.

2 Lecture 2

2.1 The real numbers by axioms

There are three approaches we will take to the real numbers in this class. The first is axiomatic.

Definition 2.1 (Axiomatic definition of the real numbers). The real numbers are an ordered field with the least

upper bound (LUB) property. The LUB property is as follows: every subset which has an upper bound has a

supremum (least upper bound). (We will define these terms now).

6

Definition 2.2. Let X be an ordered set, and S Ă X. The maximum of S is an element m P S such that for all

s P S, s ď m (exercise: prove maximums must be unique). An upper bound on S is an element x P X such that for

all s P S, s ď x.

Remark 2.3. Notice that maximums must be in the set S, whereas upper bounds do not have to be. Further,

notice that upper bounds are usually not unique.

Example 2.4. Let X “ R and S “ r0, 1s. Then, 1 is a maximum of S and any number greater than 1 is an upper

bound.

Remark 2.5. All finite sets have maximums. However, not all sets in general have a maximum. For a trivial

example, consider p0,8q Ă R. This fails to have a maximum, and fails to have an upper bound. I can live with

this. What’s kind of bad is that not all bounded sets have maximums. For example, the interval p´1, 0q is bounded

above by 0 but does not have a maximum. Let m P p´1, 0q be a possible maximum. But, 12m is also in the set and

is even bigger (since m is negative). So there can’t be a maximum. In this way asking for maximums is somehow

the wrong thing to do for sets which might be infinite. This will motivate the next definition.

Definition 2.6. A least upper bound on a subset S of an ordered set X is an element x P X which is an upper

bound on S, and such that there is no smaller upper bound. More mathematically, it is an upper bound such that

if x1 is another upper bound, the necessarily, x ď x1. The least upper bound is also called the supremum and is

denoted suppSq. Likewise, the greatest lower bound is called the infimum and is denoted infpSq.

Remark 2.7. In our previous example, the supremum is 0. However, it is still not true that every subset of a well-

ordered set has a supremum. For example, consider p0,?

2q as a subset of Q (not R!) does not have a supremum.

Showing this is actually somewhat involved and involves some elementary number theoretic arguments. I will just

appeal to intuition here. In a way, though, this problem is less of a defect of our definition of least upper bound

than it is a defect of the rational numbers: the subset p0,?

2q of the rational numbers doesn’t have anything that

even resembles a “generalized maximum” if we stay in Q. This motivates the next definition.

Definition 2.8. A well ordered set has the least upper bound property if for every bounded above subset S, S has

a supremum (least upper bound).

Remark 2.9. The least upper bound property is equivalent to the greatest lower bound property for an ordered

field, since for any set S we can always take its negation ´S, and the supremum of S is the infimum of ´S and

vice versa.

Remark 2.10. Thus the real numbers are defined as an ordered field which has the LUB property. Vaguely, this

means “all its holes are filled in.” However, it’s still not clear such an object exists, since we haven’t constructed it.

In the next section we will do that. In a somewhat later section, we will show that there is a general construction

one can make to “fill in the holes” on certain kinds of spaces called metric spaces, and that the real numbers are

what we get when we do this to Q.

2.2 The real numbers by Dedekind cuts

In this section we will give a construction of the real numbers by Dedekind cuts. The way to “think about” Dedekind

cuts is that to a real number α, we associate the two subsets p´8, αq and pα,8q, as subsets of Q (i.e. intersect

with Q). Though this is how we “think” of these cuts, in making arguments we cannot actually refer to things like

p´8, αq for α irrational, since this would make our construction circular!

Definition 2.11. A Dedekind cut of the rational numbers Q is a partition of Q into subsets A and B that satisfies

the following properties:

(1) (definition of partition) AYB “ Q and AXB “ H

(2) (A is closed downward) if a P A, then p´8, aq Ă A

(3) (B is closed upward) if b P B¡ then pb.8q Ă B

(4) (infinity is not a number) neither A nor B are empty nor all of Q(5) (a number is cut out) A does not have a maximum, and B does not have a minimum

7

We can give an ordering on the Dedekind cuts as follows. We say pA,Bq ď pA1, B1q if A Ă A1 and B1 Ă B.

There is also a way to make Dedekind cuts into a field, but this is left as a homework exercise.

Remark 2.12. Note that the the data of B is extraneous. Given A that satisfies the above properties, it is

automatic that B “ Q´A does. If this B has a minimum, then remove it from B. (Exercise: show that this new B

cannot have a minimum. Hint: show that between any two rational numbers, there is a rational number.) I made

the above definition to illustrate why this is a “cut,” but from now on we will only be using the data of A.

Remark 2.13. Property (5) is made because otherwise, the cuts p´8, qq and p´8, qs for q P Q would look like

they should be the same. Try to prove the field axioms without this property and see why it’s necessary. For

example, we can’t have two candidates for zero in a field!

Remark 2.14. The Dedekind cuts need to be made into an ordered field. Unfortunately this can get kind of hairy

for multiplication, and I won’t do it here. Interested readers should refer to the constructions in Rudin.

Proposition 2.15. The ordered set of Dedekind cuts has the LUB property. In particular, let tAiuiPI be a collection

of Dedekind cuts which is bounded above. Then, its supremum isŤ

iPI Ai.

Proof. We need to check that (1) the claimed supremum is also a Dedekind cut (in particular, it is not all of Q)

and that (2) it really is the supremum.

For (1), it is easy to see thatŤ

iAi is still closed downwards. The fact that the Dedekind cuts are bounded

above means there is some cut A such that Ai Ă A for all i P I. This means thatŤ

iAi Ă A so A ‰ Q. Finally,

suppose thatŤ

iAi had a maximum, say m. Then m P Ap for some fixed p P I, but since it is a maximum of the

union it would have to be a maximum for Ap as well, contradicting that Ap is a Dedekind cut.

For (2), Suppose there is some Y such that Ai Ă Y for i P I (Ai ă Y for all i, so Y is an upper bound) but

Y ĂŤ

iPI Ai (Y is lower than the claimed supremum). Let x PŤ

iPI Ai; then x P Ap for some p P I. But this

contradicts the first statement, so this cannot be. One can sum this argument up in a sentence like, “the union of

a collection of subsets is the minimal subset containing all the subsets in the collection.”

Remark 2.16 (Why we like axioms). This construction is nice in reassuring us that the real numbers exist (yay!).

I think the logic-y thing to say would be that Dedekind cuts are a model for the real numbers. The nice thing about

axioms is that we can build theories without thinking too hard about specific constructions, as long as our axioms

are well-designed. The specific model doesn’t really matter as long as we stay within the bounds of the axioms. One

might complain that if we want to call something the real numbers, there should be a question about uniqueness,

which I know nothing about. So this question is also going under the rug. In practice this is not important.

2.3 Properties of RProposition 2.17 (Archimedian property). Let a, b P R. If a ą 0 and b ą 0, then there is a positive integer n such

that na ą b.

Proof. Suppose otherwise, i.e. that there is some a, b P R such that na ď b for all n P N. In particular, this means

that b is an upper bound for S :“ tna | n P Zu. Thus, supS is a number.

In a homework exercise, you will show that suppSq ` suppT q “ suppS ` T q. Let S be as above and T “ tau.

Then notice that S`T “ S, and suppT q “ a, so we should have suppSq` a “ suppSq which contradicts that a ą 0.

Thus, the set must be unbounded and there is some n such that na ą b.

Proposition 2.18 (Q is dense in R). For any a, b P R, there is a x P Q such that x P pa, bq.

Proof. Write x “ pq , for p, q P Z. Then we want to find p, q such that aq ă p ă bq. We can use the Archimedian

property to see that there is an integer q such that qpb´ aq “ bq ´ aq ą 1. We claim that if β ´ α ą 1, then there

is an integer in pα, βq. This would complete the proof, letting β “ bq and α “ aq, and p the integer.

8

Lemma 2.19. Let α, β P R. If β ´ α ą 1, then there is an integer p P pα, βq.

Proof. If α ă 0 and β ą 0, we can prove this result for the smaller interval p0, βq. If α, β ă 0, we can prove this

result for the interval p´β,´αq. So, without loss of generality, we can assume that α, β ě 0:

The strategy is as follows. We want to take the set

tn P Z | α ă nu

and find its minimum value, say p. By construction we have α ă p and since it is the minimum, we kno wthat

p ´ 1 ă α and so p ă α ` 1 ă β as well, completing the proposition. The issue is that this set may not have a

minimum and it may also be empty.

First, let’s address the existence of a minimum. We can force a set to have a minimum by making it finite, i.e.

by adding some conditions bounding the set:

tn P Z | α ă n and n P r0, N su

for some integer N . This is true so long as α ă N , and we can use the Archimedian property to find an integer N

such that N ¨ 1 ą maxp|α|q.

3 Lecture 3

3.1 Metric spaces

Definition 3.1. Let k be a totally ordered field. A k-metric space is a set X along with a distance function

d : X ˆX Ñ k satisfying the following properties:

(non-negativity) dpx, yq ě 0 for all x, y P X,

(identity of indiscernables) dpx, yq “ 0 if and only if x “ y,

(symmetry) dpx, yq “ dpy, xq for all x, y P X,

(triangle inequality) dpx, yq ` dpy, zq ě dpx, zq for all x, y, z P X.

Remark 3.2. Most texts fix k “ R. In general, we will assume k “ R, but later when we give a consruction of Rwe will require k “ Q to avoid a circular argument.

Remark 3.3. I should be more careful about this definition than I am. I am not sure whether any completeness

properties of R are used in results in the literature on metric spaces. I am also not sure how much in the literature

applies to loosening the restriction on k, i.e. if we consider ordered abelian groups or ordered rings instead. I hope

that these notes at least are internally consistent.

Example 3.4 (The Euclidean metric). Rn is a metric space under the Euclidean metric. Let x “ px1, . . . , xnq and

y “ py1, . . . , ynq, and define:

dpx, yq “a

px1 ´ y1q2 ` ¨ ¨ ¨ ` pxn ´ ynq2

One can think of this is a “generalized Pythagorean theorem” way of finding distance. For n “ 1 it is not hard to

see this is a metric. For larger n, the proofs I know use Cauchy-Schwartz. Note that this metric doesn’t quite make

sense when k “ Q for Qn when n ‰ 1, since square roots might not exist in Q.

Example 3.5 (The discrete metric). Let X be any set. Then the discrete metric is defined by:

dpx, yq “

#

1 if x ‰ y

0 if x “ y

One can easily verify this is a metric.

9

3.2 Topological definitions

Definition 3.6. Let pX, dq be a metric space. The open ball of radius ε at x is defined by:

Bεpxq :“ tp P X | dpx, pq ă εu

and the closed ball is defined by:

Bεpxq :“ tp P X | dpx, pq ď εu.

We will sometimes use the word neighborhood of x to refer to any open ball of any radius at x.

Example 3.7. Let X “ R with the Euclidean metric. Then Bεpxq “ px´ ε, x` εq.

Example 3.8. Let X have the discrete metric. Then Bεpxq “ txu if ε ă 1 and Bεpxq “ X otherwise.

Definition 3.9. Let pX, dq be a metric space, and E Ă X. A point p P E is called an interior point or E if there

is a neighborhood U of x such that U Ă E. A subset E is called open if every point is interior.

Example 3.10 (Open balls are open). Every open ball is open. To show that every p P Bεpxq is an interior point,

take a “ ε´ dpp, xq ą 0 note that by the triangle inequality, Bappq Ă Bεpxq.

Definition 3.11. Let pX, dq be a metric space and E Ă X. A point p P X is a limit point of E if every neighborhood

U of p has a point q ‰ p such that q P E. A subset E which contains all its limit points is called closed. The closure

of E, denote E, is the union of E with all of its limit points.

Example 3.12 (Closed balls are closed). Every closed ball is closed. Consider Bεpxq and let p be a possible limit

point. If dpp, xq ď r then we are happy. Otherwise, let a ă dpp, xq ´ r. Then Bappq and Bεpxq are disjoint by the

triangle inequality, and thus p cannot be a limit point.

Proposition 3.13 (The closure of any subset is closed). Let pX, dq be a metric space and E Ă X. The closure E

is closed.

Proof. This might seem obvious, but there is something to prove. One has to show that one does not introduce

more limit points by including the limit points of E. Suppose p is a limit point of E; then every neighborhood N

of p intersects E. If N contains a point of E we are done. If N contains a limit point of E, say q, then since N is

open, q is an interior point, so there is a neighborhood of q contained in N which contains a point of E, concluding

the proof.

Remark 3.14. The closed ball isn’t really used that much as a topological notion, as far as I know. I include it

here for fun.

Definition 3.15. Let pX, dq be a metric space. A subset E Ă X is dense if E “ X.

Example 3.16. As we’ve shown before, Q is dense in R. To see why, we claim that every real number is a limit

point of rational numbers: let a P R; every ball Bεpaq “ pa´ ε, a` εq contains a rational number.

3.3 Some topological fundamentals

Proposition 3.17 (Complements). A set is open if and only if its complement is closed.

Proof. This proof is mostly just juggling the logic in the definitions. Suppose that E is open. Let p be a limit point

of Ec. Then every neighborhood of p intersects Ec nontrivially, i.e. every neighorhood is not contained in E, so

p R E since it cannot be an interior point, so p P Ec, so Ec is closed. Suppose that E is closed. Let p P Ec, so p is

not a limit point of E, so their is a neighborhood of p disjoint from E, so p is an interior point, so Ec is open.

Proposition 3.18 (Unions and intersections). Any (possibly infinite) union of open sets is open. Any (possible

infinite) intersection of closed sets is closed. Any finite union or intersection of open or closed sets is open or

closed, respectively.

10

Proof. By the previous proposition, we only need to prove the statements for open sets. I will leave the statement

that an infinite union of open sets is open as an exercise. For a finite intersection of open sets, take open sets

G1, . . . , Gn, and x PŞni“1Gi. Since x P Gi for all i is interior, we have Bεipxq Ă Gi for all i and for some εi. Then,

Bminpεiqpxq ĂŞni“1Gi and the result follows.

Proposition 3.19 (Closure). The closure E is the intersection of all closed sets that contain E. Equivalently, it

is the smallest closed set containing E.

Proof. Let F “Ş

EĂV closed V be the intersection of all closed sets containing E. Clearly, F Ă E since E is closed

and contains E. We want to show that every closed set containing E also contains E. Let V be closed and E Ă V .

Taking the closure of both sides, we find that E Ă V .

4 Lecture 4

4.1 Sequences and convergence

Remark 4.1. Warning: the sequence notion of limit is not quite the same as than the notion of limit you learned

in calculus – the context is different.

Definition 4.2. Let pX, dq be a metric space. A sequence tpnu in X converges to p P X if for every ε ą 0, there is

some N such that dppn, pq ă ε when n ą N . Equivalently, tpnuněN Ă Bεpxq X tpnu.

In this case, we write:

limnÑ8

pn “ p

If a sequence does not converge to any point, it diverges.

Proposition 4.3. Limits of sequences are unique.

Proof. Let p, q be two limits of a sequence tpnu. We will show that dpp, qq ă ε for any ε ą 0. Choose any ε ą 0;

then there is an N such that dppn, pq ăε2 for n ą N and an M such that dpqn, pq ă

ε2 for n ą M . Then, for

n ą maxpN,Mq, we have that dpp, qq ď dpp, pnq ` dppn, qq ă ε as desired.

Proposition 4.4 (Connections to topology). If E Ă X and p is a limit point of E, then there is a sequence in E

converging to p.

Proof. For every n, choose some sn P B1{nppq X E, which we know is nonempty since p is a limit point of E. We

claim this sequence converges to p. For ε ą 0, choose N such that 1N ă ε. Then, one sees that for n ą N , we have

dpsn, pq ă1N ă ε.

Definition 4.5. Let pX, dq be a metric space. A subset E Ă X is bounded if there is a number L such that

tdpp, qq | p, q P Eu is bounded above (in the order-theoretic sense) by L.

Proposition 4.6 (Convergent sequences are bounded). Convergent sequences are bounded (considered as sets).

Proof. Let tpnu Ñ p be the convergent sequence. We will break up the sequence into two “groups” for which we

can find a bound on dppn, pmq: one for m,n ď N and one for m,n ě N . Fix any ε (for example, ε “ 1) and take

an N such that n ě N implies dppn, pq ă ε. By the triangle inequality, dppn, pmq ă 2ε for n,m ě N . This is the

first “group.” For the second, we know that for n,m ď N , dppn, pmq ě m :“ maxptdppn, pmu | n,m ď Nu. Then,

using the triangle inequality, we know that

dppn, pmq ď dppn, pN q ` dppN , pmq ď 2 maxpm, 2εq

for any m,n, establishing a bound.

11

4.2 Sequences in RProposition 4.7. Let tsnu and ttnu be sequences in R, with limits s and t respectively. Let c P R be a constant.

Then:

(a) limnÑ8pcsnq “ cs and limnÑ8pc` snq “ c` s.

(b) limnÑ8psn ` tnq “ s` t

(c) limnÑ8psntnq “ st

(d) limnÑ8psn{tnq “ s{t if for all n, sn ‰ 0 and s ‰ 0

Proof. The proof of (a) is easy and left as an exercise. For (b), fix an ε ą 0. We want to find an N such that if

n ą N then |ps` tq ´ psn ` tnq| ă ε. By the triangle inequality, we have that

|ps` tq ´ psn ` tnq| ď |s´ sn| ` |t´ tn|.

By the limits we already know, we can find an Ns such that n ą Ns implies |s´ sn| ă12ε and likewise an Nt. Then,

let N “ maxpNs, Ntq and we have the result.

For (c), rewrite the expression:

st´ sntn “ pt´ tnqps` snq ` tns´ snt “ pt´ tnqps` snq ` ptn ´ tqs´ psn ´ sqt

The strategy is as follows: in each of the three additive terms, we have one factor that should “go to zero.” We

also have bounds on the other factors, which guarantees that their products “go to zero.” More precisely let L be a

bound on the sequence ts` snu. Fix ε ą 0; we want to find an N such that if n ą N then |st´ sntn| ă ε. Applying

the triangle inequality to our rewriting, we find that

|st´ sntn| ď |L||t´ tn| ` |s||t´ tn| ` |t||s´ sn|

There is an N such that for n ą N , we have |s´ sn| ăε

|L|`|s|`|t| and |t´ tn| ăε|L| ` |s| ` |t|. Then, one finds that

|st´ sntn| ă ε as desired.

For (d), we only need to show that

limnÑ8

1

sn“

1

s.

Fix ε ą 0. We want to find N such that for n ą N , we have:

|1

sn´

1

s| “ |

sn ´ s

sns| ă ε

The idea is as follows: the denominator sns tends toward s2 which is a constant we can account for. We want to

“overestimate” this quantity with respect to s2 since it is in the denominator. So, we can find N1 such that n ą N1

implies that 2|sn| ą |s| (check this), and so |sns| ą12s

2. Thus our expression becomes:

|sn ´ s

sns| ă 2

1

s2|sn ´ s|

Now, we can find N2 such that |sn ´ s| ăs2ε2 and the result follows.

4.3 Extended real numbers

Definition 4.8. We write

limnÑ8

sn “ 8

to mean that for each M there is some N such that n ą N implies that sn ąM . Likewise, we write

limnÑ8

sn “ ´8

to mean that for each M there is some N such that n ą N implies that sn ă N .

12

Remark 4.9. The “extended reals” are not a metric space, nor a field, though it does have an total order. However,

the following arithmetical sentences still “make sense” for any x P R:

x`8 “ 8, x´8 “ ´8, x8“ x´8

“ 0

if x ą 0 then x ¨ 8 “ 8 and x ¨ p´8q “ ´8

if x ă 0 then x ¨ p´8q “ 8 and x ¨ 8 “ ´8

The statements of the above proposition are still true in these cases.

5 Lecture 5

5.1 Compactness

Remark 5.1 (Induced metric). Let pX, dq be a metric space, and let Y Ă X. Then, pY, d|Y q is a metric space,

where d|Y is the distance function when restricted to Y .

Proposition 5.2. Let pX, dq be a metric space, and E Ă X. Let U be open in X. Then U X E is open in E.

Proof. This is left as an exercise. Just check the definition, and note that open balls in Y are open balls in X

intersected with Y .

Remark 5.3 (Openness and closedness as properties of subsets). Asking whether a set is opened or closed only

makes sense once we identify it as an subset of a metric space, and the answer depends on its embedding as a subset.

For example, r0, 1s is an open subset of the metric space r0, 1s, but it is not an open subset of the metric space R.

The notion we will now introduce, compactness, is intrinsic: it does not depend on any embedding.

Definition 5.4. Let pX, dq be a metric space. Then, X is compact if every open cover has a finite subcover. An

open cover of a space X is a (possibly infinite) collection of open subsets of X, say tUiu such that they cover X,

i.e.Ť

Ui “ X. A finite subcover is a finite subcollection which is still a cover.

Example 5.5 (Open covers of R). Here are many many examples of open covers of R. Here are some.

(1) Un “ p´n, nq for n P Z,

(2) Un “ p´8, nq for n P N,

(3) Un “ p´8,8q for n P Z,

(4) Un “ pa` 1, a´ 1q for a P R,

(5) Un “ pa` ε, a´ εq for a, ε P R.

I claim that none of these have finite subcovers. I’ll prove this for (1) and leave the rest as an exercise.

Proof. Suppose there was a finite subcover; then it would be given by a finite subset of the index set of integers,

call it I Ă Z. Check thatŤ

iPIp´n, nq “ p´|maxpIq|, |maxpIq|q, which is not R.

Example 5.6 (Not open covers). Let X “ p0, 1s Ă R. Then p0, 1n q is not an open cover, since 1 P X is not in any

of those open sets.

Corollary 5.7. R is not compact.

Proof. To be compact, every open cover must have a finite subcover. Since we’ve given an open cover which does

not have a finite subcover, we are done.

Remark 5.8 (R has an open cover with a finite subcover). Note that example (3) is an open cover which has a

finite subcover. This does not contradict that R is not compact, since only one open cover can break compactness.

Example 5.9 (Compact intervals in R). Some more examples which we can do explicitly.

(1) p´1, 1q is not compact.

(2) r0, 1s is compact.

(3) p0,8q is not compact.

(4) r0,8q is not compact.

(5) Z is not compact.

(6) QX r0, 1s is not compact.

13

Proof. (1) Use the open subcover given by Un “ p´1` 1n , 1´

1n q for n P N.

(2) We will prove a general theorem later.

(3) Use the open subcover p0, nq for n P N.

(4) Use the open subcover r0, nq for n P N. Note that while these are not open in R, they are open in r0,8q.

(5) Use the open subcover tnu for n P Z.

(6) This is a little trickier. Choose an irrational number α P r0, 1s. Let sn be a rational number in pα´ 1n , αq. Then,

take the open subcover QX pr0, snq Y pα, 1sq for n P N.

Proposition 5.10. Let pX, dq be a metric space and E Ă X. If E is compact then E is a closed and bounded subset

of X.

Proof. We can do the following for any metric space. For every x P E (and x ‰ p), take a neighborhood Ux which

is disjoint from some neighborhood of p. For example, one could take Ux “ B dpx,pq2pxq X E. If p ‰ E, this is an

open cover, since it contains every point in E. By compactness, take a finite subcover, say indexed by the finite

points x1, . . . , xn P E. Then, notice thatŤni“1 Uxn is disjoint from B

minpdpxi,pq

2 qppq, so p is not a limit point. Thus

E contains all of its limit points.

For boundedness, let x P E be any point, and let consider the open cover Un “ Bnpxq X E. By compactness

this has an open subcover, and let N be the maximum n in this subcover. Then E Ă BN pxq and is bounded.

Remark 5.11. It is a little remarkable that an intrinsic property of a space tells us a property about every

embedding of that space.

Proposition 5.12. Let X be a compact metric space. A closed subset of X is compact.

Proof. Let E Ă X be closed. Take a finite subcover Ui of E. First, we claim (and will prove later) that an open

set of E can be written as the intersection of E with an open set of X. Given this claim, for each Ui take Vi Ă X

open such that Ui “ Vi X E. Now take the open cover tViu Y tX ´ Eu of X. Since X is compact, this has a finite

subcover – if X ´ E is in the finite subcover, remove it to obtain a finite subcover of E.

Lemma 5.13. Let pX, dq be a metric space and let E Ă X be a subset considered as a metric space. Let U be an

open set of E. Then there is an open set V of X such that U “ E X V .

Proof. Recall that every open set can be written as the union of open balls. Take the same open balls but in X

instead.

Proposition 5.14. Any finite union of compact subsets is compact.

Proof. Left as an exercise.

6 Lecture 6

6.1 Compactness in Rk

We will make the following definition, which will only be used in this section.

Definition 6.1. A k-cell is a subset of Rk of the form

ra1, b1s ˆ ra2, b2s ˆ ¨ ¨ ¨ ˆ rak, bks

i.e. a “box” with given vertices.

Lemma 6.2 (Cantor intersection lemma for Rk). Let E0 Ą E1 Ą E2 Ą ¨ ¨ ¨ be a descending chain of nested

nonempty k-cells of Rk. Then,Ş

iEi is nonempty.

14

Proof. First, we can prove this for R1. Consider a sequence where Ei “ rai, bis. To be nested means a1 ď a2 ď ¨ ¨ ¨

and b1 ě b2 ě ¨ ¨ ¨ . Let a “ suppa1, a2, . . .q and b “ infpb1, b2, ¨ ¨ ¨ q. We claim that a ď b. Then any x P ra, bs would

suffice. To prove the claim, suppose that a ą b; let x P pb, aq. Then there is some n for with x ă an and also some

m for with x ą bm. Then, bmaxpm,nq ă x ă amaxpm,nq, which cannot happen. To prove this for Rk, one makes a

similar argument for each component of the k-tuple, and is left as an exercise.

Theorem 6.3 (Heine-Borel theorem). A subset of Rn is compact if and only if it is closed and bounded.

Proof. We’ve already proven one direction. Now let X Ă Rn be closed and bounded. Since X is bounded, it is

contained in a closed box, T0 :“ r´N,N sn for large N . Since a closed subset of a compact set is compact, it suffices

to show that this box is compact.

Fix an open cover of the box T0. If one cuts this box in half in all possible directions, one obtains 2n smaller

boxes with half the side length of the original. If every one of the 2n half-boxes had a finite subcover, then T0 would

have a finite subcover (by taking the finite union of the finite subcovers). So, suppose otherwise, and choose a box

T1 which does not have a finite subcover. We can repeat this process to obtain a descending chain of nested boxes:

T0 Ą T1 Ą T2 Ą ¨ ¨ ¨ .

By the previous lemma, this is nonempty and contains a point p. This point p must be in one of the open sets of

the cover, say U , and thus there is some ball Bεppq Ă U . But, this ball contains all Ti for i ąM for some large M ,

contradicting that each of the Ti does not have a finite subcover. This proves the theorem.

Example 6.4 (Not true for all metric spaces). The Heine-Borel theorem is not true for all metric spaces. For

example, if X is an infinite metric space with the discrete topology, then a subset is compact if and only if it is

finite. However, every subset is closed and bounded.

Remark 6.5. This theorem might convince you that compactness is a redundant notion. In the following sections

we will see that this is very untrue, if we want to work in the general context of metric spaces or topological spaces.

7 Lecture 7

7.1 Subsequences

I will not give a definition of subsequences, and convergence of a subsequence. It is exactly what you think it is.

However, we will make a few definitions anyway.

Definition 7.1. A sequence s in R is monotonically increasing if si ď si`1 for all i. It is monotonically decreasing

Proposition 7.2. All bounded monotone sequences converge.

Proof. Suppose sn is monotonically increasing. We claim sn converges to s :“ supptsnuq. For ε ą 0, there is some

sN P ps´ ε, sq by definition of supremum. Since the sequence is mootonic, this is true for all sn and n ą N .

Proposition 7.3. Every sequence has a monotone subsequence.

Proof. Let sn be a sequence. We say the kth term, sk, is dominant if sk ą si for all i ą k, i.e. bigger than all

its subsequent terms. There are two cases to consider. In the first case, there are infinitely many dominant terms;

then, the subsequence consisting of dominant terms is a monotonically decreasing sequence. In the second case,

there are only finitely many. Then, in choosing a subsequence, we follow the rule: never choose a dominant term.

In doing so, we can always choose some term in the sequence following any choice which is bigger, obtaining a

monotonically increasing sequence.

Theorem 7.4 (Bolzano-Weierstrass for R). Every bounded sequence has a convergent subsequence.

Proposition 7.5 (Bolzano-Weierstrass, generalization). Let X be a compact metric space. Every sequence has a

convergent subsequence.

15

Proof. Let txnu be the sequence, and let E “ tx1, x2, . . .u i.e. the set of points in the sequence. If E is finite then

there must be a point that reoccurs infinitely many times, and that is our convergent subsequence. If E is infinite,

then we claim it has a limit point, say p. Given this claim, for each ball B 1nppq we can choose successive elements

in smaller balls converging to p. More precisely, there is some Nn such that si is in the ball B 1nppq for i ą Nn.

Choose successively in such that in ą in´1 and in ą Nn to obtain the subsequence.

Lemma 7.6. If E is an infinite subset of a compact metric space X, then E has a limit point in X.

Proof. Suppose to the contrary; if no point in X is a limit point, every point in x P X has a neighborhood which

contains at most one point of E (namely, the point x itself if x P E). This gives an open cover of X, but it cannot

have a finite subcover since each open contains at most one point of E.

Remark 7.7. Note that this proposition is not true for just closed and bounded subsets. For example, take X “ Zunder the discrete topology, so that it is closed and bounded but not compact. The sequence sn “ n has no

convergent subsequence. So this is another piece of evidence in favor of compactness as a “good notion.”

Definition 7.8. The limit supremum of a sequence sn is defined as

lim supnÑ8

“ limnÑ8

suppsn, sn`1, sn`2, . . .q

The limit infimum is defined similarly

lim infnÑ8

“ limnÑ8

infpsn, sn`1, sn`2, . . .q

Proposition 7.9 (Characterization of limit supremum). Let sn be a bounded sequence. Let E be the set of subse-

quential limits. Then lim sup sn exists and is equal to suppEq, and lim inf sn exists and is equal to infpEq.

Proof. I will prove the statement for lim sup. Since sn is bounded above, suppEq exists. ehhhhhhhhh

Proposition 7.10. For any sequence sn we have that lim inf sn ď lim sup sn. If they are equal, then the limit exists

and limnÑ8 sn “ lim inf sn “ lim sup sn. Conversely, if the limit exists, then limnÑ8 sn “ lim inf sn “ lim sup sn.

Proof. The first statement is left as an exercise. For the second, let s “ lim sup sn “ lim inf sn. Fix ε ą 0. There

is an N such that for n ą N , we have that suppsn, sn`1, . . .q P ps ´ ε, s ` εq. Thus, sn ă s ` ε for all n ą N . A

similar argument with the infimum shows that s´ ε ă sn for all n ą N . Thus, |sn´ s| ă ε for n ą N . For the third

statement, we have that for every ε ą 0, there is some N such that for n ą N we have sn P ps ´ ε, s ` εq. Thus,

lim suppsnq ă s` ε and lim inf ą s´ ε for every ε, and the result follows.

Corollary 7.11 (Application to sequences: “squeezing”). Let sn, tn be sequences in R that converge to s, t, such

that sn ď tn for n ą N for some N . Then, lim sn ď lim tn.

Proof. One can easily check that lim sup sn ď lim inf tn. The rest follows by the previous proposition.

7.2 Cauchy sequences, complete metric spaces

Example 7.12. Let X “ R´ t0u be a metric space. The sequence t 1nu does not converge, because 0 R X. But it

“looks like” a convergent sequence. We want to make a definition that captures this phenomenon.

Definition 7.13. A sequence is Cauchy if for every ε ą 0 there is an integer N such that dppn, pmq ă ε for

m,n ą N . A metric space for which every Cauchy sequence is convergent is called complete.

Remark 7.14. As we saw above, not every Cauchy sequence converges, and in our example was because there was

a “hole.”

Proposition 7.15. Cauchy sequences are bounded

Proof. Left as an exercise.

16

Proposition 7.16. Every convergent sequence is Cauchy.

Proof. Suppose sn converges to s. Then, for every ε ą 0 there is an N such that dpsn, sq ăε2 for n ą N . Then, for

m,n ą N , dpsn, smq ď dpsn, sq ` dpsm, sq ă ε.

Proposition 7.17 (Cantor intersection lemma). Let E0 Ą E1 Ą E2 Ą ¨ ¨ ¨ be a descending chain of nonempty

compact subsets of a metric space X. Then,Ş

iEi is nonempty. Further, define the diameter of a set E by

diampEq :“ supptdpx, yq | x, y P Eu.

If limnÑ8 diampEiq “ 0, thenŞ

Ei consists of exactly one point.

Proof. For the first statement, if the intersection is empty, then tE0 ´EiuiPN is an open cover of E0 (open because

Ei is compact in compact E0 and thus closed). It has a finite subcover, since E0 is compact. This means that

eventually the sequence stabilizes, i.e. Ei “ Ei`1. This contradicts that the Ei are nonempty, so the intersection

must be nonempty. For the second statement, this must be true since if the intersection contained two points, say

x, y, then diampEiq ě dpx, yq for all i.

Proposition 7.18. Every compact space is complete.

Proof. Let si be a Cauchy sequence. Define Ei “ tsi, si`1, . . .u, in the notation of the previous proposition. Since

the sequence is Cauchy, limnÑ8 diampEnq “ 0. Thus there is a single point in the intersection, say s. I claim this is

the limit (the “hard part” is over, since the failure for a Cauchy sequence to converge is somehow due to the point

it should converge to “not being there”). To see this, fix ε ą 0; if diampEN q ă ε then this implies that dpsn, sq ă ε

for all n ą N .

Corollary 7.19. Rk is complete.

Proof. Consider a Cauchy sequence; it is bounded (check as exercise) so we may enclose it in a bounded and closed

box. This is compact by Heine-Borel, and thus every bounded Cauchy sequence has a limit.

Definition 7.20. Let pX, dq be a metric space. We will define the completion of X, denoted X. As a set, the

elements of X are Cauchy sequences (indexed by N) in X, modulo the an equivalence relation which we will define

later. For now, define the distance function between two Cauchy sequences by:

dpx, yq “ limndpxn, ynq

Check as an exercise that this is well-defined since the sequences are Cauchy. Then, two sequences are equivalent

x „ y if dpx, yq “ 0. This distance function is also well-defined on equivalence classes and satisfies the axioms of a

distance function.

7.3 Aside: Construction of the real numbers by completion

In this section we will backtrack and pretend we don’t know what R is. We will treat Q as a Q-metric space. This

will be the only time in these notes that we think of distances as taking values in any field other than R. We will

use Cauchy sequences in Q to construct R.

Definition 7.21. The set of real numbers is defined to be the set of Cauchy sequences in Q (using the absolute

value metric) modulo the equivalence relation where x „ y if limnÑ8 xn ´ yn “ 0. Addition and multiplication

are defined on Cauchy sequences by px ` yqn “ xn ` yn and likewise. The additive identity is the constant zero

sequence, and the multiplicative identity is the constant one sequence. The order is defined by x ď y if x „ y of if

there is some N such that xn ă yn for all n ą N . Finally, the rational numbers are considered as a subset of R by

taking constant sequences.

17

Proof. Check that all operations and orders are well-defined and the field axioms as an exercise. One is assigned

as homework.

The upper bound property is a little more work. Let S be a subset of real numbers, and let u0 be an upper

bound of S. Choose `0 such that there is some s P p`0, u0q X S. One should quickly check that this can be done:

take some s P S and find the N such that m,n ą N implies |sn ´ sm| ă 1, and we can take `0 “ sn ´ 2. Now,

repeat the following process to construct a sequence: consider mn “`n´1`u0

2 . If this is an upper bound for S, then

set un “ mn and `n “ `n´1. Otherwise, set un “ un´1 and `n “ mn. This defines two sequences, un and `n. These

are both Cauchy sequences; I will leave this as an exercise.

First, notice that every un is an upper bound on S by construction. Thus the sequence u thought of as an

element of R is an upper bound on S by definition. I claim that u is the supremum. To see this, suppose there is

a smaller upper bound, u1. This means that limnÑ8 un ´ u1n “ a for some positive number a. Notice that the the

quantity un ´ `n is cut in half each time we increase n by one. Also notice that by construction, there is always a

element in S greater than `n. Thus, when un´ `n ă a, we find that this contradicts that u1n is an upper bound.

8 Lecture 8

8.1 Taking powers in the real numbers

We haven’t actually define what it means to take powers in the positive real numbers to numbers other than integers.

We will first define fractional powers, which we could have done earlier as soon as we defined Dedekind cuts.

Definition 8.1. Let α P R be have α ą 1 and p P N. We define α1{p to be the unique positive real number x such

that xp “ α. Constructively, this is the Dedekind cut:

tx P Q | x ď 1 or xp ă αu.

Now, let p P R be positive, and α ą 1. We define αp to be the Dedekind cut:

tx P Q | x ď 1 or xq ă α for 0 ă q ă pu.

For the case 0 ă α ă 1, one uses that α´1 ą 1. One does have to prove that this cut does have the given property,

and that it is unique in this sense. The first is left as an exercise, and the second follows from the order axioms.

We won’t prove the following.

Proposition 8.2. For a, b ą 0 and p real numbers, using the above definitions, we have that pabqp “ apbp.

8.2 “Toolbox” sequences

Recall:

Theorem 8.3 (Binomial theorem). Let n P N. Then,

p1` xqn “ 1` nx`

ˆ

n

2

˙

x2 ` ¨ ¨ ¨ ` nxn´1 ` xn

Proposition 8.4. The following are useful sequences to know.

(a) If p ą 0, then limnÑ81np “ 0.

(b) If p ą 0, then limnÑ8n?p “ 1.

(c) limnÑ8n?n “ 1.

(d) If p ą 0, then limnÑ8nα

p1`pqn “ 0.

(e) If |x| ă 1, then limnÑ8 xn “ 0.

Proof. For (a), we want to verify that we can make 1np ă ε for large n. Doing some rearranging, we want n ą p 1ε q

1{p.

Here we use two facts: that 0 ă x ă y implies 0 ă 1y ă

1x and that the function fpxq “ xα is monotonically increas-

ing for α ą 0 (i.e. x ď y implies xα ď yα).

18

For (b), we look at three cases: p ą 1, p “ 1, and 0 ă p ă 1. If p ą 1, then n?p ą 1 and take xn “ n

?p´ 1 ą 0.

By the binomial theorem:

1` nxn ď 1` nxn `

ˆ

n

2

˙

x2n ` ¨ ¨ ¨ “ p1` xnqn “ p

So:

0 ă xn ďp´ 1

n

Thus, xn Ñ 0 so n?pÑ 1. If p “ 1 the statement is trivial. If 0 ă p ă 1 then we take reciprocals and proceed.

For (c), proceed similarly as (b) but let xn “ n?n´ 1. We take a different term in the binomial theorem:

ˆ

n

2

˙

x2n ď p1` xnqn “ n

and find that

0 ă xn ă

c

2

n´ 1

so

0 ă xn`1 ă?

2p1

nq1{2

and use part (a).

For (d), let k be a positive integer such that k ą α (we will need to take binomial terms up to k to dominate

α). For n ą 2k,nkpk

2kk!ănpn´ 1q ¨ ¨ ¨ pn´ k ` 1q

k!pk “

ˆ

n

k

˙

pk ď p1` pqn

and so

0 ănα

p1` pqnă

2kk!

pkp

1

nqk´α

and use part (a).

For (e), just take α “ 0 and 1` p “ x from (d).

9 Lecture 9

9.1 Series

Definition 9.1. Let tsnu be a sequence in R. The nth partial sum is

Sn :“nÿ

i“1

si

and we write8ÿ

i“1

sn :“ limnÑ8

Sn

to be the series of sn.

Remark 9.2. Much of our study of series will be to find necessary and sufficient conditions for convergence in

different situations, based on our knowledge of sequences. This is analogous to studying integrals in terms of

functions.

The following is an easy but useful criterion for the convergence of series. It is essentially the Cauchy criterion.

19

Proposition 9.3. The series for sn converges if and only if for every ε ą 0 there is an integer N such that

m,n ą N implies that

|

mÿ

i“n

si| ă ε

Remark 9.4. Note that while this seems like an easy result, we had to do a lot of work to prove that Cauchy

sequences converge. In general it’s harder for us to identify the exact number that a series converges to versus a

sequence (maybe in the same way it’s hard to take integrals), so the Cauchy criterion is really useful for series.

Corollary 9.5. Ifř

sn converges, then sn Ñ 0.

Proof. Take n “ m above.

Example 9.6 (Counterexample to converse). Consider sn “1n , which has limnÑ8 sn “ 0. The corresponding

series does not converge. To see why, consider “grouping“ the sums as follows:

1`1

2` p

1

3`

1

4q ` p

1

5`

1

6`

1

7`

1

8q ` ¨ ¨ ¨ ą 1`

1

2`

1

2`

1

2` ¨ ¨ ¨

Note that in this argument I am not rearranging the series or changing it any way. Rearranging the terms in series

must be done delicately, as we will show later.

Proposition 9.7 (Comparison test). If |an| ď cn for n ą N0 (some N0), andř

cn converges, thenř

an converges.

If an ě dn ě 0 for n ą N0, andř

dn diverges, thenř

an diverges.

Proof. Note that this is a little deeper than a simple application of the comparison of sequences result we proved

earlier, since there is no mention of a lower bound converging to the same number. Let N ě N0 be such that

m,n ą N implies |řmi“n ci| ă ε. Then we have

|

mÿ

i“n

ai| ďmÿ

i“n

|ai| ďmÿ

i“n

ci ă ε.

The second statement follows from comparison of sequences. Note that since all the dn terms are positive, the

partial sums form a monotonically increasing sequence, and since it diverges it must in fact diverge to infinity.

Since an ě dn, then the series for an must also diverge to infinity.

Example 9.8. Note that it is important for the above cn to have only positive terms, since must serve as an

“overestimate” of a convergent series. For example, if we take cn “ p´1qn 1n , we find that

ř

cn converges, whereas1n does not. Likewise, it is important for dn and an to be entirely positive or entirely negative.

9.2 Adding, regrouping series

Proposition 9.9. Letř

an “ A andř

bn “ B. Then,ř

pan ` bnq “ A`B andř

can “ cA for any c.

Proof. Straightforward, left as exercise.

Definition 9.10. A regrouping of a seriesř

an is defined as a subsequence of the sequence of partial sums Sn “řni“1 si. This is best exhibited by example. Recall our regrouping of an “

1n :

1`1

2` p

1

3`

1

4q ` p

1

5`

1

6`

1

7`

1

8q ` ¨ ¨ ¨ ą 1`

1

2`

1

2`

1

2` ¨ ¨ ¨

This corresponds to taking the partial sums:

S1, S2, S4, S8, S16, . . .

The way to think of this is that we want to treat the parentheses as one “group” and ignore any intermediate terms

one might obtain by only summing some part. Note that this only allows for finitely many terms in each group.

20

Proposition 9.11. Letř

ai be a series of non-negative terms. Then any regrouping of the series has the same

behavior as the series itself.

Proof. Since the terms are nonnegative, the sequence is monotonically increasing, and thus the set of subsequential

limit has at most one point.

Example 9.12. Let an “ p´1qn. Note that the following regrouping:

p1`´1q ` p1`´1q ` p1`´1q ` ¨ ¨ ¨

would change the behavior of this series, since it makes this divergent series converge to zero.

Definition 9.13. Let σ : N Ñ N be a injective and surjective function. Then the rearrangement of the sequence

tanu corresponding to σ is given by:

paσqk “ aσpkq

Note that I do not think this notation is standard.

Proposition 9.14. Suppose σ fixes all but finitely many numbers, i.e. σpnq “ n for all but finitely many n. Then,

the convergence behavior and also limits of two sequences aσ and a are the same.

Proof. This is left as an exercise.

9.3 “Toolbox” series

Of course, we need some series to compare to.

Proposition 9.15 (Geometric series). If x P r0, 1q then

8ÿ

n“0

xn “1

1´ x

and if x ě 1 then the series diverges.

Proof. We can compute by induction the partial sums:

kÿ

n“0

xn “1´ xk`1

1´ x

and use previous results on sequences. For x “ 1 we see immediately it diverges, and we can use the comparison

test for other cases.

Proposition 9.16 (Negative power series).ř

n1np converges if p ą 1 and diverges if p ď 1.

Proof. We have shown divergence for p “ 1 and the comparison test takes care of p ď 1. For p ą 1 we will use a

similar technique as for p “ 1, i.e. we will group:

1` p1

2p`

1

3pq ` p

1

4pq `

1

5p`

1

6p`

1

7pq ` p

1

8p` ¨ ¨ ¨ q ` ¨ ¨ ¨ ď 1`

1

2p`

2

2p` p

4

4p`

8

8p` ¨ ¨ ¨ q

The term in the parentheses is a geometric series with ratio 12p´1 which is less than 1 if p ´ 1 ą 0, and the result

follows.

Proposition 9.17 (n logpnq). If p ą 1 then8ÿ

n“2

1

nplogpnqqp

converges. If p ď 1 the series diverges.

Proof. This is left as an exercise; use the grouping technique.

21

10 Lecture 10

10.1 Root and ratio tests

Theorem 10.1 (Root test). Consider a sequence an and let α “ lim sup na

|an|. Then,

(a) if α ă 1 thenř

an converges,

(b) if α ą 1 thenř

an diverges.

Remark 10.2. Note that if α “ 1 we cannot conclude anything. For example,ř

1n diverges, but 1

n2 converges,

but they both have α “ 1.

Proof. If α ă 1, then there is some ε such that α ` ε ă 1. By definition of lim sup, there is some N such that

supt Na

|aN |,N`1a

|aN`1|, . . .u ă α ` ε ă 1. In other words, |an| ă pα ` εqn for every n ą N . Thus,ř8

n“N |an|

converges by the comparison test.

If α ą 1, then take ε “ α´12 and let γ “ α ´ ε. There are is an N such that for n ą N , we have

supt n?an, n`1

?an`1, . . .u ą γ ą 1. Thus we have a subsequence of an whose terms are all greater than 1, so

the sequence an cannot converge to zero.

Note that this argument fails when α “ 1; in this case we have ε “ 0. Also note that we cannot use the

comparison test and geometric series in analogy to the α ă 1 case, due to subtle differences in the comparison test

for divergence requiring all terms to be positive or all terms to be negative. If α “ 1, note thatř

1n diverges but

ř

1n2 converges. Thus the test cannot be conclusive in this case.

Theorem 10.3 (Ratio test). The seriesř

an:

(a) converges if lim sup |an`1

an| ă 1

(b) diverges if |an`1

an| ě 1 for all n ą N0 for some fixed N0

Proof. For (a), fix some β P plim sup |an`1

an|, 1q; there is some N such that if n ě N we have that |an`1

an| ă β. Thus,

we find that:

|aN`1| ă β|aN |

and inductively we find

aN`k| ă βk|aN |

i.e.

|an| ă βn´N |aN |

for n ą N . Sinceř

βn converges, by the comparison test,ř

an converges. For (b), check that an cannot limit to

zero.

Remark 10.4 (Similarities and differences). The techniques for both tests are relatively similar: for convergence

we use comparison with a geometric series, and for divergence we show that the sequence does not limit to zero.

However, there is a difference in divergence for the ratio test, which insists that the ratio must be greater than or

equal to one for all terms past a point, whereas the ratio test asks for a limit supremum to be positive. If we try

to apply the ratio test to the seriesř

1n , we find that it falls under neither (a) nor (b), since the limit supremum

of | nn`1 | is 1 but is never greater than 1.

Example 10.5. This example will show that the divergence part of the ratio test is extremely blunt. Take the

series:1

2`

1

3`

1

22`

1

32`

1

23`

1

33` ¨ ¨ ¨

and note that (index a2 “12 ):

lim sup n?an “ lim

2n

c

1

2n“

1?

2

whereas

lim sup |an`1

an| “ 8

since we always have consecutive terms of the form 12k, 13k`1 .

22

Example 10.6. This example is a “rearrangement” of a convergent series

1

2` 1`

1

8`

1

4`

1

32`

1

16` ¨ ¨ ¨

obtained by swapping pairs of terms. Notice that:

lim sup n?an “

1

2

but

lim sup |anan`1

| “ 2

Notice that:

lim inf |anan`1

| “1

8

and that the geometric mean of 2 and 18 is 1

2 . In some really vague sense, the ratio test is too sensitive to the

ordering of the sequence.

Proposition 10.7 (The root test is stronger). For any sequnece an of positive numbers,

lim sup n?an ď lim sup

an`1

an

Proof. Choose some β ą lim sup cn`1

cn. There is an N such that n ě N implies that an`1

ană β. Thus one finds that

an ă βnŃaN . Taking nth roots, one obtains that n?an ă β n

a

βNaN . Note that on the right hand side βNaN is

a constant, and taking lim sup we find that lim sup n?an ă β. Since this is true for every β ą lim sup an`1

an, one

obtains the result.

10.2 Summation by parts, alternating series

Theorem 10.8 (Summation by parts). Let an and bn be two sequences, and write An “ sumni“0ai for n ě 0 and

A´1 “ 0. Then for 0 ď m ď n one has:

nÿ

i“m

anbn “n´1ÿ

i“m

Aipbi ´ bi`1q Ànbn Ám´1bm

Proof. Write an “ An`1 Án.

Proposition 10.9. If,

(a) the partial sums An of an are bounded,

(b) bn is monotonically decreasing,

(c) bn converges to zero,

thenř

anbn converges.

Proof. Suppose |An| ă M , i.e. M is a bound. We want to use the Cauchy criterion, i.e. we want to make the

following expression ă ε:

|

nÿ

i“m

anbn| “ |n´1ÿ

i“m

Aipbi ´ bi`1q Ànbn Ám´1bm| ăMpn´1ÿ

i“m

|bi ´ bi`1| ` |bn| ` |bm|q

Since bi ´ bi`1 ě 0 and bi ě 0, this is:

Mpn´1ÿ

i“m

bi ´ bi`1 ` bn ` bmq “ 2Mbm

23

Now, by convergence of bn, for any ε ą 0, we can find an N such that m ą N implies that bm ăε

2M , completing

the proof.

Proposition 10.10 (Alternating series test). A sequence is alternating if its signs alternate, i.e. if ai ě 0 then

ai`1 ď 0 and ai`2 ě 0, et cetera. Let an be an alternating series such that |an| is monotonically decreasing and anconverges to 0. Then,

ř

an converges.

Proof. Apply the previous proposition with an “ p´1qn and bn “ |cn|.

Example 10.11 (Why monotonically decreasing). Otherwise, one could make the negative terms converge much

faster than the positive terms. For example, take a2n “1n and a2n`1 “ ´

12n . One can check by regrouping:

1´1

2`

1

2´

1

4` p

1

3´

1

8`

1

4´

1

16q ` ¨ ¨ ¨

that the groups are all ě 14 .

10.3 Absolute convergence, multiplying and rearranging series

Definition 10.12. A seriesř

an converges absolutely ifř

|an| converges. It is easy to see an absolutely convergent

series is convergent. A series that converges but not absolutely is called conditionally convergent.

Remark 10.13. The root test and ratio tests all use |an| rather than an, and thus are really testing for absolute

convergence. Summation by parts does handle some non-absolutely convergent series.

Example 10.14 (A conditionally convergent series). By the alternating series test,ř

p´1qn 1n converges, but

ř

1n

does not.

Definition 10.15. For seriesř

an andř

bn, define their product, which we will write by informal convention asř

cn, by

cn “ÿ

i`j“n

aibj “nÿ

k“0

akbn´k

This definition is motivated by imagining products of power series in the formal variasble x:

pÿ

anxnqp

ÿ

bnxnq “

8ÿ

n“1

ÿ

i`j“n

aibjxn

Remark 10.16. Notice that we can’t apply sequence techniques to conclude that AB “ C (the infinite sums),

since AnBn ‰ Cn (the partial sums).

Example 10.17. Take an “ bn “p´1qn?n

. These both converge by the alternating series test. However, the product

has:

cn “ p´1qnÿ

i`j“n

1?ij.

Now, notice that ij is maximized when i “ j “ n2 . Thus we have that

cn ěn´ 2a

n2{4“ 2´

4

n

and thus cn does not converge to 0, so the series cannot converge.

Proposition 10.18. Using the previous notation, if one ofř

an,ř

bn converges absolutely, then AB “ C.

Proof. For shorthand, write βn “ Bn ´B. We can write

Cn “ a0Bn ` a1Bn´1 ` ¨ ¨ ¨ ` anB0 “ a0pB ` βnq ` a1pB ` βn´1q ` ¨ ¨ ¨ ` anpB ` β0q

24

“ AnB ` a0βn ` a1βn´1 ` ¨ ¨ ¨ ` anβ0

To prove the result, we need to show that the “tail:”

γn “ÿ

i`j“n

aiβj “ a0βn ` a1βn´1 ` ¨ ¨ ¨ ` anβ0

converges to zero.

Sinceř

an converges absolutely, let α “ř

|an|. Since β converges to 0, let N be such that n ě N implies that

βn ă ε. Then, we can break up the sum into two groups using the triangle inequality:

|γn| ď |β0αn ` ¨ ¨ ¨ ` βNanŃ | ` |βN`1anŃ´1 ` ¨ ¨ ¨ ` βna0|

ď |β0αn ` ¨ ¨ ¨ ` βNanŃ | ` αε

Taking lim sup of both sides, we find that lim sup |γn| ď εα (we can’t take the limit because we don’t know it exists).

Since ε was chosen arbitrarily, it must follow that lim sup γn “ 0, and the result follows.

Definition 10.19. Let σ : NÑ N be a injective and surjective function. Then the rearrangement of the sequence

tanu corresponding to σ is given by:

paσqk “ aσpkq

Note that I do not think this notation is standard.

Example 10.20. Take an “p´1qn`1

n “ 1 ´ 12 `

13 ´ ¨ ¨ ¨ . The series converges by the alternating series test, to a

number A ă 1´ 12 `

13 “

56 .

Now, we can rearrange this series so that it has two positive numbers followed by one negative:

r1`1

3´

1

2s ` r

1

5`

1

7´

1

4s ` r

1

9`

1

11´

1

6s ¨ ¨ ¨

(the brackets are only for visual ease). Each three term group is:

1

4k ´ 3`

1

4k ´ 1´

1

2k

and one can verify that this is positive when k ě 38 . This the partial sums are monotonically increasing after a

point, and one can check that eventually the sum becomes greater than 56 . Thus the two cannot converge to the

same number.

Proposition 10.21. Suppose σ fixes all but finitely many numbers, i.e. σpnq “ n for all but finitely many n. Then,

the convergence behavior and also limits of two sequences aσ and a are the same.

Proof. This is left as an exercise.

Proposition 10.22. Ifř

an converges absoutely, then every rearrangement converges to the same number.

Proof. Letř

a1n be the rearrangement. For ε ą 0, choose N such that for m,n ą N , one has thatřni“m |ai| ă ε

andřni“m |a

1i| ă ε. So, we need to “take care” of terms a1, . . . , aN .

Now, choose M such that a11, . . . , a1M contains all the a1, . . . , aN terms. Then, in the quantity |An ´ A1n|, the

first N of the original series will cancel in both An and A1n. Using the triangle inequality, this quantity is ă 2ε.

Proposition 10.23. Supposeř

an converges conditionally. Let α ď β. Then there is a rearrangement of an which

has lim inf “ α and lim sup “ β. One can take α, β to be infinite.

Proof. I will skip this. Refer to Rudin.

25

11 Lecture 11

11.1 Limits of functions

Definition 11.1. Let X,Y be metric spaces, and E Ă X, and p a limit point of E. We write the limit

limxÑp

fpxq “ q

if for every ε ą 0 there exists a δ ą 0 such that x P Bδppq ´ tpu implies that fpxq P Bεpqq. Note that it is not

relevant whether or not p P E. Also, I will use the same symbols for this set-up throughout this section.

Example 11.2 (Sequences). We can realize N “ NYt8u as the subset t 1n | n P NuY t0u Ă R. Here, 8 is the only

limit point, and the definitions agree.

Example 11.3 (One-sided limits in R). In calculus sometimes we take one-sided limits, i.e. limxÑ0`txu “ 0 but

limxÑ0´txu “ ´1. Here, the subset E “ p0,8q and p´8, 0q respectively.

Proposition 11.4. One has limxÑp fpxq “ q if and only if for every sequence tpnu such that pn ‰ p (for all n)

and pn Ñ p, one has limnÑ8 fppnq “ q.

Proof. Suppose that limxÑp fpxq “ q. Then, for every ε ą 0 one can find a δ ą 0 such that x P Bδppq ´ tpu implies

that fpxq P Bεpqq. Any sequence pn converging to p and such that pn ‰ p has an N such that pn P Bδppq ´ tpu for

n ą N , and so fppnq P Bεpqq. Thus we have shown that for every ε there is an N such that fppnq P Bεpqq.

If one did not have that limxÑp fpxq “ q, then there is some ε ą 0 for which one could find a pn arbitrarily close

to p (i.e. for every δ) such that fppnq R Bεpqq. Fix this ε; choose pn P B 1nppq ´ tpu such that fppnq R Bεpqq. This is

a sequence such that fppnq does not converge to q.

Corollary 11.5. Limits of functions are unique, if they exist.

Proposition 11.6. Let limxÑp fpxq “ A and limxÑp gpxq “ B. Then,

(a) limpf ` gqpxq “ A`B,

(b) limpfgqpxq “ AB,

(c) lim fg pxq “ A{B if B ‰ 0

Proof. Follows from the result in sequences and the previous proposition.

Example 11.7. We can check explicitly: limxÑa x “ a. Fix an ε ą 0; we are concerned with the interval pa´ε, a`εq.

We want to show there is δ such that x P pa´ δ, a` δq implies that fpxq P pa´ ε, a` εq. But we can just take δ :“ ε!

Example 11.8. Define fpxq to be 0 if x P Q and 1 if x R Q. The limit limxÑ0 fpxq does not exist, since in every

interval p´ε, εq there are points such that |fpxq ´ fp0q| “ 1 and |fpxq ´ fp0q| “ 0.

11.2 Continuity

Definition 11.9. Let f : X Ñ Y be a function, and U Ă Y a subset. Then define the inverse image:

f´1pUq :“ tx P X | fpxq P Uu

Let V Ă X be a subset, and define the image

fpV q :“ tfpxq P Y | x P Xu

Definition 11.10. Let X and Y be metric spaces. A function f : X Ñ Y is continuous at p P X if the following

equivalent are true:

(a) limxÑp fpxq “ fppq,

(b) for every ε ą 0, there is a δ ą 0 such that fpxq P Bεpfppqq for all x P Bδppq,

(c) for every ε ą 0, there is a δ ą 0 such that fpBδppqq Ă Bεpfppqq.

26

Definition 11.11. Let X and Y be metric spaces. A function f : X Ñ Y is continuous if the following equivalent

are true:

(a) for every p P X, limxÑp fpxq “ fppq,

(b) for every p P X and every ε ą 0, there is a δ ą 0 such that fpxq P Bεpfppqq for all x P Bδppq,

(c) for every open subset U Ă Y , f´1pUq is open,

(d) for every closed subset Z Ă Y , f´1pZq is closed.

Proof. The equivalence of (c) and (d) is a set-theoretic exercise, knowing that U is open if and only if U c is closed.

The equivalence of (a) and (b) is simply untangling the definition. I will show that (b) and (c) are equivalent.

Assume (a). Let U Ă Y be open, and let p P f´1pUq. We want to show that p is an interior point. Since fppq P U

it is an interior point, so choose ε such that Bεpfppqq Ă U . Then there is a δ such that fpBδppqq Ă Bεpfppqq Ă U

and thus Bδppq Ă f´1pUq. Now assume (c). Then, f´1pBεpfppqq is open, so p is an interior point, so there is some

Bδppq Ă f´1pBεpfppqq.

Example 11.12. In the discrete metric, all functions are continuous.

Remark 11.13. Note that it is not true that if f is continuous at x, then it is continuous in a neighborhood of x.

As a counterexample, consider the function f : R Ñ R such that fpxq “ 0 if x R Q and fpxq “ x if x P Q. Check

that f is continuous only at 0.

Proposition 11.14 (Composition of continuous functions). Let f : X Ñ Y and g : Y Ñ Z. If f is continuous at

p P X and g is continuous at fppq P Y , then g ˝ f is continuous at p.

Proof. This proof is just unwinding a lot of definitions. For any ε ą 0 we want to show that there is a δ ą 0 such

that gpfpBδppqq Ă Bεpgpfppqq. We know that we can find a δ1 ą 0 such that gpBδ1pfppqq Ă Bεpgpfppqq. Using δ1 as

the “ε” for f , we then find a δ such that fpBδppqq Ă Bδ1pfppqq.

Corollary 11.15. Let f : X Ñ Y and E Ă X. Suppose L “ limxÑa fpxq exists. If g : fpXq Y tLu Ñ Z is

continuous at L, then

limxÑa

gpfpxqq “ gpLq.

Proposition 11.16 (Algebraic operations on continuous functions in R). If f and g are continuous then f `

g, fg, f{g are continuous (assume gpxq ‰ 0 for the last one).

12 Lecture 12

12.1 Properties of continuity

Proposition 12.1. Let X be compact, and f : X Ñ Y a continuous function. Then the image, fpXq, is compact.

Proof. Take an open cover of fpXq and pull it back using f´1. Since f is continuous, this is an open cover of X.

It has a finite subcover since X is compact. The corresponding subcollection in the original cover of fpXq is thus

a subcover.

Remark 12.2. Note that it is not true that fpUq is open for U open. For example, let f : RÑ R2 send x ÞÑ px, 0q.

Let U be any open interval; the image is not open in R2. It is also not true that fpZq is closed for Z closed. For

example, let f : R Ñ R by fpxq “ ex, and consider the image fpp´8, asq “ p0, eas for any a. It is also not true

that fpEq is bounded for E bounded. Let f : R´t0u Ñ R be defined by fpxq “ 1x , and consider fpp0, 1qq “ p1,8q.

Nor is it true that f´1pEq is bounded for E bounded. Let f : RR be a constant function, say fpxq “ 0. Then

f´1pt0uq “ R is not bounded.

Theorem 12.3 (Maximum value theorem). Let ra, bs be any nonempty closed interval. Suppose that f : ra, bs Ñ Ris a continuous function of real numbers. Then, f attains maximum and minimum values on ra, bs.

Proof. This follows immediately; fpra, bsq is closed and bounded.

In the homework, the student explored connectedness. Recall the definition.

27

Definition 12.4. Two subsets of X are separated if AXB “ AXB “ H. A subset E Ă X is connected if it is not

the union of two nonempty separated sets.

Proposition 12.5. A subset E of R is connected if and only if for all pairs, x, y P E, one has px, yq Ă E.

Proof. If the condition is false then one can take z P px, yq and one has p´8, zq and pz,8q separating E. Conversely,

if E is not connected, take nonempty separated sets A and B covering E. Choose x P A and y P B and take

z “ supA X rx, ys. Since z P A, z ‰ B. If z R A then z P px, yq breaks the condition. If z P A then z R B, so one

can find z1 P pz, yq such that z1 R B, but also z1 R A, so z1 R E, breaking the condition.

Proposition 12.6. Let E Ă X be connected, and f : X Ñ Y continuous. Then fpEq is connected.

Proof. Suppose fpEq is not connected; then it has separating sets A and B. We claim that f´1pAq and f´1pBq

are separating sets for E. They are clearly nonempty and cover E. Using two facts: (1) f´1pAq Ă f´1pAq and (2)

f´1pA X Bq “ f´1pAq X f´1pBq, one has that f´1pAq X f´1pBq “ f´1pAq X f´1pBq “ f´1pA X Bq “ H. Check

the two facts as an exercise.

Theorem 12.7 (Intermediate value theorem). Let f : ra, bs Ñ R be continuous. If fpaq ă fpbq, and c P pfpaq, fpbqq,

then there is some x P pa, bq such that fpxq “ c.

Proof. Apply the previous two propositions.

Corollary 12.8 (Fixed point theorem for R). Let f : r0, 1s Ñ r0, 1s be a continuous map. Then f has a fixed point,

i.e. some p such that fppq “ p.

Proof. Apply the IVT to gpxq “ fpxq´x. Note that gp0q “ fp0q ě 0 and gp1q “ fp1q´ 1 ď 0, one has some p such

that gppq “ 0.

Proposition 12.9. Let f be a one-to-one continuous function on an interval I, i.e. f : I Ñ R. Then f is either

strictly increasing or strictly decreasing. Recall that f : I Ñ R is strictly increasing if for all a ă b P I, one has

that fpaq ă fpbq.

Proof. First, we will show the result for a closed interval I “ ra, bs. Suppose that fpaq ă fpbq (for fpbq ă fpaq

the proof is the same). We want to show that every c P pa, bq has fpcq P pfpaq, fpbqq. If not, then if fpcq ă fpaq,

then fpaq P pfpcq, fpbqq, so there is some p P pc, bq such that fppq “ fpaq by the IVT, contradicting injectivity. If

fpcq ą fpbq a similar argument applies. Thus, fpcq P pfpaq, fpbqq.

Now, if the interval were open, say I “ pa, bq, then one could take a closed subinterval ra1, b1s Ă pa, bq and extend

the argument as follows. Let c P pa, a1s. We wish to show that fpcq ă fpa1q. If not, i.e. fpcq ą fpa1q, then choose p

such that fppq P pfpa1q, fpcqq. Then apply the IMT to contradict injectivity, and this completes the proof.

13 Lecture 13

13.1 Uniform continuity

Definition 13.1. Let f : X Ñ Y be a function on metric spaces. We say that f is uniformly continuous on X if

for every ε ą 0 there exists δ ą 0 such that

dpfpxq, fpx1qq ă ε

for all x, x1 such that dpx, x1q ă δ.

Remark 13.2. Unlike the notion of continuity, which only had a topological characterization (i.e. in terms of open

sets only), uniform continuity requires the use of a metric. There are ways around this (e.g. uniform spaces) but

we won’t get into that here.

28

Example 13.3. Consider fpxq “ 1x . We claim that fpxq is uniformly continuous on ra,8q for any a ą 0. Fix

ε ą 0; one can show because fpxq is monotonically decreasing, that for fixed x, dpfpxq, fpx1qq is maximized when

x1 “ x` δ or x1 “ x´ δ. Then,

|fpxq ´ fpx` δq| “ |1

x´

1

x` δ| “

δ

xpx` δq.

This is maximized when x is minimal, i.e. at x “ a, and takes value δa2`δa . Thus, for ε ą 0 we want to find a δ

such thatδ

a2 ` δaă ε

Check that

δ ăεa2

1´ aε

are all good choices. One has to check that the right hand side is positive, which is true as long as ε ă 1a . However

for larger ε check that in fact any δ will do.

However, fpxq “ 1x is not uniformly continuous on the interval p0,8q. Fix ε ą 0. I claim that we can find

two x, x1 arbitrarily close to each other such that dpfpxq, fpx1qq ą ε. Note that for x “ 1n and x1 “ 1

n`1 one has

that |fpxq ´ fpx1q| “ n. For n large enough, one can make |x´ x1| arbitrarily small, and at the same time make n

arbitrarily large.

Proposition 13.4. Let X be compact and f : X Ñ Y be continuous. Then f is uniformly continuous.

Proof. Fix ε ą 0. By continuity of f , for each x P X choose δx ą 0 such that fpBδxpxqq Ă B ε2pfpxqq. Consider the

open cover:

tJx :“ B δx2pxquxPX

This has a finite subcover by compactness. Let that finite subcover by indexed by I; then take δ “ 12 minpδx | x P Iq.

Now, we claim that for p, q P X, that dpp, qq ă δ implies that dpfppq, fpqqq ă ε. Suppose p be in the open given

by i P I and q in the open given by j P I. Since dpp, qq ă δ, there must be a point in common between the balls Jiand Jj , call it x. Then,

dpfppq, fpqqq ď dpfppq, fpxqq ` dpfpxq, fpqqq ă ε

as desired.

Proposition 13.5. Let f : X Ñ Y be uniformly continuous. Let sn be a Cauchy sequence in X. Then fpsnq is a

Cauchy sequence in Y .

Remark 13.6. Note that this is untrue for continuous functions. For continuous functions, we insist that sn is

convergent.

Proof. Fix ε ą 0. We want to find an N such that m,n ą N implies that |fpsnq´fpsmq| ă ε. By uniform continuity

there is a δ such that |sn ´ sm| ă δ implies this, and by Cauchy-ness of sn there is an N that guarantees this.

13.2 The derivative

Definition 13.7. Let f : ra, bs Ñ R be a real valued function. Define the derivative:

f 1pxq “ limtÑx

fptq ´ fpxq

t´ x

Note that this limit may not always be well-defined at every point. If f 1pxq exists at x we say f is differentiable at

x.

Proposition 13.8. If f is differentiable at x, then f is continuous at x.

29

Proof. Using limit laws,

limtÑ0

fpxq ´ fptq “ limtÑ0

fpxq ´ fptq

x´ tpx´ tq “ f 1pxq ¨ 0 “ 0

Remark 13.9. Note that it is not true that if f is differentiable at x, then f is continuous in a neighborhood of

x. For example, take

fpxq “

#

x2 x P Q´x2 x R Q

This function is continuous only at zero, and differentiable only at zero.

Example 13.10. Take

fpxq “

#

x sinp 1x q x ‰ 0

0 x “ 0

We claim that the derivative does not exist at 0. To see this, note that

fpxq ´ fp0q

x“ sinp

1

xq

But this has no limit as xÑ 0.

Example 13.11. Now take

fpxq “

#

x2 sinp 1x q x ‰ 0

0 x “ 0

We claim that f 1p0q “ 0.

Proposition 13.12 (Arithmetic operations and the derivative). Let f, g : ra, bs Ñ R and suppose f, g are differen-

tiable at x P ra, bs. Then the following are differentiable at x and are given by:

(a) pf ` gq1pxq “ f 1pxq ` g1pxq

(b) pfgq1pxq “ f 1pxqgpxq ` fpxqg1pxq

(c) pf{gq1pxq “ f 1pxqgpxq´fpxqg1pxqgpxq2

Proof. It is easy to prove (a) from the definition. We will prove (b).

fptqgptq ´ fpxqgpxq

t´ x“fptqrgptq ´ gpxqs ` gpxqrfptq ´ fpxqs

t´ x“ fptq

gptq ´ gpxq

t´ x` gpxq

fptq ´ fpxq

t´ x

Taking the limit as tÑ x, one has the result. See the textbook for the proof of (c).

Example 13.13. This allows us to differentiate all polynomials. It is easy to show directly that if fpxq “ 1

then f 1pxq “ 0. It is also easy to show that if gpxq “ x then g1pxq “ 1. Using the product rule, one has

that pxnq1 “ xn´1pxq1 ` pxn´1q1x “ xn´1 ` pxn´1q1x, and one can use an argument by induction to show that

pxnq1 “ nxn´1. Then one uses scaling and addition to get all polynomials.

Proposition 13.14 (Chain rule). Let f : ra, bs Ñ R be differentiable at x P ra, bs, and g : I Ñ R where fpra, bsq Ă I,

and g is differentiable at fpxq. Then

pg ˝ fq1pxq “ g1pfpxqqf 1pxq

Remark 13.15. What one wants to do is write

gpfptqq ´ gpfpxqq

t´ x“gpfptqq ´ gpfpxqq

fptq ´ fpxq

fptq ´ fpxq

t´ x

and take the limit. The problem here is we cannot guarantee that fptq ´ fpxq ‰ 0 near t “ x. For example,

considering fpxq “ x2 sinp 1x q for x ‰ 0 and fp0q “ 0, one that for x P Bδp0q for any δ, f has a zero. So there are

functions for which one cannot avoid this problem by taking a sufficiently small δ.

30

Proof. Put y “ fpxq. We can find functions u and v such that:

fptq ´ fpxq “ pt´ xqpf 1pxq ` uptqq

gpsq ´ gpyq “ ps´ yqpg1pyq ` vpsqq

and upxq “ 0 and vpyq “ 0. Note that u and v are continuous at x and y respectively, as

limtÑ0fptq ´ fpxq

t´ x´ f 1pxq “ 0

and likewise for v.

Then,

gpfptqq ´ gpfpxqq “ pfptq ´ fpxqqpg1pfpxqq ` vpfptqq “ pt´ xqpf 1pxq ` uptqqpg1pfpxqq ` vpfptqqq

and for t ‰ x, one hasgpfptqq ´ gpfpxqq

t´ x“ pf 1pxq ` uptqqpg1pfpxqq ` vpfptqqq

Taking the limit tÑ x, one finds has the result.

14 Lecture 14

14.1 Mean value theorem

Definition 14.1. A a point p is a local maximum of f if there is a δ such that fpxq “ maxpfpBδpxqqq. Likewise,

for local minimum.

Proposition 14.2. Let f : ra, bs Ñ R. If f has a local maximum or minimum at p, and if fppq exists, then

f 1ppq “ 0.

Proof. Suppose f has a local maximum at p. Then we have some δ ą 0 such that fppq ě fpxq for x P pp´ δ, p` δq.

The derivative is

limtÑp

fptq ´ fppq

t´ p

Note that this expression is ě 0 when t ď p and ď 0 when t ě p. Thus if this limit exists it must be zero.

Remark 14.3. One consequence of this proposition is that to find the local minima or maxima of a function, one

only needs to look at points where f 1pxq “ 0 or does not exist.

Theorem 14.4 (Rolle’s theorem). Let f : ra, bs Ñ R be continuous, and suppose f is differentiable on pa, bq, and

that fpaq “ fpbq. Then there is some c P pa, bq such that f 1pcq “ 0.

Proof. By the maximum value theorem, there is a global maximum on this interval, which must be a local maximum

as well, and the result follows from the previous proposition.

Theorem 14.5 (Generalized mean value theorem). Let f, g : ra, bs Ñ R be continuous and differentiable in pa, bq.

Then there is c P pa, bq such that

rfpbq ´ fpaqsg1pcq “ rgpbq ´ gpaqsf 1pxq

Remark 14.6. One way to think of this is to rearrange terms:

fpbq ´ fpaq

gpbq ´ gpaq“f 1pxq

g1pxq

Proof. Take hpxq “ rfpbq ´ fpaqsgpxq ´ rgpbq ´ gpaqsfpxq. We want to show that h1pxq “ 0 for some c P pa, bq. Note

that hpaq “ hpbq and apply the previous theorem.

31

Theorem 14.7 (Mean value theorem). If f : ra, bs Ñ R is continuous and differentiable on pa, bq, then there is

some c P pa, bq such that

fpbq ´ fpaq “ pb´ aqf 1pcq

Proof. Take gpxq “ x above.

Here is an application

Proposition 14.8. Let f be differentiable on pa, bq. Then,

(a) if f is strictly increasing then f 1pxq ą 0 for x P pa, bq,

(b) if f is strictly decreasing then f 1pxq ă 0 for x P pa, bq,

(c) if f is increasing then f 1pxq ě 0 for x P pa, bq,

(d) if f is decreasing then f 1pxq ď 0 for x P pa, bq.

Proof. Let a ă x ă y ă b. Then by the mean value theorem,

fpyq ´ fpxq

y ´ x“ f 1pcq ą 0

for some c P px, yq. Thus fpyq ą fpxq.

Proposition 14.9 (Intermediate value theorem for derivatives). Let f : ra, bs Ñ R be differentaible, such that

f 1paq ă f 1pbq. Let λ P pf 1paq, f 1pbqq. Then there is some x P pa, bq such that f 1pxq “ λ.

Remark 14.10. Note that this isn’t an immediate application of the intermediate value theorem, since derivatives

of continuous functions need not be continuous.

Proof. Take gpxq “ fpxq ´ λx. We want to show that g has a zero in pa, bq. Note that g1paq ă 0 and g1pbq ą 0.

Using the definition of derivative, this means there is some point t1 such that gpt1q ă gpaq and a point t2 such that

gpt2q ă gpbq. Thus neither a nor b can be global minimums. Since f is continuous in ra, bs, it attains a global in

this interval not at the endpoints, and at this point the derivative of g vanishes.

15 Lecture 15

15.1 L’Hospital’s Rule

Theorem 15.1 (L’Hospital’s Rule). Suppose f, g : ra, bs Ñ R are differentiable in pa, bq. Allow ˘8 for the symbols

a, b. Suppose:

limxÑa`

f 1pxq

g1pxq“ L

If one of the following are true:

(a) limxÑa` fpxq “ limxÑa` gpxq “ 0

(b) limxÑa` gpxq “ 8

Then one has:

limxÑa`

fpxq

gpxq“ L

The symmetric statements for b´ are also true.

Proof. In this proof we will have to choose a lot of symbols. One should think of c, c1, c2 as being increasingly close

to a as the number of 1 symbols increase.

Case 1: A is finite or A “ ´8. The assumptions of this case allow us to choose some q P pA,8q. The goal is

to show that there is some interval pa, a` δq such that if x P pa, a` δq then fpxqgpxq ă q. For technical reasons we will

see later, we want to again choose an r P pA, qq, and we will show that fpxqgpxq ď r ă q instead.

Since the limit of the expression is A, there is some c such that x P pa, cq implies that

f 1pxq

g1pxqă r

32

Now choose α ă β P pa, cq. Using the generalized mean value theorem, there is some t P pα, βq such that

fpαq ´ fpβq

gpαq ´ gpβq“f 1ptq

g1ptqă r

If (a) holds, we can take the limit αÑ a to find that for β P pa, cq, one has

fpβq

gpβqď r ă q

This proves the goal in this case.

If (b) holds, we can find some c1 P pa, cq such that gppa, c1qq ą maxp0, gpβqq, i.e. such that

gpαq ´ gpβq

gpαqą 0

for α P pa, c1q. Multiplying by this factor, one has:

fpαq ´ fpβq

gpαqă r

pgpαq ´ gpβq

gpαq

sofpαq

gpαqă r ´ r

gpβq

gpαq`fpβq

gpαq“ r ´

M

gpαq

for some constant M (i.e. we will fix β and r). Let ε “ 12 pq´ rq. Since gpαq Ñ 8 as αÑ a, there is some c2 P pa, cq

such that for α P pa, c2q one has that | Mgpαq | ă ε so that

fpαq

gpαqă r ` ε ă q

This proves the goal in this case.

Case 2 A is finite or A “ 8. One can repeat the argument for q ă A. Together, this completes the argument.

Remark 15.2. Of course, one could loosen condition (b) to be ´8 by taking ´gpxq.

15.2 Power series

Definition 15.3. A power series is defined given a sequence tanuně0 by:

8ÿ

n“0

anzn

It is immaterial whether this sequence converges or diverges for a particular z in this definition, though this is a

question we will ask later.

A quick application of the root test gives:

Proposition 15.4. Given a power series

fpzq “8ÿ

n“0

anzn

put

α “ lim supnÑ8

na

|an|

and

R “1

α

Then, fpzq converges for |z| ă R and diverges for |z| ą R. This R is called the radius of convergence.

33

Proposition 15.5. Given a power series

fpzq “8ÿ

n“0

anzn

put

α “ lim supnÑ8

|an`1|

|an|

Then the radius of convergence R has R ě 1α .

Example 15.6. We compute the radius of convergence:

(a)ř

nnzn has R “ 0

(b)ř

zn

n! has R “ 8 (use ratio test instead)

(c)ř

zn has R “ 1. If |z| “ 1 then the series diverges.

(d)ř

zn

n has R “ 1. It diverges for z “ 1 and converges for z “ ´1.

(e)ř

zn

n2 has R “ 1 and converges for |z| “ 1.

15.3 Taylor series

Definition 15.7. Let f be a function for which all higher derivatives exist at a. Then, define the Taylor series of

f at a to be the formal power series:

Tf pxq “8ÿ

n“0

f pnqpaq

n!px´ aqn

Theorem 15.8 (Taylor’s theorem). Let f : ra, bs Ñ R, f pn´1q is continuous on ra, bs (and therefore all lower

derivatives are continuous), and f pnq exists on pa, bq. Let α, β P ra, bs be distinct points and define

P ptq “n´1ÿ

k“0

f pkqpαq

k!pt´ αqk

Then there exists a point c between α and β such that

fpβq “ P pβq `f pnqpcq

n!pβ ´ αqn.

Remark 15.9. Let n “ 1. Then P ptq “ fpαq and the statement is that there is c P pα, βq such that

fpβq ´ fpαq “ f 1pcqpβ ´ αq

i.e. the mean value theorem. Thus one can think of this as a “higher order mean value theorem.”

Proof. Without loss of generality let α ă β. Choose M such that

fpβq “ P pβq `Mpβ ´ αqn

and let

gpxq “ fpxq ´ P pxq ´Mpt´ αqn

so that gpβq “ 0. We want to show that n!M “ f pnqpxq for some x P pα, βq. One finds that

gpnq “ f pnqpxq ´ n!M

so we’ve reduced our claim to showing that gpnqpxq “ 0 for some x P pα, βq.

Note that gpαq “ 0. By Rolle’s theorem (here we need continuity of the derivatives at the endpoints) there

is some c1 P pα, βq such that g1pc1q “ 0. Notice that also, g1pαq “ 0¡ and in fact, all higher derivatives up to

f pn´1qpαq “ 0. Thus we can repeat the argument for g1 and pα, c1q, and then for g2 and pα, c2q, and at the end we

find cn “ c.

34

Remark 15.10. Note that Taylor’s theorem doesn’t say anything about convergence of the Taylor series. Consider

the function:

fpxq “

#

e´1x x ą 0

0 x ď 0

All higher derivatives of this function exist and are continuous everywhere. However, its Taylor series at 0 is

Tf pxq “ 0.

16 Lecture 16

16.1 The Riemann-Stieltjes integral

We will have to introduce a lot of notation in this section. For the entirety of this section f : ra, bs Ñ R will

refer to a bounded function, and α : ra, bs Ñ R a monotonically increasing weight function. We will

treat other cases later.

Definition 16.1. Let ra, bs be an interval. Let α : ra, bs Ñ R be a monotonically increasing weight function. If one

would prefer to simpliy the exposition on a first reading, one can just fix αpxq “ x.

A partition P of ra, bs can be given by a finite set of points a “ x0 ď x1 ď ¨ ¨ ¨ ď xn “ b which we think of as

endpoints of subintervals of ra, bs, i.e.

ra, bs “ rx0, x1s Y rx1, x2s Y ¨ ¨ ¨ Y rxn´1, xns

Note that this is not a partition in the sense of equivalence relations since the intervals are not disjoint. Write:

∆αi “ αpxiq ´ αpxi´1q

i.e. the length of rxi´1, xis. Unfortunately one does have to remember the convention that ∆xi refers

to the interval where xi is the upper endpoint.

Let

Mi “ suptfpxq | x P rxi´1, xisu

mi “ inftfpxq | x P rxi´1, xisu

Note that both these exist since f is bounded.

Define the upper and lower Darboux sums:

UpP, f, αq “nÿ

i“1

Miδαi

LpP, f, αq “nÿ

i“1

miδαi

Define the upper and lower Darboux integrals:

Upf, αq “ inftUpP, f, αq | partitions P u

Lpf, αq “ suptLpP, f, αq | partitions P u

where we range over all partitions P .

If Upfq “ Lpfq then we say f is Riemann-integrable and define the Riemann-Stiljes integral :

ż b

a

f dα

Definition 16.2. Let P,Q be two partitions, given by points x0, . . . , xn and y0, . . . , ym. We say that P is a

35

refinement of Q if ty0, . . . , ymu Ă tx0, . . . , xnu. Note that we forget about the “multiplicities” here, i.e. P “

p0, 1, 2, 3q is a refinement of p0, 1, 1, 3q. Further, any two partitions P and Q have a common refinement P Y Q.

The common refinement is unique only up to repetition of one of the points xi.

Proposition 16.3. If P is a refinement of Q, then

UpP, f, αq ď UpQ, f, αq

LpQ, f, αq ď LpP, f, αq

Proof. We will only prove the first statement. It suffices to prove the case where P has only one more point than

Q. If that point is already a point of Q there is nothing to prove, since δαi “ 0 for that term, so suppose that the

extra point of Q is xi P pxi´1, xi`1q. Then,

UpQ, f, αq´UpP, f, αq “ suptfprxi´1, xi`1squpαi`1´αi´1q´suptfprxi´1, xisqupαi´αi´1q´suptfprxi, xi`1squpαi`1´αiq

But notice that L :“ sup fprxi´1, xi`1sq ě sup fpIq for any subinterval I,

UpQ, f, αq ´ UpP, f, αq ě Lpαi`1 ´ αi´1 ´ αi ` αi`1 ´ αi`1 ` αiq “ 2Lpαi`1 ´ αi´1q ě 0

Proposition 16.4. Upf, αq ě Lpf, αq

Proof. It is clear that for any particular P , one has UpP, f, αq ě LpP, f, αq. Further, for any P1 and P2 with

common refinement Q, one has:

LpP1, f, αq ď LpQ, f, αq ď UpQ, f, αq ď UpP2, f, αq

Taking the sup over P1 and the inf over P2 gives:

Lpf, αq ď Upf, αq

The following is a sufficient condition for integrability.

Proposition 16.5. f is Riemann integrable (with respect to α) if and only if for every ε ą 0 there is a partition P

such that

UpP, f, αq ´ LpP, f, αq ă ε

Proof. Suppose the condition holds. Note that

0 ď Upf, αq ´ Lpf, αq ď UpP, f, αq ´ LpP, f, αq

for any P , and the result follows. Conversely, if f is Riemann-integrable, then there exists a P such that

UpP, f, αq ´ Upf, αq ăε

2

and a Q such that

Lpf, αq ´ LpP, f, αq ăε

2

and thus their common refinement has

UpP, f, αq ´ LpP, f, αq ă ε

36

Example 16.6 (αpxq ‰ x). We can consider the case where αpxq is a “step” function, i.e. take the floor function:

αpxq “ txu

Now, what isşb

af dα, if it exists? Note that the only intervals which contribute to the sums are those which contain

an integer, since otherwise one would have ∆αi “ 0. Further, we can refine any partition so that each interval

contains at most one integer. Then as long as f is continuous at each integer, one has that (say a, b P Z):

ż b

a

f dα “bÿ

i“a

fpiq.

Example 16.7. As for another example, let’s take αpxq “ 2x. Then one can check that:

ż b

a

f dα “ 2

ż b

a

f dx

if the integral exists at all. We will see more examples later.

16.2 Some Riemann-integrable functions

Proposition 16.8. If f is monotonic and α is continuous, then f is Riemann-integrable.

Proof. Suppose that f is monotonically increasing. Then, in the Darboux sum, for the subinterval rxi´1, xis, one

has that Mi “ fpxiq and mi “ fpxi´1q. Take P to be a partition where ∆αi “αpbq´αpaq

n . Then,

UpP, f, αq ´ LpP, f, αq “αpbq ´ αpaq

n

nÿ

i“1

pfpxiq ´ fpxi´1qq “pαpbq ´ αpaqqpfpbq ´ fpaq

n

For large n this becomes arbitrarily small.

Proposition 16.9. If f is continuous then f is Riemann-integrable for any α.

Proof. Given ε ą 0, choose some η ą 0 such that

pαpbq ´ αpaqqη ă ε

Since f is continuous on ra, bs and ra, bs is compact, it is uniformly continuous, so there exists some δ ą 0 such that

|fpxq ´ fpyq| ă η

for |x´ y| ă δ. Now, take P to be a partition where xi`1 ´ xi ă δ, so that

Mi ´mi ď η

and so

UpP, f, αq ´ LpP, f, αq ď ηÿ

∆α “ ηpαpbq ´ αpaqq ă ε

Proposition 16.10. Suppose f has finitely many points of discontinuity, and α is continuous at every discontinuous

point of f . Then f is Riemann-integrable.

Proof. The idea is as follows: where f is continuous we can bound Mi ´mi. Where α is continuous we can bound

αpxiq ´ αpxi´1q. We formalize this as follows. Let p1, . . . , pk be the points of discontinuity for f , and find intervals

rci, dis such that pi is the only point of discontinuity in it and such thatř

i αpdiq ´ αpciq ă ε. Let M be an upper

bound on f on ra, bs. Then one has that the Darboux sum on this part is less than

ÿ

pMi ´miqpαpdiq ´ αpciqq ď 2Mÿ

αpdiq ´ αpciq “ 2Mε

37

Let K “ ra, bs ´Ť

rci, dis. This is compact, so f is uniformly continuous on this interval, so find a δ such that

|fpxq ´ fpyq| ă ε if |x´ y| ă δ. Now, there is a partition P such that

(a) ci and di are in P

(b) no point in pci, diq is in P

(c) every interval not rci, dis has length less than δ

Then, one has that, following the proof of the previous proposition,

UpP, f, αq ´ LpP, f, αq ď pαpbq ´ αpaqqε ď 2Mε

Proposition 16.11. Let f be Riemann-integrable, and fpra, bsq Ă I for some closed interval I. Suppose that

g : I Ñ R is continuous. Then h :“ g ˝ f is Riemann-integrable, with respect to any α.

Proof. Fix an ε ą 0. Now, consider the sum we want to make small for some partition P (i.e. smaller than ε):

UpP, h, αq ´ LpP, h, αq “ÿ

pMhi ´m

hi qpαpxiq ´ αpxi´1qq

where the Mhi ,m

hi correspond to h “ g ˝ f . Let Mf

i and mfi be the corresponding numbers for f .

First, we use uniform continuity of g. There is positive δ ą 0 such that |gpxq ´ gpyq| ă ε if |x´ y| ď δ. Then, if

Mfi ´m

fi ă δ, i.e. |fpxq ´ fpyq| ă δ for x, y P rxi´1, xis, one has that Mh

i ´mhi ă ε.

However, if Mfi ´m

fi ě δ, since f is Riemann-integrable, the intervals on which Mf

i ´mfi ě δ have to be “short

enough.” We can choose a partition P “ tx0, . . . , xnu such that UpP, f, αq ´ LpP, f, αq ă δ2. Then we have

δÿ

∆αi ďÿ

pMfi ´m

fi q∆α ă δ2

ÿ

∆αi ă δ

where the sum is only taken over intervals where Mfi ´mf

i ě δ. Note that this says that the intervals on which

Mfi ´m

fi ě δ must be of total weighted length less than δ. Let K be an upper bound on |g|, so that Mh

i ´mhi ď 2K.

Then,ÿ

pMhi ´m

hi q∆αi ă 2Kδ.

Putting these two cases together, and taking δ such that δ ă ε, one has that

UpP, h, αq ´ LpP, h, αq ď εpαpbq ´ αpaqq ` 2Kδ ă εpαpbq ´ αpaq ` 2Kq

17 Lecture 17

17.1 Properties of the integral

Proposition 17.1 (Linearity properties). Let f, g be Riemann integrable and let c P R, and α continuous at c.

Then the following are also Riemann-integrable and have values:

ż b

a

cf dα “

ż b

a

cf dα

ż b

a

f ` g dα “

ż b

a

f dα`

ż b

a

g dα

Proof. These follow directly from the definition. Let Mi and mi correspond to f and M 1i and m1i correspond to g.

38

Then one has that for any P ,

UpP, f ` g, αq ďÿ

pMi `M1iq∆αi “ UpP, f, αq ` UpP, g, αq

The inequality is because the maximum of f and g may not occur at the same point. Simialrly,

LpP, f ` g, αq ě LpP, f, αq ` LpP, g, αq

and since Upf, αq “ Lpf, αq and Upg, αq “ Lpg, αq, one has that Upf ` g, αq “ Lpf ` g, αq.

Proposition 17.2 (Bounds of integration). If f is Riemann-integrable and c P ra, bs, then the following as also

Riemann-integrable and one has:ż c

a

f dα`

ż b

c

f dα “

ż b

a

f dα

Proof. Any partition of ra, bs can be refined to include the point c. The rest is left as an exercise.

Proposition 17.3 (Comparison). Let f, g be Riemann integrable such that fpxq ď gpxq on ra, bs. Then

ż b

a

f dα ď

ż b

a

g dα

Further, if |fpxq| ďM , then

|

ż b

a

f dα| ďMpαpbq ´ αpaqq

Proof. Since UpP, f, αq ď UpP, g, αq for every partition P , taking inf over all P yields the result. For the second

statement, then one has UpP, f, αq ďMpαpbq ´ αpaq and similarly for LpP, f, αq, for any P .

Proposition 17.4 (Linearity in the weights). If f is Riemann integrable with respect to α1 and also α2, then the

following are also Riemann-integrable:

ż b

a

f dpα1 ` α2q “

ż b

a

f dα1 `

ż b

a

f dα2

ż b

a

f dpcαq “ c

ż b

a

f dα

Proof. This is immediate.

Proposition 17.5 (Closure under products). If f is Riemann integrable, and g is Riemann integrable, then fg is

Riemann integrable.

Proof. By a previous proposition, if f is Riemann-integrable, then f2 is. Then note that pf ` gq2 “ f2 ` 2fg ` g2

and so fg “ 12 ppf ` gq

2 ´ f2 ´ g2q, completing the proof.

Proposition 17.6 (Triangle inequality). If f is Riemann integrable, then |f | is, and

|

ż b

a

f dα| ď

ż b

a

|f | dα

Proof. For the first statement, note that since gpxq “ |x| is Riemann-integrable, if f is Riemann-integrable, then

so is |f |. For the second statement, supposeşb

af dα ě 0. Then since f ď |f | the statement follows. If

şb

af dα ď 0,

then since ´f ď |f |, the statement follows.

Proposition 17.7 (Intermediate value theorem for integrals or “average value” theorem). Let f : ra, bs Ñ R be

continuous. Then there is an x P ra, bs such that

fpxq “1

b´ a

ż b

a

f dx

39

Proof. Let M and m be the minimum and maximum values of f on ra, bs. Then,

m ď1

b´ a

ż b

a

f dx ďM

Now apply the intermediate value theorem.

Proposition 17.8. Suppose α increases monotonically, is everywhere differentiable, and α1 is Riemann integrable.

Then f is Riemann integrable with respect to α if and only if fα1 is Riemann integrable with respect to x. In this

case,ż b

a

f dα “

ż b

a

fα1 dx

Proof. It suffices to show the equality of the upper and lower integrals of f dα and fα1 dx. We will do the upper;

the lower follows in the same way. Since α1 is Riemann-integrable, for every ε ą 0 there is a P such that

UpP, α1, xq ´ LpP, α1, xq ă ε

By the mean value theorem, we can find points ci P rxi´1, xis such that

∆αi “ αpxiq ´ αpxi´1q “ α1pciqpxi ´ xi´1q “ α1pciq∆xi

For any ti P rxi´1, xis one has:

ÿ

|α1ptiq ´ α1pciq|∆xi ď

ÿ

pMα1

i ´mα1

i q∆xi ă ε

Let M be a bound for |f |. Then one has:

|ÿ

fptiq∆αi ´ÿ

fptiqα1ptiq∆xi| ď

ÿ

|fptiq||∆αi ´ α1ptiq∆xi| ďM

ÿ

|∆αi ´ α1ptiq∆xi|

ďMp|∆αi ´ α1pciq∆xi| ` |α

1pciq∆xi ´ αptiq∆xi|q ăMpε` 0q “Mε

Lemma 17.9. Let c ą 0. Then |x` y| ă c if and only if x ă c` y and y ă c` x.

So,ÿ

fptiq∆αi ďÿ

fptiqα1ptiq∆xi `Mε ď UpP, fα1, xq `Mε

ÿ

fptiqα1ptiq∆xi ď

ÿ

fptiq∆αi `Mε ď UpP, f, αq `Mε

Since this is true for any choice of ti, in particular t` i “Mi, one has:

UpP, fα1, xq ď UpP, f, αq `Mε

UpP, f, αq ď UpP, fα1, xq `Mε

So,

|UpP, fα1, xq ´ UpP, f, αq| ăMε

Since everything above is true for a refinement,

|Upfα1, xq ´ Upf, αq| ăMε

and so

UpF, α1, xq “ Upf, αq

Proposition 17.10. Suppose φ : rA,Bs Ñ ra, bs is strictly increasing and continuous. Suppose α is monotonically

40

increasing on ra, bs and f : ra, bs Ñ R is Riemann integrable with respect to α. Define by change of variables:

βpxq “ αpφpxqq

gpxq “ fpφpxqq

Then g is Riemann integrable with respect to beta and

ż B

A

g dβ “

ż b

a

f dα

Proof. Since φ is strictly increasing, a partiton of rA,Bs gives a partition of ra, bs and vice versa. Let them be P

and Q respectively. Then one finds that

UpP, g, betaq “ UpQ, f, αq

and likewise, and the result follows.

Corollary 17.11. Suppose φ1 is Riemann integrable. Then,

ż b

a

fpxq dx “

ż B

A

fpφpyqqφ1pyq dy

Proof. Take αpxq “ x and β “ φ above.

18 Lecture 18

18.1 The fundamental theorem of calculus

In this section we will take αpxq “ x.

Theorem 18.1 (The fundamental theorem of calculus I). Let f be Riemann integrable on ra, bs. Define a function

F on ra, bs by:

F pxq “

ż x

a

fptq dt

Then F is continuous on ra, bs. Furthermore, f is continuous at a point x0 if and only if F is differentiable at x0and:

F 1px0q “ fpx0q

Proof. First we will show that F is uniformly continuous, and thus continuous. Let M be a bound on |f |. Then,

|F pyq ´ F pxq| “ |

ż y

x

fptq dt| ďM |x´ y|

Thus, if |x´ y| ă εM , then |F pxq ´ F pyq| ă ε, finishing the claim.

Now, suppose f is continuous at x0. For any ε ą 0 there is δ ą 0 such that |fptq ´ fpx0q| ă ε for |t ´ x0| ă δ.

Then for any t P px0´ δ, x0` δq with t ‰ x0 and by the intermediate value theorem for integrals, there is a c P pt, sq

such thatF ptq ´ F px0q

t´ x0“

1

t´ x0

ż t

s

fpxq dx “ fpcq

so

|F ptq ´ F px0q

t´ x0´ fpx0q| “ |fpcq ´ fpx0q| ă ε

Thus one has that

F 1px0q “ limtÑx0

F ptq ´ F px0q

t´ x0“ fpx0q

41

as desired. One obtains the converse by reversing these arguments.

Theorem 18.2 (The fundamental theorem of calculus II). If f is Riemann integrable on ra, bs which has an

antiderivative, i.e. a function F such that f 1 “ f , then

ż b

a

fpxq dx “ F pbq ´ F paq

for any antiderivative F .

Proof. Fix ε ą 0. Let P be a partition such that UpP, fq ´ LpP, fq ă ε. Applying the mean value theorem to each

interval, there is some ti P rxi´1, xis such that

F pxiq ´ F pxi´1q “ F 1ptiq∆xi “ fptiq∆xi

And soÿ

fptiq∆xi “ F pbq ´ F paq

Since both the above sum andşb

afpxq is between LpP, fq and UpP, fq, one has that

|F pbq ´ F paq ´

ż b

a

fpxq dx| ă ε

Corollary 18.3 (Substitution). Let u be a differentiable function on pa, bq such that u1 is continuous, and I be an

open interval such that upra, bsq Ă I. If f is continuous on I, then f ˝ u is continuous on J and

ż b

a

fpupxqqu1pxq dx “

ż upbq

upaq

fpuq du

Proof. Let F be as in the fundamental theorem. Then if G “ F ˝ u then by the chain rule one has that G1pxq “

F pupxqqu1pxq. Now one has

ż b

a

fpupxqqu1pxq dx “

ż b

a

G1pxq dx “ Gpbq ´GpaqF pupbqq ´ F pupaqq “

ż upbq

upaq

fpuq du

Corollary 18.4 (Integration by parts). Suppose F and G are differentiable functions on ra, bs and their derivatives

f, g are Riemann integrable. Then

ż b

a

F pxqgpxq dx “ F pbqGpbq ´ F paqGpaq ´

ż b

a

fpxqGpxq dx

Proof. Put Hpxq “ F pxqGpxq. Then H 1pxq “ F pxqgpxq`fpxqGpxq, and apply the second fundamental theorem.

19 Lecture 19

19.1 Things that aren’t true

First, I want to introduce the most immediate version of convergence one can ask for of a function.

Definition 19.1. Let fn : I Ñ R be a sequence of functions. Then one says they converge pointwise to a function

f : J Ñ R (for J Ă I) if

limnÑ8

fnpxq “ fpxq

42

for every fixed x P J.

Example 19.2 (Interchanging limits for sequences). Define

sm,n “m

m` n

Then,

limmÑ8

limnÑ8

sm,n “ limmÑ8

0 “ 0

but

limnÑ8

limmÑ8

sm,n “ limnÑ8

1 “ 1

Example 19.3 (Limit of continuous functions is not continuous). Define

fnpxq “x2

p1` x2qn

and the partial sums

Fnpxq “nÿ

k“1

fnpxq

Then,

limnÑ8

Fnpxq “

#

0 x “ 0

1` x2 x ‰ 0

since for fixed x, the fn give a geometric sequence with first term x2 and ratio 11`x2 , and so

x21

1´ 11`x2

“ 1` x2

This function is not continuous.

Example 19.4 (Limit of derivatives is not derivative of limit). Define

fnpxq “sinpnxq?n

and define

fpxq “ limnÑ8

fnpxq “ 0

Then we have thatd

dxlimnÑ8

fnpxq “ 0

but

limnÑ8

d

dxfnpxq “ lim

nÑ8

?n cospnxq “ 8

Example 19.5 (Limit of integrals is not integral of limits). Define

fnpxq “ n2xp1´ x2qn

Then one has that

ż 1

0

fnpxq dx “ n2ż 1

0

xp1´ x2qn dx “ n2p1

2pn` 1qp1´ x2qn`1q|10 “

n2

2n´ 2

So,

limnÑ8

ż 1

0

fnpxq “ 8

43

However,

limnÑ8

n2xp1´ x2qn “ 0

for x P r0, 1s, and soż 1

0

limnÑ8

fnpxq “ 0

19.2 Uniform convergence

Definition 19.6. Let fn be a sequence of functions which converges pointwise to f . Then we say fn converges

uniformly to f on E if for every ε ą 0 there is an N such that for n ą N , one has

|fnpxq ´ fpxq| ă ε

for every x P E.

Example 19.7. Take E “ p´1, 1q and define

fnpxq “ p1´ |x|qn

One can check that this converges pointwise to

fpxq “

#

1 x “ 0

0 x ‰ 0

However, it does not converge uniformly. When x ‰ 0 one has that

|fpxq ´ fnpxq| “ p1´ |x|qn

However, for ε “ 12 and any n, one can find x ‰ 0 such that

x2 ă 1´n

c

1

2

Example 19.8. Define

fnpxq “1

nsinpnxq

One can check that this converges to 0 pointwise. It converges uniformly.

Example 19.9. Take E “ r0, 1s and

fnpxq “ xn

This converges to

fpxq “

#

0 x ‰ 1

1 x “ 1

This does not converge uniformly.

Each of the previous examples do not converge uniformly. I encourage you to do the following as exercises and

check against these notes.

Example 19.10. Take E “ N and define,

smpnq “m

m` n

This does not converge uniformly in E. For fixed n P E, the limit of the sequence is 1. However,

|m

m` n´ 1| “ |

n

m` n|

44

For, say, ε “ 12 and any m, there is some n such that

|n

m` n| ą ε

for example, m ă n.

Further, one can do this the other, i.e. take

snpmq “m

m` n

This also does not converge uniformly in E. For any fixed m P E the limit of the sequence is 0. However, for ε “ 12

and any n, there is an m such thatm

m` nă

1

2

for example, choose n ą m.

Example 19.11. Define

fnpxq “x2

p1` x2qn

and the partial sums

Fnpxq “nÿ

k“1

fnpxq

so that

F pxq “ limnÑ8

Fnpxq “

#

0 x “ 0

1` x2 x ‰ 0

Then one has that

|F pxq ´ Fnpxq| “

#

0 x “ 0

p1` x2q1ń x ‰ 0

The sequence cannot be uniformly continuous, since for say ε “ 12 and any n

p1` x2q1ń ą1

2

is equivalent to

2 ą p1` x2qn´1

which can be achieved by taking x such that

x2 ăn´1?

2´ 1

which is possible since n´1?

2´ 1 ą 0.

Example 19.12. Take E “ r0, 1s and define

fnpxq “ n2xp1´ x2qn

so that

limnÑ8

n2xp1´ x2qn “ 0

However, one has that

n2xp1´ x2qn ą 1

p1´ x2qn ą xp1´ x2qn ą1

n2

x2 ă 1ń

c

1

n2

45

20 Lecture 20

20.1 Basic criteria for uniform convergence

Proposition 20.1 (Cauchy criterion). Let E Ă R, and fn : E Ñ R. This sequence converges uniformly if and only

if for every ε ą 0 there exists an integer N such that for m,n ě N , one has

|fnpxq ´ fmpxq| ă ε

Proof. Suppose fn converges uniformly to f . Then for every ε there is an N such that for n ą N one has

|fnpxq ´ fpxq| ăε2 . Then, by the triangle inequality,

|fnpxq ´ fmpxq| ď |fnpxq ´ fpxq| ` |fpxq ´ fnpxq| ă ε

for n,m ą N .

For the converse, using the usual Cauchy criterion, we know that fn converges pointwise to some function f .

Let ε ą 0 be arbitrary, and suppose the condition above holds. Then,


for n,m ą N . Taking mÑ8, one has the required condition for uniform convergence.

Proposition 20.2. Suppose fn converges pointwise to f on E. Then fn converges uniformly on E if and only if

limnÑ8

supxPE

|fnpxq ´ fpxq| “ 0.

Proof. This is a rewording of the definition.

Proposition 20.3 (M-test). Suppose fn is a sequence on E and suppose that we have bounds (for x P E)

|fnpxq| ďMn

for all n. Ifř

Mn converges, thenř

fn converges uniformly.

Proof. Ifř

Mn converges, then for any ε ą 0 there is N such that if m,n ą N then

mÿ

i“n

Mi ă ε

Then, one has

|

mÿ

i“n

fipxq| ďmÿ

i“n

Mi ă ε

20.2 Uniform convergence and continuity

Proposition 20.4. Let X be a metric space, and let E Ă X. Let fn, f : E Ñ R, and fn Ñ f uniformly. Let p be

a limit point of E, and suppose that

limxÑp

fnpxq “ An

Then limAn converges and

limxÑp

limnÑ8

fnpxq “ limnÑ8

limxÑp

fnpxq

i.e.

limxÑp

fpxq “ limnÑ8

An

46

Proof. FIx ε ą 0. By uniform convergence, there is N such that for n,m ą N one has


for all x P E. Taking xÑ p one has

|An Ám| ď ε

and so the sequence An converges. Let its limit be A.

Now, we want to show that

limxÑp

fpxq “ A

Note that

|fpxq Á| ď |fpxq ´ fnpxq| ` |fnpxq Án| ` |An Á|

We can make |fpxq´ fnpxq| ă ε for all x P E by uniorm continuity. We can also make |AnÁ| ă ε by convergence.

Finally we can make |fnpxq Án| ă ε in a neighborhood of p and for large n. Together, one has that

|fpxq Á| ď 3ε

for x in a neighborhood of p (note n plays no role now), proving the result.

Corollary 20.5. Let fn Ñ f uniformly, and fn continuous. Then f is continuous.

Proposition 20.6. Let E “ ra, bs. Let α be monotonically increasing. Suppose fn : E Ñ R are integrable and

fn Ñ f uniformly. Then f is integrable and

ż b

a

fn dα “ limnÑ8

ż b

a

fn dα

Proof. Let

εn “ supxPE

|fnpxq ´ fpxq|

By uniform continuity εn Ñ 0. Further one has that

fn ´ εn ď f ď fn ` εn

and taking integralsż b

a

pfn ´ εnq dα ď Lpf, αq ď Upf, αq ď

ż b

a

pf ` εnq dα

and thus

Upf, αq ´ Lpf, αq ď 2

ż b

a

εn dα “ 2pαpbq ´ αpaqqεn

Taking the limit nÑ8 one has that f is integrable. Further we have that

|

ż b

a

f dα´

ż b

a

fn dα| ď εnpαpbq ´ αpaqq

Corollary 20.7 (Integration of series). Let fn be integrable and the series fpxq “ sumfnpxq converge uniformly.

Then,şb

af dα “

řşn

afn dα.

47

21 Lecture 21

21.1 Uniform convergence and differentiation

Proposition 21.1. Let E “ ra, bs and let fn be differentiable on E. Suppose fn converges pointwise, for some

point p P E. If f 1n converges uniformly on E, then fn converges uniformly on E, and

f 1pxq “ limnÑ8

f 1npxq

Proof. First, we will show that f converges uniformly on E. Fix ε ą 0 and choose N such that m,n ą N implies

(by pointwise convergence)

fnpx0q ´ fmpx0q| ăε

2

and also (by uniform convergence)

|f 1npxq ´ f1mpxq| ă

ε

2pb´ aq

for all x P E. Now, apply the mean value theorem to fn ´ fm at any two points x, y:

|fnpxq ´ fmpxq ´ fnpyq ` fmpyq| ď|x´ y|ε

2pb´ aqďε

2

Then take y “ x0:

|fnpxq ´ fmpxq| ď |fnpxq ´ fmpxq ´ fnpx0q ` fmpx0q| ` |fnpx0q ´ fmpx0q| ď ε

and so fn converges uniformly.

Let f be its limit, and take p P E. Define

φnpxq “fnpxq ´ fnppq

x´ p

φpxq “fpxq ´ fppq

x´ p

By definition of derivative, limxÑp φnpxq “ f 1npxq. Using the previous inequality, one has that

|φnpxq ´ φmpxq| ďε

2pb´ aq

for large n,m, so that φn converges uniformly for x ‰ p, and it must converge pointwise (and thus uniformly) to φ.

Thus, by interchanging limits, one has that

limxÑp

φpxq “ limnÑ8

f 1npxq

and the result follows.

21.2 An everywhere continuous but nowhere differentiable function

Example 21.2. Define φpxq “ |x| for x P r´1, 1s, and define it for the rest of R periodically, i.e. so that

φpx` 2q “ φpxq. Define

fpxq “8ÿ

n“0

p3

4qnφp4nxq

This might help think of this function: each term p 34 qnφp4nq “squeezes” the function φ horizontally by a factor of

14n and also squeezes it vertically by a factor of p 34 q

n. One can draw φ as a spiky function, and in each summand in

fpxq, the spikes (discontinuous points) get shorter and also closer together. They have to get shorter in order for

the function to converge, and they have to get closer together in order for the derivative to fail to exist everywhere.

48

More formally, since φ is bounded, the M-test for uniform convergence shows that this series converges uniformly,

and so f is continuous. To see that it is not differentiable anywhere, take x P R. Notice that non-differentiable

points of φ occur at the integers, so for any k we want to choose a δk such that φ has no integer points in rx, x`δks.

Explicitly, we choose

δk “ ˘1

24´k

Define

γn “φp4npx` δmqq ´ φp4

nxq

δm

when n ą m, δm4n is an even integer, so by periodicity, γn “ 0. Further, one has that for any s, t:

|φpsq ´ φptq| ď |s´ t|

so for n ă m one has |γn| ă 4n, and for n “ m one has that γm “ 4m.

Then, takeˇ

ˇ

ˇ

ˇ

fpx` δmq ´ fpxq

δm

ˇ

ˇ

ˇ

ˇ

“

ˇ

ˇ

ˇ

ˇ

ˇ

mÿ

n“0

p3

4qnγn

ˇ

ˇ

ˇ

ˇ

ˇ

ě 3m ´m´1ÿ

n“0

3n “1

2p3m ` 1q

As mÑ8 one has δm Ñ 0. So that:

f 1pxq ě1

2p3m ` 1q

for every m, so f is not differentiable at x.

21.3 Differentiation and integration of power series

Proposition 21.3. Suppose the series

fpxq “8ÿ

n“0

anxn

has radius of convergence R. Then it converges uniformly on any r´R ` ε, R ´ εs for any ε ą 0. Further, f is

continuous and differentiable in p´R,Rq, and one can differentiate term-by-term:

f 1pxq “8ÿ

n“1

nanxn´1

and has radius of convergence R. Further, one can integrate term-by-term:

ż x

0

fptq dt “8ÿ

n“0

ann` 1

xn`1

and has radius of convergence R.

Proof. One has that

|anxn| ďMn :“ |an|pR´ εq

n

and sinceř

Mn converges absolutely by the root test, the M-test tells us that the series converges absolutely. The

rest of the facts follow from previous propositions.

Proposition 21.4 (Abel’s theorem). Take

fpxq “8ÿ

n“0

anxn

Let R be the radius convergence, and suppose f converges at R. Then, f is continious at R, i.e.

limxÑR

fpxq “8ÿ

n“0

anRn.

49

The same is true for ´R.

Proof. Without loss of generality assume R “ 1 (otherwise, we can replace x with x{R). Define

Sn “ a0 ` ¨ ¨ ¨ ` an

We can rewrite

fpxq “ p1´ xq8ÿ

n“0

Snxn

Let S “ limSn. For ε ą 0, there is N such that for n ą N one has |S ´ Sn| ăε2 . Then,

|fpxq ´ S| “ |p1´ xqÿ

pSn ´ Sqxn| ď p1´ xqp

ÿ

|Sn ´ S||x|n `

ε

2q ď ε

So as near 1, one has that p1´ xq Ñ 0 and since ε was arbitrary the result follows.

22 Lecture 22

22.1 The Stone-Weierstrass theorem

Definition 22.1. A family of real-valued functions on a set E is a subset of the set of all functions from E to R.

A family A is said to be an algebra if it is closed under addition, multiplication, and scalar multiplication in R. A

family A is said to be uniformly closed if for any sequence fn with fn P A, such that fn Ñ f uniformly, one has

that f P A. Further, given any family A one can take its uniform closure, i.e. the smallest set family containing Athat is uniformly closed.

Example 22.2. For example, the family of polynomial functions on E “ R is an algebra, since the addition and

multiplication of polynomials are also polynomials. Also, the family of functions of the form P pxqex form an algebra,

where P is any polynomial. Neither of these are uniformly closed, as we will see later.

Proposition 22.3. Let B be the uniform closure of A. Then B consists of all functions which arise as a uniformly

convergent limit of functions in A.

Definition 22.4. Let A be a family of functions on a set E. Then A separates points if for any x1, x2 P E, there

is a function f P A such that fpx1q ‰ fpx2q; and it vanishes at no point if for every x P E there is some f P A such

that fpxq ‰ 0.

Example 22.5. The family of polynomials on R separates points: the function fpxq “ x separates all points.

Further, it vanishes at no point, since the function fpxq “ 1 is nowhere zero

Example 22.6. The family of even polynomials on R does not separate points, since fpxq “ ´fpxq for every such

f .

Theorem 22.7. Let A be an algebra of real continuous functions on a compact set K. If A separates points and

vanishes at no point, then the uniform closure B consists of all real continuous functions on K.

Proof. This is a long proof. We will divide it into five lemmas.

Lemma 22.8 (Stone-Weierstrass theorem). Let A be an algebra of continuous functions ra, bs Ñ R which contains

fpxq “ x. Then, B contains every continuous function on ra, bs.

Proof. First, note that we may as well assume that ra, bs “ r0, 1s, since one can take f̃pxq “ fpx´ab´a q which has

domain r0, 1s, find a sequence of polynomials P̃n converging to it, and take Pnpxq “ P̃nppb´ aq{x` aq.

The idea is as follows: the Dirac delta “function” is a function δapxq which is “defined” by:

δapxq “

#

8 x “ a

0 x ‰ 1

50

It has the “property” that:ż 8

´8

δapxqfpxq dx “ fpaq

I will disgress for a bit to justify this property. Since δa is not really a function, we want to express it as a limit

(which does not actually converge) of functions of the form:

Qnpxq “

#

n2 x P ra´ 1

n , a`1n s

0 else

The constant is chosen so thatż 8

´8

Qn dx “ 1

for all n. Then, one can verify that, if F is the antideriative of f , then

ż 8

´8

Qnpxqfpxq dx “n

2

ż a` 1n

a´ 1n

f dx “F pa` 1

n q ´ F pa´1n q

1n ´ p´

1n q

Taking the limit as nÑ8, one obtains F 1paq “ fpaq as desired.

Further, convolution gives:

pf ˚ δaqpxq “

ż 8

´8

fptqδapx´ tq dt “ fpx´ aq

In particular, if a “ 0:

pf ˚ δ0qpxq “

ż 8

´8

fptqδ0px´ tq dt “ fpxq

Our strategy is as follows: the Dirac delta function is not really a function, so we will take a sequence of polynomial

functions that “limit” to it. That sequence of functions is:

Qnpxq “

#

cnp1´ x2qn x P r´1, 1s

0 x R r´1, 1s

where cn is chosen such thatż 8

´8

Qnpxq dx “ 1

And we will claim that

Pnpxq :“

ż 8

´8

Qnpx´ tqfptq dt

converge uniformly to f (where we extend f to R by zero).

First we need to show that Pnpxq is actually a polynomial. To this end, notice that (since f is defined in r0, 1s)

Pnpxq “

ż 1

0

Qnpx´ tqfptq dt

Using integration by parts, and letting F be the antiderivative of f , one has that

Pnpxq “ Qpx´ tqF ptq|10 ´

ż 1

0

d

dtQnpx´ tqF ptq dt

The term Qpx´ tqF ptq|ba is a polynomial, and in the integral term the polynomial ddtQnpx´ tq is of smaller degree

than Qnpx ´ tq, so by induction one has that Pnpxq is a polynomial. Note that this relies on the fact that the

truncated polynomial Qpx´ tq is a polynomial for t P r0, 1s and x P r0, 1s

Next, we want to show that Pn converges uniformly to f . Let us try to work out the expression first. All

51

integrals below are over all of R:

|Pnpxq ´ fpxq| “

ˇ

ˇ

ˇ

ˇ

ż

Qnpx´ tqfptq ´Qnpx´ tqfpxq dt

ˇ

ˇ

ˇ

ˇ

sinceş

Qnptq dt “ 1. Let us do a change of variables:

ˇ

ˇ

ˇ

ˇ

ż

Qnpsqpfpx´ sq ´ fpxqq ds

ˇ

ˇ

ˇ

ˇ

In this expression we are mostly worried about the integral near 0, where Qn becomes very large (recall that the

Dirac delta function “takes on” the value8 at 0), and it won’t be enough to just replace the expression fpx´sq´fpxq

with a constant upper bound.

Our first step is to evaluate cn. We compute

ż 1

´1

p1´ x2qn dx “ 2

ż 1

0

p1´ x2qn dx

At this point one wants to replace p1´x2qn with 1´nx2. One could try to work out the binomial theorem explicitly,

but a short cut is to take the function

p1´ x2qn ´ p1´ nxq

whose derivative is

2nxp1´ p1´ x2qn´1q

which is positive on p0, 1q, and since they are equal at 0, one has

p1´ x2qn ě 1´ nx

on r0, 1s. Thus,ż 1

´1

p1´ x2qn dx ě 2

ż 1

0

1´ nx dx ě

ż 1{?n

0

1´ nx dx “4

3?ną

1?n

so, cn ă?n. The choice of taking the upper limit of 1?

nis a bit arbitrary, but it makes the expression work out

nicely.

Next, we will analyze the behavior of Qn away from r´δ, δs. Since the function Qnpxq is decreasing in r0, 1s, one

has that for any δ ą 0 and any x P rδ, 1s:

Qnpxq ď?np1´ δ2qn

and thus Qn converges to zero uniformly in rδ, 1s. Further, let M be a bound on f .

Now, by uniform continuity f (ra, bs is compact), for every ε ą 0 there is a δ ą 0 such that |x´ y| ă 2δ implies

that |fpxq ´ fpyq| ă ε.

Given these, we can evaluate:

ˇ

ˇ

ˇ

ˇ

ż

Qnpsqpfpx´ sq ´ fpxqq ds

ˇ

ˇ

ˇ

ˇ

ď 2M

˜

ż ´δ

´1

Qnptq dt`

ż 1

δ

Qnptq dt

¸

` ε

ż δ

´δ

Qnptq dt

ď 4M?np1´ δ2qn ` ε ă 2ε

for large n, and sinceşδ

´δQnptq dt ď 1, and we are done.

Lemma 22.9. If f P B, then |f | P B (where B is any uniformly closed algebra).

Proof. fpKq Ă ra, bs Ă R. For any ε ą 0, there is a polynomial

P pxq “ a0 ` a1x` ¨ ¨ ¨ ` anxn

52

such that:

|P pxq ´ |x|| ă ε

for any x P ra, bs. Then, the function

P pfq “ a0 ` a1f ` ¨ ¨ ¨ ` anfn

is in B and further one has that

|P pfpxqq ´ |fpxq|| ă ε

so that |f | P B by closedness.

Lemma 22.10. If f P B and g P B, then maxpf, gq P B and minpf, gq P B (where B is any uniformly closed

algebra).

Proof. This follows from the fact that:

maxpf, gq “f ` g

2`|f ´ g|

2

minpf, gq “f ` g

2´|f ´ g|

2

Lemma 22.11. Suppose that A is an algebra of functions on any set E which separates points and vanishes at no

point. Let x1 ‰ x2 P E and c1, c2 P R. Then A has a function such that fpx1q “ c1 and fpx2q “ c2.

Proof. This is best done as an exercise, but choose a functions gi which do not vanish at xi and h be a function

such that hpx1q ‰ hpx2q. Then take

fpxq “c1g1pxqphpxq ´ hpx2qq

g1px1qphpx1q ´ hpx2qq`c2g2pxqphpxq ´ hpx1qq

g2px2qphpx2q ´ hpx1qq

the key observation being that g1pxqphpxq ´ hpx2qq is nonzero at x1 and zero at x2 (we need h to separate x1 and

x2). Also, notice that constant functions are not automatically in A, so we need to introduce the functions gi to

multiply the constants by, i.e. we need gi to not vanish at xi.

The strategy is as follows. Let f be the continuous function we want to approximate. We will find for each

point p P K a function that is within ε of f on one side, i.e. no less than f ´ ε. Using compactness we will choose

finitely many of these and take the maximum of the functions. We will do the same again on the other side.

Lemma 22.12. Let A, B as in the theorem. Let f be continuous on K (where K is compact). Let p P K and

ε ą 0. There exists a function gp P B such that gpppq “ fppq and gppxq ą fpxq ´ ε.

Proof. For every q P K, we can find hq P B such that

hqppq “ fppq, hqpqq “ fpqq

Define the open set

Jq :“ pf ´ hqq´1pp´8, εq

The Jq as q P K form a cover, since q P Jq. Since K is compact there is a finite subcover indexed by I, and define

the function

gp “ maxphyiqiPI

We will now show that given f : K Ñ R continuous, and ε ą 0, there is a function h P B such that for x P K:

|hpxq ´ fpxq| ă ε

53

From the previous lemma, for any p P K, we have gp such that

gppxq ą fpxq ´ ε

Define

Vp :“ pgp ´ fq´1pp´8, εqq

i.e. points such that

gppxq ă fpxq ` ε

By compactness, this open cover has a finite subcover, and take

h “ minpgxiqiPI

This function h has the property that

hpxq ą fpxq ´ ε

hpxq ă fpxq ` ε

for x P K.

54

math 104: introduction to analysis - github pages

Documents