3 - Construction of R
Where we left off last time
We discussed our construction of the rational numbers
where is the class of ordered pairs under the usual equivalence relation for fractions. Of course, we think of rational numbers as fractions, and fractions are, in fact, ordered pairs of integers. You have a number on top and a number on the bottom. And the way we thought about this was these ordered pairs satisfied some notion of equivalence, and it's just the usual notion of equivalence that you are used to, that you were taught in grade school for how to tell whether or not two fractions actually represented the same thing.
What we've seen is, in fact, this is an equivalence relation. We've seen that under this definition we can actually construct an addition and a multiplication that's well-defined. So it gives this collection of ordered pairs a structure, an arithmetic structure. (That is, has well-defined operations , , an order, and "contains" .) And it has an order, which, if you recall, an order relation satisfies two properties:
-
Trichotomy: Whatever the relation is, say, "less than" or "", then if you take two elements, say and , then you have exactly one of the following things:
- or
- or
-
Transitivity: If , and , then .
If trichotomy and transitivity hold for a set , then we say has an order relation. From last time, we know that has a nice order relation that we defined. Despite the sophistication, the essential message is that it is all what you learned in grade school. From now on out, since we've carefully constructed the rationals, we can think of rationals as fractions. And contains in a very natural way.
Recall: why is Q nice?
-
We also saw that , in fact, is nice, because not only does it have these arithmetic operations, but it has the ability to solve certain equations that we couldn't solve in the integers before. For instance, can't be solved in for . But it can be solved in because is a field and it satisfies certain addition properties A1--A5, multiplication properties M1--M5, and distribution (which basically says addition and multiplication have to play nicely with each other) of multiplication over addition.
-
We saw that not only do the rationals have an order and are a field but that they are an ordered field. This means that , in addition to being ordered and being a field, that the order and the field properties play nice with each other. In particular, if I look at two rational numbers and I add the same thing to them, then their order relation should stay the same. Or if I multiply them by a positive rational number, then they should still preserve the same order.
-
We finished last time by noting that is not large enough to solve all equations. In particular, is not large enough to contain solutions to the equation , an equation whose solution has a very natural geometric interpretation:
As we can see from the above, by the Pythagorean Theorem, the hypotenuse has a length that solves the equation , but this solution is not contained in (last time we showed that does not have a solution in ).
-
Our plan is to rectify that problem above by looking for a construction that would allow us to solve equations like . We will do so by "Dedekind cuts" which were introduced by Richard Dedekind in 1872. What do we mean by "construct" the real numbers? The situation will be very much like what we did for the rationals. The idea you should have in your head is that if you have a number line, because we have an order, we can place the rationals along such a number line. We can mark off the integers and rationals. The idea is that there are some holes, in some sense, on the number line. If the Pythagorean length (i.e., what we are calling the square root of 2 even though we haven't actually defined what we mean by square root), if we look at the hypotenuse of a right triangle with sides 1 and 1, as above, then the length of that hypotenuse is somewhere on the number line, and yet there is not a rational that spans that length. In this sense, there are some holes on the real line.
-
We want a construction of the reals, but what do we mean, again, by "construct" the reals? What we mean is we want a construction that only depends on things we've already defined. Just like the rationals, we defined those in terms of integers, as ordered pairs of integers. But we did this in a fashion that only depended on things we did before.
-
If I want to define the real numbers, I can't say "let's fill this in" because that would not have any meaning. What does it mean to fill in the number line? I don't even know where the holes are. So how will I get at the fact that I have some gaps here without talking about the gaps?
-
Since does not have a solution in , it might be worthwhile, then, to look at the other possibilities, namely that or (by trichotomy, these are the only other possibilities). Consider the set , which is the set of all rationals whose square is less than 2. What does that set look like on the number line? It's a set consisting of all the rationals roughly between and . If we look at the set in more detail, then it has an interesting problem. What's the problem? The problem is there is nothing at the very edge. We wish that there was, but there isn't. This points to a property that we want to get at. We'll give it a name, and we'll explore what it means for our construction. It points to the idea of what is called a least upper bound. What do we mean by that? If you take a look at the number line, the fact that there is not a number right at the very end is related to the idea that there are a bunch of points to the right that are greater than any of the numbers in the set . We would call, for instance, 2.2 an upper bound of . So 3.1 is also an upper bound of . We know 1.5 is also an upper bound even though we will concretely show it in a moment. The idea we are trying to get at is whether or not there is a least upper bound, and the problem in is that there may not be a least upper bound.
What is the deal with least upper bounds?
-
Upper bound (definition): Suppose is an ordered set, and . If there exists a such that for every , we say that is bounded above, and call an upper bound of . (Lower bounds are defined in the same way with in place of .)
-
Examples: It may help to look at a few examples where upper bounds are involved.
- We mentioned before that 2 was an upper bound for . This seemed pretty obvious. Less obvious, perhaps, was whether or not was an upper bound for . So let's look at that. If is not an upper bound, then there exists where and . But this implies , a contradiction.
-
Least upper bound (definition): Suppose is an ordered set, , and is bounded above. Suppose there exists an with the following properties:
-
is an upper bound of
-
If then is not an upper bound of .
Then is called the least upper bound of (that there is at most one such is clear from (ii)) or the supremum of , and we write
-
-
The word supremum kind of suggests that "to be supreme" means to be bigger than everything, but it's the least biggest thing. It's sort of what might be at the endpoint discussed earlier if there were anything there.
-
Properties of suprema: Which sets have suprema? Here are some examples (throughout the examples, let ).
-
Let . What is the supremum? We see that . To see this more clearly, suppose there were an upper bound less than 2, call it , then would not be bigger than 2. And therefore would not be an upper bound for . This is just one example, but we could throw some more numbers into the mix. If we added the number 1.5 to , then yet again. If we added 15 to , then . We could keep throwing in numbers, but can we generalize a statement about a finite collection of objects? Yes, it's the last number on the number line for the collection. Ultimately, we can say that a finite set has a supremum.
-
Let or the negative rationals. Does this have a supremum? Yes, . How do we know this? How do we know that there isn't some incredibly small number that's just to the left of 0 that's greater than all the negative rationals? Because given any negative rational , we can find a negative rational where . In particular, we could take to illustrate our point (i.e., would be closer to 0 from the left than would be). So would not be an upper bound. We can apply this thought process to any given negative rational , definitely making not an upper bound. This is an example of an infinite set, namely , that does have a supremum.
-
Let . Does exist? Not in the conventional sense at least. The reason does not have a supremum is not that it failed property (ii) above but because it failed property (i). That is, there is not an upper bound at all for . The set is unbounded above. In the extended real number system, we communicate this by writing . This is very different from the situation concerning .
-
Let , where . What is ? Does have a least upper bound in (which is all we have right now)? We see that does not exist, but how do we prove that? It is not because is unbounded, as was the case with above. It does not exist even though is bounded. We noted earlier that is bounded above by , for instance. Thus, this is a more essential kind of failure to not have a supremum than was the case with .
-
To prove that does not exist, see [17]. Nonetheless, let's suppose we have an upper bound for in . One way to show that doesn't have a least upper bound for is to show that, no matter what upper bound you give me, I can always find a smaller one. Let's suppose we are given an upper bound, say . Can we devise a way to come up with a number that is between the rightmost edge of and our choice of ? In the example involving , we considered a negative rational , but we showed could not be an upper bound by considering . There's a temptation in this case involving the set to average with or the square root of 2, but we don't have a square root of 2 (yet). Because we can't do that, we have to be a bit more clever. Instead of associating with each number the number , which would be a cheat since we don't have a notion of just yet, Rudin suggests associating with each number a number slightly less than (which thankfully only involves ), namely,
and it's enough to get you what you want; that is, it produces a that is somewhere between the rightmost edge of and our originally chosen .
-
-
Motivation for : We have this notion now of a least upper bound, the supremum, and whatever we do to construct the reals, the reals (unlike the rationals) will not have bounded sets that do not have suprema. That's the surprising thing. We'll construct only involving rationals in such a way that it'll be obvious that has what is called "the least upper bound property." We will construct and prove the main theorem in Chapter 1 from Rudin.
How do we construct R?
-
Theorem: There exists an ordered field which has the least-upper-bound property. Moreover, contains as a subfield. [This is the core theorem of Chapter 1 in Rudin.]
-
Structure of our proof: There are three things we are going to show.
- Ordered field: The set is an ordered field.
- Least-upper-bound property: The set has the least-upper-bound property.
- Containment: The set contains as a subfield.
-
Least upper bound property (definition): It basically says if the supremum of a set does not exist, then it must be the case that the set is unbounded. It will not be the case that bounded sets lack suprema. The more formal statement of the definition is given below.
Definition. An ordered set is said to have the least-upper-bound property if the following is true: If , is not empty, and is bounded above, then exists in .
Such a set , as mentioned in the definition above, is often said to have the "lub" property or that it satisfies the so-called completeness axiom.
-
The empty set: What happens if ? Is 2 an upper bound for ? Does 2 have the property that for element in the empty set that ? Yes. So everything is an upper bound for the empty set. Why is this true? It's vacuously true! There's nothing in the empty set.
How do we construct R from Q?
-
The task ahead: Remember our task is to construct the real numbers in a way that only involves things we've defined before, namely the rationals. How is it that we are going to get at the idea that there ought to be something right on the far-right edge of , right on the cutting edge. How are we going to get at that idea? The idea of Dedekind was to think about the real numbers as just being some collection of rationals. Just like we thought of the rationals as an ordered pair of integers, we'll think of the real numbers as collections of rational numbers that are specified, in some sense, by a cut. We are going to make a cut at the rightmost edge of and say that everything to the left of this cutting location is a collection of rationals and that this collection is a cut.
-
Cut (definition): A cut is a subset of such that the following conditions hold:
- is not empty, and \hfill[the cut is said to be nontrivial in this case]
Note A cut is nontrivial. That is, a cut is neither empty nor everything — we want to be some thing but not all things.
- If , , and , then
Note A cut is closed downward. A cut will be a collection that is, in some sense, closed downwards. What is meant by that is if you take some cut member, say , and then you have a rational (i.e., ) that is less than (i.e., ), then this rational must also be a member of the same cut as (i.e., ). In this sense, the condition above means a cut is "closed downward" or "closed to the left."
- If , then for some .
Note A cut is unbounded above. In some sense, a cut is like the picture of the set in the sense that it does not have a biggest member. No largest member. In other words, there's something to the right of , namely , that is also a member of the cut .
A set that fulfills criteria (i)-(iii) listed above is called a Dedekind cut.
-
Informal thoughts about cuts: Essentially, a cut should be thought of as the set of all rational numbers to the left of some point in the real line. Since we have not constructed the real line yet, we cannot define a cut in that simple way; we have to make the construction more indirect. Nonetheless, given the properties of a cut above, these properties essentially describe a set that looks like a "rational halfline" (in the sense that our cut represents the half of the rationals to the left of wherever the cut is made). If is a cut and , then any rational is also not in . Also, if and , then it must be the case that . (See [13] for more on this paraphrasing.)
-
Examples of cuts: Let's consider some examples involving the idea of cuts.
-
Is the set , which we defined before as , a cut? No. Which property does it not satisfy? Let's see.
-
Property (i): Yes. The set is clearly nontrivial since and .
-
Property (ii): No. This is where we run into an issue. Choose any rational lower bound for . Then for some and our , we have , but . For example, if , then , but , and hence .
-
Property (iii): Yes. The set has no largest member. Recall this was one of the goals in establishing our choice of from before. In [17], we consider some rational in and associate with it some rational slightly smaller than , and end up showing that is in . In the language of cuts, specifically the statement of condition (iii), we have the following:
In Rudin's exposition, this is .
-
-
Is the set , the set of all negative rationals, a cut? We must check the three given properties.
- Property (i): Yes. The set is clearly nontrivial since and .
- Property (ii): Yes. The set is clearly closed downward because a rational number that is more negative than another negative rational number is still a negative rational number.
- Property (iii): Yes. Given a negative rational number , there certainly exists another negative rational number that is greater than our given rational number. Using our work from before, we can consider . Then for some , namely .
Since satisfies conditions (i)-(iii) for cuts, we know that is, indeed, a cut.
-
Is the set a cut?
- Property (i): Yes. The set is clearly neither nonempty nor equivalent to .
- Property (ii): Yes. Given some rational less than or equal to 2 and another rational less than or equal to 2 and also less than , then it will clearly be the case that is also a rational less than or equal to 2.
- Property (iii): No. For example, if , then , but it is not the case that for some since in this case is an upper bound for .
We are starting to get some idea as to what cuts ought to do just by their properties, but it's still unclear exactly what kinds of things or sets could be cuts. Instead of looking at more particular examples of cuts, we will simply look at the set of all cuts, and we will declare such a collection to be the real numbers.
-
How do we construct R by looking at it as the set of all cuts?
-
Defining in terms of cuts: Let . There, that is our definition of the real numbers. What!? That is a little strange. A little strange. But the point is once we prove properties about real numbers that we are comfortable with, then we'll just think about the reals as we normally do. But for now, what do you think the cut will be that corresponds to the square root of 2 or ? Maybe we start with something a littler easier. What cut would correspond with the number 1? How about cutting the number line at 1 and looking at all of the rationals less than 1? What cut would correspond with the number 3? All the rationals to the left of 3. Good. What about the square root of 2 now? All the rationals that are, in some sense, to the left of the rightmost edge of so it might look like somewhat but it had better include all of the rationals to the left of the rightmost edge of . That's kind of the idea. So let's see that this actually behaves the way we expect it does, and that's maybe the surprise.
-
Overview: The set , defined in terms of Dedekind cuts, will satisfy all of the properties that we expect the real numbers to satisfy. In particular, will be an ordered field with the least-upper-bound property, and it will contain . So the first thing we should maybe do is define what the order is.
-
Defining an order: The set is a structure, but what kind of structure is it? The set , we will show, has structure; in particular, it will have an order, an arithmetic, and the field operations. We will define an order as follows (note that and below are collections of rationals):
Definition. Define " to mean the following: is a proper subset of . (Note that we do not mean if but iff since we are making a definition.) That is, but .
An aside: In terms of how we would make sense of the square root of 2 as is currently defined, the point is that we will not have to refer to the square root of 2 in making our definition (because doing so would be meaningless). To hint at the idea, recall the set that did not have a least upper bound in ; well, in , the set actually will have a least upper bound once we actually discuss how to take supremums involving where we've defined . For right now, you should check that "" is an order. See [17] for such a demonstration. You should fill in this demonstration a bit. But very briefly:
Trichotomy: Is it possible for to be less than and for to be less than ? No, given proper containment. That's clear. And the only way that they are equal is if there is equal containment. Does it satisfy transitivity? Clear. But you should check. It is an order.
-
Operations: We want to know whether or not has operations that we can deal with. This is the amazing thing — we can actually define an operation. How might we define addition for instance? Recall that and are both collections of rational numbers. How might we define , a new collection of rational numbers?
-
Addition (definition): If and , then define to be the set of all sums , where and . That is, . It helps to note here, perhaps, that all of the elements of are sets, namely a bunch (i.e., a collection) of cuts where each cut represents a set of rational numbers. The set is a collection of rational numbers (the elements of and are both rational numbers, and the set of rational numbers is closed with respect to addition), but is the set a cut? We must check that is, in fact, a cut.
-
Property (i): Yes. Is nontrivial? If there is something in and something in , then is there not something in ? Clearly there must be. And if is not everything and is not everything, then is not everything? We'd have to check. If is not everything, then has an upper bound, say . If is not everything, has an upper bound, say . Is there an upper bound for ? Yes, we claim the upper bound is . So does not contain everything. For Rudin's description below, replace with and with to get Su's description.
Rudin: It is clear that is a nonempty subset of . Take , . Then for all choices of , . Thus , whereby we see that . Thus, has property (i).
-
Property (ii): Yes. Is closed downward? Something to check. If , say , where and . We want to know whether or not . Is ? What would it mean to show that is in ? We would need to show that is the sum of two rational numbers, a rational number in and a rational number in . Now, we know is the sum of two rational numbers, namely and , where and . Note that since, by supposition, we have . Since and , it must necessarily be the case that (since is a cut). Then , where and , as desired.
Rudin: Pick . Then , with , . If , then , so , and . Thus, property (ii) holds.
-
Property (iii): Yes. Is there no largest member? There is not---check this. My comment: Recall we most show that
"If , then for some ."
in order for property (iii) to hold. Let and choose such that . (We can make such a choice since is a cut and thus has no largest element.) With , where and , and , we will certainly have or simply , where since and . We may set in what follows to slightly simplify the conclusion.
Recalling that (since , where and ), and (since , where and ), we see that we have shown the following:
If , then for some .
The above is what was required to show that property (iii) holds, and we have shown just that.
Rudin: Choose so that . Then and . Thus, property (iii) holds.
Since satisfies properties (i)--(iii) for Dedekind cuts, we know that is a cut. So we have an order, we have an addition. We would want to show that the axioms for addition A1--A5 all hold.
-
Do the addition axioms A1-A5 hold for cuts?
-
Remark: We won't show all of these — we will just think through why they should be reasonable, and you can check them in your own privacy. Note that we showed A1 holds above by showing that is a cut. We also showed that is a cut, thereby showing , where is defined below, is also a cut by definition. For instance what about commutativity?
-
A2 (Commutativity): If we define , then could we necessarily say that ? If so, why? Addition itself is commutative for rationals. So commutativity for reals follows from commutativity for rationals. This is nice.
Rudin: is the set of all , with , . By the same definition, is the set of all . Since for all , , we have .
-
A3 (Associativity): This is the same idea as that for commutativity.
Rudin: As above, this follows from the associative law in .
-
A4 (Additive identity): What would the additive identity be? Let's give it a name. Let's call it zero but denote it as to remind ourselves that the additive identity in this case is not the rational number 0 but a collection. What should play the role of zero in this case? The set of all negative rationals makes sense (i.e., ) because if we take any rational number in , then adding a negative rational number to it will only make smaller than , thus leaving unchanged in the sense that is a Dedekind cut and we "get all the rational numbers left of a cut location for free" in a sense. This gives us an intuitive understanding, but we should more formally check that adding any negative rational to a given rational in leaves unchanged; that is, we need to check that as cuts; that is and are both sets of rational numbers and to show any two sets are equivalent we show that one contains the other or what is often called mutual subset inclusion. Symbolically, we should show that and . If we can show this, then we will have shown that . To start, we note that if we take anything in and add something negative to it (i.e., something from ), then we will get back something in because is is closed downwards. This shows that . Now let's go the other direction. Why is it that if I pick something in , then it's the sum of something in plus something negative? Because there's no largest member of , let , and then choose some such that and . Since , we have , whereby it becomes clear that is an element of since and . Since we have just shown that implies for arbitrary , it is clear that . We thus conclude that .
Rudin: If and , then , hence . Thus . To obtain the opposite inclusion, pick , and pick , . Then , and . Thus . We conclude that .
-
A5 (Additive inverse): This is a little more complicated. On the surface, it's not quite clear what the inverse of should be. Here is a definition that is rather unmotivated at the moment, but it will make sense upon some reflection. Our set will consist of all rationals such that there exists a number such that . In symbols:
Our job, then, is to show that .
Rudin: Rudin's exposition is rather involved. First he shows that actually is a cut. And then he appeals to the archimedean property of later on. See [5] for an analysis as well as Bergman's errata. This is definitely the most involved proof for A1-A5. Note that Su gives somewhat of a hand-wavy explanation at the end of A5, but it may be worth revisiting once Bergman has been read and fleshing out the illustration in more detail.
Do the multiplication axioms M1-M5 hold for cuts?
-
Comments: We'll just make a few comments since it is rather much so what you would expect. Addition is the sum of two things, one from one cut and one from another cut. We would like to define multiplication as the product of two things, where we multiply one thing from one cut by another thing from another cut. The only problem with that is if we have some negatives, then we run into a few problems, since products of negative rationals are positive.
-
Definition: If (i.e., ), then
We will then let . For negative reals, see Rudin. (Basically, you make them positive, you multiply, then you negate as needed.) The claim is that our definition of multiplication satisfies the axioms of multiplication M1-M5.
Do cuts have suprema?
-
Supremums of cuts: How would you take the supremum of a bunch of cuts? A hint is to look at a collection of a whole bunch of cuts and ask ourselves what the cut is that will be the cut that contains all of the cuts in our collection because it is bigger but it's the smallest cut that contains them all? We essentially want the supremum to be the union of the bunch or collection of cuts — this is a beautiful description of what the supremum ought to be. Given a bunch of cuts, , let be the union of all the where ; that is, given a collection of cuts , let
Our claim is that will be a cut and that . It should be clear that is an upper bound (because it contains everything), but why it is a least upper bound we shall see next time. Once we do that, we'll have very easily shown that has this property that does not have. The set is "complete." It has all its holes filled in.
Thoughts and questions
- Look more into the construction of the real numbers, particularly the construction via Cauchy sequences at some point. It looks like Dedekind's approach is quite popular due to the fact that he only uses tools available to us from an early time period. That is, for example, the Cauchy sequence approach requires that we use (shock) Cauchy sequences, a tool that is not available until later.
- How might one construct the integers? See [13].
- How might one construct the natural numbers? See [12].