Skip to main content

1 - Construction of the rational numbers

Video Lecture

The following questions/notes correspond to the video lecture titled Constructing the rational numbers by Francis Su.

Kronecker and God's Integers

Leopold Kronecker, responsible for the Kronecker delta that bears his name, uttered the following statement in 1886:

God created the integers. All else is the work of man.

Let's explore some of the implications or consequences of this statement.

Kronecker was what one might call a "finite-ist" or a proponent of the point of view that mathematics should only deal with finite objects, finite numbers, or things that can be constructed from the numbers in a finite number of steps; that is, mathematics should only deal with objects obtained from finite numbers in a finite number of operations. (This philosophical view of mathematics is appropriately known as "finitism.") This viewpoint had a few consequences:

  • Irrational numbers: Opposed to their use.
  • Non-constructive existence proofs: Doubted their significance.
  • Transcendental numbers: Doubted their existence; in fact, the quote by Kronecker was a response to Ferdinand von Lindemann's proof of the transcendence of π\pi, which Kronecker said was beautiful but proved nothing, since transcendental numbers didn't exist. (Recall that a transcendental number is a real or complex number that is not algebraic; that is, it is not a root of a nonzero polynomial equation with integer or rational coefficients.)

The reason for highlighting the consequences of Kronecker's viewpoint is to illustrate that there are many things that we take for granted that were not always so obvious. This is especially true in light of how the Greeks thought about mathematics:

  • Constructions of rational lengths: The Greeks understood rational lengths could be constructed with straightedge and compass and ordered on a line. Thus, for instance, if you wanted a line of length 4/54/5 units, then they could show you how to construct such a line given a line of length 1 unit.
  • Constructions of irrational lengths: They also knew that there were other lengths on a line that were not rational but constructible (e.g., 2\sqrt{2}).
  • Impossible construction: They knew about other lengths, such as π\pi, but they couldn't find a construction (e.g., the problem of squaring the circle). The number π\pi cannot be constructed by using straightedge and compass because it is transcendental and constructible numbers are always algebraic and therefore not transcendental.
  • Impossible construction made possible by an infinite process: But π\pi can be constructed through an infinite process such as the sum of an infinite series (e.g., Newton in 1666). But what is an infinite series? So this begins to beg a question. Already here, 200 years before Kronecker, Newton and Leibniz, in developing the calculus, began to encounter the infinite. And they did not have a real rigorous way to deal with the infinite. In fact, if you look at a lot of the history of calculus, it was a toolbox at first. It gave good answers. But there was not a precise notion of what it meant for a series of numbers to converge.

The infinite

Much of calculus was seen as a toolbox in that it often gave nice answers, but it was grossly lacking in precision due to the slippery nature of the infinite. Some hard questions needed to be asked. Let's see what some of these hard questions were.

  • Convergence: What does it mean for a series of numbers to converge?
  • Notion of a limit: What exactly is a limit? Newton and Leibniz both had a vague notion of this. In the early 1800s, Fourier series, which are made up of infinite sums of sines and cosines, made Laplace and Lagrange uneasy. In fact, many people outright rejected Fourier's work because his series resulted in some strange behavior. But no one could actually deny that Fourier's method gave answers! And it seemed to give right answers to boot. Why was that? If you have a series of numbers, an infinite sum, then that is, in some sense, a limit of a bunch of finite sums.
  • Robustness: Are there "enough" numbers to capture all limits?

Adding precision

There was a revolution in the 1800's in order to make many of the ideas in calculus precise. Let's see some of what characterized this revolution.

  • Main actors: Cauchy (1820's), Weierstrass (1850's), and Riemann (1860's).
  • Renewed skepticism: Many of the things that we take for granted that were not so obvious at the time resulted due to a prolonged struggle with the infinite.
  • From the ground up: Constructing the real numbers will be the major goal, but we will start by constructing Q\Q, the rational numbers, to illustrate what is meant by "construct."

Sets: notation and terminology

First we'll establish some common notation and terminology concerning sets:

  • Set: A set is a collection of objects. (As easy as that sounds, the notion of a set is something that mathematicians had to wrestle with very carefully in the 1900s.) We will often write a set as follows (notating it by a letter and notating what's inside by setting off the contents with braces):
S={1,,,{2,}}S=\{1,\bigcirc,\Box,\{2,\bigcirc\}\}

or

S={x:P(x) is true},S=\{x : P(x)\ \text{is true}\},

where P(x)P(x) is a statement about xx.

  • Membership: The notation xSx\in S means "xx is in SS" while x∉Sx\not\in S means "xx is not in SS."
  • Empty set: The set with nothing in it is denoted by \emptyset.
  • Subset: The notation ABA\subset B means "AA is a subset of BB," which means "if xAx\in A, then xBx\in B."
  • Proper subset: If ABA\subset B and B⊄AB\not\subset A, then we call AA a proper subset of BB.
  • Set equivalence: If ABA\subset B and BAB\subset A, then we write A=BA=B. If this is not the case, then we write ABA\neq B.
  • Union: The union of two sets AA and BB is communicated by writing
AB={x:xA or xB}.A\cup B=\{x : x\in A\ \text{or}\ x\in B\}.
  • Intersection: The intersection of two sets AA and BB is communicated by writing
AB={x:xA and xB}.A\cap B=\{x : x\in A\ \text{and}\ x\in B\}.
  • Complement: The complement of a set AA is communicated by writing
Ac={x:x∉A}.A^c=\{x : x\not\in A\}.
  • Set difference: The set difference of two sets AA and BB is communicated by writing
AB={x:xA and x∉B}.A\setminus B=\{x : x\in A\ \text{and}\ x\not\in B\}.
  • Product: The product of two sets AA and BB is communicated by writing
A×B={(a,b):aA and bB},A\times B=\{(a,b) : a\in A\ \text{and}\ b\in B\},

where (a,b)(a,b) is an ordered pair.

Relations: notation and terminology

We will first establish some common notation and terminology concerning relations.

  • Relation: A (binary) relation R\Rel{R} is a subset of A×BA\times B. If (a,b)R(a,b)\in\Rel{R}, then we write aRba\Rel{R} b.

  • Examples:

    • Let RA\rel{A} be the relation "is an ancestor of." Then RA\rel{A} is a relation on P×PP\times P, where PP is the set of all people.

      • Let RL\rel{L} be the relation "likes." Then RL\rel{L} is a relation on P×PP\times P as well.

      Note: If you look at the set of all people, and you look at ordered pairs of people, you might ask whether or not the ordered pair (Bonnie,Jenny)(\text{Bonnie},\text{Jenny}) is in RA\rel{A} or (Bonnie,Jenny)RA(\text{Bonnie},\text{Jenny})\in\rel{A}. Linguistically, "Does Bonnie RA\rel{A} Jenny?" or "Is Bonnie an ancestor of Jenny?" Probably not. But does Bonnie RL\rel{L} Jenny? We certainly hope so. In terms of notation, then, we would have (Bonnie,Jenny)∉RA(\text{Bonnie},\text{Jenny})\not\in\rel{A} and (Bonnie,Jenny)RL(\text{Bonnie},\text{Jenny})\in\rel{L}. Also worth noting is that the relation RL\rel{L} need not necessarily be defined on P×PP\times P. It would be reasonable to ask whether or not a person in the set PP liked some sort of hobby in the set HH (in which case we would consider RL\rel{L} to be defined on the set P×HP\times H, where PP is the set of all people and HH is the set of all hobbies) and the like.

      • Let RS\rel{S} be the relation "is a sibling of." Then RS\rel{S} is, again, a relation on P×PP\times P as well.
      • The symbol <<, which most people are quite familiar with, stands for the relation "less than," and we may define this relation on Z×Z\Z\times\Z.
  • Equivalence relation: An equivalence relation R\Rel{R} on a set SS is a relation on S×SS\times S such that three things hold:

    1. Reflexive: It must be the case that aRaa\Rel{R} a.
    2. Symmetric: It must be the case that aRb    bRaa\Rel{R} b\implies b\Rel{R} a.
    3. Transitive: It must be the case that (aRb and bRc)    aRc(a\Rel{R} b\ \text{and}\ b\Rel{R} c)\implies a\Rel{R} c.

For the examples given above, we note the following:

  • Relation: RA\rel{A} ("is an ancestor of")

    • Reflexive: No. One cannot be an ancestor of him or herself.
    • Symmetric: No. You are not an ancestor of your ancestor.
    • Transitive: Yes. If Jack is an ancestor of Jill who is an ancestor of Glum, then Jack is certainly an ancestor of Glum.
  • Relation: RL\rel{L} ("likes")

    • Reflexive: Yes. Or at least one should hope this would be the case.
    • Symmetric: No. This is too often the case, but it could certainly be affirmative in many cases.
    • Transitive: No. Certainly not!
  • Relation: RS\rel{S} ("is a sibling of")

    • Reflexive: No. You are not a sibling of yourself. Note that this strongly depends on how sibling is defined, where nearly all definitions simply mean "brother or sister."
    • Symmetric: Yes. Certainly!
    • Transitive: Yes. Certainly!
  • Relation: << ("less than")

    • Reflexive: No. A number cannot be less than itself. That would be most problematic and rather concerning.
    • Symmetric: No. This would be rather problematic as well.
    • Transitive: Yes. Certainly!

    Note that an equivalence relation is just a way of specifying how two objects are meant to be interpreted as equivalent (whether or not such an interpretation is mathematically useful is another matter). The equivalence relation "equals" or "==" is a natural equivalence relation, but there may be other kinds of equivalence relations. For example, all of the sophomores at a college may be deemed equivalent in terms of class year and the like.

  • Function: A function F\Rel{F} from AA to BB is a relation such that if aFba\Rel{F} b and aFba\Rel{F} b', then b=bb=b'.

Constructing the rational numbers

We will now set to work on constructing the set of rational numbers.

  • Assumptions: We will assume properties about the arithmetic and order of Z\Z, the set of integers. The word construction often implies that there is some goal. What is the goal of the construction of Q\Q?

  • Definition: First, how is Q\Q defined? We might define it to be the set

    Q={mn:m,nZ,n0}.\Q=\{\tfrac{m}{n} : m,n\in\Z, n\neq0\}.

    At this point, however, this is rather useless because we have no idea what "mm over nn" or mn\frac{m}{n} really means. What could it possibly mean if all we have to work with is Z\Z and its algebraic and order properties (which is what we are using to build or construct Q\Q)? Note that there's no division defined on Z\Z because such an operation would often lead to a result outside of the number set; for example, "3 divided by 2" or "3 over 2" involves the operation of division on two integers, but the result itself is not an integer.

  • Motivation: When we think about fractions, we often think about trying to teach children about, for instance, how to divide a cake into three equal pieces and giving somebody one of those three pieces. We call such a fraction "one third," which really means one part of three. That's one way to think about it, but there are other ways to describe the same quantity. I could divide the cake into six pieces and pick two of those pieces. Such a fraction would be called "two sixths." For both cases, I could mathematically write 13\frac{1}{3} and 26\frac{2}{6}, respectively, to communicate this. Or one could draw the following two cakes (which look remarkably like intervals):

In the left-hand figure, we have divided the whole cake into 3 portions. A child may take 1 part out of 3 of the cake for his or her portion. We have a name for such a division and\slash or portion of this sort: "one third." What about the right-hand figure? What if the cake was divided into 6 pieces and a child took 2 parts out of 6? Just from looking at the figures, this looks like somehow the quantities "1 part out of 3" and "2 parts out of 6," which we may normally denote as 13\frac{1}{3} and 26\frac{2}{6}, respectively, are "the same" or "equivalent" in some way.

This presents a slightly subtle problem since the fractions 13\frac{1}{3} and 26\frac{2}{6} are certainly different, but they seem to be representing the same thing! So the "issue" is fairly apparent — we have two different fractions that represent the same thing or what we would like to interpret as the same (i.e., equivalent) thing. So maybe we want to set up a construction where we define fractions in terms of equivalence relations. (It should be noted here how sometimes the real numbers themselves are defined in terms of equivalence relations where equivalence class representatives are Cauchy sequences.) How are we going to do that? What should the equivalence relation be?

  • Fractions as Equivalence Relations: We can represent the fractions 13\frac{1}{3} and 26\frac{2}{6} as ordered pairs and write (1,3)(2,6)(1,3)\sim(2,6) to communicate that these ordered pairs are equivalent much in the sense that we would normally write 13=26\frac{1}{3}=\frac{2}{6}. The idea is that these ordered pairs belong to some equivalence class that may have many other elements such as (10,30)(10,30), (20,60)(20,60), and so forth. In this particular example, we shall say that (1,3)(1,3), (2,6)(2,6), (10,30)(10,30), (20,60)(20,60), and many more belong to the equivalence class that we will call "13\frac{1}{3}." We will let Q\Q be the set of all such equivalence classes of ordered pairs in Z×(Z{0})\Z\times(\Z\setminus\{0\}).

  • A more thorough note about fractions as equivalence classes: We noted that we would like to think of 13\frac{1}{3} and 26\frac{2}{6} as being equivalent as well as many other quantities like 1030\frac{10}{30} or 2060\frac{20}{60}, and we will now think about all of this more formally by thinking of (1,3)(1,3), (2,6)(2,6), (10,30)(10,30), (20,60)(20,60), etc., as all being equivalent in some way. The idea is that all of these ordered pairs (and perhaps many many more) belong to some equivalence class, and we'll give that equivalence class a name — we will call it "13\frac{1}{3}."

    Of course, once we have "13\frac{1}{3}" we can simply talk about fractions, and everyone knows how to work with fractions, which are really disguised ways of dealing with the equivalence relation that's embedded here. What actually is the equivalence relation that everyone learns in grade school but not in such formal terms? Whatever the case, the set of all such equivalence classes (i.e., ordered pairs from Z×(Z{0})\Z\times(\Z\setminus\{0\})) will be called Q\Q. Of course, we now need to actually put some thought into defining the equivalence relation.

  • Number system extensions: It may be useful to consider how Q\Q is related to the set we began with, namely Z\Z. The way Q\Q is often thought of is that it somehow "extends" Z\Z. That is, we have a bunch of points or whole numbers on the number line, and now we've filled them in with a bunch of other points in between. But are the original points on the number line (i.e., the whole numbers) still there? Yes. They're embedded in Q\Q. So we might want to say how Z\Z is embedded in Q\Q. We want our ordered pairs to extend Z\Z so that, for example, n1Q\frac{n}{1}\in\Q corresponds to nZn\in\Z. (Algebraically, we are looking for an isomorphism of Z\Z into Q\Q.)

  • More on relationship between Z\bm{\Z} and Q\bm{\Q}: How is Q\Q related to Z\Z in the construction that we are undertaking? We might like to think of Q\Q as some sort of extension of Z\Z where Z\Z is just a bunch of equally spaced big points and Q\Q somehow fills in the space between such points. The fact that Q\Q is dense in itself (i.e., between every two rationals is another rational) may give one the erroneous notion that a number line is "completed" by extending Z\Z to Q\Q, but we will see that is not the case by noting that 2\sqrt{2} is irrational and yet on the number line we typically envision.

    Are the original integers still in the new construction of Q\Q that we have made? Yes, they are embedded in Q\Q. It may be nice to say how Z\Z is embedded in Q\Q. We may note that n1Q\frac{n}{1}\in\Q corresponds to nZn\in\Z. From an algebraic point of view: we are looking for an isomorphism of Z\Z into Q\Q.

  • The Equivalence Relation: After enough examples, we see that Q={mn:m,nZ,n0}\Q=\{\frac{m}{n} : m,n\in\Z, n\neq0\}, where mn\frac{m}{n} is an equivalence class containing (m,n)(m,n) with relation (p,q)(m,n)(p,q)\sim(m,n) if pn=qmpn=qm and q,n0q,n\neq0. If pn=qmpn=qm and q,n0q,n\neq0, then we say that the ordered pairs (p,q)(p,q) and (m,n)(m,n) are equivalent or (p,q)(m,n)(p,q)\sim(m,n)

    We may express this more formally by recalling that the equivalence class of an element aa is often denoted [a][a] and is defined as the set [a]={xX:ax}[a]=\{x\in X : a\sim x\}; hence, the equivalence class of the element (m,n)(m,n) may be denoted [(m,n)][(m,n)] or mn\frac{m}{n} and defined as

    mn=[(m,n)]={(p,q)Z×(Z{0}):(m,n)(p,q)},\tfrac{m}{n}=[(m,n)]=\{(p,q)\in\Z\times(\Z\setminus\{0\}) : (m,n)\sim(p,q)\},

    where the binary relation \sim is defined by writing (a,b)(c,d)(a,b)\sim(c,d) if and only if ad=bcad=bc with b,d0b,d\neq0.

    Is the relation "\sim" we have just defined an equivalence relation though? Let's check. (In the transitive case, note that division is not an option since the integers are not closed with respect to division; however, we have the next best thing, namely the cancellation law for integers which states that ab=acab=ac implies b=cb=c when a0a\neq0.)

    • Reflexive: We must confirm that (p,q)(p,q)(p,q)\sim(p,q). By our definition, this means we must have pq=qppq=qp and q0q\neq0. It is clearly true that pq=qppq=qp and q0q\neq0 by definition and the fact that multiplication of integers is commutative. (We say "by definition" because the ordered pairs we are considering are from the set Z×(Z{0})\Z\times(\Z\setminus\{0\}), and the definition of this set stipulates that the second coordinate be nonzero, which necessarily prohibits the possibility of having a zero "denominator.") Thus, the definition of \sim, along with multiplication being commutative in Z\Z, implies that (p,q)(p,q)(p,q)\sim(p,q) as desired.

    • Symmetric: We must confirm that, given (p,q)(m,n)(p,q)\sim(m,n), it necessarily follows that (m,n)(p,q)(m,n)\sim(p,q); that is, (p,q)(m,n)    (m,n)(p,q)(p,q)\sim(m,n)\implies(m,n)\sim(p,q). By definition, (p,q)(m,n)(p,q)\sim(m,n) tells us that pn=qmpn=qm, where q,n0q,n\neq0. Since equality (i.e., "==") is itself an equivalence relation, knowing that pn=qmpn=qm tells us that qm=pnqm=pn by symmetry and mq=npmq=np by commutativity of multiplication of integers. Since mq=npmq=np and q,n0q,n\neq0, we have (m,n)(p,q)(m,n)\sim(p,q) by definition of \sim. Hence, (p,q)(m,n)(p,q)\sim(m,n) implies (m,n)(p,q)(m,n)\sim(p,q), as desired.

    • Transitive: We must confirm that, given (p,q)(m,n)(p,q)\sim(m,n) and (m,n)(a,b)(m,n)\sim(a,b), it necessarily follows that (p,q)(a,b)(p,q)\sim(a,b). We can use a few observations, along with the definition of \sim, to piece together this slightly trickier proof than the proofs for the reflexive and symmetric cases:

      1. Given (p,q)(m,n)(p,q)\sim(m,n), we have pn=qmpn=qm, where q,n0q,n\neq0. 2. Given (m,n)(a,b)(m,n)\sim(a,b), we have mb=namb=na, where n,b0n,b\neq0. 3. Consider the following:
      pn=qm    pnb=qmb(cancellation property of Zb0)    pnb=qna(by substitution, where mb=na)    pbn=qan(comm. and assoc. of mult. on Z)    pb=qa.(cancellation property of Zn0)\begin{align*} pn=qm &\iff pnb=qmb & \text{(cancellation property of $\Z$, $b\neq0$)}\\[0.5em] &\iff pnb=qna & \text{(by substitution, where $mb=na$)}\\[0.5em] &\iff pbn=qan & \text{(comm. and assoc. of mult. on $\Z$)}\\[0.5em] &\iff pb=qa. & \text{(cancellation property of $\Z$, $n\neq0$)} \end{align*}
      1. Since pb=qapb=qa by the above observation, and q,b0q,b\neq0 by the first and second observations, respectively, we have (p,q)(a,b)(p,q)\sim(a,b) by definition of \sim. Thus, given (p,q)(m,n)(p,q)\sim(m,n) and (m,n)(a,b)(m,n)\sim(a,b), it necessarily follows that (p,q)(a,b)(p,q)\sim(a,b), as desired.

      Since the relation \sim is reflexive, symmetric, and transitive, we see that \sim is an equivalence relation.

  • Arithmetic on Q\bm{\Q}: Great. So we've constructed Q\Q insofar as knowing what the elements of Q\Q look like, but we haven't yet talked about arithmetic on Q\Q. We'll have to check a few things there as well. That will be the next lecture.