6 - The principle of induction
What induction actually is
So what is induction and how do we set the stage for it?
- The natural numbers: So what is induction? Well, induction actually takes a couple different forms. You think can think of it as a proof technique but really you should think of it as an axiomatic property of the natural numbers. So let's begin with the natural numbers, where we take the natural numbers to be
but we could also include 0 without an issue. That is, the fact that the particular set begins with 1 is inconsequential — we are going to let start with 1 because we are going to think of its elements as indexing statements that we want to prove.
-
The well-ordering principle: The well-ordering property is a property of the natural numbers that should hopefully seem self-evident. This property for says first of all that is well-ordered. What does that mean? It means that every nonempty subset of the natural numbers has a least element. So the natural numbers are endowed with an order which is the one you usually think of. For example,
If you're worried about that axiomatically, then you could begin with constructing the natural numbers using Peano's axioms. There are five axioms for the natural numbers, and it usually begins with 0. You postulate the existence of a zero. Then you postulate the existence of a successor function, which basically tells you what the next thing is. So 0 has a successor. We'll call it 1. And 1 has a successor. We'll call it 2. And so on. And there are a few more axioms, but the point is that by starting with Peano's axioms you could develop an order. So we would say, for example, that 4 is bigger than 1 if it is the case that 4 is an eventual successor of 1, if you want to go back that far. We're going to assume that we have a particular order on , and the claim is that this particular ordering is a well-ordering; that is, we claim that has the well-ordering property that states any nonempty subset of has a least element. This seems self-evident, and we are going to take it to be an axiom for the natural numbers. What may not seem self-evident is the relationship between the well-ordering property of and the principle of induction. In summary, you can take the well-ordering principle to be an axiom that is something we assume and do not prove about the natural numbers. It is related to a central topic of this course, namely the principle of induction.
-
Principle of induction: Let be a subset of such that the following two properties holds:
- ,
- if , then ,
then .
Based on how the well-ordering property of is communicated as opposed to the principle of induction, one may likely assume one is more "self-evident" than the other. For many people, the well-ordering property of seems more evident than the principle of induction. The good news is that these two properties are equivalent. So you could take either property to be axiomatic, but we need to take at least one of them to be axiomatic. Since the well-ordering property (or \wop from now on) and the principle of mathematical induction (or \pmi from now on) are equivalent properties, we may write to communicate this. In order to prove these properties are equivalent, we must show both direction for the biconditional. We will prove that and circle back to the other direction if time permits.
-
Well-ordering implies principle of induction: Let be a nonempty subset of such that and if then . Now, either , which is what we want to show, or which would be a problem. So let's assume, for a hopefully eventual contradiction, that these properties of hold but that . Where is the contradiction going to come from? If we list out some numbers, say the numbers 1, 2, 3, 4, then we know by our hypotheses for that 5 will come next. Then 6. And so on. But this "and so on" is where we run into some issues. Because we cannot just keep appealing to the properties of that gives us successive numbers and claim that is obtained in some murky fashion. That is rather unsatisfactory. Hand-wavy and grossly incomplete at best. So how might we start to think of a potentially useful way to get a contradiction?
What about thinking about something not in ? This idea is nice because it allows us to get a handle on something very specific whereby we should hopefully obtain a contradiction but where we aren't appealing to a property over and over again. So consider the set of all natural numbers not in . More specifically, consider the set . Then , since it is a nonempty set of natural numbers, must have a least element. We deduce that has a least element by appealing to the \wop — this will be why we do not have to bootstrap the property "if then " over and over again. Let denote the least element of . Then , but is it necessarily true that ; put another way, do we know that actually has a predecessor? Yes, because we assumed at the beginning so we know it's not possible that (otherwise ). Note that because . Thus, since and consists of all those members in not in . Consider . It must be the case that ; hence, . But the assumed property that "if then " tells us that , and this is a contradiction.
How induction can be useful in the context of proofs
How might induction be useful in the context of proofs?
-
Pragmatism of induction in proofs: What you usually want to use induction for is to prove a whole series of statements (infinitely many statements) all at once, and those statements are indexed by the natural numbers. (There's a first statement, a second statement, etc.) More precisely, let be statements indexed by . The idea then is to show that is true for all . Now, what would that mean? What I'm really looking at is a set. I'm looking at the set of all statements where is true. By induction, we have . That is, the statement will hold for all natural numbers. In terms of an actual proof write up, the structure of the proof is often divided into two steps, where the first step is often called the "base case" and the second step the "inductive step":
- Show that is true. (BASE CASE)
- Show that if is true, then must also be true. (INDUCTIVE STEP)
What assumption do you normally make when you are doing the inductive step? It's actually built into the statement of the step itself, namely "If is true, then ... ." Note that is fixed here and you are not assuming what you are trying to prove. The proof amounts to first showing that holds. Then . Or, in set notation, you first show that . Then . (Using set notation may also more clearly indicate why the is fixed for the implication because the set notation equivalent is simply , where it's clear that is a number and not a whole set of numbers. See the note below about "fixed " for an explanation of this and how comparing "regular" and "strong" induction makes this difference clear.) Perhaps more concretely, by showing that holds and also that , you essentially take care of infinitely many implications all at once. Knowing holds and that means we first have . So now we know holds. If we have , then we have since . Essentially, if we have shown that holds, and that , then we have the chain of implications
For the implication , when we assume the antecedent holds, this assumption is called the inductive hypothesis. So we have shown that is true for all natural numbers . More explicitly our in this case is simply .
In summary, with proofs by induction, we define some statement , and we consider the set . By showing that and that , we effectively show that , and this amounts to saying that holds for all natural numbers.
-
Strong induction: This is essentially the same as ordinary induction in the sense that strong induction and ordinary induction are logically equivalent, but the inductive step for strong induction looks slightly different. The structure for proofs by strong induction will look as follows:
- Show that is true. (BASE CASE)
- Show that if , , \ldots, are true, then must also be true. (INDUCTIVE STEP)
The principle of strong induction allows you to assume that everything up to the th statement is true. You can see why that is equivalent because if you show that 1 is in the set, and that each successor of anything in the set is in the set, then you have everything from 1 through in the set already.
-
Why is fixed for ordinary induction and "not really" for strong induction: It is helpful throughout to think of the natural numbers as simply indexing statements. (We could use integers just as well, but it might be a little odd to talk about the negative seventh statement.) We state below the forms for ordinary and strong induction, where we communicate them both by using a statement as well as a set of indices .
-
Ordinary induction: We can communicate this as follows:
- Show and that . Then .
- Show is true and that . Then is true for all natural numbers.
-
Strong induction: We can communicate this as follows:
- Show and that . Then .
- Show is true and that , where ; that is, you are assuming the statement in question holds for indexed statements , and you are showing the th indexed statement follows from this assumption or set of assumptions. Then is true for all natural numbers.
Note: For the part above, sometimes the form of the inductive hypothesis " where " is actually communicated by writing
Regardless of how the different forms are communicated, it must be stressed that you are never engaging in circular logic and ultimately assuming what you are trying to prove — you are showing that the th indexed statement (i.e. ) follows in some form or fashion from your hypotheses. (The "form or fashion" here concern ordinary or strong induction.)
-
-
Style in inductive proofs: Aside from telling the reader that you will be doing a poof by induction, what else should you possibly make explicit? What your indexes are; that is, you should say what you will be inducting on or, in other words, what index you are looking to make this induction argument hinge upon. In most cases you are inducing on , where this variable represents the index. So you should always specify what variables you are inducting on where these variables represent indices in the case of more than one variable.
Examples of proofs using induction
What are some examples using proofs by induction?
-
Tiling a chessboard with L-shaped pieces: The goal will be to prove that any chessboard with one square removed can be tiled by L-shaped pieces. Now, the thing about induction that is really amazing is the actual statement or statements you are trying to show ... you might have no idea how to do it. But what's great about induction is often you don't need insight to do the problem. You're sort of replacing the fact that I need to have some insight about some board by the fact that I know that if I can do a board then I can do an board. And so on. So we'll induct on for our statement about boards.
It is worth noting that often in the process of induction you are looking for the smaller cases embedded within the big one. So in this case we start off with a big square and one small square removed. We cut the big square into four pieces and apply the inductive hypothesis to the square with a removed square. (The beauty is we can apply the inductive hypothesis without any real insight.) We cannot yet apply the inductive hypothesis to the other three squares though because none of them have a square removed. But I could remove a square from each of the three squares in such a way that the squares removed form an L-shaped tile. I could then apply the inductive hypothesis to the three squares, and what I'm left with is with three small, adjoined squares that can be covered by an L-shaped tile based on how I removed the little squares.
-
Finite sets and supremums: Every finite set of real numbers contains its supremum. How might we try to prove this? We have a finite set of numbers — it's a statement about a bunch of numbers. A finite amount of numbers in fact! That's a bunch of statements you're trying to prove. What should we induct on? How about the number of things in the set ? Do the base case. For the inductive step, what should you start with? A set with how many elements? A set with elements. And then find the -element set within that. The mistake committed every single year is that one will start with a -element set and then add one element to it instead of starting with the generalized set with elements. (This is similar to the chessboard example where one might be tempted to start with a board and extend to a board, but after applying the inductive hypothesis to the board, it's not at all clear whether or not you have a general enough board to handle the case when you add three more boards to adjoin to the original board to form a board.)
-
All natural numbers are even (fake proof): Every natural number is even. We'll use strong induction to prove this statement. Let's assume all numbers less than or equal to are even. Note that . By the inductive hypothesis, is even. So is the sum of two even numbers, as desired. End of proof. The problem is there was no base case so we could never really get the proof off the ground. The interesting thing is this argument will work to show that all even numbers are even.
-
All horses are the same color: We will induct on the number of horses in a given set. We will do the base case because now we are so concerned about the base case. For the base case, a set with 1 horse, all horses are the same color. So the statement holds. For the inductive step, let . And now I want to find the inductive hypothesis in here which concerns a set with how many horses? A set with horses. Then and have horses, and by the inductive hypothesis have the same color in each set. But clearly is in both sets so all the horses in have the same color as and all the horses in have the same color as as well. And we are done. The problem: Without realizing it, I have assumed that I have at least 3 horses because when I go down to the two sets and I assumed they overlapped with . But there is a case where they don't overlap at all! That's the case when you have 2 horses. And then you have disjoint sets. So the problem here is that the inductive step fails in a very bizarre way. In particular, it fails when .
-
Induction involving sets other than the natural numbers: Is it possible to have a "principle of mathematical induction" where the set is not a subset of the natural numbers but of another set? The issue has to do with countability (which makes sense because sets that are countable can be put into one-to-one correspondence to the natural numbers). You could do this for subsets of sets other than , but you need something stronger that is not logically equivalent to induction. It's called the principle of transfinite induction.