Skip to main content

6 - The principle of induction

What induction actually is

So what is induction and how do we set the stage for it?

  • The natural numbers: So what is induction? Well, induction actually takes a couple different forms. You think can think of it as a proof technique but really you should think of it as an axiomatic property of the natural numbers. So let's begin with the natural numbers, where we take the natural numbers to be
N={1,2,3,4,},\N=\{1,2,3,4,\ldots\},

but we could also include 0 without an issue. That is, the fact that the particular set N\N begins with 1 is inconsequential — we are going to let N\N start with 1 because we are going to think of its elements as indexing statements that we want to prove.

  • The well-ordering principle: The well-ordering property is a property of the natural numbers that should hopefully seem self-evident. This property for N\N says first of all that N\N is well-ordered. What does that mean? It means that every nonempty subset of the natural numbers has a least element. So the natural numbers are endowed with an order which is the one you usually think of. For example,

    1<2<3<4,1<2<3<4,\ldots

    If you're worried about that axiomatically, then you could begin with constructing the natural numbers using Peano's axioms. There are five axioms for the natural numbers, and it usually begins with 0. You postulate the existence of a zero. Then you postulate the existence of a successor function, which basically tells you what the next thing is. So 0 has a successor. We'll call it 1. And 1 has a successor. We'll call it 2. And so on. And there are a few more axioms, but the point is that by starting with Peano's axioms you could develop an order. So we would say, for example, that 4 is bigger than 1 if it is the case that 4 is an eventual successor of 1, if you want to go back that far. We're going to assume that we have a particular order on N\N, and the claim is that this particular ordering is a well-ordering; that is, we claim that N\N has the well-ordering property that states any nonempty subset of N\N has a least element. This seems self-evident, and we are going to take it to be an axiom for the natural numbers. What may not seem self-evident is the relationship between the well-ordering property of N\N and the principle of induction. In summary, you can take the well-ordering principle to be an axiom that is something we assume and do not prove about the natural numbers. It is related to a central topic of this course, namely the principle of induction.

  • Principle of induction: Let SS be a subset of N\N such that the following two properties holds:

    1. 1S1\in S,
    2. if kSk\in S, then k+1Sk+1\in S,

    then S=NS=\N.

    Based on how the well-ordering property of N\N is communicated as opposed to the principle of induction, one may likely assume one is more "self-evident" than the other. For many people, the well-ordering property of N\N seems more evident than the principle of induction. The good news is that these two properties are equivalent. So you could take either property to be axiomatic, but we need to take at least one of them to be axiomatic. Since the well-ordering property (or \wop from now on) and the principle of mathematical induction (or \pmi from now on) are equivalent properties, we may write WOP    PMI\wop\iff\pmi to communicate this. In order to prove these properties are equivalent, we must show both direction for the biconditional. We will prove that WOP    PMI\wop\implies\pmi and circle back to the other direction if time permits.

  • Well-ordering implies principle of induction: Let SS be a nonempty subset of N\N such that 1S1\in S and if kSk\in S then k+1Sk+1\in S. Now, either S=NS=\N, which is what we want to show, or SNS\neq\N which would be a problem. So let's assume, for a hopefully eventual contradiction, that these properties of SS hold but that SNS\neq\N. Where is the contradiction going to come from? If we list out some numbers, say the numbers 1, 2, 3, 4, then we know by our hypotheses for SS that 5 will come next. Then 6. And so on. But this "and so on" is where we run into some issues. Because we cannot just keep appealing to the properties of SS that gives us successive numbers and claim that N\N is obtained in some murky fashion. That is rather unsatisfactory. Hand-wavy and grossly incomplete at best. So how might we start to think of a potentially useful way to get a contradiction?

    What about thinking about something not in SS? This idea is nice because it allows us to get a handle on something very specific whereby we should hopefully obtain a contradiction but where we aren't appealing to a property over and over again. So consider the set of all natural numbers not in SS. More specifically, consider the set A=NSA=\N\setminus S. Then AA, since it is a nonempty set of natural numbers, must have a least element. We deduce that AA has a least element by appealing to the \wop — this will be why we do not have to bootstrap the property "if kSk\in S then k+1Sk+1\in S" over and over again. Let \ell denote the least element of AA. Then 1∉A\ell-1\not\in A, but is it necessarily true that 1S\ell-1\in S; put another way, do we know that 1\ell-1 actually has a predecessor? Yes, because we assumed 1S1\in S at the beginning so we know it's not possible that =1\ell=1 (otherwise S\ell\in S). Note that >1\ell>1 because 1S1\in S. Thus, 1∉A1\not\in A since 1S1\in S and AA consists of all those members in N\N not in SS. Consider 1\ell-1. It must be the case that 1∉A\ell-1\not\in A; hence, 1S\ell-1\in S. But the assumed property that "if kSk\in S then k+1Sk+1\in S" tells us that (1)+1=S(\ell-1)+1=\ell\in S, and this is a contradiction.

How induction can be useful in the context of proofs

How might induction be useful in the context of proofs?

  • Pragmatism of induction in proofs: What you usually want to use induction for is to prove a whole series of statements (infinitely many statements) all at once, and those statements are indexed by the natural numbers. (There's a first statement, a second statement, etc.) More precisely, let P(n)P(n) be statements indexed by nNn\in\N. The idea then is to show that P(n)P(n) is true for all nn. Now, what would that mean? What I'm really looking at is a set. I'm looking at the set of all statements SS where P(n)P(n) is true. By induction, we have S=NS=\N. That is, the statement P(n)P(n) will hold for all natural numbers. In terms of an actual proof write up, the structure of the proof is often divided into two steps, where the first step is often called the "base case" and the second step the "inductive step":

    1. Show that P(1)P(1) is true. (BASE CASE)
    2. Show that if P(k)P(k) is true, then P(k+1)P(k+1) must also be true. (INDUCTIVE STEP)

    What assumption do you normally make when you are doing the inductive step? It's actually built into the statement of the step itself, namely "If P(k)P(k) is true, then ... ." Note that kk is fixed here and you are not assuming what you are trying to prove. The proof amounts to first showing that P(1)P(1) holds. Then P(k)    P(k+1)P(k)\implies P(k+1). Or, in set notation, you first show that 1S1\in S. Then kS    (k+1)Sk\in S\implies (k+1)\in S. (Using set notation may also more clearly indicate why the kk is fixed for the implication P(k)    P(k+1)P(k)\implies P(k+1) because the set notation equivalent is simply kS    (k+1)Sk\in S\implies (k+1)\in S, where it's clear that kk is a number and not a whole set of numbers. See the note below about "fixed kk" for an explanation of this and how comparing "regular" and "strong" induction makes this difference clear.) Perhaps more concretely, by showing that P(1)P(1) holds and also that P(k)    P(k+1)P(k)\implies P(k+1), you essentially take care of infinitely many implications all at once. Knowing P(1)P(1) holds and that P(k)    P(k+1)P(k)\implies P(k+1) means we first have P(1)    P(2)P(1)\implies P(2). So now we know P(2)P(2) holds. If we have P(2)P(2), then we have P(2)    P(3)P(2)\implies P(3) since P(k)    P(k+1)P(k)\implies P(k+1). Essentially, if we have shown that P(1)P(1) holds, and that P(k)    P(k+1)P(k)\implies P(k+1), then we have the chain of implications

    P(1)    P(2),P(2)    P(3),P(3)    P(4),P(1)\implies P(2),\quad P(2)\implies P(3),\quad P(3)\implies P(4),\quad\ldots

    For the implication P(k)    P(k+1)P(k)\implies P(k+1), when we assume the antecedent P(k)P(k) holds, this assumption is called the inductive hypothesis. So we have shown that P(n)P(n) is true for all natural numbers nn. More explicitly our SS in this case is simply S={nP(n)}=NS=\{n \mid P(n)\}=\N.

    In summary, with proofs by induction, we define some statement P(n)P(n), and we consider the set S={nP(n)}S=\{n\mid P(n)\}. By showing that 1S1\in S and that kS    (k+1)Sk\in S\implies(k+1)\in S, we effectively show that S=NS=\N, and this amounts to saying that P(n)P(n) holds for all natural numbers.

  • Strong induction: This is essentially the same as ordinary induction in the sense that strong induction and ordinary induction are logically equivalent, but the inductive step for strong induction looks slightly different. The structure for proofs by strong induction will look as follows:

    1. Show that P(1)P(1) is true. (BASE CASE)
    2. Show that if P(1)P(1), P(2)P(2), \ldots, P(k)P(k) are true, then P(k+1)P(k+1) must also be true. (INDUCTIVE STEP)

    The principle of strong induction allows you to assume that everything up to the kkth statement is true. You can see why that is equivalent because if you show that 1 is in the set, and that each successor of anything in the set is in the set, then you have everything from 1 through kk in the set already.

  • Why kk is fixed for ordinary induction and "not really" for strong induction: It is helpful throughout to think of the natural numbers as simply indexing statements. (We could use integers just as well, but it might be a little odd to talk about the negative seventh statement.) We state below the forms for ordinary and strong induction, where we communicate them both by using a statement P(n)P(n) as well as a set of indices SS.

    • Ordinary induction: We can communicate this as follows:

      1. Show 1S1\in S and that kS    (k+1)Sk\in S\implies (k+1)\in S. Then S=NS=\N.
      2. Show P(1)P(1) is true and that P(k)    P(k+1)P(k)\implies P(k+1). Then P(n)P(n) is true for all natural numbers.
    • Strong induction: We can communicate this as follows:

      1. Show 1S1\in S and that {1,2,,k}S    (k+1)S\{1,2,\ldots,k\}\subseteq S\implies (k+1)\in S. Then S=NS=\N.
      2. Show P(1)P(1) is true and that P(j)    P(k+1)P(j)\implies P(k+1), where 1jk1\leq j\leq k; that is, you are assuming the statement in question holds for indexed statements 1,2,,k1,2,\ldots,k, and you are showing the (k+1)(k+1)th indexed statement follows from this assumption or set of assumptions. Then P(n)P(n) is true for all natural numbers.

      Note: For the part above, sometimes the form of the inductive hypothesis "P(j)    P(k+1)P(j)\implies P(k+1) where 1jk1\leq j\leq k" is actually communicated by writing

      [P(1)P(2)P(3)P(k)]    P(k+1).[P(1)\land P(2)\land P(3)\land\cdots\land P(k)]\implies P(k+1).

      Regardless of how the different forms are communicated, it must be stressed that you are never engaging in circular logic and ultimately assuming what you are trying to prove — you are showing that the (k+1)(k+1)th indexed statement (i.e. P(k+1)P(k+1)) follows in some form or fashion from your hypotheses. (The "form or fashion" here concern ordinary or strong induction.)

  • Style in inductive proofs: Aside from telling the reader that you will be doing a poof by induction, what else should you possibly make explicit? What your indexes are; that is, you should say what you will be inducting on or, in other words, what index you are looking to make this induction argument hinge upon. In most cases you are inducing on nn, where this variable nn represents the index. So you should always specify what variables you are inducting on where these variables represent indices in the case of more than one variable.

Examples of proofs using induction

What are some examples using proofs by induction?

  • Tiling a chessboard with L-shaped pieces: The goal will be to prove that any 2n×2n2^n\times2^n chessboard with one square removed can be tiled by L-shaped pieces. Now, the thing about induction that is really amazing is the actual statement or statements you are trying to show ... you might have no idea how to do it. But what's great about induction is often you don't need insight to do the problem. You're sort of replacing the fact that I need to have some insight about some 8×88\times8 board by the fact that I know that if I can do a 4×44\times 4 board then I can do an 8×88\times 8 board. And so on. So we'll induct on nn for our statement about 2n×2n2^n\times2^n boards.

    It is worth noting that often in the process of induction you are looking for the smaller cases embedded within the big one. So in this case we start off with a big square and one small square removed. We cut the big square into four pieces and apply the inductive hypothesis to the square with a removed square. (The beauty is we can apply the inductive hypothesis without any real insight.) We cannot yet apply the inductive hypothesis to the other three squares though because none of them have a square removed. But I could remove a square from each of the three squares in such a way that the squares removed form an L-shaped tile. I could then apply the inductive hypothesis to the three squares, and what I'm left with is with three small, adjoined squares that can be covered by an L-shaped tile based on how I removed the little squares.

  • Finite sets and supremums: Every finite set of real numbers contains its supremum. How might we try to prove this? We have a finite set of numbers — it's a statement about a bunch of numbers. A finite amount of numbers in fact! That's a bunch of statements you're trying to prove. What should we induct on? How about the number of things in the set nn? Do the base case. For the inductive step, what should you start with? A set with how many elements? A set with k+1k+1 elements. And then find the kk-element set within that. The mistake committed every single year is that one will start with a kk-element set and then add one element to it instead of starting with the generalized set with k+1k+1 elements. (This is similar to the chessboard example where one might be tempted to start with a 2k×2k2^k\times2^k board and extend to a 2k+1×2k+12^{k+1}\times2^{k+1} board, but after applying the inductive hypothesis to the 2k×2k2^k\times2^k board, it's not at all clear whether or not you have a general enough board to handle the case when you add three more 2k×2k2^k\times2^k boards to adjoin to the original 2k×2k2^k\times2^k board to form a 2k+1×2k+12^{k+1}\times2^{k+1} board.)

  • All natural numbers are even (fake proof): Every natural number nn is even. We'll use strong induction to prove this statement. Let's assume all numbers less than or equal to nn are even. Note that n+1=(n1)+2n+1=(n-1)+2. By the inductive hypothesis, n1n-1 is even. So n+1n+1 is the sum of two even numbers, as desired. End of proof. The problem is there was no base case so we could never really get the proof off the ground. The interesting thing is this argument will work to show that all even numbers are even.

  • All horses are the same color: We will induct on the number of horses in a given set. We will do the base case because now we are so concerned about the base case. For the base case, a set with 1 horse, all horses are the same color. So the statement holds. For the inductive step, let S={h1,,hn+1}S=\{h_1,\ldots,h_{n+1}\}. And now I want to find the inductive hypothesis in here which concerns a set with how many horses? A set with nn horses. Then S={h1,,hn}S'=\{h_1,\ldots,h_n\} and S"={h2,,hn+1}S"=\{h_2,\ldots,h_{n+1}\} have nn horses, and by the inductive hypothesis have the same color in each set. But clearly h2h_2 is in both sets so all the horses in SS' have the same color as h2h_2 and all the horses in S"S" have the same color as h2h_2 as well. And we are done. The problem: Without realizing it, I have assumed that I have at least 3 horses because when I go down to the two sets SS' and S"S" I assumed they overlapped with h2h_2. But there is a case where they don't overlap at all! That's the case when you have 2 horses. And then you have disjoint sets. So the problem here is that the inductive step fails in a very bizarre way. In particular, it fails when n=2n=2.

  • Induction involving sets other than the natural numbers: Is it possible to have a "principle of mathematical induction" where the set SS is not a subset of the natural numbers but of another set? The issue has to do with countability (which makes sense because sets that are countable can be put into one-to-one correspondence to the natural numbers). You could do this for subsets of sets other than N\N, but you need something stronger that is not logically equivalent to induction. It's called the principle of transfinite induction.