A Handwaving Introduction to Infinitesimals

In the early 1960s, Abraham Robinson established nonstandard analysis rigorously. Not digging into the controversies around hyperreal numbers and infinitesimals, here I give a handwaving introduction to infinitesimals, following the construction in Lectures on the Hyperreals, an introduction to nonstandard analysis, GTM 188, by Robert Goldblatt, Springer 1998.

Assuming we know what natural numbers are, we create integers by doing addition and subtraction between two natural numbers. Then we define rational numbers as fractions, i.e., the ratios between an integer and a nonzero integer. Then we can do addition, subtraction, multiplication between two rational numbers, and can also do division by a nonzero rational number.

Then what are real numbers? The theory is very complicated but one way is to define real numbers by Dedekind cut, which essentially says that whenever you split the set of rational numbers into two disjoint parts by a single cut, then the “cut” is a real number. For example, you may split rational numbers into A={rational numbers x such that x3<3} and {rational numbers x such that x3>3}. Then the cut is the cube root of 3, which is not a rational number but a real number.

Another way is to define real numbers as equivalence classes of Cauchy sequences of rational numbers. If you don’t know what it means, that is OK, move on and read the following paragraphs.

Assuming that (we think) we know what real numbers are, consider all sequences of real numbers. We can do addition, subtraction, and multiplication between two real sequences in the following way: for {an}={a1 , a2, a3, a4, a5, …}, {bn}={b1 , b2, b3, b4, b5, …}, we define

{an} + {bn} ={a1 + b1, a2 + b2, …}, {an} – {bn} ={a1 – b1, a2 – b2, …}, {an} x {bn} ={a1 x b1, a2 x b2, …}.

But we can not define division in this way because some of the terms of {bn} may be zero. Then we realize that when studying sequences, what we care are the “limiting behavior” of the sequence, not the exact values of the first several terms. So it is tempting to consider sequences whose terms are nonzero except for finitely many terms. For instance, we consider a sequence {bn} such that bn>0 for all n>10,000. Then we can do “division” between {an} and {bn} by defining

{an} / {bn} = {whatever you want for the first 10,000 terms, and a10001/b10001, a10002/b10002, a10003/b10003, a10004/b10004, …}.

This turns out to be a good definition for division between sequences, except that we cannot always require {bn} to have at most finitely many zero terms. The layman’s words are that as long as almost all terms of {bn} are nonzero, we can divide any sequence by {bn}, by doing standard division on the corresponding nonzero terms of {bn}, and defining other terms to be whatever you want, say just define them to be zero. Correspondingly, if almost all terms of a sequence {bn} are zero, we will treat that sequence as a zero sequence.

(The rigorous theory for the concept of “almost all terms” is to use a nonprincipal ultra filter, and use equivalence classes. We will not discuss them here.)

Since we can do addition, subtraction, and multiplication between any two sequences, and division between any sequence and a nonzero sequence, the collection of all sequences can do the same arithmetic operations as “real numbers”, and we call them hyperreal numbers. Each real number r can be written as a sequence whose terms are all r, i.e., {r} = {r, r, r, r, r, …}. In this way real numbers are in fact hyperreal numbers.

In addition, hyperreal numbers are ordered in the sense that for any two sequences {an} and {bn}, if almost all terms of {an} are smaller than the corresponding terms of {bn}, then we say {an}<{bn}, and if almost all terms of {an} are equal to the corresponding terms of {bn}, then we say {an}={bn}. By design, one and only one of the situations is true: {an}<{bn}, {an}={bn}, {an}>{bn} (which means {bn}<{an}).

Then what is an infinitesimal? An infinitesimal is a sequence {an} that is smaller than any positive real number and bigger than any negative real number, in the sense that for any real numbers r>0 and s<0, almost all terms of {an} satisfy s<an<r.

The only real infinitesimal is 0. There are many hyperreal infinitesimals. In fact, any real sequence that converges to 0 is a hyperreal infinitesimal under the above definition. Now the question is, what does it mean for a real sequence to converge to 0?

Reference: Lectures on the Hyperreals, an introduction to nonstandard analysis, GTM 188, by Robert Goldblatt, Springer 1998.

Leave a comment