The Real Number pi – or to use Greek typography, which I will do henceforth – is one of the most misunderstood in all of Mathematics. Mystery swirls around it, unlikely properties are claimed for it, false equalities are propounded for it and – most perniciously – characteristics are uniquely ascribed to it, which are actually properties of the vast majority of other Real Numbers as well. In the box below are presented just a selection of the mangled menagerie of misconceptions ^{[1]}:
Ten common misconceptions about :

Here is the beginning of in decimal notation (a notation that lies behind much of the confusion covered in the box above):
The ellipsis at the righthand end indicates that the numbers keep on going forever; more on this later. At the time of writing, the record for the number of digits calculated stands at ^{[2]}, that’s twentytwo trillion digits. However, this feat – impressive as it is – has more to do with computer science than Mathematics and it is the Mathematics of that we will be focussing on here.
In particular, I will hope to address all of the misconceptions listed in the box above and – more positively – to provide a clearer insight into the true meaning of . As in previous pieces, I will need to develop some Mathematical apparatus along the way. Here this will include, the use of polygons to estimate , the properties of Rational and Irrational Numbers and how these relate to decimal fractions, different ways to represent and, finally, the distribution of digits in ‘s expansion. In a sense, the journey may be as interesting – and as illuminating – as the destination. Let’s start with a historical perspective.
The modern day definition of relates to its role in the period of the Exponential Function, which is . This is something that I covered in length across both Euler’s Number and The Equation. As explained in these articles, the Exponential Function is a much more fundamental part of Mathematics and, to an extent, we can think of as a byproduct of this.
However, no schoolchild is introduced to this way and it was not how the ancients stumbled across the concept either. Instead first came to prominence in considering circles. Buried in the mists of time, someone (or more likely some several people independently) noticed that – as in the image appearing at the top of this article ^{[3]} – if you fixed a point on a wheel and rolled it along until the point returned to the same place, laying out the “edge” of the circle on the ground as it were, then the distance travelled by the wheel was in fixed proportion to how “wide” the circle was. Wider wheels traveled further with one rotation, smaller ones travelled less far, but in each case the distance traveled was just a bit more than three times the “width” of the circle. Some sort of fundamental constant of nature seemed to emerge from the properties of circles.
Using more precise terminology, the “edge” of a circle is its circumference and its “width” is its diameter. The distance between the centre of a circle and its circumference is the same in all directions – that’s the definition of a circle – and this quantity, half of the diameter, gives us another important dimension called the radius. As ever, a picture paints a thousand words:
If the various illustrated elements of our circle above have the magnitudes shown (so the diameter is , the radius is and the circumference is ), then we have:
What empirical evidence suggested to our forebears was that, for any circle, no matter what the size of its diameter () and circumference (), we have:
So at some point in antiquity, the existence of a constant relating the dimensions of a circle was discovered. It is estimated that this occurred somewhere of the order of 4,000 years ago. The ancients also knew that this ratio was not precisely three, but a bit more. A Babylonian tablet of around this age shows the state of the art in estimation back then ^{[4]}:
What the “three and a bit” was actually equal to, took some time to establish. This value was not initially referred to as either “pi” or “&’. The terminology was first coined by Mathematician William Jones as late as 1706. Jones’s idea was later popularised by our old friend Leonhard Euler. Taking their lead, we can now write:
For the sake of clarity, I will use this notation going forward, regardless of what people may have written historically.
Turning the clock back several centuries from Jones and Euler’s work in the 1700s, the 4,000 year old Babylonian estimate for was improved upon 2,000 years later by another giant of Mathematics, Archimedes of Syracuse. Archimedes developed lower and upper bounds for the value of as follows ^{[5]}:
It is probably via Archimedes showing that was less than that this fraction began to be mistaken for the actual value of ; we will return to and fractions later. How did Archimedes come up with these inequalities? The answer is with polygons. In the next section, albeit with some more modern elements, I will try to explain Archimedes’s approach.
Let’s consider a circle with unit radius (and hence, by our definition above, with a circumference of ) and to make what we are doing a little more clear colour it red. We will inscribe a regular octagon inside it (i.e. draw an octagon inside of the circle with each of its vertices touching the circle) and circumscribe a regular octagon outside it (i.e. draw an octagon outside of the circle with the midpoints of each side touching the circle). In either case, if we draw lines from two adjacent vertices to the shared centre of both figures, these form an angle which is obviously one eighth of the whole circle, i.e. one eighth of ^{[6]}. This is shown in the above diagram.
The idea is to calculate the perimeter of each of the two octagons. The inscribed octagon’s perimeter will be an underestimate of the circle’s circumference (an underestimate for ) and the circumscribed octagon’s perimeter will be an overestimate for it. Before proceeding further, let’s cover the same brief refresher on trigonometry that I have shared in other articles:
Aside:
Consider a generic rightangled triangle as in the figure below: Here the bottom lefthand angle has a value of , the hypotenuse has length , the adjacent side has length and the opposite side has a length of . We then have the following definitions: Here we will use the definitions of and 
With these equations lodged in our minds, let’s proceed. For both of our octagons, let’s draw a line from the midpoint of the topmost octagon side to the centre of the figures. This line is obviously perpendicular to the side in both cases. The line also bisects the angle we were discussing above and thus the angle subtended by this bisector is one sixteenth of . The preceding paragraph ably demonstrates why diagrams are so useful. Tale a look at the the following, it’s a lot easier to grasp:
In both cases we have a rightangled triangle with the bottom angle being equal to . In the inscribed case, the hypotenuse of this triangle is of length one. In the circumscribed case the adjacent side of the triangle is of length one. Our aim is to find the length of the opposite side, for the inscribed octagon and to find the length of the opposite side for the circumscribed octagon. By our trigonometric relations, we have:
Now is one of those angles whose trigonometric values end up being rather nice. We have:
If we recall that the circumference of the circle in question is , then we need to consider half of the perimeters of each octagon in order to get our estimates for . Both and are one eighth of these half perimeters. So if we want the length of the half perimeters for each octagon, we need to multiply each of and by eight. Doing this we get:
Evaluating these approximately we get:
As one is an upper bound for and one a lower bound, we can even take their average and assert:
This isn’t a very accurate approximation, but is at least in the right ballpark. It can be seen that, as we increase the number of sides of our polygons, we increase our accuracy. It was via such an approach that Archimedes was able to derive his estimates; he used a 96gon.
Following in Archimedes’s footsteps, if we take just the formula we established above for the half perimeter of the inscribed octagon, we can generalise it to say that the perimeter of an inscribed ngon would be:
Acknowledging the limits of Excel as a tool for doing anything bar addition, we can form the following table, which starts with an octagon and then doubles the number of sides as we go down to the next entry:
So we can see that for an inscribed 1,048,576gon ^{[7]} our estimate for matches the first 12 figures of its decimal expansion. That would be an approximation that is accurate enough for virtually any real world usage.
Borrowing the concept of a limit that we introduced in Euler’s Number, we can say that ^{[8]}:
Number 8 on our list of misconceptions about was that it cannot be represented exactly. Well – at least to Mathematicians – the above formula is just such a precise representation of . In a later section, we will meet some more of these. For now we will move on to consider whether or not the ratio we are discussing is a ratio or not. Confused? Hopefully the next section will make things clearer.
Elements of the beginning on this section have been adapted from the part of A Brief Taxonomy of Numbers relating to Rational Nunbers.
Misconceptions 1 and 2 at the start of this article related to being equal to (as opposed to approximately equal to) some Rational Number, with both and being offered as candidates. Misconceptions 4 and 5 relate to the decimal expansion of . There is a connection between these two areas and we will explore it in this section and the next. Let’s start with some definitions, just what is a Rational Number?
The set of Rational Numbers, denoted by , consists of fractions both positive and negative, so numbers like:
and so on.
However, a loose definition of fractions would include that abomination, , which as every schoolchild learns is the work of the devil and to be avoided at all costs. It would also include, for example:
which is probably not what any of us had in mind when using the term.
A way to avoid such complications is to define in terms of the numerators and denominators of its elements as follows:
Where is the Natural Numbers, and is the Integers, .
Aside:
Here we introduce some more notation. What we have above is a sort of recipe for creating the members of a set, namely the Rational Numbers. The bit directly after the first curly bracket and before the vertical bar shows the general pattern of set members. The bit after the vertical bar and before the second curly bracket provides restrictions on the general pattern. The vertical bar itself can be read as “such that”. For example, consider the set . This is the even Natural Numbers; the set of numbers of format such that is a Natural Number. So our definition of the Rationals can similarly be read as “numbers consisting of divided by , such that is an Integer and is a Natural Number”. 
With this definition under our belt, let’s also think a bit about decimal representations. We use decimals so frequently that maybe their meaning slips under the surface of consciousness. What does a decimal integer, say , actually mean? Well a moment’s thought yields the following:
So the position of each number indicates to which power of ten the number relates. Moving from right to left, position one (units) is the number times ten to the power zero (which equals one), position two (tens) is the number times ten to the power one, position three (hundreds) is the number times ten to the power two, and so on.
How about decimal fractions? Well, by extension from the above, means:
where we of course recall that: , i.e. the reciprocal of .
We have the same idea, but this time we are looking at the number of tenths, the number of hundredths, the number of thousandths and so on.
My aim here is going to be to tie Rational Numbers and decimal representations together. Let’s start with some examples.
Most will be entirely au fait with equalities like:
Some may also be wellaware of more complexlooking relationships like:
But how do we go about establishing these and what – if anything – do attributes of Rational Numbers tell us about their decimal representations? The answer lies in the precalculator disciple of long division. Let’s consider dividing by and see how we get on. We start of course with:
Clearly, does not go into , so we instead consider divided by (keeping track of the shift by a multiple of ten via using a decimal point), getting an answer of with a remainder of as follows:
Once more, does not go into , so we instead consider divided by , getting an answer of with a remainder of .
We continue this way until we have a remainder of . does not divide this, so we look at dividing by instead. This goes times remainder . See below:
At this point, we are back precisely where we started and – if we continue – we will simply repeat the sequence we have just established. It is pretty easy to see that:
which is the result we stated above.
Thinking about this more generically, if we perform long division of an Integer by a Natural Number , then at any point the remainder must be a number where . If we continue the long division for enough time, we must either hit a remainder of , in which case the decimal expansion halts like:
or we hit a remainder that we have already seen and the sequence repeats itself indefinitely as in the example above.
We can thus say that the decimal expansion of any Rational Number either halts, or goes on forever repeating a sequence ^{[9]}.
What about the other way round, is a number whose decimal expansion either halts or goes on forever repeating a sequence a Rational Number?
Well the halting case is easy. Consider, for example, , we can of course write this as:
This is clearly a Rational Number and we are done.
Now let’s think about a decimal expansion that repeats a sequence forever. Let’s take the example of:
Here the repeating section, , is of length 9. Given this, we now multiply by to get:
Subtract the larger number from the smaller and we have:
The forever repeating elements to the right of the decimal point just cancel. Rearranging we get:
This is clearly a Rational Number and so we have demonstrated that Rational Numbers and decimal numbers which either halt, or repeat a sequence forever are exactly the same thing.
So we have tied together Rational Numbers and at least certain types of decimals. An obvious question is “are all decimals Rational Numbers?”, we will explore this issue in the next section.
They appear to be Illogical Numbers Captain
We are going to start this section, by finding two numbers (with decimal expansions) that are not Rationals. The first we will construct explicitly to not be Rational, the second is a pretty simple number that frequently appears in everything from algebra to geometry, to trigonometry, and which we will show cannot be Rational. Either of these approaches demonstrates the existence of a new set of numbers, the Irrationals ^{[10]}. Let’s start be being constructive.
We have determined that any decimal fraction that either halts or repeats itself must be a Rational Number. So if we want to find a nonRational Number, then we need to consider a decimal expansion that neither terminates, nor repeats itself. The combination of these two properties means that such a number must have a decimal expansion that goes on forever and in which no continuously repeating set of digits ever occurs. This may sound like a daunting prospect, but it’s not really so bad. Let’s think about a number, , that starts like this (with spacing to emphasise the construction):
This clearly has a pattern, the decimal expansion is made by tagging on an ever increasing number of zeros, followed by a 1. However, equally clearly, no sequence of digits ever repeats. It is also evident that the decimal goes on forever. These two criteria taken together tell us that cannot be a Rational Number. Instead it is our first example of an Irrational Number.
This construction may seem a little artificial to some readers (it is nonetheless entirely valid), so let’s spend some time on something that is hopefully more concrete, a very simple rightangled triangle where the two shorter sides are both of length one. Here is this rather nonesoteric object:
We can calculate the value of the mysterious questionmark, the length of the hypotenuse of the triangle using Pythagoras as follows:
My claim is that in we have found another number which is not Rational.
This claim has some history. It was supposedly made by a member of the Pythagorean Sect, a certain Hippasus of Metapontum. It is rumoured that the Pythagorean Sect was less than happy with this result, feeling that it pointed to disharmony in their otherwise neat and tidy view of numbers. It has even been suggested that Hippasus may have paid with his life for advancing such a heretical idea. This story, apocryphal as it may be, is further proof – if it was ever needed – that being right is no guarantee that others will appreciate your position.
However posterity has taken the side of Hippasus and is indeed our second example of an Irrational Number. A sketched proof of this appeas in the box below:
Aside:
The content of this box has been adapted from a footnote to Chapter 4 of my book on Group Theory and Particle Physics, Glimpses of Symmetry. There are many proofs that is irrational. The one with the longest history (dating back into antiquity) proceeds by a tried and tested Mathematical technique called reductio ad absurdum (Latin for reducing [an argument] to absurdity). If we want to prove a statement X is true, we start by assuming that X is instead false. We then proceed to show that this assumption leads to something that is patently false, or a contradiction of some sort. As this conclusion cannot be true we are forced to accept that X must indeed be true. Let’s apply this approach to the Rationality of . If – contrary to our assertion – is a Rational Number, we can express it as the ratio between two Natural Numbers and ^{[11]} Further we can assume that at least one of and is odd. If both are even then: For some and Then: We can keep going on this way until one of our numbers is no longer divisible by two (this is just the process of reducing fractions that many will recall from school). Now if we rearrange our initial equation we see that: This means that must be even (it equals some number multiplied by two, which is the definition of even). What does this tell us about ? Well if we multiply an odd number by itself, we get another odd number: Which we can write as: Which is clearly an odd number as the first term is even and we add one to it. Because has been shown to be even, this means that there is some Natural Number such that: Substituting this in we get: Or: So: Which means that is also even by the same argument we have just employed above. So both and are even, which is a contradiction of our first step where we showed that we can assume that at least one of these is odd. The contradiction means that our assertion that is a Rational Number is false. Which means that is not a Rational Number. 
Based on our hardwon equivalence between Rational Numbers and decimal fractions that either repeat or terminate, we have now shown that the decimal expansion of goes on forever and never repeats itself. In fact it begins like this:
As per misconception 7 in our initial box, it seems that humble has a property often thought of as applying only to members of an exclusive club containing , ^{[12]} and sometimes ^{[13]}.
In fact, if we consider Rational Numbers and Irrational Numbers, while both sets are infinite (you will never get to the end of them), it can be shown ^{[14]} that the size of the Irrationals in some sense exceeds that of the Rationals; the Irrationals have a size which is at a different level of infinity to the Rationals. Collectively, the Rationals and Irrationals are known as the Real Numbers. One way of thinking about the Real Numbers is anything you can write as a decimal, either terminating or nonterminating, repeating or nonrepeating. Perhaps a better analogy is to think of a continuous line stretching both ways from a central , negative numbers to the left, positives to the right. If you pick any point at random on this line, its distance from may be an Integer, it may be a Rational Number, but it is guaranteed to be a Real Number. That is the Reals can be thought of as every single number on an infinite line with no gaps whatsoever.
The result we mention above, that the Irrational subset of the Reals is infinite to a higher degree than the Rational subset, means that – at least in one sense – the vast majority of Real Numbers have decimal expansions that are nonterminating and never repeat. Such numbers are more the rule than the exception.
The observant reader may at this point have made a reasonable inference; they may have drawn the conclusion that, like our two numbers above, is probably also Irrational. Congratulations to such people, you are right in your guess. In fact this result was proved in the 18th Century by Johann Heinrich Lambert. Neither Lambert’s proof, nor any of the more modern alternatives is entirely straightforward and the details are accordingly not repeated here ^{[15]}.
The reason that the decimal expansion of is thought to hold so many mysteries is precisely that is an Irrational Number. Just like , its decimal expansion is nonterminating and nonrepeating. However, as we have seen in this section, this is a wholly unremarkable property, one shared by the vast majority of Real Numbers. We will consider later if having a nonterminating and nonrepeating decimal expansion is sufficient to guarantee a number includes the entire works of Shakespeare as per misconception number 9 (HINT: It’s not).
Perhaps here is a good point to tick off another two misconceptions, number 3, is infinite, and number 6, is Rational in some other number base. The first of these is probably as much down to an issue with terminology as anything else. The decimal expansion of is indeed infinitely long by virtue of the fact that it is an Irrational number. However, itself is anything but infinite. It’s decimal expansion starts this means that we must have:
and are clearly finite numbers. If sits between them, it too must be finite, indeed we could have just used its Integer part to state:
There are no infinite numbers between and of course.
Moving on to different number bases, above we elucidated what the number meant in base (aka decimals), namely:
We could however choose a number other than as our base. For example hexadecimal numbers, or base numbers, are frequently used in computing. These are related to powers of and use the numerals:
We have to add more “numbers” like as in hexadecimal is equivalent to the decimal number . Then we need something to represent, for example, the hexadecimal number , which is in decimal. is the, essentially arbitrary, symbol that has been chosen to play this role.
So what is the meaning of the hexadecimal number ^{[16]}? Well it is as follows, noting that in hexadecimal is in decimal:
Another wellknown number base, also associated with computing, is of course binary, you can read more about binary in the Data & Analytics Dictionary.
Typically number bases are Natural Numbers. We could – if we were masochistic enough – define numbers base , in which case we would have in decimal equals base . However, this way lies insanity. In any Natural Number base, is still Irrational. In general number bases are different ways of representing the same number. So base is precisely the same number as base . Things like Irrationality are properties of a number, not how it is represented. While we have not presented Lambert’s proof that is Irrational, it does not depend on how is represented, the argument would be just as valid in hexadecimal. To get a sense of this, take a look at the proof that is Irrational that is provided above. What elements of this would be invalidated in a different number base?
Before moving on to the next section, it is worth pointing out one way that is actually different to , this is that is a Transcendental Number, whereas is not. Mathematicians seem to love applying misleading terms to concepts, Transcendental Numbers have hardly anything at all to do with meditation. Instead they are numbers which do not satisfy any finite expression such as:
where the are all Rational Numbers
Such an expression is known as a polynomial of degree . Polynomials are important in many areas of Mathematics, but we will not have anything else to say about them here.
The proof that is Transcendental was provided by German Mathematician Ferdinand von Lindemann in 1882 ^{[17]}. As an aside, this result was essentially a corollary of work that showed that is Transcendental; the connections between and are manifold. This same result also settled the problem of squaring the circle that dates to antiquity; as is Transcendental, this cannot be done.
Having addressed misconceptions 1, 2, 3, 4, 5, 6 and 7 so far. The next section will debunk misconception 8. The following one will address misconception 9, leaving us only misconception 10 to consider, which we will do in the final section.
This section presents a number of formulae for , ancient and modern. With a couple of exceptions, I will just state these rather than derive them; this article would expand exponentially (in both length and complexity) if I provided the background necessary to prove all of these equalities.
Towards the end of a previous section, we discovered one way in which Mathematicians can precisely define . This involved the concept of a limit. The same idea can generate many other formula precisely equal to . Rather than being limits of a formula, some of these are the sum of infinite series (another concept covered in Euler’s Number).
The first of these was discovered by Indian Mathematician Madhava of Sangamagrama in the 1300s. It is as follows ^{[18]}:
Where again the ellipsis denotes that the sum goes on for ever. With the way the history of Mathematics goes, results being lost and then rediscovered, this expression used to be known as the Leibniz series after the great 17th Century German Mathematician Gottfried Leibniz. It is now often called the Madhava–Leibniz series.
We can use the summation notation that we also introduced in Euler’s Number to write this more concisely as:
Madhava didn’t rest on his laurels, he also formulated the following ^{[19]}:
Which we can also write as:
An extremely famous formula for was the result of a Mathematician solving a then outstanding problem to do with the sum of reciprocals of the squares of Natural Numbers, the socalled Basel Problem. The answer proved to be:
Who was the person who provided this solution? None other that Leonhard Euler in 1734.
There are also many formulae for employing infinite products, such as:
We use a capital sigma – – to denote summation, the capital version of another Greek letter is used for products. Can you guess which one from this equivalent expression?
As stunning an example of selfreference as I have seen!
Another fruitful way of defining ; is via continued fractions. Instead of sums that go on for ever, these are fractions that go on for ever. Here is just one example:
A more direct way to define ; is via Integral Calculus ^{[20]}. Such an approach leads us to the formula:
Some of the details of this derivation are covered in the box below for those who may be interested:
Aside:
The equation for a circle of unit radius (or indeed any other radius) centered at the origin can be obtained via Pythagoras’s Theorem: If we consider a generic point on such a circle, with xcoordinate ; and ycoordinate ;, then we can form the rightangled triangle shown above, which has as its hypotenuse a radius of the circle (recall that this is of length ). Pythagoras then says that: If we wanted to write this in terms of and , we of course write: Rearranging this to express in terms of , we get: We can then calculate the length of the upper semicircle running from to using the generic formula for the length of a curve, , running between xcoordinates and : Here is the derivative of with respect to , something we defined in Euler’s Number. In our case, we have: and need to calculate its derivative. To do so, we employ something called the Chain Rule. Once more in Euler’s Number, we defined functions, we can rewrite the above by first definining two functions, and , such that: and These can be combined so we have: which means “do to the output of ”. The Chain Rule helps us with derivatives of combined functions and can be written as ^{[21]}: Note that, as both and are in general simpler than , they may be easier to differentiate separately than the combined expression. Applying the Chain Rule to our definitions of and above, we have: then, noting that and applying the rule for differentiating powers of that we again met in Euler’s Number, we have: If we combine these together using the Chain Rule definition above, we get: Plugging this into our formula for the length of a curve , we get: Recalling that the xcoordinates run from to , the length of the upper semicircle (which is of course just ), is given by: Which is the equality that we wanted to prove. 
However, my absolute favourite formula for is one that is much easier to arrive at. It starts with Euler’s Identity and progresses via some very simple rearrangements as follows:
Subtracting from both sides, we get:
Take the Natural Logarithm of both sides, the equation becomes:
Divide both sides by (which is of course equal to ^{[22]}) to get:
Multiplying both top and bottom of the righthand side by , we have:
Noting that of course , we finally have:
We have covered a lot of ways of expressing the ratio of a circle’s circumference to its diameter in the course of this section. However, we have not as yet covered the most obvious one. Let me introduce it to you: . That’s it, just . is a symbol that equates to a number, just the same way that or does. In the confusion caused by decimal expansions, infinite sums, continued fractions and definite integrals, we can sometimes lose track of the simple fact that all numbers are just symbols and that is a perfectly sound representation of the number we have been discussing in this article.
Here we get to our penultimate misconception about , that it includes, somewhere in the depths of its infinite decimal expansion, any string of numbers that you could think of. This property is typically equated with some actual number, say your telephone number, or – by mapping numbers to letters ^{[24]} – some actual text, like the Declaration of Independence or – canonically – the Complete Works of Shakespeare.
We know that the decimal expansion of any Irrational Number is a stream of digits which neither stops nor settles down to repeating a sequence. Assertions like the ones above are based on the assumption that a stream of digits with these properties must contain any finite length sequence of digits that you can name. For example, if we pick a sequence:
(which is just madeup gibberish of course), it must appear somewhere in the neverending stream of digits. This might be at the twentysix trillionth position. It might be later than that. The assumption is that you just have to wait long enough for the chosen sequence to appear.
For the decimal expansion of a generic Irrational Number, this assumption is clearly false. As a counterexample, let’s consider the Irrational Number that we constructed as our first example of the breed above:
Clearly the single digit is never going to appear in this, let alone the rest of our 30 digit sequence above.
We obviously need to think about things in a little more depth. We will do this by first considering some Probability Theory. Though some of the concepts introduced here are meant to apply in all Natural Number bases, we’ll focus on just decimals.
Let’s assume that we have a back box which randomly spits out a chain of digits in the range to . If the black box is a genuine random number generator, then we would expect that each of the ten digits, to , let’s say ^{[25]}, appears on average a tenth of the time:
We would also expect that, if we considered consecutive pairs of digits in the range to , let’s say ^{[26]}, these would appear on average one hundredth of the time:
Generalising, any string of digits would appear on average times.
Let’s think about this for a moment. If a sequence of a certain length, say 30 like the one above, appears on average every of the time, then this is the same as saying that of the time it does not appear. Or we might say that, we have to wait for an average of randomly generated digits to go by before we see our 30 digit sequence ^{[27]}. At one digit per second, that equates to:
or about years.
Our best estimate for the current age of the Universe is:
The average waiting time for our 30 digit sequence is more than two trillion times longer than this. It might take a while ^{[28]}.
To return to the main point, the essentially uniform distribution of digits, pairs of digits, triplets, etc. that we describe above is the essence of a definition of a Normal Number ^{[29]}. Here is an example of a number that is Normal in base 10:
That is the concatenation of all the Natural Numbers as a decimal fraction. This number, known as Champernowne’s Number, exhibits this randomlike distribution of its digits. Its decimal expansion clearly goes on forever and equally no sequence ever ends up repeating indefinitely. Therefore Champernowne’s Number is an Irrational Number and – in addition – is Normal base 10. Translating this into less technical language, Champernowne’s Number does indeed contain the entire text of Macbeth somewhere in its decimal expansion. While this may seem remarkable, it also contains the same text somewhere else, but with all references to Macbeth replaced by your name, or a version in which Macbeth lives happily ever after ^{[30]}.
It is however worth pointing out that, using the same numbers to text mapping that yielded Macbeth, the vast majority of this number would be total gibberish. When I say “vast majority”, I mean that – much as we found with our 30 digit number above – if you began counting at the Big Bang and took a second to read each digit starting from the decimal point, then you could well reach the present day before you stumble across:
Sadly, while we might be hoping for:
to appear next, it is overwhelmingly likely that the following text will be more like:
After a trillion Universe lifetimes, you might come across a sequence which completes the first line. After some more time goes by, you might find a series of numbers that gives you the first ten lines. Of course eventually you are guaranteed to find a sequence which yields the whole text, but the time taken to do so is most likely a number of seconds that makes TREE(3) ^{[31]} pale into insignificance.
Well this is all very interesting I hear you say, but this is not meant to be an article about Champernowne’s Number, it’s meant to be about . The $64,000 question is surely, “is Normal in base 10?” After all this build up, I appreciate that the answer may be rather anticlimactic, but – at the time of writing – we really don’t know. It is notoriously hard to determine whether a specific number is Normal or not.
Once more is not exceptional in having this property. We don’t know whether any of , , , or are Normal. With the odd way that Mathematical proof works, it can be shown that the vast majority of Irrational Numbers are Normal Numbers, we just don’t know which ones. There are only a handful of numbers that we can say are Normal and most of these have been specifically constructed to have this property.
Trillions of digits of have been calculated and a statistical analysis of these suggests that may well be normal (similar studies indicate the same about and the rest), but “suggestive” and “may well be” don’t cut a lot of ice in Mathematics. Unfortunately, while we suspect that may contain both the full text of Macbeth and all of the acceptance speeches for next year’s Oscars, we cannot definitely say that this is the case at present.
With these thoughts on the information that may or may not include, this section draws to a close. In the final section, I’ll look to summarise what we have learnt and to address our last misconception, number 10.
So, as advertised at the beginning of this article, the journey has been a long and winding one. At least from my perspective, some of the scenery along the way is just as interesting as the destination.
In terms of our list of misconceptions (crossreferenced in square brackets), we have explained that can never be equal to any Rational Number, let alone either of or [1 and 2]. Trivially we noted that, as it must be finite [3]. We were able to demonstrate that – like every other Irrational Number – ’s decimal expansion goes on forever [4] and never settles into a repeating sequence of digits [5]. We took a detour to establish that – again like every other Irrational Number – remains Irrational in any Natural Number base [6] (though obviously in base – not that any sane person would want to use this number system). As already referenced, through much of the above, we also showed that what holds for also holds for most other Irrational Numbers, most obviously surds like [7]. We had an entire section devoted to precise representations of , noting of course that the symbol is just such a representation [8]. Most recently, we explained that the jury is still out on whether or not the decimal expansion of contains your favourite piece of text or not, though indications are that maybe it does [9]. While ticking off this misconceptions has provided a framework, the Mathematical machinery we have developed and the results we have demonstrated or cited have, to my mind, been the real value of this article.
To close, let’s consider the final misconception, that the value of could be different in other Universes, or that we could play a game of What If?, as in “what would the world be like if ?”
The concept of other Universes used to be the preserve of Science Fiction. Today, several respectable theories in Cosmology and Physics suggest that the idea may not be so far fetched. Some interpretations of Quantum Mechanics ^{[32]} admit a Multiverse as does a core element of the Standard Model of Cosmology ^{[33]}. Through millennia of investigation and theorising, we have come up with a series of quantities that define key parts of the reality that we observe. Things like the Fine Structure Constant; the rest mass and other properties of the Quarks, Leptons and Bosons that make up the Standard Model of Particle Physics; the strength of the Gravitational Force; and a number of other more esoteric constatnts.
In alternative Universes, there is nothing to say that the value of these constants may not be different. Maybe protons are heavier and Gravity is weaker. Indeed one idea in Cosmology ^{[34]} suggests that – in a manner analogous to Darwinian Natural Selection – we are only here to observe the grandeur of our particular Universe precisely because it is one in which these fundamental constants have just the right values. If the values of all of these numbers could vary, surely the same goes for . The answer is a resounding “no”; let me try to explain why.
All of the constants I reference above are Physical constants, their values are derived by measurement. They are tied to our reality and in different Universes different measurements could be made. is not a Physical constant, it is a Mathematical constant. In none of our work on in this article has the charge of an electron been a key factor; the speed of light in a vacuum has never made an appearance in one of our equations. In general, while Mathematics is an uncannily useful tool for describing the Universe around us, one that “we neither understand nor deserve” ^{[35]}, it is also entirely independent of physical reality. A sentient being in any Universe we could imagine could conceive of a set of points equidistant from a central one and arrive at a definition of a circle, thereby discovering . To take a different definition, they could also determine the value of , where is the smallest positive number such that and find that it begins
Despite the best efforts of the Indiana Legislature back in 1897, we will always have:
or indeed any of the other many varied formulae for that we saw in an earlier section. Can ever be equal to anything except ? Well, as I put it in a Quora response a while back, the answer is:
 Not in some alternate timeline where Kirk is married to Spock
 Not a long time ago in a galaxy far far away
 Not in another dimension
 Not in a parallel Universe with different values of the fundamental Physical constants
 Not in a box / Not with a fox / Not in a house / Not with a mouse
Because, like the rest of Mathematics, the value of emerges from a set of logical steps, rather than from measurement, it has the same value in every Universe and indeed in none. Concepts like are in a sense supraUniversal. In a world subject to the oxymoron of constant change, I for one take some comfort in this indisputable and invariant fact.
<<  Two Escapees  
Part of the peterjamesthomas.com Maths and Science archive. 
Notes
^{ [1]}  If you want to experience these things first hand, try hanging out on in the pertinent part of Quora for an hour or so. 
^{ [2]}  Peter Trueb, based on an algorithm by Alexander J. Yee. 
^{ [3]}  Though this exhibit shows a halfturn, this is related to the confusion between diameters and radii when it comes to . 
^{ [4]}  The curly equals sign – – means “approximately equal to”. 
^{ [5]}  Archimedes actually did better than this. He came up with what we would now call an algorithm for generating estimates for using an approach I am about to explain. 
^{ [6]}  Against my normal practice, I am going to stick to degrees here rather than radians. 
^{ [7]}  Answers on a postcard for the Greek name of this figure, the best reply wins a prize. 
^{ [8]}  And of course that: This approaches from above, whereas the other expression approaches it from below. 
^{ [9]}  You can actually finesse the distinction by stating that , an equality that probably merits an article of its own at some point. 
^{ [10]}  And of course the Real Numbers, which consists of both the Rationals and Irrationals. 
^{ [11]}  Here as the number we are considering is positive, we can use the Natural Numbers in the numerator as opposed to the Integers. 
^{ [12]}  Euler’s Number. 
^{ [13]}  The Golden Ratio. 
^{ [14]}  See the section of A Brief Taxonomy of Numbers referring to Real Numbers. This actually shows that the size of the set of Real Numbers, , is a greater type of infinity than that of the Rational Numbers, . But, as the Irrational Numbers are just the set , this result follows simply. 
^{ [15]}  Wikipedia has a helpful page providing an overview of a number of proofs that is Irrational. 
^{ [16]}  is the HTML code for they shade of grey I use to fill boxes in this article. It equates to (214,214,214) in RGB format, precisely because in hexadecimal is equal to in decimal. 
^{ [17]}  Again the proof that is Transcendental is nontrivial. Wikipedia provides a proof of the more general Lindemann–Weierstrass Theorem for those who are interested. 
^{ [18]}  Madhava actually presented a more general result, this series is a special case of it. 
^{ [19]}  This second series starts to give an accurate estimate of after fewer terms than the Madhava–Leibniz series. 
^{ [20]}  Again Euler’s Number provided an introduction to the other kind of Calculus, Differential Calculus. An introduction to Integral Calculus will have to wait for another article. 
^{ [21]}  Using to denote the derivative of 
^{ [22]}  For a whistlestop introduction to , see the relevant section of A Brief Taxonomy of Numbers. For a more leisurely review, see Chapter 7 of Glimpses of Symmetry. 
^{ [23]}  Yes that is a Buffy reference: “Normal Again” – Season 6, Episode 17. See also: 25 Indispensable Business Terms 
^{ [24]}  This might be done by paring up consecutive digits and mapping them as follows: Or you could be lazy and just use the existing ASCII mapping, which requires three digits, e.g. . The mechanism doesn’t really matter of course, just that such mappings exist. 
^{ [25]}  Or any other digit, such as or . 
^{ [26]}  Or any other pair of digits, such as or . 
^{ [27]}  Of course it might be much less or much more, we are just talking about averages here. 
^{ [28]}  This might be British understatement. 
^{ [29]}  The full definition should insist that this applies in any Natural Number base. 
^{ [30]}  I am unsure about the copyright implications of this. Perhaps not an issue for older works, but surely Stephenie Meyer is going to sue as the entire Twilight Saga, but with more believable characters, will appear in the same number somewhere. 
^{ [31]}  A very big number indeed – for an introduction see TREE(3) is a big number, I mean really big by Josh Kerr. 
^{ [32]}  The Many Worlds Interpretation. 
^{ [33]}  Eternal Inflation. 
^{ [34]}  The Anthropic Principle. 
^{ [35]}  Part of a quote by Nobel Laureate, E.P. Wigner, which I have cited in full before on this site. 
Text & Images: © Peter James Thomas 2018. 