Lesson 12: The Poisson Distribution

In this lesson, we learn about another specially named discrete probability distribution, namely the Poisson distribution.

Objectives

Upon completion of this lesson, you should be able to:

12.1 - Poisson Distributions

12.1 - Poisson Distributions

Situation

Let the discrete random variable \(X\) denote the number of times an event occurs in an interval of time (or space). Then \(X\) may be a Poisson random variable with \(x=0, 1, 2, \ldots\)

Examples 12-1

  1. Let \(X\) equal the number of typos on a printed page. (This is an example of an interval of space — the space being the printed page.)
  2. Let \(X\) equal the number of cars passing through the intersection of Allen Street and College Avenue in one minute. (This is an example of an interval of time — the time being one minute.)
  3. Let \(X\) equal the number of Alaskan salmon caught in a squid driftnet. (This is again an example of an interval of space — the space being the squid driftnet.)
  4. Let \(X\) equal the number of customers at an ATM in 10-minute intervals.
  5. Let \(X\) equal the number of students arriving during office hours.
Poisson Random Variable

If \(X\) is a Poisson random variable, then the probability mass function is:

for \(x=0, 1, 2, \ldots\) and \(\lambda>0\), where \(\lambda\) will be shown later to be both the mean and the variance of \(X\).

Recall that the mathematical constant e is the unique real number such that the value of the derivative (slope of the tangent line) of the function \(f(x)=e^x\) at the point \(x=0\) is equal to 1. It turns out that the constant is irrational, but to five decimal places, it equals:

Also, note that there are (theoretically) an infinite number of possible Poisson distributions. Any specific Poisson distribution depends on the parameter \(\lambda\).

"Derivation" of the p.m.f.

Let \(X\) denote the number of events in a given continuous interval. Then \(X\) follows an approximate Poisson process with parameter \(\lambda>0\) if:

  1. The number of events occurring in non-overlapping intervals are independent.
  2. The probability of exactly one event in a short interval of length \(h=\frac\) is approximately \(\lambda h = \lambda \left(\frac\right)=\frac<\lambda>\).
  3. The probability of exactly two or more events in a short interval is essentially zero.

With these conditions in place, here's how the derivation of the p.m.f. of the Poisson distribution goes:

Now, let's make the intervals even smaller. That is, take the limit as \(n\) approaches infinity \(n\rightarrow \infty\) for fixed \(x\). Doing so, we get:

12.2 - Finding Poisson Probabilities

12.2 - Finding Poisson Probabilities

Example 12-2

printed page

Let \(X\) equal the number of typos on a printed page with a mean of 3 typos per page. What is the probability that a randomly selected page has at least one typo on it?

We can find the requested probability directly from the p.m.f. The probability that \(X\) is at least one is:

Therefore, using the p.m.f. to find \(P(X=0)\), we get:

That is, there is just over a 95% chance of finding at least one typo on a randomly selected page when the average number of typos per page is 3.

What is the probability that a randomly selected page has at most one typo on it?

The probability that \(X\) is at most one is:

Therefore, using the p.m.f., we get:

That is, there is just under a 20% chance of finding at most one typo on a randomly selected page when the average number of typos per page is 3.

Just as we used a cumulative probability table when looking for binomial probabilities, we could alternatively use a cumulative Poisson probability table, such as Table III in the back of your textbook. You should be able to use the formulas as well as the tables. If you take a look at the table, you'll see that it is three pages long. Let's just take a look at the top of the first page of the table in order to get a feel for how the table works:

In summary, to use the table in the back of your textbook, as well as that found in the back of most probability textbooks, to find cumulative Poisson probabilities, do the following:

  1. Find the column headed by the relevant \(\lambda\). Note that there are three rows containing \(\lambda\) on the first page of the table, two rows containing \(\lambda\) on the second page of the table, and one row containing \(\lambda\) on the last page of the table.
  2. Find the \(x\) in the first column on the left for which you want to find \(F(x)=P(X\le x)\).

Let's try it out on an example. If \(X\) equals the number of typos on a printed page with a mean of 3 typos per page, what is the probability that a randomly selected page has four typos on it?

Solution

The probability that a randomly selected page has four typos on it can be written as \(P(X=4)\). We can calculate \(P(X=4)\) by subtracting \(P(X\le 3)\) from \(P(X\le 4)\). To find \(P(X\le 3)\) and \(P(X\le 4)\) using the Poisson table, we:

  1. Find the column headed by \(\lambda=3\).
  2. Find the 3 in the first column on the left, since we want to find \(F(3)=P(X\le 3)\). And, find the 4 in the first column on the left, since we want to find \(F(4)=P(X\le 4)\).

Now, all we need to do is, first, read the probability value where the \(\lambda=3\) column and the \(x=3\) row intersect, and, second, read the probability value where the \(\lambda=3\) column and the \(x=4\) row intersect. What do you get?

\(\lambda = E(X)\)
x 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
0 0.905 0.819 0.741 0.670 0.607 0.549 0.497 0.449 0.407 0.368
1 0.995 0.982 0.963 0.938 0.910 0.878 0.844 0.809 0.772 0.736
2 1.000 0.999 0.996 0.992 0.986 0.970 0.966 0.953 0.937 0.920
3 1.000 1.000 1.000 0.999 0.998 0.997 0.994 0.991 0.987 0.981
4 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999 0.998 0.996
5 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999
6 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000
x 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2.0
0 0.333 0.301 0.273 0.247 0.223 0.202 0.183 0.165 0.150 0.135
1 0.699 0.663 0.627 0.592 0.558 0.525 0.493 0.463 0.434 0.403
2 0.900 0.879 0.857 0.833 0.809 0.830 0.757 0.731 0.704 0.677
3 0.974 0.966 0.957 0.946 0.934 0.921 0.907 0.981 0.875 0.857
4 0.995 0.992 0.989 0.986 0.981 0.976 0.970 0.964 0.956 0.947
5 0.999 0.998 0.998 0.997 0.996 0.994 0.992 0.990 0.987 0.983
6 1.000 1.000 1.000 0.999 0.999 0.999 0.998 0.997 0.997 0.995
7 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999 0.999
8 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000
x 2.2 2.4 2.6 2.8 3.0 3.2 3.4 3.6 3.8 4.0
0 0.111 0.091 0.074 0.061 0.050 0.051 0.033 0.027 0.022 0.018
1 0.355 0.308 0.267 0.231 0.199 0.171 0.147 0.126 0.107 0.092
2 0.623 0.570 0.518 0.469 0.423 0.380 0.340 0.303 0.269 0.238
3 0.819 0.779 0.736 0.692 0.647 0.603 0.558 0.515 0.473 0.433
4 0.928 0.904 0.887 0.848 0.815 0.781 0.744 0.706 0.668 0.629
5 0.975 0.964 0.951 0.935 0.916 0.895 0.871 0.844 0.816 0.785
6 0.993 0.988 0.983 0.976 0.966 0.955 0.942 0.927 0.909 0.889
7 0.998 0.997 0.995 0.992 0.988 0.983 0.977 0.969 0.960 0.949
8 1.000 0.990 0.999 0.998 0.993 0.994 0.992 0.988 0.984 0.979
9 1.000 1.000 1.000 0.999 0.999 0.998 0.997 0.996 0.994 0.992
10 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999 0.998 0.997
11 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999
12 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000
\(\lambda = E(X)\)
x 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
0 0.905 0.819 0.741 0.670 0.607 0.549 0.497 0.449 0.407 0.368
1 0.995 0.982 0.963 0.938 0.910 0.878 0.844 0.809 0.772 0.736
2 1.000 0.999 0.996 0.992 0.986 0.970 0.966 0.953 0.937 0.920
3 1.000 1.000 1.000 0.999 0.998 0.997 0.994 0.991 0.987 0.981
4 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999 0.998 0.996
5 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999
6 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000
x 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2.0
0 0.333 0.301 0.273 0.247 0.223 0.202 0.183 0.165 0.150 0.135
1 0.699 0.663 0.627 0.592 0.558 0.525 0.493 0.463 0.434 0.403
2 0.900 0.879 0.857 0.833 0.809 0.830 0.757 0.731 0.704 0.677
3 0.974 0.966 0.957 0.946 0.934 0.921 0.907 0.981 0.875 0.857
4 0.995 0.992 0.989 0.986 0.981 0.976 0.970 0.964 0.956 0.947
5 0.999 0.998 0.998 0.997 0.996 0.994 0.992 0.990 0.987 0.983
6 1.000 1.000 1.000 0.999 0.999 0.999 0.998 0.997 0.997 0.995
7 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999 0.999
8 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000
x 2.2 2.4 2.6 2.8 3.0 3.2 3.4 3.6 3.8 4.0
0 0.111 0.091 0.074 0.061 0.050 0.051 0.033 0.027 0.022 0.018
1 0.355 0.308 0.267 0.231 0.199 0.171 0.147 0.126 0.107 0.092
2 0.623 0.570 0.518 0.469 0.423 0.380 0.340 0.303 0.269 0.238
3 0.819 0.779 0.736 0.692 0.647 0.603 0.558 0.515 0.473 0.433
4 0.928 0.904 0.887 0.848 0.815 0.781 0.744 0.706 0.668 0.629
5 0.975 0.964 0.951 0.935 0.916 0.895 0.871 0.844 0.816 0.785
6 0.993 0.988 0.983 0.976 0.966 0.955 0.942 0.927 0.909 0.889
7 0.998 0.997 0.995 0.992 0.988 0.983 0.977 0.969 0.960 0.949
8 1.000 0.990 0.999 0.998 0.993 0.994 0.992 0.988 0.984 0.979
9 1.000 1.000 1.000 0.999 0.999 0.998 0.997 0.996 0.994 0.992
10 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999 0.998 0.997
11 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999
12 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000

The cumulative Poisson probability table tells us that finding \(P(X\le 4)=0.815\) and \(P(X\le 3)=0.647\). Therefore:

\(P(X=4)=P(X\le 4)-P(X\le 3)=0.815-0.647=0.168\)

That is, there is about a 17% chance that a randomly selected page would have four typos on it. Since it wouldn't take a lot of work in this case, you might want to verify that you'd get the same answer using the Poisson p.m.f.

What is the probability that three randomly selected pages have more than eight typos on it?

Solution

Solving this problem involves taking one additional step. Recall that \(X\) denotes the number of typos on one printed page. Then, let's define a new random variable \(Y\) that equals the number of typos on three printed pages. If the mean of \(X\) is 3 typos per page, then the mean of \(Y\) is:

Finding the desired probability then involves finding:

where \(P(Y\le 8)\) is found by looking on the Poisson table under the column headed by \(\lambda=9.0\) and the row headed by \(x=8\). What do you get?

x 6.5 7.0 7.5 8.0 8.5 9.0 9.5 10.0 10.5 11.0
0 0.002 0.001 0.001 0.000 0.000 0.000 0.000 0.000 0.000 0.000
1 0.011 0.007 0.005 0.003 0.002 0.001 0.001 0.000 0.000 0.000
2 0.043 0.030 0.020 0.014 0.009 0.006 0.004 0.003 0.002 0.001
3 0.112 0.082 0.059 0.042 0.030 0.021 0.015 0.010 0.007 0.005
4 0.224 0.173 0.132 0.100 0.074 0.055 0.040 0.029 0.021 0.015
5 0.369 0.301 0.241 0.191 0.150 0.116 0.089 0.067 0.050 0.015
6 0.527 0.450 0.378 0.313 0.256 0.207 0.165 0.130 0.102 0.079
7 0.673 0.599 0.525 0.453 0.386 0.324 0.269 0.220 0.179 0.143
8 0.792 0.729 0.662 0.593 0.523 0.456 0.392 0.333 0.279 0.232
9 0.877 0.830 0.776 0.717 0.653 0.587 0.522 0.458 0.397 0.341
10 0.933 0.901 0.862 0.816 0.763 0.706 0.645 0.583 0.521 0.460
11 0.966 0.947 0.921 0.888 0.849 0.803 0.752 0.697 0.639 0.579
12 0.984 0.973 0.957 0.936 0.909 0.876 0.836 0.792 0.742 0.689
13 0.993 0.987 0.978 0.966 0.949 0.926 0.898 0.864 0.825 0.781
14 0.997 0.994 0.990 0.983 0.973 0.959 0.940 0.917 0.888 0.854
15 0.999 0.998 0.995 0.992 0.986 0.978 0.967 0.951 0.932 0.907
16 1.000 0.999 0.998 0.996 0.993 0.989 0.982 0.973 0.960 0.944
17 1.000 1.000 0.999 0.998 0.997 0.995 0.991 0.986 0.978 0.968
18 1.000 1.000 1.000 0.999 0.999 0.998 0.096 0.993 0.988 0.982
19 1.000 1.000 1.000 1.000 0.999 0.999 0.998 0.997 0.994 0.991
20 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.998 0.997 0.995
21 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999 0.998
22 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999
23 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000
x 6.5 7.0 7.5 8.0 8.5 9.0 9.5 10.0 10.5 11.0
0 0.002 0.001 0.001 0.000 0.000 0.000 0.000 0.000 0.000 0.000
1 0.011 0.007 0.005 0.003 0.002 0.001 0.001 0.000 0.000 0.000
2 0.043 0.030 0.020 0.014 0.009 0.006 0.004 0.003 0.002 0.001
3 0.112 0.082 0.059 0.042 0.030 0.021 0.015 0.010 0.007 0.005
4 0.224 0.173 0.132 0.100 0.074 0.055 0.040 0.029 0.021 0.015
5 0.369 0.301 0.241 0.191 0.150 0.116 0.089 0.067 0.050 0.015
6 0.527 0.450 0.378 0.313 0.256 0.207 0.165 0.130 0.102 0.079
7 0.673 0.599 0.525 0.453 0.386 0.324 0.269 0.220 0.179 0.143
8 0.792 0.729 0.662 0.593 0.523 0.456 0.392 0.333 0.279 0.232
9 0.877 0.830 0.776 0.717 0.653 0.587 0.522 0.458 0.397 0.341
10 0.933 0.901 0.862 0.816 0.763 0.706 0.645 0.583 0.521 0.460
11 0.966 0.947 0.921 0.888 0.849 0.803 0.752 0.697 0.639 0.579
12 0.984 0.973 0.957 0.936 0.909 0.876 0.836 0.792 0.742 0.689
13 0.993 0.987 0.978 0.966 0.949 0.926 0.898 0.864 0.825 0.781
14 0.997 0.994 0.990 0.983 0.973 0.959 0.940 0.917 0.888 0.854
15 0.999 0.998 0.995 0.992 0.986 0.978 0.967 0.951 0.932 0.907
16 1.000 0.999 0.998 0.996 0.993 0.989 0.982 0.973 0.960 0.944
17 1.000 1.000 0.999 0.998 0.997 0.995 0.991 0.986 0.978 0.968
18 1.000 1.000 1.000 0.999 0.999 0.998 0.096 0.993 0.988 0.982
19 1.000 1.000 1.000 1.000 0.999 0.999 0.998 0.997 0.994 0.991
20 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.998 0.997 0.995
21 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999 0.998
22 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 0.999 0.999
23 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000

The cumulative Poisson probability table tells us that finding \(P(X\le 8)=0.456\). Therefore:

That is, there is a 54.4% chance that three randomly selected pages would have more than eight typos on it.

12.3 - Poisson Properties

12.3 - Poisson Properties

Just as we did for the other named discrete random variables we've studied, on this page, we present and verify four properties of a Poisson random variable.

Theorem

The probability mass function:

for a Poisson random variable \(X\) is a valid p.m.f.

Proof

Theorem

The moment generating function of a Poisson random variable \(X\) is:

Proof

Theorem

The mean of a Poisson random variable \(X\) is \(\lambda\).

Proof

Theorem

The variance of a Poisson random variable \(X\) is \(\lambda\).

Proof

12.4 - Approximating the Binomial Distribution

12.4 - Approximating the Binomial Distribution

Example 12-3

Christmas Tree lights

Five percent (5%) of Christmas tree light bulbs manufactured by a company are defective. The company's Quality Control Manager is quite concerned and therefore randomly samples 100 bulbs coming off of the assembly line. Let \(X\) denote the number in the sample that are defective. What is the probability that the sample contains at most three defective bulbs?

Solution

Can you convince yourself that \(X\) is a binomial random variable? Hmmm. let's see. there are two possible outcomes (defective or not), the 100 trials of selecting the bulbs from the assembly line can be assumed to be performed in an identical and independent manner, and the probability of getting a defective bulb can be assumed to be constant from trial to trial. So, \(X\) is indeed a binomial random variable. Well, calculating the probability is easy enough then. we just need to use the cumulative binomial table with \(n=100\) and \(p=0.05\). Oops! The table won't help us here, will it? Even many standard calculators would have trouble calculating the probability using the p.m.f.:

Using a statistical software package (Minitab), I was able to use the binomial p.m.f. to determine that:

But, if you recall the way that we derived the Poisson distribution. we started with the binomial distribution and took the limit as n approached infinity. So, it seems reasonable then that the Poisson p.m.f. would serve as a reasonable approximation to the binomial p.m.f. when your \(n\) is large (and therefore, \(p\) is small). Let's calculate \(P(X\le 3)\) using the Poisson distribution and see how close we get. Well, the probability of success was defined to be:

Therefore, the mean \(\lambda\) is:

So, we need to use our Poisson table to find \(P(X\le 3)\) when \(\lambda=100(0.05)=5\). What do you get?

x 4.2 4.4 4.6 4.8 5.0 5.2
0 0.015 0.012 0.010 0.008 0.007 0.006
1 0.078 0.033 0.056 0.048 0.040 0.034
2 0.210 0.185 0.163 0.143 0.125 0.109
3 0.395 0.359 0.326 0.294 0.265 0.238
4 0.590 0.551 0.513 0.476 0.440 0.406
5 0.753 0.720 0.686 0.651 0.616 0.581
6 0.867 0.844 0.818 0.791 0.762 0.732
7 0.936 0.921 0.905 0.887 0.867 0.845
8 0.972 0.964 0.955 0.944 0.932 0.918
9 0.989 0.985 0.980 0.975 0.968 0.960
10 0.996 0.994 0.992 0.990 0.986 0.982
11 0.999 0.998 0.997 0.996 0.995 0.993
12 1.000 0.999 0.999 0.999 0.998 0.997
13 1.000 1.000 1.000 1.000 0.999 0.999
14 1.000 1.000 1.000 1.000 1.000 1.000
15 1.000 1.000 1.000 1.000 1.000 1.000
16 1.000 1.000 1.000 1.000 1.000 1.000
x 4.2 4.4 4.6 4.8 5.0 5.2
0 0.015 0.012 0.010 0.008 0.007 0.006
1 0.078 0.033 0.056 0.048 0.040 0.034
2 0.210 0.185 0.163 0.143 0.125 0.109
3 0.395 0.359 0.326 0.294 0.265 0.238
4 0.590 0.551 0.513 0.476 0.440 0.406
5 0.753 0.720 0.686 0.651 0.616 0.581
6 0.867 0.844 0.818 0.791 0.762 0.732
7 0.936 0.921 0.905 0.887 0.867 0.845
8 0.972 0.964 0.955 0.944 0.932 0.918
9 0.989 0.985 0.980 0.975 0.968 0.960
10 0.996 0.994 0.992 0.990 0.986 0.982
11 0.999 0.998 0.997 0.996 0.995 0.993
12 1.000 0.999 0.999 0.999 0.998 0.997
13 1.000 1.000 1.000 1.000 0.999 0.999
14 1.000 1.000 1.000 1.000 1.000 1.000
15 1.000 1.000 1.000 1.000 1.000 1.000
16 1.000 1.000 1.000 1.000 1.000 1.000

The cumulative Poisson probability table tells us that finding \(P(X\le 3)=0.265\). That is, if there is a 5% defective rate, then there is a 26.5% chance that the a randomly selected batch of 100 bulbs will contain at most 3 defective bulbs. More importantly, since we have been talking here about using the Poisson distribution to approximate the binomial distribution, we should probably compare our results. When we used the binomial distribution, we deemed \(P(X\le 3)=0.258\), and when we used the Poisson distribution, we deemed \(P(X\le 3)=0.265\). Not too bad of an approximation, eh?

It is important to keep in mind that the Poisson approximation to the binomial distribution works well only when \(n\) is large and \(p\) is small. In general, the approximation works well if \(n\ge 20\) and \(p\le 0.05\), or if \(n\ge 100\) and \(p\le 0.10\).

Legend
[1] Link
Has Tooltip/Popover
Toggleable Visibility