This is “Probability Distributions for Discrete Random Variables”, section 4.2 from the book Beginning Statistics (v. 1.0).
For more information on the source of this book, or why it is available for free, please see the project's home page. You can browse or download additional books there. You may also download a PDF copy of this book (33 MB) or just this chapter (651 KB), suitable for printing or most e-readers, or a .zip file containing this book's HTML files (for use in a web browser offline).
Associated to each possible value x of a discrete random variable X is the probability $P\left(x\right)$ that X will take the value x in one trial of the experiment.
The probability distributionA list of each possible value and its probability. of a discrete random variable X is a list of each possible value of X together with the probability that X takes that value in one trial of the experiment.
The probabilities in the probability distribution of a random variable X must satisfy the following two conditions:
A fair coin is tossed twice. Let X be the number of heads that are observed.
Solution:
The possible values that X can take are 0, 1, and 2. Each of these numbers corresponds to an event in the sample space $S=\left\{hh,ht,th,tt\right\}$ of equally likely outcomes for this experiment: X = 0 to $\left\{tt\right\}$, X = 1 to $\left\{ht,th\right\}$, and X = 2 to $\left\{hh\right\}.$ The probability of each of these events, hence of the corresponding value of X, can be found simply by counting, to give
$$\begin{array}{cccc}x& 0& 1& 2\\ P\left(x\right)& 0.25& 0.50& 0.25\end{array}$$This table is the probability distribution of X.
“At least one head” is the event X ≥ 1, which is the union of the mutually exclusive events X = 1 and X = 2. Thus
$$P\left(X\ge 1\right)=P\left(1\right)+P\left(2\right)=0.50+0.25=0.75$$A histogram that graphically illustrates the probability distribution is given in Figure 4.1 "Probability Distribution for Tossing a Fair Coin Twice".
Figure 4.1 Probability Distribution for Tossing a Fair Coin Twice
A pair of fair dice is rolled. Let X denote the sum of the number of dots on the top faces.
Solution:
The sample space of equally likely outcomes is
$$\begin{array}{cccccc}11& 12& 13& 14& 15& 16\\ 21& 22& 23& 24& 25& 26\\ 31& 32& 33& 34& 35& 36\\ 41& 42& 43& 44& 45& 46\\ 51& 52& 53& 54& 55& 56\\ 61& 62& 63& 64& 65& 66\end{array}$$The possible values for X are the numbers 2 through 12. X = 2 is the event {11}, so $P\left(2\right)=1\u221536.$ X = 3 is the event {12,21}, so $P\left(3\right)=2\u221536.$ Continuing this way we obtain the table
$$\begin{array}{cccccccccccc}x& 2& 3& 4& 5& 6& 7& 8& 9& 10& 11& 12\\ P\left(x\right)& \frac{1}{36}& \frac{2}{36}& \frac{3}{36}& \frac{4}{36}& \frac{5}{36}& \frac{6}{36}& \frac{5}{36}& \frac{4}{36}& \frac{3}{36}& \frac{2}{36}& \frac{1}{36}\end{array}$$This table is the probability distribution of X.
The event X ≥ 9 is the union of the mutually exclusive events X = 9, X = 10, X = 11, and X = 12. Thus
$$P\left(X\ge 9\right)=P\left(9\right)+P\left(10\right)+P\left(11\right)+P\left(12\right)=\frac{4}{36}+\frac{3}{36}+\frac{2}{36}+\frac{1}{36}=\frac{10}{36}=0.2\stackrel{-}{7}$$Before we immediately jump to the conclusion that the probability that X takes an even value must be 0.5, note that X takes six different even values but only five different odd values. We compute
$$\begin{array}{lll}\hfill P\left(X\text{\hspace{0.17em}is\hspace{0.17em}even}\right)& =& P\left(2\right)+P\left(4\right)+P\left(6\right)+P\left(8\right)+P\left(10\right)+P\left(12\right)\\ & =& \frac{1}{36}+\frac{3}{36}+\frac{5}{36}+\frac{5}{36}+\frac{3}{36}+\frac{1}{36}=\frac{18}{36}=0.5\end{array}$$A histogram that graphically illustrates the probability distribution is given in Figure 4.2 "Probability Distribution for Tossing Two Fair Dice".
Figure 4.2 Probability Distribution for Tossing Two Fair Dice
The meanThe number $\mathrm{\Sigma}x\text{\hspace{0.17em}}P\left(x\right)$, measuring its average upon repeated trials. (also called the expected valueIts mean.) of a discrete random variable X is the number
$$\mathit{\mu}=E\left(X\right)=\mathrm{\Sigma}x\text{\hspace{0.17em}}P\left(x\right)$$The mean of a random variable may be interpreted as the average of the values assumed by the random variable in repeated trials of the experiment.
Find the mean of the discrete random variable X whose probability distribution is
$$\begin{array}{ccccc}x& \text{\u2212}2& 1& 2& 3.5\\ P\left(x\right)& 0.21& 0.34& 0.24& 0.21\end{array}$$Solution:
The formula in the definition gives
$$\begin{array}{lll}\hfill \mathit{\mu}& =& \mathrm{\Sigma}x\text{\hspace{0.17em}}P\left(x\right)\\ & =& \left(\text{\u2212}2\right)\xb70.21+\left(1\right)\xb70.34+\left(2\right)\xb70.24+\left(3.5\right)\xb70.21=1.135\end{array}$$A service organization in a large town organizes a raffle each month. One thousand raffle tickets are sold for $1 each. Each has an equal chance of winning. First prize is $300, second prize is $200, and third prize is $100. Let X denote the net gain from the purchase of one ticket.
Solution:
If a ticket is selected as the first prize winner, the net gain to the purchaser is the $300 prize less the $1 that was paid for the ticket, hence X = 300 − 1 = 299. There is one such ticket, so P(299) = 0.001. Applying the same “income minus outgo” principle to the second and third prize winners and to the 997 losing tickets yields the probability distribution:
$$\begin{array}{ccccc}x& 299& 199& 99& \text{\u2212}1\\ P\left(x\right)& 0.001& 0.001& 0.001& 0.997\end{array}$$Let W denote the event that a ticket is selected to win one of the prizes. Using the table
$$P\left(W\right)=P\left(299\right)+P\left(199\right)+P\left(99\right)=0.001+0.001+0.001=0.003$$Using the formula in the definition of expected value,
$$E\left(X\right)=299\xb70.001+199\xb70.001+99\xb70.001+\left(\text{\u2212}1\right)\xb70.997=\text{\u2212}0.4$$The negative value means that one loses money on the average. In particular, if someone were to buy tickets repeatedly, then although he would win now and then, on average he would lose 40 cents per ticket purchased.
The concept of expected value is also basic to the insurance industry, as the following simplified example illustrates.
A life insurance company will sell a $200,000 one-year term life insurance policy to an individual in a particular risk group for a premium of $195. Find the expected value to the company of a single policy if a person in this risk group has a 99.97% chance of surviving one year.
Solution:
Let X denote the net gain to the company from the sale of one such policy. There are two possibilities: the insured person lives the whole year or the insured person dies before the year is up. Applying the “income minus outgo” principle, in the former case the value of X is 195 − 0; in the latter case it is $195-\mathrm{200,000}=\text{\u2212}\mathrm{199,805}.$ Since the probability in the first case is 0.9997 and in the second case is $1-0.9997=0.0003$, the probability distribution for X is:
$$\begin{array}{ccc}x& 195& \text{\u2212}\mathrm{199,805}\\ P\left(x\right)& 0.9997& 0.0003\end{array}$$Therefore
$$E\left(X\right)=\mathrm{\Sigma}x\text{\hspace{0.17em}}P\left(x\right)=195\xb70.9997+\left(\text{\u2212}\mathrm{199,805}\right)\xb70.0003=135$$Occasionally (in fact, 3 times in 10,000) the company loses a large amount of money on a policy, but typically it gains $195, which by our computation of $E\left(X\right)$ works out to a net gain of $135 per policy sold, on average.
The variance, ${\mathit{\sigma}}^{2}$, of a discrete random variable X is the number
$${\mathit{\sigma}}^{2}=\mathrm{\Sigma}{\left(x-\mathit{\mu}\right)}^{2}\text{\hspace{0.17em}}P\left(x\right)$$which by algebra is equivalent to the formula
$${\mathit{\sigma}}^{2}=\left[\right.\mathrm{\Sigma}{x}^{2}\text{\hspace{0.17em}}P\left(x\right)\left]\right.-{\mathit{\mu}}^{2}$$The standard deviationThe number $\sqrt{{\displaystyle \mathrm{\Sigma}{(x-\mathit{\mu})}^{2}P\left(x\right)}}$ (also computed using $\sqrt{\left[{\displaystyle \mathrm{\Sigma}{x}^{2}P\left(x\right)}\text{\hspace{0.17em}}\right]-{\mathit{\mu}}^{2}}$), measuring its variability under repeated trials., σ, of a discrete random variable X is the square root of its variance, hence is given by the formulas
$$\mathit{\sigma}=\sqrt{\mathrm{\Sigma}{\left(x-\mathit{\mu}\right)}^{2}\text{\hspace{0.17em}}P\left(x\right)}=\sqrt{\left[\right.\mathrm{\Sigma}{x}^{2}\text{\hspace{0.17em}}P\left(x\right)\left]\right.-{\mathit{\mu}}^{2}}$$The variance and standard deviation of a discrete random variable X may be interpreted as measures of the variability of the values assumed by the random variable in repeated trials of the experiment. The units on the standard deviation match those of X.
A discrete random variable X has the following probability distribution:
$$\begin{array}{ccccc}x& \text{\u2212}1& 0& 1& 4\\ P\left(x\right)& 0.2& 0.5& a& 0.1\end{array}$$A histogram that graphically illustrates the probability distribution is given in Figure 4.3 "Probability Distribution of a Discrete Random Variable".
Figure 4.3 Probability Distribution of a Discrete Random Variable
Compute each of the following quantities.
Solution:
Using the formula in the definition of μ,
$$\mathit{\mu}=\mathrm{\Sigma}x\text{\hspace{0.17em}}P\left(x\right)=\left(\text{\u2212}1\right)\xb70.2+0\xb70.5+1\xb70.2+4\xb70.1=0.4$$Using the formula in the definition of ${\mathit{\sigma}}^{2}$ and the value of μ that was just computed,
$$\begin{array}{lll}\hfill {\mathit{\sigma}}^{2}& =& \mathrm{\Sigma}{\left(x-\mathit{\mu}\right)}^{2}\text{\hspace{0.17em}}P\left(x\right)\\ & =& {\left(\text{\u2212}1-0.4\right)}^{2}\xb70.2+{\left(0-0.4\right)}^{2}\xb70.5+{\left(1-0.4\right)}^{2}\xb70.2+{\left(4-0.4\right)}^{2}\xb70.1\\ & =& 1.84\end{array}$$Determine whether or not the table is a valid probability distribution of a discrete random variable. Explain fully.
$\begin{array}{ccccc}x& \text{\u2212}2& 0& 2& 4\\ P\left(x\right)& 0.3& 0.5& 0.2& 0.1\end{array}$
$\begin{array}{cccc}x& 0.5& 0.25& 0.25\\ P\left(x\right)& \text{\u2212}0.4& 0.6& 0.8\end{array}$
$\begin{array}{cccccc}x& 1.1& 2.5& 4.1& 4.6& 5.3\\ P\left(x\right)& 0.16& 0.14& 0.11& 0.27& 0.22\end{array}$
Determine whether or not the table is a valid probability distribution of a discrete random variable. Explain fully.
$\begin{array}{cccccc}x& 0& 1& 2& 3& 4\\ P\left(x\right)& \text{\u2212}0.25& 0.50& 0.35& 0.10& 0.30\end{array}$
$\begin{array}{cccc}x& 1& 2& 3\\ P\left(x\right)& 0.325& 0.406& 0.164\end{array}$
$\begin{array}{cccccc}x& 25& 26& 27& 28& 29\\ P\left(x\right)& 0.13& 0.27& 0.28& 0.18& 0.14\end{array}$
A discrete random variable X has the following probability distribution:
$$\begin{array}{cccccc}x& 77& 78& 79& 80& 81\\ P\left(x\right)& 0.15& 0.15& 0.20& 0.40& 0.10\end{array}$$Compute each of the following quantities.
A discrete random variable X has the following probability distribution:
$$\begin{array}{cccccc}x& 13& 18& 20& 24& 27\\ P\left(x\right)& 0.22& 0.25& 0.20& 0.17& 0.16\end{array}$$Compute each of the following quantities.
If each die in a pair is “loaded” so that one comes up half as often as it should, six comes up half again as often as it should, and the probabilities of the other faces are unaltered, then the probability distribution for the sum X of the number of dots on the top faces when the two are rolled is
$$\begin{array}{ccccccc}x& 2& 3& 4& 5& 6& 7\\ P\left(x\right)& \frac{1}{144}& \frac{4}{144}& \frac{8}{144}& \frac{12}{144}& \frac{16}{144}& \frac{22}{144}\end{array}$$ $$\begin{array}{cccccc}x& 8& 9& 10& 11& 12\\ P\left(x\right)& \frac{24}{144}& \frac{20}{144}& \frac{16}{144}& \frac{12}{144}& \frac{9}{144}\end{array}$$Compute each of the following.
Borachio works in an automotive tire factory. The number X of sound but blemished tires that he produces on a random day has the probability distribution
$$\begin{array}{ccccc}x& 2& 3& 4& 5\\ P\left(x\right)& 0.48& 0.36& 0.12& 0.04\end{array}$$In a hamster breeder's experience the number X of live pups in a litter of a female not over twelve months in age who has not borne a litter in the past six weeks has the probability distribution
$$\begin{array}{cccccccc}x& 3& 4& 5& 6& 7& 8& 9\\ P\left(x\right)& 0.04& 0.10& 0.26& 0.31& 0.22& 0.05& 0.02\end{array}$$The number X of days in the summer months that a construction crew cannot work because of the weather has the probability distribution
$$\begin{array}{cccccc}x& 6& 7& 8& 9& 10\\ P\left(x\right)& 0.03& 0.08& 0.15& 0.20& 0.19\end{array}$$ $$\begin{array}{ccccc}x& 11& 12& 13& 14\\ P\left(x\right)& 0.16& 0.10& 0.07& 0.02\end{array}$$Let X denote the number of boys in a randomly selected three-child family. Assuming that boys and girls are equally likely, construct the probability distribution of X.
Let X denote the number of times a fair coin lands heads in three tosses. Construct the probability distribution of X.
Five thousand lottery tickets are sold for $1 each. One ticket will win $1,000, two tickets will win $500 each, and ten tickets will win $100 each. Let X denote the net gain from the purchase of a randomly selected ticket.
Seven thousand lottery tickets are sold for $5 each. One ticket will win $2,000, two tickets will win $750 each, and five tickets will win $100 each. Let X denote the net gain from the purchase of a randomly selected ticket.
An insurance company will sell a $90,000 one-year term life insurance policy to an individual in a particular risk group for a premium of $478. Find the expected value to the company of a single policy if a person in this risk group has a 99.62% chance of surviving one year.
An insurance company will sell a $10,000 one-year term life insurance policy to an individual in a particular risk group for a premium of $368. Find the expected value to the company of a single policy if a person in this risk group has a 97.25% chance of surviving one year.
An insurance company estimates that the probability that an individual in a particular risk group will survive one year is 0.9825. Such a person wishes to buy a $150,000 one-year term life insurance policy. Let C denote how much the insurance company charges such a person for such a policy.
An insurance company estimates that the probability that an individual in a particular risk group will survive one year is 0.99. Such a person wishes to buy a $75,000 one-year term life insurance policy. Let C denote how much the insurance company charges such a person for such a policy.
A roulette wheel has 38 slots. Thirty-six slots are numbered from 1 to 36; half of them are red and half are black. The remaining two slots are numbered 0 and 00 and are green. In a $1 bet on red, the bettor pays $1 to play. If the ball lands in a red slot, he receives back the dollar he bet plus an additional dollar. If the ball does not land on red he loses his dollar. Let X denote the net gain to the bettor on one play of the game.
A roulette wheel has 38 slots. Thirty-six slots are numbered from 1 to 36; the remaining two slots are numbered 0 and 00. Suppose the “number” 00 is considered not to be even, but the number 0 is still even. In a $1 bet on even, the bettor pays $1 to play. If the ball lands in an even numbered slot, he receives back the dollar he bet plus an additional dollar. If the ball does not land on an even numbered slot, he loses his dollar. Let X denote the net gain to the bettor on one play of the game.
The time, to the nearest whole minute, that a city bus takes to go from one end of its route to the other has the probability distribution shown. As sometimes happens with probabilities computed as empirical relative frequencies, probabilities in the table add up only to a value other than 1.00 because of round-off error.
$$\begin{array}{ccccccc}x& 42& 43& 44& 45& 46& 47\\ P\left(x\right)& 0.10& 0.23& 0.34& 0.25& 0.05& 0.02\end{array}$$Tybalt receives in the mail an offer to enter a national sweepstakes. The prizes and chances of winning are listed in the offer as: $5 million, one chance in 65 million; $150,000, one chance in 6.5 million; $5,000, one chance in 650,000; and $1,000, one chance in 65,000. If it costs Tybalt 44 cents to mail his entry, what is the expected value of the sweepstakes to him?
The number X of nails in a randomly selected 1-pound box has the probability distribution shown. Find the average number of nails per pound.
$$\begin{array}{cccc}x& 100& 101& 102\\ P\left(x\right)& 0.01& 0.96& 0.03\end{array}$$Three fair dice are rolled at once. Let X denote the number of dice that land with the same number of dots on top as at least one other die. The probability distribution for X is
$$\begin{array}{cccc}x& 0& u& 3\\ P\left(x\right)& p& \frac{15}{36}& \frac{1}{36}\end{array}$$Two fair dice are rolled at once. Let X denote the difference in the number of dots that appear on the top faces of the two dice. Thus for example if a one and a five are rolled, X = 4, and if two sixes are rolled, X = 0.
A fair coin is tossed repeatedly until either it lands heads or a total of five tosses have been made, whichever comes first. Let X denote the number of tosses made.
A manufacturer receives a certain component from a supplier in shipments of 100 units. Two units in each shipment are selected at random and tested. If either one of the units is defective the shipment is rejected. Suppose a shipment has 5 defective units.
Shylock enters a local branch bank at 4:30 p.m. every payday, at which time there are always two tellers on duty. The number X of customers in the bank who are either at a teller window or are waiting in a single line for the next available teller has the following probability distribution.
$$\begin{array}{ccccc}x& 0& 1& 2& 3\\ P\left(x\right)& 0.135& 0.192& 0.284& 0.230\end{array}$$ $$\begin{array}{cccc}x& 4& 5& 6\\ P\left(x\right)& 0.103& 0.051& 0.005\end{array}$$The owner of a proposed outdoor theater must decide whether to include a cover that will allow shows to be performed in all weather conditions. Based on projected audience sizes and weather conditions, the probability distribution for the revenue X per night if the cover is not installed is
$$\begin{array}{lll}\text{Weather}\hfill & \hfill x\hfill & \hfill P\left(x\right)\hfill \\ \text{Clear}\hfill & \hfill \text{\$3000}\hfill & \hfill \text{0}\text{.61}\hfill \\ \text{Threatening}\hfill & \hfill \text{\$2800}\hfill & \hfill \text{0}\text{.17}\hfill \\ \text{Light\hspace{0.17em}rain}\hfill & \hfill \text{\$1975}\hfill & \hfill \text{0}\text{.11}\hfill \\ \text{Show-cancelling\hspace{0.17em}rain}\hfill & \hfill \text{\$0}\hfill & \hfill \text{0}\text{.11}\hfill \end{array}$$The additional cost of the cover is $410,000. The owner will have it built if this cost can be recovered from the increased revenue the cover affords in the first ten 90-night seasons.
$\begin{array}{ccccc}x& 0& 1& 2& 3\\ P\left(x\right)& 1/8& 3/8& 3/8& 1/8\end{array}$
$\begin{array}{ccccc}x& \text{\u2212}1& 999& 499& 99\\ P\left(x\right)& \frac{4987}{5000}& \frac{1}{5000}& \frac{2}{5000}& \frac{10}{5000}\end{array}$
136
$\begin{array}{ccc}x& C& C\text{\u2212}\mathrm{150,000}\\ P\left(x\right)& 0.9825& 0.0175\end{array}$
$\begin{array}{ccc}x& \text{\u2212}1& 1\\ P\left(x\right)& \frac{20}{38}& \frac{18}{38}\end{array}$
101.02
$\begin{array}{ccccccc}x& 0& 1& 2& 3& 4& 5\\ P\left(x\right)& \frac{6}{36}& \frac{10}{36}& \frac{8}{36}& \frac{6}{36}& \frac{4}{36}& \frac{2}{36}\end{array}$
$\begin{array}{cccc}x& 0& 1& 2\\ P\left(x\right)& 0.902& 0.096& 0.002\end{array}$