Normal Random Variables (6 of 6)

CUNY School of Professional Studies

Module 6: Probability and Probability Distributions

Normal Random Variables (6 of 6)

Learning OUTCOMES

Use a normal probability distribution to estimate probabilities and identify unusual events.

Now we use the simulation and the standard normal curve to find the probabilities associated with any normal density curve.

Example

Length of Human Pregnancy

The length (in days) of a randomly chosen human pregnancy is a normal random variable with μ = 266, σ = 16. So X = length of pregnancy (in days)

(a) What is the probability that a randomly chosen pregnancy will last less than 246 days?

We want P(X < 246). To find this probability, we first convert X = 246 to a z-score:

[latex]Z = \frac{246 - 266}{16} = \frac{-20}{16} = -1.25[/latex]

Now we can use the simulation to find P(Z < −1.25). This is the area under the normal probability curve to the left of Z = −1.25.

The probability that a randomly chosen pregnancy lasts less than 246 days is 0.1056. In other words, there is an 11% chance that a randomly selected pregnancy will last less than 246 days.

(b) Suppose a pregnant woman’s husband has scheduled his business trips so that he will be in town between the 235th and 295th days of her pregnancy. What is the probability that the birth will take place during that time?

Compute the z-scores for each of these x-values:

[latex]Z = \frac{235 - 266}{16} = \frac{-31}{16} = -1.94[/latex]

and

[latex]Z = \frac{295 - 266}{16} = \frac{29}{16} = 1.81[/latex]

Use the simulation to find the area under the standard normal curve between these two z-scores.

Using the simulation to find area between two z-scores

So the desired probability is 0.9387.

[latex]P(235 < X < 295) = P(-1.94 < Z < 1.81) = 0.9387[/latex]

There is about a 94% probability that he will be home for the birth. Looks like he planned well.

Try It

The previous examples all followed the same general form: Given values of a normal random variable, we found an associated probability. The two basic steps in the solution process were as follows:

Convert x-value to a z-score.
Use the simulation to find associated probability.

Process for finding associated probability given values of normal random variables

The next example is a different type of problem: Given a probability, we will find the associated value of the normal random variable. The solution process will go in reverse order.

Use a new simulation to convert statements about probabilities to statements about z-scores.
Convert z-scores to x-values.

These types of problems are informally called “work-backwards” problems. We will use a new simulation for these types of problems. The new simulation requires us to enter a probability and then gives us the associated z-score. This is backwards from the simulation we worked with previously where we entered a z-score to find a probability. We will use this simulation in the next example.

Click here to open this simulation in its own window.

Example

Work Backwards to Find X

Foot length (in inches) of a randomly chosen adult male is a normal random variable with a mean of 11 and standard deviation of 1.5. So X = foot length (inches).

(a) Suppose that an XL sock is designed to fit the largest 30% of men’s feet. What is the smallest foot length that fits an XL sock?

Step 1: Use the simulation to convert the probability to a statement about z-scores.

We want to mark off the largest 30% of the distribution, so the probability to the right of the z-score is 30%. This means that 70% of the area is to the left of the z-score.

Simulation showing corresponding z-score of 0.52

From the simulation, we can see that the corresponding z-score is 0.52.

Step 2: Now we need to convert this z-score to a foot length.

Before we calculate the length, note that the z-score is about 0.5, so the x-value will be about 0.5 standard deviations above the mean.

[latex]x = \mu + 0.5 \cdot \sigma = 11 + 0.5(1.5) = 11 + 0.75 = 11.75 \text{ inches}[/latex]

Conclusion: A foot length of 11.75 inches is the shortest foot for an XL sock.

(b) What is the first quartile for the men’s foot lengths?

Step 1: Use the simulation to convert this probability into a statement about z-scores.

We want to mark off the smallest 25% of the distribution, so the probability to the left of the z-score is 25%.

Simulation showing corresponding z-score of −0.67

From the simulation, we can see that the corresponding z-score is −0.67.

Step 2: Convert this z-score to a foot length. If X is the foot length we seek, then X is 0.67 standard deviations below the mean. That is,

[latex]x = \mu - 0.67 \cdot \sigma = 11 - 0.67(1.5) = 11 - 1.005 = 9.995 \text{ inches}[/latex]

Conclusion: The first quartile mark is 9.995 inches, so about 25% of the men’s feet are shorter than 10 inches.

Comments

In the preceding example (specifically step 2), we found the x-value by reasoning about the meaning of the z-score. We can also develop a formula for this process.

Recall the definition of z-score. In words, the z-score of an x-value is the number of standard deviations X is away from the mean. As a formula, this is

[latex]Z = \frac{x - \mu}{\sigma}[/latex]

We can solve this equation for X as follows:

[latex]\frac{x - \mu}{\sigma} = Z[/latex]

[latex]x - \mu = Z \cdot \sigma[/latex]

[latex]x = \mu + Z \cdot \sigma[/latex]

This gives us a formula for finding X from Z. You can use this formula in step 2 of a work-backwards problem.

Try It

Let’s Summarize

In “Continuous Random Variables,” we made the transition from discrete to continuous random variables. A continuous random variable is not limited to distinct values. It is a measurement such as foot length. We cannot display the probability distribution for a continuous random variable with a table or histogram. We use a density curve to assign probabilities to intervals of x-values. We use the area under the density curve to find probabilities.
We use a normal density curve to model the probability distribution for many variables, such as weight, shoe sizes, foot lengths, and other human physical characteristics. Normal curves are mathematical models. We use µ to represent the mean of a normal curve and σ to represent the standard deviation of a normal curve. We use Greek letters to remind us that the normal curve is not a distribution of real data. It is a mathematical model based on a mathematical equation. We use this mathematical model to represent the perfect bell-shaped distribution.
For a normal curve, the empirical rule for normal curves tells us that 68% of the observations fall within 1 standard deviation of the mean, 95% within 2 standard deviations of the mean, and 99.7% within 3 standard deviations of the mean.
To compare x-values from different distributions, we standardize the values by finding a z-score:

[latex]Z = \frac{x - \mu}{\sigma}[/latex]

A z-score measures how far X is from the mean in standard deviations. In other words, the z-score is the number of standard deviations X is from the mean of the distribution. For example, Z = 1 means the x-value is 1 standard deviation above the mean.
If we convert the x-values into z-scores, the distribution of z-scores is also a normal density curve. This curve is called the standard normal distribution. We use a simulation with the standard normal curve to find probabilities for any normal distribution.
We can also work backwards and find the x-value for a given probability. We used a different simulation to work backwards from probabilities to x-values. With this simulation, we found x-values corresponding to quartiles and percentiles.

Concepts in Statistics. Provided by: Open Learning Initiative. Located at: http://oli.cmu.edu. License: CC BY: Attribution

License

Icon for the Creative Commons Attribution-ShareAlike 4.0 International License