Loading...

Messages

Proposals

Stuck in your homework and missing deadline? Get urgent help in $10/Page with 24 hours deadline

Get Urgent Writing Help In Your Essays, Assignments, Homeworks, Dissertation, Thesis Or Coursework & Achieve A+ Grades.

Privacy Guaranteed - 100% Plagiarism Free Writing - Free Turnitin Report - Professional And Experienced Writers - 24/7 Online Support

The probability that z lies between 0.7 and 1.98

26/10/2021 Client: muhammad11 Deadline: 2 Day

CHAPTER 6

The Normal Distribution

Objectives

After completing this chapter, you should be able to

1Identify distributions as symmetric or skewed.

2Identify the properties of a normal distribution.

3Find the area under the standard normal distribution, given various z values.

4Find probabilities for a normally distributed variable by transforming it into a standard normal variable.

5Find specific data values for given percentages, using the standard normal distribution.

6Use the central limit theorem to solve problems involving sample means for large samples.

7Use the normal approximation to compute probabilities for a binomial variable.

Outline

Introduction

6–1Normal Distributions

6–2Applications of the Normal Distribution

6–3The Central Limit Theorem

6–4The Normal Approximation to the Binomial Distribution

Summary

Page 298

Statistics Today

What Is Normal?

Medical researchers have determined so-called normal intervals for a person’s blood pressure, cholesterol, triglycerides, and the like. For example, the normal range of systolic blood pressure is 110 to 140. The normal interval for a person’s triglycerides is from 30 to 200 milligrams per deciliter (mg/dl). By measuring these variables, a physician can determine if a patient’s vital statistics are within the normal interval or if some type of treatment is needed to correct a condition and avoid future illnesses. The question then is, How does one determine the so-called normal intervals? See Statistics Today—Revisited at the end of the chapter.

In this chapter, you will learn how researchers determine normal intervals for specific medical tests by using a normal distribution. You will see how the same methods are used to determine the lifetimes of batteries, the strength of ropes, and many other traits.

Introduction

Random variables can be either discrete or continuous. Discrete variables and their distributions were explained in Chapter 5. Recall that a discrete variable cannot assume all values between any two given values of the variables. On the other hand, a continuous variable can assume all values between any two given values of the variables. Examples of continuous variables are the heights of adult men, body temperatures of rats, and cholesterol levels of adults. Many continuous variables, such as the examples just mentioned, have distributions that are bell-shaped, and these are called approximately normally distributed variables. For example, if a researcher selects a random sample of 100 adult women, measures their heights, and constructs a histogram, the researcher gets a graph similar to the one shown in Figure 6–1(a). Now, if the researcher increases the sample size and decreases the width of the classes, the histograms will look like the ones shown in Figure 6–1(b) and (c). Finally, if it were possible to measure exactly the heights of all adult females in the United States and plot them, the histogram would approach what is called a normal distribution , shown in Figure 6–1(d). This distribution is also known as a bell curve or a Gaussian distribution, named for the German mathematician Carl Friedrich Gauss (1777–1855), who derived its equation.

Page 299

Figure 6–1

Histograms for the Distribution of Heights of Adult Women

Figure 6–2

Normal and Skewed Distributions

Objective 1

Identify distributions as symmetric or skewed.

No variable fits a normal distribution perfectly, since a normal distribution is a theoretical distribution. However, a normal distribution can be used to describe many variables, because the deviations from a normal distribution are very small. This concept will be explained further in Section 6–1.

When the data values are evenly distributed about the mean, a distribution is said to be a symmetric distribution. (A normal distribution is symmetric.) Figure 6–2(a) shows a symmetric distribution. When the majority of the data values fall to the left or right of the mean, the distribution is said to be skewed. When the majority of the data values fall to the right of the mean, the distribution is said to be a negatively or left-skewed distribution. The mean is to the left of the median, and the mean and the median are to the left of the mode. See Figure 6–2(b). When the majority of the data values fall to the left of the mean, a distribution is said to be a positively or right-skewed distribution. The mean falls to the right of the median, and both the mean and the median fall to the right of the mode. See Figure 6–2(c).

Page 300

The “tail” of the curve indicates the direction of skewness (right is positive, left is negative). These distributions can be compared with the ones shown in Figure 3–1 in Chapter 3. Both types follow the same principles.

This chapter will present the properties of a normal distribution and discuss its applications. Then a very important fact about a normal distribution called the central limit theorem will be explained. Finally, the chapter will explain how a normal distribution curve can be used as an approximation to other distributions, such as the binomial distribution. Since a binomial distribution is a discrete distribution, a correction for continuity may be employed when a normal distribution is used for its approximation.

Objective 2

Identify the properties of a normal distribution.

6–1Normal Distributions

In mathematics, curves can be represented by equations. For example, the equation of the circle shown in Figure 6–3 is x2 + y2 = r2, where r is the radius. A circle can be used to represent many physical objects, such as a wheel or a gear. Even though it is not possible to manufacture a wheel that is perfectly round, the equation and the properties of a circle can be used to study many aspects of the wheel, such as area, velocity, and acceleration. In a similar manner, the theoretical curve, called a normal distribution curve, can be used to study many variables that are not perfectly normally distributed but are nevertheless approximately normal.

Figure 6–3

Graph of a Circle and an Application

The mathematical equation for a normal distribution is

where

e ≈ 2.718 (≈ means “is approximately equal to”)

π ≈ 3.14

μ = population mean

σ = population standard deviation

This equation may look formidable, but in applied statistics, tables or technology is used for specific problems instead of the equation.

Another important consideration in applied statistics is that the area under a normal distribution curve is used more often than the values on the y axis. Therefore, when a normal distribution is pictured, the y axis is sometimes omitted.

Circles can be different sizes, depending on their diameters (or radii), and can be used to represent wheels of different sizes. Likewise, normal curves have different shapes and can be used to represent different variables.

The shape and position of a normal distribution curve depend on two parameters, the mean and the standard deviation. Each normally distributed variable has its own normal distribution curve, which depends on the values of the variable’s mean and standard deviation. Figure 6–4(a) shows two normal distributions with the same mean values but different standard deviations. The larger the standard deviation, the more dispersed, or spread out, the distribution is. Figure 6–4(b) shows two normal distributions with the same standard deviation but with different means. These curves have the same shapes but are located at different positions on the x axis. Figure 6–4(c) shows two normal distributions with different means and different standard deviations.

Page 301

Figure 6–4

Shapes of Normal Distributions

Historical Note

The discovery of the equation for a normal distribution can be traced to three mathematicians. In 1733, the French mathematician Abraham DeMoivre derived an equation for a normal distribution based on the random variation of the number of heads appearing when a large number of coins were tossed. Not realizing any connection with the naturally occurring variables, he showed this formula to only a few friends. About 100 years later, two mathematicians, Pierre Laplace in France and Carl Gauss in Germany, derived the equation of the normal curve independently and without any knowledge of DeMoivre’s work. In 1924, Karl Pearson found that DeMoivre had discovered the formula before Laplace or Gauss.

A normal distribution is a continuous, symmetric, bell-shaped distribution of a variable.

The properties of a normal distribution, including those mentioned in the definition, are explained next.

Summary of the Properties of the Theoretical Normal Distribution

1.A normal distribution curve is bell-shaped.

2.The mean, median, and mode are equal and are located at the center of the distribution.

3.A normal distribution curve is unimodal (i.e., it has only one mode).

4.The curve is symmetric about the mean, which is equivalent to saying that its shape is the same on both sides of a vertical line passing through the center.

5.The curve is continuous; that is, there are no gaps or holes. For each value of X, there is a corresponding value of Y.

6.The curve never touches the x axis. Theoretically, no matter how far in either direction the curve extends, it never meets the x axis—but it gets increasingly closer.

7.The total area under a normal distribution curve is equal to 1.00, or 100%. This fact may seem unusual, since the curve never touches the x axis, but one can prove it mathematically by using calculus. (The proof is beyond the scope of this textbook.)

8.The area under the part of a normal curve that lies within 1 standard deviation of the mean is approximately 0.68, or 68%; within 2 standard deviations, about 0.95, or 95%; and within 3 standard deviations, about 0.997, or 99.7%. See Figure 6–5, which also shows the area in each region.

The values given in item 8 of the summary follow the empirical rule for data given in Section 3–2.

You must know these properties in order to solve problems involving distributions that are approximately normal.

Page 302

Figure 6–5

Areas Under a Normal Distribution Curve

Objective 3

Find the area under the standard normal distribution, given various z values.

The Standard Normal Distribution

Since each normally distributed variable has its own mean and standard deviation, as stated earlier, the shape and location of these curves will vary. In practical applications, then, you would have to have a table of areas under the curve for each variable. To simplify this situation, statisticians use what is called the standard normal distribution .

The standard normal distribution is a normal distribution with a mean of 0 and a standard deviation of 1.

The standard normal distribution is shown in Figure 6–6.

The values under the curve indicate the proportion of area in each section. For example, the area between the mean and 1 standard deviation above or below the mean is about 0.3413, or 34.13%.

The formula for the standard normal distribution is

All normally distributed variables can be transformed into the standard normally distributed variable by using the formula for the standard score:

This is the same formula used in Section 3–3. The use of this formula will be explained in Section 6–3.

As stated earlier, the area under a normal distribution curve is used to solve practical application problems, such as finding the percentage of adult women whose height is between 5 feet 4 inches and 5 feet 7 inches, or finding the probability that a new battery will last longer than 4 years. Hence, the major emphasis of this section will be to show the procedure for finding the area under the standard normal distribution curve for any z value. The applications will be shown in Section 6–2. Once the X values are transformed by using the preceding formula, they are called z values. The z value is actually the number of standard deviations that a particular X value is away from the mean. Table E in Appendix C gives the area (to four decimal places) under the standard normal curve for any z value from –3.49 to 3.49.

Page 303

Figure 6–6

Standard Normal Distribution

Interesting Fact

Bell-shaped distributions occurred quite often in early coin-tossing and die-rolling experiments.

Finding Areas Under the Standard Normal Distribution Curve

For the solution of problems using the standard normal distribution, a four-step procedure is recommended with the use of the Procedure Table shown.

Step 1Draw the normal distribution curve and shade the area.

Step 2Find the appropriate figure in the Procedure Table and follow the directions given.

There are three basic types of problems, and all three are summarized in the Procedure Table. Note that this table is presented as an aid in understanding how to use the standard normal distribution table and in visualizing the problems. After learning the procedures, you should not find it necessary to refer to the Procedure Table for every problem.

Procedure Table

Finding the Area Under the Standard Normal Distribution Curve

1.To the left of any z value:

Look up the z value in the table and use the area given.

2.To the right of any z value:

Look up the z value and subtract the area from 1.

3.Between any two z values:

Look up both z values and subtract the corresponding areas.

Page 304

Figure 6–7

Table E Area Value for z = 1.39

Table E in Appendix C gives the area under the normal distribution curve to the left of any z value given in two decimal places. For example, the area to the left of a z value of 1.39 is found by looking up 1.3 in the left column and 0.09 in the top row. Where the two lines meet gives an area of 0.9177. See Figure 6–7.

Example 6–1

Find the area to the left of z = 1.99.

Solution

Step 1Draw the figure. The desired area is shown in Figure 6–8.

Figure 6–8

Area Under the Standard Normal Distribution Curve for Example 6–1

Step 2We are looking for the area under the standard normal distribution curve to the left of z = 1.99. Since this is an example of the first case, look up the area in the table. It is 0.9767. Hence 97.67% of the area is less than z = 1.99.

Example 6–2

Find the area to the right of z = –1.16.

Solution

Step 1Draw the figure. The desired area is shown in Figure 6–9.

Figure 6–9

Area Under the Standard Normal Distribution Curve for Example 6–2

Page 305

Step 2We are looking for the area to the right of z = –1.16. This is an example of the second case. Look up the area for z = –1.16. It is 0.3770. Subtract it from 1.000. 1.000 – 0.1230 = 0.8770. Hence 87.70% of the area under the standard normal distribution curve is to the left of z = –1.16.

Example 6–3

Find the area between z = +1.68 and z = –1.37.

Solution

Step 1Draw the figure as shown. The desired area is shown in Figure 6–10.

Figure 6–10

Area Under the Standard Normal Distribution Curve for Example 6–3

Step 2Since the area desired is between two given z values, look up the areas corresponding to the two z values and subtract the smaller area from the larger area. (Do not subtract the z values.) The area for z = +1.68 is 0.9535, and the area for z = –1.37 is 0.0853. The area between the two z values is 0. 9535 – 0.0853 = 0.8682 or 86.82%.

A Normal Distribution Curve as a Probability Distribution Curve

A normal distribution curve can be used as a probability distribution curve for normally distributed variables. Recall that a normal distribution is a continuous distribution, as opposed to a discrete probability distribution, as explained in Chapter 5. The fact that it is continuous means that there are no gaps in the curve. In other words, for every z value on the x axis, there is a corresponding height, or frequency, value.

The area under the standard normal distribution curve can also be thought of as a probability. That is, if it were possible to select any z value at random, the probability of choosing one, say, between 0 and 2.00 would be the same as the area under the curve between 0 and 2.00. In this case, the area is 0.4772. Therefore, the probability of randomly selecting any z value between 0 and 2.00 is 0.4772. The problems involving probability are solved in the same manner as the previous examples involving areas in this section. For example, if the problem is to find the probability of selecting a z value between 2.25 and 2.94, solve it by using the method shown in case 3 of the Procedure Table.

For probabilities, a special notation is used. For example, if the problem is to find the probability of any z value between 0 and 2.32, this probability is written as P(0 < z < 2.32).

Page 306

Note: In a continuous distribution, the probability of any exact z value is 0 since the area would be represented by a vertical line above the value. But vertical lines in theory have no area. So P(a ≤ z ≤ b) = P(a < z < b).

Example 6–4

Find the probability for each.

a.P(0 < z < 2.32)

b.P(z < 1.65)

c.P(z > 1.91)

Solution

a.P(0 < z < 2.32) means to find the area under the standard normal distribution curve between 0 and 2.32. First look up the area corresponding to 2.32. It is 0.9898. Then look up the area corresponding to z = 0. It is 0.500. Subtract the two areas: 0.9898 – 0.5000 = 0.4898. Hence the probability is 0.4898, or 48.98%. This is shown in Figure 6–11.

Figure 6–11

Area Under the Standard Normal Distribution Curve for Part a of Example 6–4

b.P(z < 1.65) is represented in Figure 6–12. Look up the area corresponding to z = 1.65 in Table E. It is 0.9505. Hence, P(z < 1.65) = 0.9505, or 95.05%.

Figure 6–12

Area Under the Standard Normal Distribution Curve for Part b of Example 6–4

c.P(z > 1.91) is shown in Figure 6–13. Look up the area that corresponds to z = 1.91. It is 0.9719. Then subtract this area from 1.0000. P(z < 1.91) = 1.0000 – 0.9719 = 0.0281, or 2.81%.

Page 307

Figure 6–13

Area Under the Standard Normal Distribution Curve for Part c of Example 6–4

Sometimes, one must find a specific z value for a given area under the standard normal distribution curve. The procedure is to work backward, using Table E.

Since Table E is cumulative, it is necessary to locate the cumulative area up to a given z value. Example 6–5 shows this.

Example 6–5

Find the z value such that the area under the standard normal distribution curve between 0 and the z value is 0.2123.

Solution

Draw the figure. The area is shown in Figure 6–14.

Figure 6–14

Area Under the Standard Normal Distribution Curve for Example 6–5

In this case it is necessary to add 0.5000 to the given area of 0.2123 to get the cumulative area of 0.7123. Look up the area in Table E. The value in the left column is 0.5, and the top value is 0.06, so the positive z value for the area z = 0.56.

Next, find the area in Table E, as shown in Figure 6–15. Then read the correct z value in the left column as 0.5 and in the top row as 0.06, and add these two values to get 0.56.

Figure 6–15

Finding the z Value from Table E for Example 6–5

Page 308

Figure 6–16

The Relationship Between Area and Probability

If the exact area cannot be found, use the closest value. For example, if you wanted to find the z value for an area 0.9241, the closest area is 0.9236, which gives a z value of 1.43. See Table E in Appendix C.

The rationale for using an area under a continuous curve to determine a probability can be understood by considering the example of a watch that is powered by a battery. When the battery goes dead, what is the probability that the minute hand will stop somewhere between the numbers 2 and 5 on the face of the watch? In this case, the values of the variable constitute a continuous variable since the hour hand can stop anywhere on the dial’s face between 0 and 12 (one revolution of the minute hand). Hence, the sample space can be considered to be 12 units long, and the distance between the numbers 2 and 5 is 5 – 2, or 3 units. Hence, the probability that the minute hand stops on a number between 2 and 5 is See Figure 6–16(a).

The problem could also be solved by using a graph of a continuous variable. Let us assume that since the watch can stop anytime at random, the values where the minute hand would land are spread evenly over the range of 0 through 12. The graph would then consist of a continuous uniform distribution with a range of 12 units. Now if we require the area under the curve to be 1 (like the area under the standard normal distribution), the height of the rectangle formed by the curve and the x axis would need to be . The reason is that the area of a rectangle is equal to the base times the height. If the base is 12 units long, then the height has to be since 12 · = 1.

The area of the rectangle with a base from 2 through 5 would be 3 · , or . See Figure 6–16(b). Notice that the area of the small rectangle is the same as the probability found previously. Hence the area of this rectangle corresponds to the probability of this event. The same reasoning can be applied to the standard normal distribution curve shown in Example 6–5.

Finding the area under the standard normal distribution curve is the first step in solving a wide variety of practical applications in which the variables are normally distributed. Some of these applications will be presented in Section 6–2.

Page 309

Applying the Concepts 6–1

Assessing Normality

Many times in statistics it is necessary to see if a set of data values is approximately normally distributed. There are special techniques that can be used. One technique is to draw a histogram for the data and see if it is approximately bell-shaped. (Note: It does not have to be exactly symmetric to be bell-shaped.)

The numbers of branches of the 50 top libraries are shown.

Source: The World Almanac and Book of Facts.

1.Construct a frequency distribution for the data.

2.Construct a histogram for the data.

3.Describe the shape of the histogram.

4.Based on your answer to question 3, do you feel that the distribution is approximately normal?

In addition to the histogram, distributions that are approximately normal have about 68% of the values fall within 1 standard deviation of the mean, about 95% of the data values fall within 2 standard deviations of the mean, and almost 100% of the data values fall within 3 standard deviations of the mean. (See Figure 6–5.)

5.Find the mean and standard deviation for the data.

6.What percent of the data values fall within 1 standard deviation of the mean?

7.What percent of the data values fall within 2 standard deviations of the mean?

8.What percent of the data values fall within 3 standard deviations of the mean?

9.How do your answers to questions 6, 7, and 8 compare to 68, 95, and 100%, respectively?

10.Does your answer help support the conclusion you reached in question 4? Explain.

(More techniques for assessing normality are explained in Section 6–2.) See pages 351 and 352 for the answers.

Exercises 6-1

1.What are the characteristics of a normal distribution?

2.Why is the standard normal distribution important in statistical analysis?

3.What is the total area under the standard normal distribution curve?

4.What percentage of the area falls below the mean? Above the mean?

5.About what percentage of the area under the normal distribution curve falls within 1 standard deviation above and below the mean? 2 standard deviations? 3 standard deviations?

For Exercises 6 through 25, find the area under the standard normal distribution curve.

6.Between z = 0 and z = 1.89

7.Between z = 0 and z = 0.75

8.Between z = 0 and z = –0.46

9.Between z = 0 and z = –2.07

10.To the right of z = 2.11

11.To the right of z = 0.23

12.To the left of z = –0.75

13.To the left of z = –1.43

Page 310

14.Between z = 1.23 and z = 1.90

15.Between z = 1.05 and z = 1.78

16.Between z = –0.96 and z = –0.36

17.Between z = –1.56 and z = –1.83

18.Between z = 0.24 and z = –1.12

19.Between z = – 1.53 and z = –2.08

20.To the left of z = 1.31

21.To the left of z = 2.11

22.To the right of z = –1.92

23.To the right of z = –0.25

24.To the left of z = –2.15 and to the right of z = 1.62

25.To the right of z = 1.92 and to the left of z = –0.44

In Exercises 26 through 39, find the probabilities for each, using the standard normal distribution.

26. P(0 < z < 1.96)

27. P(0 < z < 0.67)

28. P(–1.23 < z < 0)

29. P(–1.57 < z < 0)

30. P(z > 0.82)

31. P(z > 2.83)

32. P(z < –1.77)

33. P(z < –1.21)

34. P(–0.20 < z < 1.56)

35. P(–2.46 < z < 1.74)

36. P(1.12 < z < 1.43)

37. P(1.46 < z < 2.97)

38. P(z > –1.43)

39. P(z < 1.42)

For Exercises 40 through 45, find the z value that corresponds to the given area.

40.

41.

42.

43.

44.

45.

46.Find the z value to the right of the mean so that

a.54.78% of the area under the distribution curve lies to the left of it.

b.69.85% of the area under the distribution curve lies to the left of it.

c.88.10% of the area under the distribution curve lies to the left of it.

47.Find the z value to the left of the mean so that

a.98.87% of the area under the distribution curve lies to the right of it.

b.82.12% of the area under the distribution curve lies to the right of it.

c.60.64% of the area under the distribution curve lies to the right of it.

Page 311

48.Find two z values so that 48% of the middle area is bounded by them.

49.Find two z values, one positive and one negative, that are equidistant from the mean so that the areas in the two tails total the following values.

a.5%

b.10%

c.1%

Extending the Concepts

50.In the standard normal distribution, find the values of z for the 75th, 80th, and 92nd percentiles.

51.Find P(– 1 < z < 1), P(–2 < z < 2), and P(–3 < z < 3). How do these values compare with the empirical rule?

52.Find z0 such that P(z > z0) = 0.1234.

53.Find z0 such that P(–1.2 < z < z0) = 0.8671.

54.Find z0 such that P(z0 < z < 2.5) = 0.7672.

55.Find z0 such that the area between z0 and z = –0.5 is 0. 2345 (two answers).

56.Find z0 such that P(–z0 < z < z0) = 0.76.

57.Find the equation for the standard normal distribution by substituting 0 for µ and 1 for σ in the equation

58.Graph by hand the standard normal distribution by using the formula derived in Exercise 57. Let π ≈ 3.14 and e ≈ 2.718. Use X values of –2, –1.5, – 1, –0.5, 0, 5, 1, 1.5, and 2. (Use a calculator to compute the y values.)

Technology Step by Step

MINITAB

Step by Step

The Standard Normal Distribution

It is possible to determine the height of the density curve given a value of z, the cumulative area given a value of z, or a z value given a cumulative area. Examples are from Table E in Appendix C.

Find the Area to the Left of z = 1.39

1.Select Calc>Probability Distributions>Normal. There are three options.

2.Click the button for Cumulative probability. In the center section, the mean and standard deviation for the standard normal distribution are the defaults. The mean should be 0, and the standard deviation should be 1.

3.Click the button for Input Constant, then click inside the text box and type in 1.39. Leave the storage box empty.

4.Click [OK].

Page 312

Cumulative Distribution Function

Normal with mean = 0 and standard deviation = 1

x P( X <= x )

1.39 0.917736

The graph is not shown in the output.

The session window displays the result, 0.917736. If you choose the optional storage, type in a variable name such as K1. The result will be stored in the constant and will not be in the session window.

Find the Area to the Right of –2.06

1.Select Calc>Probability Distributions>Normal.

2.Click the button for Cumulative probability.

3.Click the button for Input Constant, then enter –2.06 in the text box. Do not forget the minus sign.

4.Click in the text box for Optional storage and type K1.

5.Click [OK]. The area to the left of –2.06 is stored in K1 but not displayed in the session window.

To determine the area to the right of the z value, subtract this constant from 1, then display the result.

6.Select Calc>Calculator.

a)Type K2 in the text box for Store result in:.

b)Type in the expression 1 – K1, then click [OK].

7.Select Data>Display Data. Drag the mouse over K1 and K2, then click [Select] and [OK].

The results will be in the session window and stored in the constants.

Data

Display

K1

0.0196993

K2

0.980301

8.To see the constants and other information about the worksheet, click the Project Manager icon. In the left pane click on the green worksheet icon, and then click the constants folder. You should see all constants and their values in the right pane of the Project Manager.

9.For the third example calculate the two probabilities and store them in K1 and K2.

10.Use the calculator to subtract K1 from K2 and store in K3.

The calculator and project manager windows are shown.

Page 313

Calculate a z Value Given the Cumulative Probability

Find the z value for a cumulative probability of 0.025.

1.Select Calc>Probability Distributions>Normal.

2.Click the option for Inverse cumulative probability, then the option for Input constant.

3.In the text box type .025, the cumulative area, then click [OK].

4.In the dialog box, the z value will be returned, –1.960.

Inverse Cumulative Distribution Function

Normal with mean = 0 and standard deviation = 1

P ( X <= x )

x

0.025

–1.95996

In the session window z is –1.95996.

TI–83 Plus or TI–84 Plus

Step by Step

Standard Normal Random Variables

To find the probability for a standard normal random variable:

Press 2nd [DISTR], then 2 for normalcdf(

The form is normalcdf(lower z score, upper z score).

Use E99 for ∞ (infinity) and –E99 for –∞ (negative infinity). Press 2nd [EE] to get E.

Example: Area to the right of z = 1.11

normalcdf(1.11,E99)

Example: Area to the left of z = –1.93

normalcdf(–E99, –1.93)

Example: Area between z = 2.00 and z = 2.47

normalcdf(2.00,2.47)

To find the percentile for a standard normal random variable:

Press 2nd [DISTR], then 3 for the invNorm(

The form is invNorm(area to the left of z score)

Example: Find the z score such that the area under the standard normal curve to the left of 0.7123

invNorm(.7123)

Excel

Step by Step

The Standard Normal Distribution

Finding areas under the standard normal distribution curve

Example XL6–1

Find the area to the left of z = 1.99.

In a blank cell type: =NORMSDIST(1.99)

Answer: 0.976705

Example XL6–2

Find the area to the right of z = –2.04.

In a blank cell type: = 1-NORMSDIST(–2.04)

Answer: 0.979325

Page 314

Example XL6–3

Find the area between z = –2.04 and z = 1.99.

In a blank cell type: =NORMSDIST(1.99) – NORMSDIST(–2.04)

Answer: 0.956029

Finding a z value given an area under the standard normal distribution curve

Example XL6–4

Find a z score given the cumulative area (area to the left of z) is 0.0250.

In a blank cell type: =NORMSINV(.025)

Answer: –1.95996

Objective 4

Find probabilities for a normally distributed variable by transforming it into a standard normal variable.

6–2Applications of the Normal Distribution

The standard normal distribution curve can be used to solve a wide variety of practical problems. The only requirement is that the variable be normally or approximately normally distributed. There are several mathematical tests to determine whether a variable is normally distributed. See the Critical Thinking Challenges on page 350. For all the problems presented in this chapter, you can assume that the variable is normally or approximately normally distributed.

To solve problems by using the standard normal distribution, transform the original variable to a standard normal distribution variable by using the formula

This is the same formula presented in Section 3–3. This formula transforms the values of the variable into standard units or z values. Once the variable is transformed, then the Procedure Table and Table E in Appendix C can be used to solve problems.

For example, suppose that the scores for a standardized test are normally distributed, have a mean of 100, and have a standard deviation of 15. When the scores are transformed to z values, the two distributions coincide, as shown in Figure 6–17. (Recall that the z distribution has a mean of 0 and a standard deviation of 1.)

Figure 6–17

Test Scores and Their Corresponding z Values

To solve the application problems in this section, transform the values of the variable to z values and then find the areas under the standard normal distribution, as shown in Section 6–1.

Page 315

Example 6–6

Holiday Spending

A survey by the National Retail Federation found that women spend on average $146.21 for the Christmas holidays. Assume the standard deviation is $29.44. Find the percentage of women who spend less than $160.00. Assume the variable is normally distributed.

Solution

Step 1Draw the figure and represent the area as shown in Figure 6–18.

Figure 6–18

Area Under a Normal Curve for Example 6–6

Step 2Find the z value corresponding to $160.00.

Hence $160.00 is 0.47 of a standard deviation above the mean of $146.21, as shown in the z distribution in Figure 6–19.

Figure 6–19

Area and z Values for Example 6–6

Step 3Find the area, using Table E. The area under the curve to the left of z = 0.47 is 0.6808.

Therefore 0.6808, or 68.08%, of the women spend less than $160.00 at Christmas time.

Example 6–7

Monthly Newspaper Recycling

Each month, an American household generates an average of 28 pounds of newspaper for garbage or recycling. Assume the standard deviation is 2 pounds. If a household is selected at random, find the probability of its generating

a.Between 27 and 31 pounds per month

b.More than 30.2 pounds per month

Assume the variable is approximately normally distributed.

Source: Michael D. Shook and Robert L. Shook, The Book of Odds.

Page 316

Solution a

Step 1Draw the figure and represent the area. See Figure 6–20.

Figure 6–20

Area Under a Normal Curve for Part a of Example 6–7

Historical Note

Astronomers in the late 1700s and the 1800s used the principles underlying the normal distribution to correct measurement errors that occurred in charting the positions of the planets.

Step 2Find the two z values.

Step 3Find the appropriate area, using Table E. The area to the left of z2 is 0.9332, and the area to the left of z1 is 0.3085. Hence the area between z1 and z2 is 0.9332 – 0.3085 = 0.6247. See Figure 6–21.

Figure 6–21

Area and z Values for Part a of Example 6–7

Hence, the probability that a randomly selected household generates between 27 and 31 pounds of newspapers per month is 62.47%.

Solution b

Step 1Draw the figure and represent the area, as shown in Figure 6–22.

Figure 6–22

Area Under a Normal Curve for Part b of Example 6–7

Step 2Find the z value for 30.2.

Page 317

Step 3Find the appropriate area. The area to the left of z = 1.1 is 0.8643. Hence the area to the right of z = 1.1 is 1.0000 – 0.8643 = 0.1357.

Hence, the probability that a randomly selected household will accumulate more than 30.2 pounds of newspapers is 0.1357, or 13.57%.

A normal distribution can also be used to answer questions of “How many?” This application is shown in Example 6–8.

Example 6–8

Emergency Call Response Time

The American Automobile Association reports that the average time it takes to respond to an emergency call is 25 minutes. Assume the variable is approximately normally distributed and the standard deviation is 4.5 minutes. If 80 calls are randomly selected, approximately how many will be responded to in less than 15 minutes?

Source: Michael D. Shook and Robert L. Shook, The Book of Odds.

Solution

To solve the problem, find the area under a normal distribution curve to the left of 15.

Step 1Draw a figure and represent the area as shown in Figure 6–23.

Figure 6–23

Area Under a Normal Curve for Example 6–8

Step 2Find the z value for 15.

Step 3Find the area to the left of z = –2.22. It is 0.0132.

Step 4To find how many calls will be made in less than 15 minutes, multiply the sample size 80 by 0.0132 to get 1.056. Hence, 1.056, or approximately 1, call will be responded to in under 15 minutes.

Note: For problems using percentages, be sure to change the percentage to a decimal before multiplying. Also, round the answer to the nearest whole number, since it is not possible to have 1.056 calls.

Objective 5

Find specific data values for given percentages, using the standard normal distribution.

Finding Data Values Given Specific Probabilities

A normal distribution can also be used to find specific data values for given percentages. This application is shown in Example 6–9.

Page 318

Example 6–9

Police Academy Qualifications

To qualify for a police academy, candidates must score in the top 10% on a general abilities test. The test has a mean of 200 and a standard deviation of 20. Find the lowest possible score to qualify. Assume the test scores are normally distributed.

Solution

Since the test scores are normally distributed, the test value X that cuts off the upper 10% of the area under a normal distribution curve is desired. This area is shown in Figure 6–24.

Figure 6–24

Area Under a Normal Curve for Example 6–9

Work backward to solve this problem.

Step 1Subtract 0.1000 from 1.000 to get the area under the normal distribution to the left of x: 1.0000 – 0.10000 = 0.9000.

Step 2Find the z value that corresponds to an area of 0.9000 by looking up 0.9000 in the area portion of Table E. If the specific value cannot be found, use the closest value—in this case 0.8997, as shown in Figure 6–25. The corresponding z value is 1.28. (If the area falls exactly halfway between two z values, use the larger of the two z values. For example, the area 0.9500 falls halfway between 0.9495 and 0.9505. In this case use 1.65 rather than 1.64 for the z value.)

Figure 6–25

Finding the z Value from Table E (Example 6–9)

Step 3Substitute in the formula z = (X – μ)/σ and solve for X.

A score of 226 should be used as a cutoff. Anybody scoring 226 or higher qualifies.

Interesting Fact

Americans are the largest consumers of chocolate. We spend $16.6 billion annually.

Page 319

Instead of using the formula shown in step 3, you can use the formula X = z · σ + µ. This is obtained by solving

for X as shown.

z · σ = X – µ

Multiply both sides by σ.

z · σ + µ = X

Add µ to both sides.

X = z · σ + µ

Exchange both sides of the equation.

Formula for Finding X

When you must find the value of X, you can use the following formula:

X = z · σ + µ

Example 6–10

Systolic Blood Pressure

For a medical study, a researcher wishes to select people in the middle 60% of the population based on blood pressure. If the mean systolic blood pressure is 120 and the standard deviation is 8, find the upper and lower readings that would qualify people to participate in the study.

Solution

Assume that blood pressure readings are normally distributed; then cutoff points are as shown in Figure 6–26.

Figure 6–26

Area Under a Normal Curve for Example 6–10

Figure 6–26 shows that two values are needed, one above the mean and one below the mean. To get the area to the left of the positive z value, add 0.5000 + 0.3000 = 0.8000 (30% = 0.3000). The z value closest to 0.8000 is 0.84. Substituting in the formula X = zσ + µ gives

X1 = zσ + µ = (0.84)(8) + 120 = 126.72

The area to the left of the negative z value is 20%, or 2.000. The area closest to 0.2000 is –0.84.

X2 = (–0.84)(8) + 120 = 113.28

Therefore, the middle 60% will have blood pressure readings of 113.28 < X < 126.72.

As shown in this section, a normal distribution is a useful tool in answering many questions about variables that are normally or approximately normally distributed.

Page 320

Determining Normality

A normally shaped or bell-shaped distribution is only one of many shapes that a distribution can assume; however, it is very important since many statistical methods require that the distribution of values (shown in subsequent chapters) be normally or approximately normally shaped.

There are several ways statisticians check for normality. The easiest way is to draw a histogram for the data and check its shape. If the histogram is not approximately bell-shaped, then the data are not normally distributed.

Skewness can be checked by using Pearson’s index PI of skewness. The formula is

If the index is greater than or equal to +1 or less than or equal to –1, it can be concluded that the data are significantly skewed.

In addition, the data should be checked for outliers by using the method shown in Chapter 3. Even one or two outliers can have a big effect on normality.

Examples 6–11 and 6–12 show how to check for normality.

Example 6–11

Technology Inventories

A survey of 18 high-technology firms showed the number of days’ inventory they had on hand. Determine if the data are approximately normally distributed.

Source: USA TODAY.

Solution

Step 1Construct a frequency distribution and draw a histogram for the data, as shown in Figure 6–27.

Class

Frequency

5–29

2

30–54

3

55–79

4

80–104

5

105–129

2

130–154

1

155–179

1

Figure 6–27

Histogram for Example 6–11

Page 321

Since the histogram is approximately bell-shaped, we can say that the distribution is approximately normal.

Step 2Check for skewness. For these data, = 79.5, median = 77.5, and s = 40.5. Using Pearson’s index of skewness gives

In this case, the PI is not greater than +1 or less than –1, so it can be concluded that the distribution is not significantly skewed.

Step 3Check for outliers. Recall that an outlier is a data value that lies more than 1.5 (IQR) units below Q1 or 1.5 (IQR) units above Q3. In this case, Q1 = 45 and Q3 = 98; hence, IQR = Q3 – Qx = 98 – 45 = 53. An outlier would be a data value less than 45 – 1.5(53) = –34.5 or a data value larger than 98 + 1.5(53) = 177.5. In this case, there are no outliers.

Since the histogram is approximately bell-shaped, the data are not significantly skewed, and there are no outliers, it can be concluded that the distribution is approximately normally distributed.

Example 6–12

Number of Baseball Games Played

The data shown consist of the number of games played each year in the career of Baseball Hall of Famer Bill Mazeroski. Determine if the data are approximately normally distributed.

Source: Greensburg Tribune Review.

Solution

Step 1Construct a frequency distribution and draw a histogram for the data. See Figure 6–28.

Figure 6–28

Histogram for Example 6–12

Class

Frequency

34–58

1

59–83

3

84–108

0

109–133

2

134–158

7

159–183

4

Page 322

Unusual Stat

The average amount of money stolen by a pickpocket each time is $128.

The histogram shows that the frequency distribution is somewhat negatively skewed.

Step 2Check for skewness; = 127.24, median = 143, and s = 39.87.

Since the PI is less than –1, it can be concluded that the distribution is significantly skewed to the left.

Step 3Check for outliers. In this case, Q1 = 96.5 and Q3 = 155.5. IQR = Q3 – Q1 = 155.5 – 96.5 = 59. Any value less than 96.5 – 1.5(59) = 8 or above 155.5 + 1.5(59) = 244 is considered an outlier. There are no outliers.

In summary, the distribution is somewhat negatively skewed.

Another method that is used to check normality is to draw a normal quantile plot . Quantiles, sometimes called fractiles, are values that separate the data set into approximately equal groups. Recall that quartiles separate the data set into four approximately equal groups, and deciles separate the data set into 10 approximately equal groups. A normal quantile plot consists of a graph of points using the data values for the x coordinates and the z values of the quantiles corresponding to the x values for the y coordinates. (Note: The calculations of the z values are somewhat complicated, and technology is usually used to draw the graph. The Technology Step by Step section shows how to draw a normal quantile plot.) If the points of the quantile plot do not lie in an approximately straight line, then normality can be rejected.

There are several other methods used to check for normality. A method using normal probability graph paper is shown in the Critical Thinking Challenge section at the end of this chapter, and the chi-square goodness-of-fit test is shown in Chapter 11. Two other tests sometimes used to check normality are the Kolmogorov-Smikirov test and the Lilliefors test. An explanation of these tests can be found in advanced textbooks.

Applying the Concepts 6–2

Smart People

Assume you are thinking about starting a Mensa chapter in your hometown of Visiala, California, which has a population of about 10,000 people. You need to know how many people would qualify for Mensa, which requires an IQ of at least 130. You realize that IQ is normally distributed with a mean of 100 and a standard deviation of 15. Complete the following.

1.Find the approximate number of people in Visiala who are eligible for Mensa.

2.Is it reasonable to continue your quest for a Mensa chapter in Visiala?

3.How could you proceed to find out how many of the eligible people would actually join the new chapter? Be specific about your methods of gathering data.

4.What would be the minimum IQ score needed if you wanted to start an Ultra-Mensa club that included only the top 1% of IQ scores?

See page 352 for the answers.

Page 323

Exercises 6-2

1.Admission Charge for Movies The average admission charge for a movie is $5.81. If the distribution of movie admission charges is approximately normal with a standard deviation of $0.81, what is the probability that a randomly selected admission charge is less than $3.50?

Source: New York Times Almanac.

2.Teachers’ Salaries The average annual salary for all U.S. teachers is $47,750. Assume that the distribution is normal and the standard deviation is $5680. Find the probability that a randomly selected teacher earns

a.Between $35,000 and $45,000 a year

b.More than $40,000 a year

c.If you were applying for a teaching position and were offered $31,000 a year, how would you feel (based on this information)?

Source: New York Times Almanac.

3.Population in U.S. Jails The average daily jail population in the United States is 706,242. If the distribution is normal and the standard deviation is 52,145, find the probability that on a randomly selected day, the jail population is

a.Greater than 750,000

b.Between 600,000 and 700,000

Source: New York Times Almanac.

4.SAT Scores The national average SAT score (for Verbal and Math) is 1028. If we assume a normal distribution with σ = 92, what is the 90th percentile score? What is the probability that a randomly selected score exceeds 1200?

Source: New York Times Almanac.

5.Chocolate Bar Calories The average number of calories in a 1.5-ounce chocolate bar is 225. Suppose that the distribution of calories is approximately normal with σ = 10. Find the probability that a randomly selected chocolate bar will have

a.Between 200 and 220 calories

b.Less than 200 calories

Source: The Doctor’s Pocket Calorie, Fat, and Carbohydrate Counter.

6.Monthly Mortgage Payments The average monthly mortgage payment including principal and interest is $982 in the United States. If the standard deviation is approximately $180 and the mortgage payments are approximately normally distributed, find the probability that a randomly selected monthly payment is

a.More than $1000

b.More than $1475

c.Between $800 and $1150

Source: World Almanac.

7.Professors’ Salaries The average salary for a Queens College full professor is $85,900. If the average salaries are normally distributed with a standard deviation of $11,000, find these probabilities.

a.The professor makes more than $90,000.

b.The professor makes more than $75,000.

Source: AAUP, Chronicle of Higher Education.

8.Doctoral Student Salaries Full-time Ph.D. students receive an average of $12,837 per year. If the average salaries are normally distributed with a standard deviation of $1500, find these probabilities.

a.The student makes more than $15,000.

b.The student makes between $13,000 and $14,000.

Source: U.S. Education Dept., Chronicle of Higher Education.

9.Miles Driven Annually The mean number of miles driven per vehicle annually in the United States is 12,494 miles. Choose a randomly selected vehicle, and assume the annual mileage is normally distributed with a standard deviation of 1290 miles. What is the probability that the vehicle was driven more than 15,000 miles? Less than 8000 miles? Would you buy a vehicle if you had been told that it had been driven less than 6000 miles in the past year?

Source: World Almanac.

10.Commute Time to Work The average commute to work (one way) is 25 minutes according to the 2005 American Community Survey. If we assume that commuting times are normally distributed and that the standard deviation is 6.1 minutes, what is the probability that a randomly selected commuter spends more than 30 minutes commuting one way? Less than 18 minutes?

Source: www.census.gov

11.Credit Card Debt The average credit card debt for college seniors is $3262. If the debt is normally distributed with a standard deviation of $1100, find these probabilities.

a.That the senior owes at least $1000

b.That the senior owes more than $4000

c.That the senior owes between $3000 and $4000

Source: USA TODAY.

12.Price of Gasoline The average retail price of gasoline (all types) for the first half of 2005 was 212.2 cents. What would the standard deviation have to be in order for a 15% probability that a gallon of gas costs less than $1.80?

Source: World Almanac.

13.Time for Mail Carriers The average time for a mail carrier to cover a route is 380 minutes, and the standard deviation is 16 minutes. If one of these trips is selected at random, find the probability that the carrier will have the following route time. Assume the variable is normally distributed.

Page 324

a.At least 350 minutes

b.At most 395 minutes

c.How might a mail carrier estimate a range for the time he or she will spend en route?

14.Newborn Elephant Weights Newborn elephant calves usually weigh between 200 and 250 pounds—until October 2006, that is. An Asian elephant at the Houston (Texas) Zoo gave birth to a male calf weighing in at a whopping 384 pounds! Mack (like the truck) is believed to be the heaviest elephant calf ever born at a facility accredited by the Association of Zoos and Aquariums. If, indeed, the mean weight for newborn elephant calves is 225 pounds with a standard deviation of 45 pounds, what is the probability of a newborn weighing at least 384 pounds? Assume that the weights of newborn elephants are normally distributed.

Source: www.houstonzoo.org

15.Waiting to Be Seated The average waiting time to be seated for dinner at a popular restaurant is 23.5 minutes, with a standard deviation of 3.6 minutes. Assume the variable is normally distributed. When a patron arrives at the restaurant for dinner, find the probability that the patron will have to wait the following time.

a.Between 15 and 22 minutes

b.Less than 18 minutes or more than 25 minutes

c.Is it likely that a person will be seated in less than 15 minutes?

16.Salary of Full-Time Male Professors The average salary of a male full professor at a public four-year institution offering classes at the doctoral level is $99,685. For a female full professor at the same kind of institution, the salary is $90,330. If the standard deviation for the salaries of both genders is approximately $5200 and the salaries are normally distributed, find the 80th percentile salary for male professors and for female professors.

Source: World Almanac.

17.Used Boat Prices A marine sales dealer finds that the average price of a previously owned boat is $6492. He decides to sell boats that will appeal to the middle 66% of the market in terms of price. Find the maximum and minimum prices of the boats the dealer will sell. The standard deviation is $1025, and the variable is normally distributed. Would a boat priced at $5550 be sold in this store?

18.Itemized Charitable Contributions The average charitable contribution itemized per income tax return in Pennsylvania is $792. Suppose that the distribution of contributions is normal with a standard deviation of $103. Find the limits for the middle 50% of contributions.

Source: IRS, Statistics of Income Bulletin.

19.New Home Sizes A contractor decided to build homes that will include the middle 80% of the market. If the average size of homes built is 1810 square feet, find the maximum and minimum sizes of the homes the contractor should build. Assume that the standard deviation is 92 square feet and the variable is normally distributed.

Source: Michael D. Shook and Robert L. Shook, The Book of Odds.

20.New Home Prices If the average price of a new one- family home is $246,300 with a standard deviation of $15,000, find the minimum and maximum prices of the houses that a contractor will build to satisfy the middle 80% of the market. Assume that the variable is normally distributed.

Source: New York Times Almanac.

21.Cost of Personal Computers The average price of a personal computer (PC) is $949. If the computer prices are approximately normally distributed and σ = $100, what is the probability that a randomly selected PC costs more than $1200? The least expensive 10% of personal computers cost less than what amount?

Source: New York Times Almanac.

22.Reading Improvement Program To help students improve their reading, a school district decides to implement a reading program. It is to be administered to the bottom 5% of the students in the district, based on the scores on a reading achievement exam. If the average score for the students in the district is 122.6, find the cutoff score that will make a student eligible for the program. The standard deviation is 18. Assume the variable is normally distributed.

23.Used Car Prices An automobile dealer finds that the average price of a previously owned vehicle is $8256. He decides to sell cars that will appeal to the middle 60% of the market in terms of price. Find the maximum and minimum prices of the cars the dealer will sell. The standard deviation is $1150, and the variable is normally distributed.

24.Ages of Amtrak Passenger Cars The average age of Amtrak passenger train cars is 19.4 years. If the distribution of ages is normal and 20% of the cars are older than 22.8 years, find the standard deviation.

Source: New York Times Almanac.

25.Lengths of Hospital Stays The average length of a hospital stay for all diagnoses is 4.8 days. If we assume that the lengths of hospital stays are normally distributed with a variance of 2.1, then 10% of hospital stays are longer than how many days? Thirty percent of stays are less than how many days?

Source: www.cdc.gov

26.High School Competency Test A mandatory competency test for high school sophomores has a normal distribution with a mean of 400 and a standard deviation of 100.

a.The top 3% of students receive $500. What is the minimum score you would need to receive this award?

Page 325

b.The bottom 1.5% of students must go to summer school. What is the minimum score you would need to stay out of this group?

27.Product Marketing An advertising company plans to market a product to low-income families. A study states that for a particular area, the average income per family is $24,596 and the standard deviation is $6256. If the company plans to target the bottom 18% of the families based on income, find the cutoff income. Assume the variable is normally distributed.

28.Bottled Drinking Water Americans drank an average of 23.2 gallons of bottled water per capita in 2004. If the standard deviation is 2.7 gallons and the variable is normally distributed, find the probability that a randomly selected American drank more than 25 gallons of bottled water. What is the probability that the selected person drank between 18 and 26 gallons?

Source: www.census.gov

29.Wristwatch Lifetimes The mean lifetime of a wristwatch is 25 months, with a standard deviation of 5 months. If the distribution is normal, for how many months should a guarantee be made if the manufacturer does not want to exchange more than 10% of the watches? Assume the variable is normally distributed.

30.Security Officer Stress Tolerance To qualify for security officers’ training, recruits are tested for stress tolerance. The scores are normally distributed, with a mean of 62 and a standard deviation of 8. If only the top 15% of recruits are selected, find the cutoff score.

31.In the distributions shown, state the mean and standard deviation for each. Hint: See Figures 6–5 and 6–6. Also the vertical lines are 1 standard deviation apart.

32.SAT Scores Suppose that the mathematics SAT scores for high school seniors for a specific year have a mean of 456 and a standard deviation of 100 and are approximately normally distributed. If a subgroup of these high school seniors, those who are in the National Honor Society, is selected, would you expect the distribution of scores to have the same mean and standard deviation? Explain your answer.

33.Given a data set, how could you decide if the distribution of the data was approximately normal?

34.If a distribution of raw scores were plotted and then the scores were transformed to z scores, would the shape of the distribution change? Explain your answer.

35.In a normal distribution, find σ when µ = 110 and 2.87% of the area lies to the right of 112.

36.In a normal distribution, find µ when σ is 6 and 3.75% of the area lies to the left of 85.

37.In a certain normal distribution, 1.25% of the area lies to the left of 42, and 1.25% of the area lies to the right of 48. Find µ and σ.

38.Exam Scores An instructor gives a 100-point examination in which the grades are normally distributed. The mean is 60 and the standard deviation is 10. If there are 5% A’s and 5% F’s, 15% B’s and 15% D’s, and 60% C’s, find the scores that divide the distribution into those categories.

39.Drive-in Movies The data shown represent the number of outdoor drive-in movies in the United States for a 14-year period. Check for normality.

Source: National Association of Theater Owners.

40.Cigarette Taxes The data shown represent the cigarette tax (in cents) for 30 randomly selected states. Check for normality.

Source: Commerce Clearing House.

Page 326

41.Box Office Revenues The data shown represent the box office total revenue (in millions of dollars) for a randomly selected sample of the top-grossing films in 2001. Check for normality.

Source: USA TODAY.

42.Number of Runs Made The data shown represent the number of runs made each year during Bill Mazeroski’s career. Check for normality.

Source: Greensburg Tribune Review.

Technology Step by Step

MINITAB

Step by Step

Determining Normality

There are several ways in which statisticians test a data set for normality. Four are shown here.

Construct a Histogram

Inspect the histogram for shape.

1.Enter the data in the first column of a new worksheet. Name the column Inventory.

2.Use Stat>Basic Statistics>Graphical Summary presented in Section 3–3 to create the histogram. Is it symmetric? Is there a single peak?

Check for Outliers

Inspect the boxplot for outliers. There are no outliers in this graph. Furthermore, the box is in the middle of the range, and the median is in the middle of the box. Most likely this is not a skewed distribution either.

Calculate Pearson’s Index of Skewness

The measure of skewness in the graphical summary is not the same as Pearson’s index. Use the calculator and the formula.

3.Select Calc>Calculator, then type PI in the text box for Store result in:.

4.Enter the expression: 3*(MEAN(C1)—MEDI(C1))/(STDEV(C1)). Make sure you get all the parentheses in the right place!

5.Click [OK]. The result, 0.148318, will be stored in the first row of C2 named PI. Since it is smaller than +1, the distribution is not skewed.

Construct a Normal Probability Plot

6.Select Graph>Probability Plot, then Single and click [OK].

7.Double-click C1 Inventory to select the data to be graphed.

8.Click [Distribution] and make sure that Normal is selected. Click [OK].

9.Click [Labels] and enter the title for the graph: Quantile Plot for Inventory. You may also put Your Name in the subtitle.

10.Click [OK] twice. Inspect the graph to see if the graph of the points is linear.

Page 327

These data are nearly normal.

What do you look for in the plot?

a)An “S curve” indicates a distribution that is too thick in the tails, a uniform distribution, for example.

b)Concave plots indicate a skewed distribution.

c)If one end has a point that is extremely high or low, there may be outliers.

This data set appears to be nearly normal by every one of the four criteria!

TI–83 Plus or TI–84 Plus

Step by Step

Normal Random Variables

To find the probability for a normal random variable:

Press 2nd [DISTR], then 2 for normalcdf(

The form is normalcdf(lower x value, upper x value, µ, σ)

Use E99 for ∞ (infinity) and –E99 for –∞ (negative infinity). Press 2nd [EE] to get E.

Example: Find the probability that x is between 27 and 31 when µ = 28 and σ = 2

(Example 6–7a from the text).

normalcdf(27,31,28,2)

To find the percentile for a normal random variable:

Press 2nd [DISTR], then 3 for invNorm(

The form is invNorm(area to the left of x value, µ, σ)

Example: Find the 90th percentile when µ = 200 and σ = 20 (Example 6–9 from text). invNorm(.9,200,20)

To construct a normal quantile plot:

1.Enter the data values into L1.

2.Press 2nd [STAT PLOT] to get the STAT PLOT menu.

3.Press 1 for Plot 1.

4.Turn on the plot by pressing ENTER while the cursor is flashing over ON.

5.Move the cursor to the normal quantile plot (6th graph).

6.Make sure L1 is entered for the Data List and X is highlighted for the Data Axis.

7.Press WINDOW for the Window menu. Adjust Xmin and Xmax according to the data values. Adjust Ymin and Ymax as well, Ymin = –3 and Ymax = 3 usually work fine.

8.Press GRAPH.

Using the data from the previous example gives

Since the points in the normal quantile plot lie close to a straight line, the distribution is approximately normal.

Page 328

Excel

Step by Step

Normal Quantile Plot

Excel can be used to construct a normal quantile plot in order to examine if a set of data is approximately normally distributed.

1.Enter the data from the MINITAB example into column A of a new worksheet. The data should be sorted in ascending order. If the data are not already sorted in ascending order, highlight the data to be sorted and select the Sort & Filter icon from the toolbar. Then select Sort Smallest to Largest.

2.After all the data are entered and sorted in column A, select cell B1. Type: =NORMSINV(1/(2*18)). Since the sample size is 18, each score represents , or approximately 5.6%, of the sample. Each data value is assumed to subdivide the data into equal intervals. Each data value corresponds to the midpoint of a particular subinterval. Thus, this procedure will standardize the data by assuming each data value represents the midpoint of a subinterval of width .

3.Repeat the procedure from step 2 for each data value in column A. However, for each subsequent value in column A, enter the next odd multiple of in the argument for the NORMSINV function. For example, in cell B2, type: =NORMSINV(3/(2*18)). In cell B3, type: =NORMSINV(5/(2*18)), and so on until all the data values have corresponding z scores.

4.Highlight the data from columns A and B, and select Insert, then Scatter chart. Select the Scatter with only markers (the first Scatter chart).

5.To insert a title to the chart: Left-click on any region of the chart. Select Chart Tools and Layout from the toolbar. Then select Chart Title.

6.To insert a label for the variable on the horizontal axis: Left-click on any region of the chart. Select Chart Tools and Layout form the toolbar. Then select Axis Titles>Primary Horizontal Axis Title.

The points on the chart appear to lie close to a straight line. Thus, we deduce that the data are approximately normally distributed.

Page 329

Objective 6

Use the central limit theorem to solve problems involving sample means for large samples.

6–3The Central Limit Theorem

In addition to knowing how individual data values vary about the mean for a population, statisticians are interested in knowing how the means of samples of the same size taken from the same population vary about the population mean.

Distribution of Sample Means

Suppose a researcher selects a sample of 30 adult males and finds the mean of the measure of the triglyceride levels for the sample subjects to be 187 milligrams/deciliter. Then suppose a second sample is selected, and the mean of that sample is found to be 192 milligrams/deciliter. Continue the process for 100 samples. What happens then is that the mean becomes a random variable, and the sample means 187, 192, 184, … , 196 constitute a sampling distribution of sample means .

A sampling distribution of sample means is a distribution using the means computed from all possible random samples of a specific size taken from a population.

If the samples are randomly selected with replacement, the sample means, for the most part, will be somewhat different from the population mean µ. These differences are caused by sampling error.

Sampling error is the difference between the sample measure and the corresponding population measure due to the fact that the sample is not a perfect representation of the population.

When all possible samples of a specific size are selected with replacement from a population, the distribution of the sample means for a variable has two important properties, which are explained next.

Properties of the Distribution of Sample Means

1.The mean of the sample means will be the same as the population mean.

2.The standard deviation of the sample means will be smaller than the standard deviation of the population, and it will be equal to the population standard deviation divided by the square root of the sample size.

The following example illustrates these two properties. Suppose a professor gave an 8-point quiz to a small class of four students. The results of the quiz were 2, 6, 4, and 8. For the sake of discussion, assume that the four students constitute the population. The mean of the population is

The standard deviation of the population is

The graph of the original distribution is shown in Figure 6–29. This is called a uniform distribution.

Page 330

Figure 6–29

Distribution of Quiz Scores

Historical Note

Two mathematicians who contributed to the development of the central limit theorem were Abraham DeMoivre (1667–1754) and Pierre Simon Laplace (1749–1827). DeMoivre was once jailed for his religious beliefs. After his release, DeMoivre made a living by consulting on the mathematics of gambling and insurance. He wrote two books, Annuities Upon Lives and The Doctrine of Chance.

Laplace held a government position under Napoleon and later under Louis XVIII. He once computed the probability of the sun rising to be 18,226,214/18,226,215.

Now, if all samples of size 2 are taken with replacement and the mean of each sample is found, the distribution is as shown.

Homework is Completed By:

Writer Writer Name Amount Client Comments & Rating
Instant Homework Helper

ONLINE

Instant Homework Helper

$36

She helped me in last minute in a very reasonable price. She is a lifesaver, I got A+ grade in my homework, I will surely hire her again for my next assignments, Thumbs Up!

Order & Get This Solution Within 3 Hours in $25/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 3 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 6 Hours in $20/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 6 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 12 Hours in $15/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 12 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

6 writers have sent their proposals to do this homework:

Instant Assignments
Professor Smith
Fatimah Syeda
Top Grade Essay
Top Quality Assignments
Exam Attempter
Writer Writer Name Offer Chat
Instant Assignments

ONLINE

Instant Assignments

As per my knowledge I can assist you in writing a perfect Planning, Marketing Research, Business Pitches, Business Proposals, Business Feasibility Reports and Content within your given deadline and budget.

$39 Chat With Writer
Professor Smith

ONLINE

Professor Smith

I have read your project description carefully and you will get plagiarism free writing according to your requirements. Thank You

$40 Chat With Writer
Fatimah Syeda

ONLINE

Fatimah Syeda

I am a professional and experienced writer and I have written research reports, proposals, essays, thesis and dissertations on a variety of topics.

$49 Chat With Writer
Top Grade Essay

ONLINE

Top Grade Essay

I have assisted scholars, business persons, startups, entrepreneurs, marketers, managers etc in their, pitches, presentations, market research, business plans etc.

$50 Chat With Writer
Top Quality Assignments

ONLINE

Top Quality Assignments

This project is my strength and I can fulfill your requirements properly within your given deadline. I always give plagiarism-free work to my clients at very competitive prices.

$18 Chat With Writer
Exam Attempter

ONLINE

Exam Attempter

I reckon that I can perfectly carry this project for you! I am a research writer and have been writing academic papers, business reports, plans, literature review, reports and others for the past 1 decade.

$21 Chat With Writer

Let our expert academic writers to help you in achieving a+ grades in your homework, assignment, quiz or exam.

Similar Homework Questions

+971561686603 Abortion pills in Dubai/Abu Dhabi-mifepristone & misoprostol in DUBAI - I had a box of crayons poem - Determination of the solubility product constant - Why is chemistry considered the central science - Business law chapter 18 test answers - Another word for risk free - Eliminate reduce raise create grid - What are the key elements of tesla motors strategy - Project closure document sample - 2000 mb to gb - Hris cost benefit analysis matrix - Module 10 discussion Ethical Dilemma - What are the 3 prophecies in macbeth act 4 - Winnie the pooh depression test - C++ battleship game - Psychology Discussion SUPER EASY AND FAST PLZ HELP - Boyle's law and scuba diving - 4-1 Discussion: Patents - Transition curve in highway - A consolidation rotates data to display alternative presentations of the data. - E-prescribing. What were some advantages and disadvantages? - How to download any song from YouTube? - Mental status exam social work - Leccion 5 contextos activities answers - Interpersonal communication everyday encounters 9th edition citation - Pork stir-fry jamie oliver - Criminal justice - Mental Health (ADHD) Attention Deficit Hyperactivity Disorder - Predicting consumer tastes with big data at gap case solution - Vestigial structures in humans - Triangle Shirtwaist Fire - Eight stones recruitment pte ltd - Cognitive psychology attention quiz - Examples of levers in sport - P4P Programs - Is mt st helens a hotspot - Marine corps five paragraph order - Capstone Research Companion - Deliverable 4 - Electricity, Magnetism, and Light Compare/Contrast Paper - Managment - Thomas friedman thanks for being late pdf - Teachers aide course brisbane - Social Studies and the Arts Unit Plan - Lesson 1 the meaning of oxidation and reduction - TOM3010 - The secret to efficient teamwork is ridiculously simple - Marvins miami ok weekly ad - Evaluating Use of Literature and Problem Statement - Clawson's diamond model of leadership - Crime and punishment part 4 chapter 5 summary - Recycled plastic tiles machine - Sample soap note for normal physical exam - Focusing on accuracy and trying not to fall behind are especially important for - Experiment 1 observation of mitosis in a plant cell - Calculate number of possible outcomes - Pyramid of giza diorama - Radians and degrees worksheet - 02.05 the bill of rights assessment - Engage le cordon bleu - The great wall of china worksheet answers - Stone age necklace craft - 8 - Irresponsible pursuit of paradise 2nd - Melting point experiment lab report introduction - Soc 110 teamwork collaboration and conflict resolution - T of PN - Artificial intelligence george f luger 6th edition pdf download - Weekly Post #2 - Classification and division essay - Case study on moral status fetal abnormality - FIN/571: Corporate Finance - Multiple step income statements show - Artifact of human creative expression examples - Computer Science - Chamberlain college of nursing program outcome 5 - Big Data Paper - Sle x aro ale - Boa and merrill lynch merger - Philosophy in nursing definition - University of canterbury email - Caps Assignment W1 - List of nfl quarterbacks by salary - Health Policy Questions - Georgia tech chemical engineering curriculum - 306/20 garden street south yarra - Analyzing Transcripts - Importance of gps in civil engineering - Four frames of organization examples - Disadvantage of asexual reproduction - As nzs 3500 download free - W4D1 - MENTAL HEALTH - Infotech in a global summary - Discussion - Bobby rio conversation escalation - Discussion - Advantages and disadvantages of category captain - Singapore air wheelchair assistance - Ian freeman vs frank mir full fight - Social Stratification Part II: Race