CONTENTS Chapter 1 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 2 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 3 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 4 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 5 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 6 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 7 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key
Chapter 8 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 9 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 10 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 11 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 12 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 13 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 14 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key Chapter 15 Test Form A Test Form A Answer Key Test Form B Test Form B Answer Key
CHAPTER 1 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Answer true or false. 1) The information we gather with experiments and with surveys is collectively called data. A) True B) False
1)
Provide an appropriate response. 2) A survey of 1500 American households found that 33% of the respondents own a computer. Is this value a parameter or a statistic? B) statistic A) parameter
2)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 3) The three main aspects of statistics are , and
,
3)
.
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Answer true or false. 4) A large company would like to know what percentage of it's employees would utilize an on-site daycare facility. Rather than asking it's 5000 employees, each employee is assigned a number from 1 to 5000 and then a computer program is used to randomly select 100 numbers between 1 and 5000. These employees are then contacted and asked whether they would use an on-site daycare facility. The method used to collect the sample of 100 responses produces a random sample. A) True B) False
4)
5) Based on 12,000 responses from 55,000 questionnaires sent to its alumni, a major university estimated that the annual salary of its alumni was $98,500 per year. This sample was collected using random sampling. A) False B) True
5)
6) Inferential statistics are used when data are available only for a sample; however, descriptive statistics are used when data are available for either a sample or a population. A) False B) True
6)
Copyright © 2017 Pearson Education, Inc. 1
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 7) Drug X, a drug that claims to treat male pattern scalp hair loss, was administered for 12 months to over 1800 men aged 18 to 41 with mild to moderate amounts of ongoing hair loss. Whether they were receiving drug X or a placebo (a pill containing no medication), all men were given a medicated shampoo. In general, men who took drug X maintained or increased the number of visible scalp hairs; while scalp hair counts in men who took the placebo continued to decrease. This concluded that drug X is effective in maintaining or increasing the amount of scalp hair in men. Which statement of this example can be referred to as descriptive statistics? 8) A recent poll asked 1,245 registered voters nationwide, "Which party do you think can do a better job of handling immigration issues?" 31% of the respondents answered "Republicans." With a margin of error of ±3%, it is estimated that between 28 and 34 percent of registered voters nationwide feel that the Republican party can do a better job of handling immigration issues. Identify which part of this example is inferential.
7)
8)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 9) A study attempted to estimate the proportion of Florida residents who were willing to spend more tax dollars on protecting the Florida beaches from environmental disasters. Twenty-four hundred Florida residents were surveyed. Which of the following is the population of interest in the study? A) all Florida residents who were willing to spend more tax dollars on protecting the beaches from environmental disasters. B) all Florida residents C) all Florida residents who lived along the beaches D) the 2400 Florida residents surveyed E) all residents of the United States 10) A short application program for performing a specific task is called a/an A) sampling. B) time series. C) parameter. D) applet. E) None of these. Answer true or false. 11) In a recent television survey, participants were asked to answer "yes" or "no" to the question "Are you in favor of the death penalty?". Six thousand five hundred responded "yes" while 3700 responded "no". There was a fifty-cent charge for the call. The sampling technique used produces a random sample. A) True B) False
Copyright © 2017 Pearson Education, Inc. 2
9)
10)
11)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 12) The ____________________ is the set of all subjects of interest; while the ____________________ is a subset of the entire set of interest. Provide an appropriate response. 13) A radio show host asks listeners whether they believe that human activity is altering the global climate. Of those calling in, only 20% responded by saying yes. Is it safe to infer that only 20% of the general population believes that human activity is altering the global climate?
12)
13)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Determine whether the summary measure is better described as a parameter or a statistic. 14) The average age of 25 congressional members who were selected at random from all U.S. Congressmen A) Parameter B) Statistic Select the most appropriate answer. 15) The estimation of the population average age of registered voters in the state of Ohio based on the sample average age of 1,000 registered voters in that state and its corresponding margin of error is an example of A) a statistic. B) deductive statistics. C) a parameter. D) inferential statistics. E) descriptive statistics. 16) The following statement refers to which aspect of a statistical study: "A meteorologist constructs a graph showing the total precipitation in Phoenix, Arizona in each of the months of a given year"? A) Design B) Description C) Inference Answer true or false. 17) Parameter values are usually known. A) True
14)
15)
16)
17) B) False
Provide an appropriate response. 18) Using an computer to mimic what would actually happen if you selected a sample and used statistics in real life is called A) time series B) database C) simulation D) random sampling E) mini-tab Answer true or false. 19) Random sampling enables the sample to be a good reflection of the population. A) True B) False Copyright © 2017 Pearson Education, Inc. 3
18)
19)
Determine whether the summary measure is better described as a parameter or a statistic. 20) The average height of horse jockeys B) Statistic A) Parameter
20)
The owners of a coffee shop conducted a taste test to determine whether its customers preferred a new coffee brand to the current one sold by the shop. Customers who were willing to participate were given small samples of each of the two brands in random order and were asked to select which one they preferred without knowing the brand. Of the 100 participating customers, 90% chose the new brand. Based on these results, the owners determined that a majority of their customers preferred the new brand and therefore switched their coffee supplier. 21) Randomizing the order in which the samples of each brand were given to each 21) customer refers to which aspect of statistics? A) Description B) Investigation C) Design D) Inference E) None of these Select the most appropriate answer. 22) The following statement refers to which aspect of a statistical study: "The average age of the students in a statistics class is 25 years"? A) Description B) Inference C) Design
22)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 23) _____________________ is the art and science of learning from data.
23)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 24) Making decisions and predictions based on the data refers to which aspect of statistics? A) Design B) Description C) Sampling D) Inference E) None of these Answer true or false. 25) To estimate the percentage of defective items a machine is producing, a quality control analyst inspects the first 100 items produced in a day. This technique produces a random sample. A) True B) False
Copyright © 2017 Pearson Education, Inc. 4
24)
25)
Answer Key Testname: CHAPTER 1 FORM A TEST
1) A 2) B 3) design, description, inference 4) A 5) A 6) B 7) Scalp hair counts in the men who took drug X maintained or increased; scalp hair counts in the men who took the placebo decreased 8) It is estimated that between 28 and 34 percent of registered voters nationwide feel that the Republican party can do a better job of handling immigration issues. 9) B 10) D 11) B 12) population; sample 13) No. Those who respond to this question are unlikely to be representative of the general population for two reasons. First, listeners to the show are unlikely to be representative of the general population, as they are likely to share the views of the radio show host, and may not represent a wide spectrum of views. Second, those listeners who respond voluntarily to the question are likely to be those who feel the most strongly about the issue. Their views are unlikely to be representative of all listeners to the show let alone of the general population. 14) B 15) D 16) B 17) B 18) C 19) A 20) A 21) C 22) A 23) Statistics 24) D 25) B
Copyright © 2017 Pearson Education, Inc. 5
CHAPTER 1 FORM B TEST Name___________________________________
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) A radio show host asks listeners whether they believe that human activity is altering the global climate. Of those calling in, only 20% responded by saying yes. Is it safe to infer that only 20% of the general population believes that human activity is altering the global climate?
1)
2) Define the terms population and sample.
2)
3) Last year, 25% of students at a certain college smoked regularly. A researcher wants to determine whether an anti-smoking campaign was effective in reducing smoking at the college. If the proportion of students smoking regularly were still 25%, would it be surprising to find that only 15% of students smoked in a sample of size 20? in a sample of size 200? In either case would it be reasonable to assume that there had been a decrease in the proportion of students smoking regularly?
3)
Copyright © 2017 Pearson Education, Inc. 1
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 4) In a survey of students at a certain college, 677 students were asked whether they believed in life after death and whether they believed in reincarnation. The results are summarized in the table below.
4)
Frequency Distribution Reincarnation Life After Death (Column Percent, (Column Percent, Response Number of cases) Number of cases) Yes, Definitely 9.5%, 64 37.7%, 255 Yes, Probably 19.9%, 135 31.0%, 210 No, Probably Not 43.0%, 291 19.2%, 130 No, Definitely Not 27.6%, 187 12.1%, 82 Column Total 100.0%, 677 100.0%, 677 How does the percentage of "yes, definitely" responses for life after death compare with the percentage of "yes, definitely" responses for reincarnation? A) The percentage who responded "yes, definitely" when asked whether they believed in life after death (37.7%) was slightly higher than the percentage who responded "yes, definitely" when asked whether the believed in reincarnation (31.0%). B) The percentage who responded "yes, definitely" when asked whether they believed in life after death (37.7%) was roughly four times the percentage who responded "yes, definitely" when asked whether the believed in reincarnation (9.5%). C) The percentage who responded "yes, definitely" when asked whether they believed in life after death (19.9%) was roughly twice the percentage who responded "yes, definitely" when asked whether the believed in reincarnation (9.5%). D) The percentage who responded "yes, definitely" when asked whether they believed in life after death (255%) was roughly four times the percentage who responded "yes, definitely" when asked whether the believed in reincarnation (64%). SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 5) What is statistics?
5)
6) A recent poll asked 1,245 registered voters nationwide, ʺWhich party do you think can do a better job of handling immigration issues?ʺ 31% of the respondents answered ʺRepublicans.ʺ With a margin of error of ±3%, it is estimated that between 28 and 34 percent of registered voters nationwide feel that the Republican party can do a better job of handling immigration issues. Identify which part of this example is inferential.
6)
Copyright © 2017 Pearson Education, Inc. 2
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 7) A study attempted to estimate the proportion of Florida residents who were willing to spend more tax dollars on protecting the Florida beaches from environmental disasters. Twenty-four hundred Florida residents were surveyed. Which of the following is the population of interest in the study? A) all residents of the United States B) all Florida residents who were willing to spend more tax dollars on protecting the beaches from environmental disasters. C) the 2400 Florida residents surveyed D) all Florida residents who lived along the beaches E) all Florida residents Provide an appropriate response. 8) The average salary of all automotive workers is $42,000. Is this value a parameter or a statistic? A) statistic B) parameter Select the most appropriate answer. 9) In a survey, 71% of 1052 adults polled answered "Yes" to the question "Do you believe the theory that increased carbon dioxide and other gases released into the atmosphere will, if unchecked, lead to global warming and an increase in average temperatures?" The 71% responding "Yes" to the question is an example of A) a statistic B) a parameter C) the sample D) the population E) random sampling
7)
8)
9)
10) In a survey, 71% of 1052 adults polled answered "Yes" to the question "Do you believe the theory that increased carbon dioxide and other gases released into the atmosphere will, if unchecked, lead to global warming and an increase in average temperatures?" The sample consists of A) all American adults B) the proportion of adults polled who responded "Yes" to the question C) the proportion of American adults who are predicted to respond "Yes" to the question D) the 1052 polled adults E) 71% of American adults
10)
11) Making decisions and predictions based on the data refers to which aspect of statistics? A) Design B) Sampling C) None of these D) Inference E) Description
11)
Copyright © 2017 Pearson Education, Inc. 3
12) The following statement refers to which aspect of a statistical study: "From past figures, it is predicted that 47% of the registered voters in Virginia will vote in the June primaryʺ.? A) Design B) Description C) Inference
12)
13) In a survey, 71% of 1052 adults polled nationwide answered "Yes" to the question "Do you believe the theory that increased carbon dioxide and other gases released into the atmosphere will, if unchecked, lead to global warming and an increase in average temperatures?" The predicted proportion of all American adults who would respond "Yes" to the question is an example of A) a parameter B) the population C) random sampling D) the sample E) a statistic
13)
14) A survey asks "would you like to see more or less government spending on natural disasters?" Of the 1496 respondents, 723 responded "more" or "much more". The population of interest consists of A) the 723 respondents who responded "more" or "much more" B) the proportion of respondents who responded "more" or "much more" C) the proportion of American adults who would respond "more" or "much more" D) all American adults E) the 1496 respondents
14)
15) The following statement refers to which aspect of a statistical study: "Based on previous clients, a marriage counselor concludes that the majority of marriages that begin with cohabitation before marriage will result in divorce"? A) Description B) Design C) Inference
15)
16) A manufacturer of cellular phones has decided that an assembly line is operating satisfactorily if less than 6% of the phones manufactured per day are defective. To check the quality of a day's production, the company decides to randomly sample 30 phones from a day's production and test for defects. Define the population of interest to the manufacturer. A) all defective cellular phones manufactured by the company B) the 6% of the cellular phones that were defective C) the 30 cellular phones that were sampled and tested D) all cellular phones manufactured during the day in question E) the 30 responses: defective or not defective
16)
17) The following statement refers to which aspect of a statistical study: "Based on a study of 25 hospitals nationwide, researchers have concluded that there is a relationship between smoking cigarettes and contracting emphysema"? A) Description B) Inference C) Design
17)
Copyright © 2017 Pearson Education, Inc. 4
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 18) A ____________________ is a numerical summary of the population; while a ____________________ is a numerical summary of the sample.
18)
19) The basis of ____________________ is that each subject in the population has the same chance of being in the sample.
19)
20) _____________________ is the art and science of learning from data.
20)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Answer true or false. 21) A lobbyist for a major airspace firm assigns a number to each legislator and then uses a computer to randomly generate ten numbers. The lobbyist contacts the legislators corresponding to these numbers. This technique produces a random sample. A) True B) False
21)
22) Inferential statistics are used when data are available only for a sample; however, descriptive statistics are used when data are available for either a sample or a population. A) True B) False
22)
23) A lobbyist for a major airspace firm wants to get the opinion of state legislators on an issue of importance to his industry. The lobbyist contacts 10 state legislators at a restaurant during a lunch break and each one is polled about the issue. This technique produces a random sample. A) False B) True
23)
24) In a recent television survey, participants were asked to answer ʺyesʺ or ʺnoʺ to the question ʺAre you in favor of the death penalty?ʺ. Six thousand five hundred responded ʺyesʺ while 3700 responded ʺnoʺ. There was a fifty-cent charge for the call. The sampling technique used produces a random sample. A) True B) False
24)
Copyright © 2017 Pearson Education, Inc. 5
The owners of a coffee shop conducted a taste test to determine whether its customers preferred a new coffee brand to the current one sold by the shop. Customers who were willing to participate were given small samples of each of the two brands in random order and were asked to select which one they preferred without knowing the brand. Of the 100 participating customers, 90% chose the new brand. Based on these results, the owners determined that a majority of their customers preferred the new brand and therefore switched their coffee supplier. 25) Predicting the preference of all of the coffee shop customers based on the taste test 25) results refers to which aspect of statistics? A) Design B) Description C) Investigation D) Inference E) None of these
Copyright © 2017 Pearson Education, Inc. 6
Answer Key Testname: CHAPTER 1 FORM B TEST
1) No. Those who respond to this question are unlikely to be representative of the general population for two reasons. First, listeners to the show are unlikely to be representative of the general population, as they are likely to share the views of the radio show host, and may not represent a wide spectrum of views. Second, those listeners who respond voluntarily to the question are likely to be those who feel the most strongly about the issue. Their views are unlikely to be representative of all listeners to the show let alone of the general population. 2) The population is the total set of subjects in which we are interested. A sample is the subset of the population for whom we have data. 3) No, it would not be surprising to find that only 15% of students smoked in a sample of size 20. When the sample size is small, there is a lot of variability in the sample proportions and a sample proportion can easily lie far from the population proportion. In this case, it would not be reasonable to assume that there had been a decrease in the proportion of students smoking regularly. Yes, it would be surprising to find that only 15% of students smoked in a sample of size 200. When the sample size is large, the sample proportions tend to lie closer to the population proportion. If the proportion of students smoking regularly were still 25%, it would be surprising to find that only 15% of students smoked in a sample of size 200. In this case, it would be reasonable to assume that there had been a decrease in the proportion of students smoking regularly. 4) B 5) Statistics is the art and science of learning from data. 6) It is estimated that between 28 and 34 percent of registered voters nationwide feel that the Republican party can do a better job of handling immigration issues. 7) E 8) B 9) A 10) A 11) D 12) C 13) A 14) D 15) C 16) D 17) B 18) parameter; statistic 19) random sampling 20) Statistics 21) A 22) A 23) A 24) B 25) D
Copyright © 2017 Pearson Education, Inc. 7
CHAPTER 2 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. The heights (in inches) of 30 adult males are listed below. A frequency distribution show the frequency and relative frequency using five classes. 70 72 71 70 69 73 69 68 70 71 67 71 70 74 69 68 71 71 71 72 69 71 68 67 73 74 70 71 69 68 Height (in inches) 67.0-68.4 68.5-69.9 70.0-71.4 71.5-72.9 73.0-74.4
Frequency 6 5 13 2 4
Relative Frequency 0.20 0.167 0.433 0.067 0.133
1) Which category of heights represents the mode? A) 67.0-68.4 B) 68.5-69.9 C) 71.5-72.9 D) 70.0-71.4 E) 73.0-74.4
1)
2) What proportion of the 30 adult males had heights less than 70 inches? A) 16.7% B) 36.7 C) 0.367 D) 0.167
2) E) 0.433
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 3) A recent survey investigated exposure to tobacco and alcohol use in a series of G-rated animated films. Data on the total tobacco exposure time (in seconds) is below. 223 165
176 74
548 9
37 2
158 6
51 23
299 206
37 9
Find the Five-Number Summary of Positions.
Copyright © 2017 Pearson Education, Inc. 1
11
3)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 4) Test scores for a history class had a mean of 79 with a standard deviation of 4.5. Test scores for a physics class had a mean of 69 with a standard deviation of 3.7. Suppose a student gets a 68 on the history test and a 87 on the physics test. Calculate the z-score for each test. On which test did the student perform better? A) physics; -2.44 B) history; 4.86 C) physics; 4.86 D) history; 2.44 E) history; -2.44
4)
5) Parking at a large university has become a major issue. University administrators would like to determine the average time it takes a student to find a parking spot in a university lot. Students who are willing to participate in the study were asked to record the time between entering campus and pulling into a parking spot. Which of the following would not be appropriate for displaying the parking time data? A) Pie chart B) Stem-and-leaf plot C) Box plot D) Histogram E) None of these should be used.
5)
6) A competency test has scores with a mean of 69 and a standard deviation of 4. A histogram of the data shows that the distribution is normal. Use the Empirical Rule to find the percentage of scores between 61 and 77. A) 68% B) 77% C) 50% D) 95% E) 99.7%
6)
Copyright © 2017 Pearson Education, Inc. 2
7) Twenty-four workers were surveyed and asked how long it takes them to travel to work each day. The data below are given in minutes. 20 35 42 52 65 20 60 49 24 37 23 24 22 20 41 25 28 27 50 47 58 30 32 48 Which of the following shows the data in a stem-and-leaf plot? A) 2 0002344578 3 0257 4 12789 5 028 6 05 B) 2 0002344578 3 0257 4 12789 5 028 6 0 C) 2 002344578 3 0257 4 12789 5 028 6 05 D) 2 00002344578 3 0257 4 12789 5 028 6 05 E) 2 000234457 3 02578 4 12789 5 028 6 05
Copyright © 2017 Pearson Education, Inc. 3
7)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 8) The following frequency histogram provides average SO 2 (sulfur dioxide) emission rates from utility and industrial boilers (lb/million Btu) for 47 states (data for Idaho, Alaska, and Hawaii omitted).
8)
a. Identify the intervals of emission rates used for the plot. b. Describe the shape of the distribution. c. What information can you get from the dot plot or stem-and-leaf plot of these data that you cannot get from this plot? d. This histogram shows frequencies. If you were to construct a histogram using the percentages for each interval, how (if at all) would the shape of this histogram change? 9) The table below summarizes total enrollment and female enrollment at a pilot training college for the years 2005 through 2012. The table has been used to construct two different graphs displayed below the table. Summarize the information that is available from each of the graphs and discuss the advantages and disadvantages of each graph. Enrollment at Pilot Training College
Copyright © 2017 Pearson Education, Inc. 4
9)
Year Total Number Number of of Students Female Students 2005 283 20 2006 275 22 2007 265 22 2008 258 24 2009 252 25 2010 248 25 2011 245 27 2012 242 28 Enrollment at Pilot Training College 300
Number of Students
250 200 150 100 50
1
2 3 4 5 6 Years Since 2004
7
8
- - - - Total enrollment Female enrollment
Female Enrollment at Pilot Training College
Copyright © 2017 Pearson Education, Inc. 5
Number of Female Students
40 32 24 16 8
1
2
3 4 5 6 Years Since 2004
7
8
10) A sample of 324 randomly selected doctors was asked to indicate the category that best described how often they used the Internet. The results follow.
10)
Internet Usage Pattern Count Never 31 Rarely (about 3 times per year) 15 Occasionally (about once a month) 52 Often (about once a week) 109 Daily 117 a. Construct a pie chart for these data. b. In creating a bar graph of these data, would it be more useful to list the patterns as given in the table above or in the order of a Pareto chart? MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 11) Brandon kept track of the number of hours he spent exercising each week for four months. The results are shown below. Find the mean number of hours Brandon spent exercising per week. Round your answer to two decimal places. 7.50 8.20 7.10 7.90 8.00 7.50 7.80 7.10 7.30 7.50 7.90 8.90 7.10 8.20 8.20 8.20 8.00 7.80 A) 8.01
B) 7.30
C) 8.25
D) 7.79
Copyright © 2017 Pearson Education, Inc. 6
E) 7.38
11)
12) Each year advertisers spend billions of dollars purchasing commercial time on network sports television. A recent article listed the top 10 leading spenders (in millions of dollars) over a 6 month period: Company A Company B Company C Company D Company E
$72.0 63.1 54.7 54.3 29.0
Company F Company G Company H Company I Company J
12)
$26.9 25.0 23.9 23.0 20.0
Which of the following graphs would not be appropriate for displaying this data? A) Stem-and-leaf plot B) Histogram C) Pie chart D) Dot plot E) None of these should be used. A graphical display of a data set is given. Identify the overall shape of the distribution. 13) The ages of a group of patients being treated at one hospital for osteoporosis are summarized in the frequency histogram below.
Which of the following best describes the shape of the distribution? A) Skewed to the right B) Skewed to the left C) Symmetric D) Multimodal E) Bimodal
Copyright © 2017 Pearson Education, Inc. 7
13)
Provide an appropriate response. 14) A safety engineer wishes to use the following data to show the number of deaths in a year from the collision of passenger cars with trucks on a particular highway. Year 1 2 3 4 5 6 7 8
14)
Number of Deaths 12 17 22 21 16 13 11 12
What is the mode of the number of deaths? A) 22 B) 12 C) 15.5
D) 16
E) 13
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 15) The table below shows the unemployment rate in one city from 2003 to 2012.
15)
Year 2003 2004 2005 2006 2007 2008 2009 2010 2 Unemployment Rate (Percent) 5.90 5.78 5.45 5.28 5.06 4.88 4.80 4.63 4 a. Construct a time plot for these data. b. Is there a trend? If so, what kind? c. Would a histogram more clearly describe the above dataset? Explain. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 16) Use the following summary information for a data set of 100 observations to determine whether the data set is likely to be bell-shaped, skewed to the right or skewed to the left. Mean = 120, s=22, Minimum=103, Maximum=170 A) skewed to the left B) bell-shaped C) skewed to the right D) unable to determine from the information given
Copyright © 2017 Pearson Education, Inc. 8
16)
17) The following is a time plot of wine exports (in millions of gallons) in a certain country for the past 15 years. Is there a trend evident in the data?
17)
A) yes, increasing trend B) no trend evident C) yes, decreasing trend A sample of fifty motorists was taken on a Federal highway where the speed limit was 60 miles per hour. A dot plot of their speeds is shown below.
18) What is the mode for speed? A) 70 B) 60 C) 7 D) 55 E) none of these
18)
19) What proportion of the motorists were speeding? A) 2 B) 0.22 C) 0.72
19) D) 0.04
E) 0.18
Answer true or false. 20) Bar graphs and pie charts are graphical methods that are often used in summarizing quantitative data. A) True B) False
Copyright © 2017 Pearson Education, Inc. 9
20)
Select the most appropriate answer. 21) Which of the following is a discrete variable? A) time it takes to drive to work B) weight of a newborn baby C) none of these D) number of phones per household E) amount of coffee in an 8-ounce cup
21)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 22) A variable is called ____________________ if each observation belongs to one of a set of categories. 23) The ____________________ is the balance point of the data values; while, the _____________________ is the midpoint of the ordered data values.
22)
23)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A sporting goods retailer conducted a customer survey to determine its customers primary reason for shopping at their store. The results are shown in the graph below.
24) Is the variable "reason for shopping at our store" categorical or quantitative? A) Categorical B) Quantitative
24)
The following data show the number of laps run by each participant in a timed running race: 46 65 55 43 51 48 57 30 43 49 32 56 25) If the stems are 3, 4, 5 and 6, how many leaves are on the "4 stem"? A) 4 B) 5 C) 1 D) 0
Copyright © 2017 Pearson Education, Inc. 10
25)
Answer Key Testname: CHAPTER 2 FORM A TEST
1) D 2) C 3) minimum = 2 seconds, Q1 = 10 seconds, median = 51 seconds, Q3 = 191 seconds, and maximum = 548 seconds 4) C 5) A 6) D 7) A 8) a. 0 to 0.49, 0.5 to 0.99, 1.0 to 1.49, 1.5 to 1.99, 2.0 to 2.49, 2.5 to 2.99, 3.0 to 3.49, 3.5 to 3.99, 4.0 to 4.49, 4.5 to 4.99; b. The distribution is skewed to the right. c. You can get the actual data values from a dot plot or stem-and-leaf plot. d. The shape would not change. 9) The first graph shows the total numbers of students for each year as well as the number of female students. We can see the downward trend in overall enrollment, the slight upward trend in female enrollment and that female enrollment is small relative to total enrollment. However, with both total and female enrollment on the same graph, since female enrollment is small relative to total enrollment, the scale is not suitable for female enrollment and the upward trend in female enrollment is not very clear. This upward trend is much clearer from the second graph which shows female enrollment alone, However this graph gives no indication of how female enrollment compares to total enrollment. 10) a.
b.
Since the categories of Internet usage pattern have a natural order from never to daily, it makes more sense to leave the categories in this natural order rather than ordering them from the tallest bar to the shortest bar.
11) D 12) C 13) B 14) B
Copyright © 2017 Pearson Education, Inc. 11
Answer Key Testname: CHAPTER 2 FORM A TEST
15) a.
Unemployment Rate 2003-2012
Unemployment Rate (Percent)
2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 Year b. There is a clear decreasing trend over time; c. No, a histogram would not depict the trend in this dataset. 16) C 17) A 18) D 19) E 20) B 21) D 22) categorical 23) mean; median 24) A 25) B
Copyright © 2017 Pearson Education, Inc. 12
CHAPTER 2 FORM B TEST Name___________________________________
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) The following frequency histogram provides average SO 2 (sulfur dioxide) emission rates from utility and industrial boilers (lb/million Btu) for 47 states (data for Idaho, Alaska, and Hawaii omitted).
a. Identify the intervals of emission rates used for the plot. b. Describe the shape of the distribution. c. What information can you get from the dot plot or stem-and-leaf plot of these data that you cannot get from this plot? d. This histogram shows frequencies. If you were to construct a histogram using the percentages for each interval, how (if at all) would the shape of this histogram change?
Copyright © 2017 Pearson Education, Inc. 1
1)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 2) Parking at a large university has become a major issue. University administrators would like to determine the average time it takes a student to find a parking spot in a university lot. Students who are willing to participate in the study were asked to record the time between entering campus and pulling into a parking spot. Which of the following would not be appropriate for displaying the parking time data? A) Stem-and-leaf plot B) Histogram C) None of these should be used. D) Box plot E) Pie chart
2)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 3) A sample of 324 randomly selected doctors was asked to indicate the category that best described how often they used the Internet. The results follow.
3)
Internet Usage Pattern Count Never 31 Rarely (about 3 times per year) 15 Occasionally (about once a month) 52 Often (about once a week) 109 Daily 117 a. Construct a pie chart for these data. b. In creating a bar graph of these data, would it be more useful to list the patterns as given in the table above or in the order of a Pareto chart? 4) The table below shows the unemployment rate in one city from 2003 to 2012. Year 2003 2004 2005 2006 2007 2008 2009 2010 2 Unemployment Rate (Percent) 5.90 5.78 5.45 5.28 5.06 4.88 4.80 4.63 4 a. Construct a time plot for these data. b. Is there a trend? If so, what kind? c. Would a histogram more clearly describe the above dataset? Explain.
Copyright © 2017 Pearson Education, Inc. 2
4)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 5) Use the following summary information for a data set of 100 observations to 5) determine whether the data set is likely to be bell - shaped, skewed to th right or skewed to the left. Mean = 120, s=22, Minimum=37, Maximum=136 A) bell-shaped B) skewed to the left C) skewed to the right D) unable to determine from the information given 6) Brandon kept track of the number of hours he spent exercising each week for four months. The results are shown below. Find the mean number of hours Brandon spent exercising per week. Round your answer to two decimal places.
6)
7.50 8.20 7.10 7.90 8.00 7.50 7.80 7.10 7.30 7.50 7.90 8.90 7.10 8.20 8.20 8.20 8.00 7.80 A) 7.38
B) 8.01
C) 7.30
D) 7.79
E) 8.25
7) Use the following summary information for a data set of 100 observations to determine whether the data set is likely to be bell-shaped, skewed to the right or skewed to the left. Mean = 120, s=22, Minimum=103, Maximum=170 A) skewed to the right B) bell-shaped C) unable to determine from the information given D) skewed to the left
Copyright © 2017 Pearson Education, Inc. 3
7)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 8) The following data represent the number of grams of fat in various breakfast foods.
8)
Breakfast Food Fat (in grams) Muffin and egg sandwich 12 Muffin, egg, and ham sandwich 22 Muffin, egg, and bacon sandwich 27 Muffin and sausage sandwich 22 Bagel, egg, and ham sandwich 25 Bagel, egg, and bacon sandwich 30 Bagel, egg, and sausage sandwich 32 Bagel, egg, sausage, and cheese sandwich 37 Bagel, egg, ham, and cheese sandwich 27 Bagel, egg, bacon, and cheese sandwich 31 Bagel 11 Pancakes platter 16 Pancakes and eggs platter 21 Pancakes, eggs, and bacon platter 32 Yogurt 2 Construct a dot plot for these data. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 9) SAT verbal scores are normally distributed with a mean of 433 and a standard deviation of 90. Use the Empirical Rule to determine what percent of the scores lie between 433 and 523. A) 47.5% B) 68% C) 51% D) 49.9% E) 34%
9)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 10) The table below summarizes total enrollment and female enrollment at a pilot training college for the years 2005 through 2012. The table has been used to construct two different graphs displayed below the table. Summarize the information that is available from each of the graphs and discuss the advantages and disadvantages of each graph. Enrollment at Pilot Training College Year Total Number Number of of Students Female Students 2005 283 20 2006 275 22 2007 265 22 2008 258 24 2009 252 25 2010 248 25 2011 245 27 2012 242 28 Copyright © 2017 Pearson Education, Inc. 4
10)
Enrollment at Pilot Training College 300
Number of Students
250 200 150 100 50
1
2 3 4 5 6 Years Since 2004
7
8
- - - - Total enrollment Female enrollment
Female Enrollment at Pilot Training College
Number of Female Students
40 32 24 16 8
1
2
3 4 5 6 Years Since 2004
7
8
Copyright © 2017 Pearson Education, Inc. 5
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 11) Each year advertisers spend billions of dollars purchasing commercial time on network sports television. A recent article listed the top 10 leading spenders (in millions of dollars) over a 6 month period: Company A Company B Company C Company D Company E
$72.0 63.1 54.7 54.3 29.0
Company F Company G Company H Company I Company J
$26.9 25.0 23.9 23.0 20.0
Which of the following graphs would not be appropriate for displaying this data? A) None of these should be used. B) Pie chart C) Stem-and-leaf plot D) Histogram E) Dot plot
Copyright © 2017 Pearson Education, Inc. 6
11)
12) Twenty-four workers were surveyed and asked how long it takes them to travel to work each day. The data below are given in minutes.
12)
20 35 42 52 65 20 60 49 24 37 23 24 22 20 41 25 28 27 50 47 58 30 32 48 Which of the following shows the data in a stem-and-leaf plot? A) 2 000234457 3 02578 4 12789 5 028 6 05 B) 2 002344578 3 0257 4 12789 5 028 6 05 C) 2 0002344578 3 0257 4 12789 5 028 6 0 D) 2 0002344578 3 0257 4 12789 5 028 6 05 E) 2 00002344578 3 0257 4 12789 5 028 6 05 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 13) A recent survey investigated exposure to tobacco and alcohol use in a series of G-rated animated films. Data on the total tobacco exposure time (in seconds) is below. 223 165
176 74
548 9
37 2
158 6
51 23
299 206
37 9
Find the Five-Number Summary of Positions.
Copyright © 2017 Pearson Education, Inc. 7
11
13)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 14) A safety engineer wishes to use the following data to show the number of deaths in a year from the collision of passenger cars with trucks on a particular highway. Year 1 2 3 4 5 6 7 8
14)
Number of Deaths 12 17 22 21 16 13 11 12
What is the mode of the number of deaths? A) 13 B) 22 C) 16
D) 15.5
E) 12
15) The following is a time plot of wine exports (in millions of gallons) in a certain country for the past 15 years. Is there a trend evident in the data?
15)
A) yes, decreasing trend B) no trend evident C) yes, increasing trend The following data show the number of laps run by each participant in a timed running race: 46 65 55 43 51 48 57 30 43 49 32 56 16) If the stems are 3, 4, 5 and 6, how many leaves are on the "4 stem"? A) 0 B) 5 C) 1 D) 4
Copyright © 2017 Pearson Education, Inc. 8
16)
Select the most appropriate answer. 17) Which of the following is a discrete variable? A) weight of a newborn baby B) none of these C) number of phones per household D) amount of coffee in an 8-ounce cup E) time it takes to drive to work
17)
Classify as categorical or qualitative data. 18) A survey of automobiles parked in the student and staff lots at a large college recorded the make and model of the automobiles. The variable ʺmakeʺ is: A) Quantitative B) Categorical
18)
A sample of fifty motorists was taken on a Federal highway where the speed limit was 60 miles per hour. A dot plot of their speeds is shown below.
19) What is the mode for speed? A) 70 B) 55 C) 60 D) 7 E) none of these
19)
Answer true or false. 20) Bar graphs and pie charts are graphical methods that are often used in summarizing quantitative data. A) True B) False
20)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 21) A variable is called ____________________ if each observation belongs to one of a set of categories. 22) The ____________________ is the balance point of the data values; while, the _____________________ is the midpoint of the ordered data values.
Copyright © 2017 Pearson Education, Inc. 9
21)
22)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A sporting goods retailer conducted a customer survey to determine its customers primary reason for shopping at their store. The results are shown in the graph below.
23) Is the variable ʺreason for shopping at our storeʺ categorical or quantitative? A) Quantitative B) Categorical
23)
The heights (in inches) of 30 adult males are listed below. A frequency distribution show the frequency and relative frequency using five classes. 70 72 71 70 69 73 69 68 70 71 67 71 70 74 69 68 71 71 71 72 69 71 68 67 73 74 70 71 69 68 Height (in inches) 67.0-68.4 68.5-69.9 70.0-71.4 71.5-72.9 73.0-74.4
Frequency 6 5 13 2 4
Relative Frequency 0.20 0.167 0.433 0.067 0.133
24) What proportion of the 30 adult males had heights less than 70 inches? A) 0.367 B) 36.7 C) 16.7% D) 0.433 25) Which category of heights represents the mode? A) 70.0-71.4 B) 67.0-68.4 C) 68.5-69.9 D) 73.0-74.4 E) 71.5-72.9
Copyright © 2017 Pearson Education, Inc. 10
24) E) 0.167 25)
Answer Key Testname: CHAPTER 2 FORM B TEST
1) a. 0 to 0.49, 0.5 to 0.99, 1.0 to 1.49, 1.5 to 1.99, 2.0 to 2.49, 2.5 to 2.99, 3.0 to 3.49, 3.5 to 3.99, 4.0 to 4.49, 4.5 to 4.99; b. The distribution is skewed to the right. c. You can get the actual data values from a dot plot or stem-and-leaf plot. d. The shape would not change. 2) E 3) a.
b.
Since the categories of Internet usage pattern have a natural order from never to daily, it makes more sense to leave the categories in this natural order rather than ordering them from the tallest bar to the shortest bar.
Copyright © 2017 Pearson Education, Inc. 11
Answer Key Testname: CHAPTER 2 FORM B TEST
4) a.
Unemployment Rate 2003-2012
Unemployment Rate (Percent)
2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 Year b. There is a clear decreasing trend over time; c. No, a histogram would not depict the trend in this dataset. 5) B 6) D 7) A 8)
9) E 10) The first graph shows the total numbers of students for each year as well as the number of female students. We can see the downward trend in overall enrollment, the slight upward trend in female enrollment and that female enrollment is small relative to total enrollment. However, with both total and female enrollment on the same graph, since female enrollment is small relative to total enrollment, the scale is not suitable for female enrollment and the upward trend in female enrollment is not very clear. This upward trend is much clearer from the second graph which shows female enrollment alone, However this graph gives no indication of how female enrollment compares to total enrollment. 11) B 12) D 13) minimum = 2 seconds, Q1 = 10 seconds, median = 51 seconds, Q3 = 191 seconds, and maximum = 548 seconds 14) E 15) C 16) B 17) C 18) B 19) B Copyright © 2017 Pearson Education, Inc. 12
Answer Key Testname: CHAPTER 2 FORM B TEST
20) B 21) categorical 22) mean; median 23) B 24) A 25) A
Copyright © 2017 Pearson Education, Inc. 13
CHAPTER 3 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Answer true or false. 1) The value of the correlation is always between 0 and 1. A) True B) False
1)
Provide an appropriate response. 2) In order for a data point to be considered influential, which of the following must hold? I) the point has a large residual for the regression line fitted including that point II) its x value must be relatively low or high compared to the rest of the data III) the point has a large residual for the regression line fitted without using that data point A) I and II B) III only C) II only D) I only Determine which plot shows the strongest linear correlation. 3) A)
2)
E) II and III
3) B)
y
y
x
x
C)
D) y
y
x
x
Copyright © 2017 Pearson Education, Inc. 1
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 4) When two explanatory variables are both associated with a response variable but are also associated with each other, there is said to be ____________________.
4)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Answer true or false. 5) If the absolute value of the correlation is approximately one, then the points lie close to a line that slopes upward or downward. A) True B) False 6) The advantage of a side-by-side bar graph is that it allows for easy comparison of the explanatory variable groups with respect to values on the response variable. A) False B) True SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 7) A researcher is investigating the association between blood pressure and "workaholism" in a certain population. She classifies someone who works more than 60 hours per week as a workaholic. She records the income level and blood pressure (high or normal) for each participant and whether or not they can be classified as a "workaholic." The data are summarized in the table below.
Workaholic Yes No
Low HBP NBP 25 75 25 80
Income Group Middle HBP NBP 23 87 18 72
7)
High HBP NBP 26 134 9 51
Consider each income group separately and find the proportion with high blood pressure among the workaholics and the proportion with high blood pressure among the non-workaholics. Then determine the proportion of workaholics overall who have high blood pressure and the proportion of non-workaholics overall who have high blood pressure (ignoring information about income group). Explain how these data satisfy Simpson's paradox. How would you explain what is responsible for this result? 8) A large manufacturer hires many handicapped workers and keeps track of both their type of handicap and their level of performance. a. b.
Identify the two variables. Identify the response variable and the explanatory variable.
Copyright © 2017 Pearson Education, Inc. 2
8)
5)
6)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 9) In a negative association between two quantitative variables, A) none of these. B) y tends to increase as x decreases. C) y tends to increase as x increases. D) the movement of x does not affect the movement of y. E) y tends to decrease as x decreases.
9)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Solve the problem. 10) Use a scatterplot to display the data below. All measurements are in milligrams per cigarette.
10)
Brand Tar Nicotine A 16 1.2 B 13 1.1 C 16 1.2 D 18 1.4 E 6 0.6
Construct a scatterplot for the data. 11) Below are the gold medal performances in the men's high jump at an athletic competition held in one city from 2006 to 2012.
11)
Year High Jump (in.) 2006 85.3 2007 85.8 2008 88.3 2009 87.8 2010 88.5 2011 92.5 2012 92.3
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 12) A researcher records for each participant in a clinical trial the amount of alcohol they consume per month and their cholesterol level. Indicate whether each variable is categorical or quantitative. A) alcohol: categorical; cholesterol level: quantitative B) alcohol: categorical; cholesterol level: categorical C) alcohol: quantitative; cholesterol level: quantitative D) alcohol: quantitative; cholesterol level: categorical Copyright © 2017 Pearson Education, Inc. 3
12)
Select the most appropriate answer. 13) The ____________________ is the outcome variable on which comparisons are made. A) explanatory variable B) Both B and C C) response variable D) lurking variable E) predictor variable Provide an appropriate response. 14) The relationship between the number of games won by a minor league baseball team and the average attendance at their home games is analyzed. A regression to predict the average attendance from the number of games won has an r = 0.73. Interpret this statistic. A) No association B) Positive, fairly strong linear relationship. 73% of the variation in average attendance is explained by the number of games won. C) Negative, fairly strong linear relationship. 53.29% of the variation in average attendance is explained by the number of games won. D) Positive, fairly strong linear relationship. 53.29% of the variation in average attendance is explained by the number of games won. E) Positive, weak linear relationship. 7.29% of the variation in average attendance is explained by the number of games won. 15) A researcher recorded for each of a number of recent high school and college graduates the number of years of education and their annual salary when they started their first job. The scatter plot is shown below together with the regression equation. The point (14, 4) was excluded when obtaining the regression equation.
Salary (Thousands of Dollars)
y 48
32
16
4 8 12 16 Years of Education
20
x
Is the point (14, 4) an outlier on x? Is it an outlier on y? Is it a regression outlier? A) not an outlier on x, outlier on y, regression outlier B) not an outlier on x, outlier on y, not a regression outlier C) not an outlier on x, not an outlier on y, regression outlier D) outlier on x, outlier on y, regression outlier
Copyright © 2017 Pearson Education, Inc. 4
13)
14)
15)
^
16) A regression line for predicting the selling prices of homes in Chicago is y = 168 + 102x, where x is the square footage of the house. Interpret the residual for a house with 1800 square feet that recently sold for $200,000. A) The house sold for $16,232 less than was to be expected from the regression equation. B) The house sold for $16,400 less than was to be expected from the regression equation. C) The house sold for $16,400 more than was to be expected from the regression equation. D) The house sold for $16,064 more than was to be expected from the regression equation. E) The house sold for $16,232 more than was to be expected from the regression equation. Fill in the missing information. _ _ x s y sy r x 17) ?
?
18
^
y = a + bx
16)
17)
^
4 -0.5 y = 30 - 4x
A) x = 48; sx = -18.00 B) x = 3; sx = 1.00 C) x = 12; sx = 2.00 D) x = 3; sx = 0.50 E) x = 12; sx = 1.00 Provide an appropriate response. 18) For 14 baseball teams, the correlation with number of wins in the regular season is 0.51 for shutouts, 0.61 for hits made, -0.70 for runs allowed and -0.56 for homeruns allowed. Which variable has the weakest linear association with number of wins? A) homeruns allowed B) hits made C) runs allowed D) shutouts 19) Almost all of the acidity of soda pop comes from the phosphoric acid which is added to give them a sharper flavor. Is there an association between the pH of the soda and the amount of phosphoric acid (in grams)? The correlation between pH and phosphoric acid is -0.991. Describe the association. A) Very strong linear association in a negative direction B) Strong linear association in a positive direction C) No evidence of association D) Weak linear association in a positive direction E) Weak linear association in a negative direction ^
20) A regression line for predicting Internet usage (%) for 39 countries is y = -3.61 + 1.55x, where x is the per capita GDP, in thousands of dollars, and y is Internet usage. Interpret the residual for one of the 39 countries with per capita GDP of $15,000 and actual Internet use of 20 percent. A) The actual Internet usage for this country is 0.36% higher than expected from the regression equation. B) The actual Internet usage for this country is 3.6% higher than expected from the regression equation. C) The actual Internet usage for this country is 0.36% lower than expected from the regression equation. D) The actual Internet usage for this country is 3.25% higher than expected from the regression equation. E) The actual Internet usage for this country is 3.25% lower than expected from the regression equation.
Copyright © 2017 Pearson Education, Inc. 5
18)
19)
20)
21) A teacher recorded for each of five students their score on a quiz (x) and their score on a project (y). A scatter plot of the points and the corresponding regression line are shown below. 21
21)
y
Score on Project
18 15 12 9 6 3 2 4 6 8 10 12 14 16 18 20 Score on Quiz
x
Is the point (18, 19) an outlier on x? Is it an outlier on y? Is it a regression outlier? A) outlier on x, not an outlier on y, not a regression outlier B) outlier on x, outlier on y, regression outlier C) outlier on x, outlier on y, not a regression outlier D) not an outlier on x, not an outlier on y, not a regression outlier SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 22) The fact that the direction of an association between two variables can change after we include a third variable and analyze the data at separate levels of that variable is known as ____________________.
22)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Determine the type of association apparent in the following scatterplot. 23)
A) Linear association, moderately strong association B) Moderately strong association, negative association C) Little or no association D) Negative association, linear association E) Linear association Copyright © 2017 Pearson Education, Inc. 6
23)
The following scatterplot shows the percentage of the vote a presidential candidate received in an election according to the voter's income level based on an exit poll of voters. The income levels 1-8 correspond to the following income classes: 1=Under $15,000; 2=$15-30,000; 3=$30-50,000; 4=$50-75,000; 5=$75-100,000; 6=$100-150,000; 7=$150-200,000; 8=$200,000 or more.
24) Which of the following describes the linear association between income level and percentage of the vote won by the candidate? A) weak, positive B) no linear association C) weak, negative D) strong, negative E) strong, positive SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 25) A recent survey polled a sample of male Mexican-American adolescents to determine if there is an association between smoking and unhealthy behaviors. Below is the data. Each boy, age 10-18, was classified according to smoking status and his response to a question asking whether he liked to do risky things.
Likes Risky Things Doesnʹt Like Risky Things
Smoking Status Smoker Nonsmoker 45 46 36 153
a. Create a side-by-side bar graph that compares smoking status with respect to risky things status. b. Summarize the results of the side-by-side bar graph. c. Describe how the graph would look if there was not an association between smoking and unhealthy behaviors among Mexican -American male adolescents.
Copyright © 2017 Pearson Education, Inc. 7
25)
24)
Answer Key Testname: CHAPTER 3 FORM A TEST
1) B 2) E 3) A 4) confounding 5) A 6) B 7) For low income: 25% high blood pressure for workaholics versus 23.8% high blood pressure for non-workaholics For middle income: 20.9% high blood pressure for workaholics versus 20% high blood pressure for non-workaholics For high income: 16.3% high blood pressure for workaholics versus 15% high blood pressure for non-workaholics All income groups combined: 20% high blood pressure for workaholics.versus 20.4% for non-workaholics Within each income group, the percentage with high blood pressure is higher for workaholics. However, when income groups are combined the percentage with high blood pressure is higher for non-workaholics. Answers may vary. Possible explanation: When the income groups are combined, workaholism becomes confounded with income level. Many of the workaholics are in the high-income group and high incomes bring greater access to other options which tend to lower blood pressure, such as health care, a healthy diet, gym membership, retreats, massage, etc. The large number of high-income people among the workaholics reduces the percentage with high blood pressure in the workaholic group. 8) a. The two variables are type of handicap and level of performance; b. response variable = level of performance, explanatory variable = type of handicap 9) B 10)
Copyright © 2017 Pearson Education, Inc. 8
Answer Key Testname: CHAPTER 3 FORM A TEST
11)
2006 2007 2008 2009 2010 2011 2012 Year 12) C 13) C 14) D 15) A 16) E 17) D 18) D 19) A 20) A 21) C 22) Simpson's paradox 23) B 24) D
Copyright © 2017 Pearson Education, Inc. 9
Answer Key Testname: CHAPTER 3 FORM A TEST
25) a.
b. The graph clearly shows that the proportion of Mexican-American male adolescents that likes risky things is higher for those Mexican-American male adolescents who are smokers. The proportion of Mexican-American male adolescent smokers that likes risky things is almost two and a half times as high as the proportion of Mexican-American male adolescent nonsmokers that likes risky things; c. If there was not an association, the two bars in the ʺlikes risky thingsʺ group would be the same height and the two bars in the ʺdoesnʹt like risky thingsʺ group would be the same height.
Copyright © 2017 Pearson Education, Inc. 10
CHAPTER 3 FORM B TEST Name___________________________________
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) A researcher is investigating the association between blood pressure and "workaholism" in a certain population. She classifies someone who works more than 60 hours per week as a workaholic. She records the income level and blood pressure (high or normal) for each participant and whether or not they can be classified as a "workaholic." The data are summarized in the table below.
Workaholic Yes No
Low HBP NBP 25 75 25 80
Income Group Middle HBP NBP 23 87 18 72
1)
High HBP NBP 26 134 9 51
Consider each income group separately and find the proportion with high blood pressure among the workaholics and the proportion with high blood pressure among the non-workaholics. Then determine the proportion of workaholics overall who have high blood pressure and the proportion of non-workaholics overall who have high blood pressure (ignoring information about income group). Explain how these data satisfy Simpson's paradox. How would you explain what is responsible for this result? 2) A recent survey polled a sample of male Mexican-American adolescents to determine if there is an association between smoking and unhealthy behaviors. Below is the data. Each boy, age 10-18, was classified according to smoking status and his response to a question asking whether he liked to do risky things.
Likes Risky Things Doesnʹt Like Risky Things
2)
Smoking Status Smoker Nonsmoker 45 46 36 153
a. Create a side-by-side bar graph that compares smoking status with respect to risky things status. b. Summarize the results of the side-by-side bar graph. c. Describe how the graph would look if there was not an association between smoking and unhealthy behaviors among Mexican -American male adolescents. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. ^
3) A regression line for predicting Internet usage (%) for 39 countries is y = -3.61 + 1.55x, where x is the per capita GDP, in thousands of dollars, and y is Internet usage. Interpret the residual for one of the 39 countries with per capita GDP of $15,000 and actual Internet use of 20 percent. A) The actual Internet usage for this country is 3.25% lower than expected from the regression equation. B) The actual Internet usage for this country is 3.6% higher than expected from the regression equation. C) The actual Internet usage for this country is 3.25% higher than expected from the regression equation. D) The actual Internet usage for this country is 0.36% higher than expected from the regression equation. E) The actual Internet usage for this country is 0.36% lower than expected from the regression equation. Copyright © 2017 Pearson Education, Inc. 1
3)
4) The relationship between the number of games won by a minor league baseball team and the average attendance at their home games is analyzed. A regression to predict the average attendance from the number of games won has an r = 0.73. Interpret this statistic. A) Positive, fairly strong linear relationship. 53.29% of the variation in average attendance is explained by the number of games won. B) Negative, fairly strong linear relationship. 53.29% of the variation in average attendance is explained by the number of games won. C) Positive, weak linear relationship. 7.29% of the variation in average attendance is explained by the number of games won. D) Positive, fairly strong linear relationship. 73% of the variation in average attendance is explained by the number of games won. E) No association
4)
5) A teacher recorded for each of five students their score on a quiz (x) and their score on a project (y). A scatter plot of the points and the corresponding regression line are shown below.
5)
21
y
Score on Project
18 15 12 9 6 3 2 4 6 8 10 12 14 16 18 20 Score on Quiz
x
Is the point (18, 19) an outlier on x? Is it an outlier on y? Is it a regression outlier? A) outlier on x, outlier on y, regression outlier B) outlier on x, not an outlier on y, not a regression outlier C) outlier on x, outlier on y, not a regression outlier D) not an outlier on x, not an outlier on y, not a regression outlier ^
6) A regression line for predicting the selling prices of homes in Chicago is y = 168 + 102x, where x is the square footage of the house. Interpret the residual for a house with 1800 square feet that recently sold for $200,000. A) The house sold for $16,400 less than was to be expected from the regression equation. B) The house sold for $16,064 more than was to be expected from the regression equation. C) The house sold for $16,232 more than was to be expected from the regression equation. D) The house sold for $16,400 more than was to be expected from the regression equation. E) The house sold for $16,232 less than was to be expected from the regression equation.
Copyright © 2017 Pearson Education, Inc. 2
6)
7) Almost all of the acidity of soda pop comes from the phosphoric acid which is added to give them a sharper flavor. Is there an association between the pH of the soda and the amount of phosphoric acid (in grams)? The correlation between pH and phosphoric acid is -0.991. Describe the association. A) Strong linear association in a positive direction B) Very strong linear association in a negative direction C) Weak linear association in a negative direction D) Weak linear association in a positive direction E) No evidence of association
7)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 8) A large manufacturer hires many handicapped workers and keeps track of both their type of handicap and their level of performance. a. b.
8)
Identify the two variables. Identify the response variable and the explanatory variable.
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 9) A researcher recorded for each of a number of recent high school and college graduates the number of years of education and their annual salary when they started their first job. The scatter plot is shown below together with the regression equation. The point (14, 4) was excluded when obtaining the regression equation.
9)
Salary (Thousands of Dollars)
y 48
32
16
4 8 12 16 Years of Education
20
x
Is the point (14, 4) an outlier on x? Is it an outlier on y? Is it a regression outlier? A) outlier on x, outlier on y, regression outlier B) not an outlier on x, outlier on y, not a regression outlier C) not an outlier on x, not an outlier on y, regression outlier D) not an outlier on x, outlier on y, regression outlier 10) For 14 baseball teams, the correlation with number of wins in the regular season is 0.51 for shutouts, 0.61 for hits made, -0.70 for runs allowed and -0.56 for homeruns allowed. Which variable has the weakest linear association with number of wins? A) homeruns allowed B) shutouts C) hits made D) runs allowed
Copyright © 2017 Pearson Education, Inc. 3
10)
11) A researcher records for each participant in a clinical trial the amount of alcohol they consume per month and their cholesterol level. Indicate whether each variable is categorical or quantitative. A) alcohol: quantitative; cholesterol level: quantitative B) alcohol: categorical; cholesterol level: categorical C) alcohol: quantitative; cholesterol level: categorical D) alcohol: categorical; cholesterol level: quantitative
11)
12) In order for a data point to be considered influential, which of the following must hold?
12)
I) the point has a large residual for the regression line fitted including that point II) its x value must be relatively low or high compared to the rest of the data III) the point has a large residual for the regression line fitted without using that data point A) II only B) I and II C) III only D) II and III
E) I only
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 13) The fact that the direction of an association between two variables can change after we include a third variable and analyze the data at separate levels of that variable is known as ____________________. 14) When two explanatory variables are both associated with a response variable but are also associated with each other, there is said to be ____________________.
13)
14)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Answer true or false. 15) If the absolute value of the correlation is approximately one, then the points lie close to a line that slopes upward or downward. A) False B) True
15)
16) The value of the correlation is always between 0 and 1. A) True B) False
16)
17) The advantage of a side-by-side bar graph is that it allows for easy comparison of the explanatory variable groups with respect to values on the response variable. A) False B) True
17)
Select the most appropriate answer. 18) The ____________________ is the outcome variable on which comparisons are made. A) predictor variable B) Both B and C C) explanatory variable D) lurking variable E) response variable
Copyright © 2017 Pearson Education, Inc. 4
18)
19) If a positive association exists between two quantitative variables, A) the movement of x does not affect the movement of y. B) none of these. C) y tends to decrease as x increases. D) y tends to decrease as x decreases. E) y tends to increase as x decreases. Determine which plot shows the strongest linear correlation. 20) A)
19)
20) B)
y
y
x
x
C)
D) y
y
x
x
Copyright © 2017 Pearson Education, Inc. 5
Determine the type of association apparent in the following scatterplot. 21)
21)
A) Linear association, moderately strong association B) Linear association C) Little or no association D) Moderately strong association, negative association E) Negative association, linear association SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Construct a scatterplot for the data. 22) Below are the gold medal performances in the men's high jump at an athletic competition held in one city from 2006 to 2012.
22)
Year High Jump (in.) 2006 85.3 2007 85.8 2008 88.3 2009 87.8 2010 88.5 2011 92.5 2012 92.3
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Fill in the missing information. _ _ x s x y sy r 23) ?
?
18
^
y = a + bx
23)
^
4 -0.5 y = 30 - 4x
A) x = 48; sx = -18.00 B) x = 3; sx = 1.00 C) x = 12; sx = 1.00 D) x = 3; sx = 0.50 E) x = 12; sx = 2.00
Copyright © 2017 Pearson Education, Inc. 6
The following scatterplot shows the percentage of the vote a presidential candidate received in an election according to the voter's income level based on an exit poll of voters. The income levels 1-8 correspond to the following income classes: 1=Under $15,000; 2=$15-30,000; 3=$30-50,000; 4=$50-75,000; 5=$75-100,000; 6=$100-150,000; 7=$150-200,000; 8=$200,000 or more.
24) Which of the following describes the linear association between income level and percentage of the vote won by the candidate? A) strong, negative B) weak, negative C) no linear association D) strong, positive E) weak, positive SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Solve the problem. 25) Use a scatterplot to display the data below. All measurements are in milligrams per cigarette. Brand Tar Nicotine A 16 1.2 B 13 1.1 C 16 1.2 D 18 1.4 E 6 0.6
Copyright © 2017 Pearson Education, Inc. 7
25)
24)
Answer Key Testname: CHAPTER 3 FORM B TEST
1) For low income: 25% high blood pressure for workaholics versus 23.8% high blood pressure for non-workaholics For middle income: 20.9% high blood pressure for workaholics versus 20% high blood pressure for non-workaholics For high income: 16.3% high blood pressure for workaholics versus 15% high blood pressure for non-workaholics All income groups combined: 20% high blood pressure for workaholics.versus 20.4% for non-workaholics Within each income group, the percentage with high blood pressure is higher for workaholics. However, when income groups are combined the percentage with high blood pressure is higher for non-workaholics. Answers may vary. Possible explanation: When the income groups are combined, workaholism becomes confounded with income level. Many of the workaholics are in the high-income group and high incomes bring greater access to other options which tend to lower blood pressure, such as health care, a healthy diet, gym membership, retreats, massage, etc. The large number of high-income people among the workaholics reduces the percentage with high blood pressure in the workaholic group. 2) a.
b. The graph clearly shows that the proportion of Mexican-American male adolescents that likes risky things is higher for those Mexican-American male adolescents who are smokers. The proportion of Mexican-American male adolescent smokers that likes risky things is almost two and a half times as high as the proportion of Mexican-American male adolescent nonsmokers that likes risky things; c. If there was not an association, the two bars in the ʺlikes risky thingsʺ group would be the same height and the two bars in the ʺdoesnʹt like risky thingsʺ group would be the same height. 3) D 4) A Copyright © 2017 Pearson Education, Inc. 8
Answer Key Testname: CHAPTER 3 FORM B TEST
5) C 6) C 7) B 8) a. The two variables are type of handicap and level of performance; b. response variable = level of performance, explanatory variable = type of handicap 9) D 10) B 11) A 12) D 13) Simpson's paradox 14) confounding 15) B 16) B 17) B 18) E 19) D 20) A 21) D 22)
2006 2007 2008 2009 2010 2011 2012 Year 23) D 24) A
Copyright © 2017 Pearson Education, Inc. 9
Answer Key Testname: CHAPTER 3 FORM B TEST
25)
Copyright © 2017 Pearson Education, Inc. 10
CHAPTER 4 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) Explain whether an experiment or an observational study would be more appropriate to investigate whether meditation is effective in lowering blood pressure. A) Experiment; in an observational study it would be too difficult to monitor participants' behavior. B) Experiment; it would be easier to control for the effects of lurking variables such as diet and exercise. C) Experiment; it would be unethical to deprive people of a potentially beneficial treatment over a long period of time. D) Experiment; an observational study would take too long. E) Observational study; it would be easier to control for the effects of lurking variables such as diet and exercise. 2) A computer network manager wants to test the reliability of some new and expensive fiber-optic Ethernet cables that the computer department just received. The computer department received 4 boxes containing 30 cables each. The manager does not have the time to test every cable in each box. The manager will choose one box at random and test 6 cables chosen randomly within that box. What is the population of interest? A) The one box that was chosen at random from the 4 boxes B) 120 cables C) The 6 cables chosen randomly for testing D) 60 cables E) The 4 boxes
1)
2)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Identify the flaw(s) in the experiment or study described. 3) At one hospital, 674 women were diagnosed with breast cancer. Five years later, 88% of the Caucasian women and 83% of the African American women were still alive. A researcher concludes that being Caucasian causes women with breast cancer to have an increased chance of surviving five years. Why is this conclusion not justified?
3)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 4) Of the 5000 outpatients released from a local hospital in the past year, one hundred were contacted and asked their opinion on the care they received. Select the first five patients who belong to the simple random sample. 16348
76938
90169
51392
55887
71015
09209
79157
A) 163, 487, 693, 169, 513 B) 1, 6, 3, 4, 8 C) 16, 34, 69, 38, 13 D) 163, 169, 15, 92, 97 E) 1634, 3890, 1695, 1392, 1509
Copyright © 2017 Pearson Education, Inc. 1
4)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 5) What is the difference between a retrospective study and a prospective study?
5)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Identify the specified elements of the experiment. 6) In a clinical trial, 780 participants suffering from high blood pressure were randomly assigned to one of three groups. Over a one-month period, the first group received a low dosage of an experimental drug, the second group received a high dosage of the drug, and the third group received a placebo. The diastolic blood pressure of each participant was measured at the beginning and at the end of the period and the change in blood pressure was recorded. Identify the explanatory variable. A) The experimental drug B) The one-month period of the experiment C) High blood pressure, low blood pressure D) Diastolic blood pressure at the start, diastolic blood pressure at the end E) Placebo, low drug dosage, high drug dosage Identify the bias. 7) A magazine publisher mails a survey to every subscriber asking about the quality of its subscription service. From which of the following is this study most likely to suffer? A) Nonresponse bias B) Both undercoverage and sampling bias C) Sampling bias D) Undercoverage E) Both nonresponse bias and undercoverage Provide an appropriate response. 8) Explain whether an experiment or an observational study would be more appropriate to investigate whether acupuncture can help people to heal from sports-related injuries. A) Experiment; it would be unethical to subject people to a potentially harmful treatment over a long period of time. B) Experiment; it would be easier to control for the effects of lurking variables such as exercise. C) Experiment; an observational study would take too long. D) Observational study; an experiment would take too long. E) Observational study; it would be easier to control for the effects of lurking variables such as exercise. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Identify the flaw(s) in the experiment or study described. 9) A pharmaceutical company has developed a medication that they believe will help to reduce the pain of arthritis. They would like to test the medication at two different dosage levels. They design an experiment as follows to test the medication. They will obtain a group of volunteers who suffer from arthritis. A doctor from the pharmaceutical company will evaluate each patient's condition at the start of the experiment. Volunteers will be randomly assigned to one of three groups. Each day for the duration of the experiment, patients in group 1 will receive a low dose of the medication, patients in group 2 will receive a higher dose of the medication, and patients in group 3 will receive a placebo. After a suitable amount of time (two months, for example), the same doctor will evaluate each patient's progress. Based on the amount of inflammation and the patient's report on the amount of pain, the doctor will give each patient a numerical score to represent their improvement. The amount of improvement for the three groups would then be compared. They will also ensure that the technicians administering the supplements are unaware of which patients receive a low dose, a high dose, or a placebo. Identify the flaw(s) in this experiment.
Copyright © 2017 Pearson Education, Inc. 2
9)
6)
7)
8)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 10) An educational researcher used school records to determine that, in one school district, 84% of children living in two-parent homes graduated high school while 75% of children living in single-parent homes graduated high school. What is the explanatory variable? A) Whether the child lived with one or two parents and whether or not they graduated high school B) Graduation rate C) Whether the child lived with two parents or with a single parent D) Whether the child graduated from high school Answer the question. 11) A clothing manufacturer would like to determine whether celebrity endorsements influence a customer's decision to buy its products. In order to estimate this potential influence, the manufacturer premieres its latest series of commercials privately to two separate groups of potential buyers. One group will see a series of commercials without celebrities, and one group will view a series of commercials with celebrities. The potential buyers are further divided by gender.
10)
11)
Who is in the control group? A) The men and women who see the commercials with celebrities. B) The 100 potential buyers. C) The women who see the commercials without celebrities. D) The men and women who see the commercials without celebrities. E) The men who see the commercials without celebrities. Select the most appropriate answer. 12) Experiments where two observations (one for each treatment) are recorded for each subject are called A) Matched Pair Designs B) Randomized Block Designs C) Cluster Designs D) Completely Randomized Designs E) Stratified Designs List all possible samples from the specified population. 13) Given a group of students: Allen (A), Brenda (B), Chad (C), Dorothy (D), and Eric (E), list all of the possible samples (without replacement) of size four that can be obtained from the group. A) ABCD, ABCE, ACDE, ADEB, BCDE, BCEA, BDEA, CABD, CEDB, DACE B) ABCD, ABCE, ABDE C) ABCD, ABCE, ACDE, ADEB D) ABCD, ABCE, ACDE, ADEB, BCDE E) ABCD
Copyright © 2017 Pearson Education, Inc. 3
12)
13)
Select the most appropriate answer. 14) The most common type of convenience sample is A) a systematic sample. B) a cluster sample. C) a simple random sample. D) a volunteer sample. E) a stratified sample.
14)
Identify the type of observational study. 15) A statistical analyst obtains data concerning ankle injuries by examining hospital records from the past 3 years. A) Retrospective B) Case-control C) Prospective D) Cross-sectional E) Census Provide an appropriate response. 16) A magazine publisher mails a survey to every subscriber asking about the quality of its subscription service. Describe the population of interest. A) all American adults who read the magazine B) the respondents to the survey C) all American adults D) all subscribers to the magazine Answer true or false. 17) Nonresponse bias occurs when the subject gives an incorrect response. A) True B) False
Identify which type of sampling is used. 19) A lobbyist for a major airspace firm assigns a number to each legislator and then uses a computer to randomly generate ten numbers. The lobbyist contacts the legislators corresponding to these numbers. What sampling technique was used? A) Simple random sample B) Stratified random sample C) Convenience sample D) Matched pair sample E) Cluster random sample SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
Copyright © 2017 Pearson Education, Inc. 4
16)
17)
18) Sampling bias occurs when some sampled subjects cannot be reached or refuse to participate or fail to answer some questions. A) False B) True
Fill in the blank. 20) Bias that results from the sampling method, such as nonrandom sampling or undercoverage, is called ____________________.
15)
20)
18)
19)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Identify the study as observational or experimental. 21) To determine whether children born with fetal alcohol syndrome have a less than average IQ, 200 children between the ages of 8 and 10 who were born with the syndrome were given an IQ test and their scores were compared to the average IQ of children aged 8 to 10 who were born healthy. A) Experimental B) Observational Select the most appropriate answer. 22) A method of sampling which divides the population into separate groups and then selects a simple random sample from each group is called ____________________. A) simple random sampling B) cluster random sampling C) stratified random sampling D) systematic random sampling E) convenience sampling Identify which type of sampling is used. 23) The student dean of a university uses a computer to randomly select 500 student identification numbers then interviews the students corresponding to those identification numbers. A) Matched pair sample B) Stratified random sample C) Cluster random sample D) Simple random sample E) Convenience sample
21)
22)
23)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 24) When the difference between the results for the two treatments in an experimental study is so large that it would be rare to see such a difference by ordinary random variation among the experimental units, the results are said to be ____________________.
24)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 25) Explain whether an experiment or an observational study would be more appropriate to investigate whether women are more likely than men to suffer from anxiety. A) Observational study; it is unethical to subject study participants to anxiety over a long period of time. B) Experiment; an observational study would take too long. C) Experiment; it would be easier to control for lurking variables such as number of hours worked per week. D) Observational study; it would be easier to control for lurking variables such as number of hours worked per week. E) Observational study; gender cannot be assigned as a treatment.
Copyright © 2017 Pearson Education, Inc. 5
25)
Answer Key Testname: CHAPTER 4 FORM A TEST
1) B 2) B 3) Since there is no random assignment, there is no way to know that being Caucasian causes women with breast cancer to have an increased chance of surviving five years; there may have been lurking variables. For example, Caucasian women may be more affluent and able to afford better health care or better nutrition. They may have a less stressful lifestyle, or maybe they don't have to work as much. They may live in more affluent areas where there is less pollution and less noise. 4) E 5) A retrospective study looks into the past, while a prospective study follows its subjects into the future. 6) E 7) A 8) B 9) The doctor evaluating the patients' progress is still aware of which treatment patients received. The doctor may be biased and influenced by his or her knowledge of whether the patient received the medication or a placebo. This is especially important because the doctor works for the pharmaceutical company and has a vested interest in finding that the medication is effective. 10) C 11) D 12) A 13) D 14) D 15) A 16) D 17) B 18) A 19) A 20) sampling bias 21) B 22) C 23) D 24) statistically significant 25) E
Copyright © 2017 Pearson Education, Inc. 6
CHAPTER 4 FORM B TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) Explain whether an experiment or an observational study would be more appropriate to investigate whether acupuncture can help people to heal from sports-related injuries. A) Observational study; an experiment would take too long. B) Experiment; it would be unethical to subject people to a potentially harmful treatment over a long period of time. C) Observational study; it would be easier to control for the effects of lurking variables such as exercise. D) Experiment; it would be easier to control for the effects of lurking variables such as exercise. E) Experiment; an observational study would take too long. 2) Of the 5000 outpatients released from a local hospital in the past year, one hundred were contacted and asked their opinion on the care they received. Select the first five patients who belong to the simple random sample. 16348
76938
90169
51392
55887
71015
09209
1)
2)
79157
A) 163, 487, 693, 169, 513 B) 163, 169, 15, 92, 97 C) 1634, 3890, 1695, 1392, 1509 D) 16, 34, 69, 38, 13 E) 1, 6, 3, 4, 8 3) A magazine publisher mails a survey to every subscriber asking about the quality of its subscription service. Describe the population of interest. A) all American adults who read the magazine B) all subscribers to the magazine C) all American adults D) the respondents to the survey
3)
4) Explain whether an experiment or an observational study would be more appropriate to investigate whether women are more likely than men to suffer from anxiety. A) Experiment; it would be easier to control for lurking variables such as number of hours worked per week. B) Observational study; it would be easier to control for lurking variables such as number of hours worked per week. C) Observational study; gender cannot be assigned as a treatment. D) Experiment; an observational study would take too long. E) Observational study; it is unethical to subject study participants to anxiety over a long period of time.
4)
5) An educational researcher used school records to determine that, in one school district, 84% of children living in two-parent homes graduated high school while 75% of children living in single-parent homes graduated high school. What is the explanatory variable? A) Whether the child graduated from high school B) Whether the child lived with one or two parents and whether or not they graduated high school C) Whether the child lived with two parents or with a single parent D) Graduation rate
5)
Copyright © 2017 Pearson Education, Inc. 1
6) A computer network manager wants to test the reliability of some new and expensive fiber-optic Ethernet cables that the computer department just received. The computer department received 4 boxes containing 30 cables each. The manager does not have the time to test every cable in each box. The manager will choose one box at random and test 6 cables chosen randomly within that box. What is the population of interest? A) 60 cables B) The 4 boxes C) The one box that was chosen at random from the 4 boxes D) 120 cables E) The 6 cables chosen randomly for testing
6)
7) Explain whether an experiment or an observational study would be more appropriate to investigate whether meditation is effective in lowering blood pressure. A) Experiment; in an observational study it would be too difficult to monitor participants' behavior. B) Experiment; it would be easier to control for the effects of lurking variables such as diet and exercise. C) Observational study; it would be easier to control for the effects of lurking variables such as diet and exercise. D) Experiment; it would be unethical to deprive people of a potentially beneficial treatment over a long period of time. E) Experiment; an observational study would take too long.
7)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 8) What is the difference between a retrospective study and a prospective study?
8)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Answer true or false. 9) Sampling bias occurs when some sampled subjects cannot be reached or refuse to participate or fail to answer some questions. A) False B) True 10) Nonresponse bias occurs when the subject gives an incorrect response. A) False B) True Select the most appropriate answer. 11) The most common type of convenience sample is A) a volunteer sample. B) a stratified sample. C) a simple random sample. D) a systematic sample. E) a cluster sample.
9)
10)
11)
12) Experiments where two observations (one for each treatment) are recorded for each subject are called A) Completely Randomized Designs B) Randomized Block Designs C) Matched Pair Designs D) Cluster Designs E) Stratified Designs
Copyright © 2017 Pearson Education, Inc. 2
12)
13) A method of sampling which divides the population into separate groups and then selects a simple random sample from each group is called ____________________. A) convenience sampling B) cluster random sampling C) stratified random sampling D) systematic random sampling E) simple random sampling
13)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 14) When the difference between the results for the two treatments in an experimental study is so large that it would be rare to see such a difference by ordinary random variation among the experimental units, the results are said to be ____________________. 15) Bias that results from the sampling method, such as nonrandom sampling or undercoverage, is called ____________________.
14)
15)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Identify the bias. 16) A magazine publisher mails a survey to every subscriber asking about the quality of its subscription service. From which of the following is this study most likely to suffer? A) Both nonresponse bias and undercoverage B) Sampling bias C) Nonresponse bias D) Undercoverage E) Both undercoverage and sampling bias Identify which type of sampling is used. 17) The student dean of a university uses a computer to randomly select 500 student identification numbers then interviews the students corresponding to those identification numbers. A) Cluster random sample B) Convenience sample C) Simple random sample D) Stratified random sample E) Matched pair sample 18) A lobbyist for a major airspace firm assigns a number to each legislator and then uses a computer to randomly generate ten numbers. The lobbyist contacts the legislators corresponding to these numbers. What sampling technique was used? A) Cluster random sample B) Matched pair sample C) Convenience sample D) Simple random sample E) Stratified random sample
Copyright © 2017 Pearson Education, Inc. 3
16)
17)
18)
Answer the question. 19) A clothing manufacturer would like to determine whether celebrity endorsements influence a customer's decision to buy its products. In order to estimate this potential influence, the manufacturer premieres its latest series of commercials privately to two separate groups of potential buyers. One group will see a series of commercials without celebrities, and one group will view a series of commercials with celebrities. The potential buyers are further divided by gender.
19)
Who is in the control group? A) The men and women who see the commercials with celebrities. B) The men and women who see the commercials without celebrities. C) The men who see the commercials without celebrities. D) The 100 potential buyers. E) The women who see the commercials without celebrities. List all possible samples from the specified population. 20) Given a group of students: Allen (A), Brenda (B), Chad (C), Dorothy (D), and Eric (E), list all of the possible samples (without replacement) of size four that can be obtained from the group. A) ABCD, ABCE, ABDE B) ABCD, ABCE, ACDE, ADEB C) ABCD, ABCE, ACDE, ADEB, BCDE, BCEA, BDEA, CABD, CEDB, DACE D) ABCD E) ABCD, ABCE, ACDE, ADEB, BCDE
20)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Identify the flaw(s) in the experiment or study described. 21) At one hospital, 674 women were diagnosed with breast cancer. Five years later, 88% of the Caucasian women and 83% of the African American women were still alive. A researcher concludes that being Caucasian causes women with breast cancer to have an increased chance of surviving five years. Why is this conclusion not justified?
Copyright © 2017 Pearson Education, Inc. 4
21)
22) A pharmaceutical company has developed a medication that they believe will help to reduce the pain of arthritis. They would like to test the medication at two different dosage levels. They design an experiment as follows to test the medication. They will obtain a group of volunteers who suffer from arthritis. A doctor from the pharmaceutical company will evaluate each patient's condition at the start of the experiment. Volunteers will be randomly assigned to one of three groups. Each day for the duration of the experiment, patients in group 1 will receive a low dose of the medication, patients in group 2 will receive a higher dose of the medication, and patients in group 3 will receive a placebo. After a suitable amount of time (two months, for example), the same doctor will evaluate each patient's progress. Based on the amount of inflammation and the patient's report on the amount of pain, the doctor will give each patient a numerical score to represent their improvement. The amount of improvement for the three groups would then be compared. They will also ensure that the technicians administering the supplements are unaware of which patients receive a low dose, a high dose, or a placebo. Identify the flaw(s) in this experiment.
22)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Identify the specified elements of the experiment. 23) In a clinical trial, 780 participants suffering from high blood pressure were randomly assigned to one of three groups. Over a one-month period, the first group received a low dosage of an experimental drug, the second group received a high dosage of the drug, and the third group received a placebo. The diastolic blood pressure of each participant was measured at the beginning and at the end of the period and the change in blood pressure was recorded. Identify the explanatory variable. A) The one-month period of the experiment B) Placebo, low drug dosage, high drug dosage C) High blood pressure, low blood pressure D) The experimental drug E) Diastolic blood pressure at the start, diastolic blood pressure at the end Identify the study as observational or experimental. 24) To determine whether children born with fetal alcohol syndrome have a less than average IQ, 200 children between the ages of 8 and 10 who were born with the syndrome were given an IQ test and their scores were compared to the average IQ of children aged 8 to 10 who were born healthy. A) Observational B) Experimental Identify the type of observational study. 25) A statistical analyst obtains data concerning ankle injuries by examining hospital records from the past 3 years. A) Prospective B) Census C) Cross-sectional D) Case-control E) Retrospective
Copyright © 2017 Pearson Education, Inc. 5
23)
24)
25)
Answer Key Testname: CHAPTER 4 FORM B TEST
1) D 2) C 3) B 4) C 5) C 6) D 7) B 8) A retrospective study looks into the past, while a prospective study follows its subjects into the future. 9) A 10) A 11) A 12) C 13) C 14) statistically significant 15) sampling bias 16) C 17) C 18) D 19) B 20) E 21) Since there is no random assignment, there is no way to know that being Caucasian causes women with breast cancer to have an increased chance of surviving five years; there may have been lurking variables. For example, Caucasian women may be more affluent and able to afford better health care or better nutrition. They may have a less stressful lifestyle, or maybe they don't have to work as much. They may live in more affluent areas where there is less pollution and less noise. 22) The doctor evaluating the patients' progress is still aware of which treatment patients received. The doctor may be biased and influenced by his or her knowledge of whether the patient received the medication or a placebo. This is especially important because the doctor works for the pharmaceutical company and has a vested interest in finding that the medication is effective. 23) B 24) A 25) E
Copyright © 2017 Pearson Education, Inc. 6
CHAPTER 5 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. List the outcomes comprising the specified event. 1) When a quarter is tossed four times, 16 outcomes are possible. HHHH HTHH THHH TTHH
HHHT HTHT THHT TTHT
HHTH HTTH THTH TTTH
1)
HHTT HTTT THTT TTTT
Here, for example, HTTH represents the outcome that the first toss is heads, the next two tosses are tails, and the fourth toss is heads. List the outcomes that comprise the event of obtaining the same face on the first three tosses. A) HHHH, TTTT B) HHH, TTT C) HHHH, HHHT, TTTH, TTTT D) HHHT, TTTH E) HHHT, TTTH, HTTT, THHH SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 2) The census reported the frequencies in the following table for occupations of Americans 16 years and older.
2)
a. Find the probability that a randomly selected worker is in an occupation other than farming, fishing, or forestry. b. Given that a randomly selected worker is in service or in sales and office, find the probability they are in sales and office. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 3) For two events A and B, P(A) = 0.4, P(B) = 0.3, and P(A and B) = 0. It follows that A and B are A) disjoint but not independent. B) complementary. C) independent but not disjoint. D) neither disjoint nor independent. E) both disjoint and independent. Copyright © 2017 Pearson Education, Inc. 1
3)
Provide an appropriate response. 4) Mr. Smithʹs gardener is not dependable; the probability that he will forget to water the rosebush during Smithʹs absence is 2/3. The rosebush is in questionable condition anyhow; if watered, the probability of its withering is 1/2, but if it is not watered, the probability of its withering is 3/4. Upon returning, Smith finds that the rosebush has withered. What is the probability that the gardener did not water the rosebush? A) None of these. 2 B) 3 C)
1 2
D)
3 4
E)
1 3
5) You flip a coin x number of times and calculate the probability of heads as (number of heads)/x. What happens to this probability as x gets larger? B) It stays the same, 0.50 A) It gets larger C) It gets closer and closer to 0.50 D) It gets smaller Find the indicated probability. 6) If two balanced die are rolled, the possible outcomes can be represented as follows. (1, 1) (1, 2) (1, 3) (1, 4) (1, 5) (1, 6)
(2, 1) (2, 2) (2, 3) (2, 4) (2, 5) (2, 6)
(3, 1) (3, 2) (3, 3) (3, 4) (3, 5) (3, 6)
(4, 1) (4, 2) (4, 3) (4, 4) (4, 5) (4, 6)
(5, 1) (5, 2) (5, 3) (5, 4) (5, 5) (5, 6)
6)
D)
1 6
E)
7 36
List the outcomes comprising the specified event. 7) Three board members for a nonprofit organization will be selected from a group of five people. The board members will be selected by drawing names from a hat. The names of the five possible board members are Allison, Betty, Charlie, Dave, and Emily. The possible outcomes can be represented as follows. ABD BCD
ABE BCE
ACD BDE
5)
(6, 1) (6, 2) (6, 3) (6, 4) (6, 5) (6, 6)
Determine the probability that the sum of the dice is 7. 5 2 7 B) C) A) 36 9 18
ABC ADE
4)
ACE CDE
Here, for example, ABC represents the outcome that Allison, Betty, and Charlie are selected to be on the board. List the outcomes that comprise the event that Betty and Emily are selected. A) ABE, BCE, BDE B) ABC, ABD, ABE, ACE, ADE, BCD, BCE, BDE, CDE C) ABE, BDE D) BE E) ABE, BCE
Copyright © 2017 Pearson Education, Inc. 2
7)
Draw a Venn diagram and shade the described events. 8) From a finite sample, events A, B, and C are disjoint. Shade the collection A or B or C. B) A)
C)
8)
D)
Find the probability of the given event. 9) A random spinner has equal-sized regions numbered 1 through 18. The spinner stops on an even number or a multiple of 3. 2 1 1 B) C) 15 D) 1 E) A) 3 3 2 Determine whether the events are disjoint. 10) A card is selected randomly from a deck of 52. The events A, B, and C are defined as follows.
10)
A = event the card selected is a heart B = event the card selected is a club C = event the card selected is an ace Are the events A, B, and C disjoint? A) No
B) Yes
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 11) A production process of computer parts uses two machines, one old machine and one new machine. If the old machine is used, the probability that a defective part is produced is 0.13. If the new machine is used, the probability that a defective part is produced is 0.04. Moreover, the new machine produces parts 4 times as fast as the old machine does. What is the probability that a randomly selected part produced by this process is defective?
Copyright © 2017 Pearson Education, Inc. 3
9)
11)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 12) If A and B are independent events, with P(A) =
2 1 , P(B) = , then P(A or B) 5 5
12)
A) cannot be determined from the given information. 3 B) equals . 5 C) equals
13 . 25
D) equals
3 . 25
E) equals
3 . 10
Draw a Venn diagram and shade the described events. 13) From a finite sample, events A, B, and C are not disjoint. Shade the collection A and B and C. B) A)
C)
13)
D)
Answer true or false. 14) Two trials are independent if they have no outcomes in common. A) False B) True
Copyright © 2017 Pearson Education, Inc. 4
14)
List the outcomes comprising the specified event. 15) When a coin is tossed two times, the possible outcomes are HH HT TH TT where ʺHʺ represents a head and ʺTʺ represents a tail. List the outcomes that comprise the complement of the event A = at least one head appears. A) TT B) HT, TH C) TT, HH D) TT, HH, HT, TH E) HH, HT, TH
15)
Suppose P(C) =0 .048, P(M and C) = 0.044, and P(M or C) = 0.524. Find the indicated probability. 16) P(M) A) 0.472 B) 0.480 C) 0.524 D) 0.528 E) 0.520
16)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 17) When entering data into a computer, the probability that a student will make at most three mistakes per 1,000 keystrokes is 0.71, and the probability of making anywhere from 4 to 6 mistakes per 1,000 keystrokes is 0.22. Find the probabilities that in 1,000 keystrokes the student will make a. b.
17)
at most 6 mistakes more than 3 mistakes
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 18) According to a survey, 41.5% of babies born in the U.S. were still being breastfed at 6 months of age. If 4 children who are born in the U.S. are randomly selected, what is the probability that none of them are breastfed for at least 6 months? A) none of these B) 0.12 C) 0.585 D) 1 E) 0.03 Solve the problem. 19) Find the probability that in a class of 26 students, at least two have the same birthday. Express the probability as a decimal rounded to four decimal places. A) 0.5982 B) 0.5687 C) 0.4018 D) 0.6269 20) A patient is told that a test for a certain disease is 92% accurate. The way this is worded, this could mean that (a) 92% of those with the disease test positive, (b) 92% of those without the disease test negative, (c) 92% of those who test positive have the disease, or (d) 92% of those who test negative do not have the disease. Let D denote {person has the disease}, and let P denote {person tests positive}. Using these events and their complements, express as a conditional probability the event that a person who tests positive does not have the disease. B) P(Dc Pc) C) P(P Dc) D) P(Pc D) E) P(Dc P) A) P(D Pc)
Copyright © 2017 Pearson Education, Inc. 5
18)
19)
20)
Provide an appropriate response. 21) A prix fixed menu offers a choice of 2 appetizers, 4 main courses and 3 desserts. If a tree diagram is used to list all possible meal combinations from the prix fixed menu, how many branches will there be? A) 8 B) 12 C) 9 D) 24 E) none of these List the outcomes comprising the specified event. 22) When a quarter is tossed four times, 16 outcomes are possible.
21)
22)
HHHH HHHT HHTH HHTT HTHH HTHT HTTH HTTT THHH THHT THTH THTT TTHHTTHT TTTH TTTT Here, for example, HTTH represents the outcome that the first toss is heads, the next two tosses are tails, and the fourth toss is heads. List the outcomes that comprise the event of tossing exactly three tails. A) HTTT, THTT, TTTH B) HTTT, THTT, TTHT, TTTH, TTTT C) HTTT, THTT, TTHT, TTTH D) TTTH E) THTT, TTHT, TTTH Provide an appropriate response. 23) Ten white balls, 20 blue balls and 20 red balls are placed in an urn. If two balls are drawn, with replacement, what is the probability of drawing two white balls? A) 0.04 B) 0.01 C) 0.36 D) 0.037
23)
Draw a tree diagram to represent the problem. At the end of each branch use symbols to represent the event that the branch corresponds to and give the probability of the event. 24) 24) Two cards are selected randomly without replacement from a standard deck of 52 cards. The color of each card (red or black) is recorded. Draw a tree diagram showing the possible outcomes and their probabilities for this problem. A)
Copyright © 2017 Pearson Education, Inc. 6
B)
C)
D) None of these Find the probability using complements. 25) Suppose that on a particular multiple choice question, 96% of the students answered correctly. What is the probability that a randomly selected student answered the question incorrectly? A) 0.96 B) 1.04 C) 24.00 D) 0.04 E) 0.48
Copyright © 2017 Pearson Education, Inc. 7
25)
Answer Key Testname: CHAPTER 5 FORM A TEST
1) C 2) a. 0.9927; b. 0.6423 3) A 4) D 5) C 6) D 7) A 8) D 9) B 10) A 11) P(D) = P(N and D) + P(O and D) = 0.032 + 0.026 = 0.058 12) C 13) B 14) A 15) A 16) E 17) a. 0.93; b. 0.29 18) B 19) A 20) E 21) D 22) C 23) A 24) A 25) D
Copyright © 2017 Pearson Education, Inc. 8
CHAPTER 5 FORM B TESTS Name___________________________________
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) The census reported the frequencies in the following table for occupations of Americans 16 years and older.
1)
a. Find the probability that a randomly selected worker is in an occupation other than farming, fishing, or forestry. b. Given that a randomly selected worker is in service or in sales and office, find the probability they are in sales and office. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 2)
2) You flip a coin x number of times and calculate the probability of heads as (number of heads)/x. What happens to this probability as x gets larger? A) It gets closer and closer to 0.50 B) It gets smaller C) It gets larger D) It stays the same, 0.50 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 3) A production process of computer parts uses two machines, one old machine and one new machine. If the old machine is used, the probability that a defective part is produced is 0.13. If the new machine is used, the probability that a defective part is produced is 0.04. Moreover, the new machine produces parts 4 times as fast as the old machine does. What is the probability that a randomly selected part produced by this process is defective?
3)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 4) According to a survey, 41.5% of babies born in the U.S. were still being breastfed at 6 months of age. If 4 children who are born in the U.S. are randomly selected, what is the probability that none of them are breastfed for at least 6 months? A) 0.03 B) none of these C) 0.585 D) 0.12 E) 1
Copyright © 2017 Pearson Education, Inc. 1
4)
Answer true or false. 5) Two trials are independent if they have no outcomes in common. A) False B) True Find the probability of the given event. 6) A single fair die is rolled. The number on the die is a 3 or a 5. 1 1 1 B) C) A) 36 3 2
5)
6) D) 2
1 E) 6
Provide an appropriate response. 7) Mr. Smithʹs gardener is not dependable; the probability that he will forget to water the rosebush during Smithʹs absence is 2/3. The rosebush is in questionable condition anyhow; if watered, the probability of its withering is 1/2, but if it is not watered, the probability of its withering is 3/4. Upon returning, Smith finds that the rosebush has withered. What is the probability that the gardener did not water the rosebush? 1 A) 3
7)
B) None of these. 3 C) 4 D)
1 2
E)
2 3
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 8) Among the 12 applicants for managerial positions at a chain of movie theaters, 8 are college graduates and 4 are high school graduates. If three of the applicants are randomly selected,
8)
a. Identify the possible outcomes in terms of whether a selected applicant is a college graduate (C) or a high school graduate (H). b. Find the probability for each possible outcome and verify that these probabilities follow the two basic rules of the probabilities for the outcomes in a sample space. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 9) A prix fixed menu offers a choice of 2 appetizers, 4 main courses and 3 desserts. If a tree diagram is used to list all possible meal combinations from the prix fixed menu, how many branches will there be? A) 12 B) 9 C) 8 D) 24 E) none of these
Copyright © 2017 Pearson Education, Inc. 2
9)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 10) When entering data into a computer, the probability that a student will make at most three mistakes per 1,000 keystrokes is 0.71, and the probability of making anywhere from 4 to 6 mistakes per 1,000 keystrokes is 0.22. Find the probabilities that in 1,000 keystrokes the student will make a. b.
10)
at most 6 mistakes more than 3 mistakes
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 11) Ten white balls, 20 blue balls and 20 red balls are placed in an urn. If two balls are drawn, with replacement, what is the probability of drawing two white balls? A) 0.037 B) 0.04 C) 0.01 D) 0.36 Solve the problem. 12) Find the probability that in a class of 31 students, at least two have the same birthday. Express the probability as a decimal rounded to four decimal places. A) 0.7533 B) 0.7063 C) 0.7305 D) 0.2695 13) A patient is told that a test for a certain disease is 92% accurate. The way this is worded, this could mean that (a) 92% of those with the disease test positive, (b) 92% of those without the disease test negative, (c) 92% of those who test positive have the disease, or (d) 92% of those who test negative do not have the disease. Let D denote {person has the disease}, and let P denote {person tests positive}. Using these events and their complements, express as a conditional probability the event that a person who tests positive does not have the disease. B) P(Dc Pc) C) P(D Pc) D) P(P Dc) E) P(Dc P) A) P(Pc D) Find the probability of the given event. 14) A random spinner has equal-sized regions numbered 1 through 18. The spinner stops on an even number or a multiple of 3. 2 1 1 B) 1 C) 15 D) E) A) 3 3 2 Find the indicated probability. 15) If two balanced die are rolled, the possible outcomes can be represented as follows. (1, 1) (1, 2) (1, 3) (1, 4) (1, 5) (1, 6)
(2, 1) (2, 2) (2, 3) (2, 4) (2, 5) (2, 6)
(3, 1) (3, 2) (3, 3) (3, 4) (3, 5) (3, 6)
(4, 1) (4, 2) (4, 3) (4, 4) (4, 5) (4, 6)
(5, 1) (5, 2) (5, 3) (5, 4) (5, 5) (5, 6)
11)
12)
13)
14)
15)
(6, 1) (6, 2) (6, 3) (6, 4) (6, 5) (6, 6)
Determine the probability that the sum of the dice is 7. 1 7 7 B) C) A) 6 18 36
D)
5 36
E)
2 9
Draw a tree diagram to represent the problem. At the end of each branch use symbols to represent the event that the branch corresponds to and give the probability of the event.
Copyright © 2017 Pearson Education, Inc. 3
16) Two cards are selected randomly without replacement from a standard deck of 52 cards. The color of each card (red or black) is recorded. Draw a tree diagram showing the possible outcomes and their probabilities for this problem. A)
B)
C)
D) None of these
Copyright © 2017 Pearson Education, Inc. 4
16)
Draw a Venn diagram and shade the described events. 17) From a finite sample, events A, B, and C are disjoint. Shade the collection A or B or C. B) A)
C)
17)
D)
18) From a finite sample, events A, B, and C are not disjoint. Shade the collection A and B and C. B) A)
C)
D)
Copyright © 2017 Pearson Education, Inc. 5
18)
List the outcomes comprising the specified event. 19) Three board members for a nonprofit organization will be selected from a group of five people. The board members will be selected by drawing names from a hat. The names of the five possible board members are Allison, Betty, Charlie, Dave, and Emily. The possible outcomes can be represented as follows. ABC ADE
ABD BCD
ABE BCE
ACD BDE
19)
ACE CDE
Here, for example, ABC represents the outcome that Allison, Betty, and Charlie are selected to be on the board. List the outcomes that comprise the event that Betty and Emily are selected. A) BE B) ABE, BDE C) ABE, BCE D) ABE, BCE, BDE E) ABC, ABD, ABE, ACE, ADE, BCD, BCE, BDE, CDE 20) When a quarter is tossed four times, 16 outcomes are possible. HHHH HTHH THHH TTHH
HHHT HTHT THHT TTHT
HHTH HTTH THTH TTTH
20)
HHTT HTTT THTT TTTT
Here, for example, HTTH represents the outcome that the first toss is heads, the next two tosses are tails, and the fourth toss is heads. List the outcomes that comprise the event of obtaining the same face on the first three tosses. A) HHHH, HHHT, TTTH, TTTT B) HHH, TTT C) HHHH, TTTT D) HHHT, TTTH, HTTT, THHH E) HHHT, TTTH 21) When a quarter is tossed four times, 16 outcomes are possible. HHHH HTHH THHH TTHH
HHHT HTHT THHT TTHT
HHTH HTTH THTH TTTH
HHTT HTTT THTT TTTT
Here, for example, HTTH represents the outcome that the first toss is heads, the next two tosses are tails, and the fourth toss is heads. List the outcomes that comprise the event of tossing exactly three tails. A) HTTT, THTT, TTTH B) THTT, TTHT, TTTH C) HTTT, THTT, TTHT, TTTH D) TTTH E) HTTT, THTT, TTHT, TTTH, TTTT
Copyright © 2017 Pearson Education, Inc. 6
21)
Select the most appropriate answer. 22) For two events A and B, P(A) = 0.4, P(B) = 0.3, and P(A and B) = 0. It follows that A and B are A) independent but not disjoint. B) both disjoint and independent. C) complementary. D) disjoint but not independent. E) neither disjoint nor independent. Find the probability using complements. 23) Suppose that on a particular multiple choice question, 96% of the students answered correctly. What is the probability that a randomly selected student answered the question incorrectly? A) 0.04 B) 1.04 C) 0.48 D) 24.00 E) 0.96 Determine whether the events are disjoint. 24) A card is selected randomly from a deck of 52. The events A, B, and C are defined as follows.
22)
23)
24)
A = event the card selected is a heart B = event the card selected is a club C = event the card selected is an ace Are the events A, B, and C disjoint? A) No
B) Yes
Suppose P(C) =0 .048, P(M and C) = 0.044, and P(M or C) = 0.524. Find the indicated probability. 25) P(M) A) 0.480 B) 0.472 C) 0.524 D) 0.520 E) 0.528
Copyright © 2017 Pearson Education, Inc. 7
25)
Answer Key Testname: CHAPTER 5 FORM B TEST
1) a. 0.9927; b. 0.6423 2) A 3) P(D) = P(N and D) + P(O and D) = 0.032 + 0.026 = 0.058 4) D 5) A 6) C 7) C 8) a. {CCC, CCH, CHC, CHH, HCC, HCH, HHC, HHH}; b. P(CCC) = 336/1320, P(CCH) = P(CHC) = P(HCC) = 224/1320, P(CHH) = P(HCH) = P(HHC) = 96/1320, P(HHH) =24/1320; The probability of each individual outcome is between 0 and 1. The total of the probabilities of all of the individual outcomes is 1. 9) D 10) a. 0.93; b. 0.29 11) B 12) C 13) E 14) D 15) B 16) C 17) A 18) C 19) D 20) A 21) C 22) D 23) A 24) A 25) D
Copyright © 2017 Pearson Education, Inc. 8
CHAPTER 6 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Find the indicated probability. 1) Suppose that 11% of people are left handed. If 6 people are selected at random, what is the probability that exactly 2 of them are left handed? A) 0.0076 B) 0.2278 C) 0.1139 D) 0.0566 E) 0.0121 Find the standard deviation of the binomial random variable. 2) The probability of winning a certain lottery is 1/51,949. For people who play 560 times, find the standard deviation for the random variable X, the number of wins. A) 0.0108 B) 0.1038 C) 0.1137 D) 0.1223 E) 2.4569 Find the specified probability distribution of the binomial random variable. 3) In one city, 21% of the population is under 25 years of age. Three people are selected at random from the city. Find the probability distribution of X, the number among the three that are under 25 years of age. x P(X = x) 1 0.21 A) 2 0.0441 3 0.0213 x P(X = x) 0 0.4930 B) 1 0.3932 2 0.0925 3 0.0213 x P(X = x) 1 0.21 C) 2 0.0441 3 0.0093 x P(X = x) 0 0.4930 D) 1 0.3932 2 0.1045 3 0.0093 x P(X = x) 0 0.4930 E) 1 0.1311 2 0.0348 3 0.0093
Copyright © 2017 Pearson Education, Inc. 1
1)
2)
3)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 4) Serum cholesterol is an important risk factor for coronary disease. The level of serum cholesterol is approximately normally distributed with a mean of 219 mg/dL and a standard deviation of 50 mg/dL. If the clinically desirable range for serum cholesterol is < 200 mg/dL, what is the probability that a randomly selected person will have a clinically desirable level of serum cholesterol?
4)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Find the indicated probability for the normally distributed variable. 5) The weekly salaries of elementary school teachers in one state are normally distributed with a mean of $490 and a standard deviation of $45. What is the probability that a randomly selected elementary school teacher earns more than $525 a week? A) 0.2177 B) 0.7823 C) 0.2823 D) 0.1003 E) 0.4354 Find the mean of the binomial random variable. 6) The probability is 0.7 that a person shopping at a certain store will spend less than $20. For random samples of 28 customers, find the mean number of shoppers who spend less than $20. A) 19.6 B) 18.2 C) 8.4 D) 14.0 E) 6.0 Obtain the probability distribution of the random variable. 7) When a coin is tossed four times, sixteen equally likely outcomes are possible as shown below: HHHH HHHT HHTH HHTT HTHH HTHT HTTH HTTT THHH THHT THTH THTT TTHH TTHT TTTH TTTT Let X denote the total number of tails obtained in the four tosses. Find the probability distribution of the random variable X. Leave your probabilities in fraction form. A) x P(X = x) 0 0 1 1/4 2 3/8 3 1/4 4 1/16 B) x P(X = x) 0 1/16 1 1/4 2 3/8 3 1/4 4 1/16 C) x P(X = x) 1 1/4 2 7/16 3 1/4 4 1/16 Copyright © 2017 Pearson Education, Inc. 2
5)
6)
7)
D) x P(X = x) 0 1/16 1 1/8 2 3/8 3 1/8 4 1/16 E) x P(X = x) 0 1/16 1 3/16 2 1/2 3 3/16 4 1/16 Find the mean of the given probability distribution. 8) The random variable X is the number of houses sold by a realtor in a single month at a particular real estate office. Its probability distribution is given in the table below. x P(X = x) 0 0.24 1 0.01 2 0.12 3 0.16 4 0.01 5 0.14 6 0.11 7 0.21 A) 3.40 B) 3.35 C) 3.50 D) 3.60 Use a table of areas to find the specified area under the standard normal curve. 9) The area that lies to the right of -1.82 A) 0.0344 B) 0.4828 C) 0.9656 D) 0.4656
9) E) -0.0344
Find the indicated probability. 10) An archer is able to hit the bull's-eye 55% of the time. If she shoots 8 arrows, what is the probability that she gets exactly 4 bull's-eyes? Assume each shot is independent of the others. A) 0.0915 B) 0.2627 C) 0.5254 D) 0.0038 E) 0.1719 Obtain the probability distribution of the random variable. 11) When two balanced dice are rolled, 36 equally likely outcomes are possible as shown below. (1, 1) (1, 2) (1, 3) (1, 4) (1, 5) (1, 6) (2, 1) (2, 2) (2, 3) (2, 4) (2, 5) (2, 6) (3, 1) (3, 2) (3, 3) (3, 4) (3, 5) (3, 6) (4, 1) (4, 2) (4, 3) (4, 4) (4, 5) (4, 6) (5, 1) (5, 2) (5, 3) (5, 4) (5, 5) (5, 6) (6, 1) (6, 2) (6, 3) (6, 4) (6, 5) (6, 6) Let X denote the smaller of the two numbers or the common value if doubles are thrown. Find the probability distribution of X. Leave your probabilities in fraction form.
Copyright © 2017 Pearson Education, Inc. 3
8)
10)
11)
A) x P(X = x) 1 1/6 2 1/6 3 1/6 4 1/6 5 1/6 6 1/6 B) x P(X = x) 1 5/18 2 1/4 3 7/36 4 5/36 5 1/9 6 1/36 C) x P(X = x) 1 11/36 2 1/8 3 7/36 4 5/36 5 1/18 6 1/36 D) x P(X = x) 1 11/36 2 1/4 3 7/36 4 5/36 5 1/12 6 1/36 E) x P(X = x) 1 5/18 2 2/9 3 1/6 4 1/9 5 1/18 6 0 Find the indicated probability. 12) A multiple choice test has 10 questions each of which has 4 possible answers, only one of which is correct. If Judy, who forgot to study for the test, guesses on all questions, what is the probability that she will answer exactly 3 questions correctly? A) 0.2503 B) 0.2816 C) 0.0021 D) 0.5006 E) 0.0156
Copyright © 2017 Pearson Education, Inc. 4
12)
Determine whether the conditions for the normal approximation to the binomial are satisfied. 13) In one town, 2% of adults are HIV positive. A researcher selects a random sample of 400 adults from a population of 23,250. X = number of people in the sample who are HIV positive. In this scenario, the conditions for the binomial distribution are satisfied. Is it reasonable to use the normal approximation to determine an interval within which X will almost certainly fall? Explain why or why not. A) No, np is less than 15. B) No, the trials are not independent as sampling is without replacement. C) Yes, sampling is without replacement but the sample size is less than 10% of population. D) Yes, as n is large. Find the mean of the binomial random variable. 14) According to a survey on drug use, 54.3% of males have never used marijuana. Based on this percentage, what is the expected number of males who have used marijuana for samples of size 100? A) 54.3 B) 24.8 C) 45.7 D) 5 E) 50 Find the mean of the given probability distribution. 15) The percentage of families by size in the U.S. is given in the table below.
13)
14)
15)
(note: the category "7" actually represents 7 or more but "7" will be used for calculation purposed) A) 2.5 B) 0.17 C) 3.1 D) 2 E) 4.5 Find the mean of the binomial random variable. 16) According to a poll taken in February of 2008, 67% of respondents disapproved of the overall job that the President was doing. Based on this poll, for samples of size 200, what is the mean number of adults who disapprove of the overall job that the President is doing? A) 134 B) 100 C) 67 D) 6.65 E) 44.22 17) A die is rolled 10 times and the number of times that two shows on the up face is counted. If this experiment is repeated many times, find the mean for the random variable X, the number of twos thrown out of ten tosses. A) 3.33 B) 2.5 C) 8.33 D) 2.98 E) 1.67
Copyright © 2017 Pearson Education, Inc. 5
16)
17)
Find the mean of the given probability distribution. 18) According to a survey, the percentage of households by number of vehicles is given in the following table:
18)
(note, the category "5" actually represents 5 or more, but we will use "5" for ease of calculation) A) 1.88 B) 2.5 C) 188.2 D) 1.5 E) 2 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 19) Serum cholesterol is an important risk factor for coronary disease. The level of serum cholesterol is approximately normally distributed with a mean of 219 mg/dL and a standard deviation of 50 mg/dL. If the clinically desirable range for serum cholesterol is < 200 mg/dL and serum cholesterol levels of over 250 mg/dL indicate a high-enough risk for heart disease to warrant treatment, what is the probability that a randomly selected person will have a borderline high serum cholesterol level (that is, > 200, but < 250 mg/dL)?
19)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Determine whether the conditions for the binomial distribution are satisfied. If the conditions are not satisfied, state why not. 20) X = number of Democrats in a sample of 500 voters selected randomly from a total of 2560 registered voters 20) in one town. 42% of the 2560 registered voters are Democrats. A) Conditions for binomial distribution are not satisfied. Trials are not independent. Sampling is without replacement and the sample size is less than 20% of the population. B) Conditions for binomial distribution are satisfied. C) Conditions for binomial distribution are not satisfied. np is larger than 15. D) Conditions for binomial distribution are not satisfied. Trials are not independent. Sampling is without replacement and sample size is more than 10% of the population. Find the standard deviation of the binomial random variable. 21) According to a college survey, 22% of all students work full time. Find the standard deviation for the random variable X, the number of students who work full time in samples of size 16. A) 3.52 B) 1.88 C) 1.98 D) 1.66 E) 2.75
21)
Determine whether the conditions for the binomial distribution are satisfied. If the conditions are not satisfied, state why not. 22) A polling agency in one city randomly selects 1000 registered voters from a pool of 385,600 registered 22) voters of whom 17% are African American. X = number of African Americans in the sample. A) Conditions for binomial distribution are not satisfied. There are more than two possible outcomes per trial. B) Conditions for binomial distribution are not satisfied. Trials are not independent. Sampling is without replacement and sample size is less than 10% of population. C) Conditions for binomial distribution are not satisfied. The probability of success is not the same on each trial. D) Conditions for binomial distribution are satisfied.
Copyright © 2017 Pearson Education, Inc. 6
Use a table of areas for the standard normal curve to find the required z-score. 23) Find the z-score for which the area under the standard normal curve to its left is 0.96 A) 1.03 B) -1.38 C) -1.75 D) 1.75
E) 1.82
24) Find the z-score having area 0.86 to its right under the standard normal curve. A) -1.08 B) 0.5557 C) 0.8051 D) 1.08
E) -0.5557
23)
24)
Find the standard deviation of the binomial random variable. 25) A die is rolled 18 times and the number of twos that come up is tallied. If this experiment is repeated many times, find the standard deviation for the random variable X, the number of twos. A) 1.537 B) 2.5 C) 1.622 D) 1.581 E) 1.73
Copyright © 2017 Pearson Education, Inc. 7
25)
Answer Key Testname: CHAPTER 6 FORM A TEST
1) C 2) B 3) D 4) 0.3520 5) A 6) A 7) B 8) D 9) C 10) B 11) D 12) A 13) A 14) C 15) C 16) A 17) E 18) A 19) 0.3804 20) D 21) D 22) D 23) D 24) A 25) D
Copyright © 2017 Pearson Education, Inc. 8
CHAPTER 6 FORM B TEST Name___________________________________
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) Serum cholesterol is an important risk factor for coronary disease. The level of serum cholesterol is approximately normally distributed with a mean of 219 mg/dL and a standard deviation of 50 mg/dL. If the clinically desirable range for serum cholesterol is < 200 mg/dL and serum cholesterol levels of over 250 mg/dL indicate a high-enough risk for heart disease to warrant treatment, what is the probability that a randomly selected person will have a borderline high serum cholesterol level (that is, > 200, but < 250 mg/dL)? 2) Serum cholesterol is an important risk factor for coronary disease. The level of serum cholesterol is approximately normally distributed with a mean of 219 mg/dL and a standard deviation of 50 mg/dL. If the clinically desirable range for serum cholesterol is < 200 mg/dL, what is the probability that a randomly selected person will have a clinically desirable level of serum cholesterol?
1)
2)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Use a table of areas to find the specified area under the standard normal curve. 3) The area that lies to the right of -1.82 A) 0.0344 B) -0.0344 C) 0.9656 D) 0.4656
3) E) 0.4828
Find the indicated probability. 4) A multiple choice test has 10 questions each of which has 4 possible answers, only one of which is correct. If Judy, who forgot to study for the test, guesses on all questions, what is the probability that she will answer exactly 3 questions correctly? A) 0.0156 B) 0.0021 C) 0.2503 D) 0.5006 E) 0.2816
4)
5) Suppose that 11% of people are left handed. If 6 people are selected at random, what is the probability that exactly 2 of them are left handed? A) 0.0566 B) 0.0076 C) 0.2278 D) 0.1139 E) 0.0121
5)
6) An archer is able to hit the bull's-eye 55% of the time. If she shoots 8 arrows, what is the probability that she gets exactly 4 bull's-eyes? Assume each shot is independent of the others. A) 0.5254 B) 0.2627 C) 0.0038 D) 0.1719 E) 0.0915
6)
Find the mean of the binomial random variable. 7) According to a poll taken in February of 2008, 67% of respondents disapproved of the overall job that the President was doing. Based on this poll, for samples of size 200, what is the mean number of adults who disapprove of the overall job that the President is doing? A) 67 B) 44.22 C) 100 D) 6.65 E) 134 8) A die is rolled 10 times and the number of times that two shows on the up face is counted. If this experiment is repeated many times, find the mean for the random variable X, the number of twos thrown out of ten tosses. A) 2.98 B) 8.33 C) 2.5 D) 3.33 E) 1.67
Copyright © 2017 Pearson Education, Inc. 1
7)
8)
9) According to a survey on drug use, 54.3% of males have never used marijuana. Based on this percentage, what is the expected number of males who have used marijuana for samples of size 100? A) 24.8 B) 45.7 C) 54.3 D) 5 E) 50
9)
10) The probability is 0.7 that a person shopping at a certain store will spend less than $20. For random samples of 28 customers, find the mean number of shoppers who spend less than $20. A) 14.0 B) 18.2 C) 8.4 D) 19.6 E) 6.0
10)
Find the indicated probability for the normally distributed variable. 11) The weekly salaries of elementary school teachers in one state are normally distributed with a mean of $490 and a standard deviation of $45. What is the probability that a randomly selected elementary school teacher earns more than $525 a week? A) 0.4354 B) 0.7823 C) 0.1003 D) 0.2177 E) 0.2823 Obtain the probability distribution of the random variable. 12) When two balanced dice are rolled, 36 equally likely outcomes are possible as shown below. (1, 1) (1, 2) (1, 3) (1, 4) (1, 5) (1, 6) (2, 1) (2, 2) (2, 3) (2, 4) (2, 5) (2, 6) (3, 1) (3, 2) (3, 3) (3, 4) (3, 5) (3, 6) (4, 1) (4, 2) (4, 3) (4, 4) (4, 5) (4, 6) (5, 1) (5, 2) (5, 3) (5, 4) (5, 5) (5, 6) (6, 1) (6, 2) (6, 3) (6, 4) (6, 5) (6, 6) Let X denote the smaller of the two numbers or the common value if doubles are thrown. Find the probability distribution of X. Leave your probabilities in fraction form. A) x P(X = x) 1 11/36 2 1/8 3 7/36 4 5/36 5 1/18 6 1/36 B) x P(X = x) 1 11/36 2 1/4 3 7/36 4 5/36 5 1/12 6 1/36 C) x P(X = x) 1 5/18 2 2/9 3 1/6 4 1/9 5 1/18 6 0
Copyright © 2017 Pearson Education, Inc. 2
11)
12)
D) x P(X = x) 1 5/18 2 1/4 3 7/36 4 5/36 5 1/9 6 1/36 E) x P(X = x) 1 1/6 2 1/6 3 1/6 4 1/6 5 1/6 6 1/6 13) When a coin is tossed four times, sixteen equally likely outcomes are possible as shown below: HHHH HHHT HHTH HHTT HTHH HTHT HTTH HTTT THHH THHT THTH THTT TTHH TTHT TTTH TTTT Let X denote the total number of tails obtained in the four tosses. Find the probability distribution of the random variable X. Leave your probabilities in fraction form. A) x P(X = x) 1 1/4 2 7/16 3 1/4 4 1/16 B) x P(X = x) 0 0 1 1/4 2 3/8 3 1/4 4 1/16 C) x P(X = x) 0 1/16 1 1/4 2 3/8 3 1/4 4 1/16
Copyright © 2017 Pearson Education, Inc. 3
13)
D) x P(X = x) 0 1/16 1 3/16 2 1/2 3 3/16 4 1/16 E) x P(X = x) 0 1/16 1 1/8 2 3/8 3 1/8 4 1/16 Determine whether the conditions for the binomial distribution are satisfied. If the conditions are not satisfied, state why not. 14) X = number of Democrats in a sample of 500 voters selected randomly from a total of 2560 registered voters 14) in one town. 42% of the 2560 registered voters are Democrats. A) Conditions for binomial distribution are not satisfied. np is larger than 15. B) Conditions for binomial distribution are not satisfied. Trials are not independent. Sampling is without replacement and the sample size is less than 20% of the population. C) Conditions for binomial distribution are not satisfied. Trials are not independent. Sampling is without replacement and sample size is more than 10% of the population. D) Conditions for binomial distribution are satisfied. 15) A polling agency in one city randomly selects 1000 registered voters from a pool of 385,600 registered voters of whom 17% are African American. X = number of African Americans in the sample. A) Conditions for binomial distribution are not satisfied. Trials are not independent. Sampling is without replacement and sample size is less than 10% of population. B) Conditions for binomial distribution are satisfied. C) Conditions for binomial distribution are not satisfied. The probability of success is not the same on each trial. D) Conditions for binomial distribution are not satisfied. There are more than two possible outcomes per trial.
Copyright © 2017 Pearson Education, Inc. 4
15)
Find the specified probability distribution of the binomial random variable. 16) In one city, 21% of the population is under 25 years of age. Three people are selected at random from the city. Find the probability distribution of X, the number among the three that are under 25 years of age. x P(X = x) 1 0.21 A) 2 0.0441 3 0.0213 x P(X = x) 0 0.4930 B) 1 0.3932 2 0.0925 3 0.0213 x P(X = x) 1 0.21 C) 2 0.0441 3 0.0093 x P(X = x) 0 0.4930 D) 1 0.1311 2 0.0348 3 0.0093 x P(X = x) 0 0.4930 E) 1 0.3932 2 0.1045 3 0.0093 Use a table of areas for the standard normal curve to find the required z-score. 17) Find the z-score for which the area under the standard normal curve to its left is 0.96 A) 1.82 B) 1.03 C) -1.38 D) -1.75 18) Find the z-score having area 0.86 to its right under the standard normal curve. A) 0.5557 B) 0.8051 C) 1.08 D) -0.5557
16)
17) E) 1.75 18) E) -1.08
Find the standard deviation of the binomial random variable. 19) A die is rolled 18 times and the number of twos that come up is tallied. If this experiment is repeated many times, find the standard deviation for the random variable X, the number of twos. A) 2.5 B) 1.581 C) 1.622 D) 1.537 E) 1.73
19)
20) According to a college survey, 22% of all students work full time. Find the standard deviation for the random variable X, the number of students who work full time in samples of size 16. A) 3.52 B) 1.88 C) 1.66 D) 2.75 E) 1.98
20)
21) The probability of winning a certain lottery is 1/51,949. For people who play 560 times, find the standard deviation for the random variable X, the number of wins. A) 0.1038 B) 2.4569 C) 0.0108 D) 0.1137 E) 0.1223
21)
Copyright © 2017 Pearson Education, Inc. 5
Determine whether the conditions for the normal approximation to the binomial are satisfied. 22) In one town, 2% of adults are HIV positive. A researcher selects a random sample of 400 adults from a population of 23,250. X = number of people in the sample who are HIV positive. In this scenario, the conditions for the binomial distribution are satisfied. Is it reasonable to use the normal approximation to determine an interval within which X will almost certainly fall? Explain why or why not. A) No, the trials are not independent as sampling is without replacement. B) Yes, as n is large. C) No, np is less than 15. D) Yes, sampling is without replacement but the sample size is less than 10% of population. Find the mean of the given probability distribution. 23) According to a survey, the percentage of households by number of vehicles is given in the following table:
22)
23)
(note, the category "5" actually represents 5 or more, but we will use "5" for ease of calculation) A) 2 B) 188.2 C) 2.5 D) 1.88 E) 1.5 24) The random variable X is the number of houses sold by a realtor in a single month at a particular real estate office. Its probability distribution is given in the table below. x P(X = x) 0 0.24 1 0.01 2 0.12 3 0.16 4 0.01 5 0.14 6 0.11 7 0.21 A) 3.60 B) 3.40 C) 3.35 D) 3.50
24)
25) The percentage of families by size in the U.S. is given in the table below.
25)
(note: the category "7" actually represents 7 or more but "7" will be used for calculation purposed) A) 2.5 B) 2 C) 3.1 D) 4.5 E) 0.17
Copyright © 2017 Pearson Education, Inc. 6
Answer Key Testname: CHAPTER 6 FORM B TEST
1) 0.3804 2) 0.3520 3) C 4) C 5) D 6) B 7) E 8) E 9) B 10) D 11) D 12) B 13) C 14) C 15) B 16) E 17) E 18) E 19) B 20) C 21) A 22) C 23) D 24) A 25) C
Copyright © 2017 Pearson Education, Inc. 7
CHAPTER 7 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. For samples of the specified size from the population described, find the mean and standard deviation of the sampling distribution of the sample mean x. 1) The mean and the standard deviation of the sampled population are, respectively, 113.9 and 32.1. 1) n = 64 A) mean= 113.9; standard deviation =0.5 B) mean= 4.0; standard deviation = 113.9 C) mean= 113.9; standard deviation = 32.1 D) mean= 113.9; standard deviation = 4.0 E) mean= 113.9; standard deviation = 2.3 Provide an appropriate response. 2) Assume that the heights of adult Caucasian women have a mean of 63.6 inches and a standard deviation of 2.5 inches. If 100 women are randomly selected, find the probability that they have a mean height greater than 63.0 inches. A) not enough information to determine B) 0.9918 C) 0.8989 D) 0.2881 E) 0.0082
2)
For samples of the specified size from the population described, find the mean and standard deviation of the sampling distribution of the sample mean x. 3) The mean and the standard deviation of the sampled population are, respectively, 77.4 and 4.0. 3) n = 225 A) mean = 20.6; standard deviation = 0.3 B) mean = 77.4; standard deviation = 0.3 C) mean =0.3; standard deviation =77.4 D) mean = 20.6; standard deviation = 0.8 E) mean = 77.4; standard deviation = 0.02 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 4) A recent poll of 1000 British adults asked, "If there were a referendum on the issue, would you favor Britain becoming a republic or remaining a monarchy?" Suppose that the population proportion favoring the monarchy equals 0.70. (This was, in fact, the value for the sample proportion.) For a random sample of 1000 residents, let X denote the number in this category. a. Find the mean of the probability distribution of X. b. Find the standard deviation of the probability distribution of X. Round to four decimal places. c. What range of values falls within 3 standard deviations of the mean? Round to four decimal places. Explain why it is unlikely that X will fall outside this interval.
Copyright © 2017 Pearson Education, Inc. 1
4)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 5) In an exit poll, for which the sample size is 3400, the sampling distribution of the sample proportion voting for the incumbent has mean 0.496 and standard deviation 0.009. Find an interval of values within which the sample proportion will almost certainly fall. A) (0.478, 0.514) B) (0.487, 0.505) C) (0.469, 0.523) D) (0.496, 0.523)
5)
6) The body temperatures of adults have a mean of 98.6 ° F and a standard deviation of 0.60 ° F. If 36 adults are randomly selected, find the probability that their mean body temperature is greater than 98.4° F. A) 0.8188 B) not enough information to determine C) 0.0228 D) 0.9772 E) 0.9360
6)
7) Assume that the heights of adult Caucasian women have a mean of 63.6 inches and a standard deviation of 2.5 inches. If 75 women are randomly selected, find the probability that they have a mean height between 63 and 65 inches. A) not enough information to determine B) 0.9811 C) 0.3071 D) 0.0188 E) 0.2119
7)
Is the observed sample proportion unusual? 8) Based on previous studies, researchers believe that 6% of children are born with a gene that is linked to a certain childhood disease. If the researchers test 950 newborns for the presence of this gene, would it be unlikely for them to find fewer than 25 children with the gene? Answer by calculating the appropriate z-score. Round to the nearest hundredth when necessary. A) No, z = -0.005 B) Yes, z = -4.41 C) No, z = -6.59 D) Yes, z = -6.59 E) No, z = -4.41 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 9) Distinguish between a population distribution, a data distribution, and a sampling distribution.
Copyright © 2017 Pearson Education, Inc. 2
9)
8)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Is the observed sample proportion unusual? 10) A candy company claims that its jelly bean mix contains 15% blue jelly beans. Suppose that the candies are packaged at random in small bags containing about 200 jelly beans. If you receive a bag with 40 blue jelly beans, would you be doubtful of the companyʹs claim? Answer by calculating the appropriate z-score. Round to the nearest hundredth when necessary. A) No, z = 1.77 B) Yes, z = 1.77 C) No, z = 1.98 D) Yes, z = 9.9 E) Yes, z = 1.98 11) Assume that 20% of students at a university wear contact lenses. We randomly pick 200 students. Would it be unusual to obtain a sample proportion of 22%? Answer by calculating the appropriate z-score. Round to the nearest hundredth when necessary. A) No, z = 0.68 B) No, z = 0.71 C) No, z = 25 D) Yes, z = 25 E) Yes, z = 0.71 Provide an appropriate response. 12) In an exit poll, for which the sample size is 3079, the sampling distribution of the sample proportion voting for the incumbent has mean 0.491 and standard deviation 0.013. Would a
10)
11)
12)
^
sample proportion from the exit poll of p = 0.449 be a plausible value expected in the exit poll? Why? A) Yes. It lies within two standard deviations of the mean sample proportion. B) No. It does not lie within one standard deviation of the mean sample proportion. C) Yes. It lies within three standard deviations of the mean sample proportion. D) No. It does not lie within three standard deviations of the mean sample proportion. 13) In one region, the September energy consumption levels for single-family homes had a mean of 1050 kWh and a standard deviation of 218 kWh. If 50 different homes are randomly selected, find the probability that their mean energy consumption level for September is greater than 1075 kWh. A) 0.4180 B) 0.2910 C) 0.2090 D) 0.4562 E) 0.0438
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 14) The sampling distribution of sample means is from a highly skewed population with μ = 4.47 and σ = 1.40. For repeated random samples of size 100 from this population: a. Find the mean and standard deviation of the sampling distribution of the sample mean. b. Explain why the sampling distribution of the sample mean is bell-shaped, even though the population was highly skewed.
Copyright © 2017 Pearson Education, Inc. 3
14)
13)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 15) The closer p is to 0 or 1, the larger n must be in order for the sampling distribution of the sample proportion to A) have a standard deviation equal to [p(1 - p)/n]. B) have a standard deviation equal to [n(p)(1 - p)]. C) be approximately normal. D) be approximately binomial. E) be centered at p. Provide an appropriate response. 16) The body temperatures of adults have a mean of 98.6°F and a standard deviation of 0.60° F. Describe the center and variability of the sampling distribution of the sample mean for a random sample of 50 adults. A) center = 98.6, variability = 0.07 B) center = 0.60, variability = 0.07 C) center = 98.6, variability = 0.14 D) center = 98.6, variability = 0.08 E) center = 98.6, variability = 0.008
15)
16)
17) The body temperatures of adults have a mean of 98.6°F and a standard deviation of 0.60° F. Describe the center and variability of the sampling distribution of the sample mean for a random sample of 75 adults. A) center = 98.6, variability = 0.60 B) center = 98.6, variability = 0.14 C) center = 0.60, variability = 0.07 D) center = 98.6, variability = 0.07 E) center = 98.6, variability = 0.008
17)
18) In one region, the September energy consumption levels for single-family homes had a mean of 1050 kWh and a standard deviation of 218 kWh. Describe the center and variability of the sampling distribution of the sample mean for a random sample of 80 single-family homes from this region. A) center = 1050, variability = 24.37 B) center = 1050, variability = 218 C) center = 1050, variability = 30.83 D) center = 1050, variability = 92.49 E) center = 1050, variability = 61.66
18)
19) The gestation time for humans has a mean of 266 days and a standard deviation of 25 days. If 100 women are randomly selected, find the probability that they have a mean pregnancy between 266 days and 268 days. A) not enough information to determine B) 0.2881 C) 0.7881 D) 0.5517 E) 0.2119
19)
Copyright © 2017 Pearson Education, Inc. 4
Select the most appropriate answer. 20) A student group at a university conducted a poll to determine what percentage of students would vote republican in the upcoming presidential election. The poll randomly surveyed 500 students, 275 of which were female. If 54% of the universityʹs 8200 students are female, which of the following identifies the population distribution of gender at this university? A) mean = 0.54, standard deviation = 0.006 B) P(1) = 0.54, P(0) = 0.46 where 1 = female and 0 = male C) mean = 0.55, standard deviation = 0.02 D) 275 1s, 225 0s, where 1 = female and 0 = male E) P(1) = 0.55, P(0) = 0.45 where 1 = female and 0 = male Provide an appropriate response. 21) Assume that blood pressure readings have a mean of 120 and a standard deviation of 8. If 100 people are randomly selected, find the probability that their mean blood pressure will be greater than 122. A) 0.8615 B) 0.9938 C) 0.8819 D) 0.0062 E) not enough information to determine
^
sample proportion from the exit poll of p = 0.371 be a plausible value expected in the exit poll? Why? A) Yes. It lies within two standard deviations of the mean sample proportion. B) No. It does not lie within three standard deviations of the mean sample proportion. C) No. It does not lie within one standard deviation of the mean sample proportion. D) Yes. It lies within three standard deviations of the mean sample proportion. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 23)
a. Find the probability that the sample mean acreage falls within 100 acres of the population mean acreage. b. If the researcher can increase n above 64, will the probability that the sample mean falls within 100 acres of the population mean increase or decrease? Why? 24) A recent poll was taken to gauge the approval of adults for the European currency. Of the 1000 people sampled in a particular country in the European Union, consider the sample proportion of people who indicate approval of the euro. a. Find the mean and standard deviation of the sampling distribution for this sample proportion, if the population proportion equals 0.67. b. What shape would you expect this sampling distribution to have? Explain. Within what limits would the sample proportion almost certainly fall? Explain.
Copyright © 2017 Pearson Education, Inc. 5
21)
22)
22) In an exit poll, for which the sample size is 2781, the sampling distribution of the sample proportion voting for the incumbent has mean 0.417 and standard deviation 0.014. Would a
23) To estimate the mean acreage of ranches in a certain province a researcher plans to obtain the acreage for a random sample of 64 farms. Results from an earlier study suggest that 800 acres is a reasonable guess for the standard deviation of ranch size.
20)
24)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Find the mean and standard deviation of the sampling distribution of the proportion. 25) Assume that 26% of students at a university wear contact lenses. We randomly pick 300 students. Describe the sampling distribution model of the proportion of students in this group who wear contact lenses. A) mean = 74%; standard deviation = 2.5% B) mean = 74%; standard deviation = 1.1% C) mean = 26%; standard deviation = 1.1% D) mean = 26%; standard deviation = 2.5% E) There is not enough information to describe the distribution.
Copyright © 2017 Pearson Education, Inc. 6
25)
Answer Key Testname: CHAPTER 7 FORM A TEST
1) D 2) B 3) B 4) a. 700; b. 14.4914; c. 656.5258 to 743.4742; It is unlikely that X will fall outside this interval b/c np and n(1 - p) are both much greater than 15 which implies that X is approximately normally distributed. Therefore, the probability is very close to 1.0 that the number in this category would fall within 3 standard deviations of 700. 5) C 6) D 7) B 8) B 9) A population distribution is the probability distribution from which we take the sample. A data distribution is the distribution of the sample data. A sampling distribution is the probability distribution of a sample statistic. 10) C 11) B 12) D 13) C 14) a. mean = 4.47, standard deviation = 0.14; b. The central limit theorem holds no matter what the shape of the population distribution. It states that for random samples of sufficiently large size (at least about 30), the sampling distribution of the sample means is approximately normal. 15) C 16) D 17) D 18) A 19) B 20) B 21) D 22) B 23) a. 0.6826; b. It will increase, because a larger sample size results in a smaller standard error of the sample mean. 24) a. mean = 0.67, standard deviation = 0.0149; b. The sampling distribution would be approximately normal, b/c both np and n(1 p) are much larger than 15. The sample proportion would almost certainly fall within 3(0.0149) = 0.0447 of the mean of 0.67, because the probability is very close to 1.0 that the sample proportion would fall within 3 standard deviations of the mean of 0.67 given that the sample proportions approximately follow a normal distribution. 25) D
Copyright © 2017 Pearson Education, Inc. 7
CHAPTER 7 FORM B TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Find the mean and standard deviation of the sampling distribution of the proportion. 1) Based on past experience, a bank believes that 8% of the people who receive loans will not make payments on time. The bank has recently approved 600 loans. Describe the sampling distribution model of the proportion of clients in this group who may not make timely payments. A) There is not enough information to describe the distribution. B) mean = 8%; standard deviation = 0.3% C) mean = 8%; standard deviation = 1.1% D) mean = 92%; standard deviation = 0.3% E) mean = 92%; standard deviation = 1.1% Provide an appropriate response. 2) In one region, the September energy consumption levels for single-family homes had a mean of 1050 kWh and a standard deviation of 218 kWh. Describe the center and variability of the sampling distribution of the sample mean for a random sample of 50 single-family homes from this region. A) center = 1050, variability = 218 B) center = 1050, variability = 30.83 C) center = 1050, variability = 4.36 D) center = 1050, variability = 61.66 E) center = 1050, variability = 92.49 3) The body temperatures of adults have a mean of 98.6°F and a standard deviation of 0.60° F. Describe the center and variability of the sampling distribution of the sample mean for a random sample of 75 adults. A) center = 98.6, variability = 0.008 B) center = 98.6, variability = 0.60 C) center = 0.60, variability = 0.07 D) center = 98.6, variability = 0.07 E) center = 98.6, variability = 0.14 Find the mean and standard deviation of the sampling distribution of the proportion. 4) A candy company claims that its jelly bean mix contains 21% blue jelly beans. Suppose that the candies are packaged at random in small bags containing about 400 jelly beans. Describe the sampling distribution model of the proportion of blue jelly beans in a bag. A) mean = 21%; standard deviation = 2.0% B) There is not enough information to describe the distribution. C) mean = 79%; standard deviation = 0.8% D) mean = 79%; standard deviation = 2.0% E) mean = 21%; standard deviation = 0.8% Provide an appropriate response. 5) In an exit poll, for which the sample size is 2797, the sampling distribution of the sample proportion voting for the incumbent has mean 0.554 and standard deviation 0.009. Find an interval of values within which the sample proportion will almost certainly fall. A) (0.554, 0.581) B) (0.545, 0.563) C) (0.536, 0.572) D) (0.527, 0.581)
Copyright © 2017 Pearson Education, Inc. 1
1)
2)
3)
4)
5)
6) Assume that the heights of adult Caucasian women have a mean of 63.6 inches and a standard deviation of 2.5 inches. If 100 women are randomly selected, find the probability that they have a mean height greater than 63.0 inches. A) 0.9918 B) 0.8989 C) 0.0082 D) not enough information to determine E) 0.2881
6)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 7) To estimate the mean acreage of ranches in a certain province a researcher plans to obtain the acreage for a random sample of 64 farms. Results from an earlier study suggest that 800 acres is a reasonable guess for the standard deviation of ranch size.
7)
a. Find the probability that the sample mean acreage falls within 100 acres of the population mean acreage. b. If the researcher can increase n above 64, will the probability that the sample mean falls within 100 acres of the population mean increase or decrease? Why? MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 8) In an exit poll, for which the sample size is 3400, the sampling distribution of the sample proportion voting for the incumbent has mean 0.496 and standard deviation 0.009. Find an interval of values within which the sample proportion will almost certainly fall. A) (0.496, 0.523) B) (0.469, 0.523) C) (0.487, 0.505) D) (0.478, 0.514)
8)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 9) A recent poll of 1000 British adults asked, "If there were a referendum on the issue, would you favor Britain becoming a republic or remaining a monarchy?" Suppose that the population proportion favoring the monarchy equals 0.70. (This was, in fact, the value for the sample proportion.) For a random sample of 1000 residents, let X denote the number in this category.
9)
a. Find the mean of the probability distribution of X. b. Find the standard deviation of the probability distribution of X. Round to four decimal places. c. What range of values falls within 3 standard deviations of the mean? Round to four decimal places. Explain why it is unlikely that X will fall outside this interval. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 10) In one region, the September energy consumption levels for single-family homes had a mean of 1050 kWh and a standard deviation of 218 kWh. If 50 different homes are randomly selected, find the probability that their mean energy consumption level for September is greater than 1075 kWh. A) 0.2910 B) 0.2090 C) 0.0438 D) 0.4180 E) 0.4562
Copyright © 2017 Pearson Education, Inc. 2
10)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 11) In North America, female adult heights are approximately normal with μ = 65 inches and σ = 3.5 inches. The heights of 50 females were measured at a national collegiate volleyball tournament. The sample mean height was found to be 70 inches.
11)
a. Using the population parameters given above, what is the probability of obtaining a sample mean height of 70 inches or higher with a random sample of n = 50? b. Does this probability make you question the population mean stated for female heights? Justify why you believe this sample mean may not be representative of the population of female heights. 12) Distinguish between a population distribution, a data distribution, and a sampling distribution.
12)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 13) The gestation time for humans has a mean of 266 days and a standard deviation of 25 days. If 100 women are randomly selected, find the probability that they have a mean pregnancy between 266 days and 268 days. A) 0.2119 B) 0.5517 C) not enough information to determine D) 0.2881 E) 0.7881
13)
14) Assume that blood pressure readings have a mean of 120 and a standard deviation of 8. If 100 people are randomly selected, find the probability that their mean blood pressure will be greater than 122. A) 0.8819 B) 0.8615 C) 0.0062 D) 0.9938 E) not enough information to determine
14)
Find the mean and standard deviation of the sampling distribution of the proportion. 15) A realtor has been told that 43% of homeowners in a city prefer to have a finished basement. She surveys a group of 300 homeowners randomly chosen from her client list. Describe the sampling distribution model of the proportion of homeowners in this sample who prefer a finished basement. A) mean = 43%; standard deviation = 1.4% B) mean = 57%; standard deviation = 2.9% C) mean = 57%; standard deviation = 1.4% D) mean = 43%; standard deviation = 2.9% E) There is not enough information to describe the distribution.
Copyright © 2017 Pearson Education, Inc. 3
15)
Provide an appropriate response. 16) Assume that the heights of adult Caucasian women have a mean of 63.6 inches and a standard deviation of 2.5 inches. If 75 women are randomly selected, find the probability that they have a mean height between 63 and 65 inches. A) 0.9811 B) 0.2119 C) 0.3071 D) 0.0188 E) not enough information to determine
16)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 17) A recent poll was taken to gauge the approval of adults for the European currency. Of the 1000 people sampled in a particular country in the European Union, consider the sample proportion of people who indicate approval of the euro.
17)
a. Find the mean and standard deviation of the sampling distribution for this sample proportion, if the population proportion equals 0.67. b. What shape would you expect this sampling distribution to have? Explain. Within what limits would the sample proportion almost certainly fall? Explain. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 18) The body temperatures of adults have a mean of 98.6° F and a standard deviation of 0.60° F. If 36 adults are randomly selected, find the probability that their mean body temperature is greater than 98.4° F. A) 0.9772 B) 0.9360 C) 0.0228 D) 0.8188 E) not enough information to determine
18)
19) The body temperatures of adults have a mean of 98.6°F and a standard deviation of 0.60° F. Describe the center and variability of the sampling distribution of the sample mean for a random sample of 50 adults. A) center = 98.6, variability = 0.08 B) center = 98.6, variability = 0.07 C) center = 98.6, variability = 0.008 D) center = 0.60, variability = 0.07 E) center = 98.6, variability = 0.14
19)
Select the most appropriate answer. 20) The closer p is to 0 or 1, the larger n must be in order for the sampling distribution of the sample proportion to A) have a standard deviation equal to [n(p)(1 - p)]. B) be approximately binomial. C) be approximately normal. D) have a standard deviation equal to [p(1 - p)/n]. E) be centered at p.
Copyright © 2017 Pearson Education, Inc. 4
20)
Is the observed sample proportion unusual? 21) Assume that 20% of students at a university wear contact lenses. We randomly pick 200 students. Would it be unusual to obtain a sample proportion of 22%? Answer by calculating the appropriate z-score. Round to the nearest hundredth when necessary. A) No, z = 0.68 B) No, z = 0.71 C) Yes, z = 0.71 D) No, z = 25 E) Yes, z = 25
21)
22) A candy company claims that its jelly bean mix contains 15% blue jelly beans. Suppose that the candies are packaged at random in small bags containing about 200 jelly beans. If you receive a bag with 40 blue jelly beans, would you be doubtful of the companyʹs claim? Answer by calculating the appropriate z-score. Round to the nearest hundredth when necessary. A) Yes, z = 9.9 B) No, z = 1.77 C) Yes, z = 1.77 D) Yes, z = 1.98 E) No, z = 1.98
22)
23) Based on previous studies, researchers believe that 6% of children are born with a gene that is linked to a certain childhood disease. If the researchers test 950 newborns for the presence of this gene, would it be unlikely for them to find fewer than 25 children with the gene? Answer by calculating the appropriate z-score. Round to the nearest hundredth when necessary. A) No, z = -6.59 B) No, z = -4.41 C) Yes, z = -6.59 D) Yes, z = -4.41 E) No, z = -0.005
23)
Find the mean and standard deviation of the sampling distribution of the proportion. 24) Assume that 26% of students at a university wear contact lenses. We randomly pick 300 students. Describe the sampling distribution model of the proportion of students in this group who wear contact lenses. A) mean = 74%; standard deviation = 1.1% B) mean = 26%; standard deviation = 1.1% C) There is not enough information to describe the distribution. D) mean = 26%; standard deviation = 2.5% E) mean = 74%; standard deviation = 2.5%
24)
For samples of the specified size from the population described, find the mean and standard deviation of the sampling distribution of the sample mean x. 25) The mean and the standard deviation of the sampled population are, respectively, 77.4 and 4.0. 25) n = 225 A) mean =0.3; standard deviation =77.4 B) mean = 77.4; standard deviation = 0.3 C) mean = 20.6; standard deviation = 0.3 D) mean = 77.4; standard deviation = 0.02 E) mean = 20.6; standard deviation = 0.8
Copyright © 2017 Pearson Education, Inc. 5
Answer Key Testname: CHAPTER 7 FORM B TEST
1) C 2) B 3) D 4) A 5) D 6) A 7) a. 0.6826; b. It will increase, because a larger sample size results in a smaller standard error of the sample mean. 8) B 9) a. 700; b. 14.4914; c. 656.5258 to 743.4742; It is unlikely that X will fall outside this interval b/c np and n(1 - p) are both much greater than 15 which implies that X is approximately normally distributed. Therefore, the probability is very close to 1.0 that the number in this category would fall within 3 standard deviations of 700. 10) B 11) a. approximately zero; b. Yes, if this sample is representative of the population of heights of adult females in North America. The sampling distribution of the sample mean is approximately normal by the central limit theorem with a mean of 65 inches and a standard error of 0.5 inches. Therefore, approximately 99.7% of all possible sample means for all samples of size 50 drawn from this population will fall between 63.5 inches and 66.5 inches assuming 65 inches is the true mean height of adult females in North America. If 65 inches is the true mean height of adult females in North America, it would be extremely unlikely to observe a sample mean of 70 inches. 12) A population distribution is the probability distribution from which we take the sample. A data distribution is the distribution of the sample data. A sampling distribution is the probability distribution of a sample statistic. 13) D 14) C 15) D 16) A 17) a. mean = 0.67, standard deviation = 0.0149; b. The sampling distribution would be approximately normal, b/c both np and n(1 p) are much larger than 15. The sample proportion would almost certainly fall within 3(0.0149) = 0.0447 of the mean of 0.67, because the probability is very close to 1.0 that the sample proportion would fall within 3 standard deviations of the mean of 0.67 given that the sample proportions approximately follow a normal distribution. 18) A 19) A 20) C 21) B 22) E 23) D 24) D 25) B
Copyright © 2017 Pearson Education, Inc. 6
CHAPTER 8 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Find the margin of error 1) In a survey of 2300 T.V. viewers, 690 said they watch network news programs. Find the margin of error for the 95% confidence interval used to estimate the population proportion. A) 0.0140 B) 0.0374 C) 0.0187 D) 0.0245 E) 0.0215 Select the most appropriate answer. 2) In an effort to monitor the level of lead in the air after an explosion at a battery factory, the following lead readings were taken for 6 days following the explosion (in ug/ m3 ). What is the point estimate for the population mean lead level in the air over the 6 days following the explosion? Monday 5.40 A) 1.54
Tuesday 1.10 B) 0.42
Wednesday 0.42
Thursday 0.73
C) 0.73
Friday 0.48 D) 0.50
1)
2)
Saturday 1.10 E) 2.91
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 3) When it is difficult to derive a standard error or confidence interval formula by using mathematical techniques, the method is used.
3)
4) A city council votes to appropriate funds for a new civic auditorium. The mayor of the city threatens to veto this decision unless it can be shown that a majority of citizens would use it at least twice a year. The council commissions a poll of city residents. For a random sample of 400 residents, 230 say they would use the facility at least twice a year. Find a 95% confidence interval for the proportion of all residents of the town who would say they would use the proposed auditorium at least twice a year. Interpret the interval and advise the mayor.
4)
5) How tall is your average English classmate? To determine this, you measure the height of a random sample of 15 of your 100 fellow students, finding a 95% confidence interval for the mean height of 67.25 to 69.75 inches.
5)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Use the given degree of confidence and sample data to construct a confidence interval for the population proportion. 6) A survey of 865 voters in one state reveals that 408 favor approval of an issue before the legislature. 6) Construct a 95% confidence interval for the proportion of all voters in the state who favor approval. A) (0.438, 0.505) B) (0.431, 0.512) C) (0.423, 0.520) D) (0.469, 0.475) E) (0.444, 0.500)
Copyright © 2017 Pearson Education, Inc. 1
Select the most appropriate answer. 7) When determining the sample size for estimating a population proportion for a given level of confidence and a desired margin of error, the closer to 0.50 that p is estimated to be A) the larger the sample size required. B) the farther from 0.50 that 1 - p is estimated to be. C) has no effect on the sample size required. D) has an undeterminable effect on the sample size required. E) the smaller the sample size required.
7)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 8) Alarmed at the rising gas prices in your town, you decide to estimate the average gas price for a gallon of regular gas. From your sample of 25 gas stations, you calculate a 90% confidence interval of ($2.99, $3.11)
8)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 9) In monitoring lead in the air after an explosion at a battery factory, it is found that the amounts of lead (in ug/m3) over a 6 day period had a standard error of 1.91. Find the margin of error that corresponds to a 95% confidence interval. A) none of these B) 5.65 C) 3.74 D) 1.91 E) 95 Find the sample size 10) The drying times for a certain type of cement are normally distributed with a standard deviation of 73 minutes. A researcher wishes to estimate the mean drying time for this type of cement. Find the sample size needed to assure with 68% confidence that the sample mean will not differ from the population mean by more than 4 minutes. A) 3 B) 19 C) 330 D) 178 E) 5 Select the most appropriate answer. 11) The large sample confidence interval formula for estimating p should only be used when ^
^
B) n(1 - p ) ≥ 15 ^
C) np ≥ 15 ^
D) np ≥ 15 or n(1 - p ) ≥ 15 E) n ≥ 30 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 12) Thirty randomly selected students took the statistics final. If the sample mean was 82 and the standard deviation was 12.2, construct a 99% confidence interval for the mean score of all students. 13) A laboratory tested twelve chicken eggs and found that the mean amount of cholesterol was 246 milligrams with s = 11.7 milligrams. Construct a 95% confidence interval for the true mean cholesterol content of all such eggs. Copyright © 2017 Pearson Education, Inc. 2
10)
11)
^
A) np ≥ 15 and n(1 - p ) ≥ 15
^
9)
12)
13)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 14) Which of the following statements about the t distribution is true? A) It assumes the population is normally distributed. B) It approaches the normal distribution as the sample size decreases. C) It has less area in the tails than does the normal distribution. D) It is used to construct confidence intervals for the population mean when the population standard deviation is known. E) It assumes the population is not normally distributed.
14)
Find the margin of error 15) In a survey of 280 adults over 50, 75% said they were taking vitamin supplements. Find the margin of error for this survey if we want a 99% confidence in our estimate of the percentage of adults over 50 who take vitamin supplements. A) 5.07% B) 6.66% C) 6.03% D) 13.3% E) 7.00%
15)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 16) A sample of 81 calculus students at a large college had a mean mathematics ACT score of 26 with a standard deviation of 6. Find a 95% confidence interval for the mean mathematics ACT score for all calculus students at this college.
16)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Find the margin of error 17) A recent poll of 500 residents in a large town found that only 36% were in favor of a proposed referendum to build a new high school. Find the margin of error for this poll if we want 95% confidence in our estimate of the percentage of residents in favor of this proposed referendum. A) 2.5% B) 8.42% C) 5.53% D) 4.21% E) 3.52%
17)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 18) Physiologists often use the forced vital capacity as a way to assess a person's ability to move air in and out of their lungs. A researcher wishes to estimate the forced vital capacity of people suffering from asthma. A random sample of 15 asthmatics yields the following data on forced vital capacity, in liters.
18)
5.1 5.3 4.4 3.9 4.3 3.3 3.6 4.3 3.4 3.1 3.2 3.5 4.8 4.0 5.1 Use the data to obtain a point estimate of the mean forced vital capacity for all asthmatics. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Find the margin of error 19) A survey found that 89% of a random sample of 1024 American adults approved of cloning endangered animals. Find the margin of error for this survey if we want 90% confidence in our estimate of the percentage of American adults who approve of cloning endangered animals. A) 1.92% B) 1.61% C) 1.10% D) 16.5% E) 4.85%
Copyright © 2017 Pearson Education, Inc. 3
19)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 20) A sociologist develops a test to measure attitudes about public transportation, and 27 randomly selected subjects are given the test. Their mean score is 76.2 and their standard deviation is 21.4. Construct the 95% confidence interval for the mean score of all such subjects.
20)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 21) In a poll of registered voters nationwide, the responses to the question ʺWho do you blame the most for the recent increase in gasoline prices: oil producing countries, oil companies, the President, Americans who drive vehicles that use a lot of gasoline, or normal supply and demand pressures?ʺ are given in the table that follows. Based on these responses, find a point estimate for the population proportion who would answer ʺthe President.ʺ
21)
Oil Companies 736 342 the President 222 Supply & Demand Oil Producing Countries 188 68 American Drivers Total 1556 A) 311 B) 0.22 C) 0.47 D) 0.34 E) none of these 22) In practice a
is an estimated standard deviation of a sampling distribution.
22)
A) population standard deviation B) sample standard deviation C) margin of error D) standard error. E) none of these 23) A 90% confidence interval for the mean percentage of airline reservations being canceled on the day of the flight is (1.3%, 5.1%). What is the point estimate of the mean percentage of reservations that are canceled on the day of the flight? A) 1.90% B) 3.20% C) 3.80% D) 5.10% E) 2.55% Select the most appropriate answer. 24) The width of a confidence interval estimate for a proportion is A) wider for 90% confidence than for 95% confidence. B) narrower when the sample proportion is 0.10 than when the sample proportion is 0.45. C) narrower for a sample size of 50 than for a sample size of 100. D) wider when the sample proportion is 0.95 than when the sample proportion is 0.55. E) narrowest when the sample proportion is 0.5.
Copyright © 2017 Pearson Education, Inc. 4
23)
24)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 25) A poll asked whether marijuana should be legalized for medical purposes. 72% said definitely yes, 20% said probably, 2% said probably not, 5% said definitely not, and 2% had no opinion. a. Assuming that this was a random sample, construct a 95% confidence interval for the population proportion who would answer definitely yes or probably. Can you conclude that a majority of all Canadians would answer this way? Explain. b. Check that the sample size was large enough to construct the interval in (a).
Copyright © 2017 Pearson Education, Inc. 5
25)
Answer Key Testname: CHAPTER 8 FORM A TEST
1) C 2) A 3) bootstrap 4) The 95% CI for p is (0.527, 0.623). None of the numbers in the confidence interval fall at or below 0.50. So we infer that more than half the population would use the new civic auditorium at least twice a year and advise the mayor not to veto the city council's decision. 5) 1.25 inches 6) A 7) A 8) $0.06 9) C 10) C 11) A 12) (75.86, 88.14) 13) (238.6, 253.4) 14) A 15) B 16) (24.7, 27.3) 17) D 18) 4.09 liters 19) B 20) (67.7, 84.7) 21) B 22) D 23) B 24) B 25) a. A 95% CI for p is (0.896, 0.0944). Yes, since none of the values in the CI are at or below 0.50, one can conclude that a ^
^
majority of all Canadians would answer this way; b. Both np = 500(0.92) = 460 and n(1 - p ) = 500(0.08) = 40 are at least 15, so the CI is valid.
Copyright © 2017 Pearson Education, Inc. 6
CHAPTER 8 FORM B TEST Name___________________________________
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) A poll asked whether marijuana should be legalized for medical purposes. 72% said definitely yes, 20% said probably, 2% said probably not, 5% said definitely not, and 2% had no opinion. a. Assuming that this was a random sample, construct a 95% confidence interval for the population proportion who would answer definitely yes or probably. Can you conclude that a majority of all Canadians would answer this way? Explain. b. Check that the sample size was large enough to construct the interval in (a).
1)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 2) Which of the following statements about the t distribution is true? A) It assumes the population is normally distributed. B) It approaches the normal distribution as the sample size decreases. C) It has less area in the tails than does the normal distribution. D) It is used to construct confidence intervals for the population mean when the population standard deviation is known. E) It assumes the population is not normally distributed.
2)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 3) Physiologists often use the forced vital capacity as a way to assess a person's ability to move air in and out of their lungs. A researcher wishes to estimate the forced vital capacity of people suffering from asthma. A random sample of 15 asthmatics yields the following data on forced vital capacity, in liters. 5.1 5.3 4.4 3.9 4.3 3.3 3.6 4.3 3.4 3.1 3.2 3.5 4.8 4.0 5.1 Use the data to obtain a point estimate of the mean forced vital capacity for all asthmatics.
Copyright © 2017 Pearson Education, Inc. 1
3)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 4) In a poll of registered voters nationwide, the responses to the question ʺWho do you blame the most for the recent increase in gasoline prices: oil producing countries, oil companies, the President, Americans who drive vehicles that use a lot of gasoline, or normal supply and demand pressures?ʺ are given in the table that follows. Based on these responses, find a point estimate for the population proportion who would answer ʺthe President.ʺ
4)
Oil Companies 736 342 the President 222 Supply & Demand Oil Producing Countries 188 American Drivers 68 Total 1556 A) 0.47 B) 311 C) 0.22 D) none of these E) 0.34 5) In monitoring lead in the air after an explosion at a battery factory, it is found that the amounts of lead (in ug/m3) over a 6 day period had a standard error of 1.91. Find the margin of error that corresponds to a 95% confidence interval. A) 95 B) none of these C) 1.91 D) 3.74 E) 5.65
5)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 6) A sample of 81 calculus students at a large college had a mean mathematics ACT score of 26 with a standard deviation of 6. Find a 95% confidence interval for the mean mathematics ACT score for all calculus students at this college.
6)
7) A laboratory tested twelve chicken eggs and found that the mean amount of cholesterol was 246 milligrams with s = 11.7 milligrams. Construct a 95% confidence interval for the true mean cholesterol content of all such eggs.
7)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 8) A 90% confidence interval for the mean percentage of airline reservations being canceled on the day of the flight is (1.3%, 5.1%). What is the point estimate of the mean percentage of reservations that are canceled on the day of the flight? A) 3.80% B) 5.10% C) 3.20% D) 2.55% E) 1.90% SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 9) Thirty randomly selected students took the statistics final. If the sample mean was 82 and the standard deviation was 12.2, construct a 99% confidence interval for the mean score of all students.
Copyright © 2017 Pearson Education, Inc. 2
9)
8)
10) Alarmed at the rising gas prices in your town, you decide to estimate the average gas price for a gallon of regular gas. From your sample of 25 gas stations, you calculate a 90% confidence interval of ($2.99, $3.11)
10)
11) A sociologist develops a test to measure attitudes about public transportation, and 27 randomly selected subjects are given the test. Their mean score is 76.2 and their standard deviation is 21.4. Construct the 95% confidence interval for the mean score of all such subjects.
11)
12) When it is difficult to derive a standard error or confidence interval formula by using mathematical method is used. techniques, the
12)
13) A city council votes to appropriate funds for a new civic auditorium. The mayor of the city threatens to veto this decision unless it can be shown that a majority of citizens would use it at least twice a year. The council commissions a poll of city residents. For a random sample of 400 residents, 230 say they would use the facility at least twice a year. Find a 95% confidence interval for the proportion of all residents of the town who would say they would use the proposed auditorium at least twice a year. Interpret the interval and advise the mayor.
13)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 14) In practice a
14)
is an estimated standard deviation of a sampling distribution.
A) margin of error B) population standard deviation C) standard error. D) sample standard deviation E) none of these SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 15) How tall is your average English classmate? To determine this, you measure the height of a random sample of 15 of your 100 fellow students, finding a 95% confidence interval for the mean height of 67.25 to 69.75 inches.
15)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 16) The width of a confidence interval estimate for a proportion is A) wider for 90% confidence than for 95% confidence. B) wider when the sample proportion is 0.95 than when the sample proportion is 0.55. C) narrower for a sample size of 50 than for a sample size of 100. D) narrower when the sample proportion is 0.10 than when the sample proportion is 0.45. E) narrowest when the sample proportion is 0.5. 17) When determining the sample size for estimating a population proportion for a given level of confidence and a desired margin of error, the closer to 0.50 that p is estimated to be A) the smaller the sample size required. B) the larger the sample size required. C) has no effect on the sample size required. D) the farther from 0.50 that 1 - p is estimated to be. E) has an undeterminable effect on the sample size required.
Copyright © 2017 Pearson Education, Inc. 3
16)
17)
18) The large sample confidence interval formula for estimating p should only be used when A) n ≥ 30 ^
18)
^
B) np ≥ 15 and n(1 - p ) ≥ 15 ^
C) n(1 - p ) ≥ 15 ^
^
D) np ≥ 15 or n(1 - p ) ≥ 15 ^
E) np ≥ 15 19) In an effort to monitor the level of lead in the air after an explosion at a battery factory, the following lead readings were taken for 6 days following the explosion (in ug/ m3 ). What is the point estimate for the population mean lead level in the air over the 6 days following the explosion? Monday 5.40 A) 2.91
Tuesday 1.10 B) 1.54
Wednesday 0.42
Thursday 0.73
C) 0.73
Friday 0.48 D) 0.42
19)
Saturday 1.10 E) 0.50
Find the margin of error 20) A recent poll of 500 residents in a large town found that only 36% were in favor of a proposed referendum to build a new high school. Find the margin of error for this poll if we want 95% confidence in our estimate of the percentage of residents in favor of this proposed referendum. A) 4.21% B) 3.52% C) 5.53% D) 8.42% E) 2.5%
20)
21) In a survey of 280 adults over 50, 75% said they were taking vitamin supplements. Find the margin of error for this survey if we want a 99% confidence in our estimate of the percentage of adults over 50 who take vitamin supplements. A) 5.07% B) 7.00% C) 13.3% D) 6.03% E) 6.66%
21)
22) In a survey of 2300 T.V. viewers, 690 said they watch network news programs. Find the margin of error for the 95% confidence interval used to estimate the population proportion. A) 0.0245 B) 0.0215 C) 0.0140 D) 0.0374 E) 0.0187
22)
23) A survey found that 89% of a random sample of 1024 American adults approved of cloning endangered animals. Find the margin of error for this survey if we want 90% confidence in our estimate of the percentage of American adults who approve of cloning endangered animals. A) 1.61% B) 1.92% C) 16.5% D) 1.10% E) 4.85%
23)
Use the given degree of confidence and sample data to construct a confidence interval for the population proportion. 24) A survey of 865 voters in one state reveals that 408 favor approval of an issue before the legislature. 24) Construct a 95% confidence interval for the proportion of all voters in the state who favor approval. A) (0.444, 0.500) B) (0.469, 0.475) C) (0.431, 0.512) D) (0.438, 0.505) E) (0.423, 0.520)
Copyright © 2017 Pearson Education, Inc. 4
Find the sample size 25) The drying times for a certain type of cement are normally distributed with a standard deviation of 73 minutes. A researcher wishes to estimate the mean drying time for this type of cement. Find the sample size needed to assure with 68% confidence that the sample mean will not differ from the population mean by more than 4 minutes. A) 3 B) 5 C) 330 D) 178 E) 19
Copyright © 2017 Pearson Education, Inc. 5
25)
Answer Key Testname: CHAPTER 8 FORM B TEST
1) a. A 95% CI for p is (0.896, 0.0944). Yes, since none of the values in the CI are at or below 0.50, one can conclude that a ^
^
majority of all Canadians would answer this way; b. Both np = 500(0.92) = 460 and n(1 - p ) = 500(0.08) = 40 are at least 15, so the CI is valid. 2) A 3) 4.09 liters 4) C 5) D 6) (24.7, 27.3) 7) (238.6, 253.4) 8) C 9) (75.86, 88.14) 10) $0.06 11) (67.7, 84.7) 12) bootstrap 13) The 95% CI for p is (0.527, 0.623). None of the numbers in the confidence interval fall at or below 0.50. So we infer that more than half the population would use the new civic auditorium at least twice a year and advise the mayor not to veto the city council's decision. 14) C 15) 1.25 inches 16) D 17) B 18) B 19) B 20) A 21) E 22) E 23) A 24) D 25) C
Copyright © 2017 Pearson Education, Inc. 6
CHAPTER 9 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Determine the null and alternative hypotheses. 1) In the past, the mean running time for a certain type of radio battery has been 9.6 hours. The manufacturer has introduced a change in the production method and wants to perform a hypothesis test to determine whether the mean running time has changed as a result. A) H0 : μ > 9.6hours
1)
Ha: μ > 9.6 hours B) H0 : μ = 9.6 hours Ha: μ > 9.6 hours C) H0 : μ ≥ 9.6 hours Ha: μ = 9.6 hours D) H0 : μ ≠ 9.6 hours Ha: μ = 9.6 hours E) H0 : μ = 9.6 hours Ha: μ ≠ 9.6 hours Assume that a simple random sample has been selected from a normally distributed population. Find the test statistic t. 2) Test the claim that the mean lifetime of a particular car engine is greater than 220,000 miles. Sample data are 2) summarized as n = 23, x = 226,450 miles, and s = 11,500 miles. Use a significance level of α = 0.01. Find the test statistic t. A) -2.69 B) 2.24 C) 12.9 D) 2.69 E) -2.24 Find the P-value for the indicated hypothesis test. 3) A manufacturer claims that fewer than 6% of its fax machines are defective. In a random sample of 100 such fax machines, 5% are defective. Find the P-value for testing the manufacturer's claim. A) 0.06 B) 0.17 C) 0.33 D) 0.34 E) 0.16 Provide an appropriate response. 4) Suppose 1000 tests are run to test a null hypothesis using α = 0.05. If the null hypothesis is true, about how many of these tests would you expect to show statistically significant results? A) 1000 B) 50 C) 0 D) cannot be determined from the information given E) 5 For the given significance test, determine the probability of a Type II error or the power, as specified. 5) Suppose we wish to test H0 : p = 0.25 against H1 : p > 0.25 using α = 0.05. If p is actually equal to 0.4, what is the power of the test assuming n = 55? A) 0.73 B) 0.93
C) 0.21
D) 0.27
Copyright © 2017 Pearson Education, Inc. 1
E) 0.79
3)
4)
5)
Select the most appropriate answer. 6) Given Ha ≠ p . What is the P-value if the test statistics is calculated to be z = -0.12? 0 A) 0.90 B) 0.10 C) 0.95 D) 0.05
6) E) 0.12
7) A small P-value indicates which of the following? I) the parameter value indicated by the null hypothesis is not plausible II) the null hypothesis should be rejected in favor of the alternative hypothesis III) the difference between the parameter value and the value specified in the null hypothesis is practically significant A) I only B) I, II, and III C) II only D) III only E) both I and II Provide an appropriate response. 8) A study uses a random sample of size 9. The test statistic for testing H0 : μ = 12 versus Ha : μ > 12
7)
8)
is t = 1.8. Find the approximate P-value. A) 0.025 B) 0.95 C) Cannot be determined without the sample standard deviation. D) 0.10 E) 0.05 For the given sample data and null hypothesis, compute the value of the test statistic, z 9) 410 people were asked if they were satisfied with their jobs. 37% of the responses were affirmative. H0 : p = 0.30 A) 3.09 B) 0.04 C) 0.15 D) 2.61 E) 4.12
9)
For the given significance test, explain the meaning of a Type I error, a Type II error, or a correct decision as specified. 10) A health insurer has determined that the "reasonable and customary" fee for a certain medical procedure is 10) $1200. They suspect that the average fee charged by one particular clinic for this procedure is higher than $1200. The insurer performs a significance test to determine whether their suspicion is correct using α = 0.05. The hypotheses are: H0 : μ = $1200 Ha: μ > $1200 If the P-value is 0.09 and a decision error is made, what type of error is it? Explain. A) Type I error. We conclude that the average fee charged for the procedure is not higher than $1200 when it actually is higher. B) Type II error. We conclude that the average fee charged for the procedure is higher than $1200 when it actually is not higher. C) Type II error. We conclude that the average fee charged for the procedure is not higher than $1200 when it actually is higher. D) Type I error. We conclude that the average fee charged for the procedure is higher than $1200 when it actually is not higher.
Copyright © 2017 Pearson Education, Inc. 2
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 11) A sample was used to obtain a 95% confidence interval for the population mean age of graduate students at a large university. The 95% confidence interval for μ was (23, 27). If the same sample had been used to test the null hypothesis that the population mean age is equal to 28 versus the alternative hypothesis that the population mean age differs from 28, would the null hypothesis be rejected at α = 0.05? Explain.
11)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. For the given significance test, determine the probability of a Type II error or the power, as specified. 12) Suppose we wish to test H0 : p = 0.5 against H1 : p < 40 using α = 0.05. If p is actually equal to 0.4, what is the probability of a type II error assuming n = 150? A) 0.79 B) 0.05 C) 0.69
D) 0.31
12)
E) 0.21
For the given significance test, explain the meaning of a Type I error, a Type II error, or a correct decision as specified. 13) At one school, the average amount of time tenth-graders spend watching television each week is 21.6 hours. 13) The principal introduces a campaign to encourage the students to watch less television. One year later, the principal performs a significance test using α = 0.05 to determine whether the average amount of time spent watching television per week has decreased. The hypotheses are: H0 : μ = 21.6 hours Ha: μ < 21.6 hours If the P-value = 0.04 and a decision error is made, what type of error is it? Explain. A) Type II error. We conclude that the average amount of time spent watching television each week is 21.6 hours when it is in fact less. B) Type II error. We conclude that the average amount of time spent watching television each week is less than 21.6 hours when it in fact is not. C) Type I error. We conclude that the average amount of time spent watching television each week is less than 21.6 hours when it in fact is not. D) Type I error. We conclude that the average amount of time spent watching television each week is 21.6 hours when it is in fact less. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 14) About 4% of Americans are vegetarians. For a random sample of 400 students at University X, suppose that none are vegetarians. a. Set up null and alternative hypotheses for testing whether the proportion of vegetarians is the same or different at that university than nationwide. Check that the sample size is large enough for the test. b. Find the sample proportion, standard error, and test statistic. c. Find the P-value. Is there strong evidence that the proportion of vegetarians at the university differs from 0.04?
Copyright © 2017 Pearson Education, Inc. 3
14)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 15) The test statistic for testing H0 : μ = 100 against Ha : μ ≠ 100 was t = 3.3, with P-value 0.001. Which
15)
of the following is the appropriate conclusion? A) Since the P-value is very small, we are unable to conclude that μ = 100. B) Since the P-value is very small, we conclude that μ ≠ 100. C) Since the P-value is very small, we conclude that μ = 100. D) Since the P-value is very small, we are unable to conclude that μ ≠ 100. E) In order to interpret the results, we must first know the sample size. 16) A researcher claims that less than 20% of American adults are allergic to pollen. In a random sample of 100 adults, 15% indicate that they have such an allergy. Calculate the test statistic z for the population proportion. A) none of these B) 1.96 C) -1.25 D) 1.25 E) 1.645
16)
17) Which of the following would be an appropriate null hypothesis? A) The population mean is equal to 3.4. B) The population mean is not equal to 3.4. C) The sample mean is greater than 3.4. D) The population mean is greater than 3.4. E) The sample mean is equal to 3.4.
17)
Find the P-value for the indicated hypothesis test. 18) A medical school claims that more than 28% of its students plan to go into general practice. It is found that among a random sample of 130 of the school's students, 39% of them plan to go into general practice. Find the P-Value for testing the school's claim. A) 0.0280 B) 0.3461 C) 0.3078 D) 0.0026 E) 0.1635 Classify the conclusion of the significance test as a Type I error, a Type II error, or No error. 19) A health insurer has determined that the "reasonable and customary" fee for a certain medical procedure is $1200. They suspect that the average fee charged by one particular clinic for this procedure is higher than $1200. The insurer wants to perform a significance test to determine whether their suspicion is correct. The hypotheses are: H0 : μ = $1200 Ha: μ > $1200 Suppose that the results of the sample lead to rejection of the null hypothesis. Classify that conclusion as a Type I error, a Type II error, or a correct decision, if in fact the average fee charged by the clinic is $1200 . A) No error B) Type I error C) Type II error
Copyright © 2017 Pearson Education, Inc. 4
18)
19)
State hypotheses for the significance test. 20) Last year, the results of a survey at one college suggested that 28% of students smoked regularly. This year, after an intensive college-wide anti-smoking campaign, a researcher wishes to investigate whether the proportion of smokers has changed. Let p represent the proportion of students who smoke regularly today. State hypotheses for a significance test, letting the alternative hypothesis reflect the possibility that the proportion of students who smoke today is different from the proportion last year. A) H0 : p ≠ 28% B) H0 : p > 28% C) H0 : p = 28% D) H0 : p = 28% Ha: p = 28%
Ha: p < 28%
Ha: p ≥ 28%
Ha: p ≠ 28%
Select the most appropriate answer. 21) Which of the following would be an appropriate alternative hypothesis? A) The sample proportion is less than 0.41. B) The sample proportion is not equal to 0.41. C) The population proportion is equal to 0.41. D) The population proportion is less than 0.41. E) The sample proportion is equal to 0.41.
21)
Determine the null and alternative hypotheses. 22) A health insurer has determined that the "reasonable and customary" fee for a certain medical procedure is $1200. They suspect that the average fee charged by one particular clinic for this procedure is higher than $1200. The insurer wants to perform a hypothesis test to determine whether their suspicion is correct. A) H0 : μ > $1200 Ha: μ < $1200 B) H0 : μ = $1200 Ha: μ ≥ $1200 C) H0 : μ = $1200 Ha: μ < $1200 D) H0 : μ = $1200 Ha: μ > $1200 E) H0 : μ > $1200 Ha: μ = $1200 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 23) The mean score for all U.S. high school seniors taking the SAT college entrance exam equals 500. A study is conducted to see whether a different mean applies to Canadian seniors. For a random sample of 100 Canadian seniors, suppose the mean and standard deviation on this exam equal 508 and 100. a. b.
Set up hypotheses for a significance test, and compute the test statistic. The P-value is 0.43. Interpret it, and make a decision about H0 , using a significance level of
0.05. c. If the decision in (b) was in error, what type of error is it? d. A 95% confidence interval for μ is (488.2, 527.8). Show the correspondence between the decision in the test and whether 500 falls in this confidence interval.
Copyright © 2017 Pearson Education, Inc. 5
20)
23)
22)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Assume that a simple random sample has been selected from a normally distributed population. State the final conclusion. 24) Test the claim that for the population of female college students at a particular university, the mean weight is 24) given by μ = 132 lb. Sample data are summarized as n = 20, x = 137 lb, and s = 14.2 lb. Use a significance level of α = 0.1. H0 : μ = 132 Ha: μ ≠ 132 State your conclusion about H0 . A) t = -1.57, do not reject H0 B) t = 1.57, do not reject H0 C) z = 1.57, do not reject H0 D) t = 1.57, reject H0 E) t = 7.04, reject H0 Select the most appropriate answer. 25) A research company claims that more than 55% of Americans regularly watch public access television. You decide to test this claim and ask a random sample of 425 Americans if they watch these programs regularly. Of the 425, 255 respond yes. Calculate the test statistic z for the population proportion. A) 1.645 B) -2.07 C) -1.96 D) 2.07 E) none of these
Copyright © 2017 Pearson Education, Inc. 6
25)
Answer Key Testname: CHAPTER 9 FORM A TEST
1) E 2) D 3) D 4) B 5) E 6) A 7) A 8) E 9) A 10) C 11) Yes, because all of the values in the confidence interval differ from 28. 12) E 13) C
^
^
14) a. H0 : p = 0.04, Ha: p ≠ 0.04, both np and n(1-p ) are at least 15; b. p = 0, SE(p ) = 0.0098, z = -4.08; c. P-value < 0.0001, There 0 0 is sufficient evidence at α = 0.05 that the proportion of vegetarians at the university differs from 0.04. Note: Because the P-value is so small, the result is statistically significant at α = 0.01 as well. 15) B 16) C 17) A 18) D 19) B 20) D 21) D 22) D 23) a. H0 : μ = 500, Ha: μ ≠ 500, t = 0.8; b. the approximate probability that x takes a value of greater than or equal to 508 or less than or equal to 492 if H0 is true is 0.43, at α = 0.05, P-value > α, fail to reject H0 , there is not sufficient evidence that a different mean applies to Canadian seniors; c. Type II error; d. The 95% CI does contain the H0 value. This indicates (as the two-sided test at a 0.05 significance level indicated) that there is not sufficient evidence that a different mean applies to Canadian seniors. 24) B 25) D
Copyright © 2017 Pearson Education, Inc. 7
CHAPTER 9 FORM B TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Assume that a simple random sample has been selected from a normally distributed population. Find the test statistic t. 1) Test the claim that the mean lifetime of a particular car engine is greater than 220,000 miles. Sample data are 1) summarized as n = 23, x = 226,450 miles, and s = 11,500 miles. Use a significance level of α = 0.01. Find the test statistic t. A) -2.24 B) 2.69 C) -2.69 D) 12.9 E) 2.24 For the given significance test, determine the probability of a Type II error or the power, as specified. 2) Suppose we wish to test H0 : p = 0.5 against H1 : p < 40 using α = 0.05. If p is actually equal to 0.4, what is the probability of a type II error assuming n = 150? A) 0.21 B) 0.05 C) 0.69
D) 0.79
E) 0.31
3) Suppose we wish to test H0 : p = 0.25 against H1 : p > 0.25 using α = 0.05. If p is actually equal to 0.4, what is the power of the test assuming n = 55? A) 0.73 B) 0.93
C) 0.21
D) 0.27
2)
3)
E) 0.79
Select the most appropriate answer. 4) A small P-value indicates which of the following? I) the parameter value indicated by the null hypothesis is not plausible II) the null hypothesis should be rejected in favor of the alternative hypothesis III) the difference between the parameter value and the value specified in the null hypothesis is practically significant A) III only B) I, II, and III C) II only D) I only E) both I and II
4)
5) Which of the following would be an appropriate null hypothesis? A) The sample mean is greater than 3.4. B) The sample mean is equal to 3.4. C) The population mean is not equal to 3.4. D) The population mean is equal to 3.4. E) The population mean is greater than 3.4.
5)
6) Given Ha ≠ p . What is the P-value if the test statistics is calculated to be z = -0.12? 0 A) 0.05 B) 0.90 C) 0.10 D) 0.95
6)
Copyright © 2017 Pearson Education, Inc. 1
E) 0.12
7) A research company claims that more than 55% of Americans regularly watch public access television. You decide to test this claim and ask a random sample of 425 Americans if they watch these programs regularly. Of the 425, 255 respond yes. Calculate the test statistic z for the population proportion. A) 2.07 B) 1.645 C) -1.96 D) none of these E) -2.07
7)
8) The test statistic for testing H0 : μ = 100 against Ha : μ ≠ 100 was t = 3.3, with P-value 0.001. Which
8)
of the following is the appropriate conclusion? A) Since the P-value is very small, we are unable to conclude that μ ≠ 100. B) Since the P-value is very small, we conclude that μ ≠ 100. C) Since the P-value is very small, we conclude that μ = 100. D) In order to interpret the results, we must first know the sample size. E) Since the P-value is very small, we are unable to conclude that μ = 100. 9) Which of the following would be an appropriate alternative hypothesis? A) The sample proportion is not equal to 0.41. B) The population proportion is less than 0.41. C) The sample proportion is less than 0.41. D) The population proportion is equal to 0.41. E) The sample proportion is equal to 0.41.
9)
Assume that a simple random sample has been selected from a normally distributed population. State the final conclusion. 10) Test the claim that for the population of female college students at a particular university, the mean weight is 10) given by μ = 132 lb. Sample data are summarized as n = 20, x = 137 lb, and s = 14.2 lb. Use a significance level of α = 0.1. H0 : μ = 132 Ha: μ ≠ 132 State your conclusion about H0 . A) t = 1.57, reject H0 B) t = 1.57, do not reject H0 C) t = 7.04, reject H0 D) z = 1.57, do not reject H0 E) t = -1.57, do not reject H0 Select the most appropriate answer. 11) A researcher claims that less than 20% of American adults are allergic to pollen. In a random sample of 100 adults, 15% indicate that they have such an allergy. Calculate the test statistic z for the population proportion. A) 1.96 B) -1.25 C) 1.25 D) none of these E) 1.645
Copyright © 2017 Pearson Education, Inc. 2
11)
Find the P-value for the indicated hypothesis test. 12) A medical school claims that more than 28% of its students plan to go into general practice. It is found that among a random sample of 130 of the school's students, 39% of them plan to go into general practice. Find the P-Value for testing the school's claim. A) 0.0026 B) 0.1635 C) 0.3078 D) 0.3461 E) 0.0280 13) A manufacturer claims that fewer than 6% of its fax machines are defective. In a random sample of 100 such fax machines, 5% are defective. Find the P-value for testing the manufacturer's claim. A) 0.16 B) 0.17 C) 0.33 D) 0.34 E) 0.06 Provide an appropriate response. 14) Suppose 1000 tests are run to test a null hypothesis using α = 0.05. If the null hypothesis is true, about how many of these tests would you expect to show statistically significant results? A) 5 B) cannot be determined from the information given C) 50 D) 1000 E) 0
12)
13)
14)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 15) The mean score for all U.S. high school seniors taking the SAT college entrance exam equals 500. A study is conducted to see whether a different mean applies to Canadian seniors. For a random sample of 100 Canadian seniors, suppose the mean and standard deviation on this exam equal 508 and 100. a. b.
15)
Set up hypotheses for a significance test, and compute the test statistic. The P-value is 0.43. Interpret it, and make a decision about H0 , using a significance level of
0.05. c. If the decision in (b) was in error, what type of error is it? d. A 95% confidence interval for μ is (488.2, 527.8). Show the correspondence between the decision in the test and whether 500 falls in this confidence interval. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 16) A study uses a random sample of size 9. The test statistic for testing H0 : μ = 12 versus Ha : μ > 12 is t = 1.8. Find the approximate P-value. A) Cannot be determined without the sample standard deviation. B) 0.95 C) 0.025 D) 0.05 E) 0.10 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 17) A sample was used to obtain a 95% confidence interval for the population mean age of graduate students at a large university. The 95% confidence interval for μ was (23, 27). If the same sample had been used to test the null hypothesis that the population mean age is equal to 28 versus the alternative hypothesis that the population mean age differs from 28, would the null hypothesis be rejected at α = 0.05? Explain.
Copyright © 2017 Pearson Education, Inc. 3
17)
16)
18) About 4% of Americans are vegetarians. For a random sample of 400 students at University X, suppose that none are vegetarians.
18)
a. Set up null and alternative hypotheses for testing whether the proportion of vegetarians is the same or different at that university than nationwide. Check that the sample size is large enough for the test. b. Find the sample proportion, standard error, and test statistic. c. Find the P-value. Is there strong evidence that the proportion of vegetarians at the university differs from 0.04? MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Classify the conclusion of the significance test as a Type I error, a Type II error, or No error. 19) A health insurer has determined that the "reasonable and customary" fee for a certain medical procedure is $1200. They suspect that the average fee charged by one particular clinic for this procedure is higher than $1200. The insurer wants to perform a significance test to determine whether their suspicion is correct. The hypotheses are: H0 : μ = $1200
19)
Ha: μ > $1200 Suppose that the results of the sample lead to rejection of the null hypothesis. Classify that conclusion as a Type I error, a Type II error, or a correct decision, if in fact the average fee charged by the clinic is $1200 . A) No error B) Type II error C) Type I error For the given significance test, explain the meaning of a Type I error, a Type II error, or a correct decision as specified. 20) At one school, the average amount of time tenth-graders spend watching television each week is 21.6 hours. 20) The principal introduces a campaign to encourage the students to watch less television. One year later, the principal performs a significance test using α = 0.05 to determine whether the average amount of time spent watching television per week has decreased. The hypotheses are: H0 : μ = 21.6 hours Ha: μ < 21.6 hours If the P-value = 0.04 and a decision error is made, what type of error is it? Explain. A) Type I error. We conclude that the average amount of time spent watching television each week is less than 21.6 hours when it in fact is not. B) Type I error. We conclude that the average amount of time spent watching television each week is 21.6 hours when it is in fact less. C) Type II error. We conclude that the average amount of time spent watching television each week is 21.6 hours when it is in fact less. D) Type II error. We conclude that the average amount of time spent watching television each week is less than 21.6 hours when it in fact is not.
Copyright © 2017 Pearson Education, Inc. 4
21) A health insurer has determined that the "reasonable and customary" fee for a certain medical procedure is $1200. They suspect that the average fee charged by one particular clinic for this procedure is higher than $1200. The insurer performs a significance test to determine whether their suspicion is correct using α = 0.05. The hypotheses are: H0 : μ = $1200
21)
Ha: μ > $1200 If the P-value is 0.09 and a decision error is made, what type of error is it? Explain. A) Type II error. We conclude that the average fee charged for the procedure is higher than $1200 when it actually is not higher. B) Type I error. We conclude that the average fee charged for the procedure is not higher than $1200 when it actually is higher. C) Type I error. We conclude that the average fee charged for the procedure is higher than $1200 when it actually is not higher. D) Type II error. We conclude that the average fee charged for the procedure is not higher than $1200 when it actually is higher. Determine the null and alternative hypotheses. 22) In the past, the mean running time for a certain type of radio battery has been 9.6 hours. The manufacturer has introduced a change in the production method and wants to perform a hypothesis test to determine whether the mean running time has changed as a result. A) H0 : μ = 9.6 hours
22)
Ha: μ ≠ 9.6 hours B) H0 : μ = 9.6 hours Ha: μ > 9.6 hours C) H0 : μ ≠ 9.6 hours Ha: μ = 9.6 hours D) H0 : μ ≥ 9.6 hours Ha: μ = 9.6 hours E) H0 : μ > 9.6hours Ha: μ > 9.6 hours 23) A health insurer has determined that the "reasonable and customary" fee for a certain medical procedure is $1200. They suspect that the average fee charged by one particular clinic for this procedure is higher than $1200. The insurer wants to perform a hypothesis test to determine whether their suspicion is correct. A) H0 : μ > $1200 Ha: μ = $1200 B) H0 : μ = $1200 Ha: μ > $1200 C) H0 : μ > $1200 Ha: μ < $1200 D) H0 : μ = $1200 Ha: μ ≥ $1200 E) H0 : μ = $1200 Ha: μ < $1200
Copyright © 2017 Pearson Education, Inc. 5
23)
For the given sample data and null hypothesis, compute the value of the test statistic, z 24) 410 people were asked if they were satisfied with their jobs. 37% of the responses were affirmative. H0 : p = 0.30 A) 3.09 B) 4.12 C) 2.61 D) 0.15 E) 0.04 State hypotheses for the significance test. 25) Last year, the results of a survey at one college suggested that 28% of students smoked regularly. This year, after an intensive college-wide anti-smoking campaign, a researcher wishes to investigate whether the proportion of smokers has changed. Let p represent the proportion of students who smoke regularly today. State hypotheses for a significance test, letting the alternative hypothesis reflect the possibility that the proportion of students who smoke today is different from the proportion last year. A) H0 : p = 28% B) H0 : p = 28% C) H0 : p ≠ 28% D) H0 : p > 28% Ha: p ≥ 28%
Ha: p ≠ 28%
Ha: p = 28%
Copyright © 2017 Pearson Education, Inc. 6
Ha: p < 28%
24)
25)
Answer Key Testname: CHAPTER 9 FORM B TEST
1) B 2) A 3) E 4) D 5) D 6) B 7) A 8) B 9) B 10) B 11) B 12) A 13) D 14) C 15) a. H0 : μ = 500, Ha: μ ≠ 500, t = 0.8; b. the approximate probability that x takes a value of greater than or equal to 508 or less than or equal to 492 if H0 is true is 0.43, at α = 0.05, P-value > α, fail to reject H0 , there is not sufficient evidence that a different mean applies to Canadian seniors; c. Type II error; d. The 95% CI does contain the H0 value. This indicates (as the two-sided test at a 0.05 significance level indicated) that there is not sufficient evidence that a different mean applies to Canadian seniors. 16) D 17) Yes, because all of the values in the confidence interval differ from 28. ^
^
18) a. H0 : p = 0.04, Ha: p ≠ 0.04, both np and n(1-p ) are at least 15; b. p = 0, SE(p ) = 0.0098, z = -4.08; c. P-value < 0.0001, There 0 0 is sufficient evidence at α = 0.05 that the proportion of vegetarians at the university differs from 0.04. Note: Because the P-value is so small, the result is statistically significant at α = 0.01 as well. 19) C 20) A 21) D 22) A 23) B 24) A 25) B
Copyright © 2017 Pearson Education, Inc. 7
CHAPTER 10 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Use the paired t-interval procedure to obtain the required confidence interval for the mean difference. Assume that the conditions and assumptions for inference are satisfied. 1) Ten families are randomly selected and their daily water usage (in gallons) before and after viewing a 1) conservation video. Construct a 90% confidence interval for the mean of the difference of the "before" minus the "after" times if d(after-before) = -4.8 and sd = 5.2451 Before 33 33 38 33 35 35 40 40 40 31 After 34 28 25 28 35 33 31 28 35 33 A) (1.8,7.8) B) (2.1,7.5) C) (2.5,7.1)
D) (1.5,8.1)
E) (3.8,5.8)
Find the appropriate test statistic/p-value. 2) Do motivation levels between mid-level and upper-level managers differ? A randomly selected group of each were administered a survey, which measures motivation for upward mobility. The scores are summarized below:
Sample Size Mean Score Standard Deviation
upper-level 173 77.4 11.06
2)
mid-level 109 79.71 6.43
Given that the P-value is 0.03, which of the following is the appropriate conclusion? A) At α = .02, reject H0 . B) At α = .01, reject H0 . C) At α = .02, accept H0 . D) At α = .025, fail to reject H0 . E) At α = .025, reject H0 . Provide an appropriate response. 3) An agricultural company would like to predict cotton yield per acre in a certain area using rainfall (in inches). Identify the response variable. A) variety of cotton grown B) the agricultural company C) rainfall in inches D) farm size E) yield per acre
Copyright © 2017 Pearson Education, Inc. 1
3)
Interpret the given confidence interval. 4) A survey was conducted to determine the difference in gasoline mileage for two truck models. A random sample was taken for each model of truck, and the mean gasoline mileage, in miles per gallon, was calculated. A 98% confidence interval for the difference in the mean mileage for model A trucks minus the mean mileage for model B trucks, μA - μB, was determined to be (2.5, 4.7).
4)
A) 98% of model A trucks have a gas mileage that is between 2.5 and 4.7 miles per gallon higher than model B trucks. B) The probability that a randomly selected Model A truck will have a gas mileage that is lower than a randomly selected model B truck is 0.98. C) Based on this sample, we are 98% confident that the average mileage for model B trucks is between 2.5 and 4.7 miles per gallon higher than the average mileage for model A trucks. D) The probability that a randomly selected Model A truck will have a gas mileage that is higher than a randomly selected model B truck is 0.98. E) Based on this sample, we are 98% confident that the average mileage for model A trucks is between 2.5 and 4.7 miles per gallon higher than the average mileage for model B trucks. ^
^
From the sample statistics, find the value of p1 - p2 , the point estimate of the difference of proportions. Unless otherwise indicated, round to the nearest thousandth when necessary. 5) n 1 = 1927 n 2 = 541 x1 = 689
5)
x2 = 160
A) 0.382 B) 0.214 C) none of these D) 0.062 E) -0.62 Provide an appropriate response. 6) Do motivation levels between mid-level and upper-level managers differ? A randomly selected group of each were administered a survey, which measures motivation for upward mobility. The scores are summarized below:
Sample Size Mean Score Standard Deviation
upper-level 73 77.4 10.06
6)
mid-level 109 79.71 6.43
Assuming equal population standard deviations, calculate the test statistic for determining whether the mean scores differ for upper-level and mid-level managers. A) -1.89 B) -63.69 C) none of these D) -1.74 E) -0.29 Answer true or false. 7) The F test for comparing standard deviations is robust. A) False B) True
Copyright © 2017 Pearson Education, Inc. 2
7)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 8) When testing for a difference between the means of a treatment group and a placebo group, the computer display below is obtained. Variable: TREATMENT N Mean Std Dev
Std Error
treatment placebo
0.403 0.453
50 65.107 50 66.183
2.847 3.205
8)
Use technology to find a 95% confidence interval for the difference of the mean of the treatment and the mean of the placebo group.
9) Using the formula se =
s1 2 s2 2 + , show that the standard error of (x 1 - x 2 ) is greater than both n1 n2
9)
se(x 1 ) and se(x 2 ). MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 10) The recent admissions for the humanities and sciences at a certain college are given in the table below. What percentage of men who applied within the two areas was accepted? What percentage of women who applied within the two areas was accepted?
Humanities Accepted Rejected Sciences Accepted Rejected Total
Men
Women
190 140
503 422
160 210 700
27 48 1000
A) 21% of men and 31% of women who applied within the two areas were accepted. B) 23% of men and 2.7% of women who applied within the two areas were accepted. C) 50% of men and 53% of women who applied within the two areas were accepted. D) 27% of men and 50% of women who applied within the two areas were accepted. E) 51% of men and 45% of women who applied within the two areas were accepted.
Copyright © 2017 Pearson Education, Inc. 3
10)
Interpret the given confidence interval. 11) A researcher was interested in comparing the salaries of female and male employees of a particular company. Independent random samples of female employees (sample 1) and male employees (sample 2) were taken to calculate the mean salary, in dollars per week, for each group. A 90% confidence interval for the difference, μ1 - μ2 , between the mean weekly salary of all female
11)
employees and the mean weekly salary of all male employees was determined to be (-$110, $10). A) 90% of the time females at this company make less than males. B) Based on these data, we are 90% confident that female employees at this company average between $110 less and $10 more per week than the male employees. C) The probability that a randomly selected female employee at this company makes between $110 less and $10 more per week than a randomly selected male employee is 0.9. D) Since 0 is contained in the interval, the probability that male employees at this company earn the same as females at this company is 0.9. E) Based on these data, we are 90% confident that male employees at this company average between $110 less and $10 more per week than the female employees. ^
^
From the sample statistics, find the value of p1 - p2 , the point estimate of the difference of proportions. Unless otherwise indicated, round to the nearest thousandth when necessary. 12) A survey asked respondents whether marijuana should be made legal. A 95% confidence interval for PA - PB
12)
is given by (0.08, 0.14) where PA is the proportion of respondents who answered "legal" in state A and PB is the proportion of respondents who responded "legal" in state B. Based on the 95% confidence interval, what can we conclude about the percentage of respondents who favor legalization in state B versus state A? A) Since all of the values in the confidence interval are greater than 0, we can conclude that the percentage in favor of legalization was greater in state A than it was in state B. B) Since all of the values in the confidence interval are greater than 0, we can conclude that the percentage in favor of legalization was greater in state B than it was in state A. C) Since all of the values in the confidence interval are less than 1, we can conclude that there is a significant difference between the percentage in favor of legalization in state B and the percentage in favor of legalization in state A. D) Since all of the values in the confidence interval are less than 1, we are unable to conclude that there is a significant difference between the percentage in favor of legalization in state B and the percentage in favor of legalization in state A. 13) n 1 = 100
n 2 = 100
13)
x1 = 34
x2 = 30 A) 0.02 B) -0.04 C) none of these D) 0.04 E) -0.02
Answer true or false. 14) The width of the confidence interval for (μ1 - μ2 ) increases as s 1 and s 2 increase. A) True
B) False
Copyright © 2017 Pearson Education, Inc. 4
14)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 15) Three editors of large-city newspapers are selected at random and asked to rate on a score of 0 to 100 the fairness of the news media in political reporting (0 corresponds to very unfair). The same question is posed to three editors of small-town papers. The large-city editors gave scores of 55, 90, and 95; while, the small-town editors gave scores of 40, 60, and 80. The table shows a computer printout for results of a two-sample comparison of means. Variable: FAIRNESS PAPER N Mean
Std Dev
Std Error
large small
21.794 20.000
12.583 11.547
3 3
80.00 60.00
15)
Use technology to find a 95% confidence interval for the difference of the means. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 16) For 12 pairs of females, the reported means are 24.8 on the well-being measure for the children of alcoholics and 29.0 for the control group. A t test statistic of 2.67 for the test comparing the means was obtained. Assuming that this is the result of a dependent-samples analysis testing for a difference between the group means, report the P-value. A) 0.0076 B) 0.0152 C) 0.01 < P-value < 0.02 D) 0.02 < P-value < 0.05 E) 0.005 < P-value < 0.01 Provide an appropriate response. 17) The data given were collected to determine whether the life span of randomly selected mammals, in years, can be predicted from their corresponding gestation periods. Gestation 8 2.1 1.3 1 11.5 5.3 3.8 24.3 Life span 30 13 8 4 28 11 12 42 Identify the explanatory variable A) climate B) life span C) months D) gestation E) mammal breed
16)
17)
^
From the sample statistics, find the value of the pooled estimate p used. 18) n 1 = 100 n 2 = 100 ^
p1 = 0.1
^
p2 = 0.12
A) none of these B) 0.33 C) 0.11 D) 0.22 E) 0.0022
Copyright © 2017 Pearson Education, Inc. 5
18)
Provide an appropriate response. 19) Do motivation levels between mid-level and upper-level managers differ? A randomly selected group of each were administered a survey, which measures motivation for upward mobility. The scores are summarized below: upper-level Sample Size 73 Mean Score 77.4 Standard Deviation 10.06
19)
mid-level 109 79.71 6.43
Assuming equal population standard deviations, find the P-value for testing that the mean scores differ for upper-level and mid-level managers. Interpret using a 5% significance level. A) P-value = 0.03; since the P-value < 0.05, we reject the null hypothesis. B) P-value = 0.04; since the P-value < 0.05, we reject the null hypothesis. C) P-value = 0.06; since the P-value > 0.05, we fail to reject the null hypothesis. D) P-value = 0.08; since the P-value > 0.05, we fail to reject the null hypothesis. ^
^
From the sample statistics, find the value of p1 - p2 , the point estimate of the difference of proportions. Unless otherwise indicated, round to the nearest thousandth when necessary. 20) A study was conducted to compare the effectiveness of two weight loss strategies for obese participants. The proportion of obese clients who lost at least 10% of their body weight was compared for the two strategies. The resulting 98% confidence interval for p 1 - p 2 is (-0.13, 0.09). Give an interpretation of this confidence interval. A) There is a 98% probability that the proportion of obese clients losing weight under strategy 1 is between 13% less and 9% more than the proportion of obese clients losing weight under strategy 2. B) There is a 98% probability that the proportion of obese clients losing weight under strategy 2 is between 13% less and 9% more than the proportion of obese clients losing weight under strategy 1. C) If samples were repeatedly drawn from the same populations under the same circumstances, the true population difference (p 1 - p 2 ) would be between -0.13 and 0.09 98% of the time. D) We are 98% confident that the proportion of obese clients losing weight under strategy 1 is between 13% less and 9% more than the proportion of obese clients losing weight under strategy 2. E) We are 98% confident that the proportion of obese clients losing weight under strategy 2 is between 13% less and 9% more than the proportion of obese clients losing weight under strategy 1.
Copyright © 2017 Pearson Education, Inc. 6
20)
Provide an appropriate response. 21) The recent admissions for the humanities and sciences at a certain college are given in the table below. What percentage of men applying to the humanities was accepted? What percentage of women applying to the humanities was accepted?
Humanities Accepted Rejected Sciences Accepted Rejected Total
Men
Women
190 140
503 422
160 210 700
27 48 1000
21)
A) In the humanities, 11% of males were accepted whereas 30% of females were accepted. B) In the humanities, 27% of males were accepted whereas 50% of females were accepted. C) In the humanities, 50% of males were accepted and 53% of females were accepted. D) In the humanities, 27% of males were accepted whereas 73% of females were accepted. E) In the humanities, 58% of males were accepted whereas only 54% of females were accepted. 22) A 95% confidence interval for the difference in means for a collection of paired sample data is (0, 3.4). Based on the same sample, a traditional significance test fails to support the claim of μd > 0. What can you conclude about the significance level αÁ (αÁ = 1 - .95) of the hypothesis test? A) α > 0.05 B) α < 0.05 C) α = 0.95 D) α = 0.05
22)
E) α = 0.01
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 23) Denote the observation for subject i by x at time 1 and by y at time 2. Let d = y - x . Letting x, i i i i i
23)
y, and x d denote the means, show that x d = y - x. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 24) At the end of a highschool football season two teams are compared to determine which team had a higher percentage of successful scoring attempts after a touchdown. The data is summarized in the following table: Type of Attempt Field Goal 2-Point Conversion Overall
Team A 89% (25/29) 40% (2/5) 79% (27/34)
Team B 86% (31/36) 0% (0/2) 82% (31/38)
Identify the control variable. A) Type of attempt B) 2-Point conversion C) Team A D) Team B E) Field Goal
Copyright © 2017 Pearson Education, Inc. 7
24)
25) At the end of a highschool football season two teams are compared to determine which team had a higher percentage of successful scoring attempts after a touchdown. The data is summarized in the following table: Type of Attempt Field Goal 2-Point Conversion Overall
Team A 89% (25/29) 40% (2/5) 79% (27/34)
Team B 86% (31/36) 0% (0/2) 82% (31/38)
Identify the explanatory variable. A) Whether or not the team scored B) Which Team C) None of these D) Type of attempt E) Number of attempts
Copyright © 2017 Pearson Education, Inc. 8
25)
Answer Key Testname: CHAPTER 10 FORM A TEST
1) A 2) D 3) E 4) E 5) D 6) A 7) A 8) (-2.279, 0.1273) 9) Since se(x 1 - x 2 ) =
[se(x 1 )]2 + [se(x 2 )]2 , se(x 1 - x 2 ) > se(x 1 ). To see this, square each side of the inequality
[se(x 1 )]2 + [se(x 2 )]2 > se(x 1 ) to get [se(x 1 )]2 + [se(x 2 )]2 > [se(x 1 )]2 . Subtract [se(x 1 )]2 from each side of the inequality [se(x 1 )]2 + [se(x 2 )]2 > [se(x 1 )]2 to get [se(x 2 )]2 > 0. Take the square root of each side of the inequality [se(x 2 )]2 > 0 to get se(x 2 ) > 0. Since x 2 is a random variable which varies from sample to sample or since x 2 is not a constant, se(x 2 ) must be greater than 0. Therefore, se(x 1 - x 2 ) > se(x 1 ). Likewise, it can be shown that se(x 1 - x 2 ) > se(x 2 ). se(x 2 ) < se(x 1 - x 2 ). 10) C 11) B 12) A 13) D 14) A 15) (-27.55, 67.55) 16) D 17) D 18) C 19) C 20) D 21) E 22) B 23) x d = (∑d )/n = [∑(y - x ) ]/n = ( ∑y - ∑x )/n = ( ∑y )/n - ( ∑x )/n = y - x. i i i i i i i 24) A 25) B
Copyright © 2017 Pearson Education, Inc. 9
CHAPTER 10 FORM B TEST Name___________________________________
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) Using the formula se =
s1 2 s2 2 + , show that the standard error of (x 1 - x 2 ) is greater than both n1 n2
1)
se(x 1 ) and se(x 2 ). MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 2) A 95% confidence interval for the difference in means for a collection of paired sample data is (0, 3.4). Based on the same sample, a traditional significance test fails to support the claim of μd > 0. What can you conclude about the significance level αÁ (αÁ = 1 - .95) of the hypothesis test? A) α = 0.05 B) α > 0.05 C) α < 0.05 D) α = 0.01
E) α = 0.95
Interpret the given confidence interval. 3) A researcher was interested in comparing the salaries of female and male employees of a particular company. Independent random samples of female employees (sample 1) and male employees (sample 2) were taken to calculate the mean salary, in dollars per week, for each group. A 90% confidence interval for the difference, μ1 - μ2 , between the mean weekly salary of all female employees and the mean weekly salary of all male employees was determined to be (-$110, $10). A) 90% of the time females at this company make less than males. B) Since 0 is contained in the interval, the probability that male employees at this company earn the same as females at this company is 0.9. C) The probability that a randomly selected female employee at this company makes between $110 less and $10 more per week than a randomly selected male employee is 0.9. D) Based on these data, we are 90% confident that male employees at this company average between $110 less and $10 more per week than the female employees. E) Based on these data, we are 90% confident that female employees at this company average between $110 less and $10 more per week than the male employees.
Copyright © 2017 Pearson Education, Inc. 1
2)
3)
Provide an appropriate response. 4) The recent admissions for the humanities and sciences at a certain college are given in the table below. What percentage of men who applied within the two areas was accepted? What percentage of women who applied within the two areas was accepted?
Humanities Accepted Rejected Sciences Accepted Rejected Total
Men
Women
190 140
503 422
160 210 700
27 48 1000
4)
A) 50% of men and 53% of women who applied within the two areas were accepted. B) 27% of men and 50% of women who applied within the two areas were accepted. C) 51% of men and 45% of women who applied within the two areas were accepted. D) 23% of men and 2.7% of women who applied within the two areas were accepted. E) 21% of men and 31% of women who applied within the two areas were accepted. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 5) Denote the observation for subject i by x at time 1 and by y at time 2. Let d = y - x . Letting x, i i i i i
5)
y, and x d denote the means, show that x d = y - x. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 6) At the end of a highschool football season two teams are compared to determine which team had a higher percentage of successful scoring attempts after a touchdown. The data is summarized in the following table: Type of Attempt Field Goal 2-Point Conversion Overall
Team A 89% (25/29) 40% (2/5) 79% (27/34)
Team B 86% (31/36) 0% (0/2) 82% (31/38)
Identify the control variable. A) Team A B) 2-Point conversion C) Field Goal D) Type of attempt E) Team B
Copyright © 2017 Pearson Education, Inc. 2
6)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 7) Three editors of large-city newspapers are selected at random and asked to rate on a score of 0 to 100 the fairness of the news media in political reporting (0 corresponds to very unfair). The same question is posed to three editors of small-town papers. The large-city editors gave scores of 55, 90, and 95; while, the small-town editors gave scores of 40, 60, and 80. The table shows a computer printout for results of a two-sample comparison of means. Variable: FAIRNESS PAPER N Mean
Std Dev
Std Error
large small
21.794 20.000
12.583 11.547
3 3
80.00 60.00
7)
Use technology to find a 95% confidence interval for the difference of the means. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 8) The recent admissions for the humanities and sciences at a certain college are given in the table below. What percentage of men applying to the humanities was accepted? What percentage of women applying to the humanities was accepted?
Humanities Accepted Rejected Sciences Accepted Rejected Total
Men
Women
190 140
503 422
160 210 700
27 48 1000
A) In the humanities, 27% of males were accepted whereas 50% of females were accepted. B) In the humanities, 58% of males were accepted whereas only 54% of females were accepted. C) In the humanities, 27% of males were accepted whereas 73% of females were accepted. D) In the humanities, 11% of males were accepted whereas 30% of females were accepted. E) In the humanities, 50% of males were accepted and 53% of females were accepted. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 9) When testing for a difference between the means of a treatment group and a placebo group, the computer display below is obtained. Variable: TREATMENT N Mean Std Dev
Std Error
treatment placebo
0.403 0.453
50 65.107 50 66.183
2.847 3.205
Use technology to find a 95% confidence interval for the difference of the mean of the treatment and the mean of the placebo group.
Copyright © 2017 Pearson Education, Inc. 3
9)
8)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 10) Do motivation levels between mid-level and upper-level managers differ? A randomly selected group of each were administered a survey, which measures motivation for upward mobility. The scores are summarized below:
Sample Size Mean Score Standard Deviation
upper-level 73 77.4 10.06
10)
mid-level 109 79.71 6.43
Assuming equal population standard deviations, calculate the test statistic for determining whether the mean scores differ for upper-level and mid-level managers. A) -0.29 B) -1.89 C) none of these D) -63.69 E) -1.74 11) An agricultural company would like to predict cotton yield per acre in a certain area using rainfall (in inches). Identify the response variable. A) rainfall in inches B) yield per acre C) the agricultural company D) farm size E) variety of cotton grown
11)
12) The data given were collected to determine whether the life span of randomly selected mammals, in years, can be predicted from their corresponding gestation periods. Gestation 8 2.1 1.3 1 11.5 5.3 3.8 24.3 Life span 30 13 8 4 28 11 12 42 Identify the explanatory variable A) life span B) gestation C) mammal breed D) climate E) months
12)
Copyright © 2017 Pearson Education, Inc. 4
13) At the end of a highschool football season two teams are compared to determine which team had a higher percentage of successful scoring attempts after a touchdown. The data is summarized in the following table: Type of Attempt Field Goal 2-Point Conversion Overall
Team A 89% (25/29) 40% (2/5) 79% (27/34)
13)
Team B 86% (31/36) 0% (0/2) 82% (31/38)
Identify the explanatory variable. A) Which Team B) None of these C) Type of attempt D) Number of attempts E) Whether or not the team scored 14) Do motivation levels between mid-level and upper-level managers differ? A randomly selected group of each were administered a survey, which measures motivation for upward mobility. The scores are summarized below:
Sample Size Mean Score Standard Deviation
upper-level 73 77.4 10.06
14)
mid-level 109 79.71 6.43
Assuming equal population standard deviations, find the P-value for testing that the mean scores differ for upper-level and mid-level managers. Interpret using a 5% significance level. A) P-value = 0.04; since the P-value < 0.05, we reject the null hypothesis. B) P-value = 0.08; since the P-value > 0.05, we fail to reject the null hypothesis. C) P-value = 0.03; since the P-value < 0.05, we reject the null hypothesis. D) P-value = 0.06; since the P-value > 0.05, we fail to reject the null hypothesis. Find the appropriate test statistic/p-value. 15) Do motivation levels between mid-level and upper-level managers differ? A randomly selected group of each were administered a survey, which measures motivation for upward mobility. The scores are summarized below:
Sample Size Mean Score Standard Deviation
upper-level 173 77.4 11.06
mid-level 109 79.71 6.43
Given that the P-value is 0.03, which of the following is the appropriate conclusion? A) At α = .02, accept H0 . B) At α = .02, reject H0 . C) At α = .025, fail to reject H0 . D) At α = .01, reject H0 . E) At α = .025, reject H0 .
Copyright © 2017 Pearson Education, Inc. 5
15)
^
^
From the sample statistics, find the value of p1 - p2 , the point estimate of the difference of proportions. Unless otherwise indicated, round to the nearest thousandth when necessary. 16) n 1 = 100 n 2 = 100 x1 = 34
16)
x2 = 30
A) -0.04 B) 0.04 C) -0.02 D) 0.02 E) none of these 17) n 1 = 1927 x1 = 689
n 2 = 541 x2 = 160
17)
A) 0.214 B) -0.62 C) 0.062 D) 0.382 E) none of these 18) A study was conducted to compare the effectiveness of two weight loss strategies for obese participants. The proportion of obese clients who lost at least 10% of their body weight was compared for the two strategies. The resulting 98% confidence interval for p 1 - p 2 is (-0.13, 0.09). Give an interpretation of this confidence
18)
interval. A) If samples were repeatedly drawn from the same populations under the same circumstances, the true population difference (p 1 - p 2 ) would be between -0.13 and 0.09 98% of the time. B) There is a 98% probability that the proportion of obese clients losing weight under strategy 2 is between 13% less and 9% more than the proportion of obese clients losing weight under strategy 1. C) We are 98% confident that the proportion of obese clients losing weight under strategy 1 is between 13% less and 9% more than the proportion of obese clients losing weight under strategy 2. D) We are 98% confident that the proportion of obese clients losing weight under strategy 2 is between 13% less and 9% more than the proportion of obese clients losing weight under strategy 1. E) There is a 98% probability that the proportion of obese clients losing weight under strategy 1 is between 13% less and 9% more than the proportion of obese clients losing weight under strategy 2. 19) A survey asked respondents whether marijuana should be made legal. A 95% confidence interval for PA - PB is given by (0.08, 0.14) where PA is the proportion of respondents who answered "legal" in state A and PB is the proportion of respondents who responded "legal" in state B. Based on the 95% confidence interval, what can we conclude about the percentage of respondents who favor legalization in state B versus state A? A) Since all of the values in the confidence interval are less than 1, we are unable to conclude that there is a significant difference between the percentage in favor of legalization in state B and the percentage in favor of legalization in state A. B) Since all of the values in the confidence interval are greater than 0, we can conclude that the percentage in favor of legalization was greater in state B than it was in state A. C) Since all of the values in the confidence interval are less than 1, we can conclude that there is a significant difference between the percentage in favor of legalization in state B and the percentage in favor of legalization in state A. D) Since all of the values in the confidence interval are greater than 0, we can conclude that the percentage in favor of legalization was greater in state A than it was in state B.
Copyright © 2017 Pearson Education, Inc. 6
19)
Interpret the given confidence interval. 20) A survey was conducted to determine the difference in gasoline mileage for two truck models. A random sample was taken for each model of truck, and the mean gasoline mileage, in miles per gallon, was calculated. A 98% confidence interval for the difference in the mean mileage for model A trucks minus the mean mileage for model B trucks, μA - μB, was determined to be (2.5, 4.7).
20)
A) 98% of model A trucks have a gas mileage that is between 2.5 and 4.7 miles per gallon higher than model B trucks. B) The probability that a randomly selected Model A truck will have a gas mileage that is higher than a randomly selected model B truck is 0.98. C) Based on this sample, we are 98% confident that the average mileage for model B trucks is between 2.5 and 4.7 miles per gallon higher than the average mileage for model A trucks. D) The probability that a randomly selected Model A truck will have a gas mileage that is lower than a randomly selected model B truck is 0.98. E) Based on this sample, we are 98% confident that the average mileage for model A trucks is between 2.5 and 4.7 miles per gallon higher than the average mileage for model B trucks. Answer true or false. 21) The F test for comparing standard deviations is robust. A) False B) True
21)
22) The width of the confidence interval for (μ1 - μ2 ) increases as s 1 and s 2 increase. A) False
22)
B) True ^
From the sample statistics, find the value of the pooled estimate p used. 23) n 1 = 100 n 2 = 100 ^
p1 = 0.1
23)
^
p2 = 0.12
A) 0.33 B) 0.0022 C) 0.22 D) none of these E) 0.11 Use the paired t-interval procedure to obtain the required confidence interval for the mean difference. Assume that the conditions and assumptions for inference are satisfied. 24) Ten families are randomly selected and their daily water usage (in gallons) before and after viewing a 24) conservation video. Construct a 90% confidence interval for the mean of the difference of the "before" minus the "after" times if d(after-before) = -4.8 and sd = 5.2451 Before 33 33 38 33 35 35 40 40 40 31 After 34 28 25 28 35 33 31 28 35 33 A) (2.1,7.5) B) (3.8,5.8) C) (2.5,7.1)
D) (1.8,7.8)
Copyright © 2017 Pearson Education, Inc. 7
E) (1.5,8.1)
Select the most appropriate answer. 25) For 12 pairs of females, the reported means are 24.8 on the well-being measure for the children of alcoholics and 29.0 for the control group. A t test statistic of 2.67 for the test comparing the means was obtained. Assuming that this is the result of a dependent-samples analysis testing for a difference between the group means, report the P-value. A) 0.02 < P-value < 0.05 B) 0.005 < P-value < 0.01 C) 0.01 < P-value < 0.02 D) 0.0076 E) 0.0152
Copyright © 2017 Pearson Education, Inc. 8
25)
Answer Key Testname: CHAPTER 10 FORM B TEST
1) Since se(x 1 - x 2 ) =
[se(x 1 )]2 + [se(x 2 )]2 , se(x 1 - x 2 ) > se(x 1 ). To see this, square each side of the inequality
[se(x 1 )]2 + [se(x 2 )]2 > se(x 1 ) to get [se(x 1 )]2 + [se(x 2 )]2 > [se(x 1 )]2 . Subtract [se(x 1 )]2 from each side of the inequality [se(x 1 )]2 + [se(x 2 )]2 > [se(x 1 )]2 to get [se(x 2 )]2 > 0. Take the square root of each side of the inequality [se(x 2 )]2 > 0 to get se(x 2 ) > 0. Since x 2 is a random variable which varies from sample to sample or since x 2 is not a constant, se(x 2 ) must be greater than 0. Therefore, se(x 1 - x 2 ) > se(x 1 ). Likewise, it can be shown that se(x 1 - x 2 ) > se(x 2 ). se(x 2 ) < se(x 1 - x 2 ). 2) C 3) E 4) A 5) x d = (∑d )/n = [∑(y - x ) ]/n = ( ∑y - ∑x )/n = ( ∑y )/n - ( ∑x )/n = y - x. i i i i i i i 6) D 7) (-27.55, 67.55) 8) B 9) (-2.279, 0.1273) 10) B 11) B 12) B 13) A 14) D 15) C 16) B 17) C 18) C 19) D 20) E 21) A 22) B 23) E 24) D 25) A
Copyright © 2017 Pearson Education, Inc. 9
CHAPTER 11 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. State the null and alternative hypothesis. 1) A sample survey obtains information on characteristics of book readers. A book reader is defined to be anyone who has read one or more books in the six months prior to the survey; a non-book reader is defined to be anyone who read newspapers or magazines but no books in the six months prior to the survey; a nonreader is defined to be anyone who did not read a book, newspaper, or magazine in the six months prior to the survey. The following data were obtained from a random sample of 1429 persons 16 years old and over in an effort to determine whether or not the proportions of book readers, non-book readers, and non-readers are the same for each income bracket. State the hypotheses for a chi-squared test of independence between the two variables, classification and household income.
A) H0 : The classifications have the same distribution for each household income bracket. Ha: The classifications do not have the same distribution for each household income bracket. B) H0 : Household income and book readership are independent. Ha: Household income and book readership are not dependent. C) H0 : There is no relationship between household income and book readership. Ha: Household income and book readership are not related. D) H0 : There is a relationship between household income and book readership. Ha: Household income and book readership are not related.
Copyright © 2017 Pearson Education, Inc. 1
1)
Provide an appropriate response. 2)
2)
The contingency table above shows the blood types of a sample of patients cross classified by sex. How many people in the sample have blood type B? A) 29 B) 58 C) 16 D) 13 E) 112 3) At a high school debate tournament, half of the teams were asked to wear suits and ties and the rest were asked to wear jeans and t-shirts. The wins and losses according to attire are given in the table below. Using α = 0.05, test the hypothesis that the proportion of wins is the same for teams wearing suits as for teams wearing jeans. Win Loss Suit 22 28 T-shirt 28 22 H0 : The proportion of wins is the same for teams wearing suits as for teams wearing jeans. Ha: The proportions are different. Test statistic: χ2 = 1.440. Critical value: χ2 = 3.841. A) Since the test statistic is less than the critical value, we reject H0 and conclude that the proportion of wins differs for the teams wearing suits and those wearing jeans. B) Since the test statistic is less than the critical value, we fail to reject H0 . There is not enough evidence to conclude that the proportion of wins differs for the teams wearing suits versus those wearing jeans. C) Since the test statistic > 1, we reject H0 and conclude that the proportion of wins differs for the teams wearing suits and those wearing jeans. D) Since the critical value > 1, we reject H0 and conclude that the proportion of wins differs for the teams wearing suits and those wearing jeans. E) Since the test statistic is less than the critical value, we accept H0 and conclude that the proportion of wins is the same for the teams wearing suits and for those wearing jeans.
Copyright © 2017 Pearson Education, Inc. 2
3)
State the null and alternative hypothesis. 4) A researcher performed a study to determine whether an association exists between sex and blood type. He obtained the following sample data. State the hypotheses for a chi-squared test of independence between the two variables.
4)
A) H0 : Gender and blood type are independent. Ha: Gender and blood type are not independent. B) H0 : Gender and blood type are dependent. Ha: Gender and blood type are independent. C) H0 : There is a relationship between gender and blood type. Ha: There is no relationship between gender and blood type. D) H0 : Gender and blood type are dependent. Ha: Gender and blood type are not dependent. Use the contingency table to estimate expected cell count. 5) The contingency table below shows the results of a random sample of 300 state representatives that was conducted to see whether their opinions on a bill are related to their party affiliation. Assuming the row and column classification are independent, find an estimate for the expected cell count of row 2, column 2. Round your answer to tenths.
5)
Opinion Party Approve Disapprove No Opinion Republican 63 30 21 Democrat 75 36 27 Independent 15 24 9 A) 34.2
B) 36
C) 33.3
D) 41.4
E) 70.4
Select the most appropriate answer. 6) As the degrees of freedom increase, the mean and standard deviation of the chi-squared distribution A) Increase and decrease, respectively. B) decrease and increase, respectively. C) are both unaffected. D) increase and increase, respectively. E) decrease and decrease, respectively.
Copyright © 2017 Pearson Education, Inc. 3
6)
Provide an appropriate response. 7) The two-way table below summarizes data from a survey at a small liberal arts college:
7)
New Hires Tenure Adjunct Total Men 20 24 44 Women 23 18 41 Total 43 42 85 What is the probability (rounded to two decimal places) that a tenure-track new hire is female? A) 0.27 B) 0.56 C) 0.51 D) 0.43 E) 0.53 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 8) A survey of senior high school students in Dayton, Ohio produced the following contingency table relating the students' alcohol use and marijuana use.
ALCOHOL USE
Yes No
8)
MARIJUANA USE Yes No 955 994 5 322
Treating marijuana use as the response variable, find and interpret the relative risk. 9) The side effect of flu syndrome was reported by 8 people taking Botox and 2 people taking a placebo. For comparing these, software reports the following results:
9)
Rows: Treatment Columns: Flu
Botox Placebo
No 397 130
Yes 8 2
Fisher's exact test: P-Value = 1 a. b. c.
Specify the parameters that are in the null hypothesis for this test. Conduct the test. Why was Fisher's exact test used instead of the chi-squared test?
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Use the chi-squared table. 10) Use the appropriate table to find P(χ2 ≥ 7.38) assuming df = 2. A) 0.050 B) 0.500 C) 0.025
10) D) 0.975
Copyright © 2017 Pearson Education, Inc. 4
E) 0.950
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. A chi-square test has been performed on the given data, with the results shown. Use a statistical package to calculate the standardized residuals for the data and comment on your findings. 11) 11) A researcher wishes to test whether the proportion of college students who smoke is the same in four different colleges. She randomly selects 100 students from each college and records the number that smoke. The results are shown below.
Smoke Don't smoke
College A College B College C College D 17 26 11 34 83 74 89 66
Test statistic: χ2 = 17.832 P-value = 0.0004763 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 12) Which of the following is NOT a property of the relative risk? A) The relative risk is a number between 0 and 1. B) The relative risk can equal any nonnegative number. C) Two values for the relative risk represent the same strength of association, but in opposite directions, when one value is the reciprocal of the other. D) Values farther from 1.0 represent stronger associations. E) When p 1 = p 2 , the variables are independent and the relative risk = 1.0. Calculate χ2 from contingency table. 13) Responses to a survey question are broken down according to gender and the sample results are given below. Using a significance level of 0.05, calculate the chi-squared test statistic for testing the claim that response and gender are independent.
Male Female
Yes No Undecided 25 50 15 20 30 10
A) 5.991
B) 6.502
C) 2.706
D) 3.841
Copyright © 2017 Pearson Education, Inc. 5
E) 0.579
12)
13)
Provide an appropriate response. 14) At a high school debate tournament, half of the teams were asked to wear suits and ties and the rest were asked to wear jeans and t-shirts. The wins and losses according to attire are given in the table below. Win Loss Suit 22 28 T-shirt 28 22 Estimate the difference in the proportion of wins for those wearing suits versus those wearing t-shirts. Interpret. A) 0.79; for the sample, those wearing suits were less likely to win. B) -0.12; for the sample, the number of wins was higher for those wearing t-shirts than for those wearing suits. C) 0.50; for the sample, it is equally likely that those wearing t-shirts or those wearing suits will win. D) 0.79; for the sample, those wearing t-shirts were less likely to win. E) -0.12; for the sample, the number of wins was higher for those wearing suits than for those wearing t-shirts.
Copyright © 2017 Pearson Education, Inc. 6
14)
15) O Female 42 Sex Male Total 95
Blood Type A B AB Total 55 5 117 48 15 5 121 30 10
15)
The contingency table above shows the blood types of a sample of patients cross classified by sex. Fill in the missing entries. A) Blood Type O A B AB Total Female 42 55 64 5 117 Sex Male 79 48 15 5 121 Total 95 103 30 10 238 B)
O Female 42 Sex Male 53 Total 95
Blood Type A B AB Total 55 45 5 117 48 15 5 121 103 30 10 476 Blood Type A B AB Total 55 15 5 117 48 15 5 121 103 30 10 238
O Female 42 Sex Male 137 Total 95
Blood Type A B AB Total 55 45 5 117 48 15 5 121 103 30 10 238
O Female 42 Sex Male 53 Total 95
Blood Type A B AB Total 55 15 5 117 48 15 5 121 103 30 10 476
O Female 42 Sex Male 137 Total 95 C)
D)
E)
Use the contingency table to estimate expected cell count. 16) A sports researcher is interested in determining if there is a relationship between the number of home team and visiting team wins and different sports. A random sample of 526 games is selected and the results are given below. Assuming the row and column classification are independent, find an estimate for the expected cell count of row 2, column 2. Round your answer to tenths.
Home team wins Visiting team wins A) 65.75
Football Basketball Soccer Baseball 38 157 26 82 29 97 22 75 B) 146.3 C) 20.3
D) 97
Copyright © 2017 Pearson Education, Inc. 7
E) 107.7
16)
State the null hypothesis to test for independence. 17) Tests for adverse reactions to a new drug yielded the results given in the table below. At the 0.05 significance level, state the null hypothesis for testing the claim that the treatment (drug or placebo) is independent of the reaction (whether or not headaches were experienced).
17)
Drug Placebo Headaches 11 7 No headaches 73 91 A) H0 : Treatment and reaction are independent.
B) H0 : Treatment and reaction are dependent.
C) Ha: Treatment and reaction are dependent.
D) Ha: Treatment and reaction are independent.
Provide an appropriate response. 18) The contingency table below shows the income level for a random sample of adults cross classified by sex.
18)
Find the conditional distribution of the variable "income bracket" for men. A) Low: 14.7%; Middle: 19.2%; High: 18.6% B) Male: 52.6%; Female: 47.4% C) Male: 59.2%; Female: 40.8% D) Low: 28%; Middle: 36.6%; High: 35.4% E) Low: 46.9%; Middle: 51.7%; High: 59.2% Use the contingency table. 19) A sports researcher is interested in determining if there is a relationship between the number of home team and visiting team wins among different sports. A random sample of 526 games is selected and the number of wins of each type by sport are listed below.
Home team wins Visiting team wins
19)
Football Basketball Soccer Baseball 39 156 25 83 31 98 19 75
Calculate the chi-square test statistic χ2 to test the claim that the number of home team and visiting team wins is independent of the sport. A) 4.192 B) 2.919 C) 1.226 D) 5.391 E) 3.290 Group the bivariate data into a contingency table. 20) The table below provides data on sex, political party affiliation, and income bracket for a sample of people questioned during a poll. Group the bivariate data for the two variables ʺsexʺ and ʺincome bracketʺ into a contingency table. Sex M F F M
Political Party Income Bracket Rep Dem Dem Dem
High Middle Middle Low Copyright © 2017 Pearson Education, Inc. 8
20)
F M F M M F M F F M M F M F M F M M F
Other Rep Rep Rep Dem Rep Dem Rep Dem Dem Rep Dem Rep Other Other Dem Dem Rep Dem
Middle Low High High High Low High Middle Middle Middle Low High Low High Middle Low Middle Low Middle
A)
B)
C)
D) None of these E)
Copyright © 2017 Pearson Education, Inc. 9
Provide an appropriate response. 21) A researcher wishes to test the effectiveness of a flu vaccination. 150 people are vaccinated, 180 people are vaccinated with a placebo, and 100 people are not vaccinated. The number in each group who later caught the flu was recorded. The results are shown below.
21)
Vaccinated Placebo Control Caught the flu 8 19 21 Did not catch the flu 142 161 79 Find and interpret the relative risk of catching the flu, comparing those who were vaccinated to those in the control group. A) 0.21; those who were vaccinated were 0.21 times as likely to contract the flu as those in the control group B) 0.16; those who were vaccinated were 0.16 times as likely to contract the flu as those in the control group C) 0.25; those who were vaccinated were 0.25 times as likely to contract the flu as those in the control group D) 0.25; those who were in the control group were 0.25 times as likely to contract the flu as those who were vaccinated E) 0.38; those who were vaccinated were 0.38 times as likely to contract the flu as those in the control group 22) The two-way table below summarizes data from a survey at a small liberal arts college:
22)
New Hires Tenure Adjunct Total Men 19 25 44 Women 25 17 42 Total 44 42 86 What is the probability (rounded to two decimal places) that a randomly selected new hire is a tenure-track woman? A) 0.25 B) 0.29 C) 0.60 D) 0.20 E) 0.49 Select the most appropriate answer. 23) When the null hypothesis in the chi-squared test of independence of two categorical variables is true, the standardized residuals have approximately a A) Student's t distribution. B) chi-squared distribution. C) conditional distribution. D) standard normal distribution. E) binomial distribution.
Copyright © 2017 Pearson Education, Inc. 10
23)
Calculate χ2 from contingency table. 24) Use the sample data below to calculate the chi-squared test statistic for testing whether car color affects the likelihood of being in an accident. Use a significance level of 0.01. Red Blue White Car has been in accident 28 33 36 Car has not been in accident 23 22 30 A) 0.579
B) 6.635
C) 9.210
D) 0.050
E) 0.4287
Group the bivariate data into a contingency table. 25) The table below provides data on sex, political party affiliation, and income bracket for a sample of people questioned during a poll. Group the bivariate data for the two variables ʺpolitical partyʺ and ʺincome bracketʺ into a contingency table. Sex M F F M F M F M M F M F F M M F M F M F M M F
Political Party Income Bracket Rep Dem Dem Dem Other Rep Rep Rep Dem Rep Dem Rep Dem Dem Rep Dem Rep Other Other Dem Dem Rep Dem
High Middle Middle Low Middle Low High High High Low High Middle Middle Middle Low High Low High Middle Low Middle Low Middle
A)
Copyright © 2017 Pearson Education, Inc. 11
24)
25)
B)
C)
D)
E) None of these
Copyright © 2017 Pearson Education, Inc. 12
Answer Key Testname: CHAPTER 11 FORM A TEST
1) A 2) A 3) B 4) A 5) D 6) D 7) E 8) Treating marijuana use as the response variable and the adverse outcome as yes, the relative risk is 32.0. Therefore, the proportion of students using alcohol who also used marijuana was 32 times the proportion of students not using alcohol who used marijuana. A relative risk of 32 represents a strong association. 9) a. H0 : p 1 = p 2 , where p 1 = proportion of people taking Botox who experienced the side effect of flu syndrome and p 2 = proportion of people taking the placebo who experienced the side effect of flu syndrome; b. (1) Assumptions: two binary categorical variables, randomization; (2) H0 : p = p , Ha: p ≠ p ; (4) P-value = 1; (5) Since the P-value is above 0.05, we can 1 2 1 2 not reject H0 . We cannot conclude that an association exists between taking Botox and experiencing the side effect of flu syndrome at α = 0.05; c. Fisher's exact test was used instead of the chi-squared test, because not all of the expected cell counts were at least 5. 10) C 11) We reject the null hypothesis that the proportion of smokers is the same at all four colleges. The standardized residuals are given by -1.394 1.115 -3.066 3.345 . 1.394 -1.115 3.066 -3.345 The residuals for smokers at college D and non-smokers at college C are both greater than 3 and larger than expected under the null hypothesis of independence. The residuals for smokers at college C and non-smokers at college D are both less than -3 and smaller than expected under the null hypothesis of independence. 12) A 13) E 14) B 15) C 16) E 17) A 18) D 19) E 20) B 21) C 22) B 23) D 24) E 25) C
Copyright © 2017 Pearson Education, Inc. 13
CHAPTER 11 FORM B TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. State the null and alternative hypothesis. 1) A sample survey obtains information on characteristics of book readers. A book reader is defined to be anyone who has read one or more books in the six months prior to the survey; a non-book reader is defined to be anyone who read newspapers or magazines but no books in the six months prior to the survey; a nonreader is defined to be anyone who did not read a book, newspaper, or magazine in the six months prior to the survey. The following data were obtained from a random sample of 1429 persons 16 years old and over in an effort to determine whether or not the proportions of book readers, non-book readers, and non-readers are the same for each income bracket. State the hypotheses for a chi-squared test of independence between the two variables, classification and household income.
A) H0 : The classifications have the same distribution for each household income bracket. Ha: The classifications do not have the same distribution for each household income bracket. B) H0 : Household income and book readership are independent. Ha: Household income and book readership are not dependent. C) H0 : There is a relationship between household income and book readership. Ha: Household income and book readership are not related. D) H0 : There is no relationship between household income and book readership. Ha: Household income and book readership are not related.
Copyright © 2017 Pearson Education, Inc. 1
1)
Provide an appropriate response. 2)
2)
The contingency table above shows the blood types of a sample of patients cross classified by sex. How many people in the sample have blood type B? A) 16 B) 29 C) 112 D) 58 E) 13 3) At a high school debate tournament, half of the teams were asked to wear suits and ties and the rest were asked to wear jeans and t-shirts. The wins and losses according to attire are given in the table below. Using α = 0.05, test the hypothesis that the proportion of wins is the same for teams wearing suits as for teams wearing jeans. Win Loss Suit 22 28 T-shirt 28 22 H0 : The proportion of wins is the same for teams wearing suits as for teams wearing jeans. Ha: The proportions are different. Test statistic: χ2 = 1.440. Critical value: χ2 = 3.841. A) Since the test statistic > 1, we reject H0 and conclude that the proportion of wins differs for the teams wearing suits and those wearing jeans. B) Since the critical value > 1, we reject H0 and conclude that the proportion of wins differs for the teams wearing suits and those wearing jeans. C) Since the test statistic is less than the critical value, we reject H0 and conclude that the proportion of wins differs for the teams wearing suits and those wearing jeans. D) Since the test statistic is less than the critical value, we fail to reject H0 . There is not enough evidence to conclude that the proportion of wins differs for the teams wearing suits versus those wearing jeans. E) Since the test statistic is less than the critical value, we accept H0 and conclude that the proportion of wins is the same for the teams wearing suits and for those wearing jeans.
Copyright © 2017 Pearson Education, Inc. 2
3)
State the null and alternative hypothesis. 4) A researcher performed a study to determine whether an association exists between sex and blood type. He obtained the following sample data. State the hypotheses for a chi-squared test of independence between the two variables.
4)
A) H0 : Gender and blood type are independent. Ha: Gender and blood type are not independent. B) H0 : There is a relationship between gender and blood type. Ha: There is no relationship between gender and blood type. C) H0 : Gender and blood type are dependent. Ha: Gender and blood type are not dependent. D) H0 : Gender and blood type are dependent. Ha: Gender and blood type are independent. Use the contingency table to estimate expected cell count. 5) The contingency table below shows the results of a random sample of 300 state representatives that was conducted to see whether their opinions on a bill are related to their party affiliation. Assuming the row and column classification are independent, find an estimate for the expected cell count of row 2, column 2. Round your answer to tenths.
5)
Opinion Party Approve Disapprove No Opinion Republican 63 30 21 Democrat 75 36 27 Independent 15 24 9 A) 41.4
B) 70.4
C) 36
D) 34.2
E) 33.3
Select the most appropriate answer. 6) As the degrees of freedom increase, the mean and standard deviation of the chi-squared distribution A) decrease and increase, respectively. B) decrease and decrease, respectively. C) are both unaffected. D) increase and increase, respectively. E) Increase and decrease, respectively.
Copyright © 2017 Pearson Education, Inc. 3
6)
Provide an appropriate response. 7) The two-way table below summarizes data from a survey at a small liberal arts college:
7)
New Hires Tenure Adjunct Total Men 20 24 44 Women 23 18 41 Total 43 42 85 What is the probability (rounded to two decimal places) that a tenure-track new hire is female? A) 0.43 B) 0.56 C) 0.51 D) 0.53 E) 0.27 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 8) A survey of senior high school students in Dayton, Ohio produced the following contingency table relating the students' alcohol use and marijuana use.
ALCOHOL USE
Yes No
8)
MARIJUANA USE Yes No 955 994 5 322
Treating marijuana use as the response variable, find and interpret the relative risk. 9) The side effect of flu syndrome was reported by 8 people taking Botox and 2 people taking a placebo. For comparing these, software reports the following results:
9)
Rows: Treatment Columns: Flu
Botox Placebo
No 397 130
Yes 8 2
Fisher's exact test: P-Value = 1 a. b. c.
Specify the parameters that are in the null hypothesis for this test. Conduct the test. Why was Fisher's exact test used instead of the chi-squared test?
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Use the chi-squared table. 10) Use the appropriate table to find P(χ2 ≥ 7.38) assuming df = 2. A) 0.975 B) 0.025 C) 0.500
10) D) 0.950
Copyright © 2017 Pearson Education, Inc. 4
E) 0.050
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. A chi-square test has been performed on the given data, with the results shown. Use a statistical package to calculate the standardized residuals for the data and comment on your findings. 11) 11) A researcher wishes to test whether the proportion of college students who smoke is the same in four different colleges. She randomly selects 100 students from each college and records the number that smoke. The results are shown below.
Smoke Don't smoke
College A College B College C College D 17 26 11 34 83 74 89 66
Test statistic: χ2 = 17.832 P-value = 0.0004763 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 12) Which of the following is NOT a property of the relative risk? A) Two values for the relative risk represent the same strength of association, but in opposite directions, when one value is the reciprocal of the other. B) When p 1 = p 2 , the variables are independent and the relative risk = 1.0.
12)
C) Values farther from 1.0 represent stronger associations. D) The relative risk is a number between 0 and 1. E) The relative risk can equal any nonnegative number. Calculate χ2 from contingency table. 13) Responses to a survey question are broken down according to gender and the sample results are given below. Using a significance level of 0.05, calculate the chi-squared test statistic for testing the claim that response and gender are independent.
Male Female
Yes No Undecided 25 50 15 20 30 10
A) 2.706
B) 3.841
C) 0.579
D) 5.991
Copyright © 2017 Pearson Education, Inc. 5
E) 6.502
13)
Provide an appropriate response. 14) At a high school debate tournament, half of the teams were asked to wear suits and ties and the rest were asked to wear jeans and t-shirts. The wins and losses according to attire are given in the table below. Win Loss Suit 22 28 T-shirt 28 22 Estimate the difference in the proportion of wins for those wearing suits versus those wearing t-shirts. Interpret. A) -0.12; for the sample, the number of wins was higher for those wearing t-shirts than for those wearing suits. B) -0.12; for the sample, the number of wins was higher for those wearing suits than for those wearing t-shirts. C) 0.79; for the sample, those wearing t-shirts were less likely to win. D) 0.79; for the sample, those wearing suits were less likely to win. E) 0.50; for the sample, it is equally likely that those wearing t-shirts or those wearing suits will win.
Copyright © 2017 Pearson Education, Inc. 6
14)
15) O Female 42 Sex Male Total 95
Blood Type A B AB Total 55 5 117 48 15 5 121 30 10
15)
The contingency table above shows the blood types of a sample of patients cross classified by sex. Fill in the missing entries. A) Blood Type O A B AB Total Female 42 55 45 5 117 Sex Male 137 48 15 5 121 Total 95 103 30 10 238 B) O Female 42 Sex Male 53 Total 95 C)
Blood Type A B AB Total 55 15 5 117 48 15 5 121 103 30 10 238
O Female 42 Sex Male 53 Total 95
Blood Type A B AB Total 55 45 5 117 48 15 5 121 103 30 10 476 Blood Type A B AB Total 55 15 5 117 48 15 5 121 103 30 10 476
O Female 42 Sex Male 79 Total 95
Blood Type A B AB Total 55 64 5 117 48 15 5 121 103 30 10 238
O Female 42 Sex Male 137 Total 95 D)
E)
Use the contingency table to estimate expected cell count. 16) A sports researcher is interested in determining if there is a relationship between the number of home team and visiting team wins and different sports. A random sample of 526 games is selected and the results are given below. Assuming the row and column classification are independent, find an estimate for the expected cell count of row 2, column 2. Round your answer to tenths.
Home team wins Visiting team wins A) 97
Football Basketball Soccer Baseball 38 157 26 82 29 97 22 75 B) 107.7 C) 65.75
D) 20.3
Copyright © 2017 Pearson Education, Inc. 7
E) 146.3
16)
State the null hypothesis to test for independence. 17) Tests for adverse reactions to a new drug yielded the results given in the table below. At the 0.05 significance level, state the null hypothesis for testing the claim that the treatment (drug or placebo) is independent of the reaction (whether or not headaches were experienced).
17)
Drug Placebo Headaches 11 7 No headaches 73 91 A) Ha: Treatment and reaction are dependent.
B) H0 : Treatment and reaction are independent.
C) H0 : Treatment and reaction are dependent.
D) Ha: Treatment and reaction are independent.
Provide an appropriate response. 18) The contingency table below shows the income level for a random sample of adults cross classified by sex.
18)
Find the conditional distribution of the variable "income bracket" for men. A) Male: 59.2%; Female: 40.8% B) Low: 14.7%; Middle: 19.2%; High: 18.6% C) Low: 28%; Middle: 36.6%; High: 35.4% D) Male: 52.6%; Female: 47.4% E) Low: 46.9%; Middle: 51.7%; High: 59.2% Use the contingency table. 19) A sports researcher is interested in determining if there is a relationship between the number of home team and visiting team wins among different sports. A random sample of 526 games is selected and the number of wins of each type by sport are listed below.
Home team wins Visiting team wins
Football Basketball Soccer Baseball 39 156 25 83 31 98 19 75
Calculate the chi-square test statistic χ2 to test the claim that the number of home team and visiting team wins is independent of the sport. A) 4.192 B) 1.226 C) 3.290 D) 5.391 E) 2.919
Copyright © 2017 Pearson Education, Inc. 8
19)
Provide an appropriate response. 20) A researcher wishes to test the effectiveness of a flu vaccination. 150 people are vaccinated, 180 people are vaccinated with a placebo, and 100 people are not vaccinated. The number in each group who later caught the flu was recorded. The results are shown below.
20)
Vaccinated Placebo Control Caught the flu 8 19 21 Did not catch the flu 142 161 79 Find and interpret the relative risk of catching the flu, comparing those who were vaccinated to those in the control group. A) 0.16; those who were vaccinated were 0.16 times as likely to contract the flu as those in the control group B) 0.21; those who were vaccinated were 0.21 times as likely to contract the flu as those in the control group C) 0.38; those who were vaccinated were 0.38 times as likely to contract the flu as those in the control group D) 0.25; those who were vaccinated were 0.25 times as likely to contract the flu as those in the control group E) 0.25; those who were in the control group were 0.25 times as likely to contract the flu as those who were vaccinated 21) The two-way table below summarizes data from a survey at a small liberal arts college:
21)
New Hires Tenure Adjunct Total Men 19 25 44 Women 25 17 42 Total 44 42 86 What is the probability (rounded to two decimal places) that a randomly selected new hire is a tenure-track woman? A) 0.20 B) 0.60 C) 0.25 D) 0.49 E) 0.29 Group the bivariate data into a contingency table. 22) The table below provides data on sex, political party affiliation, and income bracket for a sample of people questioned during a poll. Group the bivariate data for the two variables ʺsexʺ and ʺincome bracketʺ into a contingency table. Sex M F F M F M F M M F M F F
Political Party Income Bracket Rep Dem Dem Dem Other Rep Rep Rep Dem Rep Dem Rep Dem
High Middle Middle Low Middle Low High High High Low High Middle Middle Copyright © 2017 Pearson Education, Inc. 9
22)
M M F M F M F M M F
Dem Rep Dem Rep Other Other Dem Dem Rep Dem
Middle Low High Low High Middle Low Middle Low Middle
A)
B)
C)
D) None of these E)
Select the most appropriate answer. 23) When the null hypothesis in the chi-squared test of independence of two categorical variables is true, the standardized residuals have approximately a A) chi-squared distribution. B) binomial distribution. C) Student's t distribution. D) standard normal distribution. E) conditional distribution.
Copyright © 2017 Pearson Education, Inc. 10
23)
Calculate χ2 from contingency table. 24) Use the sample data below to calculate the chi-squared test statistic for testing whether car color affects the likelihood of being in an accident. Use a significance level of 0.01. Red Blue White Car has been in accident 28 33 36 Car has not been in accident 23 22 30 A) 0.4287
B) 9.210
C) 0.050
D) 0.579
E) 6.635
Group the bivariate data into a contingency table. 25) The table below provides data on sex, political party affiliation, and income bracket for a sample of people questioned during a poll. Group the bivariate data for the two variables ʺpolitical partyʺ and ʺincome bracketʺ into a contingency table. Sex M F F M F M F M M F M F F M M F M F M F M M F
Political Party Income Bracket Rep Dem Dem Dem Other Rep Rep Rep Dem Rep Dem Rep Dem Dem Rep Dem Rep Other Other Dem Dem Rep Dem
High Middle Middle Low Middle Low High High High Low High Middle Middle Middle Low High Low High Middle Low Middle Low Middle
Copyright © 2017 Pearson Education, Inc. 11
24)
25)
A)
B)
C) None of these D)
E)
Copyright © 2017 Pearson Education, Inc. 12
Answer Key Testname: CHAPTER 11 FORM B TEST
1) A 2) B 3) D 4) A 5) A 6) D 7) D 8) Treating marijuana use as the response variable and the adverse outcome as yes, the relative risk is 32.0. Therefore, the proportion of students using alcohol who also used marijuana was 32 times the proportion of students not using alcohol who used marijuana. A relative risk of 32 represents a strong association. 9) a. H0 : p 1 = p 2 , where p 1 = proportion of people taking Botox who experienced the side effect of flu syndrome and p 2 = proportion of people taking the placebo who experienced the side effect of flu syndrome; b. (1) Assumptions: two binary categorical variables, randomization; (2) H0 : p = p , Ha: p ≠ p ; (4) P-value = 1; (5) Since the P-value is above 0.05, we can 1 2 1 2 not reject H0 . We cannot conclude that an association exists between taking Botox and experiencing the side effect of flu syndrome at α = 0.05; c. Fisher's exact test was used instead of the chi-squared test, because not all of the expected cell counts were at least 5. 10) B 11) We reject the null hypothesis that the proportion of smokers is the same at all four colleges. The standardized residuals are given by -1.394 1.115 -3.066 3.345 . 1.394 -1.115 3.066 -3.345 The residuals for smokers at college D and non-smokers at college C are both greater than 3 and larger than expected under the null hypothesis of independence. The residuals for smokers at college C and non-smokers at college D are both less than -3 and smaller than expected under the null hypothesis of independence. 12) D 13) C 14) A 15) B 16) B 17) B 18) C 19) C 20) D 21) E 22) B 23) D 24) A 25) D
Copyright © 2017 Pearson Education, Inc. 13
CHAPTER 12 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The index (x) of exposure to radioactive waste and the cancer mortality rates (y)(deaths per 100,000) were recorded for nine different Texas counties. The regression analysis is given below. Use the analysis to find the slope of the regression line. Predictor Coef SE(Coef) Constant 114.7156 8.045663 Index of Exp 9.231456 1.418787 A) 9.231456 B) 8.045663
T P 14.25807 0.00000198 6.506584 0.000332 C) 114.7156
D) 1.418787
E) 14.25807
Answer true or false. 2) The parameter β in the straight-line model μ = α + βx represents a multiplicative effect on the mean of y for y a one-unit increase in x. A) True B) False Select the most appropriate answer. 3) The error involved in estimating the population mean of y at a particular value of x is ____________________ the error involved in predicting an individual value of y at that same particular value of x. A) less than B) equal to C) less than or equal to D) greater than or equal to E) greater than Provide an appropriate response. 4) The typing speeds (x) (in words per minute) and reading speeds (y)(in words per minute) of nine randomly selected secretaries were recorded. Use the regression analysis provided below to predict the reading speed for a typing speed of 10 words per minute. Round your answer to the nearest word. ^
The regression equation is y = 290.2093 + 3.502052x R-sq = 12.385% 9 - 2 = 7 degrees of freedom Predictor Coef Constant 290.2093 Typing speed 3.502052 A) 330 words per minute B) 352 words per minute C) 290 words per minute D) 250 words per minute E) 322 words per minute
SE(Coef) 215.4116 3.520579
T 1.347231 0.994737
P 0.219884 0.352998
Copyright © 2017 Pearson Education, Inc. 1
1)
2)
3)
4)
5) A tennis ball is dropped from 15 different heights, x, (in inches) and the height of the bounce, y, is recorded ^
5)
(in inches). A regression analysis gives the the model y = -0.4 + 0.70x. A tennis ball dropped 71 inches bounced 53.3 inches. What is the residual for this bounce height? A) 2.8 inches B) 4 inches C) 5 inches D) -4 inches E) 49.3 inches Select the most appropriate answer. 6) To use regression in an inferential manner, which of the following assumptions is/are necessary? A) The population means of y at different values of x have a straight -line relationship with x. B) The population values of y at each value of x have the same standard deviation. C) The population values of y at each value of x follow a normal distribution. D) The data were gathered using randomization, such as random sampling or a randomized experiment. E) All of these. Answer true or false. 7) The predictor x is relatively closer to its mean than the predicted y is to its mean. A) True B) False Provide an appropriate response. 8) Records were kept on the relationship between the rainfall (in inches) and the yield of cotton (bushels per ^
6)
7)
8)
acre). The equation of the least squares regression line is found to be y = 4.267 + 4.379x. Use the regression line to predict the yield of cotton for an area with a rainfall of 10 inches. 10.5 8.8 13.4 12.5 18.8 10.3 7.0 15.6 16.0 Rain fall (in inches), x Yield (bushels per acre), y 50.5 46.2 58.8 59.0 82.4 49.2 31.9 76.0 78.8 A) 31 bushels per acre B) 45 bushels per acre C) 50 bushels per acre D) 48 bushels per acre E) 47 bushels per acre 9) Suppose that for a certain country, the population size can be predicted by the exponential regression model ^ y = 2.21 × 1.08x where y is the population size (in millions) and x is the number of decades since 2000. What is the predicted population size for 2030? Round to two decimal places. A) 7.16 million B) 2.78 million C) 22.24 million D) 71.60 million E) 2.58 million
Copyright © 2017 Pearson Education, Inc. 2
9)
10) Ten college students were randomly selected. Their grade point averages (GPAs) when they entered the program were between 3.5 and 4.0. The students' GPAs on entering the program (x) and their current GPAs (y) were recorded. The regression analysis is given below. Use the analysis to find the equation of the regression line. Predictor
Coef
SE(Coef)
Constant Entering GPA
3.584756 0.090953
0.078183 0.022162
T
10)
P
45.85075 5.66 × 10-11 4.103932 0.003419
^
A) y = 3.584756x + 0.090953 ^
B) y = 45.85075 + 0.090953x ^
C) y = 3.584756 + 0.022162x ^
D) y = 3.584756 - 0.022162x ^
E) y = 3.584756 + 0.090953x Select the most appropriate answer. 11) For exponential regression, a plot of
versus the x-values is approximately linear.
A) the exponential of the y-values C) the logarithm of the y-values
B) the y-values D) the residuals
Solve the problem. 12) Fell running, a popular sport in England, involves running long distances over mountainous terrain. The times for 28 female runners to complete a particular 10-mile course are recorded along with the amount of time each runner had spent training for the race in the month prior to the event. The regression of time to complete the race (in minutes) on training time (in hours) has the following regression output: Predictor Constant Training time
Coef SE Coef 129.103 7.109 -0.1475 0.01593
11)
T 18.16 -9.26
12)
P 0.000 0.000
Interpret the y-intercept of the prediction equation. A) A runner who has trained for one hour, is predicted to complete the race in 128.9555 minutes. B) For each additional hour of training time, the predicted time to complete the race will decrease by 0.1475 minutes. C) A runner who has not trained at all for the race is predicted to complete the race in 128.9555 minutes. D) A runner who has not trained at all for the race is predicted to complete the race in 129.103 minutes. Provide an appropriate response. 13) In order for a prediction interval for y or a confidence interval for μ to be valid, which of the following must hold? I. The true relationship between x and y must be close to linear. II. The variability of the y-values must be about the same at each fixed x value. III. The variability of the x-values must be about the same at each fixed y value. A) both I and II B) III only C) I only D) I, II, and III E) II only
Copyright © 2017 Pearson Education, Inc. 3
13)
Fill in the blank.
^ 2 14) The sum ∑(y -y) is called the sum of squared ___________.
A) total
B) regression
C) residuals
^
14) D) errors
E) mean
^
15) The difference y - y between an observed outcome y and its predicted value y is called a/an ______________. D) equation A) residual B) constant C) mean
15) E) slope
Provide an appropriate response. 16) Fast food is often considered unhealthy because much fast food is high in both fat and sodium. Using the table below find the correlation between fat and sodium. Round your answer to three decimals places. Fat in grams (x) 19 31 34 35 39 39 43 Sodium in milligrams (y) 920 1500 1310 860 1180 940 1260 A) 0.210 B) 0.335 C) 0.199 D) -0.199 E) 0.040 17) The typing speeds (x) (in words per minute) and reading speeds (y)(in words per minute) of nine randomly selected secretaries were recorded. Use the regression analysis provided below to calculate the residual for a typist with a typing speed of 50 words per minute and a reading speed of 480 words per minute.
16)
17)
^
The regression equation is y = 290.2093 + 3.502052x R-sq = 12.385% 9 - 2 = 7 degrees of freedom Predictor Constant Typing speed A) 465.31
Coef SE(Coef) 290.2093 215.4116 3.502052 3.520579 B) 14.69
T P 1.347231 0.219884 0.994737 0.352998 C) 304.9
D) -304.9
E) -14.69
Use technology to provide an appropriate response. 18) Managers rate employees according to job performance (x) and attitude (y). The results for several randomly selected employees are given below. Determine the regression line for this data. Round to the values to two decimal places. x 59 63 65 69 77 76 69 70 64 58 y 72 67 78 82 87 92 83 87 78 75
18)
^
A) y = 2.81 + 1.35x ^
B) y = 92.3 - 0.669x ^
C) y = -47.3 + 2.02x ^
D) y = 11.7x + 1.02 ^
E) y = 11.7 + 1.02x Provide an appropriate response. 19) The data given below are the gestation periods, in months, of randomly selected mammals and their corresponding life span, in years. Gestation 8 2.1 1.3 1 11.5 5.3 3.8 24.3 Life span 30 13 8 4 28 11 12 42 Identify the explanatory variable C) gestation D) months E) life span A) climate B) mammals
Copyright © 2017 Pearson Education, Inc. 4
19)
Answer true or false. 20) The slope describes the strength of linear association. A) True
20) B) False
Provide an appropriate response. 21) The data given are the gestation periods, in months, of randomly selected mammals and their corresponding life span, in years. Calculate the correlation coefficient r. Round your answer to three decimal places. Gestation (x) 8 2.1 1.3 1 11.5 5.3 3.8 24.3 Life span (y) 30 12 6 3 25 12 10 40 A) 0.916 B) -0.839 C) -0.916 D) 1 E) 0.839
21)
Select the most appropriate answer. 22) A table that reports the sums of squares used in regression analysis is called a(n) ____________________ table. A) sums of squares B) residual C) analysis of variance D) contingency E) regression Provide an appropriate response. 23) The data below are the ages and systolic blood pressures (measured in millimeters of mercury) of 9 randomly ^
22)
23)
selected adults. The equation of the least squares regression line is found to be y = 60.46 + 1.488x. Use the regression line to predict the systolic blood pressure of a person who is 52 years old. Round to the nearest millimeter. 38 41 45 48 51 53 57 61 65 Age, x 116 120 123 131 142 145 148 150 152 Pressure, y A) 150 mmHg B) 138 mmHg C) 130 mmHg D) 80 mmHg E) 141 mmHg 24) The regression equation relating dexterity scores (x) and productivity scores (y) for the randomly selected ^
employees of a company is y = 5.50 + 1.91x. Find the estimated mean of the productivity scores for all those who had dexterity scores of x = 37. Round to the nearest tenth. A) 58.2 B) 70.7 C) 56.3 D) 76.2 E) 205.4 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 25) An observation in regression analysis is influential when what two things happen?
Copyright © 2017 Pearson Education, Inc. 5
25)
24)
Answer Key Testname: CHAPTER 12 FORM A TEST
1) A 2) B 3) A 4) B 5) B 6) E 7) B 8) D 9) B 10) E 11) C 12) D 13) A 14) C 15) A 16) C 17) B 18) E 19) C 20) B 21) A 22) C 23) B 24) D 25) (1) Its x value is low or high compared to the rest of the data and (2) It does not fall in the straight-line pattern that the rest of the data have.
Copyright © 2017 Pearson Education, Inc. 6
CHAPTER 12 FORM B TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The typing speeds (x) (in words per minute) and reading speeds (y)(in words per minute) of nine randomly selected secretaries were recorded. Use the regression analysis provided below to predict the reading speed for a typing speed of 10 words per minute. Round your answer to the nearest word.
1)
^
The regression equation is y = 290.2093 + 3.502052x R-sq = 12.385% 9 - 2 = 7 degrees of freedom Predictor Coef Constant 290.2093 Typing speed 3.502052 A) 330 words per minute B) 352 words per minute C) 290 words per minute D) 322 words per minute E) 250 words per minute
SE(Coef) 215.4116 3.520579
T 1.347231 0.994737
P 0.219884 0.352998
2) The index (x) of exposure to radioactive waste and the cancer mortality rates (y)(deaths per 100,000) were recorded for nine different Texas counties. The regression analysis is given below. Use the analysis to find the slope of the regression line. Predictor Coef SE(Coef) Constant 114.7156 8.045663 Index of Exp 9.231456 1.418787 A) 1.418787 B) 14.25807
T P 14.25807 0.00000198 6.506584 0.000332 C) 9.231456
D) 8.045663
2)
E) 114.7156
3) Suppose that for a certain country, the population size can be predicted by the exponential regression model ^ y = 2.21 × 1.08x where y is the population size (in millions) and x is the number of decades since 2000.
3)
What is the predicted population size for 2030? Round to two decimal places. A) 2.78 million B) 2.58 million C) 22.24 million D) 71.60 million E) 7.16 million 4) In order for a prediction interval for y or a confidence interval for μ to be valid, which of the following must hold? I. The true relationship between x and y must be close to linear. II. The variability of the y-values must be about the same at each fixed x value. III. The variability of the x-values must be about the same at each fixed y value. A) I only B) II only C) III only D) I, II, and III E) both I and II Copyright © 2017 Pearson Education, Inc. 1
4)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 5) An observation in regression analysis is influential when what two things happen?
5)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 6) A tennis ball is dropped from 15 different heights, x, (in inches) and the height of the bounce, y, is recorded ^
6)
(in inches). A regression analysis gives the the model y = -0.4 + 0.70x. A tennis ball dropped 71 inches bounced 53.3 inches. What is the residual for this bounce height? A) 5 inches B) -4 inches C) 2.8 inches D) 4 inches E) 49.3 inches 7) The regression equation relating dexterity scores (x) and productivity scores (y) for the randomly selected ^
7)
employees of a company is y = 5.50 + 1.91x. Find the estimated mean of the productivity scores for all those who had dexterity scores of x = 37. Round to the nearest tenth. A) 76.2 B) 56.3 C) 58.2 D) 205.4 E) 70.7 8) Fast food is often considered unhealthy because much fast food is high in both fat and sodium. Using the table below find the correlation between fat and sodium. Round your answer to three decimals places. Fat in grams (x) 19 31 34 35 39 39 43 Sodium in milligrams (y) 920 1500 1310 860 1180 940 1260 A) -0.199 B) 0.199 C) 0.335 D) 0.040 E) 0.210
8)
9) The data given are the gestation periods, in months, of randomly selected mammals and their corresponding life span, in years. Calculate the correlation coefficient r. Round your answer to three decimal places. Gestation (x) 8 2.1 1.3 1 11.5 5.3 3.8 24.3 Life span (y) 30 12 6 3 25 12 10 40 A) 0.916 B) -0.916 C) 0.839 D) 1 E) -0.839
9)
10) Ten college students were randomly selected. Their grade point averages (GPAs) when they entered the program were between 3.5 and 4.0. The students' GPAs on entering the program (x) and their current GPAs (y) were recorded. The regression analysis is given below. Use the analysis to find the equation of the regression line.
10)
Predictor
Coef
SE(Coef)
Constant Entering GPA
3.584756 0.090953
0.078183 0.022162
T
P
45.85075 5.66 × 10-11 4.103932 0.003419
^
A) y = 3.584756 - 0.022162x ^
B) y = 45.85075 + 0.090953x ^
C) y = 3.584756 + 0.022162x ^
D) y = 3.584756 + 0.090953x ^
E) y = 3.584756x + 0.090953
Copyright © 2017 Pearson Education, Inc. 2
11) The data below are the ages and systolic blood pressures (measured in millimeters of mercury) of 9 randomly ^
11)
selected adults. The equation of the least squares regression line is found to be y = 60.46 + 1.488x. Use the regression line to predict the systolic blood pressure of a person who is 52 years old. Round to the nearest millimeter. 38 41 45 48 51 53 57 61 65 Age, x 116 120 123 131 142 145 148 150 152 Pressure, y A) 141 mmHg B) 80 mmHg C) 150 mmHg D) 138 mmHg E) 130 mmHg 12) Records were kept on the relationship between the rainfall (in inches) and the yield of cotton (bushels per ^
12)
acre). The equation of the least squares regression line is found to be y = 4.267 + 4.379x. Use the regression line to predict the yield of cotton for an area with a rainfall of 10 inches. 10.5 8.8 13.4 12.5 18.8 10.3 7.0 15.6 16.0 Rain fall (in inches), x Yield (bushels per acre), y 50.5 46.2 58.8 59.0 82.4 49.2 31.9 76.0 78.8 A) 48 bushels per acre B) 47 bushels per acre C) 45 bushels per acre D) 31 bushels per acre E) 50 bushels per acre 13) The data given below are the gestation periods, in months, of randomly selected mammals and their corresponding life span, in years. Gestation 8 2.1 1.3 1 11.5 5.3 3.8 24.3 Life span 30 13 8 4 28 11 12 42 Identify the explanatory variable E) life span A) gestation B) climate C) months D) mammals Select the most appropriate answer. 14) To use regression in an inferential manner, which of the following assumptions is/are necessary? A) The data were gathered using randomization, such as random sampling or a randomized experiment. B) The population means of y at different values of x have a straight -line relationship with x. C) The population values of y at each value of x follow a normal distribution. D) All of these. E) The population values of y at each value of x have the same standard deviation. Provide an appropriate response. 15) The typing speeds (x) (in words per minute) and reading speeds (y)(in words per minute) of nine randomly selected secretaries were recorded. Use the regression analysis provided below to calculate the residual for a typist with a typing speed of 50 words per minute and a reading speed of 480 words per minute. ^
The regression equation is y = 290.2093 + 3.502052x R-sq = 12.385% 9 - 2 = 7 degrees of freedom Predictor Constant Typing speed A) -304.9
Coef SE(Coef) 290.2093 215.4116 3.502052 3.520579 B) -14.69
T P 1.347231 0.219884 0.994737 0.352998 C) 465.31
D) 304.9
Copyright © 2017 Pearson Education, Inc. 3
E) 14.69
13)
14)
15)
Select the most appropriate answer. 16) A table that reports the sums of squares used in regression analysis is called a(n) ____________________ table. A) residual B) sums of squares C) regression D) analysis of variance E) contingency
16)
17) The error involved in estimating the population mean of y at a particular value of x is ____________________ the error involved in predicting an individual value of y at that same particular value of x. A) less than or equal to B) equal to C) less than D) greater than E) greater than or equal to
17)
18) For exponential regression, a plot of
18)
versus the x-values is approximately linear.
A) the residuals C) the y-values
B) the logarithm of the y-values D) the exponential of the y-values
Fill in the blank.
^ 2 19) The sum ∑(y -y) is called the sum of squared ___________.
A) regression
B) total
C) residuals
^
19) D) errors
E) mean
^
20) The difference y - y between an observed outcome y and its predicted value y is called a/an ______________. A) constant B) slope C) equation D) residual
20) E) mean
Answer true or false. 21) The predictor x is relatively closer to its mean than the predicted y is to its mean. A) True B) False
21)
22) The parameter β in the straight-line model μ = α + βx represents a multiplicative effect on the mean of y for y a one-unit increase in x. A) True B) False
22)
23) The slope describes the strength of linear association. A) True
23) B) False
Copyright © 2017 Pearson Education, Inc. 4
Solve the problem. 24) Fell running, a popular sport in England, involves running long distances over mountainous terrain. The times for 28 female runners to complete a particular 10-mile course are recorded along with the amount of time each runner had spent training for the race in the month prior to the event. The regression of time to complete the race (in minutes) on training time (in hours) has the following regression output: Predictor Constant Training time
Coef SE Coef 129.103 7.109 -0.1475 0.01593
T 18.16 -9.26
24)
P 0.000 0.000
Interpret the y-intercept of the prediction equation. A) For each additional hour of training time, the predicted time to complete the race will decrease by 0.1475 minutes. B) A runner who has not trained at all for the race is predicted to complete the race in 129.103 minutes. C) A runner who has not trained at all for the race is predicted to complete the race in 128.9555 minutes. D) A runner who has trained for one hour, is predicted to complete the race in 128.9555 minutes. Use technology to provide an appropriate response. 25) Managers rate employees according to job performance (x) and attitude (y). The results for several randomly selected employees are given below. Determine the regression line for this data. Round to the values to two decimal places. x 59 63 65 69 77 76 69 70 64 58 y 72 67 78 82 87 92 83 87 78 75 ^
A) y = -47.3 + 2.02x ^
B) y = 11.7x + 1.02 ^
C) y = 2.81 + 1.35x ^
D) y = 92.3 - 0.669x ^
E) y = 11.7 + 1.02x
Copyright © 2017 Pearson Education, Inc. 5
25)
Answer Key Testname: CHAPTER 12 FORM B TEST
1) B 2) C 3) A 4) E 5) (1) Its x value is low or high compared to the rest of the data and (2) It does not fall in the straight-line pattern that the rest of the data have. 6) D 7) A 8) B 9) A 10) D 11) D 12) A 13) A 14) D 15) E 16) D 17) C 18) B 19) C 20) D 21) B 22) B 23) B 24) B 25) E
Copyright © 2017 Pearson Education, Inc. 6
CHAPTER 13 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) A researcher found a significant relationship between a person's age, x 1 , the number of hours a person works
1)
per week, x 2 , and the number of accidents, y, the person has per year. The relationship can be represented by ^
the multiple regression equation y = -3.2 + 0.012x 1 + 0.23x 2 . Predict the number of accidents per year (to the nearest whole number) for a person aged 41 who works 31 hours per week. A) 4 B) 3 C) 5 D) 7
E) 6
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 2) Suppose that the prediction equation relating number of years of education to gender (x 1 = 1 for
2)
^
males, x 1 = 0 for females) is y = 13.0 + 1.0 x 1 . a. Find the estimated mean education for (i) females, (ii) males. b. Explain how to interpret the coefficient of the indicator variable. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 3) What factors affect the sale price of oceanside condominium units? To answer this question, the following data were recorded for each of the n = 105 units sold at auction: y = sale price (in thousands of dollars) x 1 = floor height (1, 2, 3, . . ., 8) x 2 = 1 if ocean view, 0 is bay view
^
The least squares prediction equation is y = 17.770 - 0.073 x 1 + 3.137 x 2 . Interpret the estimated effect of floor height. A) Expected sale price increases by $73 for each one floor increase in floor height for the same view type. B) Expected sale price decreases by $73 for each one floor increase in floor height for the same view type. C) Expected sale price decreases by $0.73 for each one floor increase in floor height for the same view type. D) The correlation between selling price and floor height is -0.073. E) Expected sale price increases by $0.73 for each one floor increase in floor height for the same view type.
Copyright © 2017 Pearson Education, Inc. 1
3)
4) A family doctor wants to examine the variables that affect his female patients' total cholesterol. He randomly selects 14 of his female patients and asks them to determine their average daily consumption of saturated fat. The output from MINITAB gives the following regression analysis: Predictor Coef SE Coef Constant 90.84 15.99 AGE 1.0142 0.2427 FAT 3.2443 0.6632 R-Sq = 84.7%
T 5.68 4.18 4.89
4)
P 0.000 0.002 0.000
Interpret the coefficient for age. A) For every 1 year increase in age, total cholesterol is expected to increase by 91.8542, holding average daily consumption of saturated fat constant. B) For every 1 year increase in age, total cholesterol is expected to increase by 1.0142, holding average daily consumption of saturated fat constant. C) For every 1 unit increase in total cholesterol, age is expected to be 1.0142 years greater, holding average daily consumption of saturated fat constant. D) For every 1 year increase in age, total cholesterol is expected to increase by 90.84, holding average daily consumption of saturated fat constant. E) For every 1 year increase in age, total cholesterol is expected to increase by 0.2427, holding average daily consumption of saturated fat constant. Determine whether the scatter plot shows little or no association, a negative association, a linear association, a moderately strong association, or a very strong association (multiple associations are possible). 5) 5)
A) Positive association, linear association, moderately strong association B) Linear association, moderately strong association C) Little or no association D) Moderately strong association E) Positive association
Copyright © 2017 Pearson Education, Inc. 2
Provide an appropriate response. 6) A logistic regression model was fit to a random sample of Americans of voting age to describe the relationship between the probability of voting in the 2004 presidential election, p, and a personʹs age, x. Part of the logistic regression analysis from Minitab follows.
6)
Predictor Coef SE Coef Z P Constant -0.811552 0.200876 -4.04 0.000 Age 0.0251255 0.0047960 5.24 0.000 At what age does the estimated probability of voting in the 2004 presidential election equal 0.50? Round your answer to the nearest year. A) 32 B) 42 C) 31 D) 33 E) None of these 7) Use the following computer data, which refers to bear measurements, to answer the question.
7)
Dependent variable is Weight R-Sq = 96.9% Predictor Coef Constant -285.21 Age -1.3838 Head Width -11.24 Neck 28.594
SE Coef T 78.45 -3.64 0.9022 -1.53 20.88 -0.54 5.870 4.87
P 0.022 0.200 0.619 0.007
Write the equation of the regression model. A) Weight = -285.21 - 1.3838 Age - 11.24 Head Width + 28.594 Neck B) Weight = -285.21 + 78.45 Age - 3.64 Head Width + 0.022 Neck C) Weight = -285.21 + 1.3838 Age + 11.24 Head Width + 28.594 Neck D) Weight = 78.45 - 0.9022 Age - 20.88 Head Width + 5.870 Neck E) Weight = 78.45 + 0.9022 Age + 20.88 Head Width + 5.870 Neck 8) Suppose you fit the multiple regression model μy = α + β1 x 1 + β2 x 2 to n = 28 data values and obtain the ^
prediction equation y = 4 + 1.57 x 1 + 1.15 x 2 . The estimates of the standard deviations of the sampling ^
^
distributions of β 1 and β2 are 1.96 and 0.63, respectively. Calculate the test statistic and P-value for testing H0 : β 2 = 0 against Ha: β 2 ≠ 0. A) t = 1.83; 0.025 < P-value < 0.05 B) t = 0.80; P-value > 0.20 C) t = 9.66; P-value< 0.002 D) t = 1.83; 0.05 < P-value < 0.10 E) t = 0.80; P-value > 0.10 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 9) Explain why the overall F test is done before the individual t tests in a multiple regression analysis.
Copyright © 2017 Pearson Education, Inc. 3
9)
8)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 10) In order to predict the length of time a house will remain on the market until it is sold in a particular county, the selling price (x 1 ), square footage (x 2 ) and whether or not the house is in the city (x 3 = 1 for city and 0
10)
otherwise) were recorded for a random sample of 250 homes. What is the regression model for a house that is located in the city? A) μ = α + β1 x 1 + β2 x 2 ^
B) y = α + β1 x 1 + β2 x 2 + β3 C) μ = α + β3 x 3 D) μ = α + β1 x 1 + β2 x 2 + β3 E) μ = α + β1 x 1 + β2 x 2 + β3 x 3 Provide an appropriate response. 11) Use the multiple regression equation given below to predict the y-value for the given values of the ^
11)
independent variables. The regression line is y = 6101 + 31.8 x 1 - 0.589 x 2 . Estimate y for x 1 = 1517 and x 2 = 1558. Round to the nearest thousandth. A) 47322.938
B) 49158.262
C) 53,423.938
D) 53455.738
E) 55259.262
12) Use the following computer data, which refers to bear measurements, to answer the question.
12)
Dependent variable is Weight R-Sq = 96.9% Predictor Coef Constant -285.21 Age -1.3838 Head Width -11.24 Neck 28.594
SE Coef T 78.45 -3.64 0.9022 -1.53 20.88 -0.54 5.870 4.87
P 0.022 0.200 0.619 0.007
Which measurement is the best predictor of weight, after allowing for the linear effects of the other variables in the model? A) Sex B) Constant C) Age D) Neck E) Head Width 13) A regression equation was fit relating a student's weight to the number of hours they spent on the computer, x 1 , the number of hours they spent watching television, x 2 , and the number of hours they spent on the ^
telephone, x 3 . The fitted equation is y = 125.5 + 3.26 x 1 +4.27 x 2 + 1.75 x 3 . What is a student's predicted weight if the student spends 4 hours on the computer, 2 hours viewing television, and 3 hours on the telephone? A) 152.33 B) 153.22 C) 151.83 D) 154.35 E) 155.86
Copyright © 2017 Pearson Education, Inc. 4
13)
14) What factors affect the sale price of oceanside condominium units? To answer this question, the following data were recorded for each of the n = 105 units sold at auction: y = sale price (in thousands of dollars) x 1 = floor height (1, 2, 3, . . ., 8)
14)
x 2 = 1 if ocean view, 0 is bay view Source DF SS MS F prob>F Constant 2 19.08 8.04 4.14 0.025 Error 102 197.96 1.94 Total 104 217.04 R-sq = 88.0% Assume you are testing the null hypothesis H0 : β1 = β2 = 0 at α = 0.05. State your conclusion for H0 . A) Since the F-value is greater than 0.05, do not reject H0 . There is not enough evidence to conclude that any of the predictors have an effect on sale price. B) Since the P-value is less than 0.05, reject H0 and conclude that at least one of the predictors has an effect on sale price. C) Since the P-value is less than 0.05, reject H0 and conclude that all of the predictors have an effect on sale price. D) Since R2 is large, reject H0 and conclude that at least one of the predictors has an effect on sale price. E) Since the P-value is less than 0.05, reject H0 and conclude that both of the predictors are highly correlated to sale price. Select the most appropriate answer. 15) Average resting heart rate for adults between the ages of 18 and 65 is to be estimated based on the person's gender (x 1 = 0 for female, 1 for male), the person's age (x 2 ) and their weight (x 3 ). What is the regression model for an adult male weighing 185 pounds? ^
A) y = α + β1 + β2 x 2 + 185β3 B) μ = α + β1 + β2 x 2 + 185β3 C) μ = α + β2 x 2 + 185β3 D) μ = α + β1 x 1 + β2 x 2 + β3 x 3 E) μ = α + β1 + 185β3
Copyright © 2017 Pearson Education, Inc. 5
15)
Determine whether the scatter plot shows little or no association, a negative association, a linear association, a moderately strong association, or a very strong association (multiple associations are possible). 16) 16)
A) Linear association, very strong association B) Positive association, linear association, very strong association C) Positive association, moderately strong association D) Linear association, moderately strong association E) Positive association, linear association Provide an appropriate response. 17) A computer utility was applied to data collected in a study of generalized health index as a function of weight/height ratio, cholesterol, and hours of sleep. A partial output is shown below Predictor Constant Weight/Height (lb/in x 1 )
Coef 65 2.4
SE Coef T P 6.431 7.32 0.000 1.495 0.12 0.765
Cholesterol x 2
0.06
0.235
0.21
0.549
Hours sleep x 3
1.4
0.199
8.27
0.000
R-sq = 93.1% Regression
F = 221.12
p = 0.000
Determine the proper conclusion about the use of the multiple regression equation. A) It is useful because of the high R2 value and low P-value. B) It is not useful because of the low R2 value and high P-value. C) It is not useful because of the high R2 value and low P-value. D) It is useful because of the low R2 value and low P-value. E) It is useful because of the low R2 value and high P-value.
Copyright © 2017 Pearson Education, Inc. 6
17)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 18) For the"house selling prices" data file, the table shows Minitab output for a regression model with y = selling price of home, x 1 = size of home, x 2 = number of bedrooms, and x 3 = region
18)
(1 = NW, 0 = other). Regression of selling price on size of home, bedrooms, and region Predictor Constant size Bedrooms region
Coef -1522 82.466 -6651 29651
SE Coef T P 17848 -0.09 0.932 7.573 10.89 0.000 6439 -1.03 0.304 7996 3.71 0.000
a. Report the prediction equation. By setting x 3 = 1 and then 0, construct the two separate equations for homes in the NW and for homes not in the NW. b. Explain how to interpret the coefficient of x 3 in the prediction equation. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 19) For a sample of 20 cars of a certain model, a car dealer records selling price, age of car, and mileage at time of sale. He performs initially a simple regression of selling price (y) on age of car (x) and obtains R2 = 0.78. He then performs a second regression in which he includes mileage as a second predictor and finds that R2
19)
increases by only a little to 0.82. What can we reasonably conclude? I. Mileage is highly positively correlated with age. If age is already included as a predictor, including mileage doesn't add much new power for predicting selling price. II. Mileage is not strongly correlated with selling price. III. A regression of selling price on mileage alone would have a very small R2 . A) II and III B) I, II, and III C) I and II D) I only 20) Which of the following graphs shows interaction between x1 and x2 in their effects on y? a)
b)
A) b B) a C) a and b D) neither a nor b E) cannot be determined from the information given
Copyright © 2017 Pearson Education, Inc. 7
20)
21) Which of the following graphs shows interaction between x1 and x2 in their effects on y? a)
21)
b)
A) a B) cannot be determined from the information given C) a and b D) b E) neither a nor b Select the most appropriate answer. 22) Which of the following is not true regarding R2 ? A) If R2 = 1 then each predicted y value is equal to the corresponding observed y value.
22)
B) The closer R2 is to 1, the better the explanatory variables collectively predict y. C) If R2 = 0 then each predicted y value is equal to the mean of the y values. D) Adding correlated variables to the model will not increase R2 . E) R2 is independent of the units of measurement. Provide an appropriate response. 23) Use the multiple regression equation given below to predict the y-value for the given values of the ^
independent variables. The regression line is y = 9.876 + 0.173 x 1 + 0.051 x 2 . Estimate y for x 1 = 48 and x 2 = 16. Round to the nearest thousandth. A) 18.996
B) 17.364
C) 9.120
D) 19.001
Copyright © 2017 Pearson Education, Inc. 8
E) 19.000
23)
24) A computer utility gives the following results of a multiple regression analysis. Find the regression equation. Predictor Constant x1
Coef SE Coef 8.882 0.299 0.201 0.022
T 18.667 5.228
x2
0.049
1.771
S = 0.437
R-sq = 88.1%
0.041
24)
^
A) y = 8.882 + 0.201 x 1 ^
B) y = 8.882 + 0.201 x 1 + 0.049 x 2 ^
C) y = 0.299 + 0.022 x 1 + 0.041 x 2 ^
D) y = 18.667+ 5.228 x 1 + 1.771 x 2 ^
E) y = 0.201 x 1 + 0.049 x 2 25) What factors affect the sale price of oceanside condominium units? To answer this question, the following data were recorded for each of the n = 105 units sold at auction: y = sale price (in thousands of dollars) x 1 = floor height (1, 2, 3, . . ., 8) x 2 = 1 if ocean view, 0 is bay view
^
The least squares prediction equation is y = 17.770 - 0.073 x 1 + 3.137 x 2 . Interpret the estimated effect of an ocean view. A) The average sale price for units with an ocean view is $20,907, holding floor height constant. B) The expected sale price increases by $3137 for units with an ocean view, holding floor height constant. C) The expected sale price increases by $3.14 for units with an ocean view, holding floor height constant. D) The correlation between sale price and view is 3.137. E) The expected sale price increases by $20,907 for units with an ocean view, holding floor height constant.
Copyright © 2017 Pearson Education, Inc. 9
25)
Answer Key Testname: CHAPTER 13 FORM A TEST
1) A 2) a. (i) 13.0 (ii) 14.0; b. The predicted number of years of education is 1 year more for males than for females. 3) B 4) B 5) A 6) A 7) A 8) D 9) The F test result tells us whether there is sufficient evidence to make it worthwhile to consider the individual effects. When there are many explanatory variables, doing the F test first provides protection from doing lots of t tests and having one of them be significant merely by random variation when, in fact, there truly are no effects in the population. 10) D 11) C 12) D 13) A 14) B 15) B 16) B 17) A ^
^
^
18) a. y = -1522 + 82.466x 1 - 6651x 2 + 29651x 3 ; NW: y = 28129 + 82.466x 1 - 6651x 2 ; other: y = -1522 + 82.466x 1 - 6651x 2 ; b. The predicted selling price of a home in the NW is $29,651 more than the predicted selling price of a home in other areas. 19) D 20) C 21) E 22) D 23) A 24) B 25) B
Copyright © 2017 Pearson Education, Inc. 10
CHAP[TER 13 FORM B TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) What factors affect the sale price of oceanside condominium units? To answer this question, the following data were recorded for each of the n = 105 units sold at auction: y = sale price (in thousands of dollars) x 1 = floor height (1, 2, 3, . . ., 8) x 2 = 1 if ocean view, 0 is bay view
1)
^
The least squares prediction equation is y = 17.770 - 0.073 x 1 + 3.137 x 2 . Interpret the estimated effect of floor height. A) Expected sale price decreases by $73 for each one floor increase in floor height for the same view type. B) The correlation between selling price and floor height is -0.073. C) Expected sale price decreases by $0.73 for each one floor increase in floor height for the same view type. D) Expected sale price increases by $0.73 for each one floor increase in floor height for the same view type. E) Expected sale price increases by $73 for each one floor increase in floor height for the same view type. 2) A computer utility was applied to data collected in a study of generalized health index as a function of weight/height ratio, cholesterol, and hours of sleep. A partial output is shown below Predictor Constant Weight/Height (lb/in x 1 )
Coef 65 2.4
SE Coef T P 6.431 7.32 0.000 1.495 0.12 0.765
Cholesterol x 2
0.06
0.235
0.21
0.549
Hours sleep x 3
1.4
0.199
8.27
0.000
R-sq = 93.1% Regression
F = 221.12
p = 0.000
Determine the proper conclusion about the use of the multiple regression equation. A) It is not useful because of the high R2 value and low P-value. B) It is useful because of the low R2 value and low P-value. C) It is useful because of the low R2 value and high P-value. D) It is useful because of the high R2 value and low P-value. E) It is not useful because of the low R2 value and high P-value.
Copyright © 2017 Pearson Education, Inc. 1
2)
3) A computer utility gives the following results of a multiple regression analysis. Find the regression equation. Predictor Constant x1
Coef SE Coef 8.882 0.299 0.201 0.022
T 18.667 5.228
x2
0.049
1.771
S = 0.437
R-sq = 88.1%
0.041
3)
^
A) y = 0.299 + 0.022 x 1 + 0.041 x 2 ^
B) y = 18.667+ 5.228 x 1 + 1.771 x 2 ^
C) y = 8.882 + 0.201 x 1 ^
D) y = 8.882 + 0.201 x 1 + 0.049 x 2 ^
E) y = 0.201 x 1 + 0.049 x 2 4) Use the multiple regression equation given below to predict the y-value for the given values of the ^
4)
independent variables. The regression line is y = 6101 + 31.8 x 1 - 0.589 x 2 . Estimate y for x 1 = 1517 and x 2 = 1558. Round to the nearest thousandth. A) 47322.938
B) 53455.738
C) 55259.262
D) 53,423.938
E) 49158.262
5) Use the following computer data, which refers to bear measurements, to answer the question. Dependent variable is Weight R-Sq = 96.9% Predictor Coef Constant -285.21 Age -1.3838 Head Width -11.24 Neck 28.594
SE Coef T 78.45 -3.64 0.9022 -1.53 20.88 -0.54 5.870 4.87
P 0.022 0.200 0.619 0.007
Which measurement is the best predictor of weight, after allowing for the linear effects of the other variables in the model? A) Neck B) Head Width C) Constant D) Sex E) Age
Copyright © 2017 Pearson Education, Inc. 2
5)
6) What factors affect the sale price of oceanside condominium units? To answer this question, the following data were recorded for each of the n = 105 units sold at auction: y = sale price (in thousands of dollars) x 1 = floor height (1, 2, 3, . . ., 8)
6)
x 2 = 1 if ocean view, 0 is bay view Source DF SS MS F prob>F Constant 2 19.08 8.04 4.14 0.025 Error 102 197.96 1.94 Total 104 217.04 R-sq = 88.0% Assume you are testing the null hypothesis H0 : β1 = β2 = 0 at α = 0.05. State your conclusion for H0 . A) Since R2 is large, reject H0 and conclude that at least one of the predictors has an effect on sale price. B) Since the P-value is less than 0.05, reject H0 and conclude that at least one of the predictors has an effect on sale price. C) Since the P-value is less than 0.05, reject H0 and conclude that all of the predictors have an effect on sale price. D) Since the P-value is less than 0.05, reject H0 and conclude that both of the predictors are highly correlated to sale price. E) Since the F-value is greater than 0.05, do not reject H0 . There is not enough evidence to conclude that any of the predictors have an effect on sale price. 7) A researcher found a significant relationship between a person's age, x 1 , the number of hours a person works
7)
per week, x 2 , and the number of accidents, y, the person has per year. The relationship can be represented by ^
the multiple regression equation y = -3.2 + 0.012x 1 + 0.23x 2 . Predict the number of accidents per year (to the nearest whole number) for a person aged 41 who works 31 hours per week. A) 3 B) 5 C) 7 D) 6
E) 4
8) For a sample of 20 cars of a certain model, a car dealer records selling price, age of car, and mileage at time of sale. He performs initially a simple regression of selling price (y) on age of car (x) and obtains R2 = 0.78. He then performs a second regression in which he includes mileage as a second predictor and finds that R2 increases by only a little to 0.82. What can we reasonably conclude? I. Mileage is highly positively correlated with age. If age is already included as a predictor, including mileage doesn't add much new power for predicting selling price. II. Mileage is not strongly correlated with selling price. III. A regression of selling price on mileage alone would have a very small R2 . A) II and III B) I only C) I and II D) I, II, and III SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 9) Suppose that the prediction equation relating number of years of education to gender (x 1 = 1 for ^
males, x 1 = 0 for females) is y = 13.0 + 1.0 x 1 . a. Find the estimated mean education for (i) females, (ii) males. b. Explain how to interpret the coefficient of the indicator variable.
Copyright © 2017 Pearson Education, Inc. 3
9)
8)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 10) A family doctor wants to examine the variables that affect his female patients' total cholesterol. He randomly selects 14 of his female patients and asks them to determine their average daily consumption of saturated fat. The output from MINITAB gives the following regression analysis: Predictor Coef SE Coef Constant 90.84 15.99 AGE 1.0142 0.2427 FAT 3.2443 0.6632 R-Sq = 84.7%
T 5.68 4.18 4.89
10)
P 0.000 0.002 0.000
Interpret the coefficient for age. A) For every 1 unit increase in total cholesterol, age is expected to be 1.0142 years greater, holding average daily consumption of saturated fat constant. B) For every 1 year increase in age, total cholesterol is expected to increase by 0.2427, holding average daily consumption of saturated fat constant. C) For every 1 year increase in age, total cholesterol is expected to increase by 91.8542, holding average daily consumption of saturated fat constant. D) For every 1 year increase in age, total cholesterol is expected to increase by 90.84, holding average daily consumption of saturated fat constant. E) For every 1 year increase in age, total cholesterol is expected to increase by 1.0142, holding average daily consumption of saturated fat constant. 11) A logistic regression model was fit to a random sample of Americans of voting age to describe the relationship between the probability of voting in the 2004 presidential election, p, and a person's age, x. Part of the logistic regression analysis from Minitab follows.
11)
Predictor Coef SE Coef Z P Constant -0.811552 0.200876 -4.04 0.000 Age 0.0251255 0.0047960 5.24 0.000 At what age does the estimated probability of voting in the 2004 presidential election equal 0.50? Round your answer to the nearest year. A) 42 B) 32 C) 31 D) 33 E) None of these 12) Use the multiple regression equation given below to predict the y-value for the given values of the ^
independent variables. The regression line is y = 9.876 + 0.173 x 1 + 0.051 x 2 . Estimate y for x 1 = 48 and x 2 = 16. Round to the nearest thousandth. A) 19.001
B) 19.000
C) 18.996
D) 9.120
Copyright © 2017 Pearson Education, Inc. 4
E) 17.364
12)
13) Suppose you fit the multiple regression model μy = α + β1 x 1 + β2 x 2 to n = 28 data values and obtain the
13)
^
prediction equation y = 4 + 1.57 x 1 + 1.15 x 2 . The estimates of the standard deviations of the sampling ^
^
distributions of β 1 and β2 are 1.96 and 0.63, respectively. Calculate the test statistic and P-value for testing H0 : β 2 = 0 against Ha: β 2 ≠ 0. A) t = 1.83; 0.025 < P-value < 0.05 B) t = 0.80; P-value > 0.20 C) t = 9.66; P-value< 0.002 D) t = 0.80; P-value > 0.10 E) t = 1.83; 0.05 < P-value < 0.10 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 14) Explain why the overall F test is done before the individual t tests in a multiple regression analysis.
14)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 15) What factors affect the sale price of oceanside condominium units? To answer this question, the following data were recorded for each of the n = 105 units sold at auction: y = sale price (in thousands of dollars) x 1 = floor height (1, 2, 3, . . ., 8) x 2 = 1 if ocean view, 0 is bay view
15)
^
The least squares prediction equation is y = 17.770 - 0.073 x 1 + 3.137 x 2 . Interpret the estimated effect of an ocean view. A) The expected sale price increases by $3.14 for units with an ocean view, holding floor height constant. B) The average sale price for units with an ocean view is $20,907, holding floor height constant. C) The correlation between sale price and view is 3.137. D) The expected sale price increases by $3137 for units with an ocean view, holding floor height constant. E) The expected sale price increases by $20,907 for units with an ocean view, holding floor height constant. 16) Use the following computer data, which refers to bear measurements, to answer the question. Dependent variable is Weight R-Sq = 96.9% Predictor Coef Constant -285.21 Age -1.3838 Head Width -11.24 Neck 28.594
SE Coef T 78.45 -3.64 0.9022 -1.53 20.88 -0.54 5.870 4.87
P 0.022 0.200 0.619 0.007
Write the equation of the regression model. A) Weight = -285.21 + 1.3838 Age + 11.24 Head Width + 28.594 Neck B) Weight = -285.21 + 78.45 Age - 3.64 Head Width + 0.022 Neck C) Weight = -285.21 - 1.3838 Age - 11.24 Head Width + 28.594 Neck D) Weight = 78.45 + 0.9022 Age + 20.88 Head Width + 5.870 Neck E) Weight = 78.45 - 0.9022 Age - 20.88 Head Width + 5.870 Neck
Copyright © 2017 Pearson Education, Inc. 5
16)
17) Which of the following graphs shows interaction between x1 and x2 in their effects on y? a)
17)
b)
A) neither a nor b B) b C) cannot be determined from the information given D) a and b E) a 18) Which of the following graphs shows interaction between x1 and x2 in their effects on y? a)
b)
A) b B) cannot be determined from the information given C) a and b D) a E) neither a nor b
Copyright © 2017 Pearson Education, Inc. 6
18)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 19) For the"house selling prices" data file, the table shows Minitab output for a regression model with y = selling price of home, x 1 = size of home, x 2 = number of bedrooms, and x 3 = region
19)
(1 = NW, 0 = other). Regression of selling price on size of home, bedrooms, and region Predictor Constant size Bedrooms region
Coef -1522 82.466 -6651 29651
SE Coef T P 17848 -0.09 0.932 7.573 10.89 0.000 6439 -1.03 0.304 7996 3.71 0.000
a. Report the prediction equation. By setting x 3 = 1 and then 0, construct the two separate equations for homes in the NW and for homes not in the NW. b. Explain how to interpret the coefficient of x 3 in the prediction equation. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 20) A regression equation was fit relating a student's weight to the number of hours they spent on the computer, x 1 , the number of hours they spent watching television, x 2 , and the number of hours they spent on the
20)
^
telephone, x 3 . The fitted equation is y = 125.5 + 3.26 x 1 +4.27 x 2 + 1.75 x 3 . What is a student's predicted weight if the student spends 4 hours on the computer, 2 hours viewing television, and 3 hours on the telephone? A) 153.22 B) 155.86 C) 151.83 D) 154.35 E) 152.33 Determine whether the scatter plot shows little or no association, a negative association, a linear association, a moderately strong association, or a very strong association (multiple associations are possible). 21) 21)
A) Positive association, moderately strong association B) Positive association, linear association C) Linear association, moderately strong association D) Linear association, very strong association E) Positive association, linear association, very strong association
Copyright © 2017 Pearson Education, Inc. 7
22)
22)
A) Moderately strong association B) Little or no association C) Linear association, moderately strong association D) Positive association E) Positive association, linear association, moderately strong association Select the most appropriate answer. 23) Average resting heart rate for adults between the ages of 18 and 65 is to be estimated based on the person's gender (x 1 = 0 for female, 1 for male), the person's age (x 2 ) and their weight (x 3 ). What is the regression
23)
model for an adult male weighing 185 pounds? A) μ = α + β1 x 1 + β2 x 2 + β3 x 3 B) μ = α + β2 x 2 + 185β3 C) μ = α + β1 + β2 x 2 + 185β3 D) μ = α + β1 + 185β3 ^
E) y = α + β1 + β2 x 2 + 185β3 24) Which of the following is not true regarding R2 ?
24)
A) Adding correlated variables to the model will not increase R2 . B) If R2 = 0 then each predicted y value is equal to the mean of the y values. C) The closer R2 is to 1, the better the explanatory variables collectively predict y. D) R2 is independent of the units of measurement. E) If R2 = 1 then each predicted y value is equal to the corresponding observed y value. 25) In order to predict the length of time a house will remain on the market until it is sold in a particular county, the selling price (x 1 ), square footage (x 2 ) and whether or not the house is in the city (x 3 = 1 for city and 0 otherwise) were recorded for a random sample of 250 homes. What is the regression model for a house that is located in the city? A) μ = α + β3 x 3 B) μ = α + β1 x 1 + β2 x 2 + β3 C) μ = α + β1 x 1 + β2 x 2 ^
D) y = α + β1 x 1 + β2 x 2 + β3 E) μ = α + β1 x 1 + β2 x 2 + β3 x 3
Copyright © 2017 Pearson Education, Inc. 8
25)
Answer Key Testname: CHAPTER 13 FORM B TEST
1) A 2) D 3) D 4) D 5) A 6) B 7) E 8) B 9) a. (i) 13.0 (ii) 14.0; b. The predicted number of years of education is 1 year more for males than for females. 10) E 11) B 12) C 13) E 14) The F test result tells us whether there is sufficient evidence to make it worthwhile to consider the individual effects. When there are many explanatory variables, doing the F test first provides protection from doing lots of t tests and having one of them be significant merely by random variation when, in fact, there truly are no effects in the population. 15) D 16) C 17) A 18) C ^
^
^
19) a. y = -1522 + 82.466x 1 - 6651x 2 + 29651x 3 ; NW: y = 28129 + 82.466x 1 - 6651x 2 ; other: y = -1522 + 82.466x 1 - 6651x 2 ; b. The predicted selling price of a home in the NW is $29,651 more than the predicted selling price of a home in other areas. 20) E 21) E 22) E 23) C 24) A 25) B
Copyright © 2017 Pearson Education, Inc. 9
CHAPTER 14 FORM A TEST Name___________________________________
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) Show how to construct a multiple regression model for the analysis of mean income over a three-way classification of gender (male, female), race (white, black), and type of job (blue-collar, white-collar, service). Interpret the parameters in the model. Assume that there is no interaction.
1)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. State the null hypothesis to test for independence. 2) A golf organization wants to know which brand of golf ball goes the farthest when hit by a driver. Three brands of golf balls are compared by using a well-known golf instructor to randomly hit each brand of ball five times. The total distance in yards is measured after each ball comes to rest. A) H0 : μ1 ≠ μ2 ≠ μ3 vs. the alternative that all brand means are equal
2)
B) H0 : μ1 = μ2 = μ3 = μ4 = μ5 vs. the alternative that not all hit means are equal C) H0 : μ1 = μ2 = μ3 vs. the alternative that not all hit means are equal D) H0 : μ1 = μ2 = μ3 vs. the alternative that not all brand means are equal E) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 ≠ μ5 vs. the alternative that all hit means are equal Determine the degrees of freedom. 3) An independent research firm conducts a study to compare the taste of four new sports drinks. Five people are randomly assigned to each of the four drinks. Each person tastes the drink and judges it on a scale from one to five. How many degrees of freedom (df1 ) does the group sum of squares have? How many degrees
3)
of freedom (df2 ) for the error sum of squares? A) df1 = 3; df2 = 16 B) df1 = 16; df2 = 3 C) df1 = 3; df2 = 17 D) df1 = 4; df2 = 17 E) df1 = 4; df2 = 16 Using the table, find the F-value for the numerator and denominator degrees of freedom. Use α = 0.05. 4) Using the table, find F-value for df1 = 3 and df2 = 12 for α = 0.05.
4)
A) 3.59 B) none of these C) 3.29 D) 8.74 E) 3.49 5) Using the table, find F-value for df1 = 6 and df2 = 20 for α = 0.05. A) 2.60
B) 2.85
C) 2.63
D) 2.47
Copyright © 2017 Pearson Education, Inc. 1
5) E) 2.74
6) Using the table, find F-value for df1 = 12 and df2 = 15 for α = 0.05.
6)
A) 2.53 B) none of these C) 2.48 D) 2.69 E) 2.13 Select the most appropriate answer. 7) Which of the following are assumptions of the analysis of variance F test for comparing population means of several groups? A) The random samples drawn from each group are independent. B) Each group has a normal population distribution. C) Each group has the same standard deviation. D) All of these. E) None of these. 8) If the average crop yields are to be compared among 4 different varieties of seeds and 2 different irrigation techniques, which of the following methods of analysis seems most appropriate? A) a one-way ANOVA B) a two-way ANOVA C) a logistic regression analysis D) a factorial ANOVA E) a linear regression analysis
Copyright © 2017 Pearson Education, Inc. 2
7)
8)
Use the MiniTab display. 9) A manager records the production output of three employees who each work on three different machines for three different days. The sample results are given below and the Minitab results follow.
Machine
I II III
A 23, 27, 29 25, 26, 24 28, 25, 26
Employee B 30, 27, 25 24, 29, 26 25, 27, 23
C 18, 20, 22 19, 16, 14 15, 11, 17
ANALYSIS OF VARIANCE ITEMS SOURCE DF MACHINE 2 EMPLOYEE 2 INTERACTION 4 ERROR 18 TOTAL 26
SS 34.67 504.67 26.67 98.00 664.00
MS 17.33 252.33 6.67 5.44
Using a 0.05 significance level, test the claim that the interaction between employee and machine has no effect on the number of items produced. State the null hypothesis. A) H0 : There is no interaction effect. B) None of these. C) Ha : There is no interaction effect. D) Ha : There is an interaction effect. E) H0 : There is an interaction effect. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 10) The null hypothesis of equality of means for a factor is rejected in a two-way ANOVA. Does this imply that the hypothesis will be rejected in a one-way ANOVA F test, if the data are collapsed over the levels of the second variable? Explain.
Copyright © 2017 Pearson Education, Inc. 3
10)
9)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 11) A golf organization wants to know which brand of golf ball goes the farthest when hit by a driver. Three brands of golf balls are compared by using a well-known golf instructor to randomly hit each brand of ball five times. The total distance in yards is measured after each ball comes to rest. Calculate the F-ratio. Can you reject H0 using α = 0.05? The ANOVA table is listed below.
11)
Analysis of Variance Source SS df MS F-ratio Brand 1405.375 2 702.688 Error 1467.556 12 122.296 Total 2872.931 A) F-Ratio = 5.746; Do not reject H0 B) F-Ratio = 0.174; Reject H0 C) F-Ratio = 0.174; Do not reject H0 D) F-Ratio = 5.746; Reject H0 E) F-Ratio = 0.958; Do not reject H0 State the null hypothesis to test for independence. 12) An industrial psychologist is investigating the effects of work environment on employee attitudes. A group of 20 recently hired sales trainees were randomly assigned to one of four different "home rooms" - five trainees per room. Each room is identical except for wall color. The four colors used were light green, light blue, gray and red. The psychologist wants to know whether room color has an effect on attitude, and, if so, wants to compare the mean attitudes of the trainees assigned to the four room colors. At the end of the training program, the attitude of each trainee was measured on a 60-pt. scale (the lower the score, the poorer the attitude). The data was subjected to a one-way analysis of variance. ONE-WAY ANOVA FOR ATTITUDE BY COLOR SOURCE DF SS MS F P GROUP 3 1678.15 559.3833 59.03782 0.0000 ERROR 16 151.6 9.475 TOTAL 19 1829.75 Give the null hypothesis A) H0 : μ1 = μ2 = μ3 = μ4 = μ5 , where the μi represent attitude means for the ith person in each room B) H0 : μgreen = μblue = μgray = μred , where the μ's represent mean attitudes for the four rooms C) H0 : x 1 = x 2 = x 3 = x 4 , where the x's represent the room colors D) H0 : p green = p blue = p gray = p red , where the p's represent the proportion with the corresponding attitude E) none of these
Copyright © 2017 Pearson Education, Inc. 4
12)
Provide an appropriate response. 13) A birdhunterʹs society is interested in knowing whether three different brands of turkey calls are equally effective in attracting gobblers. Each turkey call is randomly assigned to five hunters, and the order of each experiment is also randomized. An experiment consists of a hunter using one brand three times and waiting fifteen minutes. The number of gobblers attracted by the calls is counted. An ANOVA F-test rejected the hypothesis of equal means. Each pair of means were then compared, and the 95% confidence intervals are shown below:
13)
A) 1 vs. 2 (-2.620, -0.180) B) 1 vs. 3 (-0.661, 1.461) C) 2 vs. 3 (0.739, 2.861) Which tests show a significant difference between the means? A) A B) A,C C) B
D) none
E) C
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 14) Discuss the robustness of the ANOVA F test.
14)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. State the null hypothesis to test for independence. 15) A chemist is interested in comparing four different catalysts by measuring reaction times. Each catalyst is tested five times, with each experiment randomly assigned one of the four catalysts. A) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 ≠ μ5 vs. the alternative that all the test means are equal
15)
B) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 vs. the alternative that all the catalyst means are equal C) H0 : μ1 = μ2 = μ3 = μ4 vs. the alternative that not all the test means are equal D) H0 : μ1 = μ2 = μ3 = μ4 vs. the alternative that not all the catalyst means are equal E) H0 : μ1 = μ2 = μ3 = μ4 = μ5 vs. the alternative that not all the test means are equal 16) A telemarketing company wants to compare the effectiveness of five of its employees by looking at the percentage of survey questions completed. Data from four past surveys are randomly chosen for each of the employees, and the percentage of completed questions is calculated. A) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 vs. the alternative that all survey means are equal B) H0 : μ1 = μ2 = μ3 = μ4 vs. the alternative that not all employee means are equal C) H0 : μ1 = μ2 = μ3 = μ4 vs. the alternative that not all survey means are equal D) H0 : μ1 = μ2 = μ3 = μ4 = μ5 vs. the alternative that not all employee means are equal E) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 ≠ μ5 vs. the alternative that all employee means are equal
Copyright © 2017 Pearson Education, Inc. 5
16)
Provide an appropriate response. 17) An industrial engineer wants to test the effect of three different ways of assembling a part on the total assembly time. Five people are randomly assigned to each of the three assembly methods, and the total assembly time (in seconds) is recorded. Calculate the F-ratio. Can you reject H0 using α = 0.05? The
17)
ANOVA table is listed below. Analysis of Variance Source SS df Method 985535.907 2 Error 276844.275 12 Total 1262380.182 A) F-Ratio = 3.560; Reject H0
MS 492767.954 23070.356
F-ratio
B) F-Ratio = 21.359; Do not reject H0 C) F-Ratio = 0.047; Don not reject H0 D) F-Ratio = 21.359; Reject H0 E) F-Ratio = 0.047; Reject H0 Select the most appropriate answer. 18) If the average weight losses are to be compared among 3 different diets, which of the following methods of analysis seems most appropriate? A) a two-way ANOVA B) a logistic regression analysis C) a three-way ANOVA D) a one-way ANOVA E) a linear regression analysis Provide an appropriate response. 19) In a one-factor ANOVA having three group levels with five observations in each group, the between group degrees of freedom is equal to A) 4 B) 15 C) 2 D) 10 E) 12 20) A printing company is interested in comparing the speed of four different desktop printers; four different documents are used for the experiment. Each test consists of randomly assigning one printer to one of the documents. The document is printed and the time to print in seconds is recorded. Each printer/document combination is tested. The F-value from an ANOVA is 6.173. What is the F value having a P-value of 0.05? A) 6.39 B) 3.49 C) 3.24 D) 3.59 E) 3.01
Copyright © 2017 Pearson Education, Inc. 6
18)
19)
20)
Use the MiniTab display. 21) A manager records the production output of three employees who each work on three different machines for three different days. The sample results are given below and the Minitab results follow.
Machine
I II III
Employee A B 31, 34, 32 29, 23, 22 19, 26, 22 35, 33, 30 21, 18, 26 20, 23, 24
21)
C 21, 20, 24 25, 19, 23 36, 37, 31
ANALYSIS OF VARIANCE ITEMS SOURCE MACHINE EMPLOYEE INTERACTION ERROR TOTAL
DF 2 2 4 18 26
SS 1.19 5.85 710.81 160.00 877.85
MS .59 2.93 177.70 8.89
Using a 0.05 significance level, test the claim that the interaction between employee and machine has no effect on the number of items produced. Calculate the F-statistic for this test. Round your answer to four decimal places. A) 4.4426 B) 2.9300 C) 0.0198 D) 0.2014 E) 19.9888 Provide an appropriate response. 22) A chemist is interested in comparing four different catalysts by measuring reaction times. Each catalyst is tested five times, with each experiment randomly assigned one of the four catalysts. Calculate the F-ratio. Can you reject H0 using α = 0.05? The ANOVA table is listed below. Analysis of Variance Source SS df MS F-ratio Catalyst 9.310 3 3.103 Error 79.320 16 4.958 Total 88.630 A) F-Ratio = 1.598; Do not reject H0 B) F-Ratio = 1.598; Reject H0 C) F-Ratio = 0.626; Do not reject H0 D) F-Ratio = 0.626; Reject H0 E) F-Ratio = 0.117; Do not reject H0
Copyright © 2017 Pearson Education, Inc. 7
22)
Given below are the analysis of variance results from a Minitab display. Assume that you want to use a 0.05 significance level in testing the null hypothesis that the different samples come from populations with the same mean. 23) 23) Source DF SS MS F p Factor 3 30 10.00 1.6 0.264 Error 8 50 6.25 Total 11 80 What can you conclude about the equality of the population means? A) Reject the null hypothesis since the p-value is greater than the significance level. We conclude that all of the factor means differ. B) Do not reject the null hypothesis since the p-value is greater than the significance level. There is not enough evidence to show that the factor means are unequal. C) Reject the null hypothesis since the p-value is greater than the significance level. We conclude that at least two of the factor means differ. D) No conclusion can be made. E) Do not reject the null hypothesis since the p-value is greater than the significance level. We conclude that the factor means are equal. Use the MiniTab display. 24) A manager records the production output of three employees who each work on three different machines for three different days. The sample results are given below and the Minitab results follow.
I Machine II III
A 23, 27, 29 25, 26, 24 28, 25, 26
Employee B 30, 27, 25 24, 29, 26 25, 27, 23
C 18, 20, 22 19, 16, 14 15, 11, 17
ANALYSIS OF VARIANCE ITEMS SOURCE MACHINE EMPLOYEE INTERACTION ERROR TOTAL
DF 2 2 4 18 26
SS 34.67 504.67 26.67 98.00 664.00
MS 17.33 252.33 6.67 5.44
Assume that the number of items produced is not affected by an interaction between employee and machine. Using a 0.05 significance level, test the claim that the choice of employee has no effect on the number of items produced. Calculate the F-statistic for this test. Round your answer to four decimal places. A) 3.1857 B) 1.2261 C) 0.3538 D) 3.5500 E) 46.3842
Copyright © 2017 Pearson Education, Inc. 8
24)
25) A manager records the production output of three employees who each work on three different machines for three different days. The sample results are given below and the Minitab results follow.
Machine
I II III
Employee A B 23, 27, 29 30, 27, 25 25, 26, 24 24, 29, 26 28, 25, 26 25, 27, 23
C 18, 20, 22 19, 16, 14 15, 11, 17
ANALYSIS OF VARIANCE ITEMS SOURCE MACHINE EMPLOYEE INTERACTION ERROR TOTAL
DF 2 2 4 18 26
SS 34.67 504.67 26.67 98.00 664.00
MS 17.33 252.33 6.67 5.44
Assume that the number of items produced is not affected by an interaction between employee and machine. Using a 0.05 significance level, test the claim that the machine has no effect on the number of items produced. Calculate the F-statistic for this test. Round your answer to four decimal places. A) 46.3842 B) 2.5982 C) 3.5500 D) 0.3538 E) 3.1857
Copyright © 2017 Pearson Education, Inc. 9
25)
Answer Key Testname: CHAPTER 14 FORM A TEST
1) a. We need 4 indicator variables to indicate the two genders, the two races, and the three types of jobs. The first indicator variable is x 1 = 1 for female = 0 otherwise. The second indicator variable is x 2 = 1 for black = 0 otherwise. The third indicator variable is x 3 = 1 for white-collar = 0 otherwise. The fourth indicator variable is x 4 = 1 for service = 0 otherwise. With these indicator variables, the multiple regression equation for the mean income is μy = α + β 1 x 1 + β 2 x 2 + β 3 x 3 + β 4 x 4 . α is the mean for male, white, blue collar workers; β1 is added to the mean if the worker is female; β2 is added to the mean if the worker is black; β 3 is added to the mean if the worker is white-collar and β 4 is added to the mean if the worker is a service worker. 2) D 3) A 4) E 5) A 6) C 7) D 8) B 9) A 10) No. Once the second factor is no longer controlled, the effect of the first factor on the response variable could change because the residual variation could change. 11) D 12) B 13) B 14) Since the ANOVA F test is robust to moderate breakdowns in the population normality and equal standard deviation assumptions, in practice it is used unless (i) graphical methods show extreme skew for the response variable or (ii) the largest group standard deviation is more than about double the smallest group standard deviation and the sample sizes are unequal. 15) D 16) D 17) D 18) D 19) C 20) B 21) E 22) C 23) B 24) E 25) E
Copyright © 2017 Pearson Education, Inc. 10
CHAPTER 14 FORM B TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. State the null hypothesis to test for independence. 1) An industrial psychologist is investigating the effects of work environment on employee attitudes. A group of 20 recently hired sales trainees were randomly assigned to one of four different ʺhome roomsʺ - five trainees per room. Each room is identical except for wall color. The four colors used were light green, light blue, gray and red. The psychologist wants to know whether room color has an effect on attitude, and, if so, wants to compare the mean attitudes of the trainees assigned to the four room colors. At the end of the training program, the attitude of each trainee was measured on a 60-pt. scale (the lower the score, the poorer the attitude). The data was subjected to a one -way analysis of variance.
1)
ONE-WAY ANOVA FOR ATTITUDE BY COLOR SOURCE DF SS MS F P GROUP 3 1678.15 559.3833 59.03782 0.0000 ERROR 16 151.6 9.475 TOTAL 19 1829.75 Give the null hypothesis A) H0 : x1 = x2 = x3 = x4 , where the xʹs represent the room colors B) none of these C) H0 : μgreen = μblue = μgray = μred, where the μʹs represent mean attitudes for the four rooms D) H0 : pgreen = pblue = pgray = pred, where the pʹs represent the proportion with the corresponding attitude E) H0 : μ1 = μ2 = μ3 = μ4 = μ5 , where the μi represent attitude means for the ith person in each room 2) A golf organization wants to know which brand of golf ball goes the farthest when hit by a driver. Three brands of golf balls are compared by using a well-known golf instructor to randomly hit each brand of ball five times. The total distance in yards is measured after each ball comes to rest. A) H0 : μ1 ≠ μ2 ≠ μ3 vs. the alternative that all brand means are equal
2)
B) H0 : μ1 = μ2 = μ3 vs. the alternative that not all brand means are equal C) H0 : μ1 = μ2 = μ3 vs. the alternative that not all hit means are equal D) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 ≠ μ5 vs. the alternative that all hit means are equal E) H0 : μ1 = μ2 = μ3 = μ4 = μ5 vs. the alternative that not all hit means are equal 3) A chemist is interested in comparing four different catalysts by measuring reaction times. Each catalyst is tested five times, with each experiment randomly assigned one of the four catalysts. A) H0 : μ1 = μ2 = μ3 = μ4 vs. the alternative that not all the test means are equal B) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 vs. the alternative that all the catalyst means are equal C) H0 : μ1 = μ2 = μ3 = μ4 vs. the alternative that not all the catalyst means are equal D) H0 : μ1 = μ2 = μ3 = μ4 = μ5 vs. the alternative that not all the test means are equal E) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 ≠ μ5 vs. the alternative that all the test means are equal
Copyright © 2017 Pearson Education, Inc. 1
3)
4) A telemarketing company wants to compare the effectiveness of five of its employees by looking at the percentage of survey questions completed. Data from four past surveys are randomly chosen for each of the employees, and the percentage of completed questions is calculated. A) H0 : μ1 = μ2 = μ3 = μ4 vs. the alternative that not all survey means are equal
4)
B) H0 : μ1 = μ2 = μ3 = μ4 = μ5 vs. the alternative that not all employee means are equal C) H0 : μ1 = μ2 = μ3 = μ4 vs. the alternative that not all employee means are equal D) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 ≠ μ5 vs. the alternative that all employee means are equal E) H0 : μ1 ≠ μ2 ≠ μ3 ≠ μ4 vs. the alternative that all survey means are equal Using the table, find the F-value for the numerator and denominator degrees of freedom. Use α = 0.05. 5) Using the table, find F-value for df1 = 3 and df2 = 12 for α = 0.05.
5)
A) none of these B) 3.29 C) 3.59 D) 3.49 E) 8.74 6) Using the table, find F-value for df1 = 6 and df2 = 20 for α = 0.05.
6)
A) 2.60 B) none of these C) 3.84 D) 2.85 E) 2.63 7) Using the table, find F-value for df1 = 12 and df2 = 15 for α = 0.05. A) 2.48 B) none of these C) 2.69 D) 2.53 E) 2.13
Copyright © 2017 Pearson Education, Inc. 2
7)
Use the MiniTab display. 8) A manager records the production output of three employees who each work on three different machines for three different days. The sample results are given below and the Minitab results follow.
Machine
I II III
A 31, 34, 32 19, 26, 22 21, 18, 26
Employee B 29, 23, 22 35, 33, 30 20, 23, 24
8)
C 21, 20, 24 25, 19, 23 36, 37, 31
ANALYSIS OF VARIANCE ITEMS SOURCE MACHINE EMPLOYEE INTERACTION ERROR TOTAL
DF 2 2 4 18 26
SS 1.19 5.85 710.81 160.00 877.85
MS .59 2.93 177.70 8.89
Using a 0.05 significance level, test the claim that the interaction between employee and machine has no effect on the number of items produced. State the null hypothesis. A) None of these. B) Ha : There is no interaction effect. C) H0 : There is an interaction effect. D) Ha : There is an interaction effect. E) H0 : There is no interaction effect. 9) A manager records the production output of three employees who each work on three different machines for three different days. The sample results are given below and the Minitab results follow.
I Machine II III
A 23, 27, 29 25, 26, 24 28, 25, 26
Employee B 30, 27, 25 24, 29, 26 25, 27, 23
C 18, 20, 22 19, 16, 14 15, 11, 17
ANALYSIS OF VARIANCE ITEMS SOURCE MACHINE EMPLOYEE INTERACTION ERROR TOTAL
DF 2 2 4 18 26
SS 34.67 504.67 26.67 98.00 664.00
MS 17.33 252.33 6.67 5.44
Assume that the number of items produced is not affected by an interaction between employee and machine. Using a 0.05 significance level, test the claim that the choice of employee has no effect on the number of items produced. Calculate the F-statistic for this test. Round your answer to four decimal places. A) 1.2261 B) 0.3538 C) 46.3842 D) 3.5500 E) 3.1857
Copyright © 2017 Pearson Education, Inc. 3
9)
10) A manager records the production output of three employees who each work on three different machines for three different days. The sample results are given below and the Minitab results follow.
Machine
I II III
Employee A B 23, 27, 29 30, 27, 25 25, 26, 24 24, 29, 26 28, 25, 26 25, 27, 23
10)
C 18, 20, 22 19, 16, 14 15, 11, 17
ANALYSIS OF VARIANCE ITEMS SOURCE MACHINE EMPLOYEE INTERACTION ERROR TOTAL
DF 2 2 4 18 26
SS 34.67 504.67 26.67 98.00 664.00
MS 17.33 252.33 6.67 5.44
Assume that the number of items produced is not affected by an interaction between employee and machine. Using a 0.05 significance level, test the claim that the machine has no effect on the number of items produced. Calculate the F-statistic for this test. Round your answer to four decimal places. A) 0.3538 B) 46.3842 C) 2.5982 D) 3.5500 E) 3.1857 Provide an appropriate response. 11) A birdhunterʹs society is interested in knowing whether three different brands of turkey calls are equally effective in attracting gobblers. Each turkey call is randomly assigned to five hunters, and the order of each experiment is also randomized. An experiment consists of a hunter using one brand three times and waiting fifteen minutes. The number of gobblers attracted by the calls is counted. An ANOVA F-test rejected the hypothesis of equal means. Each pair of means were then compared, and the 95% confidence intervals are shown below: A) 1 vs. 2 (-2.620, -0.180) B) 1 vs. 3 (-0.661, 1.461) C) 2 vs. 3 (0.739, 2.861) Which tests show a significant difference between the means? A) C B) A,C C) A
D) none
Copyright © 2017 Pearson Education, Inc. 4
E) B
11)
Use the MiniTab display. 12) A manager records the production output of three employees who each work on three different machines for three different days. The sample results are given below and the Minitab results follow.
Machine
I II III
Employee A B 31, 34, 32 29, 23, 22 19, 26, 22 35, 33, 30 21, 18, 26 20, 23, 24
12)
C 21, 20, 24 25, 19, 23 36, 37, 31
ANALYSIS OF VARIANCE ITEMS SOURCE MACHINE EMPLOYEE INTERACTION ERROR TOTAL
DF 2 2 4 18 26
SS 1.19 5.85 710.81 160.00 877.85
MS .59 2.93 177.70 8.89
Using a 0.05 significance level, test the claim that the interaction between employee and machine has no effect on the number of items produced. Calculate the F-statistic for this test. Round your answer to four decimal places. A) 4.4426 B) 19.9888 C) 0.2014 D) 0.0198 E) 2.9300 Provide an appropriate response. 13) A printing company is interested in comparing the speed of four different desktop printers; four different documents are used for the experiment. Each test consists of randomly assigning one printer to one of the documents. The document is printed and the time to print in seconds is recorded. Each printer/document combination is tested. The F-value from an ANOVA is 6.173. What is the F value having a P-value of 0.05? A) 3.59 B) 3.01 C) 6.39 D) 3.49 E) 3.24
13)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 14) Discuss the robustness of the ANOVA F test.
14)
15) Show how to construct a multiple regression model for the analysis of mean income over a three-way classification of gender (male, female), race (white, black), and type of job (blue-collar, white-collar, service). Interpret the parameters in the model. Assume that there is no interaction.
15)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 16) In a one-factor ANOVA having three group levels with five observations in each group, the between group degrees of freedom is equal to A) 10 B) 2 C) 15 D) 12 E) 4
Copyright © 2017 Pearson Education, Inc. 5
16)
17) A golf organization wants to know which brand of golf ball goes the farthest when hit by a driver. Three brands of golf balls are compared by using a well-known golf instructor to randomly hit each brand of ball five times. The total distance in yards is measured after each ball comes to rest. Calculate the F-ratio. Can you reject H0 using α = 0.05? The ANOVA table is listed below.
17)
Analysis of Variance Source SS df MS F-ratio Brand 1405.375 2 702.688 Error 1467.556 12 122.296 Total 2872.931 A) F-Ratio = 0.174; Do not reject H0 B) F-Ratio = 0.958; Do not reject H0 C) F-Ratio = 0.174; Reject H0 D) F-Ratio = 5.746; Do not reject H0 E) F-Ratio = 5.746; Reject H0 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 18) The null hypothesis of equality of means for a factor is rejected in a two-way ANOVA. Does this imply that the hypothesis will be rejected in a one-way ANOVA F test, if the data are collapsed over the levels of the second variable? Explain.
18)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 19) An industrial engineer wants to test the effect of three different ways of assembling a part on the total assembly time. Five people are randomly assigned to each of the three assembly methods, and the total assembly time (in seconds) is recorded. Calculate the F-ratio. Can you reject H0 using α = 0.05? The ANOVA table is listed below. Analysis of Variance Source SS df Method 985535.907 2 Error 276844.275 12 Total 1262380.182 A) F-Ratio = 3.560; Reject H0
MS 492767.954 23070.356
F-ratio
B) F-Ratio = 0.047; Reject H0 C) F-Ratio = 0.047; Don not reject H0 D) F-Ratio = 21.359; Do not reject H0 E) F-Ratio = 21.359; Reject H0
Copyright © 2017 Pearson Education, Inc. 6
19)
20) A chemist is interested in comparing four different catalysts by measuring reaction times. Each catalyst is tested five times, with each experiment randomly assigned one of the four catalysts. Calculate the F-ratio. Can you reject H0 using α = 0.05? The ANOVA table is listed below.
20)
Analysis of Variance Source SS df MS F-ratio Catalyst 9.310 3 3.103 Error 79.320 16 4.958 Total 88.630 A) F-Ratio = 1.598; Do not reject H0 B) F-Ratio = 0.117; Do not reject H0 C) F-Ratio = 0.626; Reject H0 D) F-Ratio = 1.598; Reject H0 E) F-Ratio = 0.626; Do not reject H0 Select the most appropriate answer. 21) Which of the following are assumptions of the analysis of variance F test for comparing population means of several groups? A) Each group has a normal population distribution. B) Each group has the same standard deviation. C) None of these. D) All of these. E) The random samples drawn from each group are independent.
21)
22) If the average crop yields are to be compared among 4 different varieties of seeds and 2 different irrigation techniques, which of the following methods of analysis seems most appropriate? A) a two-way ANOVA B) a factorial ANOVA C) a logistic regression analysis D) a one-way ANOVA E) a linear regression analysis
22)
23) If the average weight losses are to be compared among 3 different diets, which of the following methods of analysis seems most appropriate? A) a linear regression analysis B) a two-way ANOVA C) a three-way ANOVA D) a one-way ANOVA E) a logistic regression analysis
23)
Copyright © 2017 Pearson Education, Inc. 7
Given below are the analysis of variance results from a Minitab display. Assume that you want to use a 0.05 significance level in testing the null hypothesis that the different samples come from populations with the same mean. 24) 24) Source DF SS MS F p Factor 3 30 10.00 1.6 0.264 Error 8 50 6.25 Total 11 80 What can you conclude about the equality of the population means? A) Reject the null hypothesis since the p-value is greater than the significance level. We conclude that all of the factor means differ. B) Reject the null hypothesis since the p-value is greater than the significance level. We conclude that at least two of the factor means differ. C) Do not reject the null hypothesis since the p-value is greater than the significance level. We conclude that the factor means are equal. D) Do not reject the null hypothesis since the p-value is greater than the significance level. There is not enough evidence to show that the factor means are unequal. E) No conclusion can be made. Determine the degrees of freedom. 25) An independent research firm conducts a study to compare the taste of four new sports drinks. Five people are randomly assigned to each of the four drinks. Each person tastes the drink and judges it on a scale from one to five. How many degrees of freedom (df1 ) does the group sum of squares have? How many degrees of freedom (df2 ) for the error sum of squares? A) df1 = 4; df2 = 16 B) df1 = 3; df2 = 16 C) df1 = 16; df2 = 3 D) df1 = 4; df2 = 17 E) df1 = 3; df2 = 17
Copyright © 2017 Pearson Education, Inc. 8
25)
Answer Key Testname: CHAPTER 14 FORM B TEST
1) C 2) B 3) C 4) B 5) D 6) A 7) A 8) E 9) C 10) E 11) B 12) B 13) D 14) Since the ANOVA F test is robust to moderate breakdowns in the population normality and equal standard deviation assumptions, in practice it is used unless (i) graphical methods show extreme skew for the response variable or (ii) the largest group standard deviation is more than about double the smallest group standard deviation and the sample sizes are unequal. 15) a. We need 4 indicator variables to indicate the two genders, the two races, and the three types of jobs. The first indicator variable is x 1 = 1 for female = 0 otherwise. The second indicator variable is x 2 = 1 for black = 0 otherwise. The third indicator variable is x 3 = 1 for white-collar = 0 otherwise. The fourth indicator variable is x 4 = 1 for service = 0 otherwise. With these indicator variables, the multiple regression equation for the mean income is μy = α + β 1 x 1 + β 2 x 2 + β 3 x 3 + β 4 x 4 . α is the mean for male, white, blue collar workers; β1 is added to the mean if the worker is female; β2 is added to the mean if the worker is black; β 3 is added to the mean if the worker is white-collar and β 4 is added to the mean if the worker is a service worker. 16) B 17) E 18) No. Once the second factor is no longer controlled, the effect of the first factor on the response variable could change because the residual variation could change. 19) E 20) E 21) D 22) A 23) D 24) D 25) B
Copyright © 2017 Pearson Education, Inc. 9
CHAPTER 15 FORM A TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) SAT scores for students selected randomly from two different schools are shown below. Find W, the sum of the ranks for the larger sample, to test the claim that the scores for the two schools have the same distribution. School A School B 550 480 670 460 580 620 400 700 520 380 680 570 540 740 560 660 500 480 360 560 650 600 550 A) 11.91 B) 0.17
C) 12.08
D) 131
1)
E) 145
Select the most appropriate answer. 2) A researcher wants to know if the time spent in prison for a particular type of crime was the same for men and women. A random sample of men and women were each asked to give the length of sentence received. The data, in years, are listed below. Which of the following tests can be used to test the claim that there is no difference in the sentence received by each sex if normality cannot be assumed?
2)
Men 10 22 16 18 19 26 Women 9 12 9 14 26 12 Men 14 22 12 19 23 24 Women 34 8 10 13 17 27 A) Wilcoxon test B) Wilcoxon signed-rank test C) none of these D) Sign test E) t-test Provide an appropriate response. 3) A local school district is concerned about the number of school days missed by its teachers due to illness. A random sample of 10 teachers is selected. The numbers of days absent in one year is listed below. An incentive program is offered in an attempt to decrease the number of days absent. The number of days absent in one year after the incentive program is listed below. Use the Wilcoxon signed-rank test to find the test statistic ws to test the claim that the incentive program cuts down on the number of days missed by teachers. Use α = 0.05. The difference is defined as the After Incentive-Before Incentive. Teacher Days Absent Before Incentive Days Absent After Incentive A) -3.16
1
2
3
4
5
6
7
8
9
10
6
8
7
7
9
5
2
6
4
3
4
7
7 5 B) -2
8
3
0
7 2 C) 7
3 D) 2
Copyright © 2017 Pearson Education, Inc. 1
E) 1
3)
4) Verbal SAT scores for students randomly selected from two different schools are listed below. Use the Wilcoxon rank sum test to find W, the sum of the ranks for the larger sample, to test the claim that there is no difference in the scores from each school.
4)
School 1 School 2 560 530 780 500 450 690 490 760 540 440 720 600 590 790 620 700 560 540 600 740 760 640 650 550 A) 128.5
B) 60
C) 171.5
D) 75.5
E) 3.58
5) Students in Statistics 102 were asked to report their final exam score (a 200-point exam) in Statistics 101. These were then paired with the scores on record. The higher of the two scores is listed in the tables below ("Rep" for reported and "Rec" for recorded). At α = 0.05, determine the test statistic to support the claim that there is a difference between reported scores and recorded scores. Define p to be the population proportion who reported a larger score than was recorded. 9 10 11 12 Student 1 2 3 4 5 6 7 8 Rep Rec Rep Rec Rep Rec Rec Rec Rep Rep Rep Same A) 3 B) -1.508 C) 0 D) -1.732
E) 8
6) The grade point averages of students participating in sports at a college are to be compared. The data are listed below. Find the average rank for the tennis GPAs to be used in the Kruskal-Wallis test for testing the claim that there is no difference in the distributions of the populations. Tennis Golf 3.4 2 2.8 2.3 2.7 3.5 3.7 2.1 3.3 2.5 2.3 2.2 A) 4.08
Swimming 2.9 3.2 3 2.7 2.7 2.6 B) 3.5
C) 3.03
D) 12.25
C) 95.5
D) 143
E) 8.61
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 8) List two cases when nonparametric methods are especially useful.
Copyright © 2017 Pearson Education, Inc. 2
6)
E) 73.5
7) A teacher uses two different CAI programs to remediate students. Results for each group on a standardized test are listed in a table below. Use the Wilcoxon rank sum test to find W, the sum of the ranks for the smaller sample, to test the claim that there is no difference in the results from each program. Program I Program II 60 75 61 63 66 89 68 77 86 69 64 70 84 80 81 87 72 82 59 78 73 91 93 94 95 A) 235 B) 90
5)
8)
7)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 9) A person who commutes to work is choosing between two different routes. He tries the first route 11 times and the second route 12 times and records the time of each trip. The results (in minutes) are shown below. Find W, the difference between sample mean ranks for the two groups, to test the claim that the times for both routes have the same distribution. Route 1 Route 2 35 42 41 41 46 38 33 48 40 49 53 36 39 50 46 51 57 53 36 40 45 50 55 A) -6.62 B) 182 C) 8.55 D) 94 E) 15.17
9)
10) A researcher wishes to determine whether there is a difference in the average age of elementary school, high school, and community college teachers. Teachers are randomly selected. Their ages are recorded below. Find the mean rank for high school teachers that are used in the Kruskal-Wallis test.
10)
Elementary School High School Community College Teachers Teachers Teachers 24 37 40 29 42 46 28 39 37 53 48 62 38 43 46 26 32 36 A) 381 B) 10.58 C) 63.5
D) 36
E) 40.2
11) 11 female employees and 11 male employees are randomly selected from one company and their weekly salaries are recorded. The salaries (in dollars) are shown below. Software reports a small-sample one-sided P-value of 0.008. Interpret. Female Male 350 420 470 410 460 650 385 675 520 545 720 810 540 400 550 660 500 880 450 640 700 750 A) If there were no difference in weekly salaries among men and women, the probability of obtaining a sample as extreme as that observed is 0.008. There is strong evidence that the average salaries differ. B) The probability that a randomly selected female earns the same as a randomly selected male is 0.008. There is strong evidence that the average male salary is higher. C) If there were no difference in weekly salaries among men and women, the probability of obtaining a sample as extreme as that observed is 0.008. There is strong evidence that the average male salary is higher. D) The probability that a randomly selected male earns more than a randomly selected female is 0.008. There is strong evidence that the average male salary is higher. E) The probability of obtaining a sample where the average female salary is greater than or equal to the average male salary is 0.008. There is strong evidence that the average male salary is higher.
Copyright © 2017 Pearson Education, Inc. 3
11)
Select the most appropriate answer. 12) If the P-value of the Kruskal-Wallis test is small, to find out which pairs of groups significantly differ the Kruskal-Wallis test should be followed up by
12)
I. the Wilcoxon test to compare each pair of groups. II. constructing a confidence interval for the difference between the population medians for each pair of groups. III. constructing a confidence interval for the difference between the population means for each pair of groups. A) III only B) both I and III C) both I and II D) I only E) II only Answer true or false. 13) The Wilcoxon signed-rank test improves on the sign test by taking into account the sizes of the differences and not merely their sign. A) True B) False SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 14) To compare the length of stay (in days) in the hospital (H) for patients with the same diagnosis at two different hospitals, the following data were collected. H #1 H #2
21 86
10 27
32 10
60 68
8 87
44 76
29 125
5 60
13 35
26 73
33 96
44
a. Why might a t test not be very useful in this case? b. Explain how to find the ranks for the Wilcoxon test by showing which of the 24 observations get ranks 1, 2, 23, and 24. c. The test statistic for the Wilcoxon test is W = 83.5 and the P-value = 0.0019 (adjusted for ties), interpret.
Copyright © 2017 Pearson Education, Inc. 4
238
14)
13)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 15) 11 female employees and 11 male employees are randomly selected from one company and their weekly salaries are recorded. The salaries (in dollars) are shown below. Find W, the sum of the ranks for the smaller sample, to test the claim that salaries for female and male employees of the company have the same distribution. Female Male 350 420 470 410 460 650 385 675 520 545 720 810 540 400 550 660 500 880 450 640 700 750 A) 163 B) 90 C) 14.82 D) 8.18 E) -6.64
15)
16) Nine students took the SAT test. Their scores are listed below. Later on, they took a test preparation course and retook the SAT. Their new scores are listed below. Use the Wilcoxon signed-ranks test to find the test statistic ws to test the claim that the test preparation had no effect on their scores. Use α = 0.05. The
16)
difference is defined as the before score-after score. Student 1 2 3 4 5 6 Before Score 1040 1070 1000 1080 1170 910 After Score 1060 1070 990 1120 1200 920 A) -32 B) 15 C) -4
7 830 820
8 930 970
9 870 890 D) 4
E) 32
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill in the blank. 17) The t test for comparing two means is to the one-way ANOVA F test as the ____________________ test is to the Kruskal-Wallis test.
17)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 18) A weight-lifting coach claims that a weight-lifter can increase strength by taking vitamin E. To test the theory, the coach randomly selects 9 athletes and gives them a strength test using a bench press. Thirty days later, after regular training supplemented by vitamin E, they are tested again. The table below lists whether the athlete was stronger before or after taking vitamin E. Using the sign test for the claim that the vitamin E supplement is effective in increasing the athlete's strength, find the p-value. Athlete 1 2 3 4 5 6 7 8 9 After After Same Before After After After Before After A) 0.157 B) 0.164 C) 0.05 D) 0.289
Copyright © 2017 Pearson Education, Inc. 5
E) 0.145
18)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 19) The data below represent the number of computer units sold per week by a random sample of 8 salespersons before and after a bonus plan was introduced. The difference is defined as after minus before. A paired t-test to determine whether the bonus plan had significantly increased sales on average yielded a P-value of 0.03. 54 53 -1
Before After Difference
25 30 5
80 79 -1
76 79 3
63 70 7
82 90 8
94 92 -2
19)
72 81 9
a. Suppose the assumption of normality for the differences was not valid and conduct a Wilcoxon signed-rank test. b. Does the test in part (a) agree with the paired t-test? If not, which result is more reliable? MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 20) Nine students took the SAT test. Later on, they took a test preparation course and retook the SAT. Their new scores are listed below. For each of the nine students, the table below lists which test had the higher score, before the preparation test or after. Using the sign test for the claim that the test preparation had no effect on their scores, find the p-value. Student 1 2 3 4 5 6 7 8 9 After Same Before After After After Before After After A) 0.157 B) 0.085 C) 0.05 D) 0.289
E) 0.145
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 21) To study whether the sport of a college athlete is associated with college GPA, the following data were collected.
The following represents the MINITAB output for the Kruskal-Wallis test on these data. Interpret these results. Kruskal-Wallis Test: gpa versus sport sport golf swimming tennis
N 6 6 6
Median 2.550 3.100 3.350
H = 5.13
DF = 2
Rank 5.6 10.7 12.3
P = 0.077
(adjusted for ties)
Copyright © 2017 Pearson Education, Inc. 6
21)
20)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 22) A teacher uses two different CAI programs to remediate students. Results for each group on a standardized test are listed in a table below. Software reports a small-sample two-sided P-value of 0.003. Interpret.
22)
Program I Program II 60 75 61 63 66 89 68 77 86 69 64 70 84 80 81 87 72 82 59 78 73 91 93 94 95 A) If there were no difference between programs, the probability of obtaining a sample as extreme as that observed would be 0.003. We conclude that Program II gives better results. B) The probability that the programs give different results is 0.003. There is strong evidence that the programs give the same results. C) The probability that the programs give the same results is 0.003. We conclude that Program II gives better results. D) The probability that the programs give the same results is 0.003. There is strong evidence that the programs give different results. E) If there were no difference between programs, the probability of obtaining a sample as extreme as that observed would be 0.003. There is strong evidence that the programs give different results. Select the most appropriate answer. 23) The nonparametric test for comparing mean ranks of several groups with independent random samples is the A) Wilcoxon test B) t-test C) Kruskal-Wallis test D) Sign test E) Wilcoxon signed-rank test 24) Which of the following is a nonparametric test for comparing two groups with matched-pairs data when the difference can be ranked? A) Wilcoxon signed-rank test B) sign test C) Wilcoxon test D) Bonferroni test E) Kruskal-Wallis test Provide an appropriate response. 25) A pharmaceutical company wishes to test a new drug with the expectation of lowering cholesterol levels. Ten subjects are randomly selected and their cholesterol levels are recorded. The subjects were placed on the drug for a period of 6 months, after which their cholesterol levels were tested again. The table below lists whether the subject's cholesterol was lower before or after taking the drug. Using the sign test for the company's claim that the drug lowers cholesterol levels, find the test statistic. Subject 1 2 3 4 5 6 7 8 9 10 After After Before After After Same After After After After A) 1 B) 1.90 C) 2.33 D) 0.11
Copyright © 2017 Pearson Education, Inc. 7
E) 8
23)
24)
25)
Answer Key Testname: CHAPTER 15 FORM A TEST
1) E 2) A 3) D 4) C 5) B 6) D 7) B 8) 1) When the data are ranks for the subjects rather than quantitative measurements; 2) When it's inappropriate to assume normality and the ordinary statistical method is not robust when the normality assumption is violated. 9) A 10) B 11) C 12) C 13) A 14) a. The box plots above do not show any substantial skew, but there is an extreme outlier for the H #2 group. One patient in that group had a very long stay, 238 days. The Wilcoxon test is not affected by an outlier; b. Rank 1 goes to the shortest stay, so the value 5 gets rank 1. The second shortest stay is 8, which gets rank 2. The longest stay is 238, which gets rank 24. The next longest stay is 125, which gets rank 23; c. The P-value shows strong evidence against the null hypothesis that the distribution of stay times is identical for the two hospitals. The stay times tend to be longer for patients staying at hospital #2. 15) B 16) D 17) Wilcoxon 18) E 19) a. 1) Assumptions: Random sample of matched pairs for which the differences of observations have a symmetric population distribution and can be ranked; 2) H0 : Population median of difference scores is 0 and Ha: Population median of difference scores is greater than 0; 3) test statistic = 30; 4) P-value = 0.054; 5) Since the P-value > 0.05, there is not sufficient evidence that the bonus plan had significantly increased sales on average; b. No, the parametric test rejects the null hypothesis in favor of the alternative which states that the number of computer units sold per week increased after the bonus plan was introduced. The nonparametric test did not find sufficient evidence to reject the null hypothesis of no difference. Assuming the assumption of normality is not met, the non parametric test is more reliable. 20) D 21) If H0 : identical population distributions for the three groups were true, the Kruskal Wallis test statistic would have an approximate chi-squared distribution with df = 2. The test statistic is H = 5.13. The P-value, the right-tail probability above 5.13, is 0.077. At a usual 0.05 level of significance, it is plausible that GPA is independent of sport. The sample median GPAs are not very different. 22) E 23) C 24) A 25) C
Copyright © 2017 Pearson Education, Inc. 8
CHAPTER 15 FORM B TEST Name___________________________________
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Select the most appropriate answer. 1) A researcher wants to know if the time spent in prison for a particular type of crime was the same for men and women. A random sample of men and women were each asked to give the length of sentence received. The data, in years, are listed below. Which of the following tests can be used to test the claim that there is no difference in the sentence received by each sex if normality cannot be assumed?
1)
Men 10 22 16 18 19 26 Women 9 12 9 14 26 12 Men 14 22 12 19 23 24 Women 34 8 10 13 17 27 A) Sign test B) Wilcoxon test C) Wilcoxon signed-rank test D) t-test E) none of these 2) The nonparametric test for comparing mean ranks of several groups with independent random samples is the A) Kruskal-Wallis test B) Wilcoxon signed-rank test C) t-test D) Wilcoxon test E) Sign test
2)
3) Which of the following is a nonparametric test for comparing two groups with matched-pairs data when the difference can be ranked? A) Kruskal-Wallis test B) Wilcoxon signed-rank test C) sign test D) Wilcoxon test E) Bonferroni test
3)
Copyright © 2017 Pearson Education, Inc. 1
4) If the P-value of the Kruskal-Wallis test is small, to find out which pairs of groups significantly differ the Kruskal-Wallis test should be followed up by
4)
I. the Wilcoxon test to compare each pair of groups. II. constructing a confidence interval for the difference between the population medians for each pair of groups. III. constructing a confidence interval for the difference between the population means for each pair of groups. A) both I and III B) III only C) I only D) II only E) both I and II Provide an appropriate response. 5) Nine students took the SAT test. Their scores are listed below. Later on, they took a test preparation course and retook the SAT. Their new scores are listed below. Use the Wilcoxon signed-ranks test to find the test statistic ws to test the claim that the test preparation had no effect on their scores. Use α = 0.05. The
5)
difference is defined as the before score-after score. Student 1 2 3 4 5 6 Before Score 1040 1070 1000 1080 1170 910 After Score 1060 1070 990 1120 1200 920 A) -32 B) 15 C) 32
7 830 820
8 930 970
9 870 890 D) -4
E) 4
6) The grade point averages of students participating in sports at a college are to be compared. The data are listed below. Find the average rank for the tennis GPAs to be used in the Kruskal-Wallis test for testing the claim that there is no difference in the distributions of the populations. Tennis Golf 3.4 2 2.8 2.3 2.7 3.5 3.7 2.1 3.3 2.5 2.3 2.2 A) 4.08
Swimming 2.9 3.2 3 2.7 2.7 2.6 B) 3.5
C) 73.5
D) 12.25
E) 3.03
7) A local school district is concerned about the number of school days missed by its teachers due to illness. A random sample of 10 teachers is selected. The numbers of days absent in one year is listed below. An incentive program is offered in an attempt to decrease the number of days absent. The number of days absent in one year after the incentive program is listed below. Use the Wilcoxon signed-rank test to find the test statistic ws to test the claim that the incentive program cuts down on the number of days missed by teachers. Use α = 0.05. The difference is defined as the After Incentive-Before Incentive. Teacher Days Absent Before Incentive Days Absent After Incentive A) 7
1
2
3
4
5
6
7
8
9
10
6
8
7
7
9
5
2
6
4
3
4
7
7 5 B) 2
8
3
0
7 2 C) 1
3 D) -3.16
Copyright © 2017 Pearson Education, Inc. 2
6)
E) -2
7)
8) A teacher uses two different CAI programs to remediate students. Results for each group on a standardized test are listed in a table below. Use the Wilcoxon rank sum test to find W, the sum of the ranks for the smaller sample, to test the claim that there is no difference in the results from each program. Program I Program II 60 75 61 63 66 89 68 77 86 69 64 70 84 80 81 87 72 82 59 78 73 91 93 94 95 A) 143 B) 235
C) 95.5
D) 8.61
E) 90
9) A researcher wishes to determine whether there is a difference in the average age of elementary school, high school, and community college teachers. Teachers are randomly selected. Their ages are recorded below. Find the mean rank for high school teachers that are used in the Kruskal-Wallis test. Elementary School High School Community College Teachers Teachers Teachers 24 37 40 29 42 46 28 39 37 53 48 62 38 43 46 26 32 36 A) 10.58 B) 381 C) 40.2
D) 36
8)
9)
E) 63.5
10) A person who commutes to work is choosing between two different routes. He tries the first route 11 times and the second route 12 times and records the time of each trip. The results (in minutes) are shown below. Find W, the difference between sample mean ranks for the two groups, to test the claim that the times for both routes have the same distribution. Route 1 Route 2 35 42 41 41 46 38 33 48 40 49 53 36 39 50 46 51 57 53 36 40 45 50 55 A) 94 B) 15.17 C) 182 D) -6.62 E) 8.55
10)
11) 11 female employees and 11 male employees are randomly selected from one company and their weekly salaries are recorded. The salaries (in dollars) are shown below. Find W, the sum of the ranks for the smaller sample, to test the claim that salaries for female and male employees of the company have the same distribution. Female Male 350 420 470 410 460 650 385 675 520 545 720 810 540 400 550 660 500 880 450 640 700 750 A) -6.64 B) 163 C) 8.18 D) 14.82 E) 90
11)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 12) List two cases when nonparametric methods are especially useful.
Copyright © 2017 Pearson Education, Inc. 3
12)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 13) A teacher uses two different CAI programs to remediate students. Results for each group on a standardized test are listed in a table below. Software reports a small-sample two-sided P-value of 0.003. Interpret.
13)
Program I Program II 60 75 61 63 66 89 68 77 86 69 64 70 84 80 81 87 72 82 59 78 73 91 93 94 95 A) The probability that the programs give the same results is 0.003. We conclude that Program II gives better results. B) The probability that the programs give different results is 0.003. There is strong evidence that the programs give the same results. C) If there were no difference between programs, the probability of obtaining a sample as extreme as that observed would be 0.003. There is strong evidence that the programs give different results. D) If there were no difference between programs, the probability of obtaining a sample as extreme as that observed would be 0.003. We conclude that Program II gives better results. E) The probability that the programs give the same results is 0.003. There is strong evidence that the programs give different results. 14) A pharmaceutical company wishes to test a new drug with the expectation of lowering cholesterol levels. Ten subjects are randomly selected and their cholesterol levels are recorded. The subjects were placed on the drug for a period of 6 months, after which their cholesterol levels were tested again. The table below lists whether the subject's cholesterol was lower before or after taking the drug. Using the sign test for the company's claim that the drug lowers cholesterol levels, find the test statistic. Subject 1 2 3 4 5 6 7 8 9 10 After After Before After After Same After After After After A) 0.11 B) 8 C) 1 D) 2.33
E) 1.90
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 15) To study whether the sport of a college athlete is associated with college GPA, the following data were collected.
The following represents the MINITAB output for the Kruskal-Wallis test on these data. Interpret these results. Kruskal-Wallis Test: gpa versus sport sport golf swimming tennis
N 6 6 6
Median 2.550 3.100 3.350
H = 5.13
DF = 2
Rank 5.6 10.7 12.3
P = 0.077
(adjusted for ties)
Copyright © 2017 Pearson Education, Inc. 4
15)
14)
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 16) A weight-lifting coach claims that a weight-lifter can increase strength by taking vitamin E. To test the theory, the coach randomly selects 9 athletes and gives them a strength test using a bench press. Thirty days later, after regular training supplemented by vitamin E, they are tested again. The table below lists whether the athlete was stronger before or after taking vitamin E. Using the sign test for the claim that the vitamin E supplement is effective in increasing the athlete's strength, find the p-value. Athlete 1 2 3 4 5 6 7 8 9 After After Same Before After After After Before After A) 0.164 B) 0.289 C) 0.157 D) 0.05
16)
E) 0.145
17) Verbal SAT scores for students randomly selected from two different schools are listed below. Use the Wilcoxon rank sum test to find W, the sum of the ranks for the larger sample, to test the claim that there is no difference in the scores from each school.
17)
School 1 School 2 560 530 780 500 450 690 490 760 540 440 720 600 590 790 620 700 560 540 600 740 760 640 650 550 A) 171.5
B) 3.58
C) 128.5
D) 75.5
E) 60
18) 11 female employees and 11 male employees are randomly selected from one company and their weekly salaries are recorded. The salaries (in dollars) are shown below. Software reports a small-sample one-sided P-value of 0.008. Interpret. Female Male 350 420 470 410 460 650 385 675 520 545 720 810 540 400 550 660 500 880 450 640 700 750 A) If there were no difference in weekly salaries among men and women, the probability of obtaining a sample as extreme as that observed is 0.008. There is strong evidence that the average salaries differ. B) The probability of obtaining a sample where the average female salary is greater than or equal to the average male salary is 0.008. There is strong evidence that the average male salary is higher. C) The probability that a randomly selected male earns more than a randomly selected female is 0.008. There is strong evidence that the average male salary is higher. D) If there were no difference in weekly salaries among men and women, the probability of obtaining a sample as extreme as that observed is 0.008. There is strong evidence that the average male salary is higher. E) The probability that a randomly selected female earns the same as a randomly selected male is 0.008. There is strong evidence that the average male salary is higher.
Copyright © 2017 Pearson Education, Inc. 5
18)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 19) To compare the length of stay (in days) in the hospital (H) for patients with the same diagnosis at two different hospitals, the following data were collected. H #1 H #2
21 86
10 27
32 10
60 68
8 87
44 76
29 125
5 60
13 35
26 73
33 96
44
19)
238
a. Why might a t test not be very useful in this case? b. Explain how to find the ranks for the Wilcoxon test by showing which of the 24 observations get ranks 1, 2, 23, and 24. c. The test statistic for the Wilcoxon test is W = 83.5 and the P-value = 0.0019 (adjusted for ties), interpret. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 20) Nine students took the SAT test. Later on, they took a test preparation course and retook the SAT. Their new scores are listed below. For each of the nine students, the table below lists which test had the higher score, before the preparation test or after. Using the sign test for the claim that the test preparation had no effect on their scores, find the p-value. Student 1 2 3 4 5 6 7 8 9 After Same Before After After After Before After After A) 0.289 B) 0.05 C) 0.145 D) 0.157
Copyright © 2017 Pearson Education, Inc. 6
E) 0.085
20)
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 21) The data below represent the number of computer units sold per week by a random sample of 8 salespersons before and after a bonus plan was introduced. The difference is defined as after minus before. A paired t-test to determine whether the bonus plan had significantly increased sales on average yielded a P-value of 0.03. Before After Difference
54 53 -1
25 30 5
80 79 -1
76 79 3
63 70 7
82 90 8
94 92 -2
21)
72 81 9
a. Suppose the assumption of normality for the differences was not valid and conduct a Wilcoxon signed-rank test. b. Does the test in part (a) agree with the paired t-test? If not, which result is more reliable? MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 22) Students in Statistics 102 were asked to report their final exam score (a 200-point exam) in Statistics 101. These were then paired with the scores on record. The higher of the two scores is listed in the tables below ("Rep" for reported and "Rec" for recorded). At α = 0.05, determine the test statistic to support the claim that there is a difference between reported scores and recorded scores. Define p to be the population proportion who reported a larger score than was recorded. 9 10 11 12 Student 1 2 3 4 5 6 7 8 Rep Rec Rep Rec Rep Rec Rec Rec Rep Rep Rep Same A) -1.508 B) -1.732 C) 3 D) 0
E) 8
23) SAT scores for students selected randomly from two different schools are shown below. Find W, the sum of the ranks for the larger sample, to test the claim that the scores for the two schools have the same distribution. School A School B 550 480 670 460 580 620 400 700 520 380 680 570 540 740 560 660 500 480 360 560 650 600 550 A) 12.08 B) 11.91
C) 131
D) 145
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
Copyright © 2017 Pearson Education, Inc. 7
23)
E) 0.17
Answer true or false. 24) The Wilcoxon signed-rank test improves on the sign test by taking into account the sizes of the differences and not merely their sign. A) True B) False
Fill in the blank. 25) The t test for comparing two means is to the one-way ANOVA F test as the ____________________ test is to the Kruskal-Wallis test.
22)
25)
24)
Answer Key Testname: CHAPTER 15 FORM B TEST
1) B 2) A 3) B 4) E 5) E 6) D 7) B 8) E 9) A 10) D 11) E 12) 1) When the data are ranks for the subjects rather than quantitative measurements; 2) When it's inappropriate to assume normality and the ordinary statistical method is not robust when the normality assumption is violated. 13) C 14) D 15) If H0 : identical population distributions for the three groups were true, the Kruskal Wallis test statistic would have an approximate chi-squared distribution with df = 2. The test statistic is H = 5.13. The P-value, the right-tail probability above 5.13, is 0.077. At a usual 0.05 level of significance, it is plausible that GPA is independent of sport. The sample median GPAs are not very different. 16) E 17) A 18) D 19) a. The box plots above do not show any substantial skew, but there is an extreme outlier for the H #2 group. One patient in that group had a very long stay, 238 days. The Wilcoxon test is not affected by an outlier; b. Rank 1 goes to the shortest stay, so the value 5 gets rank 1. The second shortest stay is 8, which gets rank 2. The longest stay is 238, which gets rank 24. The next longest stay is 125, which gets rank 23; c. The P-value shows strong evidence against the null hypothesis that the distribution of stay times is identical for the two hospitals. The stay times tend to be longer for patients staying at hospital #2. 20) A 21) a. 1) Assumptions: Random sample of matched pairs for which the differences of observations have a symmetric population distribution and can be ranked; 2) H0 : Population median of difference scores is 0 and Ha: Population median of difference scores is greater than 0; 3) test statistic = 30; 4) P-value = 0.054; 5) Since the P-value > 0.05, there is not sufficient evidence that the bonus plan had significantly increased sales on average; b. No, the parametric test rejects the null hypothesis in favor of the alternative which states that the number of computer units sold per week increased after the bonus plan was introduced. The nonparametric test did not find sufficient evidence to reject the null hypothesis of no difference. Assuming the assumption of normality is not met, the non parametric test is more reliable. 22) A 23) D 24) A 25) Wilcoxon
Copyright © 2017 Pearson Education, Inc. 8