MAT 300 Unit 2 Challenges Sophia Click below link for Answers https://www.sobtell.com/q/tutorial/default/206896-mat-300-unit-2-challenges-sophia https://www.sobtell.com/q/tutorial/default/206896-mat-300-unit-2-challenges-sophia
CHALLENEG 1 The following table shows the types of homes in which residents were living in 2010.
To accurately display the large apartment building category on a pie chart, an angle that is approximately ________ degrees would need to be created. a.) 131 b.) 79 c.) 94 d.) 13
Recall that pie charts are 360 degrees and each piece of pie should represent the percent of the pie it takes up. So for a large apartment with a frequency of 1408 and a total of 6400, the fraction of the pie would then be:
The following table shows the types of homes in which residents were living in 2010.
To accurately display the mobile home category on a pie chart, an angle that is approximately ________ degrees would need to be created. a.) 79 b.) 94 c.) 13
d.) 131 he following table shows the types of homes in which residents were living in 2010.
To accurately display the single-family detached home category on a pie chart, an angle that is approximately ________ degrees would need to be created. a.) 94 b.) 131 c.) 13 d.) 79
Recall that pie charts are 360 degrees and each piece of pie should represent the percent of the pie it takes up. So with single detached homes with a frequency of 2330 and a total of 6400, the fraction of the pie would then be:
The following histogram represents quiz scores and uses the following interval bins: 1-5, 6-10, 11-15, 16-20, 21-25, 26-30, 31-35, 36-40, 41-45, and 46-50. Select the true statement for this histogram. a.) The quiz score bin interval 11-15 contains the most students, while bin interval 46-50 contains the fewest. b.) The quiz score bin interval 11-15 contains the most students, while bin interval 26-30 contains the fewest. c.) The quiz score bin interval 46-50 contains the most students, while bin interval 26-30 contains the fewest. d.)
The quiz score bin interval 1-5 contains the most students, while bin interval 46-50 contains the fewest.
The height of the histogram represents the frequency, with each frequency listed inside each. The largest bin interval is 11-15 with 10 quizzes inside this interval and the fewest is in 46-50 with only 2 quiz scores. The following histogram represents quiz scores and uses the following interval bins: 1-5, 6-10, 11-15, 16-20, 21-25, 26-30, 31-35, and 36-40. Select the true statement for this histogram. a.) The quiz score bin interval 6-10 contains the most students, while bin interval 26-30 contains the fewest. b.) The quiz score bin interval 11-15 contains the most students, while bin interval 36-40 contains the fewest. c.) The quiz score bin interval 16-20 contains the most students, while bin interval 1-5 contains the fewest. d.) The quiz score bin interval 36-40 contains the most students, while bin interval 1-5 contains the fewest.
The height of the histogram represents the frequency, with each frequency listed inside each. The largest bin interval is 11-15 with 15 quizzes inside this interval and the fewest is in 36-40 with only 2 quiz scores. he following histogram represents quiz scores and uses the following interval bins: 1-5, 6-10, 11-15, 16-20, 21-25, 26-30, 31-35, and 36-40. Select the true statement for this histogram. a.) The quiz score bin interval 1-5 contains the most students, while bin interval 36-40 contains the fewest. b.) The quiz score bin interval 11-15 contains the most students, while bin interval 36-40 contains the fewest. c.)
The quiz score bin interval 6-10 contains the most students, while bin interval 21-25 contains the fewest. d.) The quiz score bin interval 11-15 contains the most students, while bin interval 21-25 contains the fewest.
The height of the histogram represents the frequency, with each frequency listed inside each. The largest bin interval is 11-15 with 13 quizzes inside this interval and the fewest is in 36-40 with only 2 quiz scores. The time series diagram shows the change of temperature recorded over a successive interval of time. Which segment of the graph indicates the smallest difference in the temperature? a.) 2 P.M. to 3 P.M. b.) 1 P.M. to 2 P.M. c.) 10 A.M. to 11 A.M. d.) 11 A.M. to 12 P.M.
Recall that in a time series plot, if the intervals are equal length the smallest difference will be when the graph has the flattest slope. No slope would mean a horizontal line. In this case the flattest line or smallest slope occurs from 10am to 11am. The time series diagram shows the total distance traveled by Kelvin on his car trip. Which segment of the graph indicates no change in distance traveled? a.) 12 P.M. - 1 P.M. b.) 2 P.M. to 3 P.M. c.) 1 P.M. to 2 P.M. d.)
10 A.M. to 11 A.M.
Recall that in a time series plot no change in distance will be when the graph has no slope. No slope would mean a horizontal line. In this case no slope occurs from 12pm to 1pm. The time series diagram shows the snowfall recorded at successive intervals of time.
The graph below shows two different types of plans published by a cell phone company. Ryan claims that the cost of Plan A and plan B differ a lot. Which statement best explains why Ryan was misled by the graph? a.) The scale of intervals on the vertical axis is inconsistent. b.) The scale of intervals on the horizontal axis is inconsistent. c.) The graph of Plan A shows a gradual increase in price. d.) The graph of Plan B shows a rapid increase in price.
If you examine both graphs, you will notice the graphs are the same. Since the scaling is different on the vertical axis (also called the y axis), it leads you to believe the graphs are different. The bar graph below shows the net sales of a magazine company over the years. Based on the graph, Dan claims that the net sales of magazines in 2008 are twice the sales in 2006.
Which statement explains why the graph is misleading? a.) The scale of intervals on the horizontal axis is inconsistent. b.) The vertical axis should be in terms of number of magazines instead of net sales. c.)
The vertical scale made the bar of 2008 look twice the bar of 2006. d.) The horizontal axis should be in terms of months instead of years. a.)In. Since the graph goes up a year at time, they are not inconsistent. Due to the graph starting at 400 and not 0, it makes the 2008 look twice as large. 2006 has net sales of $800,000, while in 2008 the net sales are $1,200,000. So it is only 1.5 times as large. The bar graph below shows the profits made by industry over the years. Based on the graph, Richard claims that the profits made in 2008 are four times the profits made in 2006. Which statement explains why the graph is misleading? a.) The vertical axis should be in terms of net sales instead of profits. b.) The vertical scale made the bar of 2006 appear to be a fourth of the size of 2008. c.) The scale of intervals on the horizontal axis is inconsistent. d.) The horizontal axis should be in terms of months instead of years. c.)In. Since the graph goes up a year at time, they are consistent. Since the horizontal axis starts at 1000 the height of the graphs are misleading. It makes it appear as if 2008 is four times the size of 2006, when in reality 2006 has a profit of $1,500,000, while in 2008 the profit is $3,000,000. So it is only twice as large. Which segment of the graph indicates the largest difference in the snowfall depth? a.) 4 hours to 6 hours b.) 2 hours to 4 hours c.) 6 hours to 8 hours
d.) 8 hours to 10 hours
Recall that in a time series plot, if the intervals are equal length the largest difference will be when the graph has the steepest slope. It could be increasing or decreasing. In this case the largest slope is seen in 8 to 10 hours. The stem-and-leaf plot below shows the quiz scores of 50 students. How many students scored lower than 10 on the test? a.) 15 students b.) 13 students c.) 16 students d.) 12 students In a stem and leaf you can get the exact values by looking at the stem and leaf. The number of leaves indicate 1 number. So for scores less than 10 it is the entire first stem, which has 12 values. These indicate scores of 2, 3, 3, 4, 5, 5, 5, 5, 6, 7, 8, and 9, all of which are below 10. he stem-and-leaf plot below shows the quiz scores of 50 students. How many students scored higher than 25 on the test? a.) 6 students b.) 14 students c.) 9 students d.) 3 students In a stem and leaf you can get the exact values by looking at the stem and leaf. The number of leaves indicate 1 number. We can see that the number of scores 25 or more
are the 3 values in the 20's group (28, 29, 29) and all 6 in the 30's group (30, 32, 32, 32, 36, 38) for a total of 9. The stem-and-leaf plot below shows the quiz scores of 50 students. How many students scored 22 on the test? a.) 4 students b.) 7 students c.) 6 students d.) 1 student In a stem and leaf you can get the exact values by looking at the stem and leaf. The number of leaves indicate 1 number. So for 22 we can see there is only one 2 in the stem of 2, which means there is only 1 student with that score. The data below shows the number of songs downloaded over 15 successive weekends.5, 10, 6, 5, 6, 10, 8, 10, 10, 5, 8, 9, 7, 6, 5 Which dot plot represents the data? a.)
b.)
c.)
d.) Recall that a dotplot places a dot above the value along a number line. Each dot generally represents 1 value. So the dotplot that repesents this data should have 4 dots at 5, 3 dots at 6 and so on. This is the figure that illusrates that. The data below shows the number of text messages received by a group of 25 students in a day.10, 8, 13, 14, 15, 14, 15, 14, 13, 12, 12, 14, 11, 14, 13, 14, 15, 16, 13, 16, 15, 16, 15, 14, 11 Which dot plot represents the data? a.)
b.)
c.)
d.) Recall that a dotplot places a dot above the value along a number line. Each dot generally represents 1 value. So the dotplot that repesents this data should have 1 dot at 8, 1 dot at 10, 2 dots at 11, and so on. This is the figure that illusrates that. he data below shows the height (in cm) of students in a class.136, 135, 145, 142, 141, 142, 135, 132, 133, 145, 142, 140, 141, 136, 148, 140, 143, 145, 135, 136 Which dot plot represents the data? a.)
b.)
c.)
d.) Recall that a dotplot places a dot above the value along a number line. Each dot generally represents 1 value. So the dotplot that repesents this data should have 1 dot at 132, 1 dot at 133, 3 dots at 135, and so on. This is the figure that illustrates that. The frequency table shows the types of homes in which residents were living in 2010. The relative frequency of residents living in single family detached homes in 2010 is __________. Answer choices are rounded to the nearest percentage. a.) 8% b.) 23% c.) 87% d.)
36% In order to get relative frequency we take the number in the class divided by the total. So the relative frequency single family detached homes is: The frequency table shows the types of homes in which residents were living in 2010. The relative frequency of residents living in mobile homes in 2010 is __________. Answer choices are rounded to the nearest percentage, a.) 8% b.) 4% c.) 23% d.) 36% In order to get relative frequency, we take the number in the class divided by the total. So the relative frequency of mobile homes is: he frequency table shows the types of homes in which residents were living in 2010. The relative frequency of residents living in large apartment buildings in 2010 is __________. Answer choices are rounded to the nearest percentage. a.) 14% b.) 70% c.) 22% d.) 60% In order to get relative frequency we take the number in the class/total. So the relative frequency of large apartment buildings is: The following histogram represents quiz scores and uses the interval bins: 1-5, 6-10, 1115, 16-20, 21-25, 26-30, 31-35, and 36-40. For students who earned a score of 21 or higher on the quiz, the cumulative frequency is __________ and the relative cumulative frequency is __________.
a.) 3; 6% b.) 6; 24% c.) 24; 12% d.) 12; 24% Recall that for cumulative frequency, we add up all the values that fall above or below a given bin of data. So, if we want all scores more than 21 it should be the last four bins with a total of: To get the relative frequency we take the total in the bins for scores of 21 and above divided by the total of all bins: The following histogram represents quiz scores and uses the interval bins: 1-5, 6-10, 1115, 16-20, 21-25, 26-30, 31-35, and 36-40. For students who earned a score of 21 or higher on the quiz, the cumulative frequency is __________ and the relative cumulative frequency is __________. a.) 3; 6% b.) 6; 24% c.) 24; 12% d.) 12; 24% Recall that for cumulative frequency, we add up all the values that fall above or below a given bin of data. So, if we want all scores more than 21 it should be the last four bins with a total of: To get the relative frequency we take the total in the bins for scores of 21 and above divided by the total of all bins: The following histogram represents quiz scores and uses the interval bins: 1-5, 6-10, 1115, 16-20, 21-25, 26-30, 31-35, and 36-40. For students who earned a score of 20 or lower on the quiz, the cumulative frequency is __________ and the relative cumulative frequency is __________.
a.) 16; 76% b.) 76; 38% c.) 38; 76% d.) 9; 18% Recall that for cumulative frequency we add up all the values that fall above or below a given bin of data. So, if we want all scores less than 20 it should be the first four bins with a total of: To get the relative frequency we take the total in the bins for scores of 20 and lower divided by the total of all bins: In Go to the Next Question he following histogram represents quiz scores and uses the interval bins: 1-5, 6-10, 1115, 16-20, 21-25, 26-30, 31-35, and 36-40. For students who earned a score of 15 or lower on the quiz, the cumulative frequency is __________ and the relative cumulative frequency is __________. a.) 29; 58% b.) 58; 29% c.) 13; 26% d.) 26; 58% Recall that for cumulative frequency, we add up all the values that fall above or below a given bin of data. So, if we want all scores 15 or less it should be the first 3 bins with a total of: To get the relative frequency we take the total in the bins for scores of 15 and lower divided by the total of all bins: he table below shows the time spent by four students on a science project.
Which stack plot best represents the time spent for the science project? a.)
b.)
c.)
d.) c.)In. We can see that this graph cannot be since on Monday there is a total of 90 minutes, but it appears that Tina has 20 minutes, when she should only have 15. Recall a stack plot shows the cumulative amount in different groups. So each of the colored regions should represent the amounts in each group. We can verify that for each person and the total, all the values are , with Monday at a total of 90 minutes with Susan spending 20, Tina spending 15, Den spending 35, and John spending 20. We can then verify each day with this same process. The table below shows the number of blood donors from a recent donation campaign. Which stack plot best represents the blood donors? a.)
b.)
c.)
d.) Recall a stack plot shows the cumulative amount in different groups. So each of the colored regions should represent the amounts in each group. When looking at this figure starting with blood type 0 we can see 25 donors for those younger than 30 and 35 donors for those older than 30 for a total of 60 donors. The other blood types can also be verified in this same way. The table below shows the number of visitors to a museum over a period of 5 days. Which stack plot best represents the visitors to the museum? a.)
b.)
c.)
d.)
a.)In. When looking at this figure, starting with children, it appears as if the first column for children is 120, but the total for Monday is far less than 250. We can see this cannot be the figure.
Recall a stack plot shows the cumulative amount in different groups. So each of the colored regions should represent the amounts in each group. When looking at this figure starting with children and then moving to adults, these bars represent accurately the values for children and adults in the table. For example, on Monday, there were 120 children and 130 adults, a total of 250 total visitors. 1 Erica is performing an experiment that requires her to weigh multiple samples. The masses of her samples are found to be normally distributed with a mean of 157g and a standard deviation of 5.2g. If Erica wants to convert her data to a standard normal distribution, which of the following statements is true?
The new mean would be 0, and the standard deviation would be 5.
The new mean would be 1, and the standard deviation would be 0.
The new mean would be 0, and the standard deviation would be 1.
The new mean would be 1, and the standard deviation would be 5.
The mean and standard deviation for a standard normal is always 0 and 1. It is standardized since it takes all the raw data and converts them into z-scores.
Standard Normal Distribution
2 The formula for standard deviation is Find the standard deviation of the following data set that has a mean of 5: 2, 4, 4, 4, 5, 5, 5, 7, 9 1 0 2 4 First, determine the variance of the data: If we recall the standard deviation (SD) is the square root of the variance, so we can note SD Standard Deviation 3 Stan is looking at the statistics for his favorite baseball player, who has hit 25, 26, 32, 38, 43, 40, 28, 32, 34, and 42 home runs in ten seasons. Using this data set, match each value with the description. Mean Median Mode A. 32 B. 33 C. 34 The mode is the most frequently occurring data point. The number 32 occurs two times, which is more than any other number's frequency. The mean is equal to the sum of the values divided by how many numbers there are. When we add the total number of home runs, we get 340. There were 10 seasons, so the mean can be calculated by the following calculation: The median is the middle value once the data is ordered.
Since there are an even number of values, we take the average of the 2 middle values of 32 and 34 to get a mean of 33: Mean, Median, and Mode 4 A workplace gave an “Employee Culture Survey” in which 500 employees rated their agreement with the statement, “I feel respected by those I work for.” Rating Frequency Strongly Agree
156
Agree 114 Neutral
99
Disagree
88
Strongly Disagree
43
The relative frequency of people who strongly agree with the statement is __________. 8.6% 31.2% 16% 54% To get the relative frequency, we take the frequency of the value and divide it by the total number. So in this case for strongly agree, the relative frequency would be: Frequency Tables 5 Let x stand for the percentage of an individual student's math test score. 100 students were sampled at a time. The population mean is 75 percent and the population standard deviation is 12 percent. What are the mean and standard deviation of the sampling distribution of sample means? mean = 75, standard deviation = 12
mean = 7.5, standard deviation = 1.2
mean = 7.5, standard deviation = 12
mean = 75, standard deviation = 1.2
The mean of the sampling distribution should be the true population mean, which would be 75 percent. The standard deviation of the sampling distribution is equal to the population standard deviation divided by the square root of the sample size: Center and Variation of a Sampling Distribution 6 Match each term with its corresponding definition. A number that describes the middle of a set of data
A qualitative statement about how the data looks after it has been plotted A number that describes how far the data is from the middle A. Shape B. Center C. Spread Recall that center describes the middle. Spread tells us how the data is distributed and is generally a measure from center (such as variance and standard deviation). Shape describes what the plotted data looks like and is a qualitative measure because it simply describes the shape. Data Analysis 7 Consider the histogram showing the heights of individuals on a basketball team. Which of the following is the difference in height between the shortest player on the team and the tallest player on the team? 4 inches 81 inches 9 inches 75 inches
The shortest height is 75 inches. In the histogram, we can see the frequency is 3, indicating that there are 3 players at that height. Similarly the tallest height is 84 inches. There are 2 players that tall. So the difference in height from the largest to the smallest player is from 75 to 84 inches, or 9 inches. Histograms 8 Which of the following statements is NOT true? The Central Limit Theorem is applicable only for data sets comprising 30 or more samples. For the Central Limit Theorem to be true, you must have a large sample, the underlying population must be normally distributed, and the standard deviation should not be finite. According to the Central Limit Theorem, the mean of the sampling distribution is equal to the population mean. For a large enough sample size, the Central Limit Theorem states that the sample means of repeated samples of a population are normally distributed. The Central Limit Theorem (CLT) puts no restrictions on the type of population you draw from. It could be normal, uniform, skewed, etc. So, the CLT does not say you must draw from a normal population. It also requires that the variance is finite, which simply means it must be defined. Shape of a Sampling Distribution 9 Sara wonders what percentage of her students answered over 75% of the quiz questions ly. The relative cumulative frequency of students who earned a score of 31 or higher on the quiz is __________%. 88% 6% 44% 12% To get the relative frequency of 31 or higher, we need to find the cumulative number of 31 or more. We simply add up any bin that has the number 31 or more, such as the bin that shows scores of 31-35 and 36-40. This would be:
To get relative frequency, we will take this cumulative number and divide it by the total number of students. Cumulative Frequency 10 A baseball scout recorded the type of pitch a pitcher threw during a game and whether it was thrown for a strike or a ball. Which of the following is a true statement about the stacked bar chart? The pitcher threw under 100 total pitches. The pitcher threw about the same number of strikes when throwing changeups as he did throwing fastballs. The pitcher’s most accurate pitch (highest percent strikes) is the curveball. Over half of the sliders the pitcher threw were balls. Recall that strikes are the lighter color. So if we look at the lighter part of each graph, we can note strikes for changeup is 10 to 35 or about 25 strikes. For fastball it is 22 to 47 or 25 strikes. So there are about the same number of strikes for both of these pitches. Stack Plots 11 The graph in the figure shows the Gross Domestic Product (GDP) from 2008-2011. The segment of the graph that corresponds to almost no GDP growth at all is __________. C A D B In this graph, a segment that is horizontal would show no change in the consumer price index. This occurs at segment C. Line Charts and Time-Series Diagrams 12 Joe is playing a game in which he has to roll two six-sided dice. In his past ten rolls, he has rolled a sum of one 2, two 5s, three 7s, two 8s, one 10, and one 12. The weighted mean of all of Joe’s dice rolls is __________. 6.8 7.3
7.5 7.1 In order to get the weighted average, we use the following formula: The weight for each value is the number of times a value counts towards the total. For example, the value 2 occurred once, the value of 5 occurred twice, etc. Weighted Mean 13 Which of the following statements ly describes a measure of center? The mean and median can be used to summarize any quantitative data. The mean is unaffected by extreme values in a small data set. The median is calculated by adding all of the values in a data set and then dividing by the total number of values. There can only be one mode in any given set of data. The median can be used to describe any qualitative data. Recall that for data that is quantitative, if the dataset is finite there is always a defined mean and median. There is not always a mode. Measures of Center 14 Which of the following statements ly describes the variance of a data set? The variance has the same units as the standard deviation. The variance is calculated using the median. The variance is the square of the standard deviation. The variance has the same units as the data set. The variance is the square root of the standard deviation. In order to go from the variance to the standard deviation (SD), we take the square root. Conversely, to get the variance from the SD we simply square the SD. Standard Deviation 15 The data below shows the number of text messages received by a group of students in a day. How many students received 10 to 13 messages? 4
6 9 5 The dot plot shows us how many values are at each point. To find the number of students who received 10 to 13 messages, count the number of x's in at each value: 1 student received 10 messages, 2 students received 11 messages, 2 students received 12 messages, and 4 students received 13 messages. If we count up the number of x's from 10 and 13 values, we see that there are 9 x's, or 9 students. Dot Plots 16 Using the box-and-whisker plot, match each description with the value. First Quartile Second Quartile Third Quartile A. 52 B. 70 C. 33 D. 29 E. 40 Recall the box shows us Q1, Q2, and Q3. The ends of the box are Q1 and Q3 with the lowest edge (33) being Q1 and the highest (52) being Q3. The line in the box is Q2 or the median, which is 40. Five Number Summary and Boxplots 17 Jenova has scored ten standardized tests with scores of 65, 88, 46, 72, 77, 90, 95, 59, 66, and 83. The standardized test score that represents the sixtieth percentile is __________.
66 59 65 77 If we note that there are 10 values, so the 60th percentile can be found with the following calculation: This tells us that we need to find the 6th ordered value. The 6th value is 77. Percentiles 18 Which of the following statements is NOT true about the normal distribution? The normal distribution is symmetric about the mean. A large portion of the data is located near the center in a normal distribution. The normal distribution can be described as “bell-shaped.� The normal distribution is an example of a bimodal distribution. We only need to know the mean and standard deviation in order to completely describe a normal distribution. The normal distribution is a bell-shaped symmetic distribution with only one peak. So it is not bi-modal (i.e. 2 modes), but is unimodal, which means it has 1 mode. Normal Distribution 19 An outlier is which of the following? Any value in a data set that is larger than twice the mean value A value in a data set that is the highest or lowest of the values in the data set A value in a data set that is significantly higher or lower than most of the values in the data set A value in a data set that is only significantly lower than most of the values in the data set Any value in a data set that is larger than twice the median value An outlier is data that doesn't fit with other data. It is either much larger or much smaller than the other data. Outliers and Modified Boxplots
20 Rick is an engineer testing the stress required to break samples of steel. He measured the failure stress of 50 samples and found the mean failure stress to be 350 MPa, with a standard deviation of 25 MPa. If the distribution is normal, the percentage of the data that lies within two standard deviations of the mean is approximately __________. 99.7% 5% 95% 68% The normal distribution follows the empirical rule, which tells us that within 2 standard deviations we should find 95% of the data. 68-95-99.7 Rule 21 Matt just received his test back. He scored a 78 out of a possible 90 points. His teacher told him the mean score on the test was a 70, with a standard deviation of 5. Matt’s z-score for the test was __________. 2.4 -2.4 1.6 -1.6 Recall that the z-score can be calculated with the following formula: The given value is 78 points, the mean is 70 points, and the standard deviation is 5 points. Plug these values in to get the following z-score: This also tells us that 78 is 1,6 z-scores or standard deviations above the mean. Standard Scores and Z-Scores 22 Katherine, Jonathan, and Ryan are very competitive friends who went bowling. Afterwards, two of them decided to make bar graphs to plot their scores. Who do you think made Graph 1 and why? Jonathan, because he wanted to make the scores appear reasonably close. Ryan, because he wanted to accurately show each person’s score. Katherine, because she wanted to make the scores appear reasonably close.
Jonathan, because he wanted to make the scores appear very different. Katherine, because she wanted to make the scores appear very different. Although we cannot know for sure, it looks like the bar that is most different between the 2 is for Katherine. She has a much larger value in graph 1. Since they are competitive, it would be logical to assume she overestimated her score to look the best and thus created graph 1. Misleading Graphical Displays 23 Determine if each graph is positively skewed, negatively skewed, or symmetrical. = Answer = In Answer Positively (Right) Skewed Distribution
Negatively (Left) Skewed
Symmetrical
Recall that skew tells us the direction of the tail. So a tail to the right is right skewed, while a tail to the left implies left skewed. If the graph is the same on both sides, we refer to it as symmetric. Finally, a graph with 2 peaks is bimodal. Shapes of Distribution 24 Consider the times (in seconds) that it took children and adults to solve a Rubik’s cube at a competition. What does the circled section represent? It took 7 children 12 seconds to solve the Rubik’s cube. It took 7 children 21 seconds to complete the Rubik’s cube. It took 21 children 7 seconds to solve the Rubik’s cube. Two children took over 70 seconds to solve the Rubik’s cube. Recall, that a stem and leaf shows the data in stem and leaf form. So, the stem of 7 implies that this is in the tens, so 70 seconds. So the 1 and 2 mean 70+1 or 71 seconds and 70+2 = 72 seconds. This means we can say 2 children took a bit over 70 seconds to complete the cube. Stem-and-Leaf Plots 25 Which of the following two statements are true? The range is found by subtracting the minimum value from the maximum value.
The interquartile range is better than the standard deviation to describe skewed data sets. The interquartile range covers the middle 75% of the data set. The range is found by subtracting the maximum value from the minimum value. The definition of the range is simply the max - min value, which is subtracting the min from the max. Range and Interquartile Range (IQR) 26 In a poll of 216 voters, 134 said they would vote for the candidate from Party X, 52 said they would vote for the candidate from Party Y, and 30 said they would vote for the candidate from Party Z. If a pie chart were to be made showing the support for each candidate, the smallest central angle would be ________ degrees. 87 50 30 52 Recall that to get the angle for something in a pie chart we use the following formula: So in this case, the smallest central angle will be associated with the candidate with the least about of votes, which would be from 30 votes for Party Z. The central angle for the candidate from Party Z would be: So the smallest angle would be 50 degrees. 1 The dotplot below shows the number of text messages received by a group of students in a day. How many students received less than 15 messages? 15 20 4 9 If we sum up the X's that represent an individual receiving less than 15 messages, we need to include the number of students who received 14 messages, 13 messages, 12 messages, etc. This looks like: So there are 15 students who received less than 15 messages.
Dot Plots 2 Select the statement that ly describes a normal distribution. It is a positively skewed distribution, as the extreme values are greater than the median. It is a symmetric distribution, as the mean and the median are the same. It is a negatively skewed distribution, as the extreme values are less than the median. It is a uniform distribution, as all of the values have equal frequency. A normal distribution is a bell-shaped and symmetric distribution. So it has a smooth peak, which tells us the mean and median are the same. Normal Distribution 3 In which of these cases should the median be used? When the data has extreme values When the data has nominal values When the data has small variance When data has no outliers Since the mean uses the actual values in the data, it is most affected by outliers and skewness. So, we only want to use the mean when the data is symmetric as a measure of centrality. When the data is skewed or has extreme values, the median is a better measure since it is not as sensitive to these values. Measures of Center 4 Let x stand for the percentage of an individual student's math test score. 64 students were sampled at a time. The population mean is 78 percent and the population standard deviation is 14 percent. What is the standard deviation of the sampling distribution of sample means? 64 1.75 0.22 14 The standard deviation of the sampling distribution is Center and Variation of a Sampling Distribution
5 The formula for standard deviation of a sample is: Find the standard deviation of the following data set that has a mean of 6.75: 4, 6, 7, 10 Answer choices are rounded to the hundredths place. 2.17 6.50 2.50 6.25 If we first note the mean of the data is 27/4 = 6.75, we can then get the variance of the data and note that it is: If we note that the standard deviation(SD) is simply the square root of the variance, then the SD = Standard Deviation 6 The first quartile (Q1) value from the above box plot is __________. 29 52 40 33 Note the value for Q1 is the left edge of the box, which is 33. Five Number Summary and Boxplots 7 In a poll of 300 preschoolers, 125 said they preferred chocolate ice cream, 71 said they preferred vanilla, 100 said they preferred cookies & cream, and 4 said they had never eaten ice cream. If a pie chart were to be made showing the preference for each flavor, the central angle for the chocolate ice cream sector would be __________. 124째 5째 150째 41째
Recall that to get the angle for something in a pie chart we use the following formula: So in this case, the central angle for the chocolate ice cream sector would be: Bar Graphs and Pie Charts 8 Which of the following statements is true?
The Central Limit Theorem is applicable only for data sets comprising thirty or more samples.
The Central Limit Theorem is applicable only for data sets comprising less than thirty samples.
The Central Limit Theorem is applicable only for data sets comprising more than thirty samples.
The Central Limit Theorem is applicable only for data sets comprising exactly thirty samples.
Recall that the Central Limit Theorem outlines that when the sample size is large, then the distribution of sample means will be approximately normal. For most distributions, the sample size is n ≼ 30, meaning a sample of 30 or more observations is considered a good sample size.
Shape of a Sampling Distribution
9 For a class reading competition, the students were asked to read a book. Mike, Jack, and Rayon discussed the numbers of pages they read on the first day. One of them made a graph to represent the data.
Who made the graph, and why?
Mike, because he wanted to accurately show the amount read by each person.
Mike, because he wanted to make the amount read by each person appear reasonably close.
Rayon, because he wanted to make the amount read by each person appear very different.
Jack, because he wanted to make it look like he read significantly more than the others.
Since there was a competition, the person who most likely made this graph would want to represent themselves favorably. Since Jack has the most pages, it would probably be him.
Misleading Graphical Displays
10 A baseball scout recorded the type of pitch a pitcher threw during a game and whether it was thrown for a strike or a ball.
Which of the following statements about the stack plot is true?
The pitcher threw the curveball most often.
The pitcher's least accurate pitch (lowest percentage of strikes) is the slider.
The pitcher threw the changeup the most often.
The pitcher's least accurate pitch (lowest percentage of strikes) is the curveball.
For curveballs, it seems like roughly 50% are thrown for a strike. In the other pitches, the percentage of pitches thrown for strikes all seem to be greater than 50% since the strike area is more than half of the total pitches.
Stack Plots
11 Which of the following statements about a positively skewed distribution is true?
The distribution of the data features two modes.
The mean, median, and mode have the same values.
The distribution of the data tails to the right of the median.
The distribution of the data tails to the left of the median.
Skewness refers to how the data trends to the left or right. If a dataset is skewed, it is not symmetric. The direction of the tail of a distribution tells you which direction the skew lies. If there is positive skew, this implies the skew is to the right. If the distribution trends to the right, it will have a mean that is larger than the median due to those higher values.
Shapes of Distribution
12 Consider the histogram showing the heights of individuals on a basketball team.
How many players are taller than 78 inches?
4
12
6
3
If we sum up the frequencies for heights of people who are taller than 78 inches, the totals are:
So there are 12 people taller than 78 inches.
Histograms
13 The midterm exam scores obtained by boys and girls in a class are listed in the table below.
What does the circled section represent?
Eight boys scored over 10 marks on the exam.
Eight boys scored 12 marks on the exam.
Two boys scored between 80 and 89 marks on the exam.
Twelve boys scored 8 marks on the exam.
If we recall that the stem and leaf can give us the actual values in the data set, then the circle corresponds to 81 and 82. We can then note that there are two scores from boys between 80 and 89.
Stem-and-Leaf Plots
14 Mike's electronics store sold 12, 10, 9, 5, 14, 10, and 10 cell phones on each of the seven days of a week. The mean number of cell phones sold by the store for the week was __________.
9 phones
5 phones
10 phones
70 phones
To get the median we first order the data and take the middle value. The ordered values are:
Since there are an odd number (n=7) of values we simply take the middle, which is 10.
Mean, Median, and Mode
15 In a survey to rate the pizzas served by a pizza parlor, 250 people rated their agreement with the statement, “The pizzas here are one of the best I’ve ever had.” The answers were put into a table. Rating Frequency Strongly Agree
27
Agree 50 Neutral
75
Disagree
54
Strongly Disagree
44
The relative frequency of people who strongly agree with the statement is __________.
20%
10.8%
27%
17.6%
To get the relative frequency, we take the frequency of the value and divide it by the total number. So in this case for strongly agree, the relative frequency would be:
Frequency Tables
16 Select the false statement about standard deviation.
It is a measure of how spread out the values of a data set are.
It is calculated using the mean.
It is the square root of variance.
It is the average of the squared differences of the values from the mean.
Recall that the standard deviation (SD) is equal to the square root of the variance and the variance is the average of the squared distances from the mean. So, the standard deviation is the average distance to the mean.
Standard Deviation
17 Ralph records the time it takes for each of his classmates to run around the track one time. As he analyzes the data on the graph, he notices very little variation between his classmates’ times. Which component of data analysis is Ralph observing?
An outlier in the data set
The overall shape of the data
The center of the data set
The overall spread of the data
Since Ralph is looking at the variation of data, this is examining the spread of the data.
Data Analysis
18 Hannah noted the height of each student in her class and found that the mean height of the students is 56 inches, with a standard deviation of 1.2 inches. The height of one of the students, James, is 59 inches.
What is the z-score for James' height?
2.5
-2.5
-3.6
3.6
Recall that the z-score can be calculated with the following formula:
The given value is 59 inches, the mean is 56 inches, and the standard deviation is 1.2 inches. Plug these values in to get the following z-score:
This also tells us that 59 is 2.5 z-scores or standard deviations above the mean.
Standard Scores and Z-Scores
19 Sara wonders what percentage of her students answered at least half of the quiz questions inly.
The cumulative frequency of students who earned a score of 20 or lower on the quiz is __________.
68
34
54
27
To get cumulative frequency of 20 or less, we simply add up any bin that has the number 20 or less, such as the bin that shows scores of 1-5, 6-10, 11-15, and 16-20.
This would be:
Cumulative Frequency
20 At Brent's school, the final grade for his U.S. History course is weighted as follows: Tests: 30% Quizzes: 50% Homework: 20% Brent has an average of 82% on his tests, 94% on his quizzes, and 50% on his homework. What is Brent's weighted average?
74.8%
75.3%
69.6%
81.6%
In order to get the weighted average we use the following formula:
Weighted Mean
21 Jerry graded seven standardized tests with scores of 60, 74, 41, 87, 94, 79, and 57. Which standardized test score represents the 50th percentile?
41
57
74
79
The 50th percentile will be the median, or middle number. Make sure to first order the data.
The middle number is the 4th value, or 74.
Percentiles
22 Which of the following statements is true?
A standard normal curve has a mean of 0 and a standard deviation of 2.
A standard normal curve has a mean of 1 and a standard deviation of 2.
A standard normal curve has a mean of 0 and a standard deviation of 1.
A standard normal curve has a mean of 1 and a standard deviation of 1.
The standard normal distribution is a normal distribution that is centered at 0 and a standard deviation of 1.
Standard Normal Distribution
23 Naomi weighed 50 patients for a medical study using a scale that measures to the nearest whole pound. She then calculated the mean weight as 176 pounds, with a standard deviation of 12 pounds. If the distribution is normal, what percent of the data lies between 140 pounds and 212 pounds?
68%
99.7%
34%
95%
Recall that if the data is normal, then the 68-95-99.7 rule applies which states that 68% of all data points fall within one standard deviation of the mean, 95% of all data points fall within two standard deviations of the mean, and 99.7% of all data points fall within three standard deviations of the mean.
140 pounds and 212 pounds are both 36 pounds from the mean of 176 pounds, which is the same as three standard deviations (12 pounds * 3) in either direction. This tells us that 99.7% of the data should lie between 140 cm to 212 cm.
68-95-99.7 Rule
24 A data set has its first and third quartiles as 9 and 17, respectively. Which of the following data points would be considered an outlier for the data set?
41
27
3
17
To find an outlier we note the lower bound and upper bound for outliers are:
First, find the interquartile range:
Next, plug this value in for IQR, along with Q1 and Q3 to find the lower and upper bounds.
Anything lower than -3 and larger than 29 will be an outlier. 41 is outside of these bounds and is an outlier.
Outliers and Modified Boxplots
25 The graph below shows the change in passenger load factor for all scheduled airlines in the United States over different months of a year.
Which segment of the graph indicates no change in the passenger load?
C
B
D
A
In the graph, when the segment is horizontal, this shows no change in the percentage of passengers. This would indicate no change in passenger load. This is segment B.
Line Charts and Time-Series Diagrams
26 Select the statement that is TRUE.
The interquartile range covers 100% of a data set.
The interquartile range is calculated by adding the first quartile with the third quartile.
The interquartile range is the average value of a data set.
The interquartile range is calculated by subtracting the first quartile from the third quartile.
Recall that the interquartile range is the difference between Q3 and Q1 and can be calculated by subtracting the first quartile from the third quartile:
Range and Interquartile Range (IQR)