CK-12 Flexbooks on Bivariate Data

Study Guides

Big Picture Bivariate data is often represented with scatterplots and line plots. The main reason to display bivariate data in such a way is to find a relationship between the two variables. The relationship is described through the correlation coefficient. Often, transformations on the data must be done so that the correlation coefficient can be used.

Key Terms Bivariate: Two variables. Scatterplot: A graph where each point represents a pair of measurements (two variables). Correlation: The relationship between bivariate data. Correlation Coefficient: A number that describes the correlation (relation) between bivariate data. Linear Regression: Using data to calculate a line that best fits that data. The line can be used to make predictions. Residual: The distance between the observed value and the expected value.

Displaying Bivariate Data Bivariate data is primarily examined to show some sort of relationship between two variables. Bivariate data usually has an independent variable and a dependent variable. The independent variable influences the dependent variable. Time is often the independent variable. We are often interested in the change a variable exhibits over time. We can see if there is any relationship between the variables by showing the data in a scatterplot. You may often see scatterplots as a series of disconnected points. They are a good way of representing bivariate data. The independent variable is on the x-axis while the dependent variable is on the y-axis.

Probability & Statistics

Bivariate Data

Line plots are also used to show change over time. A line plot is basically a scatterplot where the dots are connected chronologically (in order by time).

Correlation Three important characteristics of bivariate data:

• shape (linear, exponential, etc.) • direction • strength We are usually most interested in finding if there is any correlation in the data. The correlation describes the direction of the direction. One way to visualize the correlation is with a scatterplot. We can describe the correlation as: • positive correlation: positive slope • negative correlation: negative slope

• zero correlation: points do not have a linear trend

straight line - can be positive or negative

The more linear the data is, the stronger the linear correlation. Another way to view the strength of this correlation is to draw an ellipse (oval) around all of the data. The narrower or skinnier the ellipse is, the stronger the linear correlation.

This guide was created by Lizhi Fan and Jin Yu. To learn more about the student authors, visit http://www.ck12.org/about/about-us/team/interns.

Page 1 of 3 v1.1.9.2012

Disclaimer: this study guide was not created to replace your textbook and is for classroom or individual use only.

• perfect correlation: points on a scatterplot lie on a

Turn static files into dynamic content formats.

Create a flipbook