Perplex
  • Lessons
  • Problems
  • Speed Run
  • Practice Tests
  • Skill Checklist
  • Review Videos
  • All Content
  • Landing Page
  • Sign Up
  • Login
  • Perplex
    IB Math AIHL
    /
    Bivariate Statistics
    /

    Skills

    Skill Checklist

    Track your progress across all skills in your objective. Mark your confidence level and identify areas to focus on.

    Track your progress:

    Don't know

    Working on it

    Confident

    📖 = included in formula booklet • 🚫 = not in formula booklet

    Track your progress:

    Don't know

    Working on it

    Confident

    📖 = included in formula booklet • 🚫 = not in formula booklet

    Bivariate Statistics

    Skill Checklist

    Track your progress across all skills in your objective. Mark your confidence level and identify areas to focus on.

    10 Skills Available

    Track your progress:

    Don't know

    Working on it

    Confident

    📖 = included in formula booklet • 🚫 = not in formula booklet

    Track your progress:

    Don't know

    Working on it

    Confident

    📖 = included in formula booklet • 🚫 = not in formula booklet

    Linear Regression

    7 skills
    Plotting approximate best fit line
    SL 4.4

    Best fit lines can also be drawn approximately by eye. We start by finding the average x and y, giving the point (xˉ,yˉ​). We then take a ruler and place it on this point, and adjust the slope until we find a reasonable best fit line.


    Powered by Desmos

    Regression line y on x
    SL 4.4

    Linear regression is a statistical method used to model the relationship between two variables when data is given as pairs of points (x,y). We fit a straight line (called the regression line) that minimizes the average vertical distance from the points:

    Powered by Desmos


    The general equation of the regression line is:

    y=ax+b

    where a is the slope and b is the y-intercept.


    The values of a and b can be found using a calculator:

    • Use Stat>Edit to fill in x- and y-values into L1​ and L2​.

    • Then, press Stat, right arrow to the CALC menu, and select 4:LinReg(ax+b).

    Pearson's Product-Moment Correlation Coefficient
    SL 4.4

    Pearson's product-moment correlation coefficient, denoted by r, measures the strength and direction of a linear relationship between two numerical variables x and y. Its value always lies between −1 and +1:

    • r=+1: perfect positive linear relationship

    • r=−1: perfect negative linear relationship

    • r=0: no linear relationship

    A positive value means y generally increases as x increases; a negative value means y generally decreases as x increases. The closer r is to ±1, the stronger the linear relationship.


    If you clickmode, scroll to STAT DIAGNOSTICS , hover over ON, and click ENTER, then any time you perform a linear regression, the calculator will provide Pearson's coefficient in addition to the regression line.

    Predicting y from x
    SL 4.4

    Once we have a regression line y=ax+b, we can use it to predict y by plugging in a value of x.

    Danger of extrapolation
    SL 4.4

    When using a regression line to predict y from x, we need to be aware of the danger of extrapolation. This occurs when we try to predict y for a value of x far outside the range of x values in our data. For such an x, we cannot trust that the relationship is the same.

    Limitations of predicting x from y
    SL 4.4

    While it is possible to use a regression line y=ax+b to predict x with

    x=ay−b​,

    this is not a reliable process. The best fit line is determined to minimize the difference between the real y’s and the predicted y’s,so the difference between real and predicted values for x may be much larger.

    Spearman’s rank correlation coefficient
    SL AI 4.10

    Spearman's rank correlation coefficient rs​ measures how close data is to be monotonic - either solely increasing or solely decreasing. Spearman's coefficient is calculated by ranking x- and y-values from least to greatest and performing a linear regression on the ranked lists.


    If rs​=1, then the data is perfectly monotonic increasing: as one variable increases, the other does too.

    If rs​=−1, then the data is perfectly monotonic decreasing: as one variable increases, the other decreases.


    Spearman's coefficient is useful because it captures all monotonic relationships - unlike Pearson's coefficient, which only measures linear monotonic relationships. Consequently, Spearman's coefficient is not as sensitive to outliers as Pearson's coefficient.

    Non-linear regression & residuals

    3 skills