Linear RegressionDr. D’Agostino McGowan1 / 68

Dr. Lucy D'Agostino McGowan adapted from slides by Hastie & Tibshirani

Lab follow-upKnit, commit, and push after every exercise
When you are working on labs, homeworks, or application exercises, edit the file I have started for you (01-hello-r.Rmd)
Any questions?
2 / 68

`Linear Models`

Go to the sta-363-s20 GitHub organization and search for appex-01-linear-models
Clone this repository into RStudio Cloud

3 / 68

Dr. Lucy D'Agostino McGowan adapted from slides by Hastie & Tibshirani

Linear Regression QuestionsIs there a relationship between a response variable and predictors?
How strong is the relationship?
What is the uncertainty?
How accurately can we predict a future outcome?
4 / 68

Simple linear regression

$Y = β_{0} + β_{1} X + ϵ$

5 / 68

Simple linear regression

$Y = β_{0} + β_{1} X + ϵ$

$β_{0}$ : intercept

5 / 68

Simple linear regression

$Y = β_{0} + β_{1} X + ϵ$

$β_{0}$ : intercept
$β_{1}$ : slope

5 / 68

Simple linear regression

$Y = β_{0} + β_{1} X + ϵ$

$β_{0}$ : intercept
$β_{1}$ : slope
- $β_{0}$ and $β_{1}$ are coefficients, parameters

5 / 68

Simple linear regression

$Y = β_{0} + β_{1} X + ϵ$

$β_{0}$ : intercept
$β_{1}$ : slope
- $β_{0}$ and $β_{1}$ are coefficients, parameters
$ϵ$ : error

5 / 68

Simple linear regression

We estimate this with

$\hat{y} = {\hat{β}}_{0} + {\hat{β}}_{1} x$

6 / 68

Simple linear regression

We estimate this with

$\hat{y} = {\hat{β}}_{0} + {\hat{β}}_{1} x$

$\hat{y}$ is the prediction of $Y$ when $X = x$

6 / 68

Simple linear regression

We estimate this with

$\hat{y} = {\hat{β}}_{0} + {\hat{β}}_{1} x$

$\hat{y}$ is the prediction of $Y$ when $X = x$
The hat denotes that this is an estimated value

6 / 68

Simple linear regression

$Y_{i} = β_{0} + β_{1} X_{i} + ϵ_{i}$

$ϵ_{i} \sim N (0, σ^{2})$

7 / 68

Simple linear regression

$Y_{i} = β_{0} + β_{1} X_{i} + ϵ_{i}$

$ϵ_{i} \sim N (0, σ^{2})$

$\begin{aligned} Y_{1} & = β_{0} + β_{1} X_{1} + ϵ_{1} \\ Y_{2} & = β_{0} + β_{1} X_{2} + ϵ_{2} \\ ⋮ & ⋮ ⋮ \\ Y_{n} & = β_{0} + β_{1} X_{n} + ϵ_{n} \end{aligned}$

8 / 68

Simple linear regression

$Y_{i} = β_{0} + β_{1} X_{i} + ϵ_{i}$

$ϵ_{i} \sim N (0, σ^{2})$

$\begin{aligned} Y_{1} & = β_{0} + β_{1} X_{1} + ϵ_{1} \\ Y_{2} & = β_{0} + β_{1} X_{2} + ϵ_{2} \\ ⋮ & ⋮ ⋮ \\ Y_{n} & = β_{0} + β_{1} X_{n} + ϵ_{n} \end{aligned}$

$\begin{aligned} [\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}] & = [\begin{matrix} β_{0} + β_{1} X_{1} \\ β_{0} + β_{1} X_{2} \\ ⋮ \\ β_{0} + β_{1} X_{n} \end{matrix}] + [\begin{matrix} ϵ_{1} \\ ϵ_{2} \\ ⋮ \\ ϵ_{n} \end{matrix}] \end{aligned}$

8 / 68

Simple linear regression

$Y_{i} = β_{0} + β_{1} X_{i} + ϵ_{i}$

$ϵ_{i} \sim N (0, σ^{2})$

$\begin{aligned} Y_{1} & = β_{0} + β_{1} X_{1} + ϵ_{1} \\ Y_{2} & = β_{0} + β_{1} X_{2} + ϵ_{2} \\ ⋮ & ⋮ ⋮ \\ Y_{n} & = β_{0} + β_{1} X_{n} + ϵ_{n} \end{aligned}$

$\begin{aligned} [\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}] & = [\begin{matrix} 1 X_{1} \\ 1 X_{2} \\ ⋮ ⋮ \\ 1 X_{n} \end{matrix}] [\begin{matrix} β_{0} \\ β_{1} \end{matrix}] + [\begin{matrix} ϵ_{1} \\ ϵ_{2} \\ ⋮ \\ ϵ_{n} \end{matrix}] \end{aligned}$

9 / 68

Simple linear regression

10 / 68

Simple linear regression

$\begin{aligned} [\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}] & = \underset{X : Design Matrix}{\underset{⏟}{[\begin{matrix} 1 X_{1} \\ 1 X_{2} \\ ⋮ ⋮ \\ 1 X_{n} \end{matrix}]}} [\begin{matrix} β_{0} \\ β_{1} \end{matrix}] + [\begin{matrix} ϵ_{1} \\ ϵ_{2} \\ ⋮ \\ ϵ_{n} \end{matrix}] \end{aligned}$

11 / 68

Simple linear regression

What are the dimensions of $X$ ?

11 / 68

Simple linear regression

What are the dimensions of (\mathbf{X})?

$n \times 2$

11 / 68

Simple linear regression

$\begin{aligned} [\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}] & = \underset{X : Design Matrix}{\underset{⏟}{[\begin{matrix} 1 X_{1} \\ 1 X_{2} \\ ⋮ ⋮ \\ 1 X_{n} \end{matrix}]}} \underset{β : Vector of parameters}{\underset{⏟}{[\begin{matrix} β_{0} \\ β_{1} \end{matrix}]}} + [\begin{matrix} ϵ_{1} \\ ϵ_{2} \\ ⋮ \\ ϵ_{n} \end{matrix}] \end{aligned}$

12 / 68

Simple linear regression

$\begin{aligned} [\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}] & = \underset{X : Design Matrix}{\underset{⏟}{[\begin{matrix} 1 X_{1} \\ 1 X_{2} \\ ⋮ ⋮ \\ 1 X_{n} \end{matrix}]}} \underset{β : Vector of parameters}{\underset{⏟}{[\begin{matrix} β_{0} \\ β_{1} \end{matrix}]}} + [\begin{matrix} ϵ_{1} \\ ϵ_{2} \\ ⋮ \\ ϵ_{n} \end{matrix}] \end{aligned}$

What are the dimensions of $β$ ?

12 / 68

Simple linear regression

$\begin{aligned} [\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}] & = [\begin{matrix} 1 X_{1} \\ 1 X_{2} \\ ⋮ ⋮ \\ 1 X_{n} \end{matrix}] [\begin{matrix} β_{0} \\ β_{1} \end{matrix}] + \underset{ϵ : vector of error terms}{\underset{⏟}{[\begin{matrix} ϵ_{1} \\ ϵ_{2} \\ ⋮ \\ ϵ_{n} \end{matrix}]}} \end{aligned}$

13 / 68

Simple linear regression

$\begin{aligned} [\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}] & = [\begin{matrix} 1 X_{1} \\ 1 X_{2} \\ ⋮ ⋮ \\ 1 X_{n} \end{matrix}] [\begin{matrix} β_{0} \\ β_{1} \end{matrix}] + \underset{ϵ : vector of error terms}{\underset{⏟}{[\begin{matrix} ϵ_{1} \\ ϵ_{2} \\ ⋮ \\ ϵ_{n} \end{matrix}]}} \end{aligned}$

What are the dimensions of $ϵ$ ?

13 / 68

Simple linear regression

$\begin{aligned} \underset{Y : Vector of responses}{\underset{⏟}{[\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}]}} & = [\begin{matrix} 1 X_{1} \\ 1 X_{2} \\ ⋮ ⋮ \\ 1 X_{n} \end{matrix}] [\begin{matrix} β_{0} \\ β_{1} \end{matrix}] + [\begin{matrix} ϵ_{1} \\ ϵ_{2} \\ ⋮ \\ ϵ_{n} \end{matrix}] \end{aligned}$

14 / 68

Simple linear regression

What are the dimensions of $Y$ ?

14 / 68

Simple linear regression

$Y = X β + ϵ$

15 / 68

Simple linear regression

$\begin{aligned} [\begin{matrix} {\hat{y}}_{1} \\ {\hat{y}}_{2} \\ ⋮ \\ {\hat{y}}_{n} \end{matrix}] & = [\begin{matrix} 1 x_{1} \\ 1 x_{2} \\ ⋮ ⋮ \\ 1 x_{n} \end{matrix}] [\begin{matrix} {\hat{β}}_{0} \\ {\hat{β}}_{1} \end{matrix}] \end{aligned}$

${\hat{y}}_{i} = {\hat{β}}_{0} + {\hat{β}}_{1} x_{i}$

16 / 68

Simple linear regression

${\hat{y}}_{i} = {\hat{β}}_{0} + {\hat{β}}_{1} x_{i}$

$ϵ_{i} = y_{i} - {\hat{y}}_{i}$

16 / 68

Simple linear regression

${\hat{y}}_{i} = {\hat{β}}_{0} + {\hat{β}}_{1} x_{i}$

$ϵ_{i} = y_{i} - {\hat{y}}_{i}$
$ϵ_{i} = y_{i} - ({\hat{β}}_{0} + {\hat{β}}_{1} x_{i})$

16 / 68

Simple linear regression

${\hat{y}}_{i} = {\hat{β}}_{0} + {\hat{β}}_{1} x_{i}$

$ϵ_{i} = y_{i} - {\hat{y}}_{i}$
$ϵ_{i} = y_{i} - ({\hat{β}}_{0} + {\hat{β}}_{1} x_{i})$
$ϵ_{i}$ is known as the residual for observation $i$

16 / 68

Simple linear regression

How are ${\hat{β}}_{0}$ and ${\hat{β}}_{1}$ chosen? What are we minimizing?

17 / 68

Simple linear regression

How are ${\hat{β}}_{0}$ and ${\hat{β}}_{1}$ chosen? What are we minimizing?

Minimize the residual sum of squares

17 / 68

Simple linear regression

How are ${\hat{β}}_{0}$ and ${\hat{β}}_{1}$ chosen? What are we minimizing?

Minimize the residual sum of squares
RSS = $\sum ϵ_{i}^{2} = ϵ_{1}^{2} + ϵ_{2}^{2} + \dots + ϵ_{n}^{2}$

17 / 68

Simple linear regression

How could we re-write this with $y_{i}$ and $x_{i}$ ?

Minimize the residual sum of squares
RSS = $\sum ϵ_{i}^{2} = ϵ_{1}^{2} + ϵ_{2}^{2} + \dots + ϵ_{n}^{2}$

18 / 68

Simple linear regression

How could we re-write this with $y_{i}$ and $x_{i}$ ?

Minimize the residual sum of squares
RSS = $\sum ϵ_{i}^{2} = ϵ_{1}^{2} + ϵ_{2}^{2} + \dots + ϵ_{n}^{2}$
RSS = $(y_{1} - \hat{β_{0}} - {\hat{β}}_{1} x_{1})^{2} + (y_{2} - {\hat{β}}_{0} - {\hat{β}}_{1} x_{2})^{2} + \dots + (y_{n} - {\hat{β}}_{0} - {\hat{β}}_{1} x_{n})^{2}$

18 / 68

Simple linear regression

Let's put this back in matrix form:

$\begin{aligned} \sum ϵ_{i}^{2} = [\begin{matrix} ϵ_{1} & ϵ_{2} & \dots & ϵ_{n} \end{matrix}] [\begin{matrix} ϵ_{1} \\ ϵ_{2} \\ ⋮ \\ ϵ_{n} \end{matrix}] = ϵ^{T} ϵ \end{aligned}$

19 / 68

Simple linear regression

What can we replace $ϵ_{i}$ with? (Hint: look back a few slides)

20 / 68

Simple linear regression

What can we replace $ϵ_{i}$ with? (Hint: look back a few slides)

$\begin{aligned} \sum ϵ_{i}^{2} = (Y - X β)^{T} (Y - X β) \end{aligned}$

20 / 68