Getting started

Go to the course organization on GitHub: https://github.com/sta-363-s20.
Find the repo starting with hw-02 and that has your team name at the end (this should be the only hw-02 repo available to you).
In the repo, click on the green Clone or download button, select Use HTTPS. Click on the clipboard icon to copy the repo URL.
If using RStudio Cloud, go to RStudio Cloud and into the course workspace. Create a New Project from Git Repo. You will need to click on the down arrow next to the New Project button to see this option.
If using RStudio Pro, create a new project by clicking File > New Project Then click Version Control and Git/Github.
Copy and paste the URL of your assignment repo into the dialog box.
Hit OK, and you’re good to go!

For parts (a) and (b), indicate which of i. through iv. is correct. Justify your answer.

(a.) The lasso, relative to least squares, is:

(i.) More flexible and hence will give improved prediction accuracy when its increase in bias is less than its decrease in variance.
(ii.) More flexible and hence will give improved prediction accuracy when its increase in variance is less than its decrease in bias.
(iii.) Less flexible and hence will give improved prediction accuracy when its increase in bias is less than its decrease in variance.
(iv.) Less flexible and hence will give improved prediction accuracy when its increase in variance is less than its decrease in bias.

(b.) Repeat (a) for ridge regression relative to least squares.

Suppose we estimate the regression coefficients in a linear regression model by minimizing

for a particular value of . For parts (a) through (e), indicate which of i. through v. is correct. Justify your answer.

(a.) As we increase from 0, the training RSS will:

(i.) Increase initially, and then eventually start decreasing in an inverted U shape.
(ii.) Decrease initially, and then eventually start increasing in a U shape.
(iii.) Steadily increase.
(iv.) Steadily decrease. (v.) Remain constant.

(b.) Repeat (a) for test RSS.

(c.) Repeat (a) for variance.

(d.) Repeat (a) for (squared) bias.

(e.) Repeat (a) for the irreducible error.

Suppose we estimate the regression coefficients in a linear regression model by minimizing