Main References & Credits
Readings:
- Tibshirani
- Andrew Ng coursera
- http://egarpor.github.io/index.html
- https://onlinecourses.science.psu.edu/stat505/node/49
- Bishop
- Site using Rstudio, Rmarkdown, bookdown. Pdf compiled using latex.
- The icons used in the notes were designed by madebyoliver, freepik and roundicons from Flaticon.
Also known as SSE: Sum of Squared Errors.↩︎
They are unique and always exist. They can be obtained by solving \(\frac{\partial}{\partial \beta_0}\text{RSS}(\beta_0,\beta_1)=0\) and \(\frac{\partial}{\partial \beta_1}\text{RSS}(\beta_0,\beta_1)=0\).↩︎
If \(\beta_1 = 0\) this means \(COV(X,Y)=0\). Remember that if the covariance is null that doesn’t necessarily means that \(X\) and \(Y\) are independent, this means there is no linear relationship between them, they are maybe independents or they have other type of relationships.↩︎
Recall that SSR is different from RSS (Residual Sum of Squares)↩︎
Recall that SSE and RSS (for \((\hat \beta_0,\hat \beta_1)\)) are just different names for referring to the same quantity: \(\text{SSE}=\sum_{i=1}^n\left(Y_i-\hat Y_i\right)^2=\sum_{i=1}^n\left(Y_i-\hat \beta_0-\hat \beta_1X_i\right)^2=\mathrm{RSS}\left(\hat \beta_0,\hat \beta_1\right)\).↩︎
The \(F_{n,m}\) distribution arises as the quotient of two independent random variables \(\chi^2_n\) and \(\chi^2_m\), \(\frac{\chi^2_n/n}{\chi^2_m/m}\).↩︎
Important to be sure that \(\hat{\beta}\) is minimising RSS.↩︎
Recal that ESS is the explained sum of squares, ESS = TSS - RSS.↩︎
More complex – included here just for clarification of the
anova
’s output.↩︎Recall that \(R^2 = 1- \frac{\text{RSS}}{\text{TSS}}\)↩︎
It is defined as \(R_{adj}^2 = 1- \frac{\text{RSS}/(n-p-1)}{\text{TSS}/(n-1)} = 1- \frac{\text{RSS}}{\text{TSS}}\times\frac{n-1}{n-p-1}\)↩︎
in the formula, \(\log\) is the natural logarithm \(\ln\)↩︎
Old Faithful, is a hydrothermal geyser in Yellowstone National Park in the state of Wyoming, U.S.A., and is a popular tourist attraction. Its name stems from the supposed regularity of its eruptions. The data set comprises 272 observations, each of which represents a single eruption and contains two variables corresponding to the duration in minutes of the eruption, and the time until the next eruption, also in minutes.↩︎
Source: the famous MOOC Statistical Learning↩︎
Source: Trevor Hastie’s website↩︎
Source: Marvin Wright’s talk from Why R? 2019↩︎
An Introduction to Recursive Partitioning Using the
rpart
Routines - Details of therpart
package.↩︎rpart.plot
Package - Detailed manual on plotting withrpart
using therpart.plot
package.↩︎For classification a suggestion is
mtry
= \(\sqrt{p}\).↩︎generalized boosted models package↩︎
For classification, the suggested
mtry
for a random forest is \(\sqrt{p}\).↩︎Old Faithful, is a hydrothermal geyser in Yellowstone National Park in the state of Wyoming, U.S.A., and is a popular tourist attraction. Its name stems from the supposed regularity of its eruptions. The data set comprises 272 observations, each of which represents a single eruption and contains two variables corresponding to the duration in minutes of the eruption, and the time until the next eruption, also in minutes.↩︎
Made by Joseph J. Allaire https://github.com/jjallaire↩︎