Definition

Our objective is to explore the effect of regressor $x$ on the response variable $y$ . In this context we define our model as the following:

y = h (x, ε)

Here, y is the dependent and observable variable.

All of our assumptions are based on the $ε$ , which is the error term in the model. It is also called as residual. Our assumptions are:

$E [ε] = 0$
$V (ε) = σ^{2}$
$C o v (ε_{i}, ε_{j}) = 0$ , $i \neq = j$
$ε \sim N I D (0, σ^{2})$

Observe that $y$ is a function of a random variable, $ε$ . Thus $y$ itself is a random variable.

In the case of SLR we have one scalar response variable and one regressor. We define our model as following:

y = β_{0} + β_{1} x + ε

One can interpret $β_{0}$ as intercept and $β_{1}$ as the slope of the fitted line.

At the time we construct our model, we do not know parameters $β_{0}$ and $β_{1}$ and our aim is to estimate these from the data using Least Squares Estimation.

Model Analysis

Estimation by LSE

We estimate $β_{0}$ , $β_{1}$ and $σ^{2}$ . To do so we say that data is paired as $(x_{i}, y_{i})$ . Observe that

ε_{i} = y_{i} - (β_{0} + β_{1} x)

We define sum of squared errors as

SSE = i = 1 \sum n ε_{i}^{2}

LSE for $β_{0}$

\frac{d}{d β _{0}} SSE = 0

\frac{d}{d β _{0}} i = 1 \sum n ε^{2} = 0

\frac{d}{d β _{0}} i = 1 \sum n (y_{i} - β_{0} - β_{1} x_{i})^{2} = 0

Derivative of a sum is sum of the derivatives.

i = 1 \sum n \frac{d}{d β _{0}} (y_{i} - β_{0} - β_{1} x_{i})^{2} = 0

i = 1 \sum n - 2 (y_{i} - β_{0} - β_{1} x_{i}) = 0

- 2 i = 1 \sum n y_{i} - β_{0} - β_{1} x_{i} = 0

i = 1 \sum n y_{i} - i = 1 \sum n β_{0} - i = 1 \sum n β_{1} x_{i} = 0

i = 1 \sum n y_{i} - i = 1 \sum n β_{1} x_{i} = n β_{0}

β_{0} = i = 1 \sum n \frac{y _{i}}{n} - i = 1 \sum n \frac{β _{1} x _{i}}{n}

\hat{β_{0}} = \overset{y}{ˉ} - \hat{β_{1}} \overset{x}{ˉ}

LSE for $β_{1}$

i = 1 \sum n \frac{d}{d β _{1}} (y_{i} - β_{0} - β_{1} x_{i})^{2} = 0

i = 1 \sum n - 2 x_{i} (y_{i} - β_{0} - β_{1} x_{i}) = 0

i = 1 \sum n x_{i} y_{i} - x_{i} β_{0} - β_{1} x_{i}^{2} = 0

i = 1 \sum n x_{i} y_{i} - β_{0} i = 1 \sum n x_{i} - β_{1} i = 1 \sum n x_{i}^{2} = 0

i = 1 \sum n x_{i} y_{i} - (\overset{y}{ˉ} - \hat{β_{1}} \overset{x}{ˉ}) i = 1 \sum n x_{i} - β_{1} i = 1 \sum n x_{i}^{2} = 0

i = 1 \sum n x_{i} y_{i} - \overset{y}{ˉ} i = 1 \sum n x_{i} + \hat{β_{1}} \overset{x}{ˉ} i = 1 \sum n x_{i} - β_{1} i = 1 \sum n x_{i}^{2} = 0

\hat{β_{1}} (\overset{x}{ˉ} i = 1 \sum n x_{i} - i = 1 \sum n x_{i}^{2}) = \overset{y}{ˉ} i = 1 \sum n x_{i} - i = 1 \sum n x_{i} y_{i}

\hat{β_{1}} = \frac{( y ˉ \sum _{i = 1}^{n} x _{i} - \sum _{i = 1}^{n} x _{i} y _{i} )}{( x ˉ \sum _{i = 1}^{n} x _{i} - \sum _{i = 1}^{n} x _{i}^{2} )}

Substitute $\sum x_{i} = \overset{x}{ˉ} n$

\hat{β_{1}} = \frac{\sum _{i = 1}^{n} ( x _{i} - x ˉ ) ( y _{i} - y ˉ )}{\sum _{i = 1}^{n} ( x _{i} - x ˉ ) ^{2}}

\hat{β_{1}} = \frac{C o v ( x , y )}{V ( x )}

\hat{β_{1}} = \frac{S _{X Y}}{S _{XX}}

where

S_{X Y} = i = 1 \sum n (x_{i} - \overset{x}{ˉ}) (y_{i} - \overset{y}{ˉ})

LSE for $σ^{2}$

\hat{σ^{2}} = \frac{SSE}{n - 2}

Here $n - 2$ is the degrees of freedom.

Distribution of least squares estimates

We have estimated $β_{0}$ , $β_{1}$ and $σ^{2}$ using random samples from the data. Thus, they are random variables too.

Since the LSE is BLUE we have:

$β_{0}$

$E [β_{0}] = β_{0}$
$V (β_{0}) = (\frac{σ ^{2} ^}{S _{XX}} \frac{\sum ^{n} x _{i}^{2}}{n})$
$SE (β_{0}) = (\frac{σ ^{2} ^}{S _{XX}} \frac{\sum ^{n} x _{i}^{2}}{n})$ Then

β_{0} \sim N β_{0}, (\frac{σ ^{2} ^}{S _{XX}} \frac{\sum ^{n} x _{i}^{2}}{n})

$β_{1}$

$E [β_{1}] = β_{1}$
$V (β_{1}) = \frac{σ ^{2} ^}{S _{XX}}$
$SE (β_{1}) = \frac{σ ^{2} ^}{S _{XX}}$ Then

β_{1} \sim N β_{1}, \frac{σ ^{2} ^}{S _{XX}}

t-values

T_{i} = \frac{β _{i} ^ - β _{i} ˉ}{SE ( β _{i} )} \sim t_{(n - 2)}

Goodness-of-fit

Question

We question whether the data match the model or not.

One would say that the model is a good fit for the data if

\overset{y_{i}}{^} \approx y_{i}

Thus

ε_{i} \approx 0

Sums of squares

$SST = \sum^{n} (y_{i} - \overset{y}{ˉ})^{2}$ , total deviation in $y$ .

$SSE = \sum^{n} (y_{i} - \overset{y_{i}}{^})^{2}$ , sum of residuals.

$SSR = SST - SSE$ , deviation caused by regression.

If $\overset{y_{i}}{^} \approx y_{i}$ then $SSE \approx 0$ .

Coefficient of determination

It is defined as:
$R^{2} = 1 - \frac{SSE}{SST}$
$R^{2}$ represents the share due to $x$ in total variation in $y$ . So $R^{2} = 0.95$ means that 95% variation in $y$ is due to $x$ and 5% of the variation is due to model residuals. Such a model is considered as a good fit.

Observe that $SSE \sim 0 ⟹ R^{2} \sim 1$ Thus we say

$R^{2} \sim 1 ⟹$ model is a good-fit

$R^{2} \sim 0 ⟹$ model may not be a poor-fit. You need to conduct goodness-of-fit test.

Goodness-of-fit Test

Hypothesis

$H_{0}$ : $β_{1} = 0$ , means that $x$ has no effect on $y$ . $H_{1}$ : $β_{1} \neq =_{0}$ , $x$ has effect on $y$ . We test this hypothesis with ANOVA.

We construct our ANOVA table:

Source	DoF	Sum of Squares	Mean of SS	$F^{*}$
Regression	1	$SSR$	$MSR = \frac{SSR}{1}$	$\frac{MSR}{MSE}$
Error	$n - 2$	$SSE$	$MSE = \frac{SSE}{n - 2}$
Total	$n - 1$	$SST$
Thus one can deduct that:

F^{*} = \frac{SSR}{\frac{SSE}{n - 2}} = \frac{( n - 2 ) ( SST - SSE )}{SSE}

F^{*} \sim F_{(α, 1, n - 2)}

Usual $α$ (significance level) values are:

0.01
0.05
0.10

Rule of decision

Let $c v = F_{(α, 1, n - 2})$

Reject $H_{0}$ if $F^{*} \geq c v$ . Thus conclude that $x$ has statistically significant effect on $y$ at $α$ significance level.

Fail to reject $H_{0}$ if $F^{*} < c v$ . Therefore say that there is not enough evidence to state the effect of $x$ on $y$ .

Melih Akay 🦾

Explorer

Recent writing

Wheelie Roadmap

Invertibility of a matrix

About me

Simple Linear Regression

Definition

Model Analysis

Estimation by LSE

LSE for $β_{0}$

LSE for $β_{1}$

LSE for $σ^{2}$

Distribution of least squares estimates

$β_{0}$

$β_{1}$

t-values

Goodness-of-fit

Goodness-of-fit Test

Test of hypothesis

Graph View

Table of Contents

Backlinks

Melih Akay 🦾

Explorer

Recent writing

Wheelie Roadmap

Invertibility of a matrix

About me

Simple Linear Regression

Definition

Model Analysis

Estimation by LSE

LSE for β0​

LSE for β1​

LSE for σ2

Distribution of least squares estimates

β0​

β1​

t-values

Goodness-of-fit

Goodness-of-fit Test

Test of hypothesis

Graph View

Table of Contents

Backlinks

LSE for $β_{0}$

LSE for $β_{1}$

LSE for $σ^{2}$

$β_{0}$

$β_{1}$