Mihai Ion
  • Home
  • Research
  • Teaching
  • CV
  1. FIN 421
  2. Lectures
  3. L10_11: Statistical models of returns
  • Teaching
  • FIN 525
    • Lectures
      • L00: Jupyter basics
      • L01: Introduction
      • L02: Variables, types, operators
      • L03: Data structures
      • L04: Conditionals, loops
      • L05: Functions, packages
      • L06: Pandas intro
      • L07: Pandas I/O
      • L08: Pandas filtering
      • L09: Pandas data cleaning
      • L10: Merging, reshaping datasets
      • L11: Dates, lags, sorting
      • L12: Descriptive stats
      • L13: Conditional stats, outliers
      • L14: Conditional stats applied
      • L15: Linear regression intro
      • L16: Linear regression applications
      • L17: Panel regression intro
      • L18: Robust panel regression
      • L19: Robust timeseries regression
      • L20: Backtesting - data prep
      • L21: Backtesting - sumstats
      • L22: Backtesting -returns
      • L23: Backtesting - risk adjustment
  • FIN 421
    • Lectures
      • L01: Introduction
      • L02: Analyzing past returns
      • L03: Modeling future returns
      • L04: Portfolio theory intro
      • L05: Optimal capital allocation
      • L06: Tangency portfolios
      • L07_08: Optimal asset allocation
      • L09: Review
      • L10_11: Statistical models of returns
      • L12: CAPM
      • L13: Cost of equity
      • L14: Bond pricing
      • L15: Bond yields
      • L16: Bond risk
      • L17: Valuation data processing
      • L18_19: Multiples valuation
      • L20_21: Dividend discount models
      • L22_23: Discounted cash flow analysis
      • L24: Valuation sensitivity analysis
      • L25: Options intro
      • L26: Risk management with options

On this page

  • The single-factor model
  • Application
    • Interpreting the results: coefficients, t-statistics, and pvalues
  • Systematic vs idiosyncratic risk
  • Multi-factor models

L10_11: Statistical models of returns

The single-factor model

We have seem in previous lectures that some portion of a stock’s total risk can be diversified away when the stock is part of a larger portfolio. This portion is often referred to as the stock’s idiosyncratic risk. The remaining portion, which can not be diversified away is often called its systematic risk.

The single-factor model is a statistical model which assumes that all sources of systematic risk (economic factors that affect all firms to some extent) are captured by the returns on the market portfolio. If this is the case, then firm excess returns can be written as a linear function of market excess returns (for every firm \(i\) and time \(t\)):

\[R_{i,t} - R_{f,t} = \alpha_i + \beta_i (R_{m,t} - R_{f,t}) + \epsilon_{i,t}\]

where: - \(R_{i,t}\) is the return of firm \(i\) at time \(t\) - \(R_{m,t}\) is the return of the market \(i\) at time \(t\) - \(R_{f,t}\) is the risk-free rate at time \(t\)

  • \(\beta_i (R_{m,t} - R_{f,t})\) is the portion of the firm’s return that is caused by systematic shocks

  • \(\epsilon_{i,t}\) is the portion of the firm’s return that is caused by idiosyncratic (firm-specific) shocks

  • \(\beta_i\) tells us if the firm has more or less systematic risk than the average firm in the economy (average \(\beta\) = 1)

  • \(\alpha_i\) tells us if the firm has offered a higher or lower return (on average) than what was warranted by its exposure to the market (its \(\beta\))

    • If markets are efficient and the market perfectly captures all systematic shocks, then \(\alpha\) should be zero for all firms at all times

Application

Use monthly data for TSLA from 2011 to 2020, to calculate its \(\alpha\), \(\beta\), and the percentage of its risk that can be diversified away. For data on market returns and the risk-free rate, use the data on Ken French’s website and download the monthly factors file.

#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#

Interpreting the results: coefficients, t-statistics, and pvalues

The “Intercept” row contains information about the firm’s \(\alpha\) and the “Mkt-RF” row contains information about the firm’s \(\beta\). The \(\alpha\) and \(\beta\) coefficient estimates themselves are in the “Coefficients” column.

The t-statistics for the two coefficients are in the “t Stat” column. Loosely speaking a t-statistic that larger than 2 or smaller than -2 allows us to conclude that the corresponding coefficient is statistically different from 0 (i.e. reject the null hypothesis that the coefficient is 0).

The p-values are in the “P-value” column. P-values lower than 0.05 allow us to conclude that the corresponding coefficient is statistically different from 0 with the 5% confidence level (i.e. reject the null hypothesis that the coefficient is 0).

The next two columns after the p-value give us the 95% confidence interval for each coefficient.

TSLA’s alpha has a p-value of that is smaller than 5% so we can conclude that it’s alpha is statistically significantly different from 0 at the 95% confidence level (but not at the 99% confidence level). The 5% is a common “significance level” used in hypothesis testing (also called the Type 1 error, or, confusingly for us, the alpha of the test). One minus the significance level is called the “confidence level” of the test.

The fact that the alpha is positive (and statistically different from 0) means that, based on the single-index model, TSLA seems to be undervalued. A negative alpha would mean the stock in overvalued.

If we can not reject the null hypothesis that alpha is 0, the conclusion is NOT that alpha = 0 and therefore the stock is correctly valued (since we can never “accept” a null hypothesis, we can only fail to reject). The conclusion is that we do not have enough evidence to claim that the stock is either undervalued or overvalued (which is not the same thing as saying that we have enough evidence to claim that the stock is correctly valued).

Systematic vs idiosyncratic risk

The regression R-squared coefficient tells us the percentage of the firm’s total risk (variance) that is systematic (can not be diversified away).

One minus the R-squared gives us the percentage of the firm’s risk that is idiosyncratic (can be diversified away).

The breakdown of the firm’s total risk into systematic and idiosyncratic can also be easily calculated as below:

The regression equation

\[R_{i,t} - R_{f,t} = \alpha_i + \beta_i (R_{m,t} - R_{f,t}) + \epsilon_{i,t}\]

implies

\[Var(R_{i,t} - R_{f,t}) = Var(\alpha_i) + Var(\beta_i (R_{m,t} - R_{f,t})) + Var(\epsilon_{i,t})\]

which reduces to

\[Var(R_{i,t} - R_{f,t}) = \beta_{i}^2 Var(R_{m,t} - R_{f,t}) + Var(\epsilon_{i,t})\]

Which says total risk equals systematic risk plus idiosyncratic risk.

#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#

Multi-factor models

Regress excess returns of TSLA on excess returns on the market as well as the SMB and HML factors:

\[R_{i,t} - R_{f,t} = \alpha_i + \beta_{mkt} (R_{m,t} - R_{f,t}) + \beta_{smb} R_{smb,t} + \beta_{hml} R_{hml,t} + \eta_{i,t}\]

  1. What are the alpha and the beta(s) of TSLA in this regression?
  2. What percentage of the firm’s total risk can be diversified away?
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#