Programming with Data

class: center, middle, inverse, title-slide

# Programming with Data
## Session 4: Forecasting with Linear Regressions
### Dr. Wang Jiwei
### Master of Professional Accounting

---

class: inverse, center, middle

# Application: Revenue prediction

---

## The question

> What factors can help us to forecast revenue of a company for budgeting, reporting, valuation, and other purposes?

- Case: <a target="_blank" href="https://eng.uber.com/transforming-financial-forecasting-machine-learning/">Uber's financial forecasting with DS and ML</a>

.pull-left[
.center[<img src="../../../Figures/UberForecasting.png" height="300px">]
]

.pull-right[
- Other interesting readings from Uber
    - [Finance Computation Platform](https://eng.uber.com/ubers-finance-computation-platform/)
    - [Fraud Detection](https://eng.uber.com/fraud-detection/)
    - [Internal Audit](https://eng.uber.com/ml-internal-audit/)
]

---

## Weather data? Satellite images?

- Case: <a target="_blank" href="https://skymapglobal.com/remote-sensing-applications-in-the-financial-sector/">Weather as a Commodity</a>

.pull-left[
- "Hedge Funds now employ a variety of techniques to track weather fluctuation in order to get a cutting edge over competitors and to increase their profit margins."
- "A legacy has been created by RS Metrics LLC., a provider of satellite imagery and quantitative analysis, ever since they forecasted Walmart’s second quarter customer traffic in 2011 using satellite images of parking lot traffic measurements."
- **What other creative data might there be?**
]

.pull-right[
.center[<img src="../../../Figures/qin_huaihe.png" height="400px">]
]

---

## Forecasting application

- Forecast sales of a real estate company in Singapore
- using financial and non-financial data:
    - company's own data
    - other companies' data
    - macro economic data

.center[<img src="../../../Figures/CRA.jpg" height="300px">]

---
class: inverse, center, middle

# Linear models

---

## What is a linear model?

- Revist the following model

$$
\hat{y}=\alpha + \beta \hat{x} + \varepsilon
$$

- This simplest model is trying to predict some outcome `$\hat{y}$` as a function of an input `$\hat{x}$`
    - `$\hat{y}$` in our case is a firm's revenue in a given year
    - `$\hat{x}$` could be a firm's assets in a given year or any other factors we can identify
    - `$\alpha$` and `$\beta$` are coefficients solved for
    - `$\varepsilon$` is the error in the measurement

> This is an *OLS* model -- **O**rdinary **L**east **S**quare regression

---

## Example

> Let's predict UOL's revenue

.pull-left[
<img src="../../../Figures/UOL.png" alt="UOL Group Limited">
<img src="../../../Figures/UOL8.png" alt="UOL Group Limited">
]

.pull-right[
- **COMPUSTAT** has data for UOL since 1989 (till 2019 for this example)
    - more missing data before 1994
    - numbers in Millions

```r
# revt: Revenue, at: Assets
summary(uol[ , c("revt", "at")])
```

```
##       revt               at       
##  Min.   :  94.78   Min.   : 1218  
##  1st Qu.: 213.05   1st Qu.: 3052  
##  Median : 464.99   Median : 3520  
##  Mean   : 774.38   Mean   : 6510  
##  3rd Qu.:1212.26   3rd Qu.: 9044  
##  Max.   :2397.34   Max.   :20664
```
]

---

## Linear models in R

- To run a linear model, use [`lm()`](https://rdrr.io/r/stats/lm.html)
    - The first argument is a formula for your model, where tilde `~` is used in place of an equals sign
        - The left side is what you want to predict
        - The right side is inputs for prediction, separated by `+`
        - `y ~ x1 + x2 + x3`
    - The second argument is the data to use
- Additional variations for the [formula](https://rdrr.io/r/stats/formula.html):
    - Functions transforming inputs (as vectors), such as `log()`
    - Fully interacting variables using asterisk/star `*`
        - i.e., `A*B` includes, A, B, and A times B in the model
    - Interactions using colon `:`
        - i.e., `A:B` just includes A times B in the model

```r
# Example:
lm(revt ~ at, data = uol)
```

---

## Example: UOL

```r
mod1 <- lm(revt ~ at, data = uol)
summary(mod1)
```

```
## 
## Call:
## lm(formula = revt ~ at, data = uol)
## 
## Residuals:
## Min 1Q Median 3Q Max 
## -212.45 -98.13 -48.29 53.50 949.34 
## 
## Coefficients:
## Estimate Std. Error t value Pr(>|t|) 
## (Intercept) 10.101598 60.085716 0.168 0.868 
## at 0.117403 0.007031 16.698 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 216.7 on 29 degrees of freedom
## Multiple R-squared: 0.9058,	Adjusted R-squared: 0.9025 
## F-statistic: 278.8 on 1 and 29 DF, p-value: < 2.2e-16
```

> $1 more in assets leads to $0.12 more in revenue

---

## What's Ordinary Least Squares?

---

## Zoom in on OLS output

- **Residuals**: actual value of y minus what the model predicted

```r
summary(uol$revt - mod1$fitted.values)
```

```
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
## -212.45  -98.13  -48.29    0.00   53.50  949.34
```

- **Estimate**: estimated coefficients that minimize the sum of the square of the errors/residuals
- **Std. Error**: Residual Standard Error (see below) divided by the square root of the sum of the square of that particular x variable.
- **t value**: Estimate divided by Std. Error
- **Pr(>|t|)**: the probability of estimated coefficient = 0 (H0), a.k.a `p-value`
- **Residual standard error**: a tweaked standard deviation of the residual/error

```r
#Residual Standard error (Like Standard Deviation)
k = length(mod1$coefficients) - 1 #number of x excluding intercept
n = length(mod1$residuals) #number of data
SSE = sum(mod1$residuals**2) #sum of squared error
sqrt(SSE/(n - (1 + k))) #Residual Standard Error
```

```
## [1] 216.7404
```

---

## Zoom in on OLS output

- **Multiple R-squared**: how much variance of Y is explained by X

```r
#Multiple R-Squared
SSY = sum((uol$revt - mean(uol$revt))**2) # sum of variance of Y
(SSY - SSE)/SSY
```

```
## [1] 0.905791
```

- **Adjusted R-Squared**: R-squared controlled for number of X and data

```r
#Adjusted R-Squared
1-(SSE/SSY)*(n-1)/(n-(k+1))
```

```
## [1] 0.9025424
```

- **F-Statistic**: a “global” test that checks if at least one coefficient is nonzero

```r
#F-Statistic
#Ho: All coefficients are zero
#Ha: At least one coefficient is nonzero
((SSY-SSE)/k) / (SSE/(n - (k + 1)))
```

```
## [1] 278.8262
```

---

## Example: UOL

- This model wasn't so interesting...
    - Bigger firms have more revenue -- this is a given
- How about... revenue *growth*?
- And *change* in assets
    - i.e., Asset growth

`$$\Delta x_t = \frac{x_t}{x_{t-1}} - 1$$`

---

## Calculating changes in R

- The easiest way is using [`package:tidyverse`](https://tidyverse.tidyverse.org)'s [`package:dplyr`](https://dplyr.tidyverse.org)
    - [`lag()`](https://dplyr.tidyverse.org/reference/lead-lag.html) function along with [`mutate()`](https://dplyr.tidyverse.org/reference/mutate.html)
    - [`package:data.table`](https://r-datatable.com) is also popular but I prefer [`package:dplyr`](https://dplyr.tidyverse.org)
- The default way to do it is to create a vector manually

```r
# tidyverse with pipe %>%
uol <- uol %>%
 mutate(revt_growth1 = revt / lag(revt, order_by = fyear) - 1)

# which is equivalent to
uol <- mutate(uol, revt_growth2 = revt / lag(revt, order_by = fyear) - 1)

# Base R way, [-n] to remove the nth element from a vector
uol$revt_growth3 = uol$revt / c(NA, uol$revt[-length(uol$revt)]) - 1
identical(uol$revt_growth1, uol$revt_growth3)
```

```
## [1] TRUE
```

```r
# magrittr %<>% to combine <- and %>%
library(magrittr)
uol %<>% mutate(revt_growth4 = revt / lag(revt) - 1)
identical(uol$revt_growth1, uol$revt_growth4)
```

```
## [1] TRUE
```

---

## A note on lag() and lead()

- <a target="_blank" href="https://dplyr.tidyverse.org/reference/lead-lag.html">`lag() or lead()`</a> finds the "previous" or "next" values in a vector.
- Very useful for comparing values ahead of or behind the current values.
- The dataset must be sorted by the key (eg, time for time series data)

```r
# Use order_by if data not already ordered
dff <- data.frame(year = 2001:2003, value = (1:3) ^ 2)
scrambled <- dff[sample(nrow(dff)), ]

wrong <- mutate(scrambled, prev = lag(value))
arrange(wrong, year)
```

```
##   year value prev
## 1 2001     1    9
## 2 2002     4   NA
## 3 2003     9    4
```

```r
right <- mutate(scrambled, prev = lag(value, order_by = year))
arrange(right, year)
```

```
##   year value prev
## 1 2001     1   NA
## 2 2002     4    1
## 3 2003     9    4
```

---

## A note on mutate()

- <a target="_blank" href="https://dplyr.tidyverse.org/reference/mutate.html">`mutate()`</a> adds variables to an existing data frame
 - Also [mutate multiple columns](https://dplyr.tidyverse.org/reference/mutate_all.html)
 - `mutate_all()` applies a transformation to all values in a data frame and adds these to the data frame
 - `mutate_at()` does this for a set of specified variables
 - `mutate_if()` transforms all variables matching a condition
 - Such as `is.numeric`
- Mutate can be very powerful when making more complex variables
 - For instance: Calculating growth within company in a multi-company data frame (cross-sectional with time series data, ie, panel data)
- Do Exercise 1 in the <a target="_blank" href="Session_4s_Exercise.html#Exercise_1:_Using_mutate()"> R Practice </a>

---

## Example: UOL with changes

```r
# Make the other needed change
uol <- uol %>%
 mutate(at_growth = at / lag(at) - 1) # From dplyr
# Rename our revenue growth variable
uol <- rename(uol, revt_growth = revt_growth1) # From dplyr
# Run the OLS model
mod2 <- lm(revt_growth ~ at_growth, data = uol)
summary(mod2)
```

```
## 
## Call:
## lm(formula = revt_growth ~ at_growth, data = uol)
## 
## Residuals:
##      Min       1Q   Median       3Q      Max 
## -0.56897 -0.12016 -0.01099  0.15012  0.42991 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)  
## (Intercept)  0.08443    0.05215   1.619   0.1167  
## at_growth    0.55576    0.26591   2.090   0.0458 *
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.237 on 28 degrees of freedom
##   (1 observation deleted due to missingness)
## Multiple R-squared:  0.135,	Adjusted R-squared:  0.1041 
## F-statistic: 4.368 on 1 and 28 DF,  p-value: 0.04582
```

---

## Example: UOL with changes

- `$\Delta$`Assets doesn't capture `$\Delta$`Revenue so well
- Perhaps change in total assets is a bad choice?
- Or perhaps we need to expand our model?

---

## Scaling up!

$$
\hat{y}=\alpha + \beta_1 \hat{x}_1 + \beta_2 \hat{x}_2 + \ldots + \varepsilon
$$

- OLS doesn't need to be restricted to just 1 input!
    - Not unlimited though (yet)
        - Number of inputs must be less than the number of observations minus 1
- Each `$\hat{x}_i$` is an input in our model
- Each `$\beta_i$` is something we will solve for
- `$\hat{y}$`, `$\alpha$`, and `$\varepsilon$` are the same as before

> We have... 823 variables from Compustat Global alone!

- Let's just add them all?
    - This is a very machine-learning mindset

- We only have 31 observations...
 - 31 << 823...

> Now what?

---

## Scaling up our model

> Building a model requires careful thought!

- What makes sense to add to our model?

> This is where having accounting and business knowledge comes in!

- Some potential sources to consider:
    - Direct accounting relations
        - Financing? Capex? R&D? Other expenditures?
    - Business management and corporate structure
        - Some management characteristics may matter
        - Corporate governance may also matter
    - Economics
        - Macro econ: trade, economic growth, population, weather
        - Micro econ: Other related firms like suppliers and customers
    - Legal factors
        - Any changes in law?  Favorable or not?
    - Market factors
        - Interest rates, cost of capital, foreign exchange?

> Any other factors?

---

## Scaling up our model

- One possible improvement:

```r
# lct: short term liabilities, che: cash and equivalents, ebit: EBIT
# list(name = ~f(.)) for repeated functions/formula
# broom::tidy(): to report a more concised summary using the broom package
uol <- uol %>%
 mutate_at(vars(lct, che, ebit), list(growth = ~(. / lag(.) - 1)))
mod3 <- lm(revt_growth ~ lct_growth + che_growth +
 ebit_growth, data = uol)
broom::tidy(mod3)
```

```
## # A tibble: 4 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 (Intercept) 0.0685 0.0457 1.50 0.146 
## 2 lct_growth 0.237 0.0699 3.39 0.00222
## 3 che_growth -0.114 0.0882 -1.29 0.209 
## 4 ebit_growth 0.0386 0.0213 1.81 0.0812
```

```r
broom::glance(mod3)
```

```
## # A tibble: 1 x 12
## r.squared adj.r.squared sigma statistic p.value df logLik AIC BIC
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 0.338 0.261 0.215 4.42 0.0122 3 5.67 -1.34 5.66
## # ... with 3 more variables: deviance <dbl>, df.residual <int>, nobs <int>
```

---
class: inverse, center, middle

# Formalizing testing

---

## Why formalize?

- Our current approach has been ad hoc
    - What is our goal?
    - How will we know if we have achieved it?
- Formalization provides more rigor

.center[<img src = "../../../Figures/whyformalize.jpg">]

---

## Scientific method

1. Question
    - What are we trying to determine?
    - Fundamentally, the question is asked/answered to solve your business problems
2. Hypothesis
    - What do we think will happen? Make a statement
        - "If X, then Y"
        - e.g., "If capital expenditures increase, revenue will increase."
        - A good hypothesis based on information in prior research, ie, hypothesis typically follows a thorough *literature review* 
    - Null hypothesis, a.k.a. `$H_0$`
        - Typically: The statement *doesn't* work
    - Alternative hypothesis, a.k.a. `$H_1$` or `$H_A$`
        - The statement *does* work (and perhaps how it works)
3. Research design
    - What exactly will we test? How to measure X and Y?
    - Formalize a statistical model
4. Testing
    - Test the model
5. Analysis
    - Did it work?

---

## Test statistics

- Testing a coefficient:
 - Use a `$t$` (less assumption on normality, unknown population s.d., more commonly used) or `$z$` test (known population s.d.)
- Testing a model as a whole
 - `$F$`-test, check *adjusted* R squared as well
- Testing across models
 - Chi squared ($\chi^2$) test
 - Vuong test (comparing `$R^2$`)
 - <a href="https://en.wikipedia.org/wiki/Akaike_information_criterion">Akaike Information Criterion</a> (AIC) (Comparing MLEs, lower is better)

> All of these have p-values, except for AIC

---
class: inverse, center, middle

# Revisiting the previous problem

---

## Formalizing our last test

1. Question
    - `$~$`

2. Hypotheses
    - `$H_0$`:

- `$H_1$`:

3. Research design
    - Individual variables:
    
    - Model:

4. Testing:
    - `$~$`

---

## Formalizing our last test

1. Question
    - Can we predict changes in revenue using a firm's accounting information?

2. Hypotheses
    - `$H_0$`: Our variables do not predict UOL's change in revenue
    - `$H_1$`: Our variables are help to predict UOL's change in revenue

3. Research design
    - Individual variables
        - Growth in current liabilities (+)
        - Growth in cash and cash equivalent (+)
        - Growth in EBIT (+)
    - Model: OLS

4. Testing:
    - t-test for coefficients and F-test for model

---

## Is this model better?

```r
anova(mod2, mod3, test = "Chisq")
```

```
## Analysis of Variance Table
## 
## Model 1: revt_growth ~ at_growth
## Model 2: revt_growth ~ lct_growth + che_growth + ebit_growth
##   Res.Df    RSS Df Sum of Sq Pr(>Chi)  
## 1     28 1.5721                        
## 2     26 1.2035  2   0.36861  0.01865 *
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
```

> A bit better at `$p<0.05$`

- This means our model with change in current liabilities, cash, and EBIT appears to be better than the model with change in assets.

> Note: p-value tells the prob that the two models are the same (in terms of variance explained). If the first model is better, the Sum of Sq number will be significantly negative. RSS also speaks.

---
class: inverse, center, middle

# Panel data

---

## Expanding our methodology

- Why should we limit ourselves to 1 firm's data?

- The nature of data analysis is such:

> Adding more data usually helps improve predictions

- Assuming:
    - The data isn't of low quality (too noisy)
    - The data is relevant
    - Any differences can be reasonably controlled for

---

## Expanding our question

- Previously: Can we predict revenue using a firm's accounting information?
    - This is simultaneous, and thus is not forecasting

- Now: Can we predict *future* revenue using a firm's accounting information?
    - By trying to predict ahead, we are now in the realm of forecasting
    - What do we need to change?
        - `$\hat{y}$` will need to be 1 year in the future

---

## First things first

- When using a lot of data, it is important to make sure the data is clean
- In our case, we may want to remove any very small firms

```r
# Ensure firms have at least $1M (local currency), and have revenue
# df contains all real estate companies excluding North America
df_clean <- filter(df, df$at > 1, df$revt > 0)

# We cleaned out 2,177 observations!
print(c(nrow(df), nrow(df_clean)))
```

```
## [1] 34156 31979
```

```r
# Another useful cleaning function:
# Replaces NaN, Inf, and -Inf with NA for all numeric variables!
df_clean <- df_clean %>%
 mutate_if(is.numeric, list(~replace(., !is.finite(.), NA)))
```

- [`is.finite()`](https://rdrr.io/r/base/is.finite.html) returns a vector of the same length as x the jth element of which is TRUE if x[j] is finite (i.e., it is not one of the values NA, NaN, Inf or -Inf) and FALSE otherwise.

---

## Looking back at the prior models

```r
uol <- uol %>% mutate(revt_lead = lead(revt)) # From dplyr
forecast1 <-
 lm(revt_lead ~ lct + che + ebit, data = uol)
library(broom) # To display regression outputs in a tidy fashion
tidy(forecast1) # present regression output
```

```
## # A tibble: 4 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 (Intercept) 64.0 127. 0.505 0.618 
## 2 lct 0.392 0.237 1.65 0.111 
## 3 che 0.141 0.330 0.425 0.674 
## 4 ebit 2.03 1.04 1.96 0.0613
```

```r
glance(forecast1)  # present regression statistics
```

```
## # A tibble: 1 x 12
## r.squared adj.r.squared sigma statistic p.value df logLik AIC BIC
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 0.746 0.717 369. 25.5 0.0000000663 3 -218. 446. 453.
## # ... with 3 more variables: deviance <dbl>, df.residual <int>, nobs <int>
```

> The model is significant but not the coefficients. We can do better.

---

## Expanding the prior model

```r
forecast2 <- 
 lm(revt_lead ~ revt + act + che + lct + dp + ebit , data = uol)
tidy(forecast2)
```

```
## # A tibble: 7 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 (Intercept) 75.2 97.1 0.775 0.446 
## 2 revt 1.63 0.318 5.11 0.0000356
## 3 act 0.212 0.168 1.26 0.219 
## 4 che 0.264 0.290 0.912 0.371 
## 5 lct -0.238 0.190 -1.25 0.223 
## 6 dp -1.45 4.42 -0.328 0.746 
## 7 ebit -3.28 1.12 -2.91 0.00780
```

- Revenue (revt) to capture stickiness of revenue
- Current assest (act) & Cash (che) to capture asset base
- Current liabilities (lct) to capture payments due
- Depreciation (dp) to capture decrease in real estate asset values
- EBIT to capture operational performance

---

## Expanding the prior model

```r
glance(forecast2)
```

```
## # A tibble: 1 x 12
## r.squared adj.r.squared sigma statistic p.value df logLik AIC BIC
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 0.923 0.903 216. 46.1 1.11e-11 6 -200. 416. 427.
## # ... with 3 more variables: deviance <dbl>, df.residual <int>, nobs <int>
```

```r
anova(forecast1, forecast2, test = "Chisq")
```

```
## Analysis of Variance Table
## 
## Model 1: revt_lead ~ lct + che + ebit
## Model 2: revt_lead ~ revt + act + che + lct + dp + ebit
##   Res.Df     RSS Df Sum of Sq Pr(>Chi)    
## 1     26 3548955                          
## 2     23 1074135  3   2474820 1.84e-11 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
```

> This is better (Adj. `$R^2$`, `$\chi^2$`, AIC).

---

## Panel data

- Panel data refers to data with the following characteristics:
    - There is a time dimension
    - There is at least 1 other dimension to the data (firm, country, etc.)

- Special cases:
    - A panel where all dimensions have the same number of observations is called *balanced*
        - Otherwise we call it *unbalanced*
    - A panel missing the time dimension is *cross-sectional*
    - A panel missing the other dimension(s) is a *time series*

- Format:
    - Long: Indexed by all dimensions (e.g., country-year)
    - Wide: Indexed only by other dimensions (e.g., country only)

---

## dplyr makes transpose easy

- Depending on data source, you may need to transform the data format from wide to long (or long to wide).
- The <a target="_blank" href="https://www.guru99.com/r-dplyr-tutorial.html">`package::dplyr`</a> has a `gather()` function is to do so.

---

## Wide versus long data

```r
university_wide # randomly generated numbers
```

```
##   university rand.2016 rand.2017 rand.2018
## 1        SMU        44        51        92
## 2        NTU        44        51        92
## 3        NUS        44        51        92
```

```r
# convert wide to long dataset
library("tidyr", "dplyr")
university_long <- university_wide %>%
 gather(year, rand, rand.2016:rand.2018) %>%
 mutate(year = as.numeric(gsub("rand.", "", year))) %>%
 arrange(desc(year))
university_long
```

```
##   university year rand
## 1        SMU 2018   92
## 2        NTU 2018   92
## 3        NUS 2018   92
## 4        SMU 2017   51
## 5        NTU 2017   51
## 6        NUS 2017   51
## 7        SMU 2016   44
## 8        NTU 2016   44
## 9        NUS 2016   44
```

---

## All SG real estate companies

```r
# group_by - without it, lead() will pull from the subsequent firm!
# ungroup() tells R that we finished grouping
df_clean <- df_clean %>% 
 group_by(isin) %>% 
 mutate(revt_lead = lead(revt)) %>%
 ungroup()
```
- Do Exercises 2 and 3 of the <a target="_blank" href="Session_4s_Exercise.html#Exercise_2:_Using_mutate()_and_lead()">R Practice</a>

.center[<img src="../../../Figures/Grouping.png" height="300px">]

---

## All SG real estate companies

```r
forecast3 <- lm(revt_lead ~ revt + act + che + lct + dp + ebit,
 data = df_clean[df_clean$fic == "SGP", ])
tidy(forecast3)
```

```
## # A tibble: 7 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 (Intercept) 21.5 11.6 1.86 6.39e- 2
## 2 revt 0.537 0.0579 9.26 1.07e-18
## 3 act 0.00999 0.0405 0.247 8.05e- 1
## 4 che 0.480 0.118 4.07 5.59e- 5
## 5 lct 0.218 0.0612 3.56 4.20e- 4
## 6 dp 4.38 0.960 4.56 6.67e- 6
## 7 ebit -1.13 0.238 -4.72 3.17e- 6
```

---

## All SG real estate companies

```r
glance(forecast3)
```

```
## # A tibble: 1 x 12
## r.squared adj.r.squared sigma statistic p.value df logLik AIC BIC
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 0.836 0.833 206. 352. 1.30e-159 6 -2850. 5717. 5749.
## # ... with 3 more variables: deviance <dbl>, df.residual <int>, nobs <int>
```

> Lower adjusted `$R^2$` -- This is worse?  Why?

- Note: `$\chi^2$` can only be used for models on the same data
    - Same for AIC

---

## Worldwide real estate companies

```r
forecast4 <-
 lm(revt_lead ~ revt + act + che + lct + dp + ebit , data = df_clean)
tidy(forecast4)
```

```
## # A tibble: 7 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 (Intercept) 220. 579. 0.379 7.04e- 1
## 2 revt 1.05 0.00634 165. 0 
## 3 act -0.0234 0.00539 -4.33 1.50e- 5
## 4 che 0.0203 0.0269 0.756 4.49e- 1
## 5 lct 0.0553 0.00866 6.39 1.82e-10
## 6 dp 0.172 0.186 0.927 3.54e- 1
## 7 ebit 0.126 0.0652 1.94 5.29e- 2
```

---

## Worldwide real estate companies

```r
glance(forecast4)
```

```
## # A tibble: 1 x 12
## r.squared adj.r.squared sigma statistic p.value df logLik AIC BIC
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 0.947 0.947 40818. 15138. 0 6 -61343. 122702. 122754.
## # ... with 3 more variables: deviance <dbl>, df.residual <int>, nobs <int>
```

> Higher adjusted `$R^2$` -- better!

- Note: `$\chi^2$` can only be used for models on the same data
    - Same for AIC

---

## Model accuracy

> Why is 1 model better while the other model is worse?

- Ranking:
    1. Worldwide real estate model
    2. UOL model
    3. Singapore real estate model

> Different sources of noise, amounts of data

---
class: inverse, center, middle

# Dealing with noise

---

## Noise

> Statistical noise is random error in the data

- Many sources of noise:
    - Other factors not included in 
    - Error in measurement
        - Accounting measurement!
    - Unexpected events / shocks

> Noise is OK, but the more we remove, the better!

---

## Removing noise: Singapore model

- Different companies may behave slightly differently (but time-invariant)
    - Control for this using a [*Fixed Effect*](https://www.econometrics-with-r.org/10-3-fixed-effects-regression.html) of companies
    - Note: ISIN uniquely identifies companies
    - factor(isin): (n-1) dummy variables
    - FE equivalent to unique intercept for each company

```r
forecast3.1 <-
 lm(revt_lead ~ revt + act + che + lct + dp + ebit + factor(isin),
 data = df_clean[df_clean$fic == "SGP", ])
# n=7 to prevent outputting every fixed effect
print(tidy(forecast3.1), n = 7)
```

```
## # A tibble: 30 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 (Intercept) -0.00946 36.8 -0.000257 1.00 
## 2 revt 0.403 0.0712 5.66 0.0000000293
## 3 act 0.0486 0.0453 1.07 0.284 
## 4 che 0.276 0.139 1.99 0.0472 
## 5 lct 0.239 0.0656 3.65 0.000300 
## 6 dp 4.86 1.05 4.63 0.00000487 
## 7 ebit -1.07 0.269 -3.98 0.0000825 
## # ... with 23 more rows
```

---

## Removing noise: Singapore model

```r
glance(forecast3.1)
```

```
## # A tibble: 1 x 12
## r.squared adj.r.squared sigma statistic p.value df logLik AIC BIC
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 0.852 0.841 201. 77.9 8.39e-144 29 -2828. 5719. 5844.
## # ... with 3 more variables: deviance <dbl>, df.residual <int>, nobs <int>
```

```r
anova(forecast3, forecast3.1, test = "Chisq")
```

```
## Analysis of Variance Table
## 
## Model 1: revt_lead ~ revt + act + che + lct + dp + ebit
## Model 2: revt_lead ~ revt + act + che + lct + dp + ebit + factor(isin)
##   Res.Df      RSS Df Sum of Sq Pr(>Chi)   
## 1    416 17663454                         
## 2    393 15915304 23   1748150 0.006616 **
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
```

> The model 3.1 is better

---

## Another way to do fixed effects

- The library [`package:lfe`](https://github.com/sgaure/lfe)  has `felm()`: **f**ixed **e**ffects **l**inear **m**odel
    - Better for 2 or more factors with thousands of levels, otherwise `lm` should be better
    - `lfe` is designed to produce the same results as `lm` will do if run with the full set of dummies
    - We will see a future example which cannot be handled by `lm()`

```r
library(lfe)
forecast3.2 <-
 felm(revt_lead ~ revt + act + che + lct + dp + ebit | factor(isin),
 data = df_clean[df_clean$fic == "SGP", ])
tidy(forecast3.2)
```

```
## # A tibble: 6 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 revt 0.403 0.0712 5.66 0.0000000293
## 2 act 0.0486 0.0453 1.07 0.284 
## 3 che 0.276 0.139 1.99 0.0472 
## 4 lct 0.239 0.0656 3.65 0.000300 
## 5 dp 4.86 1.05 4.63 0.00000487 
## 6 ebit -1.07 0.269 -3.98 0.0000825
```

---

## A faster way to do fixed effects

- The library [`package:fixest`](https://lrberge.github.io/fixest/)  has [`feols()`](https://lrberge.github.io/fixest/reference/feols.html): **f**ixed **e**ffects **ols**
    - similar to `lfe` but claim to be [much faster](https://lrberge.github.io/fixest/)
    - `lfe` and `fixest` produce the same results for OLS

```r
library(fixest)
forecast3.3 <-
 feols(revt_lead ~ revt + act + che + lct + dp + ebit | factor(isin),
 data = df_clean[df_clean$fic == "SGP", ])
summary(forecast3.3)
```

```
## OLS estimation, Dep. Var.: revt_lead
## Observations: 423 
## Fixed-effects: factor(isin): 24
## Standard-errors: Clustered (factor(isin)) 
##       Estimate Std. Error   t value Pr(>|t|))    
## revt  0.403058   0.189383  2.128300  0.044248 *  
## act   0.048569   0.088568  0.548387  0.588710    
## che   0.276009   0.174173  1.584700  0.126693    
## lct   0.239423   0.162586  1.472600  0.154416    
## dp    4.857000   1.494900  3.248900  0.003539 ** 
## ebit -1.070700   0.662245 -1.616800  0.119564    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## RMSE: 193.1     Adj. R2: 0.840904
##               Within R2: 0.771772
```

---

## Why exactly would we use FE?

.pull-left[
- Fixed effects are used when the average of `$\hat{y}$` varies by some group in our data
    - In our problem, the average revenue of each firm is different, see histogram below
- Fixed effects absorb this difference

<img src="Session_4s_files/figure-html/unnamed-chunk-31-1.png" width="100%" style="display: block; margin: auto;" />
]

.pull-right[
- Further reading:
    - Introductory Econometrics by Jeffrey M. Wooldridge

.center[<img src="../../../Figures/Econometrics_text.jpg" height="350px">]
]

---

## What else can we do?

> What else could we do to improve our prediction model?

---
class: inverse, center, middle

# Macro data

---

## Macro data sources

- For Singapore: <a href="https://data.gov.sg">Data.gov.sg</a>
 - Covers: Economy, education, environment, finance, health, infrastructure, society, technology, transport
- For real estate in Singapore: URA's REALIS system
 - Access through the library
- [WRDS](https://wrds-www.wharton.upenn.edu/) has some as well
- For US: <a href="data.gov">data.gov</a>, as well as many agency websites
 - Like <a href="https://www.bls.gov/data/">BLS</a> or the <a href="https://fred.stlouisfed.org/">Federal Reserve</a>

.center[<img src="../../../Figures/WRDS.png" height="50px"> <img src="../../../Figures/dataSG.svg" height="50px"> <img src="../../../Figures/dataUS.png" height="50px">]

---

## Loading macro data

- Singapore business expectations data (from <a target = "_blank" href="https://data.gov.sg/dataset/business-expectations-for-the-services-sector?view_id=b412d801-9097-4e62-acae-899b9db28ca8&resource_id=4779dc47-673a-42a3-896f-7bfc90315c09">data.gov.sg</a>)

```r
expectations %>%
  arrange(level_2, level_3, desc(year)) %>%  # sort the data
  select(year, quarter, level_2, level_3, value) %>%
  datatable(options = list(pageLength = 3), rownames=FALSE)
```

<div id="htmlwidget-4f7d6db5be89c77bd3cb" style="width:100%;height:auto;" class="datatables html-widget"></div>
<script type="application/json" data-for="htmlwidget-4f7d6db5be89c77bd3cb">{"x":{"filter":"none","data":[[2019,2019,2019,2018,2018,2018,2018,2017,2017,2017,2017,2016,2016,2016,2016,2015,2015,2015,2015,2014,2014,2014,2014,2013,2013,2013,2013,2012,2012,2012,2012,2011,2011,2011,2011,2010,2010,2010,2010,2009,2009,2009,2009,2008,2008,2008,2008,2007,2007,2007,2007,2006,2006,2006,2006,2005,2005,2005,2005,2004,2004,2004,2004,2003,2003,2003,2003,2002,2002,2002,2002,2001,2001,2001,2001,2000,2000,2000,2000,1999,1999,1999,1999,1998,1998,1998,1998,1997,1997,1997,1997,1996,1996,1996,1996,1995,1995,1995,1995,2019,2019,2019,2018,2018,2018,2018,2017,2017,2017,2017,2016,2016,2016,2016,2015,2015,2015,2015,2014,2014,2014,2014,2013,2013,2013,2013,2012,2012,2012,2012,2011,2011,2011,2011,2010,2010,2010,2010,2009,2009,2009,2009,2008,2008,2008,2008,2007,2007,2007,2007,2006,2006,2006,2006,2005,2005,2005,2005,2004,2004,2004,2004,2003,2003,2003,2003,2002,2002,2002,2002,2001,2001,2001,2001,2000,2000,2000,2000,1999,1999,1999,1999,1998,1998,1998,1998,1997,1997,1997,1997,1996,1996,1996,1996,1995,1995,1995,1995,2019,2019,2019,2018,2018,2018,2018,2017,2017,2017,2017,2016,2016,2016,2016,2015,2015,2015,2015,2014,2014,2014,2014,2013,2013,2013,2013,2012,2012,2012,2012,2011,2011,2011,2011,2010,2010,2010,2010,2009,2009,2009,2009,2008,2008,2008,2008,2007,2007,2007,2007,2006,2006,2006,2006,2005,2005,2005,2005,2004,2004,2004,2004,2003,2003,2003,2003,2002,2002,2002,2002,2001,2001,2001,2001,2000,2000,2000,2000,1999,1999,1999,1999,1998,1998,1998,1998,1997,1997,1997,1997,1996,1996,1996,1996,1995,1995,1995,1995,2019,2019,2019,2018,2018,2018,2018,2017,2017,2017,2017,2016,2016,2016,2016,2015,2015,2015,2015,2014,2014,2014,2014,2013,2013,2013,2013,2012,2012,2012,2012,2011,2011,2011,2011,2010,2010,2010,2010,2009,2009,2009,2009,2008,2008,2008,2008,2007,2007,2007,2007,2006,2006,2006,2006,2005,2005,2005,2005,2004,2004,2004,2004,2003,2003,2003,2003,2002,2002,2002,2002,2001,2001,2001,2001,2000,2000,2000,2000,1999,1999,1999,1999,1998,1998,1998,1998,1997,1997,1997,1997,1996,1996,1996,1996,1995,1995,1995,1995,2019,2019,2019,2018,2018,2018,2018,2017,2017,2017,2017,2016,2016,2016,2016,2015,2015,2015,2015,2014,2014,2014,2014,2013,2013,2013,2013,2012,2012,2012,2012,2011,2011,2011,2011,2010,2010,2010,2010,2009,2009,2009,2009,2008,2008,2008,2008,2007,2007,2007,2007,2006,2006,2006,2006,2005,2005,2005,2005,2004,2004,2004,2004,2003,2003,2003,2003,2002,2002,2002,2002,2001,2001,2001,2001,2000,2000,2000,2000,1999,1999,1999,1999,1998,1998,1998,1998,1997,1997,1997,1997,1996,1996,1996,1996,1995,1995,1995,1995,2019,2019,2019,2018,2018,2018,2018,2017,2017,2017,2017,2016,2016,2016,2016,2015,2015,2015,2015,2014,2014,2014,2014,2013,2013,2013,2013,2012,2012,2012,2012,2011,2011,2011,2011,2010,2010,2010,2010,2009,2009,2009,2009,2008,2008,2008,2008,2007,2007,2007,2007,2006,2006,2006,2006,2005,2005,2005,2005,2004,2004,2004,2004,2003,2003,2003,2003,2002,2002,2002,2002,2001,2001,2001,2001,2000,2000,2000,2000,1999,1999,1999,1999,1998,1998,1998,1998,1997,1997,1997,1997,1996,1996,1996,1996,1995,1995,1995,1995,2019,2019,2019,2018,2018,2018,2018,2017,2017,2017,2017,2016,2016,2016,2016,2015,2015,2015,2015,2014,2014,2014,2014,2013,2013,2013,2013,2012,2012,2012,2012,2011,2011,2011,2011,2010,2010,2010,2010,2009,2009,2009,2009,2008,2008,2008,2008,2007,2007,2007,2007,2006,2006,2006,2006,2005,2005,2005,2005,2004,2004,2004,2004,2003,2003,2003,2003,2002,2002,2002,2002,2001,2001,2001,2001,2000,2000,2000,2000,1999,1999,1999,1999,1998,1998,1998,1998,1997,1997,1997,1997,1996,1996,1996,1996,1995,1995,1995,1995,2019,2019,2019,2018,2018,2018,2018,2017,2017,2017,2017,2016,2016,2016,2016,2015,2015,2015,2015,2014,2014,2014,2014,2013,2013,2013,2013,2012,2012,2012,2012,2011,2011,2011,2011,2010,2010,2010,2010,2009,2009,2009,2009,2008,2008,2008,2008,2007,2007,2007,2007,2006,2006,2006,2006,2005,2005,2005,2005,2004,2004,2004,2004,2003,2003,2003,2003,2002,2002,2002,2002,2001,2001,2001,2001,2000,2000,2000,2000,1999,1999,1999,1999,1998,1998,1998,1998,1997,1997,1997,1997,1996,1996,1996,1996,1995,1995,1995,1995,2019,2019,2019,2018,2018,2018,2018,2017,2017,2017,2017,2016,2016,2016,2016,2015,2015,2015,2015,2014,2014,2014,2014,2013,2013,2013,2013,2012,2012,2012,2012,2011,2011,2011,2011,2010,2010,2010,2010,2009,2009,2009,2009,2008,2008,2008,2008,2007,2007,2007,2007,2006,2006,2006,2006,2005,2005,2005,2005,2004,2004,2004,2004,2003,2003,2003,2003,2002,2002,2002,2002,2001,2001,2001,2001,2000,2000,2000,2000,1999,1999,1999,1999,1998,1998,1998,1998,1997,1997,1997,1997,1996,1996,1996,1996,1995,1995,1995,1995],[1,2,3,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4],["Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Accommodation & Food Services","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Financial & Insurance","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade","Wholesale & Retail Trade"],["Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Accommodation","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Food & Beverage Services","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Banks & Finance Companies","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Fund Management","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Insurance Companies","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Other Financial Services","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Security Dealing Activities","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Retail Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade","Wholesale Trade"],[-2,25,22,-7,38,17,-26,-15,27,11,-23,-25,21,1,-44,4,14,40,-48,3,37,21,-25,10,43,31,-10,18,44,30,-27,56,66,15,-29,12,42,39,35,-74,31,44,27,69,37,4,-100,63,62,79,35,60,68,48,27,63,57,64,40,12,43,74,14,-82,56,96,32,41,58,72,-26,15,43,1,-10,-19,36,78,14,42,50,54,0,-52,-41,-37,-39,-47,52,62,2,-5,39,60,-11,16,61,71,17,-9,19,28,-6,37,36,4,-28,41,34,-6,-21,20,19,-30,-8,21,36,-27,3,-5,23,-14,-17,26,36,-29,0,30,39,-21,10,27,51,-27,45,47,41,20,-45,12,41,40,3,-15,24,-45,0,66,74,27,-5,37,58,31,30,41,52,14,34,66,66,42,-36,23,25,58,11,4,71,23,-8,14,7,-8,10,50,62,36,-2,10,63,31,5,6,17,37,-12,40,67,37,-27,-9,14,7,-19,-11,73,29,-3,-14,-24,25,17,-5,-16,11,6,4,22,-23,1,-12,-5,8,3,-13,-1,16,34,3,8,25,4,11,27,3,-23,-6,8,43,12,-54,-68,78,34,29,41,-64,12,32,64,7,7,-33,-70,28,34,19,21,54,26,19,49,46,18,16,55,70,63,52,53,-22,20,21,58,29,30,25,15,-12,-17,-67,-12,64,60,63,27,-24,23,44,66,-30,-35,-46,-41,24,12,14,-21,23,25,16,24,22,20,21,12,4,-17,-21,4,1,0,-17,-5,0,22,7,-6,-36,4,-4,20,21,-47,-32,0,14,18,16,10,-23,0,0,-14,-25,-19,-6,5,13,-72,-52,46,43,39,54,-45,48,43,58,-47,-49,-45,-80,36,40,-8,9,86,32,62,46,58,40,48,55,80,43,46,19,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,5,11,24,20,23,20,12,2,20,37,29,-1,2,14,-5,4,6,22,13,25,12,23,24,6,5,1,-6,0,-7,13,-8,49,49,25,-16,67,59,50,49,-32,9,32,33,-19,-22,-32,-38,45,43,37,24,71,75,81,52,58,69,81,22,43,72,78,8,-28,32,44,89,33,26,42,48,71,32,-33,26,69,75,89,54,-34,27,76,52,-16,-15,-43,-65,30,16,-1,-28,27,37,39,-2,28,48,49,3,34,17,11,5,12,0,21,-2,33,40,10,13,19,5,-4,52,2,11,6,15,23,-1,23,26,-5,2,-7,20,-1,15,11,42,40,33,-9,61,44,43,57,-28,5,31,24,12,23,-17,-44,27,13,10,20,-6,-13,-5,27,-25,-24,-22,27,71,3,-30,26,3,24,1,13,26,6,4,-15,68,1,-31,-3,18,-20,-33,-36,0,6,-41,52,-14,-37,-27,-21,22,15,-1,-17,25,1,-19,6,26,-7,16,14,11,-23,-22,13,10,11,-8,13,15,-7,13,-21,0,-13,-6,34,-23,0,-24,35,8,17,19,18,-4,-13,40,47,1,-25,8,-25,1,-48,-28,37,-19,14,67,-50,32,18,27,-44,-13,-68,-43,-16,-28,-30,-11,49,-40,-4,35,40,37,10,47,26,24,7,67,18,47,36,22,-13,-3,2,-10,-71,-50,-56,-15,-18,-36,-32,-46,-21,41,-16,1,-38,-83,-66,-31,29,-12,-32,-54,44,4,8,14,17,20,39,52,-14,-7,7,-19,7,29,2,-29,-5,19,-5,-16,2,23,-22,-9,8,34,-2,-13,21,38,2,-45,9,30,-4,-14,15,41,-15,-24,26,23,-29,-18,29,29,-7,-62,-11,27,-8,-29,-20,-12,-60,-21,2,31,-12,-13,22,34,8,-19,28,38,-2,-3,24,39,-9,-62,35,26,-21,-14,-24,-7,-12,-46,-29,-47,-50,-16,-9,22,-12,-21,17,27,11,-44,-49,-34,-30,-29,15,26,-38,-36,15,18,-15,-18,7,23,-5,9,-4,-5,10,4,2,-9,6,-2,6,6,-12,-19,-15,-2,-4,-5,-16,-27,5,15,13,-7,8,9,9,0,17,4,4,3,20,25,0,-19,44,37,38,22,-40,-3,23,33,9,-1,-10,-45,17,28,24,4,14,17,13,-11,10,22,8,5,25,34,17,-9,-30,2,16,9,-7,8,4,-17,-6,-19,-44,-25,18,24,23,-1,-15,15,23,19,-36,-38,-39,-30,7,21,7,-19,6,14,5,-3,17,27,31,9]],"container":"<table class=\"display\">\n <thead>\n <tr>\n <th>year<\/th>\n <th>quarter<\/th>\n <th>level_2<\/th>\n <th>level_3<\/th>\n <th>value<\/th>\n <\/tr>\n <\/thead>\n<\/table>","options":{"pageLength":3,"columnDefs":[{"className":"dt-right","targets":[0,1,4]}],"order":[],"autoWidth":false,"orderClasses":false,"lengthMenu":[3,10,25,50,100]}},"evals":[],"jsHooks":[]}</script>

---

## Transforming macro data

```r
# extract out F&I only, calculate annual average value
expectations_avg <- expectations %>%
 filter(level_2 == "Financial & Insurance") %>% # Keep F&I sector
 group_by(year) %>% # Group data by year
 mutate(fin_sentiment=mean(value, na.rm=TRUE)) %>% # Calculate yearly average
 slice(1) # Take only 1 row per group
head(expectations_avg)
```

```
## # A tibble: 6 x 7
## # Groups: year [6]
## quarter level_1 level_2 level_3 value year fin_sentiment
## <dbl> <chr> <chr> <chr> <dbl> <dbl> <dbl>
## 1 1 Total Services Sector Financial ~ Banks & F~ 22 1995 23.8 
## 2 1 Total Services Sector Financial ~ Banks & F~ 23 1996 17 
## 3 1 Total Services Sector Financial ~ Banks & F~ 24 1997 -0.25
## 4 1 Total Services Sector Financial ~ Banks & F~ -30 1998 -38 
## 5 1 Total Services Sector Financial ~ Banks & F~ -24 1999 15.8 
## 6 1 Total Services Sector Financial ~ Banks & F~ 64 2000 18.6
```

- At this point, we can merge with our accounting data

---

## dplyr makes merging easy

- For merging, use <a target="_blank" href="https://www.guru99.com/r-dplyr-tutorial.html">`package:dplyr`</a>'s `*_join()` commands
 - `left_join()` and `right_join()` for merging a dataset into another
 - `inner_join()` for keeping only matched observations
 - `full_join()` for making all possible combinations

.pull-left[
.center[<img src="../../../Figures/left_join.png" height="300px">]
]

.pull-right[
.center[<img src="../../../Figures/right_join.png" height="300px">]
]

---

## dplyr makes merging easy

- `inner_join()` vs. `full_join()`

.pull-left[
.center[<img src="../../../Figures/inner_join.png" height="300px">]
]

.pull-right[
.center[<img src="../../../Figures/full_join.png" height="300px">]
]

---

## Merging example

> Merge in the finance sentiment data to our accounting data

```r
# subset out our data, since our macro data is Singapore-specific
df_SG <- df_clean %>% filter(fic == "SGP")

# Create year in df_SG (date is given by datadate as YYYYMMDD)
df_SG$year = round(df_SG$datadate / 10000, digits = 0)

# Combine datasets
# Notice how it automatically figures out to join by "year"
df_SG_macro <- left_join(df_SG,
 expectations_avg[ , c("year", "fin_sentiment")])
```

```
## Joining, by = "year"
```

---
class: inverse, center, middle

# Predicting with macro data

---

## Building in macro data

- First try: Just add it in

```r
macro1 <- lm(revt_lead ~ revt + act + che + lct + dp + ebit + fin_sentiment,
 data = df_SG_macro)
tidy(macro1)
```

```
## # A tibble: 8 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 (Intercept) 19.1 13.8 1.39 1.66e- 1
## 2 revt 0.532 0.0599 8.88 2.47e-17
## 3 act 0.0119 0.0421 0.283 7.78e- 1
## 4 che 0.483 0.124 3.89 1.16e- 4
## 5 lct 0.216 0.0635 3.41 7.19e- 4
## 6 dp 4.42 0.992 4.46 1.08e- 5
## 7 ebit -1.12 0.247 -4.55 7.10e- 6
## 8 fin_sentiment 0.302 0.561 0.538 5.91e- 1
```

> It isn't significant.  Why is this?

---

## Scale matters

- All of our firm data is on the same scale as revenue: dollars within a given firm
- But `fin_sentiment` has much smaller range of constant scale (-38 to 44.65)
 - one sentiment value is corresponding to many revenue points, as depicted in the left chart below
 - Need to scale (standardize or normalize) this to fit the problem
- Do Exercise 4 of the <a target="_blank" href="Session_4s_Exercise.html#Exercise_4:_A_simple_scatterplot_with_ggplot2">R Practice</a> on visualization using ggplot2

.pull-left[

```r
df_SG_macro %>%
  ggplot(aes(y = revt_lead,
             x = fin_sentiment)) + 
  geom_point()
```

<img src="Session_4s_files/figure-html/unnamed-chunk-39-1.png" width="100%" style="display: block; margin: auto;" />
]

.pull-right[

```r
df_SG_macro %>%
  ggplot(aes(y = revt_lead,
    x=scale(fin_sentiment)*revt)) + 
  geom_point()
```

<img src="Session_4s_files/figure-html/unnamed-chunk-40-1.png" width="100%" style="display: block; margin: auto;" />
]

---

## Feature scaling

- There are various ways to <a target="_blank" href="https://en.wikipedia.org/wiki/Feature_scaling">scale variables/features</a>. In general, one way is to scale to a standard normal distribution ("standardization") and the other is to scale to range [0, 1] ("normalization")
- Standardization (or <a target="_blank" href="https://www.codecademy.com/articles/normalization">Z-score normalization</a>): features will be rescaled so that they’ll have the properties of a standard normal distribution with zero mean ($\mu = 0$) and one standard deviation ($\sigma = 1$).
- Standard scores (also called <a target="_blank" href="https://en.wikipedia.org/wiki/Standard_score">z scores</a>) are calculated as follows: `$$z = \frac{x - \mu}{\sigma}$$`
- z score measures how many S.D. below or above the variable mean, thus the unit of z score is S.D. of the variable

---

## The normal distribution

---

## The scale() function in Base R

- [scale()](https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/scale) function centers/scales the columns of a numeric matrix.
- [package:standardize](https://cran.r-project.org/web/packages/standardize/vignettes/using-standardize.html) is another option.

```r
# Scale creates z-scores with 0 mean and 1 sd
df_SG_macro$fin_sent_scaled <- scale(df_SG_macro$fin_sentiment)
summary(df_SG_macro[ , c("fin_sentiment", "fin_sent_scaled")])
```

```
##  fin_sentiment      fin_sent_scaled.V1
##  Min.   :-38.0000   Min.   :-2.61441  
##  1st Qu.: -0.4667   1st Qu.:-0.57333  
##  Median : 12.4500   Median : 0.12909  
##  Mean   : 10.0762   Mean   : 0.00000  
##  3rd Qu.: 17.0000   3rd Qu.: 0.37652  
##  Max.   : 44.6500   Max.   : 1.88014  
##  NA's   :68         NA's   :68
```

.pull-left[
<img src="Session_4s_files/figure-html/unnamed-chunk-42-1.png" width="100%" style="display: block; margin: auto;" />
]

.pull-right[
<img src="Session_4s_files/figure-html/unnamed-chunk-43-1.png" width="100%" style="display: block; margin: auto;" />
]

---

## Scaled macro data

- z-score normalization and scale by revenue

```r
# Scale creates z-scores
df_SG_macro$fin_sent_scaled <- scale(df_SG_macro$fin_sentiment)
macro3 <-
 lm(revt_lead ~ revt + act + che + lct + dp + ebit + fin_sent_scaled:revt,
 data=df_SG_macro) # fin_sent_scaled:revt = fin_sent_scaled x revt
tidy(macro3)
```

```
## # A tibble: 8 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 (Intercept) 21.8 12.0 1.81 7.09e- 2
## 2 revt 0.533 0.0593 8.99 1.04e-17
## 3 act 0.0220 0.0417 0.527 5.98e- 1
## 4 che 0.419 0.125 3.35 8.74e- 4
## 5 lct 0.227 0.0628 3.62 3.39e- 4
## 6 dp 3.89 0.999 3.90 1.14e- 4
## 7 ebit -0.949 0.252 -3.77 1.86e- 4
## 8 revt:fin_sent_scaled 0.0907 0.0315 2.88 4.25e- 3
```

---

## Model comparisons

```r
baseline <-
 lm(revt_lead ~ revt + act + che + lct + dp + ebit,
 data = df_SG_macro[!is.na(df_SG_macro$fin_sentiment), ])
glance(baseline)
```

```
## # A tibble: 1 x 12
## r.squared adj.r.squared sigma statistic p.value df logLik AIC BIC
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 0.835 0.833 211. 333. 6.69e-151 6 -2712. 5441. 5473.
## # ... with 3 more variables: deviance <dbl>, df.residual <int>, nobs <int>
```

```r
glance(macro3)
```

```
## # A tibble: 1 x 12
## r.squared adj.r.squared sigma statistic p.value df logLik AIC BIC
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 0.839 0.836 210. 292. 2.15e-151 7 -2708. 5434. 5470.
## # ... with 3 more variables: deviance <dbl>, df.residual <int>, nobs <int>
```

> Adjusted `$R^2$` and AIC are slightly better with macro data

---

## Model comparisons

```r
anova(baseline, macro3, test = "Chisq")
```

```
## Analysis of Variance Table
## 
## Model 1: revt_lead ~ revt + act + che + lct + dp + ebit
## Model 2: revt_lead ~ revt + act + che + lct + dp + ebit + fin_sent_scaled:revt
##   Res.Df      RSS Df Sum of Sq Pr(>Chi)   
## 1    394 17617000                         
## 2    393 17253888  1    363112 0.004029 **
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
```

> Macro model definitely fits better than the baseline model!

---

## Takeaway

1. Adding macro data can help explain some exogenous variation in a model
    - Exogenous meaning outside of the firms, in this case
2. Scaling is very important
    - Not scaling properly can suppress some effects from being visible

--
> Interpretating the macro variable

- For every 1 S.D. increase in `fin_sentiment` (18.4 points)
    - Revenue stickiness increases by ~9%
- Over the range of data (-38 to 44.65)... 
    - Revenue stickiness change ranges from -23.7% to 17%

---
class: inverse, center, middle

# Validation: Is it better?

---

## Validation

- Ideal:
    - Withhold the last year (or a few) of data when building the model
    - Check performance on *hold out sample*

- Sometimes acceptable:
    - Withhold a random sample of data when building the model
    - Check performance on *hold out sample*

> This is the basic idea of machine learning, we will cover more formally in future topics

---

## Estimation

- As we never constructed a hold out sample, let's end by estimating UOL's *2019* year revenue
- It is lead prediction, so fyear = 2018 for the 2019 forecast

```r
p_uol <- predict(forecast2, uol[uol$fyear == 2018, ])
p_base <- predict(baseline,
 df_SG_macro[df_SG_macro$isin == "SG1S83002349" & df_SG_macro$fyear == 2018,])
p_macro <- predict(macro3,
 df_SG_macro[df_SG_macro$isin == "SG1S83002349" & df_SG_macro$fyear == 2018,])
p_world <- predict(forecast4,
 df_clean[df_clean$isin == "SG1S83002349" & df_clean$fyear == 2018,])
preds <- c(p_uol, p_base, p_macro, p_world)
names(preds) <- c("UOL 2019 UOL", "UOL 2019 Base", "UOL 2019 Macro",
 "UOL 2019 World")
preds
```

```
##   UOL 2019 UOL  UOL 2019 Base UOL 2019 Macro UOL 2019 World 
##       2325.326       2379.430       2401.298       2882.955
```

---

## Visualizing our prediction

- I plot 2019 forecast separately from other years' forecast
- I also plot the actual revenue for comparison
- Click the legend to mute it

<div id="htmlwidget-35607deef45ca9333245" style="width:100%;height:432px;" class="plotly html-widget"></div>
<script type="application/json" data-for="htmlwidget-35607deef45ca9333245">{"x":{"data":[{"x":[1989,1990,1991,1992,1993,1994,1995,1996,1997,1998,1999,2000,2001,2002,2003,2004,2005,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017,2018],"y":[131.315,119.642,127.645,139.88,175.727,232.688,311.69,357.626,155.068,193.413,297.471,315.39,464.99,427.444,411.815,505.482,605.121,709.09,899.176,1007.061,1294.58,1960.234,1145.777,1058.608,1360.719,1278.749,1440.739,2103.152,2397.343,2283.341],"text":["fyear: 1989 revt_lead: 131.315 colour: Actual","fyear: 1990 revt_lead: 119.642 colour: Actual","fyear: 1991 revt_lead: 127.645 colour: Actual","fyear: 1992 revt_lead: 139.880 colour: Actual","fyear: 1993 revt_lead: 175.727 colour: Actual","fyear: 1994 revt_lead: 232.688 colour: Actual","fyear: 1995 revt_lead: 311.690 colour: Actual","fyear: 1996 revt_lead: 357.626 colour: Actual","fyear: 1997 revt_lead: 155.068 colour: Actual","fyear: 1998 revt_lead: 193.413 colour: Actual","fyear: 1999 revt_lead: 297.471 colour: Actual","fyear: 2000 revt_lead: 315.390 colour: Actual","fyear: 2001 revt_lead: 464.990 colour: Actual","fyear: 2002 revt_lead: 427.444 colour: Actual","fyear: 2003 revt_lead: 411.815 colour: Actual","fyear: 2004 revt_lead: 505.482 colour: Actual","fyear: 2005 revt_lead: 605.121 colour: Actual","fyear: 2006 revt_lead: 709.090 colour: Actual","fyear: 2007 revt_lead: 899.176 colour: Actual","fyear: 2008 revt_lead: 1007.061 colour: Actual","fyear: 2009 revt_lead: 1294.580 colour: Actual","fyear: 2010 revt_lead: 1960.234 colour: Actual","fyear: 2011 revt_lead: 1145.777 colour: Actual","fyear: 2012 revt_lead: 1058.608 colour: Actual","fyear: 2013 revt_lead: 1360.719 colour: Actual","fyear: 2014 revt_lead: 1278.749 colour: Actual","fyear: 2015 revt_lead: 1440.739 colour: Actual","fyear: 2016 revt_lead: 2103.152 colour: Actual","fyear: 2017 revt_lead: 2397.343 colour: Actual","fyear: 2018 revt_lead: 2283.341 colour: Actual"],"type":"scatter","mode":"markers+lines","marker":{"autocolorscale":false,"color":"rgba(248,118,109,1)","opacity":1,"size":5.66929133858268,"symbol":"circle","line":{"width":1.88976377952756,"color":"rgba(248,118,109,1)"}},"hoveron":"points","name":"Actual","legendgroup":"Actual","showlegend":true,"xaxis":"x","yaxis":"y","hoverinfo":"text","line":{"width":1.88976377952756,"color":"rgba(248,118,109,1)","dash":"solid"},"frame":null},{"x":[1989,1990,1991,1992,1993,1994,1995,1996,1997,1998,1999,2000,2001,2002,2003,2004,2005,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017],"y":[210.871235985059,240.084000541787,206.457837454607,189.105220954056,165.587101192124,176.689803629061,265.384842727408,312.304269360174,321.975795029245,351.015508680094,119.987762453068,219.845177635196,356.214703461672,549.532559070186,495.089034135697,583.069027991048,610.009409379019,886.23372154916,966.557218963083,945.975716830745,1152.52155458888,1507.96588694501,1498.38070644085,1165.58731812788,901.322327499381,1456.58740638731,1464.64994933747,1645.87107866765,2620.77407775201],"text":["fyear: 1989 pred_uol: 210.8712 colour: UOL only","fyear: 1990 pred_uol: 240.0840 colour: UOL only","fyear: 1991 pred_uol: 206.4578 colour: UOL only","fyear: 1992 pred_uol: 189.1052 colour: UOL only","fyear: 1993 pred_uol: 165.5871 colour: UOL only","fyear: 1994 pred_uol: 176.6898 colour: UOL only","fyear: 1995 pred_uol: 265.3848 colour: UOL only","fyear: 1996 pred_uol: 312.3043 colour: UOL only","fyear: 1997 pred_uol: 321.9758 colour: UOL only","fyear: 1998 pred_uol: 351.0155 colour: UOL only","fyear: 1999 pred_uol: 119.9878 colour: UOL only","fyear: 2000 pred_uol: 219.8452 colour: UOL only","fyear: 2001 pred_uol: 356.2147 colour: UOL only","fyear: 2002 pred_uol: 549.5326 colour: UOL only","fyear: 2003 pred_uol: 495.0890 colour: UOL only","fyear: 2004 pred_uol: 583.0690 colour: UOL only","fyear: 2005 pred_uol: 610.0094 colour: UOL only","fyear: 2006 pred_uol: 886.2337 colour: UOL only","fyear: 2007 pred_uol: 966.5572 colour: UOL only","fyear: 2008 pred_uol: 945.9757 colour: UOL only","fyear: 2009 pred_uol: 1152.5216 colour: UOL only","fyear: 2010 pred_uol: 1507.9659 colour: UOL only","fyear: 2011 pred_uol: 1498.3807 colour: UOL only","fyear: 2012 pred_uol: 1165.5873 colour: UOL only","fyear: 2013 pred_uol: 901.3223 colour: UOL only","fyear: 2014 pred_uol: 1456.5874 colour: UOL only","fyear: 2015 pred_uol: 1464.6499 colour: UOL only","fyear: 2016 pred_uol: 1645.8711 colour: UOL only","fyear: 2017 pred_uol: 2620.7741 colour: UOL only"],"type":"scatter","mode":"markers+lines","marker":{"autocolorscale":false,"color":"rgba(0,176,246,1)","opacity":1,"size":5.66929133858268,"symbol":"circle","line":{"width":1.88976377952756,"color":"rgba(0,176,246,1)"}},"hoveron":"points","name":"UOL only","legendgroup":"UOL only","showlegend":true,"xaxis":"x","yaxis":"y","hoverinfo":"text","line":{"width":1.88976377952756,"color":"rgba(0,176,246,1)","dash":"solid"},"frame":null},{"x":[1989,1990,1991,1992,1993,1994,1995,1996,1997,1998,1999,2000,2001,2002,2003,2004,2005,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017],"y":[155.417115758182,220.567072340496,186.439864484931,170.8232745053,185.939388080848,275.563398937012,270.091132657964,331.598265028497,459.460848225652,394.455905595071,265.152737068785,346.868044592978,366.642571381539,496.40274019501,524.796031165746,908.198523504726,598.448020383768,748.229978959819,901.790105310043,830.382240104406,1024.86929420564,1180.4965910041,1324.19001560406,1082.53204897741,1279.72455675783,1435.11202275472,1021.13707522374,1164.16243506204,2153.91960101272],"text":["fyear: 1989 pred_base: 155.4171 colour: Base","fyear: 1990 pred_base: 220.5671 colour: Base","fyear: 1991 pred_base: 186.4399 colour: Base","fyear: 1992 pred_base: 170.8233 colour: Base","fyear: 1993 pred_base: 185.9394 colour: Base","fyear: 1994 pred_base: 275.5634 colour: Base","fyear: 1995 pred_base: 270.0911 colour: Base","fyear: 1996 pred_base: 331.5983 colour: Base","fyear: 1997 pred_base: 459.4608 colour: Base","fyear: 1998 pred_base: 394.4559 colour: Base","fyear: 1999 pred_base: 265.1527 colour: Base","fyear: 2000 pred_base: 346.8680 colour: Base","fyear: 2001 pred_base: 366.6426 colour: Base","fyear: 2002 pred_base: 496.4027 colour: Base","fyear: 2003 pred_base: 524.7960 colour: Base","fyear: 2004 pred_base: 908.1985 colour: Base","fyear: 2005 pred_base: 598.4480 colour: Base","fyear: 2006 pred_base: 748.2300 colour: Base","fyear: 2007 pred_base: 901.7901 colour: Base","fyear: 2008 pred_base: 830.3822 colour: Base","fyear: 2009 pred_base: 1024.8693 colour: Base","fyear: 2010 pred_base: 1180.4966 colour: Base","fyear: 2011 pred_base: 1324.1900 colour: Base","fyear: 2012 pred_base: 1082.5320 colour: Base","fyear: 2013 pred_base: 1279.7246 colour: Base","fyear: 2014 pred_base: 1435.1120 colour: Base","fyear: 2015 pred_base: 1021.1371 colour: Base","fyear: 2016 pred_base: 1164.1624 colour: Base","fyear: 2017 pred_base: 2153.9196 colour: Base"],"type":"scatter","mode":"markers+lines","marker":{"autocolorscale":false,"color":"rgba(163,165,0,1)","opacity":1,"size":5.66929133858268,"symbol":"circle","line":{"width":1.88976377952756,"color":"rgba(163,165,0,1)"}},"hoveron":"points","name":"Base","legendgroup":"Base","showlegend":true,"xaxis":"x","yaxis":"y","hoverinfo":"text","line":{"width":1.88976377952756,"color":"rgba(163,165,0,1)","dash":"solid"},"frame":null},{"x":[1989,1990,1991,1992,1993,1994,1995,1996,1997,1998,1999,2000,2001,2002,2003,2004,2005,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017],"y":[null,null,null,null,null,null,285.123242257804,343.291421733212,445.602618456101,339.715899261313,267.599593832152,364.399821827498,326.899403228278,501.108599720271,543.62734094481,935.669480415453,639.431343858523,804.940751929464,906.142991025977,667.808229500366,1053.70389587961,1418.29439251564,1269.09435608028,1037.58992007694,1268.78047916663,1500.45113895067,1002.59723198451,1084.0915995108,2204.34838084234],"text":["fyear: 1989 pred_macro: NA colour: Macro","fyear: 1990 pred_macro: NA colour: Macro","fyear: 1991 pred_macro: NA colour: Macro","fyear: 1992 pred_macro: NA colour: Macro","fyear: 1993 pred_macro: NA colour: Macro","fyear: 1994 pred_macro: NA colour: Macro","fyear: 1995 pred_macro: 285.1232 colour: Macro","fyear: 1996 pred_macro: 343.2914 colour: Macro","fyear: 1997 pred_macro: 445.6026 colour: Macro","fyear: 1998 pred_macro: 339.7159 colour: Macro","fyear: 1999 pred_macro: 267.5996 colour: Macro","fyear: 2000 pred_macro: 364.3998 colour: Macro","fyear: 2001 pred_macro: 326.8994 colour: Macro","fyear: 2002 pred_macro: 501.1086 colour: Macro","fyear: 2003 pred_macro: 543.6273 colour: Macro","fyear: 2004 pred_macro: 935.6695 colour: Macro","fyear: 2005 pred_macro: 639.4313 colour: Macro","fyear: 2006 pred_macro: 804.9408 colour: Macro","fyear: 2007 pred_macro: 906.1430 colour: Macro","fyear: 2008 pred_macro: 667.8082 colour: Macro","fyear: 2009 pred_macro: 1053.7039 colour: Macro","fyear: 2010 pred_macro: 1418.2944 colour: Macro","fyear: 2011 pred_macro: 1269.0944 colour: Macro","fyear: 2012 pred_macro: 1037.5899 colour: Macro","fyear: 2013 pred_macro: 1268.7805 colour: Macro","fyear: 2014 pred_macro: 1500.4511 colour: Macro","fyear: 2015 pred_macro: 1002.5972 colour: Macro","fyear: 2016 pred_macro: 1084.0916 colour: Macro","fyear: 2017 pred_macro: 2204.3484 colour: Macro"],"type":"scatter","mode":"markers+lines","marker":{"autocolorscale":false,"color":"rgba(0,191,125,1)","opacity":1,"size":5.66929133858268,"symbol":"circle","line":{"width":1.88976377952756,"color":"rgba(0,191,125,1)"}},"hoveron":"points","name":"Macro","legendgroup":"Macro","showlegend":true,"xaxis":"x","yaxis":"y","hoverinfo":"text","line":{"width":1.88976377952756,"color":"rgba(0,191,125,1)","dash":"solid"},"frame":null},{"x":[1989,1990,1991,1992,1993,1994,1995,1996,1997,1998,1999,2000,2001,2002,2003,2004,2005,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017],"y":[324.104240302226,368.85349205861,354.175963556483,363.048702416807,380.651473674944,430.547792535107,483.248161440664,573.786020135733,638.909603821583,399.506474783437,443.646379926904,563.05433440397,570.787414764861,735.70367066758,702.017072700769,715.69152566098,779.007911424647,877.712505918185,1000.16249846437,1208.89646746471,1331.06025327425,1642.32491728768,2409.42561876938,1492.50516665455,1439.09571813326,1741.9782084502,1606.92446624812,1794.23634533755,2522.86252918136],"text":["fyear: 1989 pred_world: 324.1042 colour: World","fyear: 1990 pred_world: 368.8535 colour: World","fyear: 1991 pred_world: 354.1760 colour: World","fyear: 1992 pred_world: 363.0487 colour: World","fyear: 1993 pred_world: 380.6515 colour: World","fyear: 1994 pred_world: 430.5478 colour: World","fyear: 1995 pred_world: 483.2482 colour: World","fyear: 1996 pred_world: 573.7860 colour: World","fyear: 1997 pred_world: 638.9096 colour: World","fyear: 1998 pred_world: 399.5065 colour: World","fyear: 1999 pred_world: 443.6464 colour: World","fyear: 2000 pred_world: 563.0543 colour: World","fyear: 2001 pred_world: 570.7874 colour: World","fyear: 2002 pred_world: 735.7037 colour: World","fyear: 2003 pred_world: 702.0171 colour: World","fyear: 2004 pred_world: 715.6915 colour: World","fyear: 2005 pred_world: 779.0079 colour: World","fyear: 2006 pred_world: 877.7125 colour: World","fyear: 2007 pred_world: 1000.1625 colour: World","fyear: 2008 pred_world: 1208.8965 colour: World","fyear: 2009 pred_world: 1331.0603 colour: World","fyear: 2010 pred_world: 1642.3249 colour: World","fyear: 2011 pred_world: 2409.4256 colour: World","fyear: 2012 pred_world: 1492.5052 colour: World","fyear: 2013 pred_world: 1439.0957 colour: World","fyear: 2014 pred_world: 1741.9782 colour: World","fyear: 2015 pred_world: 1606.9245 colour: World","fyear: 2016 pred_world: 1794.2363 colour: World","fyear: 2017 pred_world: 2522.8625 colour: World"],"type":"scatter","mode":"markers+lines","marker":{"autocolorscale":false,"color":"rgba(231,107,243,1)","opacity":1,"size":5.66929133858268,"symbol":"circle","line":{"width":1.88976377952756,"color":"rgba(231,107,243,1)"}},"hoveron":"points","name":"World","legendgroup":"World","showlegend":true,"xaxis":"x","yaxis":"y","hoverinfo":"text","line":{"width":1.88976377952756,"color":"rgba(231,107,243,1)","dash":"solid"},"frame":null},{"x":[2018],"y":[2379.43017166321],"text":"fyear: 2018 preds: 2379.430 model: Base","type":"scatter","mode":"markers","marker":{"autocolorscale":false,"color":"rgba(163,165,0,1)","opacity":1,"size":5.66929133858268,"symbol":"diamond","line":{"width":1.88976377952756,"color":"rgba(163,165,0,1)"}},"hoveron":"points","name":"Base","legendgroup":"Base","showlegend":false,"xaxis":"x","yaxis":"y","hoverinfo":"text","frame":null},{"x":[2018],"y":[2401.29809327745],"text":"fyear: 2018 preds: 2401.298 model: Macro","type":"scatter","mode":"markers","marker":{"autocolorscale":false,"color":"rgba(0,191,125,1)","opacity":1,"size":5.66929133858268,"symbol":"diamond","line":{"width":1.88976377952756,"color":"rgba(0,191,125,1)"}},"hoveron":"points","name":"Macro","legendgroup":"Macro","showlegend":false,"xaxis":"x","yaxis":"y","hoverinfo":"text","frame":null},{"x":[2018],"y":[2325.32574723108],"text":"fyear: 2018 preds: 2325.326 model: UOL only","type":"scatter","mode":"markers","marker":{"autocolorscale":false,"color":"rgba(0,176,246,1)","opacity":1,"size":5.66929133858268,"symbol":"diamond","line":{"width":1.88976377952756,"color":"rgba(0,176,246,1)"}},"hoveron":"points","name":"UOL only","legendgroup":"UOL only","showlegend":false,"xaxis":"x","yaxis":"y","hoverinfo":"text","frame":null},{"x":[2018],"y":[2882.95452649288],"text":"fyear: 2018 preds: 2882.955 model: World","type":"scatter","mode":"markers","marker":{"autocolorscale":false,"color":"rgba(231,107,243,1)","opacity":1,"size":5.66929133858268,"symbol":"diamond","line":{"width":1.88976377952756,"color":"rgba(231,107,243,1)"}},"hoveron":"points","name":"World","legendgroup":"World","showlegend":false,"xaxis":"x","yaxis":"y","hoverinfo":"text","frame":null}],"layout":{"margin":{"t":24.5235920852359,"r":7.30593607305936,"b":38.4779299847793,"l":48.9497716894977},"plot_bgcolor":"rgba(235,235,235,1)","paper_bgcolor":"rgba(255,255,255,1)","font":{"color":"rgba(0,0,0,1)","family":"","size":14.6118721461187},"xaxis":{"domain":[0,1],"automargin":true,"type":"linear","autorange":false,"range":[1987.55,2019.45],"tickmode":"array","ticktext":["1990","2000","2010"],"tickvals":[1990,2000,2010],"categoryorder":"array","categoryarray":["1990","2000","2010"],"nticks":null,"ticks":"outside","tickcolor":"rgba(51,51,51,1)","ticklen":3.65296803652968,"tickwidth":0.66417600664176,"showticklabels":true,"tickfont":{"color":"rgba(77,77,77,1)","family":"","size":11.689497716895},"tickangle":-0,"showline":false,"linecolor":null,"linewidth":0,"showgrid":true,"gridcolor":"rgba(255,255,255,1)","gridwidth":0.66417600664176,"zeroline":false,"anchor":"y","title":{"text":"fyear","font":{"color":"rgba(0,0,0,1)","family":"","size":14.6118721461187}},"hoverformat":".2f"},"yaxis":{"domain":[0,1],"automargin":true,"type":"linear","autorange":false,"range":[-18.523626324644,3021.12015281752],"tickmode":"array","ticktext":["0","1000","2000","3000"],"tickvals":[0,1000,2000,3000],"categoryorder":"array","categoryarray":["0","1000","2000","3000"],"nticks":null,"ticks":"outside","tickcolor":"rgba(51,51,51,1)","ticklen":3.65296803652968,"tickwidth":0.66417600664176,"showticklabels":true,"tickfont":{"color":"rgba(77,77,77,1)","family":"","size":11.689497716895},"tickangle":-0,"showline":false,"linecolor":null,"linewidth":0,"showgrid":true,"gridcolor":"rgba(255,255,255,1)","gridwidth":0.66417600664176,"zeroline":false,"anchor":"x","title":{"text":"revt_lead","font":{"color":"rgba(0,0,0,1)","family":"","size":14.6118721461187}},"hoverformat":".2f"},"shapes":[{"type":"rect","fillcolor":null,"line":{"color":null,"width":0,"linetype":[]},"yref":"paper","xref":"paper","x0":0,"x1":1,"y0":0,"y1":1}],"showlegend":true,"legend":{"bgcolor":"rgba(255,255,255,1)","bordercolor":"transparent","borderwidth":1.88976377952756,"font":{"color":"rgba(0,0,0,1)","family":"","size":11.689497716895},"y":0.927821522309711},"annotations":[{"text":"colour","x":1.02,"y":1,"showarrow":false,"ax":0,"ay":0,"font":{"color":"rgba(0,0,0,1)","family":"","size":14.6118721461187},"xref":"paper","yref":"paper","textangle":-0,"xanchor":"left","yanchor":"bottom","legendTitle":true}],"hovermode":"closest","barmode":"relative"},"config":{"doubleClick":"reset","showSendToCloud":false},"source":"A","attrs":{"b8c87ca26c0e":{"x":{},"y":{},"colour":{},"type":"scatter"},"b8c84ac26c8e":{"x":{},"y":{},"colour":{}},"b8c838002d6f":{"x":{},"y":{},"colour":{}},"b8c828d54133":{"x":{},"y":{},"colour":{}},"b8c855a52e6b":{"x":{},"y":{},"colour":{}},"b8c8b272fb9":{"x":{},"y":{},"colour":{}},"b8c87749710a":{"x":{},"y":{},"colour":{}},"b8c82dd71b85":{"x":{},"y":{},"colour":{}},"b8c82bad4dfd":{"x":{},"y":{},"colour":{}},"b8c814464de8":{"x":{},"y":{},"colour":{}},"b8c84dc5d2f":{"x":{},"y":{},"colour":{}}},"cur_data":"b8c87ca26c0e","visdat":{"b8c87ca26c0e":["function (y) ","x"],"b8c84ac26c8e":["function (y) ","x"],"b8c838002d6f":["function (y) ","x"],"b8c828d54133":["function (y) ","x"],"b8c855a52e6b":["function (y) ","x"],"b8c8b272fb9":["function (y) ","x"],"b8c87749710a":["function (y) ","x"],"b8c82dd71b85":["function (y) ","x"],"b8c82bad4dfd":["function (y) ","x"],"b8c814464de8":["function (y) ","x"],"b8c84dc5d2f":["function (y) ","x"]},"highlight":{"on":"plotly_click","persistent":false,"dynamic":false,"selectize":false,"opacityDim":0.2,"selected":{"opacity":1},"debounce":0},"shinyEvents":["plotly_hover","plotly_click","plotly_selected","plotly_relayout","plotly_brushed","plotly_brushing","plotly_clickannotation","plotly_doubleclick","plotly_deselect","plotly_afterplot","plotly_sunburstclick"],"base_url":"https://plot.ly"},"evals":[],"jsHooks":[]}</script>

---

## In Sample Accuracy

```r
# series data is calculated, see the R code file
# Root Mean Square Error (RMSE)
# the st. deviation of the residuals (prediction errors).
rmse <- function(v1, v2) {
 sqrt(mean((v1 - v2)^2, na.rm = T))
}

rmse <- c(rmse(actual_series, uol_series),
 rmse(actual_series, base_series),
 rmse(actual_series, macro_series),
 rmse(actual_series, world_series))
names(rmse) <- c("UOL 2019 UOL", "UOL 2019 Base",
 "UOL 2019 Macro", "UOL 2019 World")
rmse
```

```
##   UOL 2019 UOL  UOL 2019 Base UOL 2019 Macro UOL 2019 World 
##       189.2207       273.6917       300.6274       349.6541
```

> Why is UOL the best for in sample?

> UOL is trained to minimize variation only in that context.  It is potentially overfitted, meaning it won't predict well *out of sample*.  Out of sample prediction is much more useful than in sample, however.

---
class: inverse, center, middle

# Summary of Session 4

---

## For next week

- Try to replicate the code
- Start to explore your group project data
- Continue your Datacamp career track
- Second individual assignment
    - Do this one individually!
    - Submission and feedback on eLearn

---

## R Coding Style Guide

Style is subjective and arbitrary but it is important to follow a generally accepted style if you want to share code with others. I suggest the [The tidyverse style guide](https://style.tidyverse.org/) which is also adopted by [Google](https://google.github.io/styleguide/Rguide.html) with some modification
- Highlights of **the tidyverse style guide**:
 - *File names*: end with .R
 - *Identifiers*: variable_name, function_name, try not to use "." as it is reserved by Base R's S3 objects
 - *Line length*: 80 characters
 - *Indentation*: two spaces, no tabs (RStudio by default converts tabs to spaces and you may change under global options)
 - *Spacing*: x = 0, not x=0, no space before a comma, but always place one after a comma
 - *Curly braces {}*: first on same line, last on own line
 - *Assignment*: use `<-`, not `=` nor `->`
 - *Semicolon(;)*: don't use, I used once for the interest of space
 - *return()*: Use explicit returns in functions: default function return is the last evaluated expression
 - *File paths*: use [relative file path](https://www.w3schools.com/html/html_filepaths.asp) "../../filename.csv" rather than absolute path "C:/mydata/filename.csv". Backslash needs `\\`

---

## R packages used in this slide

This slide was prepared on 2021-09-23 from Session_4s.Rmd with R version 4.1.1 (2021-08-10) Kick Things on Windows 10 x64 build 18362 😃.

The attached packages used in this slide are:

```
##     plotly         DT     fixest        lfe     Matrix      broom   magrittr 
##  "4.9.4.1"     "0.18"    "0.9.0"    "2.8-7"    "1.3-4"    "0.7.9"    "2.0.1" 
##    forcats    stringr      dplyr      purrr      readr      tidyr     tibble 
##    "0.5.1"    "1.4.0"    "1.0.7"    "0.3.4"    "2.0.1"    "1.1.3"    "3.1.3" 
##    ggplot2  tidyverse kableExtra      knitr 
##    "3.3.5"    "1.3.1"    "1.3.4"     "1.33"
```