Fitted AR Models

Chapter 4: Lesson 4

Learning Outcomes

Fit time series models to data and interpret fitted parameters

Fit an $A R (p)$ model to simulated data
Explain the difference between parameters of the data generating process and estimates
Calculate confidence intervals for AR coefficient estimates
Interpret AR coefficient estimates in the context of the source and nature of historical data

Check model adequacy using diagnostic plots like correlograms of residuals

Compare AR fitted models to an underlying data generating process
Explain the limitations of stochastic model fitting as evidence in favor or against real world arguments.

Preparation

Read Sections 4.6-4.7

Learning Journal Exchange (10 min)

Review another student’s journal
What would you add to your learning journal after reading another student’s?
What would you recommend the other student add to their learning journal?
Sign the Learning Journal review sheet for your peer

Class Activity: Fitting a Simulated $A R (1)$ Model with Zero Mean (5 min)

We will demonstrate how AR models are fitted via simulation. We will fit two different $A R (1)$ models and an $A R (2)$ model. The advantage of using simulation is that we know how the time series was constructed. So, we know the model that was used and the actual values of the parameters in that model. We can then see how close our estimated parameter values are to the true values.

Simulate an $A R (1)$ Time Series

In this simulation, we first simulate data from the $A R (1)$ model $x_{t} = 0.75 x_{t - 1} + w_{t}$ where $w_{t}$ is a white noise process with variance 1.

Show the code

set.seed(123)
n_rep <- 1000
alpha1 <- 0.75

dat_ts <- tibble(w = rnorm(n_rep)) |>
  mutate(
    index = 1:n(),
    x = purrr::accumulate2(
      lag(w), w, 
      \(acc, nxt, w) alpha1 * acc + w,
      .init = 0)[-1]) |>
  tsibble::as_tsibble(index = index)

dat_ts |> 
  autoplot(.vars = x) +
    labs(
      x = "Time",
      y = "Simulated Time Series",
      title = "Simulated Values from an AR(1) Process"
    ) +
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5)
    )

The R command mean(dat_ts$x) gives the mean of the $x_{t}$ values as 0.067.

Fit an $A R (1)$ Model with Zero Mean

Show the code

# Fit the AR(1) model
fit_ar <- dat_ts |>
  model(AR(x ~ order(1)))
tidy(fit_ar)

# A tibble: 1 × 6
  .model           term  estimate std.error statistic   p.value
  <chr>            <chr>    <dbl>     <dbl>     <dbl>     <dbl>
1 AR(x ~ order(1)) ar1      0.720    0.0220      32.8 2.07e-160

The estimate of the parameter $α_{1}$ (i.e. the fitted value of the parameter $α_{1}$ ) is ${\hat{α}}_{1} = 0.72$ .

When R fits an AR model, the mean of the time series is subtracted from the data before the parameter values are estimated. If R detects that the mean of the time series is not significantly different from zero, it is omitted from the output.

Because the mean is subtracted from the time series before the parameter values are estimated, R is using the model $z_{t} = α_{1} z_{t - 1} + w_{t}$ where $z_{t} = x_{t} - μ$ and $μ$ is the mean of the time series.

Check Your Understanding

Answer the following questions with your partner.

Use the expression for $z_{t}$ above to solve for $x_{t}$ in terms of $x_{t - 1}$ , $μ$ , $α_{1}$ , and $w_{t}$ .
What does your model reduce to when $μ = 0$ ?
Explain to your partner why this correctly models a time series with mean $μ$ .

We replace the parameter $μ$ with its estimator $\hat{μ} = \bar{x}$ . We also replace $α_{1}$ with the fitted value from the output ${\hat{α}}_{1}$ . This gives us the fitted model: ${\hat{x}}_{t} = \bar{x} + {\hat{α}}_{1} (x_{t - 1} - \bar{x})$

The fitted model can be expressed as:

$\begin{aligned} {\hat{x}}_{t} & = 0.067 + 0.72 (x_{t - 1} - 0.067) \\ = 0.067 - 0.72 (0.067) + 0.72 (x_{t - 1}) \\ = 0.019 + 0.72 x_{t - 1} \end{aligned}$

Even though R does not report the parameter for the mean of the process, $\hat{μ} = 0.019$ , it is not significantly different from zero. One could argue that we should not use a model that contains the mean and instead focus on a simple fitted model that has only one parameter:

${\hat{x}}_{t} = 0.72 x_{t - 1}$

Confidence Interval for the Model Parameter

The P-value given above tests the hypothesis that $α_{1} = 0$ . This is not helpful in this context. We are interested in the plausible values for $α_{1}$ , not whether or not it is different from zero. For this reason, we consider a confidence interval and disregard the P-value.

We can compute an approximate 95% confidence interval for $α_{1}$ as: $({\hat{α}}_{1} - 2 \cdot S E_{{\hat{α}}_{1}}, {\hat{α}}_{1} + 2 \cdot S E_{{\hat{α}}_{1}})$ where ${\hat{α}}_{1}$ is our parameter estimate and $S E_{{\hat{α}}_{1}}$ is the standard error of the estimate. Both of these values are given in the R output.

Show the code

ci_summary <- tidy(fit_ar) |>
    mutate(
        lower = estimate - 2 * std.error,
        upper = estimate + 2 * std.error
    )

So, our 95% confidence interval for $α_{1}$ is: $(0.72 - 2 \cdot 0.022, 0.72 + 2 \cdot 0.022)$ or $(0.676, 0.764)$ Note that the confidence interval contains $α_{1} = 0.75$ , the value of the parameter we used in our simulation. The process of estimating the parameter worked well. In practice, we will not know the value of $α_{1}$ , but the confidence interval gives us a reasonable estimate of the value.

Residuals

For an $A R (1)$ model where the mean of the time series is not statistically significantly different from 0, the residuals are computed as $\begin{aligned} r_{t} & = x_{t} - {\hat{x}}_{t} \\ = x_{t} - [0.72 x_{t - 1}] \end{aligned}$

We can easily obtain these residual values in R:

The variance of the residuals is $0.982$ . This is very close to the actual value used in the simulation: $σ^{2} = 1$ .

Class Activity: Fitting a Simulated $A R (1)$ Model with Non-Zero Mean (5 min)

Simulate an $A R (1)$ Time Series

It is easy to conceive situations where the mean of an AR model, $μ$ , is not zero. The model we have been fitting is $x_{t} = μ + α_{1} (x_{t - 1} - μ) + w_{t}$ where $μ$ and $α_{1}$ are constants, and $w_{t}$ is a white noise process with variance $σ^{2}$ .

This model can be simplified by combining like terms. $\begin{aligned} x_{t} & = μ + α_{1} (x_{t - 1} - μ) + w_{t} \\ = \underset{α_{0}}{\underset{⏟}{μ - α_{1} (μ)}} + α_{1} (x_{t - 1}) + w_{t} \\ = α_{0} + α_{1} (x_{t - 1}) + w_{t} \end{aligned}$

Suppose the mean of the $A R (1)$ process is $μ = 50$ . We will set $α_{1} = 0.75$ , and $σ^{2} = 5$ for this simulation. After specifying these numbers, the model becomes: $\begin{aligned} x_{t} & = 50 + 0.75 (x_{t - 1} - 50) + w_{t} \\ = 50 - 0.75 (50) + 0.75 x_{t - 1} + w_{t} \\ = 12.5 + 0.75 x_{t - 1} + w_{t} \end{aligned}$ where $w_{t}$ is a white noise process with variance $σ^{2} = 5$ .

Show the code

set.seed(123)
n_rep <- 1000
alpha1 <- 0.75
sigma_sqr <- 5

dat_ts <- tibble(w = rnorm(n = n_rep, sd = sqrt(sigma_sqr))) |>
  mutate(
    index = 1:n(),
    x = purrr::accumulate2(
      lag(w), w, 
      \(acc, nxt, w) alpha1 * acc + w,
      .init = 0)[-1]) |>
  mutate(x = x + alpha0) |> 
  tsibble::as_tsibble(index = index)

dat_ts |> 
  autoplot(.vars = x) +
    labs(
      x = "Time",
      y = "Simulated Time Series",
      title = "Simulated Values from an AR(1) Process"
    ) +
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5)
    )

The R command mean(dat_ts$x) gives the mean of the $x_{t}$ values as 50.15.

Fit an $A R (1)$ Model with Non-Zero Mean

We now use R to fit an $A R (1)$ model to the time series data.

Show the code

# Fit the AR(1) model
fit_ar <- dat_ts |>
  model(AR(x ~ order(1)))
tidy(fit_ar)

# A tibble: 2 × 6
  .model           term     estimate std.error statistic   p.value
  <chr>            <chr>       <dbl>     <dbl>     <dbl>     <dbl>
1 AR(x ~ order(1)) constant   14.1      1.11        12.7 1.35e- 34
2 AR(x ~ order(1)) ar1         0.719    0.0220      32.7 5.95e-160

The estimate of the parameter for the constant (mean) term $α_{0}$ is ${\hat{α}}_{0} = 14.091$ . The estimate of the parameter $α_{1}$ (i.e. the fitted value of the parameter $α_{1}$ ) is ${\hat{α}}_{1} = 0.719$ .

Fitting the model $x_{t} = α_{0} + α_{1} x_{t - 1} + w_{t}$ we get $\begin{aligned} {\hat{x}}_{t} & = {\hat{α}}_{0} + {\hat{α}}_{1} x_{t - 1} \\ = 14.091 + 0.719 x_{t - 1} \end{aligned}$

Confidence Intervals for the Model Parameters

We can compute approximate 95% confidence intervals for $α_{0}$ and $α_{1}$ :

$({\hat{α}}_{i} - 2 \cdot S E_{{\hat{α}}_{i}}, {\hat{α}}_{i} + 2 \cdot S E_{{\hat{α}}_{i}})$ where ${\hat{α}}_{i}$ is our estimate of parameter $i \in {0, 1}$ , and $S E_{{\hat{α}}_{i}}$ is the standard error of the respective estimates.

Show the code

ci_summary <- tidy(fit_ar) |>
    mutate(
        lower = estimate - 2 * std.error,
        upper = estimate + 2 * std.error
    )

95% Confidence Interval for $α_{0}$ : $({\hat{α}}_{0} - 2 \cdot S E_{{\hat{α}}_{0}}, {\hat{α}}_{0} + 2 \cdot S E_{{\hat{α}}_{0}})$

$(14.091 - 2 \cdot 1.105, 14.091 + 2 \cdot 1.105)$

$(11.88, 16.301)$ The confidence interval for $α_{0}$ contains $α_{0} = μ - α_{1} (μ) = 12.5$

95% Confidence Interval for $α_{1}$ : $({\hat{α}}_{1} - 2 \cdot S E_{{\hat{α}}_{1}}, {\hat{α}}_{1} + 2 \cdot S E_{{\hat{α}}_{1}})$

$(0.719 - 2 \cdot 0.022, 0.719 + 2 \cdot 0.022)$

$(0.675, 0.763)$ The confidence interval for $α_{1}$ contains $α_{1} = 0.75$

Both intervals captured the true value used in the simulation. The process of estimating the parameter worked well. In practice, we will not know the value of $α_{1}$ , but the confidence interval gives us a reasonable estimate of the value. About 95% of the time, the confidence interval will capture the true parameter value.

Residuals

The residuals in this model are computed as $\begin{aligned} r_{t} & = x_{t} - {\hat{x}}_{t} \\ = x_{t} - [14.091 + 0.719 x_{t - 1}] \end{aligned}$

The variance of the residuals is $4.911$ , which is near the actual parameter value: $σ^{2} = 5$ .

Class Activity: Fitting a Simulated $A R (2)$ Model (10 min)

Simulate an $A R (2)$ Time Series

In this section, we will simulate data from the following $A R (2)$ process: $x_{t} = 2 + 0.5 x_{t - 1} + 0.4 x_{t - 2} + w_{t}$ where $w_{t}$ is a discrete white noise process with variance $σ^{2} = 9$ .

Check Your Understanding

Use the $A R (2)$ process above to answer the following questions.

Is this $A R (2)$ process stationary? (Hint: The characteristic polynomial only includes terms that involve $x_{t}$ .)
Rewrite the model in the form $x_{t} = μ + α_{1} (x_{t - 1} - μ) + α_{2} (x_{t - 2} - μ) + w_{t}$ Identify the value of each of the coefficients ( $μ$ , $α_{1}$ , and $α_{2}$ ).
What is the mean of this $A R (2)$ process?

Here is a time plot of the simulated time series.

Show the code

set.seed(123)
n_rep <- 1000
alpha0 <- 20
alpha1 <- 0.5
alpha2 <- 0.4
sigma_sqr <- 9

dat_ts <- tibble(w = rnorm(n = n_rep, sd = sqrt(sigma_sqr))) |>
    mutate(
      index = 1:n(),
      x = 0
    ) |>
    tsibble::as_tsibble(index = index)

# Simulate x values
dat_ts$x[1] <- alpha0 + dat_ts$w[1]
dat_ts$x[2] <- alpha0 + alpha1 * ( dat_ts$x[1] - alpha0 ) + dat_ts$w[2]
for (i in 3:nrow(dat_ts)) {
  dat_ts$x[i] <- alpha0 + 
    alpha1 * ( dat_ts$x[i-1] - alpha0 ) + 
    alpha2 * ( dat_ts$x[i-2] - alpha0 ) + 
    dat_ts$w[i]
}

dat_ts |> 
  autoplot(.vars = x) +
    labs(
      x = "Time",
      y = "Simulated Time Series",
      title = paste("Simulated Values from an AR(2) Process with Mean", alpha0)
    ) +
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5)
    )

Fit an $A R (2)$ Model

We fit an $A R (2)$ model to these simulated values.

Show the code

# Fit the AR(2) model
fit_ar <- dat_ts |>
    model(AR(x ~ order(2))) 
tidy(fit_ar)

# A tibble: 3 × 6
  .model           term     estimate std.error statistic  p.value
  <chr>            <chr>       <dbl>     <dbl>     <dbl>    <dbl>
1 AR(x ~ order(2)) constant    2.33     0.380       6.14 1.17e- 9
2 AR(x ~ order(2)) ar1         0.478    0.0289     16.5  1.75e-54
3 AR(x ~ order(2)) ar2         0.408    0.0289     14.1  1.96e-41

The estimates of the parameter values are:
${\hat{α}}_{0} = 2.333$ , ${\hat{α}}_{1} = 0.478$ , and ${\hat{α}}_{2} = 0.408$ . This means that our fitted model can be expressed as:

$\begin{aligned} {\hat{x}}_{t} & = {\hat{α}}_{0} + {\hat{α}}_{1} x_{t - 1} + {\hat{α}}_{2} x_{t - 2} \\ = 2.333 + 0.478 x_{t - 1} + 0.408 x_{t - 2} \end{aligned}$

Confidence Interval for the Model Parameters

We can compute an approximate 95% confidence interval for $α_{i}$ as: $({\hat{α}}_{i} - 2 \cdot S E_{{\hat{α}}_{i}}, {\hat{α}}_{i} + 2 \cdot S E_{{\hat{α}}_{i}})$ where ${\hat{α}}_{i}$ is our estimate of the $i^{t h}$ parameter and $S E_{{\hat{α}}_{i}}$ is the standard error of the respective estimate. These values are given in the R output from the code below.

Show the code

ci_summary <- tidy(fit_ar) |>
    mutate(
        lower = estimate - 2 * std.error,
        upper = estimate + 2 * std.error
    )

95% confidence interval for $α_{0}$ : $({\hat{α}}_{0} - 2 \cdot S E_{{\hat{α}}_{0}}, {\hat{α}}_{0} + 2 \cdot S E_{{\hat{α}}_{0}})$ $(2.333 - 2 \cdot 0.38,$ $2.333 + 2 \cdot 0.38)$ $(1.574, 3.093)$ This confidence interval contains $α_{0} = 2$ .

95% confidence interval for $α_{1}$ : $({\hat{α}}_{1} - 2 \cdot S E_{{\hat{α}}_{1}}, {\hat{α}}_{1} + 2 \cdot S E_{{\hat{α}}_{1}})$ $(0.478 - 2 \cdot 0.029,$ $0.478 + 2 \cdot 0.029)$ $(0.42, 0.536)$ This confidence interval contains $α_{1} = 0.5$ .

95% confidence interval for $α_{2}$ : $({\hat{α}}_{2} - 2 \cdot S E_{{\hat{α}}_{2}}, {\hat{α}}_{2} + 2 \cdot S E_{{\hat{α}}_{2}})$ $(0.408 - 2 \cdot 0.029,$ $0.408 + 2 \cdot 0.029)$ $(0.35, 0.466)$ This confidence interval contains $α_{2} = 0.4$ .

All three confidence intervals contain the true parameter values we used for the simulation.

Residuals

We can compute the residuals in the same manner as we did for the other models.

Check Your Understanding

Working with a partner, do the following

Write the expression used to compute the residuals.
Find the residuals of this sequence using your expression.
Here are the first few residuals. Compare these to the values you computed.

Show the code

fit_ar |>
  residuals()

# A tsibble: 1,000 x 3 [1]
# Key:       .model [1]
   .model           index .resid
   <chr>            <int>  <dbl>
 1 AR(x ~ order(2))     1 NA    
 2 AR(x ~ order(2))     2 NA    
 3 AR(x ~ order(2))     3  4.60 
 4 AR(x ~ order(2))     4  0.237
 5 AR(x ~ order(2))     5  0.330
 6 AR(x ~ order(2))     6  5.13 
 7 AR(x ~ order(2))     7  1.46 
 8 AR(x ~ order(2))     8 -3.78 
 9 AR(x ~ order(2))     9 -2.13 
10 AR(x ~ order(2))    10 -1.39 
# ℹ 990 more rows

Explain why there are no residuals for times $t = 1$ and $t = 2$ .

The variance of the residuals is 8.857. This is close to 9, the parameter used in the simulation.

Small-Group Activity: Global Warming (20 min)

The time plot below illustrates the change in global surface temperature compared to the long-term average observed from 1951 to 1980. (Source: NASA/GISS.)

Show the code

temps_ts <- rio::import("https://byuistats.github.io/timeseries/data/global_temparature.csv") |>
  as_tsibble(index = year)

temps_ts |> autoplot(.vars = change) +
    labs(
      x = "Year",
      y = "Temperature Change (Celsius)",
      title = paste0("Change in Mean Annual Global Temperature (", min(temps_ts$year), "-", max(temps_ts$year), ")")
    ) +
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5)
    )

Using the PACF to Choose $p$ for an $A R (p)$ Process

In the previous lesson, we noted that the partial correlogram can be used to assess the number of parameters in an AR model. Here is a partial correlogram for the change in the mean annual global temperature.

Show the code

pacf(temps_ts$change)

Check Your Understanding

Working with your partner, do the following:

We will apply an $A R (p)$ model. What value of $p$ is suggested by the pacf?
Using the value of $p$ you identified, fit an $A R (p)$ model to the global temperature data. State the fitted $A R (p)$ model in the form ${\hat{x}}_{t} = \dots$
Obtain 95% confidence intervals for each of the parameters. Which are significantly different from zero?
Give the first three residual values (skipping the NAs).
What is the estimate of $σ^{2}$ ?
Make a correlogram for the residuals. Does it appear that your model has fully explained the variation in the temperatures?

Fitting Models (Dynamic Number of Parameters)

You may have concluded that $p = 3$ might be insufficient for modeling these data. We now explore a technique that allows R to choose $p$ based on the significance of the parameters.

If we specify order(1:9) in the model statement, R returns the largest $A R (p)$ model (up to $p = 9$ ) for which the parameter $α_{p}$ is significant.

Show the code

global_ar <- temps_ts |>
    model(AR(change ~ order(1:9)))
tidy(global_ar)

# A tibble: 7 × 6
  .model                  term     estimate std.error statistic  p.value
  <chr>                   <chr>       <dbl>     <dbl>     <dbl>    <dbl>
1 AR(change ~ order(1:9)) constant   0.0190   0.00881     2.15  3.30e- 2
2 AR(change ~ order(1:9)) ar1        0.656    0.0841      7.80  1.40e-12
3 AR(change ~ order(1:9)) ar2       -0.0662   0.100      -0.659 5.11e- 1
4 AR(change ~ order(1:9)) ar3        0.140    0.0988      1.42  1.58e- 1
5 AR(change ~ order(1:9)) ar4        0.265    0.0995      2.67  8.58e- 3
6 AR(change ~ order(1:9)) ar5       -0.163    0.102      -1.60  1.11e- 1
7 AR(change ~ order(1:9)) ar6        0.206    0.0863      2.38  1.85e- 2

R returned an $A R (6)$ model for this time series.

Check Your Understanding

Working with your partner, do the following:

State the fitted $A R (p)$ model in the form ${\hat{x}}_{t} = \dots$
Obtain 95% confidence intervals for each of the parameters. Which are significantly different from zero?
Give the first three residual values (skipping the NAs).
What is the estimate of $σ^{2}$ ?
Make a correlogram for the residuals. Does it appear that your model has fully explained the variation in the temperatures? Justify your answer.

Stationarity of the $A R (p)$ Model

With the exception of a lone seemingly spurious autocorrelation, there are no significant values of the acf of the residuals in the $A R (6)$ model. This suggests that the model accounts for the variation in the time series.

Check Your Understanding

Write the characteristic equation for the $A R (6)$ model you developed.
Click on the link below to obtain a more precise version of the characteristic equation, then solve the characteristic equation by any method.

Characteristic Equation

Is our $A R (6)$ model stationary?

Class Activity: Forecasting with an $A R (p)$ Model (5 min)

We now use the model to forecast the mean temperature difference for the next 50 years.

Show the code

temps_forecast <- global_ar |> forecast(h = "50 years")
temps_forecast |>
  autoplot(temps_ts, level = 95) +
  geom_line(aes(y = .fitted, color = "Fitted"),
    data = augment(global_ar)) +
  scale_color_discrete(name = "") +
  labs(
    x = "Year",
    y = "Temperature Change (Celsius)",
    title = paste0("Change in Mean Annual Global Temperature (", min(temps_ts$year), "-", max(temps_ts$year), ")"),
    subtitle = paste0("50-Year Forecast Based on our AR(", tidy(global_ar) |> as_tibble() |> dplyr::select(term) |> tail(1) |> right(1), ") Model")
  ) +
  theme_minimal() +
  theme(
    plot.title = element_text(hjust = 0.5),
    plot.subtitle = element_text(hjust = 0.5)
  )

Check Your Understanding

Does this forecast seem appropriate for the data? Why or why not?

Class Activity: Comparison to the Results in Section 4.6.3 of the Book (5 min)

In Sections 1.4.5 and 4.6.3 of the textbook, the authors present a similar dataset on the mean annual temperatures on Earth through 2005. Here is a time plot of their data:

Show the code

global_ts <- tibble(x = scan("data/global.dat")) |>
  mutate(
        date = seq(
            ymd("1856-01-01"),
            by = "1 months",
            length.out = n()),
        year = year(date),
        year_month = tsibble::yearmonth(date)
  ) |>
  summarise(x = mean(x), .by = year) |>
  as_tsibble(index = year) 
global_ts |> autoplot(.vars = x) +
    labs(
      x = "Year",
      y = "Temperature Change (Celsius)",
      title = paste0("Change in Mean Annual Global Temperature (", min(global_ts$year), "-", max(global_ts$year), ")"),
      subtitle = "Data from Textbook Sections 1.4.5 and 4.6.3"
    ) +
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5),
      plot.subtitle = element_text(hjust = 0.5)
    )

The fitted $A R (4)$ model is given below.

Show the code

global_ar_book <- global_ts |>
  model(AR(x ~ order(1:9)))
tidy(global_ar_book)

# A tibble: 4 × 6
  .model             term  estimate std.error statistic  p.value
  <chr>              <chr>    <dbl>     <dbl>     <dbl>    <dbl>
1 AR(x ~ order(1:9)) ar1     0.582     0.0791     7.36  1.24e-11
2 AR(x ~ order(1:9)) ar2     0.0216    0.0919     0.236 8.14e- 1
3 AR(x ~ order(1:9)) ar3     0.107     0.0917     1.17  2.43e- 1
4 AR(x ~ order(1:9)) ar4     0.267     0.0791     3.38  9.43e- 4

Let’s check the stationarity of this model. The characteristic equation is:

Show the code

alphas <- global_ar_book |> coefficients() |> dplyr::select(estimate) |> pull()
cat(
  "0 = 1", 
        "- (", alphas[1], ") * x",
        "- (", alphas[2], ") * x^2",
        "- (", alphas[3], ") * x^3",
        "- (", alphas[4], ") * x^4"
)

0 = 1 - ( 0.5817607 ) * x - ( 0.02164876 ) * x^2 - ( 0.1074731 ) * x^3 - ( 0.2670716 ) * x^4

The solutions of the characteristic equation are:

Show the code

polyroot(c(1, -alphas)) |> round(3)

[1]  1.011+0.000i -1.755+0.000i  0.171-1.443i  0.171+1.443i

The absolute value of the solutions of the characteristic equation are:

Show the code

polyroot(c(1, -alphas)) |> abs() |> round(3)

[1] 1.011 1.755 1.453 1.453

Check Your Understanding

Is the textbook’s model stationary?
In the textbook, the author stated, “The correlogram of the residuals has only one (marginally) significant value at lag 27, so the underlying residual series could be white noise (Fig. 4.14). Thus the fitted AR(4) model (Equation (4.24)) provides a good fit to the data. As the AR model has no deterministic trend component, the trends in the data can be explained by serial correlation and random variation, implying that it is possible that these trends are stochastic (or could arise from a purely stochastic process). Again we emphasise that this does not imply that there is no underlying reason for the trends. If a valid scientific explanation is known, such as a link with the increased use of fossil fuels, then this information would clearly need to be included in any future forecasts of the series.”
- What is the author saying?
- How would you respond to this statement?

Here is a plot of the forecasted values for the next 100 years, based on the textbook’s model:

Show the code

# global_ar_book <- global_ts |>
#   model(AR(x ~ order(4)))
temps_forecast_book <- global_ar_book |> forecast(h = "100 years")
temps_forecast_book |>
  autoplot(global_ts, level = 95) +
#   geom_line(aes(y = .mean, color = "Fitted"),
#     data = augment(global_ar_book)) +
#   scale_color_discrete(name = "") +
    labs(
      x = "Year",
      y = "Temperature Change (Celsius)",
      title = paste0("Change in Mean Annual Global Temperature (", min(temps_ts$year), "-", max(temps_ts$year), ")"),
      subtitle = "100-Year Forecast Based on the Book's AR(4) Model"
    ) +
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5),
      plot.subtitle = element_text(hjust = 0.5)
    )

Check Your Understanding

Compare and contrast the results you observed in the two global temperature time series.
What conclusions do you draw?

Homework Preview (5 min)

Review upcoming homework assignment
Clarify questions

Download Homework

homework_4_4.qmd

Small-Group Activity: Global Warming–PACF

Check Your Understanding

Working with your partner, do the following

We will apply an $A R (p)$ model. What value of $p$ is suggested by the pacf?

Solution:

$p = 3$

Using the value of $p$ you identified, fit an $A R (p)$ model to the global temperature data. State the fitted $A R (p)$ model in the form ${\hat{x}}_{t} = \dots$

Solution:

Show the code

global_ar <- temps_ts |>
    model(AR(change ~ order(3)))
tidy(global_ar)

# A tibble: 3 × 6
  .model                term  estimate std.error statistic  p.value
  <chr>                 <chr>    <dbl>     <dbl>     <dbl>    <dbl>
1 AR(change ~ order(3)) ar1     0.737     0.0823     8.95  1.80e-15
2 AR(change ~ order(3)) ar2    -0.0350    0.103     -0.339 7.35e- 1
3 AR(change ~ order(3)) ar3     0.319     0.0839     3.80  2.15e- 4

Note that the constant term is not statistically significant. If we ignore this term, we get the fitted model:

$\begin{aligned} {\hat{x}}_{t} & = 0 + {\hat{α}}_{1} x_{t - 1} + {\hat{α}}_{2} x_{t - 2} + {\hat{α}}_{3} x_{t - 3} \\ = 0.737 x_{t - 1} + (- 0.035) x_{t - 2} + 0.319 x_{t - 3} \end{aligned}$

If we want to incorporate the constant term in the model, then we need to find the mean of the time series. The mean of the time series is:

mean(temps_ts$change)

[1] 0.06923611

The fitted AR model is

$\begin{aligned} {\hat{x}}_{t} & = \hat{μ} + {\hat{α}}_{1} (x_{t - 1} - μ) + {\hat{α}}_{2} (x_{t - 2} - μ) + {\hat{α}}_{3} (x_{t - 3} - μ) \\ = \underset{{\hat{α}}_{0}}{\underset{⏟}{\hat{μ} - {\hat{α}}_{1} (\hat{μ}) - {\hat{α}}_{2} (\hat{μ}) - {\hat{α}}_{3} (\hat{μ})}} + {\hat{α}}_{1} x_{t - 1} + {\hat{α}}_{2} x_{t - 2} + {\hat{α}}_{3} x_{t - 3} \\ = {\hat{α}}_{0} + {\hat{α}}_{1} x_{t - 1} + {\hat{α}}_{2} x_{t - 2} + {\hat{α}}_{3} x_{t - 3} \end{aligned}$ Or, after substituting the fitted values: $\begin{aligned} {\hat{x}}_{t} & = 0.069 + 0.737 (x_{t - 1} - 0.069) + (- 0.035) (x_{t - 2} - 0.069) + 0.319 (x_{t - 3} - 0.069) \\ = - 0.0014 + 0.737 x_{t - 1} + (- 0.035) x_{t - 2} + 0.319 x_{t - 3} \end{aligned}$

Obtain 95% confidence intervals for each of the parameters. Which are significantly different from zero?

Solution:

Show the code

ci_summary <- tidy(global_ar) |>
    mutate(
        lower = estimate - 2 * std.error,
        upper = estimate + 2 * std.error
    )

The confidence intervals are: $\begin{aligned} α_{1} : & (0.572, 0.901) \\ α_{2} : & (- 0.242, 0.172) \\ α_{3} : & (0.151, 0.486) \end{aligned}$

The parameters

α_{1}

and

α_{3}

are statistically significantly different from 0.

Give the first three residual values (skipping the NAs).

Solution:

global_ar |> 
  residuals() |>
  na.omit() |>
  head(3)

# A tsibble: 3 x 3 [1Y]
# Key:       .model [1]
  .model                 year  .resid
  <chr>                 <int>   <dbl>
1 AR(change ~ order(3))  1883 -0.0413
2 AR(change ~ order(3))  1884 -0.130 
3 AR(change ~ order(3))  1885 -0.105

What is the estimate of $σ^{2}$ ?

Solution:

Show the code

resid_df <- global_ar |> 
  residuals() |>
  as_tibble()
var(resid_df$.resid, na.rm = TRUE)

[1] 0.01114332

Make a correlogram for the residuals. Does it appear that your model has fully explained the variation in the temperatures?

Solution:

Show the code

residuals(global_ar) |> 
  ACF(lag_max = 50) |> 
  autoplot(.vars = .resid) +
    labs(
      title = paste0("ACF of the Residuals from the AR(", tidy(global_ar) |> as_tibble() |> dplyr::select(term) |> tail(1) |> right(1), ") Model")
    ) +
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5)
    )

Response variable not specified, automatically selected `var = .resid`

There is still a significant autocorrelation at lag $k = 3$ . This suggests a more sophisticated model may be necessary.

Small-Group Activity: Global Warming–Dynamic

Check Your Understanding

Working with your partner, do the following:

State the fitted $A R (p)$ model in the form ${\hat{x}}_{t} = \dots$

Solution:

Show the code

global_ar <- temps_ts |>
    model(AR(change ~ order(1:9)))
tidy(global_ar)

# A tibble: 7 × 6
  .model                  term     estimate std.error statistic  p.value
  <chr>                   <chr>       <dbl>     <dbl>     <dbl>    <dbl>
1 AR(change ~ order(1:9)) constant   0.0190   0.00881     2.15  3.30e- 2
2 AR(change ~ order(1:9)) ar1        0.656    0.0841      7.80  1.40e-12
3 AR(change ~ order(1:9)) ar2       -0.0662   0.100      -0.659 5.11e- 1
4 AR(change ~ order(1:9)) ar3        0.140    0.0988      1.42  1.58e- 1
5 AR(change ~ order(1:9)) ar4        0.265    0.0995      2.67  8.58e- 3
6 AR(change ~ order(1:9)) ar5       -0.163    0.102      -1.60  1.11e- 1
7 AR(change ~ order(1:9)) ar6        0.206    0.0863      2.38  1.85e- 2

${\hat{x}}_{t} = 0.019 + 0.656 x_{t - 1} + (- 0.066) x_{t - 2} + 0.14 x_{t - 3} + 0.265 x_{t - 4} + (- 0.163) x_{t - 5} + 0.206 x_{t - 6}$

Obtain 95% confidence intervals for each of the parameters. Which are significantly different from zero?

Solution:

Show the code

ci_summary <- tidy(global_ar) |>
    mutate(
        lower = estimate - 2 * std.error,
        upper = estimate + 2 * std.error
    )
ci_summary

# A tibble: 7 × 8
  .model             term  estimate std.error statistic  p.value    lower  upper
  <chr>              <chr>    <dbl>     <dbl>     <dbl>    <dbl>    <dbl>  <dbl>
1 AR(change ~ order… cons…   0.0190   0.00881     2.15  3.30e- 2  0.00135 0.0366
2 AR(change ~ order… ar1     0.656    0.0841      7.80  1.40e-12  0.488   0.824 
3 AR(change ~ order… ar2    -0.0662   0.100      -0.659 5.11e- 1 -0.267   0.135 
4 AR(change ~ order… ar3     0.140    0.0988      1.42  1.58e- 1 -0.0574  0.338 
5 AR(change ~ order… ar4     0.265    0.0995      2.67  8.58e- 3  0.0663  0.464 
6 AR(change ~ order… ar5    -0.163    0.102      -1.60  1.11e- 1 -0.366   0.0405
7 AR(change ~ order… ar6     0.206    0.0863      2.38  1.85e- 2  0.0331  0.378

The constant term, $α_{1}$ , $α_{4}$ , and $α_{6}$ are all statistically significantly different from zero.

Give the first three residual values (skipping the NAs).

Solution:

Show the code

residuals(global_ar) |>
  na.omit() |>
  head(3)

# A tsibble: 3 x 3 [1Y]
# Key:       .model [1]
  .model                   year  .resid
  <chr>                   <int>   <dbl>
1 AR(change ~ order(1:9))  1886 -0.0636
2 AR(change ~ order(1:9))  1887 -0.117 
3 AR(change ~ order(1:9))  1888  0.139

What is the estimate of $σ^{2}$ ?

Solution:

Show the code

resid_var <- global_ar |>
  residuals() |>
  as_tibble() |>
  dplyr::select(.resid) |>
  pull() |>
  na.omit() |>
  var()
resid_var

[1] 0.01006731

The estimate of $σ^{2}$ is ${\hat{σ}}^{2} = 0.01$ .

Make a correlogram for the residuals. Does it appear that your model has fully explained the variation in the temperatures? Justify your answer.

Solution:

Show the code

residuals(global_ar) |> 
  ACF(lag_max = 50) |> 
  autoplot(.vars = .resid) +
    labs(
      title = paste0("ACF of the Residuals from the AR(", tidy(global_ar) |> as_tibble() |> dplyr::select(term) |> tail(1) |> right(1), ") Model")
    ) +
    theme_minimal() +
    theme(
      plot.title = element_text(hjust = 0.5)
    )

There is only one significant autocorrelation: $k = 34$ . This is probably a Type I error, which should occur 5% of the time. None of the other autocorrelations are significant–particularly among the smaller values of $k$ . It appears that this model has fully explained the variation in the temperatures.

Stationarity of the $A R (6)$ model

Check Your Understanding

Write the characteristic equation for the $A R (6)$ model you developed.

Solution:

0 = 1 - ( 0.656 ) * x - ( -0.066 ) * x^2 - ( 0.14 ) * x^3 
      - ( 0.265 ) * x^4 - ( -0.163 ) * x^5 - ( 0.206 ) * x^6

Obtain a more precise version of the characteristic equation, then solve the characteristic equation by any method.

Solution:

0 = 1 - ( 0.6559292 ) * x - ( -0.06617426 ) * x^2 - ( 0.140204 ) * x^3 
      - ( 0.2653744 ) * x^4 - ( -0.1627911 ) * x^5 - ( 0.2056924 ) * x^6

alphas <- global_ar |> coefficients() |> tail(-1) |> dplyr::select(estimate) |> pull()
polyroot(c(1, -alphas))

[1]  0.9838806+0.000000i -1.2890454+0.000000i -0.3076674-1.277838i
[4]  0.8559646-1.219125i -0.3076674+1.277838i  0.8559646+1.219125i

Is our $A R (6)$ model stationary?

Solution:

abs(polyroot(c(1, -alphas)))

[1] 0.9838806 1.2890454 1.3143548 1.4896112 1.3143548 1.4896112

Not all of the roots are greater than 1 in absolute value. So, this AR process is not stationary.

Learning Outcomes

Preparation

Learning Journal Exchange (10 min)

Class Activity: Fitting a Simulated AR(1) Model with Zero Mean (5 min)

Simulate an AR(1) Time Series

Fit an AR(1) Model with Zero Mean

Confidence Interval for the Model Parameter

Residuals

Class Activity: Fitting a Simulated AR(1) Model with Non-Zero Mean (5 min)

Simulate an AR(1) Time Series

Fit an AR(1) Model with Non-Zero Mean

Confidence Intervals for the Model Parameters

Residuals

Class Activity: Fitting a Simulated AR(2) Model (10 min)

Simulate an AR(2) Time Series

Fit an AR(2) Model

Confidence Interval for the Model Parameters

Residuals

Small-Group Activity: Global Warming (20 min)

Using the PACF to Choose p for an AR(p) Process

Fitting Models (Dynamic Number of Parameters)

Stationarity of the AR(p) Model

Class Activity: Forecasting with an AR(p) Model (5 min)

Class Activity: Comparison to the Results in Section 4.6.3 of the Book (5 min)

Homework Preview (5 min)

Class Activity: Fitting a Simulated $A R (1)$ Model with Zero Mean (5 min)

Simulate an $A R (1)$ Time Series

Fit an $A R (1)$ Model with Zero Mean

Class Activity: Fitting a Simulated $A R (1)$ Model with Non-Zero Mean (5 min)

Simulate an $A R (1)$ Time Series

Fit an $A R (1)$ Model with Non-Zero Mean

Class Activity: Fitting a Simulated $A R (2)$ Model (10 min)

Simulate an $A R (2)$ Time Series

Fit an $A R (2)$ Model

Using the PACF to Choose $p$ for an $A R (p)$ Process

Stationarity of the $A R (p)$ Model

Class Activity: Forecasting with an $A R (p)$ Model (5 min)