- The paper demonstrates that integrating GARCH models with GRU networks improves volatility forecasting accuracy using metrics like MSE and MAE.
- It leverages multiple GARCH variants alongside GRU layers to capture complex market dynamics across assets such as the S&P 500, Bitcoin, and gold.
- Empirical results indicate that model performance varies by asset and risk tolerance, highlighting the need for tailored approaches for VaR and ES predictions.
Hybrid GARCH and Deep Learning Models for Financial Forecasting
This paper explores a hybrid approach to financial volatility and risk forecasting, integrating traditional GARCH models with deep learning techniques, specifically GRU neural networks. The authors evaluate this hybrid methodology using daily logarithmic returns from the S&P 500 index, Bitcoin, and gold, and assess the performance of the models via Value-at-Risk (VaR) and Expected Shortfall (ES) metrics.
Methodology
The methodology combines GARCH models, including standard GARCH, EGARCH, GJR-GARCH, and APARCH, with GRU neural networks.
GARCH Models
The paper employs four GARCH specifications to model conditional variance:
- GARCH: Defined by the equation:
ht=a0+i=1∑qαiϵt−i2+j=1∑pβjht−j
where ht is the conditional variance, ϵt are error terms, and αi and βj are parameters subject to constraints ensuring positive variance.
- GJR-GARCH: Which incorporates an indicator function to capture asymmetric responses to positive and negative shocks:
ht=a0+i=1∑qαiϵt−i2+i=1∑qθiIt−iϵt−i2+j=1∑pβjht−j
where It−i equals 1 if ϵt−i≤0 and 0 otherwise, and θi are parameters capturing the asymmetric effect.
- EGARCH: Uses the logarithm of the conditional variance to allow for asymmetry:
ln(ht)=a0+i=1∑qαi{θzt−i+γ[∣zt−i∣−E(∣zt−i∣)]}+j=1∑pβjln(ht−j)
where zt are standardized residuals, and θ and γ are parameters capturing the size and sign effects, respectively.
- APARCH: Which generalizes other ARCH variants by including a power term:
htδ=a0+i=1∑qαi(∣ϵt−i∣−γiϵt−i)δ+j=1∑pβjht−jδ
where δ is a power term, and γi allows for asymmetry.
The error term zt is modeled using normal, Student's t-, and skewed Student's t-distributions. Estimation is performed via maximum likelihood using the rugarch package in R.
GRU and Hybrid Models
The paper uses GRU networks, a simplified version of LSTM networks, for the neural network component. The GRU network cells are defined by the following equations:
zt=σ(Wzxt+Uzot−1+bz)r_t = \sigma(W^r x_t + U^r o_{t-1} + b^r)o~t=tanh(Wxt+U(rt⊙ot−1)+b)o_t = (1 - z_t) \odot o_{t-1} + z_t \odot \tilde{o}_t</p><p>wherex_tistheinputvector,o_tistheoutputvector,z_tistheupdategatevector,andr_tistheresetgatevector.ThehybridGARCH−GRUmodelincorporatesGARCHvolatilityforecastsasinputstotheGRUnetwork.ThenetworkistrainedtopredicttheGarman−Klass−Yang−Zhang(GKYZ)volatilityestimator,modifiedtoincludeovernightpricegaps.TheGKYZvolatilityestimatorisgivenby:</p><p>\sigma_i^2 = \frac{1}{n} \sum_{i=1}^{n} \left\{ \frac{1}{2} (\ln(H_i) - \ln(L_i))^2 - (2\ln(2) - 1)(\ln(C_i) - \ln(O_i))^2 \right\}</p><p>whereO_i,H_i,L_i,andC_iaretheopen,high,low,andclosepricesattimei$.
The GRU network architecture consists of three layers with 512, 256, and 128 neurons, respectively, and a single-neuron dense output layer. ReLU activation, dropout regularization (0.3), and L2 kernel regularization (0.00001) are applied. The Adam optimizer is used with a learning rate of 0.0009, and the loss function is the mean squared error between GKYZ volatility estimates and network predictions.
Evaluation Metrics
The volatility forecasts are evaluated using MSE, MAE, and HMSE. The Diebold-Mariano test is used to assess the statistical significance of differences in MSEs between GARCH and GARCH-GRU models. Risk measure accuracy is assessed using VaR and ES, with backtesting performed via the Kupiec test and the Christoffersen test for VaR, and the McNeil and Frey test for ES.
Empirical Analysis
The empirical analysis uses daily logarithmic returns for the S&P 500, Bitcoin, and gold, spanning from 2009 to 2020. A rolling window scheme is employed, with a window size of 504 days for GARCH models and rolling training sets of 1008 observations for the neural network stage.
Results for S&P 500
The EGARCH-GRU model with a skewed Student's t-distribution yields the best MSE. Combining GARCH models with GRU networks generally enhances forecast accuracy, with statistically significant Diebold-Mariano test p-values. The GJR-GARCH-GRU models with normal and skewed Student's t-distributions achieve accurate VaR hit coverage for the 5% tolerance level, while the APARCH model with a skewed Student's t-distribution performs best for the 1% tolerance.
Results for Bitcoin
The APARCH-GRU structure with a normal distribution performs best in terms of point volatility forecasts. However, the improvement in volatility forecasts does not consistently translate to superior VaR prediction performance. "Sheer" APARCH models produce VaR estimates closest to the expected hit ratio.
Results for Gold
A hybrid EGARCH-GRU model with a Student's t-distribution produces the most accurate volatility forecasts. However, the gains from hybridization do not unanimously improve VaR prediction. Hybrid models perform best at the 5% tolerance level, while "sheer" GARCH models perform best for the 1% VaR prediction.
Conclusion
The paper concludes that hybrid GARCH-GRU models can improve financial volatility forecasting. However, the improved volatility forecasts do not always translate into better VaR and ES predictions. The choice of the best model depends on the specific asset and tolerance level. The authors suggest that future research should focus on redefining the target function underlying the GRU components in the hybrid models, specifically for VaR and ES prediction tasks.