Regression modelling of spatiotemporal extreme U.S. wildfires via partially-interpretable neural networks
Abstract: Risk management in many environmental settings requires an understanding of the mechanisms that drive extreme events. Useful metrics for quantifying such risk are extreme quantiles of response variables conditioned on predictor variables that describe, e.g., climate, biosphere and environmental states. Typically these quantiles lie outside the range of observable data and so, for estimation, require specification of parametric extreme value models within a regression framework. Classical approaches in this context utilise linear or additive relationships between predictor and response variables and suffer in either their predictive capabilities or computational efficiency; moreover, their simplicity is unlikely to capture the truly complex structures that lead to the creation of extreme wildfires. In this paper, we propose a new methodological framework for performing extreme quantile regression using artificial neutral networks, which are able to capture complex non-linear relationships and scale well to high-dimensional data. The "black box" nature of neural networks means that they lack the desirable trait of interpretability often favoured by practitioners; thus, we unify linear, and additive, regression methodology with deep learning to create partially-interpretable neural networks that can be used for statistical inference but retain high prediction accuracy. To complement this methodology, we further propose a novel point process model for extreme values which overcomes the finite lower-endpoint problem associated with the generalised extreme value class of distributions. Efficacy of our unified framework is illustrated on U.S. wildfire data with a high-dimensional predictor set and we illustrate vast improvements in predictive performance over linear and spline-based regression techniques.
- keras: R Interface to āKerasā. R package version 2.7.0.
- Ev-gan: Simulation of extreme events with relu neural networks. The Journal of Machine Learning Research, 23(1):6723ā6761.
- Using risk analysis to reveal opportunities for the management of unplanned ignitions in wilderness. Journal of Forestry, 114(6):610ā618.
- Network dissection: Quantifying interpretability of deep visual representations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 6541ā6549.
- Historical trends and extremes in boreal Alaska river basins. Journal of hydrology, 527:590ā607.
- Multi-year global land cover mapping at 300 M and characterization for climate modelling: Achievements of the land cover component of the ESA climate change initiative. The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, 40(7):323.
- Modeling and simulating spatial extremes by combining extreme value theory with generative adversarial networks. Environmental Data Science, 1:e5.
- Cannon, A.Ā J. (2010). A flexible nonlinear modelling framework for nonstationary generalized extreme value analysis in hydroclimatology. Hydrological Processes: An International Journal, 24(6):673ā685.
- Cannon, A.Ā J. (2011). GEVcdn: an R package for nonstationary extreme value analysis by generalized extreme value conditional density estimation network. Computers & Geosciences, 37(9):1532ā1533.
- Cannon, A.Ā J. (2018). Non-crossing nonlinear regression quantiles by monotone composite quantile regression neural network, with application to rainfall extremes. Stochastic Environmental Research and Risk Assessment, 32(11):3207ā3225.
- A hybrid Pareto model for conditional density estimation of asymmetric fat-tail data. In Artificial Intelligence and Statistics, pages 51ā58. PMLR.
- Stochastic downscaling of precipitation with neural network conditional mixture models. Water Resources Research, 47(10):W10502.
- Spatial regression models for extremes. Extremes, 1(4):449ā468.
- A spliced gamma-generalized Pareto model for short-term extreme wind speed probabilistic forecasting. Journal of Agricultural, Biological and Environmental Statistics, 24(3):517ā534.
- Practical strategies for generalized extreme value-based regression models for extremes. Environmetrics, 33(6):e2742.
- Evaluation of classical spatial-analysis schemes of extreme rainfall. Natural Hazards and Earth System Sciences, 12(11):3229ā3240.
- Chautru, E. (2015). Dimension reduction in multivariate extreme value analysis. Electronic Journal of Statistics, 9(1):383ā418.
- Generalized additive modelling of sample extremes. Journal of the Royal Statistical Society: Series C (Applied Statistics), 54(1):207ā222.
- A combined statistical and machine learning approach for spatial prediction of extreme wildfire frequencies and sizes. Extremes, 26(2):301ā330.
- Extending the generalised Pareto distribution for novelty detection in high-dimensional spaces. Journal of Signal Processing Systems, 74(3):323ā339.
- Coles, S. (2001). An Introduction to Statistical Modeling of Extreme Values, volume 208. Springer.
- Bayesian spatial modeling of extreme precipitation return levels. Journal of the American Statistical Association, 102(479):824ā840.
- Decompositions of dependence for high-dimensional extremes. Biometrika, 106(3):587ā604.
- Copernicus (2021). Wildfires wreaked havoc in 2021, CAMS tracked their impact. Accessed 10/02/2022. https://atmosphere.copernicus.eu/wildfires-wreaked-havoc-2021-cams-tracked-their-impact.
- Statistics of extremes. Annual Review of Statistics and its Application, 2:203ā235.
- Statistical modeling of spatial extremes. Statistical Science, 27(2):161ā186.
- Principal component analysis for multivariate extremes. Electronic Journal of Statistics, 15(1):908ā943.
- A comparison study between MLP and convolutional neural network models for character recognition. In Real-Time Image and Video Processing 2017, volume 10223, page 1022306. International Society for Optics and Photonics.
- Wildfire risk science facilitates adaptation of fire-prone social-ecological systems to the new fire reality. Environmental Research Letters, 15(2):025001.
- A marginal modelling approach for predicting wildfire extremes across the contiguous united states. Extremes, 26(2):381ā398.
- Eastoe, E.Ā F. (2019). Nonstationarity in peaks-over-threshold river flows: A regional random effects model. Environmetrics, 30(5):e2560.
- Sparse structures for multivariate extremes. Annual Review of Statistics and Its Application, 8:241ā270.
- Cyber claim analysis using generalized Pareto regression trees with applications to insurance. Insurance: Mathematics and Economics, 98:92ā105.
- Deep convolutional neural network for flood extent mapping using unmanned aerial vehicles data. Sensors, 19(7):1486.
- Gedalof, Z. (2010). Climate and spatial patterns of wildfire in North America. In The Landscape Ecology of Fire, pages 89ā115. Springer.
- Deep sparse rectifier neural networks. In Proceedings of the fourteenth international conference on artificial intelligence and statistics, pages 315ā323. JMLR Workshop and Conference Proceedings.
- Extremal random forests. Journal of the American Statistical Association, (just-accepted):1ā24.
- Comparing density forecasts using threshold-and quantile-weighted scoring rules. Journal of Business & Economic Statistics, 29(3):411ā422.
- Augmented convolutional LSTMs for generation of high-resolution climate change projections. IEEE Access, 9:25208ā25218.
- Long short-term memory. Neural computation, 9(8):1735ā1780.
- Max-and-Smooth: a two-step approach for approximate Bayesian inference in latent Gaussian models. Bayesian Analysis, 16(2):611ā638.
- Spaceātime modelling of extreme events. Journal of the Royal Statistical Society: Series B (Methodology), 76(2):439ā461.
- Reconstruction of incomplete wildfire data using deep generative models. Extremes, pages 1ā21.
- kškitalic_k-means clustering of extremes. Electronic Journal of Statistics, 14(1):1211ā1233.
- Return level estimation from non-stationary spatial data exhibiting multidimensional covariate effects. Ocean Engineering, 88:520ā532.
- Large california wildfires: 2020 fires in historical context. Fire Ecology, 17(1):1ā11.
- Deep learning with Python, volumeĀ 1. Springer.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
- Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907.
- Koenker, R. (2005). Quantile Regression. Econometric Society Monographs. Cambridge University Press.
- Koh, J. (2023). Gradient boosting with extreme-value theory for wildfire prediction. Extremes, pages 1ā27.
- Spatiotemporal wildfire modeling through point processes with moderate and extreme marks. The Annals of Applied Statistics, 17(1):560ā582.
- Climate change projected to reduce prescribed burning opportunities in the south-eastern United States. International Journal of Wildland Fire, 29(9):764ā778.
- Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, 3361(10):1995.
- Neural networks for parameter estimation in intractable models. Computational Statistics & Data Analysis, 185:107762.
- Gated graph sequence neural networks. arXiv preprint arXiv:1511.05493.
- TD-LSTM: Temporal dependence-based LSTM networks for marine temperature prediction. Sensors, 18(11):3797.
- LSTM recurrent neural networks for influenza trends prediction. In International Symposium on Bioinformatics Research and Applications, pages 259ā264. Springer.
- Downscaling extremes: A comparison of extreme value distributions in point-source and gridded precipitation data. The Annals of Applied Statistics, 4:484ā502.
- Comparison of CNN and MLP classifiers for algae detection in underwater pipelines. In 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), pages 1ā6. IEEE.
- Modeling jointly low, moderate, and heavy rainfall intensities without a threshold selection. Water Resources Research, 52(4):2753ā2769.
- Opitz, T. (2023). EVA 2021 data challenge on spatiotemporal prediction of wildfire extremes in the USA. Extremes, 26(2):241ā250.
- INLA goes extreme: Bayesian tail regression for the estimation of high spatio-temporal quantiles. Extremes, 21(3):441ā462.
- Extended generalised Pareto models for tail estimation. Journal of Statistical Planning and Inference, 143(1):131ā143.
- Statistical models of vegetation fires: Spatial and temporal patterns. In Handbook of Environmental and Ecological Statistics, pages 401ā420. CRC Press.
- The stationary bootstrap. Journal of the American Statistical Association, 89(428):1303ā1313.
- Searching for activation functions. arXiv preprint arXiv:1710.05941.
- Richards, J. (2022). pinnEV: Partially-Interpretable Neural Networks for modelling of Extreme Values. R package.
- Supplement to āRegression modelling of spatiotemporal extreme U.S. wildfires via partially-interpretable neural networksā.
- Joint estimation of extreme spatially aggregated precipitation at different scales through mixture modelling. Spatial Statistics, 53:100725.
- Network design for heavy rainfall analysis. Journal of Geophysical Research: Atmospheres, 118(23):13ā075.
- Beyond expectation: Deep joint mean and quantile regression for spatiotemporal problems. IEEE Transactions on Neural Networks and Learning Systems, 31(12):5377ā5389.
- Conditional density estimation with neural networks: Best practices and benchmarks. arXiv preprint arXiv:1903.00954.
- The extreme value machine. IEEE transactions on Pattern Analysis and Machine Intelligence, 40(3):762ā768.
- Semi-structured distributional regression. The American Statistician, 78(1):88ā99.
- Likelihood-free parameter estimation with neural Bayes estimators. The American Statistician, pages 1ā23.
- Explaining deep neural networks and beyond: A review of methods and applications. Proceedings of the IEEE, 109(3):247ā278.
- Continuous spatial process models for spatial extreme values. Journal of Agricultural, Biological, and Environmental Statistics, 15(1):49ā65.
- The graph neural network model. IEEE Transactions on Neural Networks, 20(1):61ā80.
- Droughts and wildfires in western US rangelands. Rangelands, 38(4):197ā203.
- Short, K.Ā C. (2017). Spatial wildfire occurrence data for the United States, 1992-2015 [FPA_FOD_20170508]. 4th Ed. Fort Collins, CO: Forest Service Research Data Archive.
- Projecting future nonstationary extreme streamflow for the Fraser River, Canada. Climatic Change, 145(3):289ā303.
- LSTM based hybrid method for basin water level prediction by using precipitation data. Journal of Advanced Simulation in Science and Engineering, 8(1):40ā52.
- Silverman, B.Ā W. (1985). Some aspects of the spline smoothing approach to non-parametric regression curve fitting. Journal of the Royal Statistical Society: Series B (Methodology), 47(1):1ā21.
- Sciencebrief review: Climate change increases the risk of wildfires. In: Critical Issues in Climate Change Science, edited by: C. Le QuƩrƩ, P. Liss, P. Forster. https://doi.org/10.5281/zenodo.4570195.
- Smith, R.Ā L. (1985). Maximum likelihood estimation in a class of nonregular cases. Biometrika, 72(1):67ā90.
- Smith, R.Ā L. (1989). Extreme value analysis of environmental time series: An application to trend detection in ground-level ozone. Statistical Science, 4(4):367 ā 377.
- Functional boxplots. Journal of Computational and Graphical Statistics, 20(2):316ā334.
- Modelling sub-daily precipitation extremes with the blended generalised extreme value distribution. Journal of Agricultural, Biological and Environmental Statistics, 27(4):598ā621.
- Nonstationary frequency analysis of annual maximum rainfall using climate covariates. Water Resources Management, 29(2):339ā358.
- Gradient boosting for extreme quantile regression. Extremes, 26(4):639ā667.
- Extreme value theory for anomaly detectionāthe GPD classifier. Extremes, 23(4):501ā520.
- Modelling the effect of the El NiƱo-Southern Oscillation on extreme spatial temperature events over Australia.
- Wood, S. (2006). Generalized Additive Models: An Introduction with R. Chapman & Hall/CRC Texts in Statistical Science. Taylor & Francis.
- Wood, S.Ā N. (2003). Thin plate regression splines. Journal of the Royal Statistical Society: Series B (Methodology), 65(1):95ā114.
- Spatial hierarchical modeling of threshold exceedances using rate mixtures. Environmetrics, 32(3):e2662.
- Youngman, B.Ā D. (2019). Generalized additive models for exceedances of high thresholds with an application to return level estimation for US wind gusts. Journal of the American Statistical Association, 114(528):1865ā1879.
- Probabilistic prediction of regional wind power based on spatiotemporal quantile regression. IEEE Transactions on Industry Applications, 56(6):6117ā6127.
- Prediction of North Atlantic Oscillation index with convolutional LSTM based on ensemble empirical mode decomposition. Atmosphere, 10(5):252.
- Flexible covariate representations for extremes. Environmetrics, 31(5):e2624.
- Forest fire susceptibility modeling using a convolutional neural network for Yunnan province of China. International Journal of Disaster Risk Science, 10(3):386ā403.
- Visual interpretability for deep learning: a survey.
- Joint modeling and prediction of massive spatio-temporal wildfire count and burnt area data with the INLA-SPDE approach. Extremes, 26(2):339ā351.
- Modeling nonstationary temperature maxima based on extremal dependence changing with event magnitude. The Annals of Applied Statistics, 16(1):272ā299.
- Neural networks for partially linear quantile regression. Journal of Business & Economic Statistics, pages 1ā12.
- Graph neural networks: A review of methods and applications. AI Open, 1:57ā81.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.