Quantification of continuous flood hazard using random forest classification and flood insurance claims at large spatial scales: a pilot study in southeast Texas

<p>Pre-disaster planning and mitigation necessitate detailed spatial information about flood hazards and their associated risks. In the US, the Federal Emergency Management Agency (FEMA) Special Flood Hazard Area (SFHA) provides important information about areas subject to flooding during the...

Full description

Bibliographic Details
Main Authors: W. Mobley, A. Sebastian, R. Blessing, W. E. Highfield, L. Stearns, S. D. Brody
Format: Article
Language:English
Published: Copernicus Publications 2021-03-01
Series:Natural Hazards and Earth System Sciences
Online Access:https://nhess.copernicus.org/articles/21/807/2021/nhess-21-807-2021.pdf
Description
Summary:<p>Pre-disaster planning and mitigation necessitate detailed spatial information about flood hazards and their associated risks. In the US, the Federal Emergency Management Agency (FEMA) Special Flood Hazard Area (SFHA) provides important information about areas subject to flooding during the 1 <span class="inline-formula"><i>%</i></span> riverine or coastal event. The binary nature of flood hazard maps obscures the distribution of property risk inside of the SFHA and the residual risk outside of the SFHA, which can undermine mitigation efforts. Machine learning techniques provide an alternative approach to estimating flood hazards across large spatial scales at low computational expense. This study presents a pilot study for the Texas Gulf Coast region using random forest classification to predict flood probability across a 30 523 km<span class="inline-formula"><sup>2</sup></span> area. Using a record of National Flood Insurance Program (NFIP) claims dating back to 1976 and high-resolution geospatial data, we generate a continuous flood hazard map for 12 US Geological Survey (USGS) eight-digit hydrologic unit code (HUC) watersheds. Results indicate that the random forest model predicts flooding with a high sensitivity (area under the curve, AUC: 0.895), especially compared to the existing FEMA regulatory floodplain. Our model identifies 649 000 structures with at least a 1 <span class="inline-formula"><i>%</i></span> annual chance of flooding, roughly 3 times more than are currently identified by FEMA as flood-prone.</p>
ISSN:1561-8633
1684-9981