ISSN 1003-8035 CN 11-2852/P
    陈建平,辛亚波,王泽鹏,等. 样本选取对地质灾害易发性评价的影响−以山西柳林县为例[J]. 中国地质灾害与防治学报,2024,35(3): 152-162. DOI: 10.16031/j.cnki.issn.1003-8035.202210037
    引用本文: 陈建平,辛亚波,王泽鹏,等. 样本选取对地质灾害易发性评价的影响−以山西柳林县为例[J]. 中国地质灾害与防治学报,2024,35(3): 152-162. DOI: 10.16031/j.cnki.issn.1003-8035.202210037
    CHEN Jianping,XIN Yabo,WANG Zepeng,et al. Effect of sample selection on the susceptibility assessment of geological hazards: A case study in Liulin County, Shanxi Province[J]. The Chinese Journal of Geological Hazard and Control,2024,35(3): 152-162. DOI: 10.16031/j.cnki.issn.1003-8035.202210037
    Citation: CHEN Jianping,XIN Yabo,WANG Zepeng,et al. Effect of sample selection on the susceptibility assessment of geological hazards: A case study in Liulin County, Shanxi Province[J]. The Chinese Journal of Geological Hazard and Control,2024,35(3): 152-162. DOI: 10.16031/j.cnki.issn.1003-8035.202210037

    样本选取对地质灾害易发性评价的影响以山西柳林县为例

    Effect of sample selection on the susceptibility assessment of geological hazards: A case study in Liulin County, Shanxi Province

    • 摘要: 非地质灾害样本的合理选取对地质灾害易发性预测准确度的提高具有重要意义。文章以柳林县为例,选取适宜的影响因子,基于GIS技术采用随机森林模型进行易发性评价。以地质灾害与非地质灾害比例为1∶1、1∶1.5、1∶3、1∶5、1∶10和非地质灾害点距已知灾害点100,500,800,1000 m为选取条件交叉结合共创建20组模型进行分析。结果表明:(1)通过误差指标、混淆矩阵和ROC曲线检验,样本比例和距已知灾害点距离变化对地质灾害易发性评价结果有较大影响。随着样本比例变小,距已知灾害点距离增加,各模型平均绝对误差和均方根误差整体下降,准确率整体上升。各模型ROC曲线下面积值均大于0.8,均有较好的预测效果。当样本比例小于1∶3时,距已知灾害点距离增加对模型误差和准确率影响较小,变化趋于稳定。综合判断样本比例为1∶10、距已知灾害点1000 m为最适合研究区模型。(2)高和极高易发区主要分布在中部及北部道路和河流两侧的地区,是柳林县防灾减灾的重点区。(3)样本选取差异导致易发性结果不同主要是因为建模过程中随机森林模型对数据特征的采集及判断发生变化,样本是否具有代表性发生变化。这些研究成果对当防灾减灾工作的实施具有重要意义。

       

      Abstract: The rational selection of non-geological hazard samples is of great significance to improve the accuracy of geological hazard susceptibility prediction. This study uses Liulin County as a case study, where appropriate impact factors were selected, and the random forest (RF) model was employed for susceptibility assessment based on GIS technology. A total of twenty sets of models were created by varying the ratio of geological hazard to non-geological hazard points (1∶1, 1∶1.5, 1∶3, 1∶5 and 1∶10) and the distance from non-geological hazard points to known hazard points (100,500,800,1000 m). The results demonstrate that: (1) Through error index, confusion matrix, and ROC curve tests, the sample proportion and distance from the known hazard point significantly influenced the geological hazard susceptibility evaluation. As the sample proportion decreased and the distance from known hazard points increased, the overall MAE and RMSE of the models decreased, while the overall ACC increased. All models achieved AUC value greater than 0.8, indicating excellent predictive performance. When the sample proportion was less than 1∶3, the increasing distance from the known hazard points on model error and accuracy became less pronounced, stabilizing the results. The most suitable model for the study area was found to have a sample ratio of 1∶10 and a distance of 1000 m from known hazard points. (2) High and very high susceptibility areas were primarily located in the central and northern regions, adjacent to roads and rivers, making them key areas for hazard prevention and reduction in Liulin County. (3) Differences in sample selection led to varying susceptibility results mainly due to changes in the RF model's data feature collection and judgment during the modeling process, as well as the representativeness of the samples. These research findings hold significant implications for the implementation of hazard prevention and reduction measures.

       

    /

    返回文章
    返回