Comparison of eight filter-based feature selection methods for monthly streamflow forecasting - Three case studies on CAMELS data sets

Ren Kun; Fang Wei; Qu Jihong; Zhang Xia; Shi Xiaoyu

首页> 外文期刊>Journal of Hydrology >Comparison of eight filter-based feature selection methods for monthly streamflow forecasting - Three case studies on CAMELS data sets

【24h】

Comparison of eight filter-based feature selection methods for monthly streamflow forecasting - Three case studies on CAMELS data sets

机译：每月流流预测的八个基于滤波器的特征选择方法的比较 - 骆驼数据集的三种案例研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, there has been an increased emphasis on employing data-driven models to forecast streamflow. However, in these data-driven models used for forecasting monthly streamflow, the performances of filter-based feature selection (FFS) methods have not been studied in detail. In this study, we investigated the effectiveness of eight common FFS methods, namely, linear Pearson correlation, partial linear Pearson correlation (PCI), mutual information (MI), conditional MI, partial MI, maximal relevance minimal redundancy Pearson correlation, maximal relevance minimal redundancy MI and gamma WA methods, on three regression models, namely multiple linear regression (MLR), ensemble extreme learning machine (enELM) and k-nearest neighbor (KNN) regression, for real-world one-month-ahead streamflow forecasting. The study was conducted on three cases from the Catchment Attributes and Meteorology for Large-sample Studies (CAMELS) data sets. Furthermore, two termination criterion (TC) methods, the Hampel WA and resampling, were comparatively analyzed. The results of this study highlight three important findings. First, there was no dominant FFS method that coupled with enELM or KNN. Second, when resampling was applied to select a final model in the candidate combinations of the eight FFS methods and three regression models, PCI was the most favorable FFS method for the final model. Finally, the Hampel test TC was superior to the resampling TC in terms of stability and anti-overfitting. These findings have significant practical reference value for real-world monthly streamflow forecasting.

机译：最近，在采用数据驱动模型之前，增加了预测流流程。然而，在用于预测月度流流的这些数据驱动模型中，尚未详细研究基于滤波器的特征选择（FFS）方法的性能。在这项研究中，我们研究了八个常见的FFS方法的有效性，即线性Pearson相关性，部分线性Pearson相关性（PCI），互信息（MI），条件MI，部分MI，最大相关性最小冗余Pearson相关性，最大相关性最小冗余MI和伽玛WA方法，在三个回归模型，即多元线性回归（MLR），集合极限学习机（ENELM）和K最近邻（KNN）回归，用于现实世界的一个月前的流流预测。该研究在三种情况下从集水区属性和气象进行了大型样本研究（骆驼）数据集。此外，相对分析了两个终止标准（TC）方法，汉普尔瓦和重采样。本研究的结果突出了三个重要发现。首先，没有主导的FFS方法，与ENELM或KNN联系。其次，当应用重新采样时，在八个FFS方法和三个回归模型的候选组合中选择最终模型，PCI是最终模型的最有利的FFS方法。最后，在稳定性和抗过度拟合方面，汉普尔测试TC优于重采样TC。这些发现对现实世界每月流流程预测具有显着的实际参考价值。

著录项

来源
《Journal of Hydrology》 |2020年第2020期|共15页
作者
Ren Kun; Fang Wei; Qu Jihong; Zhang Xia; Shi Xiaoyu;
展开▼
作者单位

North China Univ Water Resources &

Elect Power Zhengzhou 450045 Peoples R China;

Xian Univ Technol Xian 710048 Peoples R China;

North China Univ Water Resources &

Elect Power Zhengzhou 450045 Peoples R China;

North China Univ Water Resources &

Elect Power Zhengzhou 450045 Peoples R China;

Xian Univ Technol Xian 710048 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类水文科学（水界物理学）;
关键词
Feature selection; Filter method; Termination criterion; Streamflow forecasting;

机译：特征选择;过滤方法;终止标准;流流预测;

相似文献

外文文献
中文文献
专利

1. Comparison of eight filter-based feature selection methods for monthly streamflow forecasting - Three case studies on CAMELS data sets [J] . Ren Kun, Fang Wei, Qu Jihong, Journal of Hydrology . 2020,第期

机译：每月流流预测的八个基于滤波器的特征选择方法的比较 - 骆驼数据集的三种案例研究
2. Monthly Streamflow Forecasting Using EEMD-Lasso-DBN Method Based on Multi-Scale Predictors Selection [J] . Haibo Chu, Jiahua Wei, Jun Qiu Water . 2018,第10期

机译：基于多尺度预测器选择的EEMD-Lasso-DBN方法的月流量预测
3. FEATURE SELECTION BASED ON COMPACTNESS AND SEPARABILITY: COMPARISON WITH FILTER-BASED METHODS [J] . Chen Chien-Hsing Computational Intelligence . 2014,第3期

机译：基于紧凑性和可分离性的特征选择：与基于过滤器的方法比较
4. Comparison of feature selection methods applied to Barkhausen noise data set [C] . Aki Sorsa, Kauko Leiviska IFAC World Congress . 2011

机译：应用于Barkhausen噪声数据集的特征选择方法的比较
5. Monthly and seasonal streamflow forecasting in the Rio Grande basin. [D] . Shalamu, Abudu. 2009

机译：里奥格兰德盆地的月度和季节性流量预报。
6. Analysis of the real EADGENE data set: Comparison of methods and guidelines for data normalisation and selection of differentially expressed genes (Open Access publication) [O] . Florence Jaffrézic, Dirk-Jan de Koning, Paul J Boettcher, 2007

机译：实际EADGENE数据集的分析：数据归一化和差异表达基因选择的方法和指南的比较（开放获取出版物）
7. Comparison of Stochastic Models in Forecasting Monthly Streamflow in Rivers: A Case Study of River Nile and Its Tributaries [O] . Mohammed A. Elganiny, Alaa Esmaeil Eldwer 2016

机译：随机模型在河流中每月流动流量的比较 - 以河流河流及其支流为例

Comparison of eight filter-based feature selection methods for monthly streamflow forecasting - Three case studies on CAMELS data sets

摘要

著录项

相似文献

相关主题

期刊订阅