首页> 外文会议>World Environmental and Water Resources Congress >Identifying Key Factors Affecting Harmful Algal Blooms in Western Lake Erie from the Perspective of Machine Learning
【24h】

Identifying Key Factors Affecting Harmful Algal Blooms in Western Lake Erie from the Perspective of Machine Learning

机译:从机器学习角度识别影响西湖伊利有害藻类盛开的关键因素

获取原文

摘要

The quality of natural waters is of critical importance for public health, welfare, sustainable development, and ecological systems. Unfortunately, Lake Erie has been facing a persistent crisis of harmful algal blooms (HABs) since the 1960s. The most annual occurrence of HABs at Lake Erie predominately occurs at the western region between the months of May and August. HABs are exhibited by the excessive growth of cyanobacteria (i.e., blue-green algae), which often produce toxins such as microcystins. The goal of our research is to identify key factors affecting HABs in western Lake Erie from the perspective of machine learning, which is a powerful tool for predicting the output from the input features. Specifically, chlorophyll-a is usually a direct indicator of the severity of HABs and is applied as the output target in machine learning. We collected data from both on site and remote sensing. We found that several features of remote sensing were not reliable and thus applied only on-site data as input features in machine learning. After comparing 12 popular machine learning algorithms, we found that the random forests model had the best performance in predicting the value of chlorophyll-a, with a R~2 score of 0.84. Moreover, with the help from machine learning, we could identify the quantitative importance among input features and find that particulate organic carbon is the most correlated factor to chlorophyll-a. Furthermore, we showed that the machine learning model was location dependent and that different structures of a machine learning model should be applied to different locations to predict HABs.
机译:自然水域质量对公共卫生,福利,可持续发展和生态系统至关重要。不幸的是,自20世纪60年代以来,伊利湖一直面临有害藻类盛开(HABS)的持续危机。伊利湖最多的汉族疾病的疾病主要发生在五月和八月之间的西部地区。患有的肌腱(即蓝绿藻)的过度生长,它们通常会产生毒素如微囊藻毒素。我们研究的目标是从机器学习角度识别影响西湖伊利羊肉的关键因素,这是一种强大的工具,用于预测输入功能的输出。具体而言,叶绿素-A通常是HABS严重程度的直接指示器,并作为机器学习中的输出目标应用。我们从现场和遥感中收集数据。我们发现,遥感的几个特征不可靠,因此仅应用了机器学习中的输入功能。在比较12个受欢迎的机器学习算法之后,我们发现随机森林模型在预测叶绿素-A的值方面具有最佳性能,R〜2得分为0.84。此外,在机器学习的帮助下,我们可以识别输入特征之间的定量重要性,并发现颗粒状有机碳是叶绿素-A最相关的因素。此外,我们表明机器学习模型取决于位置,并且机器学习模型的不同结构应该应用于不同地点以预测疾病。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号