...
首页> 外文期刊>Environmental Science & Technology >Using Machine Learning to Classify Bioactivity for 3486 Per- and Polyfluoroalkyl Substances (PFASs) from the OECD List
【24h】

Using Machine Learning to Classify Bioactivity for 3486 Per- and Polyfluoroalkyl Substances (PFASs) from the OECD List

机译:使用机器学习对OECD清单中3486种全氟和多氟烷基物质(PFAS)的生物活性进行分类

获取原文
获取原文并翻译 | 示例
           

摘要

A recent OECD report estimated that more than 4000 per- and polyfluorinated alkyl substances (PFASs) have been produced and used in a broad range of industrial and consumer applications. However, little is known about the potential hazards (e.g., bioactivity, bioaccumulation, and toxicity) of most PFASs. Here, we built machine-learning-based quantitative structure-activity relationship (QSAR) models to predict the bioactivity of those PFASs. By examining a number of available molecular data sets, we constructed the first PFAS-specific database that contains the bioactivity information on 1012 PFASs for 26 bioassays. On the basis of the collected PFAS data set, we trained 5 different machine learning models that cover a variety of conventional models (e.g., random forest and multitask neural network (MNN)) and advanced graph-based models (e.g., graph convolutional network). Those models were evaluated based on the validation data set. Both MNN and graph-based models demonstrated the best performance. The average of the best area-under-the-curve score for each bioassay is 0.916. For predictions on the OECD list, most of the biologically active PFASs have perfluoroalkyl chain lengths less than 12 and are categorized into fluorotelomer-related compounds and perfluoroalkyl acids and their precursors.
机译:经合组织最近的一份报告估计,已经生产出4000多种全氟和多氟烷基物质(PFAS),并广泛用于工业和消费领域。但是,对于大多数PFAS的潜在危害(例如,生物活性,生物蓄积性和毒性)知之甚少。在这里,我们建立了基于机器学习的定量结构-活性关系(QSAR)模型,以预测这些PFAS的生物活性。通过检查许多可用的分子数据集,我们构建了第一个特定于PFAS的数据库,其中包含1026种PFAS的生物活性信息,可进行26次生物测定。在收集的PFAS数据集的基础上,我们训练了5种不同的机器学习模型,这些模型涵盖了各种常规模型(例如,随机森林和多任务神经网络(MNN))和基于高级图的模型(例如,图卷积网络) 。这些模型是根据验证数据集进行评估的。 MNN模型和基于图形的模型均显示出最佳性能。每种生物测定的最佳曲线下面积分数的平均值为0.916。为了在OECD清单上进行预测,大多数具有生物活性的PFAS的全氟烷基链长均小于12,并分为与氟调聚物有关的化合物和全氟烷基酸及其前体。

著录项

  • 来源
    《Environmental Science & Technology》 |2019年第23期|13970-13980|共11页
  • 作者

    Cheng Weixiao; Ng Carla A.;

  • 作者单位

    Univ Pittsburgh Dept Civil & Environm Engn Pittsburgh PA 15261 USA;

    Univ Pittsburgh Dept Civil & Environm Engn Pittsburgh PA 15261 USA|Univ Pittsburgh Grad Sch Publ Hlth Dept Environm & Occupat Hlth Pittsburgh PA 15261 USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);美国《生物学医学文摘》(MEDLINE);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号