Q-Learning based Maximum Power Point Tracking Control for Microbial Fuel Cell

Li-ping Fan; Xiang Feng

摘要

Microbial fuel cell (MFC) is a promising technology for wastewater treatment with simultaneousbioenergy production. To improve the power generation efficiency of MFCs, maximum power pointtracking control is a good choice. Three kinds of Q-Learning-based maximum power point trackingcontrol scheme based on ε-greedy exploration, Boltzmann exploration and greedy policy are proposedfor MFCs. The results show that the maximum power point tracking control based on Q-Learning hasbetter power tracking capabilities than perturbation and observation method. With the introduction of QLearning based on greedy policy, the time required for MFC to stabilize at the maximum power point isgreatly shortened by setting the action list of Q-Learning reasonably. In this case, the whole processfrom start-up to stabilization at the maximum power point was 42.9% faster than that of MFC using εgreedy exploration, and 50% faster than that of MFC using Boltzmann exploration. Q-Learningalgorithm based on greedy policy is an effective method to realize MPPT in MFC system.

机译：微生物燃料电池（MFC）是具有同时生产的废水处理的有希望的技术。为了提高MFCS的发电效率，最大功率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率频率距离是一个不错的选择。基于ε贪婪勘探，Boltzmann探索和贪婪政策的三种基于Q学习的最大功率点追踪控制计划是针对MFCS的。结果表明，基于Q学习Hastbetter电源跟踪能力的最大功率点跟踪控制比扰动和观察方法。随着基于贪婪政策的QLearning，MFC在最大功率点稳定的时间通过合理地设置Q-Leature的动作列表来缩短。在这种情况下，在最大功率点以最大功率点启动到稳定的整个过程比MFC的速度快42.9％，使用Boltzmann探索的MFC速度比MFC快50％。基于贪婪策略的Q-MearnalGorithgorithgorithgoriThirm是实现MFC系统中MPPT的有效方法。

Q-Learning based Maximum Power Point Tracking Control for Microbial Fuel Cell

摘要

著录项

相关主题

期刊订阅