机译:脱南策略学习的自适应动态编程算法的平行框架
Southeast Univ Sch Automat Nanjing 210096 Peoples R China|Southeast Univ Minist Educ Key Lab Measurement & Control Complex Syst Engn Nanjing 210096 Peoples R China;
Southeast Univ Sch Automat Nanjing 210096 Peoples R China|Southeast Univ Minist Educ Key Lab Measurement & Control Complex Syst Engn Nanjing 210096 Peoples R China;
Southeast Univ Sch Automat Nanjing 210096 Peoples R China|Southeast Univ Minist Educ Key Lab Measurement & Control Complex Syst Engn Nanjing 210096 Peoples R China;
Optimal control; Heuristic algorithms; Nonlinear systems; Stability analysis; Learning systems; Convergence; Dynamic programming; Adaptive dynamic programming (ADP); off-policy learning; policy gradient; sample efficiency;
机译:离散时间系统非零和游戏的基于非策略的自适应动态规划方法
机译:在线学习改进的N步值梯度学习自适应动态规划算法
机译:并行确定性动态规划和分层自适应遗传算法在水库调度优化中的应用
机译:基于遗传算法的并行学习自适应动态规划研究
机译:基于AdaBoost算法的近实时,高度可扩展,并行和分布式的自适应对象检测和再训练框架
机译:物联网环境中动态图像分类的自适应深度学习框架
机译:重新审视本地自适应能力框架:从非洲实施研究和编程框架的实施
机译:自适应异步并行全局优化算法的动态调度策略。