首页> 外国专利> DESIGN LEARNING: LEARNING DESIGN POLICIES BASED ON INTERACTIONS

DESIGN LEARNING: LEARNING DESIGN POLICIES BASED ON INTERACTIONS

机译:设计学习:基于交互的学习设计策略

摘要

Systems, methods, and articles of manufacture for learning design policies based on user interactions. One example includes determining a first task for an environment, receiving data from a plurality of data sources, determining a first time step associated with the received data, determining a plurality of candidate actions for the determined first time step, computing a respective probability value of each candidate action achieving the first task at the first time step based on a first machine learning (ML) model, determining that a first candidate action has a greater probability value for achieving the first task at the first time step relative to the remaining plurality of candidate actions, determining that the first candidate action has not been implemented in the environment at the first time step, and generating an indication specifying to implement the first candidate action as part of a policy to achieve the first task.
机译:用于基于用户交互学习设计策略的系统,方法和产品。一个示例包括确定针对环境的第一任务,从多个数据源接收数据,确定与所接收的数据相关联的第一时间步长,确定所确定的第一时间步长的多个候选动作,计算的相应概率值。基于第一机器学习(ML)模型在第一时间步完成第一任务的每个候选动作,确定相对于其余的多个第一动作在第一时间步实现第一任务的概率值更大候选动作,确定在第一时间步骤中尚未在环境中实施第一候选动作,并生成指示实施第一候选动作的指示,作为实现第一任务的策略的一部分。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号