首页> 外国专利> DISTRIBUTED RANDOM FOREST TRAINING WITH A PREDICTOR TRAINED TO BALANCE TASKS

DISTRIBUTED RANDOM FOREST TRAINING WITH A PREDICTOR TRAINED TO BALANCE TASKS

机译:分布式随机森林训练,带有预测员的平衡任务训练

摘要

In one embodiment, a device distributes sets of training records from a training dataset for a random forest-based classifier among a plurality of workers of a computing cluster. Each worker determines whether it can perform a node split operation locally on the random forest by comparing a number of training records at the worker to a predefined threshold. The device determines, for each of the split operations, a data size and entropy measure of the training records to be used for the split operation. The device applies a machine learning-based predictor to the determined data size and entropy measure of the training records to be used for the split operation, to predict its completion time. The device coordinates the workers of the computing cluster to perform the node split operations in parallel such that the node split operations in a given batch are grouped based on their predicted completion times.
机译:在一个实施例中,一种设备在用于计算集群的多个工人之间的,基于随机森林的分类器的训练数据集中分配训练记录的集合。每个工作人员通过将工作人员处的培训记录数与预定阈值进行比较,确定是否可以在随机森林上本地执行节点拆分操作。设备为每个拆分操作确定要用于拆分操作的训练记录的数据大小和熵度量。该设备将基于机器学习的预测器应用于要用于拆分操作的训练记录的确定数据大小和熵度量,以预测其完成时间。设备协调计算群集的工作人员以并行执行节点拆分操作,以便基于给定批处理中的节点拆分操作的预测完成时间对其分组。

著录项

  • 公开/公告号US2020111030A1

    专利类型

  • 公开/公告日2020-04-09

    原文格式PDF

  • 申请/专利权人 CISCO TECHNOLOGY INC.;

    申请/专利号US201816152578

  • 发明设计人 RADEK STAROSTA;JAN BRABEC;LUKAS MACHLICA;

    申请日2018-10-05

  • 分类号G06N99;G06N7;

  • 国家 US

  • 入库时间 2022-08-21 11:19:11

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号