首页> 外国专利> 一种训练样本有效性检测方法、计算机设备及计算机非易失性存储介质

一种训练样本有效性检测方法、计算机设备及计算机非易失性存储介质

机译:一种训练样本有效性检测方法、计算机设备及计算机非易失性存储介质

摘要

A training sample validity detection method, a computer device, and a computer non-volatile storage medium, relating to the technical field of artificial intelligence. The method comprises: acquiring multiple extended questions, wherein each extended question is associated with a corresponding preset standard question (S101); randomly dividing the multiple expansion questions into preset copies of sample sets and dividing the preset copies of sample sets into a training set and a cross-validation set according to a preset ratio (S102); training a classification model by using the training set (S103); using the classification model to label the multiple extended questions in the cross-validation set by adopting a cross-validation method until all the extended questions are labeled (S104); acquiring the labeling results of all the extended questions output by the classification model (S105); and obtaining abnormal extended questions according to the labeling results, the labeling results of the abnormal extended questions being different from the associated preset standard questions (S106). The present application can solve the problem of low efficiency of training sample validity detection in the prior art.
机译:一种训练样本有效性检测方法,计算机设备和计算机非易失性存储介质,涉及人工智能技术领域。该方法包括:获取多个扩展问题,其中每个扩展问题与对应的预设标准问题相关联(S101);将多个扩展问题随机分为样本集的预设副本,并根据预设比例将样本集的预设副本分为训练集和交叉验证集(S102);通过使用训练集训练分类模型(S103);使用分类模型,通过采用交叉验证方法,对交叉验证集合中的多个扩展问题进行标注,直到所有扩展问题都被标注为止(S104);获取分类模型输出的所有扩展问题的标注结果(S105);根据标签结果获取异常扩展问题,所述异常扩展问题的标签结果与关联的预设标准问题不同(S106)。本发明可以解决现有技术中训练样本有效性检测效率低的问题。

著录项

获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号