首页> 外国专利> SYSTEMS AND METHODS FOR QUICKLY SEARCHING DATASETS BY INDEXING SYNTHETIC DATA GENERATING MODELS

SYSTEMS AND METHODS FOR QUICKLY SEARCHING DATASETS BY INDEXING SYNTHETIC DATA GENERATING MODELS

机译:通过索引合成数据生成模型快速搜索数据集的系统和方法

摘要

Systems and methods for searching datasets and classifying datasets are disclosed. For example, a system may include one or more memory units storing instructions and one or more processors configured to execute the instructions to perform operations. The operations may include receiving a test dataset from a client device and generating a test data model output using a data model, based on the test dataset. The operations may include processing test data model output by implementing an encoding method, a factorizing method, and/or a vectorizing method. The operations may include retrieving a reference data model output from a dataset index, based on a reference dataset. The operations may include generating a similarity metric based on the reference data model output and the test data model output. The operations may include classifying the test dataset based on the similarity metric and transmitting, to the client device, information comprising the classification.
机译:公开了用于搜索数据集和对数据集进行分类的系统和方法。例如,一种系统可以包括一个或多个存储指令的存储器单元以及一个或多个被配置为执行指令以执行操作的处理器。所述操作可以包括:从客户端设备接收测试数据集;以及基于测试数据集,使用数据模型来生成测试数据模型输出。所述操作可以包括通过实施编码方法,因数分解方法和/或向量化方法来处理输出的测试数据模型。所述操作可以包括基于参考数据集检索从数据集索引输出的参考数据模型。所述操作可以包括基于参考数据模型输出和测试数据模型输出来生成相似性度量。该操作可以包括基于相似性度量对测试数据集进行分类,以及将包括该分类的信息发送至客户端设备。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号