首页> 外国专利> Augmented dataset representation using a taxonomy which accounts for similarity and dissimilarity between each record in the dataset and a user's similarity-biased intuition

Augmented dataset representation using a taxonomy which accounts for similarity and dissimilarity between each record in the dataset and a user's similarity-biased intuition

机译:使用分类法增强数据集表示,该分类法说明了数据集中的每个记录与用户的偏向性直觉之间的相似性和不相似性

摘要

A computerized method of representing a dataset with a taxonomy includes obtaining a dataset comprising a plurality of records, the dataset being characterized by a vocabulary and each of the plurality of records being characterized by at least one term within the vocabulary; identifying nearest neighbors for each term within the vocabulary; imputing a degree of membership for each nearest neighbor identified for each term within the vocabulary; augmenting the obtained dataset with the imputed degree of membership; and generating a taxonomy of the augmented dataset.
机译:一种用分类法表示数据集的计算机化方法,包括获得包括多个记录的数据集,该数据集以词汇表为特征,并且多个记录中的每一个以词汇表中的至少一个术语为特征;确定词汇表中每个术语的最近邻居;为词汇表中为每个术语确定的每个最近邻居估算隶属度;用推定的隶属度扩充获得的数据集;并生成扩充数据集的分类法。

著录项

相似文献

  • 专利
  • 外文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号