首页> 外国专利> System and method for automated selection of best description from descriptions extracted from a plurality of data sources using numeric comparison and textual centrality measure

System and method for automated selection of best description from descriptions extracted from a plurality of data sources using numeric comparison and textual centrality measure

机译:从使用数字比较和文本中心度测量从多个数据源提取的描述中自动选择最佳描述的系统和方法

摘要

Techniques are described for collecting descriptions of an entity from different data sources and using a numeric comparison and textual centrality measure to automatically select a best description. In one implementation, a method includes: retrieving a real property description dataset, the real property description dataset including descriptions from multiple data sources that describe the real property; extracting, from each of the descriptions, numbers that identify the property; performing a numerical comparison of the numbers extracted from each of the descriptions to determine if any descriptions needs to be discarded from further consideration; applying a text cleaning process to normalize the descriptions; and performing a textual centrality measure of remaining descriptions to determine a level agreement of each of the remaining descriptions with each of the other remaining descriptions; and using at least the textual centrality measure to select a description. The selected description may be used to populate a document.
机译:描述用于从不同数据源收集实体的描述并使用数字比较和文本中心度量来自动选择最佳描述的技术。在一个实现中,一种方法包括:检索实际属性描述数据集,该属性描述数据集包括来自描述实际属性的多个数据源的描述;从每个描述中提取,从而识别属性的数字;执行从每个描述中提取的数字的数值比较以确定是否需要从进一步考虑中丢弃任何描述;应用文本清理过程以规范化描述;并执行剩余描述的文本中心度量,以确定每个其他剩余描述中的每个剩余描述的级别协议;并使用至少文本中心度量来选择描​​述。所选描述可用于填充文档。

著录项

  • 公开/公告号US10997403B1

    专利类型

  • 公开/公告日2021-05-04

    原文格式PDF

  • 申请/专利权人 FIRST AMERICAN FINANCIAL CORPORATION;

    申请/专利号US201816226370

  • 发明设计人 MARK FLEMING;YAN QIU;

    申请日2018-12-19

  • 分类号G06K9;G06F17/18;G06K9/62;G06F40/205;G06F40/289;

  • 国家 US

  • 入库时间 2022-08-24 18:31:46

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号