首页>
外国专利>
System and method for automated selection of best description from descriptions extracted from a plurality of data sources using numeric comparison and textual centrality measure
System and method for automated selection of best description from descriptions extracted from a plurality of data sources using numeric comparison and textual centrality measure
展开▼
机译:从使用数字比较和文本中心度测量从多个数据源提取的描述中自动选择最佳描述的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Techniques are described for collecting descriptions of an entity from different data sources and using a numeric comparison and textual centrality measure to automatically select a best description. In one implementation, a method includes: retrieving a real property description dataset, the real property description dataset including descriptions from multiple data sources that describe the real property; extracting, from each of the descriptions, numbers that identify the property; performing a numerical comparison of the numbers extracted from each of the descriptions to determine if any descriptions needs to be discarded from further consideration; applying a text cleaning process to normalize the descriptions; and performing a textual centrality measure of remaining descriptions to determine a level agreement of each of the remaining descriptions with each of the other remaining descriptions; and using at least the textual centrality measure to select a description. The selected description may be used to populate a document.
展开▼