首页>
外国专利>
METHODS AND SYSTEMS FOR IDENTIFYING A LEVEL OF SIMILARITY BETWEEN A FILTERING CRITERION AND A DATA ITEM WITHIN A SET OF STREAMED DOCUMENTS
METHODS AND SYSTEMS FOR IDENTIFYING A LEVEL OF SIMILARITY BETWEEN A FILTERING CRITERION AND A DATA ITEM WITHIN A SET OF STREAMED DOCUMENTS
展开▼
机译:在一组流化文档中识别过滤条件和数据项之间相似度的方法和系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method enables identification of a similarity level between a user-provided data item and a data item within a set of data documents. The method includes a representation generator determining, for each term in an enumeration of terms, occurrence information. The representation generator generates, for each term, a sparse distributed representation (SDR) using the occurrence information. The method includes receiving, by a filtering module, a filtering criterion including at least one of a security-based term or a brand-based term. The method includes generating, by the representation generator, for the filtering criterion, at least one SDR. The method includes generating, by the representation generator, for a first of a plurality of streamed documents received from a data source, a compound SDR. The method includes determining, by a similarity engine, a distance between the filtering criterion SDR and the compound SDR. The method includes acting on the document, based upon the distance. FIG. 15
展开▼