首页> 外国专利> Text search program using a non-search keyword dictionary for the search keyword dictionary, server and methods

Text search program using a non-search keyword dictionary for the search keyword dictionary, server and methods

机译:使用非搜索关键字字典作为搜索关键字字典的文本搜索程序,服务器和方法

摘要

PPROBLEM TO BE SOLVED: To provide a text classification program, server and method for preventing text information that is not illegal or harmful from being classified into an illegal/harmful category when keywords registered in advance are used for the classification. PSOLUTION: Multiple legitimate learned text information which do not belong to a specific category, and multiple illegitimate learned text information which belong to the specific category are stored in a learned text storage means. Learned text information including a search keyword is searched. Modification keywords for the search keyword are extracted. A legitimacy rate of each modification keyword is calculated as the number of pieces of legitimate learned text information to the number of all the pieces of learned text information. The modification keywords whose legitimacy rates are not lower than a predetermined threshold are registered as non-search keywords to generate a non-search keyword dictionary. The text information including the non-search keyword as the modification keyword for the search keyword is prevented from being searched. PCOPYRIGHT: (C)2011,JPO&INPIT
机译:

要解决的问题:提供一种文本分类程序,服务器和方法,用于防止在使用预先注册的关键字进行分类时将非非法或有害文本信息分类为非法/有害类别。

解决方案:多个不属于特定类别的合法学习文本信息和多个属于特定类别的非法学习文本信息被存储在学习文本存储装置中。搜索包括搜索关键字的学习到的文本信息。提取用于搜索关键字的修改关键字。将每个修改关键词的合法性率计算为合法学习文本信息的数量相对于所有学习文本信息的数量。将合法性不低于预定阈值的修改关键字注册为非搜索关键字,以生成非搜索关键字词典。防止搜索包括非搜索关键字作为搜索关键字的修改关键字的文本信息。

版权:(C)2011,日本特许厅&INPIT

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号