首页> 外国专利> A process for the thematic classification of documents, a module for the thematic classification and such a module containing search engine

A process for the thematic classification of documents, a module for the thematic classification and such a module containing search engine

机译:用于文档的主题分类的过程,用于主题分类的模块以及包含搜索引擎的此类模块

摘要

A method of thematically classifying documents, in particular for making up or updating thematic databases ( 42 ) for a search engine, includes the steps of selecting documents representative of each theme, identifying within the selected documents, elements that are characteristic of each theme, allocating a coefficient (R) to each identified element, said coefficient being representative of the relevance of said element relative to the corresponding theme, and for each document ( 50 ) for classification, identifying said elements characteristic of each theme contained in the document and, for each theme corresponding thereto, using the coefficients allocated to said elements to calculate the value of a characteristic representative of the relevance of the theme for the document ( 50 ), in order to decide whether or not the document relates to the theme.
机译:一种对文档进行主题分类的方法,特别是用于组成或更新搜索引擎的主题数据库(42)的方法,包括以下步骤:选择代表每个主题的文档,在所选文档中识别每个主题的特征元素,进行分配对于每个识别出的元素的系数(R),所述系数代表所述元素相对于相应主题的相关性,并且对于用于分类的每个文档(50),识别文档中包含的每个主题的所述元素特征,并且与其相对应的每个主题,使用分配给所述元素的系数来计算代表主题对于文档(50)的相关性的特征的值,以便确定文档是否与主题相关。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号