首页> 外国专利> Method and apparatus for electronically extracting application specific multidimensional information from a library of searchable documents and for providing the application specific information to a user application

Method and apparatus for electronically extracting application specific multidimensional information from a library of searchable documents and for providing the application specific information to a user application

机译:用于从可搜索文档库中电子提取特定于应用程序的多维信息并向用户应用程序提供特定于应用程序的信息的方法和装置

摘要

An apparatus and method are disclosed for electronically extracting application specific multidimensional information from a library of electronically searchable documents, wherein at least one dimension of the information is a category, which may comprise an automatic document miner in communication with the contents of the library and adapted to electronically extract relevant documents from the library; an E-Space filter creator adapted to create from the extracted relevant documents a category specific representation of the extracted relevant documents comprising the E-Space filter; a document selector adapted to utilize the E-Space filter to separate the extracted relevant documents into member documents and non-member documents and to discard the non-member documents; and an application specific multidimensional information extractor adapted to extract occurrences of application specific multidimensional information from the member documents. The apparatus and method may also comprise an application specific multidimensional information verification unit adapted to verify the extraction of application specific multidimensional information from the member documents, and a database storing the application specific multidimensional information adapted to provide an application running on a user computing device access to the application specific multidimensional information. The automatic document miner may comprise at least one seeded network search agent. The E-Space filter creator may comprise a concept definer adapted to create a concept of the application specific multidimensional information and may utilize a latent index sequencer. The application specific word extractor may comprise a concept based key-word extractor.
机译:公开了一种用于从电子可搜索文档的库中电子提取应用特定的多维信息的装置和方法,其中信息的至少一个维是类别,其可以包括与该库的内容通信并经过修改的自动文档挖掘器从图书馆以电子方式提取相关文件; E-Space过滤器创建器,其适于从提取的相关文档中创建包括E-Space过滤器的提取的相关文档的类别特定表示;一个文档选择器,用于利用E-Space过滤器将提取出的相关文档分为成员文档和非成员文档,并丢弃非成员文档;专用多维信息提取器,其适于从成员文档中提取专用多维信息的出现。该设备和方法还可以包括:专用多维信息验证单元,其适于验证从成员文档中提取专用多维信息;以及数据库,其存储该专用多维信息,以提供运行在用户计算设备访问上的应用程序。特定于应用程序的多维信息。自动文档挖掘器可以包括至少一个种子网络搜索代理。 E-空间过滤器创建器可以包括概念定义器,该概念定义器适于创建应用专用的多维信息的概念,并且可以利用潜在索引定序器。专用词提取器可以包括基于概念的关键词提取器。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号