首页> 外文期刊>Information Processing & Management >A framework for understanding Latent Semantic Indexing (LSI) performance
【24h】

A framework for understanding Latent Semantic Indexing (LSI) performance

机译:理解潜在语义索引(LSI)性能的框架

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper we present a theoretical model for understanding the performance of Latent Semantic Indexing (LSI) search and retrieval application. Many models for understanding LSI have been proposed. Ours is the first to study the values produced by LSI in the term by dimension vectors. The framework presented here is based on term co-occurrence data. We show a strong correlation between second-order term co-occurrence and the values produced by the Singular Value Decomposition (SVD) algorithm that forms the foundation for LSI. We also present a mathematical proof that the SVD algorithm encapsulates term co-occurrence information.
机译:在本文中,我们提出了一种理论模型,用于理解潜在语义索引(LSI)搜索和检索应用程序的性能。已经提出了许多用于理解LSI的模型。我们是第一个研究维度向量中LSI产生的值的人。此处介绍的框架基于术语共现数据。我们显示了二阶项共现与奇异值分解(SVD)算法产生的值之间的密切相关,奇异值分解(SVD)算法构成了LSI的基础。我们还提供了SVD算法封装术语共现信息的数学证明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号