...
首页> 外文期刊>ACM Transactions on Information Systems >Cross-Lingual Topic Discovery From Multilingual Search Engine Query Log
【24h】

Cross-Lingual Topic Discovery From Multilingual Search Engine Query Log

机译:从多语言搜索引擎查询日志中发现跨语言主题

获取原文
获取原文并翻译 | 示例
           

摘要

Today, major commercial search engines are operating in a multinational fashion to provide web search services for millions of users who compose search queries by different languages. Hence, the search engine query log, which serves as the backbone of many search engine applications, records millions of users’ search history in a wide spectrum of human languages and demonstrates a strong multilingual phenomenon. However, with its salience, the multilingual nature of a search engine query log is usually ignored by existing works, which usually consider query log entries of different languages as being orthogonal and independent. This kind of oversimplified assumption heavily distorts the underlying structure of web search data. In this article, we pioneer in recognition of the multilingual nature of a query log and make the first attempt to cross the language barrier in query logs. We propose a novel model named
机译:如今,主要的商业搜索引擎正在以跨国方式运作,以为数百万使用不同语言撰写搜索查询的用户提供网络搜索服务。因此,作为许多搜索引擎应用程序的骨干的搜索引擎查询日志,以多种人类语言记录了数百万用户的搜索历史,并展示了一种强大的多语言现象。但是,由于搜索引擎查询日志的显着性,它通常会被现有作品所忽略,而现有作品通常会将不同语言的查询日志条目视为正交且独立的。这种过分简化的假设严重扭曲了Web搜索数据的基础结构。在本文中,我们率先认识到查询日志的多语言性质,并首次尝试突破查询日志中的语言障碍。我们提出了一个新模型,名为

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号