首页> 外文会议>International Workshop of the Initiative for the Evaluation of XML Retrieval >TopX 2.0 at the INEX 2008 Efficiency Track A (Very) Fast Object-Store for Top-k-Style XML Full-Text Search
【24h】

TopX 2.0 at the INEX 2008 Efficiency Track A (Very) Fast Object-Store for Top-k-Style XML Full-Text Search

机译:Topx 2.0在Inex 2008效率跟踪A(非常)Fast Object-Store for Top-K样式XML全文搜索

获取原文

摘要

For the INEX Efficiency Track 2008, we were just on time to finish and evaluate our brand-new TopX 2.0 prototype. Complementing our long-running effort on efficient top-k query processing on top of a relational back-end, we now switched to a compressed object-oriented storage for text-centric XML data with direct access to customized inverted files, along with a complete reimplementation of the engine in C++. Our INEX 2008 experiments demonstrate efficiency gains of up to a factor of 30 compared to the previous Java/JDBC-based TopX 1.0 implementation over a relational back-end. TopX 2.0 achieves overall runtimes of less than 51 seconds for the entire batch of 568 Efficiency Track topics in their content-and-structure (CAS) version and less than 29 seconds for the content-only (CO) version, respectively, using a top-15, focused (i.e., non-overlapping) retrieval mode-an average of merely 89 ms per CAS query and 49 ms per CO query.
机译:对于2008年INEX效率轨道,我们正准时完成和评估我们的全新TOPX 2.0原型。在一个关系后端的高效Top-K查询处理上补充我们的长期运行工作,我们现在切换到用于以文本为中心的XML数据的压缩面向对象存储,并直接访问自定义反转文件,以及完整的在C ++中重新实现发动机。我们的Inex 2008实验表明,与以前的Java / JDBC的TOPX 1.0在关系后端实现相比,效率提升高达30倍。 Topx 2.0在其内容和结构(CAS)版本中的整个批量轨道主题和仅使用顶部的内容(CO)版本中的整个批量轨道主题和少于29秒的整个批量轨道主题,实现了少于51秒的整体运行时间。 -15,聚焦(即非重叠)检索模式 - 仅为每种CA查询89 ms的平均值和49ms。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号