首页> 外文期刊>International journal of web services research >Service Class Driven Dynamic Data Source Discovery with DynaBot
【24h】

Service Class Driven Dynamic Data Source Discovery with DynaBot

机译:用DynaBot服务等级驱动的动态数据源发现

获取原文
获取原文并翻译 | 示例
           

摘要

Dynamic Web data sources on the Deep Web provide intuitive access to real-time information and large data repositories anywhere that Web access is available. Although recent studies suggest that the dynamic Web is larger and growing faster than static Web, dynamic content is often ignored by existing search engine indexers owing to technical challenges inherent in searching dynamic sources. To address these challenges, we present DynaBot, a service-centric crawler for discovering and clustering Deep Web sources. DynaBot has three unique characteristics. First, DynaBot utilizes a service class model implemented through the construction of service class descriptions (SCDs). Second, DynaBot employs a modular architecture for focused crawling of the Deep Web. Third, DynaBot incorporates algorithms for efficiently probing, discovering, and clustering Deep Web sources through SCD-based service analysis. Experimental results demonstrate DynaBot's effectiveness and suggest techniques for efficiently managing service discovery given the immense scale of the Deep Web.
机译:Deep Web上的动态Web数据源可提供对Web可用访问的实时信息和大数据存储库的直观访问。尽管最近的研究表明动态Web比静态Web更大并且增长速度更快,但是由于搜索动态资源固有的技术挑战,动态内容经常被现有的搜索引擎索引器忽略。为了解决这些挑战,我们介绍了DynaBot,这是一个以服务为中心的搜寻器,用于发现和群集Deep Web源。 DynaBot具有三个独特的特征。首先,DynaBot利用通过构建服务类描述(SCD)实现的服务类模型。其次,DynaBot采用模块化架构来集中抓取Deep Web。第三,DynaBot结合了算法,可通过基于SCD的服务分析有效地探测,发现和群集深层Web源。实验结果证明了DynaBot的有效性,并提出了在Deep Web规模巨大的情况下有效管理服务发现的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号