Service Class Driven Dynamic Data Source Discovery with DynaBot

Daniel Rocco; James Caverlee; Ling Liu; Terence Critchlow

首页> 外文期刊>International journal of web services research >Service Class Driven Dynamic Data Source Discovery with DynaBot

【24h】

Service Class Driven Dynamic Data Source Discovery with DynaBot

机译：用DynaBot服务等级驱动的动态数据源发现

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dynamic Web data sources on the Deep Web provide intuitive access to real-time information and large data repositories anywhere that Web access is available. Although recent studies suggest that the dynamic Web is larger and growing faster than static Web, dynamic content is often ignored by existing search engine indexers owing to technical challenges inherent in searching dynamic sources. To address these challenges, we present DynaBot, a service-centric crawler for discovering and clustering Deep Web sources. DynaBot has three unique characteristics. First, DynaBot utilizes a service class model implemented through the construction of service class descriptions (SCDs). Second, DynaBot employs a modular architecture for focused crawling of the Deep Web. Third, DynaBot incorporates algorithms for efficiently probing, discovering, and clustering Deep Web sources through SCD-based service analysis. Experimental results demonstrate DynaBot's effectiveness and suggest techniques for efficiently managing service discovery given the immense scale of the Deep Web.

机译：Deep Web上的动态Web数据源可提供对Web可用访问的实时信息和大数据存储库的直观访问。尽管最近的研究表明动态Web比静态Web更大并且增长速度更快，但是由于搜索动态资源固有的技术挑战，动态内容经常被现有的搜索引擎索引器忽略。为了解决这些挑战，我们介绍了DynaBot，这是一个以服务为中心的搜寻器，用于发现和群集Deep Web源。 DynaBot具有三个独特的特征。首先，DynaBot利用通过构建服务类描述（SCD）实现的服务类模型。其次，DynaBot采用模块化架构来集中抓取Deep Web。第三，DynaBot结合了算法，可通过基于SCD的服务分析有效地探测，发现和群集深层Web源。实验结果证明了DynaBot的有效性，并提出了在Deep Web规模巨大的情况下有效管理服务发现的技术。

著录项

来源
《International journal of web services research》 |2007年第3期|26-48|共23页
作者
Daniel Rocco; James Caverlee; Ling Liu; Terence Critchlow;
展开▼
作者单位

University of West Georgia, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
deep web; dynamic web data; service discovery; web crawling;

机译：深网;动态网络数据;服务发现;网络搜寻;

相似文献

外文文献
中文文献
专利

1. Data Integration: Data-driven Discovery from Diverse Data Sources [J] . Genevera Allen Genetic epidemiology. . 2019,第7期

机译：数据集成：来自不同数据源的数据驱动的发现
2. Intergrating academic library services directly into classroom intruction through discovery tools: Bringing library resources into the online classroom [J] . Frierson E., Virtue A. Computers in Libraries . 2013,第7期

机译：通过发现工具将高校图书馆服务直接集成到课堂教学中：将图书馆资源引入在线课堂
3. Linked Open Data for Context-aware Services: Analysis, Classification and Context Data Discovery [J] . Moritz von Hoffen, Abdulbaki Uzun International journal of semantic computing . 2014,第4期

机译：用于上下文感知服务的链接开放数据：分析，分类和上下文数据发现
4. Semantic Knowledge Discovery and Data-Driven Logical Reasoning from Heterogeneous Data Sources [C] . Claudia dAmato, Volha Bryl, Luciano Serafini Uncertainty reasoning for the semantic web III . 2011

机译：异构数据源的语义知识发现和数据驱动的逻辑推理
5. Developing a framework to support data exchange from heterogeneous data sources via Industry Foundation Classes (IFC) and web services. [D] . Danso-Amoako, Mark Owusu. 2006

机译：开发一个框架，以支持通过行业基础类（IFC）和Web服务从异构数据源进行数据交换。
6. TBIO-27. GABRIELLA MILLER KIDS FIRST DATA RESOURCE CENTER ADVANCING GENETIC RESEARCH IN CHILDHOOD CANCER AND STRUCTURAL BIRTH DEFECTS THROUGH LARGE SCALE INTEGRATED DATA-DRIVEN DISCOVERY AND CLOUD-BASED PLATFORMS FOR COLLABORATIVE ANALYSIS [O] . Allison P Heath, Pichai Raman, Yuankun Zhu, 2018

机译：TBIO-27。 GABRIELLA MILLER率先通过大规模集成数据驱动的发现和基于云的平台进行儿童癌症和结构出生缺陷的遗传学研究
7. Basic technologies of web services framework for research, discovery, and processing the disparate massive Earth observation data from heterogeneous sources [O] . V. Savorskiy, E. Lupyan, I. Balashov, 2014

机译：用于研究，发现和处理来自异构来源的不同大规模地球观测数据的Web服务框架的基本技术

Service Class Driven Dynamic Data Source Discovery with DynaBot

摘要

著录项

相似文献

相关主题

期刊订阅