A Model-Based Approach for Crawling Rich Internet Applications

MUSTAFA EMRE DINCTURK; GUY-VINCENT JOURDAN; GREGOR V. BOCHMANN; IOSIF VIOREL ONUT

首页> 外文期刊>ACM transactions on the web >A Model-Based Approach for Crawling Rich Internet Applications

【24h】

A Model-Based Approach for Crawling Rich Internet Applications

机译：基于模型的爬网富Internet应用程序

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

New Web technologies, like AJAX, result in more responsive and interactive Web applications, sometimes called Rich Internet Applications (RIAs). Crawling techniques developed for traditional Web applications are not sufficient for crawling RIAs. The inability to crawl RIAs is a problem that needs to be addressed for at least making RIAs searchable and testable. We present a new methodology, called "model-based crawling", that can be used as a basis to design efficient crawling strategies for RIAs. We illustrate model-based crawling with a sample strategy, called the "hypercube strategy". The performances of our model-based crawling strategies are compared against existing standard crawling strategies, including breadth-first, depth-first, and a greedy strategy. Experimental results show that our model-based crawling approach is significantly more efficient than these standard strategies.

机译：像AJAX这样的新Web技术会导致响应性和交互性更高的Web应用程序，有时称为Rich Internet Applications（RIA）。为传统的Web应用程序开发的爬网技术不足以爬网RIA。无法爬网RIA是一个必须解决的问题，至少要使RIA可以搜索和测试。我们提出了一种新的方法，称为“基于模型的爬网”，可以用作设计RIA的有效爬网策略的基础。我们用一个称为“超立方体策略”的样本策略说明了基于模型的爬网。我们将基于模型的爬网策略的性能与现有的标准爬网策略（包括广度优先，深度优先和贪婪策略）进行了比较。实验结果表明，基于模型的爬网方法比这些标准策略效率更高。

著录项

来源
《ACM transactions on the web》 |2014年第3期|19.1-19.39|共39页
作者
MUSTAFA EMRE DINCTURK; GUY-VINCENT JOURDAN; GREGOR V. BOCHMANN; IOSIF VIOREL ONUT;
展开▼
作者单位

EECS, University of Ottawa, 800 King Edward Avenue, Ottawa, ON, K1N 6N5, Canada;

EECS, University of Ottawa, 800 King Edward Avenue, Ottawa, ON, K1N 6N5, Canada;

EECS, University of Ottawa, 800 King Edward Avenue, Ottawa, ON, K1N 6N5, Canada;

Research and Development, IBM Security AppScan Enterprise, IBM, 770 Palladium Drive, Ottawa, ON, K2V 1C8, Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Crawling; rich Internet applications; AJAX; modeling; dynamic analysis; DOM;

机译：爬行;丰富的Internet应用程序;AJAX;造型;动态分析;DOM;

相似文献

外文文献
中文文献
专利

1. MODEL-BASED RICH INTERNET APPLICATIONS CRAWLING: 'MENU' AND 'PROBABILITY' MODELS [J] . SURYAKANT CHOUDHARY, EMRE DINCTURK, SEYED MIRTAHERI, Journal of web engineering . 2014,第3a4期

机译：基于模型的富互联网应用程序抓取：“菜单”和“概率”模型
2. Model-based approach for semantic-driven deployment of containerized applications to support future internet services and architectures [J] . Nenad Petrovi? Serbian Journal of Electrical Engineering . 2019,第1期

机译：基于模型的方法，用于语义驱动的容器化应用程序部署，以支持未来的Internet服务和体系结构
3. Model-based approach for semantic-driven deployment of containerized applications to support future internet services and architectures [J] . Nenad Petrovi? Serbian Journal of Electrical Engineering . 2019,第1期

机译：基于模型的方法，用于语义驱动的容器化应用程序部署，以支持未来的Internet服务和体系结构
4. A Statistical Approach for Efficient Crawling of Rich Internet Applications [C] . Mustafa Emre Dincturk, Suryakant Choudhary, Gregor von Bochmann, International conference on web engineering . 2012

机译：一种有效爬网丰富Internet应用程序的统计方法
5. Model-based Crawling - An Approach to Design Efficient Crawling Strategies for Rich Internet Applications. [D] . Dincturk, Mustafa Emre. 2013

机译：基于模型的爬网-一种为富Internet应用程序设计有效的爬网策略的方法。
6. Model-based approach for predicting the impact of genetic modifications on product yield in biopharmaceutical manufacturing—Application to influenza vaccine production [O] . Stefanie Duvigneau, Robert Dürr, Tanja Laske, 2020

机译：基于模型的方法用于预测遗传修饰对生物制药制造应用中产物产量的影响 - 流感疫苗生产
7. Indexing Rich Internet Applications Using Components-Based Crawling [O] . Ali Moosavi, Salman Hooshmand, Sara Baghbanzadeh, 2014

机译：使用基于组件的爬网索引富Internet应用程序

A Model-Based Approach for Crawling Rich Internet Applications

摘要

著录项

相似文献

相关主题

期刊订阅