首页> 外文学位 >Model-based Crawling - An Approach to Design Efficient Crawling Strategies for Rich Internet Applications.

【24h】

Model-based Crawling - An Approach to Design Efficient Crawling Strategies for Rich Internet Applications.

机译：基于模型的爬网-一种为富Internet应用程序设计有效的爬网策略的方法。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Rich Internet Applications (RIAs) are a new generation of web applications that break away from the concepts on which traditional web applications are based. RIAs are more interactive and responsive than traditional web applications since RIAs allow client-side scripting (such as JavaScript) and asynchronous communication with the server (using AJAX). Although these are improvements in terms of user-friendliness, there is a big impact on our ability to automatically explore (crawl) these applications. Traditional crawling algorithms are not sufficient for crawling RIAs. We should be able to crawl RIAs in order to be able to search their content and build their models for various purposes such as reverse-engineering, detecting security vulnerabilities, assessing usability, and applying model-based testing techniques. One important problem is designing efficient crawling strategies for RIAs. It seems possible to design crawling strategies more efficient than the standard crawling strategies, the Breadth-First and the Depth-First. In this thesis, we explore the possibilities of designing efficient crawling strategies. We use a general approach that we called Model-based Crawling and present two crawling strategies that are designed using this approach. We show by experimental results that model-based crawling strategies are more efficient than the standard strategies.

机译：富Internet应用程序（RIA）是新一代的Web应用程序，它摆脱了传统Web应用程序所基于的概念。与传统的Web应用程序相比，RIA更具交互性和响应能力，因为RIA允许客户端脚本（例如JavaScript）和与服务器的异步通信（使用AJAX）。尽管这些都是用户友好性方面的改进，但是对我们自动浏览（爬网）这些应用程序的能力有很大影响。传统的爬网算法不足以对RIA进行爬网。我们应该能够抓取RIA，以便能够出于各种目的（例如反向工程，检测安全漏洞，评估可用性以及应用基于模型的测试技术）搜索其内容并构建其模型。一个重要的问题是为RIA设计有效的爬网策略。设计爬网策略似乎比标准爬网策略“广度优先”和“深度优先”更有效。在本文中，我们探索了设计有效爬网策略的可能性。我们使用一种称为基于模型的爬网的通用方法，并介绍使用这种方法设计的两种爬网策略。我们通过实验结果表明，基于模型的爬网策略比标准策略更有效。

著录项

作者
Dincturk, Mustafa Emre.;
展开▼
作者单位

University of Ottawa (Canada).;

展开▼
授予单位 University of Ottawa (Canada).;
学科 Computer Science.
学位 Ph.D.
年度 2013
页码 164 p.
总页数 164
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A Model-Based Approach for Crawling Rich Internet Applications [J] . MUSTAFA EMRE DINCTURK, GUY-VINCENT JOURDAN, GREGOR V. BOCHMANN, ACM transactions on the web . 2014,第3期

机译：基于模型的爬网富Internet应用程序
2. MODEL-BASED RICH INTERNET APPLICATIONS CRAWLING: 'MENU' AND 'PROBABILITY' MODELS [J] . SURYAKANT CHOUDHARY, EMRE DINCTURK, SEYED MIRTAHERI, Journal of web engineering . 2014,第3a4期

机译：基于模型的富互联网应用程序抓取：“菜单”和“概率”模型
3. Research on crawling mechanism and policy for crawling product information from mobile internet [J] . Shu Wang, Jia Chen, Chonghuan Xu International journal of computing science and mathematics . 2017,第6期

机译：从移动互联网爬网产品信息的爬网机制和策略研究
4. A Strategy for Efficient Crawling of Rich Internet Applications [C] . Kamara Benjamin, Gregor von Bochmann, Mustafa Emre Dincturk, Web engineering . 2011

机译：有效爬网丰富Internet应用程序的策略
5. The Design and Development of uBranch Bot, an Untethered, Branch-Crawling, Caterpillar-Inspired, Soft Robot [D] . Rozen-Levy, Shane. 2019

机译：uBranch Bot的设计和开发，这是一种不受束缚，分支爬行，毛毛虫启发的软机器人
6. An Efficient Approach for Web Indexing of Big Data through Hyperlinks in Web Crawling [O] . R. Suganya Devi, D. Manjula, R. K. Siddharth 2015

机译：通过Web爬网中的超链接对大数据进行Web索引的一种有效方法
7. Indexing Rich Internet Applications Using Components-Based Crawling [O] . Ali Moosavi, Salman Hooshmand, Sara Baghbanzadeh, 2014

机译：使用基于组件的爬网索引富Internet应用程序
8. Evolutionary Design and Simulation of a Tube Crawling Inspection Robot [R] . Craft, M. , Howsman, T. , ONeil, D. 2002

机译：管道爬行检测机器人的进化设计与仿真

Model-based Crawling - An Approach to Design Efficient Crawling Strategies for Rich Internet Applications.

摘要

著录项

相似文献

相关主题

期刊订阅