首页> 外文期刊>Malaysian Journal of Computer Science >CRAWLING AJAX-BASED WEB APPLICATIONS: EVOLUTION AND STATE-OF-THE-ART
【24h】

CRAWLING AJAX-BASED WEB APPLICATIONS: EVOLUTION AND STATE-OF-THE-ART

机译:爬行基于AJAX的Web应用程序:演化和最新技术

获取原文
           

摘要

The innovation of AJAX resulted in more responsive, interactive and faster web applications due to the clever amalgamation of JavaScript, HTML, and Cascading Style Sheets (CSS). However, from the user perspective, this achievement places many challenges before web search engines. One major challenge is due to the complexities in crawling such web applications because multiple states are associated with one uniform resource locator (URL) that cause a mismatch with search model of web search engines, where a web document is uniquely identified by a single unique URL with a single state. Crawling AJAX-based web applications means giving strength and capability to web search engines so that information produced in these highly-interactive web applications is downloaded and indexed. The need here is to investigate the technicalities of AJAX that shatter the metaphor of a web page which the current web search engine utilize during crawling in order to improve the capabilities of web search engines. Although some academic tools have been developed, they produce some false positives which greatly affect the performance of web search engine. We aim to investigate AJAX and AJAX-based web applications as well as the state-of-the-art in crawling these applications along with some prominent issues, challenges and recommendations
机译:由于JavaScript,HTML和级联样式表(CSS)的巧妙融合,AJAX的创新导致了响应更快,交互性更强的Web应用程序。但是,从用户的角度来看,这一成就对Web搜索引擎提出了许多挑战。一个主要挑战是由于抓取此类Web应用程序的复杂性,因为多个状态与一个统一资源定位符(URL)相关联,这导致与Web搜索引擎的搜索模型不匹配,其中Web文档由单个唯一URL唯一标识具有单一状态。爬行基于AJAX的Web应用程序意味着赋予Web搜索引擎强大的功能,以便下载和索引这些高度交互的Web应用程序中生成的信息。这里需要研究AJAX的技术,该技术打破了网页的隐喻,当前的Web搜索引擎在抓取过程中利用了该隐喻,以提高Web搜索引擎的功能。尽管已经开发了一些学术工具,但它们会产生一些误报,从而极大地影响网络搜索引擎的性能。我们旨在调查AJAX和基于AJAX的Web应用程序,以及对这些应用程序进行爬网的最新技术以及一些突出的问题,挑战和建议

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号