On Structure-based Web Data Extraction: The Model, Method and Application

俞方桦; 戴玮; 陈家训

首页> 中文期刊> 《东华大学学报：英文版》 >On Structure-based Web Data Extraction: The Model, Method and Application

On Structure-based Web Data Extraction: The Model, Method and Application

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Web data extraction is to obtain valuable data from the tremendous information resource of the World Wide Web according to the pre - defined pattern. It processes and classifies the data on the Web. Formalization of the procedure of Web data extraction is presented, as well as the description of crawling and extraction algorithm. Based on the formalization, an XML - based page structure description language, TIDL, is brought out, including the object model, the HTML object reference model and definition of tags. At the final part, a Web data gathering and querying application based on Internet agent technology, named Web Integration Services Kit (WISK) is mentioned.

著录项

来源
《东华大学学报：英文版》 |2000年第4期|103-106|共4页
作者
俞方桦; 戴玮; 陈家训;
展开▼
作者单位

展开▼
原文格式 PDF
正文语种 chi
中图分类 TP393.09;
关键词
World; Wide; Web; Web; mining; data; extraction; HTML; XML;

机译：万维网网络挖掘数据提取HTML XML;

客服邮箱：kefu@zhangqiaokeyan.com

客服微信
服务号

On Structure-based Web Data Extraction: The Model, Method and Application

摘要

著录项

相关主题

期刊订阅