首页>
外国专利>
BUILDING OF A WEB CORPUS WITH THE HELP OF A REFERENCE WEB CRAWL
BUILDING OF A WEB CORPUS WITH THE HELP OF A REFERENCE WEB CRAWL
展开▼
机译:建立具有参考网络草稿的网络公司
展开▼
页面导航
摘要
著录项
相似文献
摘要
PURPOSE: A construction of web corpus by the help of a reference web crawl is provided to prevent the delay of a resource extraction by using downloaded resources usable at the web crawl instead of an index program to download resources in a web. CONSTITUTION: A web crawler (WC) transmits a query to a reference web crawl agent (RWCA) and the query includes the identifier of a resource. The web crawler receives a response from the web crawl agent. If the response does not include a resource identified by the identifier, the web crawler downloads the resource from a web site corresponding to the identifier for adding the resource to a web corpus (WCD). If the response includes the resource identified by the identifier, the resource is added to the web corpus.
展开▼