首页>
外国专利>
BUILDING OF A WEB CORPUS WITH THE HELP OF A REFERENCE WEB CRAWL
BUILDING OF A WEB CORPUS WITH THE HELP OF A REFERENCE WEB CRAWL
展开▼
机译:建立具有参考网络草稿的网络公司
展开▼
页面导航
摘要
著录项
相似文献
摘要
Computer-implemented method for building a web corpus (WCD) comprising thestepsof:- sending by a web crawler (WC) a query to a reference web crawl agent(RWCA), thisquery containing a least one identifier of a resource,- receiving by the web crawler (WC) a response from the reference web crawlagent(RWCA);- if this response does not contain the resource identified by theidentifier, downloadingby the web crawler (WC) the resource from the website (WS) corresponding totheidentifier and adding the resource to the web corpus (WCD; and- if this response contains the resource identified by the identifier,adding the resourceto the web corpus (WCD).
展开▼