« Google广告作弊者众,Google只好胡乱开刀 verycd未能幸免ASP内建对象Application 和 Session »

What is a WWW robot?


好几天没写东西了,实在无聊,转个英文资料过来,懒的翻译,凑合看吧

什么是robot?原文如下

What is a WWW robot?
A robot is a program that automatically traverses the Web's hypertext structure by retrieving a document, and recursively retrieving all documents that are referenced.
Note that "recursive" here doesn't limit the definition to any specific traversal algorithm; even if a robot applies some heuristic to the selection and order of documents to visit and spaces out requests over a long space of time, it is still a robot.

Normal Web browsers are not robots, because they are operated by a human, and don't automatically retrieve referenced documents (other than inline images).

Web robots are sometimes referred to as Web Wanderers, Web Crawlers, or Spiders. These names are a bit misleading as they give the impression the software itself moves between sites like a virus; this not the case, a robot simply visits sites by requesting documents from them.

发表评论:

◎欢迎参与讨论,请在这里发表您的看法、交流您的观点。

日历

Google

最新评论及回复

相关文章

Powered By Z-Blog 1.8 Spirit Build 80722

Copyright 2005-2008 Wuhuifeng.Com. All Rights Reserved. 京ICP备05006557号