Webmaster/SEO News


Users Viewing This Thread
 1 Guest(s)
amatorbalikci.net

Post Reply  Post Thread 
Robots.txt file. A handy tool.
Author Message
seoguru.bbrij
Senior Member
****


Posts: 500
Group: Registered
Joined: Sep 2007
Status: Offline
Reputation: 0

Country:
Post: #1
Robots.txt file. A handy tool.

A robot is a program that automatically traverses the Web's hypertext structure by retrieving a document, and recursively retrieving all documents that are referenced.

Note that "recursive" here doesn't limit the definition to any specific traversal algorithm; even if a robot applies some heuristic to the selection and order of documents to visit and spaces out requests over a long space of time, it is still a robot.

Normal Web browsers are not robots, because they are operated by a human, and don't automatically retrieve referenced documents (other than inline images).


Robot Exclusion :

The robots exclusion standard, also known as the Robots Exclusion Protocol or robots.txt protocol is a convention to prevent cooperating web spiders and other web robots from accessing all or part of a website which is, otherwise, publicly viewable.

Robots are often used by search engines to categorize and archive web sites, or by webmasters to proofread source code. The standard complements Sitemaps, a robot inclusion standard for websites.

09-09-2007 10:18 AM
Find all posts by this user Quote this message in a reply
Post Reply  Post Thread 

View a Printable Version
Send this Thread to a Friend
Subscribe to this Thread | Add Thread to Favorites

Forum Jump: