![]() |
Rim Thoughts And Ideas |
![]() |
||
|
|
Robots.Txt FileRobots.txt Can Improve Your ListingMost of you know what a robots.txt file is and what is does but just in case: A robots.txt file is used to tell search engines which files and which directories should not be indexed. However, a robots.txt file only works with search engines that looks for the file. Bots looking to gleam e-mail addresses or graphics will usually ignore the robots.txt file and it’s a waste of time to try excluding them. Note: Why Use A Robots.txt File? Also, if you have a famed site or are using SSI or some other type of include page feature for navigation, these navigation-only pages should also be excluded. While navigation-only pages can be indexed, they provide no real information for your visitors or about your site. By excluding poor content pages you will create a cleaner search engine listing and reduce the useless, uninformative pages that show up for your site. Not only will search engines appreciate this but knowledgeable web users will also. How To Exclude Pages: You should also exclude any custom error pages you create for you site, such as a custom 404 page. While these custom pages provide a more profession look than the standard browser or server fed page, they are not useful as part of your search engine listing. Where Not Use A Robots.txt file: A Robots.txt File Example: # File comments can go here Note: The "*" indicates the file applies to all engines. The # symbol indicates that line is a comment and will be ignored by search engine bots. In of the above example, three directories and one file are being excluded from all search engine bots. You can exclude all bots, allow all bots, or only allow selected bots - the choice is yours. If you wish to exclude (disallow) a particular robot spider: User-agent: grub Add the above two line to exclude any other robot but change the name after the User-agent: tag. Where To Place Your Robots.txt File: Helpful Links:
|
|
||
|
Copyright 2002 - 2008 By Rim Thoughts Site Owner |