shinebeach.com
  Home Page :> About Us :> Add Url :> Privacy of Info :> ToS :> Add Your Article
Search:   
Get Free Links
 

Tour & Travel

Technology & Science

Children

Academics & Learning

Self Healing

Sports

Property & Agents

Employment & Careers

Law & Politics

Food & Recipe

Entertainment

Business & Companies

Indoor Games

Shopping Online

Lifestyle & Fashion

Healthcare & Treatment

Creative Arts

Computers & Software

Banking & Finance

People & Society

Vehicles & Automotive

Issues & News

Health & Hygiene

Home Family & Garden


 

Home Page › Computers & Software › SEO Solutions
 

Robocops

 

Author: Philip Nicosia

The Robots.txt protocol, also called the robots exclusion standard is designed to lock out web spiders from accessing part of a website. It is a security or privacy measure, the equivalent of hanging a Keep Out sign on your door.

This protocol is used by web site administrators when there are sections or files that they would rather not be accessed by the rest of the world. This could include employee lists, or files that they are circulating internally. For example, the White House website uses robots.txt to block any inquiries on speeches by the Vice President, a photo essay of the First Lady, and profiles of the 911 victims.

How does the protocol work? It lists the files that shouldnt be scanned, and places it in the top-level directory of the website. The robots.txt protocol was created by consensus in June 1994 by members of the robots mailing list (robots-request@nexor.co.uk). There is no official standards body or RFC for the protocol, so its difficult to legislate or mandate that the protocol be followed. In fact, the file is treated as strictly advisory, and does not have absolute guarantee that those contents wont be read.

In effect, robot.txt requires cooperation by the web spider and even the reader, since anything that is uploaded into the internet becomes publicly available. You arent locking them out of those pages, you are just making it harder for them to get in. But it takes very little for them to ignore these instructions. Computer hackers can also easily penetrate the files and retrieve information. So the rule of thumb isif its that sensitive, it shouldnt be on your website to begin with.

Care, however, should be taken to ensure that the Robots.txt protocol doesnt block the website robots from other areas of the website. This will dramatically affect your search engine ranking, as the crawlers rely on the robots to count the keywords, review metatags, titles and crossheads, and even register the hyperlinks.

One misplaced hyphen or dash can have catastrophic effects. For example, the robots.txt patterns are matched by simple substring comparisons, so care should be taken to make sure that patterns matching directories have the final '/' character appended: otherwise all files with names starting with that substring will match, rather than just those in the directory intended.

To avoid these problems, consider submitting your site to a search engine spider simulator, also called search engine robot simulator. These simulatorswhich can be bought or downloaded from the internet use the same processes and strategies of different search engines and give you a dry run of how they will read your site. They will tell you which pages are skipped, which links are ignored, and which errors are encountered. Since the simulators will also reenact how the bots will follow your hyperlinks, youll see if your robot.txt protocol is interfering with the search engines ability to read through all the necessary pages.

Its also important to review your robot.txt files, which will enable you to spot any problems and correct them before you submit them to real search engines.

Author Bio:

Polyphonics.eu.com specializes in the different genres of ringtones including all the latest real tones.

You can also reach this article by using: search engine optimization services, search engine optimization firm
 
 
 

Related Articles

 
What Good is Viral if It Isn't Residual?
 
Viral Marketing Strategies to Sky-Rocket Your Traffic and Profits
 
3 Ways To Build A Quick Profitable Opt In List
 
How to Buy the Perfect PC
 
The Ultimate Way To Drive Targeted Traffic To Your Web Site Using Forums
 
Clipboard and Screen Capture
 
E-mail SPAM: What's The Big Deal?
 
So What is a Blog Anyway?
 
Setting of Your Payment Processors (Part 2 )
 
A Guide to Computer Desks
 
 
 
Home Page :> Privacy of Info :> ToS  
© 2006-2008 www.shinebeach.com All Rights Reserved Worldwide.