Home SEO Tools Software Submission Service Website Scripts Blog Forum Help Desk

Traffic Geek Forum: What is Robots.txt - Traffic Geek Forum

Jump to content

Page 1 of 1
  • You cannot start a new topic
  • You cannot reply to this topic

What is Robots.txt

#1 User is offline   webexpert 

  • Member
  • PipPip
  • Group: Members
  • Posts: 14
  • Joined: 15-September 09

Posted 20 September 2009 - 01:49 PM

I want to know what is Robots.txt file. and its uses..........
0

#2 User is offline   complimedia 

  • Newbie
  • Pip
  • Group: Members
  • Posts: 2
  • Joined: 21-September 09

Post icon  Posted 21 September 2009 - 02:54 AM

A robots.txt file is used to tell search engine spiders (or bots) what content they should access or stay out of. For example you have two directories on your website like so: example.com/public and example.com/private and you only want the search engines to crawl and index the pages and content in the public folder you would use the robots.txt file. Personally I would also block the private file from being accessed by anyone by using some server side security such as the .htaccess file.

Here is the coded example.

# robots.txt for http://www.example.com/

User-agent: *
Disallow: /private/ #


Learn more at The Web Robots Pages and follow me on twitter @montanaflynn

Quote

WWW Robots (also called wanderers or spiders) are programs that traverse many pages in the World Wide Web by recursively retrieving linked pages. For more information see the robots page.
In 1993 and 1994 there have been occasions where robots have visited WWW servers where they weren't welcome for various reasons. Sometimes these reasons were robot specific, e.g. certain robots swamped servers with rapid-fire requests, or retrieved the same files repeatedly. In other situations robots traversed parts of WWW servers that weren't suitable, e.g. very deep virtual trees, duplicated information, temporary information, or cgi-scripts with side-effects (such as voting).

These incidents indicated the need for established mechanisms for WWW servers to indicate to robots which parts of their server should not be accessed. This standard addresses this need with an operational solution.

0

#3 User is offline   Joe smith 

  • Advanced Member
  • PipPipPip
  • Group: Members
  • Posts: 31
  • Joined: 20-September 09

Posted 23 September 2009 - 12:48 AM

Robot.txt is server side file. and this is a notepad file.......this file we put in CGI Bin directory. it helps to allow or disallow to any search engine to visit your site or not.
0

#4 User is offline   Jhon 

  • Member
  • PipPip
  • Group: Members
  • Posts: 21
  • Joined: 20-September 09

Posted 23 September 2009 - 01:15 AM

Robots.txt is very important file for any website. it give the permission to any search engine you can visit the site or not.
0

#5 User is offline   webseo 

  • Member
  • PipPip
  • Group: Members
  • Posts: 10
  • Joined: 15-September 09

Posted 24 September 2009 - 12:48 PM

Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. it very helpful
0

#6 User is offline   deboo 

  • Newbie
  • Pip
  • Group: Members
  • Posts: 8
  • Joined: 15-September 09

Posted 27 September 2009 - 12:54 PM

You can get detailed explanation of robot.txt at http://www.robotstxt...robotstxt.html. I had the same question few weeks ago and was looking for answers. I found this website very useful.
0

#7 User is offline   jyoti singh 

  • Member
  • PipPip
  • Group: Members
  • Posts: 10
  • Joined: 15-September 09

Posted 04 October 2009 - 01:46 PM

this is a text file.
its used to permit the search engine crawlers to visit your site or not.
0

#8 User is offline   dev singh 

  • Member
  • PipPip
  • Group: Members
  • Posts: 10
  • Joined: 15-October 09

Posted 16 October 2009 - 05:30 AM

"Robots.txt" is a regular text file
Here's a basic "robots.txt":

User-agent: Googlebot
allow: /

User-agent: Googlebot-Image
Disallow: /
0

#9 User is offline   versanigam 

  • Member
  • PipPip
  • Group: Members
  • Posts: 10
  • Joined: 21-October 09

Posted 23 October 2009 - 02:53 AM

A robots.txt file is a text file. and it is helpful for your site.
you can handle that which search engine is come in your site and other is not come.

User-agent: search engine crawler name
allow: /

or disallow
0

#10 User is offline   nachiketkhanna 

  • Member
  • PipPip
  • Group: Members
  • Posts: 10
  • Joined: 07-November 09

Posted 08 November 2009 - 04:00 AM

Robot.txt is the text file it give facility to allow and disallow the search engine to visit the site.
0

Page 1 of 1
  • You cannot start a new topic
  • You cannot reply to this topic

2 User(s) are reading this topic
0 members, 2 guests, 0 anonymous users