Magento Forum - UK Magento Forum (Unofficial)

An unofficial but dedicated Magento Forum for Magento Users, Designers and Developers

Robots.txt File

A forum for Magento users to discuss tips, issues etc with regards to actually using the Magento front and back end systems.

Robots.txt File

Postby davef » Mon Jan 31, 2011 2:42 pm

Hi,

I notice Magento does not create a robots.txt file as standard. Does anyone have a standard robots.txt they can post or is there no such things or even Magento doesn't need one?
davef
 
Posts: 9
Joined: Tue Mar 30, 2010 8:02 am

Re: Robots.txt File

Postby Dx3Webs » Mon Jan 31, 2011 4:07 pm

No one seems to agree on a definitive robots.txt file.. this one has been doing the rounds of various boards and blogs and seems as good as any.
Code: Select all
# $Id: robots.txt,v magento-specific 2010/28/01 18:24:19 goba Exp $
#
# robots.txt
#
# This file is to prevent the crawling and indexing of certain parts
# of your site by web crawlers and spiders run by sites like Yahoo!
# and Google. By telling these "robots" where not to go on your site,
# you save bandwidth and server resources.
#
# This file will be ignored unless it is at the root of your host:
# Used:    http://example.com/robots.txt
# Ignored: http://example.com/site/robots.txt
#
# For more information about the robots.txt standard, see:
# http://www.robotstxt.org/wc/robots.html
#
# For syntax checking, see:
# http://www.sxw.org.uk/computing/robots/check.html

# Website Sitemap
Sitemap: http://www.mydomain.com/sitemap.xml

# Crawlers Setup
User-agent: *
Crawl-delay: 10

# Allowable Index
Allow: /*?p=
Allow: /index.php/blog/
Allow: /catalog/seo_sitemap/category/
Allow:/catalogsearch/result/

# Directories
Disallow: /404/
Disallow: /app/
Disallow: /cgi-bin/
Disallow: /downloader/
Disallow: /includes/
Disallow: /js/
Disallow: /lib/
Disallow: /magento/
Disallow: /media/
Disallow: /pkginfo/
Disallow: /report/
Disallow: /skin/
Disallow: /stats/
Disallow: /var/

# Paths (clean URLs)
Disallow: /index.php/
Disallow: /catalog/product_compare/
Disallow: /catalog/category/view/
Disallow: /catalog/product/view/
Disallow: /catalogsearch/
Disallow: /checkout/
Disallow: /control/
Disallow: /contacts/
Disallow: /customer/
Disallow: /customize/
Disallow: /newsletter/
Disallow: /poll/
Disallow: /review/
Disallow: /sendfriend/
Disallow: /tag/
Disallow: /wishlist/

# Files
Disallow: /cron.php
Disallow: /cron.sh
Disallow: /error_log
Disallow: /install.php
Disallow: /LICENSE.html
Disallow: /LICENSE.txt
Disallow: /LICENSE_AFL.txt
Disallow: /STATUS.txt

# Paths (no clean URLs)
Disallow: /*.js$
Disallow: /*.css$
Disallow: /*.php$
Disallow: /*?p=*&
Disallow: /*?SID=
Dx3webs
UK based Optimised Magento Managed Hosting : Full Managed service with Unlimited technical support
Contact Us | Dx3webs Magento Demo Store
User avatar
Dx3Webs
 
Posts: 305
Joined: Wed Jul 14, 2010 11:04 am
Location: Lincoln, UK

Re: Robots.txt File

Postby davef » Mon Jan 31, 2011 5:03 pm

Many thanks for your reply and the robots.txt details

Regards,

Dave
davef
 
Posts: 9
Joined: Tue Mar 30, 2010 8:02 am


Return to Magento Users

cron