Skip to content

Instantly share code, notes, and snippets.

@hans2103
Last active December 7, 2016 23:40

Revisions

  1. hans2103 revised this gist Mar 16, 2015. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion robots.txt
    Original file line number Diff line number Diff line change
    @@ -1,5 +1,5 @@
    # $Id: robots.txt,v magento-specific 2010/28/01 18:24:19 goba Exp $
    # Edited: 2014/10/30 by Byte Internet
    # Edited: 2014/10/30 by Byte.nl
    #
    # robots.txt
    #
  2. hans2103 revised this gist Mar 16, 2015. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion robots.txt
    Original file line number Diff line number Diff line change
    @@ -1,5 +1,5 @@
    # $Id: robots.txt,v magento-specific 2010/28/01 18:24:19 goba Exp $
    # Edited: 2012/06/11 by Byte Internet
    # Edited: 2014/10/30 by Byte Internet
    #
    # robots.txt
    #
  3. hans2103 revised this gist Oct 31, 2014. 1 changed file with 2 additions and 2 deletions.
    4 changes: 2 additions & 2 deletions robots.txt
    Original file line number Diff line number Diff line change
    @@ -12,7 +12,7 @@
    # Used: http://example.com/robots.txt
    # Ignored: http://example.com/site/robots.txt
    #
    # For more information abocut the robots.txt standard, see:
    # For more informationsk abocut the robots.txt standard, see:
    # http://www.robotstxt.org/wc/robots.html
    #
    # For syntax checking, see:
    @@ -66,7 +66,7 @@ Disallow: /pkginfo/
    Disallow: /report/
    Disallow: /scripts/
    Disallow: /shell/
    Disallow: /skin/
    #Disallow: /skin/
    Disallow: /stats/
    Disallow: /var/

  4. hans2103 revised this gist Oct 30, 2014. 1 changed file with 6 additions and 6 deletions.
    12 changes: 6 additions & 6 deletions robots.txt
    Original file line number Diff line number Diff line change
    @@ -12,7 +12,7 @@
    # Used: http://example.com/robots.txt
    # Ignored: http://example.com/site/robots.txt
    #
    # For more information about the robots.txt standard, see:
    # For more information abocut the robots.txt standard, see:
    # http://www.robotstxt.org/wc/robots.html
    #
    # For syntax checking, see:
    @@ -50,13 +50,13 @@ Disallow: /magento/
    # Disallow: /media/
    Disallow: /media/captcha/
    # Disallow: /media/catalog/
    Disallow: /media/css/
    Disallow: /media/css_secure/
    #Disallow: /media/css/
    #Disallow: /media/css_secure/
    Disallow: /media/customer/
    Disallow: /media/dhl/
    Disallow: /media/downloadable/
    Disallow: /media/import/
    Disallow: /media/js/
    #Disallow: /media/js/
    Disallow: /media/pdf/
    Disallow: /media/sales/
    Disallow: /media/tmp/
    @@ -101,8 +101,8 @@ Disallow: /STATUS.txt
    Disallow: /get.php # Magento 1.5+

    # Paths (no clean URLs)
    Disallow: /*.js$
    Disallow: /*.css$
    #Disallow: /*.js$
    #Disallow: /*.css$
    Disallow: /*.php$
    Disallow: /*?SID=
    Disallow: /rss*
  5. hans2103 revised this gist Dec 16, 2013. 1 changed file with 3 additions and 0 deletions.
    3 changes: 3 additions & 0 deletions robots.txt
    Original file line number Diff line number Diff line change
    @@ -17,6 +17,9 @@
    #
    # For syntax checking, see:
    # http://www.sxw.org.uk/computing/robots/check.html
    #
    # Prevent blocking URL parameters with robots.txt
    # Use Google Webmaster Tools > Crawl > Url parameters instead

    # Website Sitemap
    Sitemap: http://www.example.com/sitemap.xml
  6. hans2103 renamed this gist Jun 7, 2013. 1 changed file with 0 additions and 0 deletions.
    File renamed without changes.
  7. hans2103 created this gist Jun 7, 2013.
    106 changes: 106 additions & 0 deletions gistfile1.txt
    Original file line number Diff line number Diff line change
    @@ -0,0 +1,106 @@
    # $Id: robots.txt,v magento-specific 2010/28/01 18:24:19 goba Exp $
    # Edited: 2012/06/11 by Byte Internet
    #
    # robots.txt
    #
    # This file is to prevent the crawling and indexing of certain parts
    # of your site by web crawlers and spiders run by sites like Yahoo!
    # and Google. By telling these “robots” where not to go on your site,
    # you save bandwidth and server resources.
    #
    # This file will be ignored unless it is at the root of your host:
    # Used: http://example.com/robots.txt
    # Ignored: http://example.com/site/robots.txt
    #
    # For more information about the robots.txt standard, see:
    # http://www.robotstxt.org/wc/robots.html
    #
    # For syntax checking, see:
    # http://www.sxw.org.uk/computing/robots/check.html

    # Website Sitemap
    Sitemap: http://www.example.com/sitemap.xml

    # Crawlers Setup
    User-agent: *
    Crawl-delay: 10

    # Allowable Index
    # Mind that Allow is not an official standard
    Allow: /*?p=
    Allow: /index.php/blog/
    Allow: /catalog/seo_sitemap/category/
    # Allow: /catalogsearch/result/
    Allow: /media/catalog/

    # Directories
    Disallow: /404/
    Disallow: /app/
    Disallow: /cgi-bin/
    Disallow: /downloader/
    Disallow: /errors/
    Disallow: /includes/
    Disallow: /js/
    Disallow: /lib/
    Disallow: /magento/

    # Disallow: /media/
    Disallow: /media/captcha/
    # Disallow: /media/catalog/
    Disallow: /media/css/
    Disallow: /media/css_secure/
    Disallow: /media/customer/
    Disallow: /media/dhl/
    Disallow: /media/downloadable/
    Disallow: /media/import/
    Disallow: /media/js/
    Disallow: /media/pdf/
    Disallow: /media/sales/
    Disallow: /media/tmp/
    Disallow: /media/wysiwyg/
    Disallow: /media/xmlconnect/
    Disallow: /pkginfo/
    Disallow: /report/
    Disallow: /scripts/
    Disallow: /shell/
    Disallow: /skin/
    Disallow: /stats/
    Disallow: /var/

    # Paths (clean URLs)
    Disallow: /index.php/
    Disallow: /catalog/product_compare/
    Disallow: /catalog/category/view/
    Disallow: /catalog/product/view/
    Disallow: /catalog/product/gallery/
    Disallow: /catalogsearch/
    Disallow: /checkout/
    Disallow: /control/
    Disallow: /contacts/
    Disallow: /customer/
    Disallow: /customize/
    Disallow: /newsletter/
    Disallow: /poll/
    Disallow: /review/
    Disallow: /sendfriend/
    Disallow: /tag/
    Disallow: /wishlist/

    # Files
    Disallow: /cron.php
    Disallow: /cron.sh
    Disallow: /error_log
    Disallow: /install.php
    Disallow: /LICENSE.html
    Disallow: /LICENSE.txt
    Disallow: /LICENSE_AFL.txt
    Disallow: /STATUS.txt
    Disallow: /get.php # Magento 1.5+

    # Paths (no clean URLs)
    Disallow: /*.js$
    Disallow: /*.css$
    Disallow: /*.php$
    Disallow: /*?SID=
    Disallow: /rss*
    Disallow: /*PHPSESSID