For my latest Magento ecommerce project I just noticed that search engines indexed some URLs with annoying params like ?dir=asc&order=position or ?limit=9. Also some unnecessary URLs like catalog search or checkout page.
Usually we don’t want to Google to index these pages because it could be duplicated content which can be bad for our ecommerce shop’s ranking. One option would be to have nofollow links in your template but the easier way is to build a robots.txt file your Magento ecommerce shop. I also included some other important directories we usually don’t want to have indexed by search engines.
Here is my robots.txt example:
User-agent: * Disallow: /index.php/ Disallow: /*? Disallow: /*.js$ Disallow: /*.css$ Disallow: /404/ Disallow: /admin/ Disallow: /api/ Disallow: /app/ Disallow: /catalog/category/view/ Disallow: /catalog/product/view/ Disallow: /catalog/product_compare/ Disallow: /catalogsearch/ Disallow: /catalogsearch/advanced/ Disallow: /catalogsearch/term/popular/ Disallow: /cgi-bin/ Disallow: /checkout/ Disallow: /checkout/cart/ Disallow: /contacts/ Disallow: /contacts/index/ Disallow: /contacts/index/post/ Disallow: /customer/ Disallow: /customer/account/ Disallow: /customer/account/login/ Disallow: /downloader/ Disallow: /install/ Disallow: /images/ Disallow: /js/ Disallow: /lib/ Disallow: /magento/ Disallow: /media/ Disallow: /newsletter/ Disallow: /pkginfo/ Disallow: /private/ Disallow: /poll/ Disallow: /report/ Disallow: /review/ Disallow: /sendfriend/ Disallow: /skin/ Disallow: /tag/ Disallow: /var/ Disallow: /wishlist/ Disallow: /anyothercontentyouwouldliketodisallow/ Sitemap: http://www.your-amazing-magento-ecommerce-shop.com/sitemap.xml
One Comment on “How to avoid annoying Magento URL params in search engine index”
You can track this conversation through its atom feed.



Very helpful – thanks very much! Is also annoyed by loads of unnecessary URLs indexed by Google.
Posted on 06/08/2010 um 1:08 PM.