×
Friday, April 26, 2024

Google warns against using 403 or 404 status codes for Googlebot crawl rate limiting - Search Engine Land

Last updated Friday, February 17, 2023 18:05 ET , Source: NewsService

Google is warning against using 404 and other 4xx client server status errors, such as 403s, for the purpose of trying to set a crawl rate limit for Googlebot. “Please don’t do that,” Gary Illyes from the Google Search Relations team wrote.

Why the notice. There has been a recent increase in the number of sites and CDNs using these techniques to try to limit Googlebot crawling. “Over the last few months we noticed an uptick in website owners and some content delivery networks (CDNs) attempting to use 404 and other 4xx client errors (but not 429) to attempt to reduce Googlebot’s crawl rate,” Gary Illyes wrote.

What to do instead. Google has a detailed help document just on the topic of reducing Googlebot crawling on your site. The recommended approach is to use the Google Search Console crawl rate settings to adjust your crawl rate.

Google explained, “To quickly reduce the crawl rate, you can change the Googlebot crawl rate in Search Console. Changes made to this setting are generally reflected within days. To use this setting, first verify your site ownership. Make sure that you avoid setting the crawl rate to a value that’s too low for your site’s needs. Learn more about what crawl budget means for Googlebot. If the Crawl Rate Settings is unavailable for your site, file a special request to reduce the crawl rate. You cannot request an increase in crawl rate.”

If you can’t do that, Google then says “reduce the crawl rate for short period of time (for example, a couple of...



Read Full Story: https://news.google.com/rss/articles/CBMieGh0dHBzOi8vc2VhcmNoZW5naW5lbGFuZC5jb20vZ29vZ2xlLXdhcm5zLWFnYWluc3QtdXNpbmctNDAzLW9yLTQwNC1zdGF0dXMtY29kZXMtZm9yLWdvb2dsZWJvdC1jcmF3bC1yYXRlLWxpbWl0aW5nLTM5MzMwM9IBAA?oc=5

Your content is great. However, if any of the content contained herein violates any rights of yours, including those of copyright, please contact us immediately by e-mail at media[@]kissrpr.com.