0
Under review
Trouble with bots and crawlers
Mike Stickler 11 years ago
in BLOX CMS
•
updated by Patrick O'Lone (Director of Software Development) 10 years ago •
6
Is anyone having bandwith overages and when they try to track them back discover msnbot is accounting for 24%-33% of total page views. I am loathe to block them completely but changing the crawl delay had no effect?
By way of comparison googlebot, yahoo(slurp), and bingbot all seem to account for about 6% of our pageviews.
How are you dealing with the issue?
By way of comparison googlebot, yahoo(slurp), and bingbot all seem to account for about 6% of our pageviews.
How are you dealing with the issue?
Customer support service by UserEcho
Neither the webmaster tools nor the crawl delay seemed to have any impact on msnbot-newsblogs or msnbot-udiscovery. Those two bots were accounting for 1/3 of our total pages served, Your staff did confirm the ip addresses the activity was coming from was Microsoft's but in the end I disallowed them. I have not seen any drop in our search results from msn.com/bing but did notice a lack on our content on news.msn.com. That concerns me a bit, but I had not seen any referrals from that domain in months anyway.
http://www.bing.com/blogs/site_blogs/b/webmaster/archive/2009/08/10/crawl-delay-and-the-bing-crawler-msnbot.aspx
I checked with dustin at townnews and he confirm the robots.txt was formatted correctly. His suspicion was that msnbot was not reading the robots.txt file every crawl instead it was caching that info and may have missed some of my changes. There is evidence to support this as it was several days after I set the disallow before the crawl activity from msnbot actually ceased. I may go back again and allow the crawl but set the delay back at ten. I'll leave it like that until we start seeing activity.