Google-Bot fell in love with my 404-page

Posted by 32bitfloat on Server Fault See other posts from Server Fault or by 32bitfloat
Published on 2013-10-21T20:28:33Z Indexed on 2013/10/21 21:55 UTC
Read the original article Hit count: 422

Filed under:

googlebot

Every day my access-log looks kind of this:

66.249.78.140 - - [21/Oct/2013:14:37:00 +0200] "GET /robots.txt HTTP/1.1" 200 112 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.78.140 - - [21/Oct/2013:14:37:01 +0200] "GET /robots.txt HTTP/1.1" 200 112 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.78.140 - - [21/Oct/2013:14:37:01 +0200] "GET /vuqffxiyupdh.html HTTP/1.1" 404 1189 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

or this

66.249.78.140 - - [20/Oct/2013:09:25:29 +0200] "GET /robots.txt HTTP/1.1" 200 112 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.75.62 - - [20/Oct/2013:09:25:30 +0200] "GET /robots.txt HTTP/1.1" 200 112 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.78.140 - - [20/Oct/2013:09:25:30 +0200] "GET /zjtrtxnsh.html HTTP/1.1" 404 1186 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

The bot calls the robots.txt twice and after that tries to access a file (zjtrtxnsh.html, vuqffxiyupdh.html, ...) which cannot exist and must return a 404 error. The same procedure every day, just the unexisting html-filename changes.

The content of my robots.txt:

User-agent: *
Disallow: /backend
Sitemap: http://mysitesname.de/sitemap.xml

The sitemap.xml is readable and valid, so there seems to be no reason why the bot should want to force a 404-error.
How should I interpret this behaviour? Does it point to a mistake I've done or should I ignore it?

Developer IT

Google-Bot fell in love with my 404-page - Developer IT

Google-Bot fell in love with my 404-page

404

robots.txt

googlebot

Related posts about 404

ubuntu/apt-get update said "Failed to Fetch http:// .... 404 not found"

Cannot update, apt-get cannot fetch index files

Protecting Apache with Fail2Ban

Django flatpages raising 404 when DEBUG is False (404 and 500 templates exist)

x.gif in Apache logs

Related posts about robots.txt

Robots.txt practices with .htaccess redirections (inherits)

mod evasive not working properly on ubuntu 10.04

Cross-domain jQuery using YQL gives robots.txt error

Asterisk in robots.txt

SEO chaos from changing robots.txt file in Wordpress site

Categories cloud