Whitelisting the Search Engine Spiders

Avalon

Member
Apr 27, 2015
19
1
3
United States
cPanel Access Level
DataCenter Provider
Twitter
Today I noticed that mod_security blocked valid Google spiders from accessing some of my servers and while it was relatively easy to unblock them. Then I proceed to add the "big name" search engines to the CSF ignore list but I know this list is incomplete:

Code:
.googlebot.com
.crawl.yahoo.net
.search.msn.com
Does anyone have a listing of what domains are used for other popular search engines like DuckDuckGo, Yandex, Baidu, etc.

I haven't had false positives with any of the other engines yet but better to be prepared.

I didn't think it would be so hard to find a listing of crawler domains to be honest.