Randem Systems Support Board

BotBanish => BotBanish - General Questions, Report Bugs, Problems etc... => Topic started by: sonnyh on December 31, 2019, 05:20:45 AM

Title: Format Blacklisting of Spiders
Post by: sonnyh on December 31, 2019, 05:20:45 AM
Hi,
I must be doing something wrong.
I have blacklisted these spiders, but they keep showing up:
crawl-66-249-79-84.googlebot.com
crawl-66-249-79-82.googlebot.com

How should the entry be made to block these?
Title: Re: Format Blacklisting of Spiders
Post by: Randem on December 31, 2019, 10:00:41 AM
How have you blacklisted Google? Did you add gogglebot to the spider blacklist and removed it from the spider whitelist or added 66.249 to the ip blacklist?
More information is needed. Let me see your .htaccess file.
Title: Re: Format Blacklisting of Spiders
Post by: sonnyh on December 31, 2019, 02:27:04 PM
Attached is a list of the Google Spiders and htaccess from the SMF root

I had 66.249.

Title: Re: Format Blacklisting of Spiders
Post by: Randem on December 31, 2019, 03:14:45 PM
Aloha sonnyh,

I have attached a corrected file for you. The google domain (crawl-66-249-79-86.googlebot.com) was in the spider block list not the user-agent entry (googlebot). Also the google segment block was not in the ip block list (66.249). You can replace the contents of your htaccess file with the contents of this one.

I corrected these issues. Your lists should reflect these changes.

BTW: These changes need to be in both .htaccess files (root and smf) for it to work properly.

The htaccess file in the smf folder will over-ride the root htaccess file in most non-system cases.
Title: Re: Format Blacklisting of Spiders
Post by: sonnyh on December 31, 2019, 03:55:13 PM
Thank you,
Have a great and healthy new year