Format Blacklisting of Spiders

Started by sonnyh, December 31, 2019, 05:20:45 AM

Previous topic - Next topic

sonnyh

Thank you,
Have a great and healthy new year
Best,
Sonny

Randem

Aloha sonnyh,

I have attached a corrected file for you. The google domain (crawl-66-249-79-86.googlebot.com) was in the spider block list not the user-agent entry (googlebot). Also the google segment block was not in the ip block list (66.249). You can replace the contents of your htaccess file with the contents of this one.

I corrected these issues. Your lists should reflect these changes.

BTW: These changes need to be in both .htaccess files (root and smf) for it to work properly.

The htaccess file in the smf folder will over-ride the root htaccess file in most non-system cases.

sonnyh

Attached is a list of the Google Spiders and htaccess from the SMF root

I had 66.249.

Best,
Sonny

Randem

How have you blacklisted Google? Did you add gogglebot to the spider blacklist and removed it from the spider whitelist or added 66.249 to the ip blacklist?
More information is needed. Let me see your .htaccess file.

sonnyh

Hi,
I must be doing something wrong.
I have blacklisted these spiders, but they keep showing up:
crawl-66-249-79-84.googlebot.com
crawl-66-249-79-82.googlebot.com

How should the entry be made to block these?
Best,
Sonny