J 2.5.7 com_search vs Google - Joomla! Forum - community, help and support
hello.
i got website more 15 000 articles, , noticed, google indexed thousand of search results page using url
/component/search/?searchword=
each google querry generate 90% of server load php_cgi overload sometimes. , google gone crazy because looped on over same word.
i ordered google not index folders in robots.txt , @ google webmaster disabled querry strings searching... without effect. got lot of serp indexed pages "my website search results" , server overload. reduced number of request google 100 s.
the question how can prevent google querring search page on website. maybe htaccess trick ban ip specified string:
/component/search/
or code in php can hack joomla code check if ip on whitelist use com_search?
i tried of htaccess tricks none worked...
any appreciated.
best, c3n
p.s.
php 5.3, j. 2.5.7, t3 framework cache (only), gzip, htaccess rewrite, sh404sef (disabled com_search),
i got website more 15 000 articles, , noticed, google indexed thousand of search results page using url
/component/search/?searchword=
each google querry generate 90% of server load php_cgi overload sometimes. , google gone crazy because looped on over same word.
i ordered google not index folders in robots.txt , @ google webmaster disabled querry strings searching... without effect. got lot of serp indexed pages "my website search results" , server overload. reduced number of request google 100 s.
the question how can prevent google querring search page on website. maybe htaccess trick ban ip specified string:
/component/search/
or code in php can hack joomla code check if ip on whitelist use com_search?
i tried of htaccess tricks none worked...
any appreciated.
best, c3n
p.s.
php 5.3, j. 2.5.7, t3 framework cache (only), gzip, htaccess rewrite, sh404sef (disabled com_search),
robots doesn't work sadly.
you should have disallow: /components/ in robots precaution, far many people rely on robots.txt , shouldn't. best way stop things ever entering serps noindex them.
you want use "noindex, follow" meta on pages, doing tell google not index them follow links on page discover new pages pass juice.
also worth mentioning robots.txt doesn't remove pages have been indexed, or pending index. need remove manually said, or introduce noindex on next skim remove it.
noindex, robots = best method.
htaccess = more trouble worth since can fix that, , break thing , go crazy loop of problems. use metas advantage.
you should have disallow: /components/ in robots precaution, far many people rely on robots.txt , shouldn't. best way stop things ever entering serps noindex them.
you want use "noindex, follow" meta on pages, doing tell google not index them follow links on page discover new pages pass juice.
also worth mentioning robots.txt doesn't remove pages have been indexed, or pending index. need remove manually said, or introduce noindex on next skim remove it.
noindex, robots = best method.
htaccess = more trouble worth since can fix that, , break thing , go crazy loop of problems. use metas advantage.
Comments
Post a Comment