Casa > Como guias > Exclude All Hits from Known Bots and Spiders in Google Analytics

Excluir Todos os hits Bots Conhecidos e Aranhas no Google Analytics

Para cada usuário do Google Analytics, é importante distinguir o tráfego de internet de um site - o real do artificial. Existem programas de computador projetado para executar tarefas automáticas através da Internet, chamados Bots, which can generate a lot of the traffic on a website. assim, in this article you will learn what you can do, to exclude that generally unwanted traffic.


At the end of last year, there were official reports which state that only around “50% of web traffic” comes from real human beings. Isso é um problema, especially with so many “Bots” around that can hinder you in finding out what is the real traffic for any of your websites.

bot are designed to complete simple and repetitive tasks on an automatic basis, which are tedious or impossible for people to do. The largest use of such “Bots” is in Web Spidering. A “Aranha”, também conhecido como "Web crawler” is a script that fetches, analyzes and files specific information from web servers much faster than any human being can do. It is called that because it crawls over the Web.

In the past an efficient way to stop the influence of bots over traffic statistics was for Google Analytics usar JavaScript, as bots couldn’t. Nowadays there are even “smart” bots, which can successfully use JavaScript, and bypass that prevention technique in Google Analytics.

Besides such bots, there are malicious ones. Some bots are designed to cause “Negação de serviço (DoS)” attacks, while others spam unrelated messages on your sites with link that can contain malicious content, which are other valid reasons not to want bot-crawlers on your web pages.

Here are 5 métodos to exclude all of the bot generated traffic from Google Analytics:

Método #1 – Enable Bot Filtering from the Admin Panel


There is a checkbox inside Google Analytics that can be ticked to remove known bots. It is located inside the “administrador” panel, under “Visualizar” settings and the checkbox is named “Bot Filtering”. It is advised to first make a Test View, before leaving the box checked in the main reporting Visualizar. In that way you can spot what differences you will spot in your data collection results.


Método #2Filter Specific IP Addresses

IP addresses are not displayed in Google Analytics relatórios. Neither are they available to be seen through JavaScript por padrão, but you should be able to configure and turn on that feature on your site with considerable ease. After you have recovered all IP addresses that you want to exclude from being analyzed in internet traffic data, it is time to block those. You can do that by using the “View Filters” menu inside Google Analytics.

Método #3 – Adding a JavaScript Variable for User Agents

Even if you block specific IPs, some bots can use multiple IP addresses and switch between them. With the help of “Google Tag Manager” is possible to pass all “Agente de usuário” cordas para dentro Google Analytics as a custom dimension. Depois, you can exclude bots’ sessions.

Create a “Custom Dimension” in the “administrador” panel of Google Analytics. Make sure the name is “Agente de usuário” and make the scope “Sessão”. Leave “Index” untouched for now.

In Google Tag Manager set a new “JavaScript Variable” with navigator.userAgent valor.


Set a custom variable slot for your “Google Analytics Pageview Tag”, using the above-mentioned “Index”. Introduzir o {{Agente de usuário}} variable in “Dimension Value”.

In “Admin > View > Filters” you can eliminate “User Agents” which are known to be bots, due to their weird behavior, like users having repeat bounce rates or other users having hundreds of visits per day.

Método #4 – Add a CAPTCHA Requirement

To make things more difficult for bots that are still entering your site despite the previous precautionary measures taken, you should use some sort of a “CAPTCHA”. It is recommended you use Google’s own “reCAPTCHA” service and the newer variant called “noCAPTCHA”. It detects human behavior that is typical, including mouse usage, allowing for people not having the need to enter a captcha message at all.


Follow the instructions here to get “reCAPTCHA” and thus, get the function to not fire Google Analytics until somebody completes the captcha successfully. Each new site visitor will have to go through the captcha process, and this will set a session cookie after successful completion.

Método #5 – Require Users to Validate Their Emails

Users should be required to enter a valid email address, check their email, and click on the confirmation link with a message you said. There are very sophisticated Bots that can even do that, so one final thing you should do is to add a “reCAPTCHA” here too. Such one with image recognition:


This should sum up the most effective ways to exclude all hits from Internet Bots and Spiders in Google Analytics for your site(s). You should see a noticeable change in your data collection analysis results, after implementing each and every method described above. And you will also prevent ataques DDoS from malicious bots as well. assim, to sum up, all these methods are a viable way to fight bots and are necessary for every Google Analytics do utilizador.


Berta Bilbao

Berta é um pesquisador de malware dedicado, sonhando para um espaço cibernético mais seguro. Seu fascínio com a segurança de TI começou há alguns anos atrás, quando um malware bloqueado la fora de seu próprio computador.

mais Posts

Deixe um comentário

seu endereço de e-mail não será publicado. Campos obrigatórios são marcados *

Compartilhar no Facebook Compartilhar
Compartilhar no Twitter chilrear
Compartilhar no Google Plus Compartilhar
Partilhar no Linkedin Compartilhar
Compartilhar no Digg Compartilhar
Compartilhar no Reddit Compartilhar
Partilhar no StumbleUpon Compartilhar