Loading…
Websites that disallow specific crawlers in robots.txt, from AI trainers like GPTBot and ClaudeBot to SEO and search bots like AhrefsBot and Bingbot. Based on the 880,858 websites tracked by Site Stats Database.
883 blocked crawlers & bots are used by 100 or more of the 125,901 sites we track in this dimension. Pick one to see the top websites, ranked by traffic.
| # | Name | Websites |
|---|---|---|
| 1 | GPTBot | 52,109 |
| 2 | CCBot | 51,982 |
| 3 | Bytespider | 47,590 |
| 4 |
| Amazonbot |
| 45,564 |
| 5 | ClaudeBot | 45,207 |
| 6 | Google-Extended | 43,510 |
| 7 | meta-externalagent | 41,277 |
| 8 | Applebot-Extended | 39,412 |
| 9 | CloudflareBrowserRenderingCrawler | 30,902 |
| 10 | PetalBot | 27,074 |
| 11 | MJ12bot | 21,761 |
| 12 | Baiduspider | 18,271 |
| 13 | AhrefsBot | 17,305 |
| 14 | dotbot | 16,004 |
| 15 | BLEXBot | 15,945 |
| 16 | SemrushBot | 15,568 |
| 17 | anthropic-ai | 14,391 |
| 18 | Yandex | 14,313 |
| 19 | ChatGPT-User | 12,379 |
| 20 | FacebookBot | 11,126 |
| 21 | Claude-Web | 10,632 |
| 22 | cohere-ai | 9,470 |
| 23 | PerplexityBot | 9,276 |
| 24 | Diffbot | 9,231 |
| 25 | omgili | 8,964 |
| 26 | omgilibot | 8,895 |
| 27 | ImagesiftBot | 8,152 |
| 28 | DataForSeoBot | 7,231 |
| 29 | Barkrowler | 6,838 |
| 30 | TurnitinBot | 6,428 |
| 31 | YouBot | 6,288 |
| 32 | magpie-crawler | 6,007 |
| 33 | OAI-SearchBot | 5,586 |
| 34 | Scrapy | 5,403 |
| 35 | ia_archiver | 5,378 |
| 36 | YandexBot | 5,368 |
| 37 | Timpibot | 5,295 |
| 38 | Nutch | 5,256 |
| 39 | FriendlyCrawler | 5,163 |
| 40 | Exabot | 5,025 |
| 41 | AI2Bot | 5,013 |
| 42 | cohere-training-data-crawler | 4,831 |
| 43 | ICC-Crawler | 4,674 |
| 44 | AwarioSmartBot | 4,656 |
| 45 | SeznamBot | 4,514 |
| 46 | AwarioRssBot | 4,482 |
| 47 | MegaIndex.ru | 4,359 |
| 48 | OmniExplorer_Bot | 4,352 |
| 49 | spbot | 4,331 |
| 50 | img2dataset | 4,172 |