Skip to content

The numbers behind the database

Our Data

Last updated: July 1, 2026

We track 130+ data points on each of 880,858 websites, from traffic and authority to the full tech stack, monetization, commerce, and security stance. Below is everything we collect and, honestly, how many sites we have it for. Every figure is computed straight from the live database.

880.9K
Sites tracked
880,858
130+
Data points / site
fields collected
146.6B
Monthly visitors
combined · 2026-06
6.3B
Organic keywords
tracked across all sites
69
Languages
distinct detected
01

Overview

Aggregate scale of the dataset.

880,858
Sites
130+
Data points / site
collected fields
6.3B
Organic keywords
tracked across all sites
59B
Pages indexed
summed sitemap counts
2B
Referring domains
29.9M
Traffic records
34 months
3,112
Topics covered
69
Languages

Where it comes from: traffic estimates, organic keywords, referring domains, and Authority (PageRank) come from DataForSEO (traffic reflects US organic search); Domain Rating comes from Ahrefs' free public endpoint; domain registration dates come from RDAP/WHOIS records; and the technology stack, monetization, niche, and on-page signals are detected by our own homepage crawler.

02

What we track

Every data point we collect, grouped by theme, with the number of sites we have each field for. Detection is best-effort from public signals, so a low share usually means the technology is rare, not that data is missing.

880.9K
Metrics layer
traffic · authority · SEO (every site)
842.1K
DNS & TLS layer
95.6% of sites resolved
662.2K
Homepage crawl layer
75.2% of sites crawled

Traffic & search

Visitor estimates and search footprint, on every site.

  • Monthly traffic history880,858 · 100.0%
  • Organic keywords880,836 · 100.0%
  • Paid traffic880,836 · 100.0%
  • Pages indexed608,887 · 69.1%

Authority & links

How established and well-linked each domain is.

  • Authority (PageRank)880,858 · 100.0%
  • Domain Rating (Ahrefs)880,858 · 100.0%
  • Referring domains880,858 · 100.0%
  • Performance score880,858 · 100.0%
  • Traffic trend801,937 · 91.0%

Platform & CMS

What each site is built and published on.

Hosting & infrastructure

Where the site runs and how it's served.

Front-end & libraries

Client-side frameworks, UI tooling, and embeds.

Analytics & tracking

Measurement, experimentation, and monitoring.

Marketing & growth

How sites acquire, advertise to, and retain visitors.

Commerce & payments

Storefront, checkout, and fulfillment tech.

  • Payment processors42,046 · 4.8%
  • Reviews / social proof39,731 · 4.5%
  • Donations9,864 · 1.1%
  • Reservations / booking3,503 · 0.4%
  • Ticketing2,422 · 0.3%
  • Shipping / fulfillment1,859 · 0.2%
  • Subscriptions1,820 · 0.2%
  • Live shopping721 · <0.1%
  • Cart abandonment636 · <0.1%
  • Tax compliance561 · <0.1%
  • Cross-border / duties452 · <0.1%

Audience & engagement

How sites interact with and support visitors.

Security & trust

TLS, headers, consent, and abuse defenses.

Brand & structured data

Identity signals and machine-readable markup.

  • Logo511,078 · 58.0%
  • Company / brand name448,751 · 50.9%
  • Schema.org markup393,963 · 44.7%
  • Open Graph / share card341,866 · 38.8%
  • Brand color121,383 · 13.8%
  • Companion mobile apps25,268 · 2.9%
  • Founding year14,559 · 1.7%

Performance & page quality

Speed, weight, and SEO-hygiene signals from the crawl.

  • Liveness status878,665 · 99.8%
  • Time to first byte662,169 · 75.2%
  • Page weight662,169 · 75.2%
  • Third-party hosts662,169 · 75.2%
  • Indexable657,565 · 74.7%
  • Mobile-friendly625,740 · 71.0%
  • Canonical URL510,151 · 57.9%

Content, topics & meta

Editorial classification, freshness, and on-page metadata.

  • Registration date803,295 · 91.2%
  • Meta title722,582 · 82.0%
  • Primary language720,218 · 81.8%
  • Meta description581,727 · 66.0%
  • Topics covered578,597 · 65.7%
  • Word count535,926 · 60.8%
  • Key page links513,472 · 58.3%
  • Last-published date486,957 · 55.3%
  • Copyright year400,657 · 45.5%
  • RSS / Atom feed292,426 · 33.2%
  • Currencies225,918 · 25.6%
  • llms.txt138,269 · 15.7%
  • Meta keywords127,359 · 14.5%
  • hreflang targets68,026 · 7.7%
03

Traffic

Estimated monthly organic visits, added up across every site we track, for each of the 34 months we hold history.

Combined monthly visits, all sites

2023-092026-06
Combined estimated monthly visits across all tracked sites, by month
MonthEstimated visits
2023-09205,089,034,550
2023-10203,383,054,118
2023-11202,056,799,595
2023-12191,484,293,315
2024-01195,108,818,492
2024-02196,060,785,936
2024-03200,578,566,016
2024-04203,540,976,125
2024-05205,807,062,509
2024-06217,431,161,704
2024-07223,606,288,951
2024-08189,460,143,172
2024-09214,395,709,997
2024-10217,526,604,090
2024-11221,507,763,466
2024-12177,740,633,306
2025-01144,944,468,054
2025-02147,374,699,965
2025-03135,354,065,579
2025-04134,917,126,264
2025-05135,839,329,117
2025-06141,431,413,500
2025-07140,910,122,621
2025-08138,209,879,394
2025-09134,569,970,983
2025-10135,343,664,939
2025-11147,950,040,281
2025-12146,807,704,201
2026-01146,142,952,355
2026-02139,102,381,712
2026-03150,404,295,070
2026-04149,157,296,920
2026-05146,490,945,614
2026-06146,556,841,249

Sites by monthly visits

2026-06
  • 1–1K198,326 · 22.5%
  • 1K–10K377,954 · 42.9%
  • 10K–100K233,339 · 26.5%
  • 100K–1M61,545 · 7.0%
  • 1M–10M8,573 · 1.0%
  • 10M+1,088 · 0.1%
04

Platforms & CMS

What each site is built on, detected from the homepage during crawl.

Browse all platforms

Top platforms

Showing top 18 of 2,885 distinct values

05

Frameworks & front-end

JavaScript frameworks the site is built with, and the front-end libraries on the page.

Browse all frameworks

JavaScript frameworks

32 unique

Showing top 16 of 32 distinct values

JavaScript libraries

86 unique

Showing top 16 of 86 distinct values

06

Hosting & infrastructure

Origin hosts, CDN tiers, and web servers seen in front of each site.

Browse all hosting

Hosting & CDN providers

45 unique

Showing top 20 of 45 distinct values

Web servers

16 unique

Showing top 14 of 16 distinct values

07

Analytics & tracking

Measurement and tag-management tools detected in page markup.

Browse all analytics

Analytics tools

Showing top 20 of 80 distinct values

08

WordPress ecosystem

300,931 sites expose a WordPress theme. Here's what they run.

Browse all plugins

Top themes

75,230 unique

Showing top 20 of 75,230 distinct values

Top plugins

62,060 unique

Showing top 24 of 62,060 distinct values

09

Structured data

Schema.org @type values found in homepage JSON-LD.

Browse all schema types

Schema types

Showing top 20 of 2,167 distinct values

10

Monetization

Ad networks and commerce platforms detected per site (a site can use several).

Browse all ad networks

Monetization methods

Showing top 18 of 110 distinct values

11

Commerce, payments & email

Payment processors detected at checkout, and the email / newsletter platforms sites run.

Browse all payments

Payment processors

112 unique

Showing top 16 of 112 distinct values

Email & newsletter platforms

65 unique

Showing top 16 of 65 distinct values

12

Security & trust

HTTP security-header grade, DMARC email policy, and cookie-consent platforms across the catalog.

Browse by DMARC policy

Security grade

A+ to F
  • A+7,516 · 1.1%
  • A31,941 · 4.8%
  • B44,726 · 6.8%
  • C58,481 · 8.8%
  • D90,384 · 13.6%
  • E49,372 · 7.5%
  • F379,744 · 57.3%

DMARC policy

Cookie-consent platforms

55 unique

Showing top 14 of 55 distinct values

13

AI & bot access

Crawlers disallowed in robots.txt, from AI trainers like GPTBot and ClaudeBot to SEO and search bots. A site can block several.

Browse all blocked bots

Most-blocked crawlers

Showing top 20 of 10,135 distinct values

14

Authority & links

How the catalog spreads across Authority, Domain Rating, and referring-domain counts.

Authority (PageRank)

0–100
  • 0–955,447 · 6.3%
  • 10–1988,108 · 10.0%
  • 20–29163,380 · 18.5%
  • 30–39206,481 · 23.4%
  • 40–49176,684 · 20.1%
  • 50–59111,067 · 12.6%
  • 60–6953,430 · 6.1%
  • 70–7920,816 · 2.4%
  • 80+5,445 · 0.6%

Domain Rating (Ahrefs)

0–100
  • 0–9219,258 · 24.9%
  • 10–19121,779 · 13.8%
  • 20–29121,001 · 13.7%
  • 30–39117,329 · 13.3%
  • 40–4984,205 · 9.6%
  • 50–5975,125 · 8.5%
  • 60–6952,771 · 6.0%
  • 70–7962,027 · 7.0%
  • 80+27,363 · 3.1%

Referring domains

  • 0–1010,709 · 1.2%
  • 11–100180,013 · 20.4%
  • 101–1K450,121 · 51.1%
  • 1K–10K215,610 · 24.5%
  • 10K–100K23,002 · 2.6%
  • 100K+1,403 · 0.2%
15

Keywords & pages

Search footprint and crawled page counts per site.

Organic keywords

  • 0–10070,183 · 8.0%
  • 101–1K392,060 · 44.5%
  • 1K–10K353,020 · 40.1%
  • 10K–100K59,456 · 6.7%
  • 100K–1M5,669 · 0.6%
  • 1M+448 · <0.1%

Pages per site

  • 1–5082,062 · 15.1%
  • 51–500210,027 · 38.7%
  • 501–5K175,916 · 32.4%
  • 5K–50K56,071 · 10.3%
  • 50K+18,281 · 3.4%
16

Domain age

When each domain was first registered.

Domains by registration year

19852026
Number of tracked domains by registration year
YearDomains
1985254
1986321
1987319
1988194
1989308
1990346
1991486
1992565
1993882
19942,988
199510,262
199621,474
199723,473
199824,057
199928,355
200027,274
200118,651
200221,145
200320,823
200420,951
200520,619
200620,189
200720,607
200821,789
200924,177
201025,240
201126,739
201227,602
201329,709
201430,873
201532,518
201633,413
201734,733
201835,802
201935,404
202037,514
202130,541
202225,902
202324,531
202420,489
202517,002
20264,774
17

Content freshness

Publishing signals derived from sitemap timestamps and feeds.

365,599
Posted in last 90 days
75.1% of dated sites
428,507
Posted in last year
88.0% of dated sites
292,426
Publish RSS / Atom
33.2% of all sites
138,269
Publish llms.txt
15.7% of all sites
18

Languages

Primary language detected from each site's content.

Browse all markets

Top languages

Showing top 20 of 69 distinct values

19

Topics Covered

The most common topical categories assigned across the catalog.

Browse all topics

Top topics

Showing top 25 of 3,112 distinct values

Figures are computed from the live database; last updated July 1, 2026. Detection is best-effort from public homepage markup, DNS/TLS, and third-party metrics, so treat shares as indicative of the catalog, not the whole web.