crUX top lists — screenshot of github.com

crUX top lists

This project caches the top 1 million websites based on CrUX data from Google BigQuery. I find this a more accurate and robust dataset for web research compared to other top lists.

Visit github.com →

Questions & Answers

What is crUX top lists?
The crUX top lists repository provides cached CSV files of the top one million most popular websites, sourced directly from Google's public Chrome UX Report (CrUX) data in Google BigQuery. These lists are updated monthly and are freely available for download.
Who should use crUX top lists?
This resource is ideal for researchers, developers, or anyone needing an accurate list of the most popular websites on the internet. It is particularly useful for studies on web browsing behavior or website trends, especially when country-specific data is relevant.
How do CrUX top lists differ from other website ranking lists?
Unlike lists such as Alexa Top Million, CrUX top lists are based on actual user experience data from Chrome, making them significantly more accurate. They identify websites by origin, not FQDN, and rank them by completed page loads, categorizing them into rank magnitude buckets rather than specific numerical ranks.
When is it appropriate to use the CrUX top lists?
Use CrUX top lists when you need a highly representative dataset of the internet's most visited sites, especially for analyses where capturing over 95% of Chrome user traffic is sufficient. It is suitable for research on web popularity, performance, or trends.
What is the update frequency and data format of these lists?
The CrUX top lists are updated monthly, typically on the second Tuesday of each month. The data is provided in a gzipped CSV format, with each entry consisting of an origin URL and its rank magnitude.