Geziyor — screenshot of github.com

Geziyor

Geziyor is a Go-based web crawling and scraping framework. It's built for performance, supporting JS rendering and handling high request volumes for extracting structured data efficiently.

Visit github.com →

Questions & Answers

What is Geziyor?
Geziyor is a fast web crawling and web scraping framework written in Go. It enables users to crawl websites and extract structured data, supporting various functionalities like JS rendering and automatic data exporting.
Who should use Geziyor?
Geziyor is designed for developers and data engineers who need to perform data mining, monitoring, or automated testing requiring efficient web data extraction. Its Go-based implementation appeals to those working within the Go ecosystem.
What makes Geziyor stand out from other web scraping frameworks?
Geziyor distinguishes itself with its focus on speed, claiming over 5,000 requests/sec, and its built-in JavaScript rendering capabilities for dynamic content. It also offers comprehensive features like various caching options, metrics integration, and robust proxy management.
When is Geziyor a good choice for a project?
Geziyor is suitable for projects requiring high-performance web crawling and scraping, especially when dealing with large volumes of requests or websites that rely heavily on JavaScript for content rendering. It's also well-suited for Go-centric development environments.
How does Geziyor handle JavaScript-rendered content?
Geziyor supports JavaScript rendering through its GetRendered method, which by default uses a local Chrome application CLI. Users can also configure it to connect to a different Chrome instance via the BrowserEndpoint option for headless browser automation.