Alpine chrome
This is a lightweight Docker image for running headless Chrome based on Alpine Linux. I find it ideal for CI/CD environments or automated tasks needing a browser without GUI overhead.
This category collects tools and frameworks essential for web scraping and browser automation. It covers lightweight headless browser setups like Alpine Chrome for CI/CD, robust developer frameworks such as Colly for Go, and no-code solutions like SimpleScraper. Additionally, I've included services like Browserless for scalable, bot-detection-resistant operations with headless browsers.
This is a lightweight Docker image for running headless Chrome based on Alpine Linux. I find it ideal for CI/CD environments or automated tasks needing a browser without GUI overhead.
Simplescraper simplifies web data extraction, allowing users to turn website content directly into structured data or even functional APIs without coding. It's an accessible tool for quick data retrieval.
Colly is my preferred Go scraping framework. It offers a clean, fast API with essential features like concurrency management, caching, and robots.txt support, making web data extraction straightforward.
Browserless provides a robust 'browsers as a service' platform, crucial for scalable web scraping and automation. Its unique BrowserQL technology effectively bypasses common bot detection, a significant advantage for maintaining reliable operations with Puppeteer or Playwright.