Onion Peeler

A configuration-driven, concurrent web scraping system.

Documentation Structure¶

Our documentation uses the Diátaxis framework to organize information effectively. Choose the category that best matches your current needs:

Tutorials

Learning-oriented step-by-step guides. Best for your first time running the scraper.
How-To Guides

Problem-oriented instructions. Practical steps to achieve a specific goal, like building a new configuration.
Explanations

Understanding-oriented theory. High-level discussions on ethics, system architecture, and design decisions.
API Reference

Information-oriented technical specs, pipeline layouts, and auto-generated code documentation.

No-Code Configuration

Build robust Scrapy spiders using entirely dynamic .toml files. No need to repeatedly write Python classes for every new website added to the target list.
Page Object Models

Deep integration with scrapy-poet allows you to decouple extraction parsing from raw spider mechanics, making updates trivial when a site's layout changes.
Anti bot detection

Engineered to provide avoid account bans and adapt to request limits, providing comprehensive data extraction