How Scraping Works
Understanding the two-step scraping architecture
Two-Step Architecture
ET59 scraping is built around a two-step process:
Step 1: List Page Scraping
You provide a list/index URL (e.g., a blog archive page). ET59 uses your CSS selector to find all detail page links on that page.
Step 2: Detail Page Extraction
For each collected URL, ET59 visits the page and extracts fields using your defined CSS selectors.
Why Two Steps?
This approach works for any website with a list-detail pattern:
- Blog index → individual articles
- Product catalogs → product detail pages
- Directory listings → profile pages
- Search results → result detail pages
Fail-Fast Behavior
ET59 does not retry on errors. If a page returns an error or the selector doesn't match, the task stops immediately. This ensures:
- Clean, reliable data
- No wasted resources on broken pages
- Clear error messages for debugging
Pagination (Pro)
Pro plan users can enable pagination to automatically follow "next page" links and scrape multiple pages of results.