Step 1: List Page Scraping
Collect detail URLs from a list page
How It Works
In Step 1, you provide:
- A **source URL** — the list/index page
- A **link CSS selector** — identifies which links to collect
Configuring the Selector
The selector should match anchor tags (`<a>`) that link to detail pages.
Examples
- Blog posts: `article a.title-link`
- Product list: `.product-card a`
- Directory: `ul.listings li a`
Testing
Always click **Test Scrape** before running the full task. This will:
- Fetch the list page
- Apply your CSS selector
- Show the collected URLs in a preview
If the preview looks wrong, adjust your selector and test again.
Tips
- Use your browser's Developer Tools (F12) to inspect the page and find the right selector
- Be specific enough to avoid collecting unwanted links (navigation, footer, etc.)
- The selector automatically extracts the `href` attribute from matched elements