---
name: scrape-web
description: Fetch a web page and extract clean text for indexing
task: research
tools:
  - scrape_web
---

## Workflow

1. Receive a full `https://` URL from the user or orchestrator.
2. Call `scrape_web` with the URL.
3. Return title, extracted text, and final URL metadata.
4. Pass the `ExtractedDocument` to `extract_and_index` for MemRAG storage.

See `references/html-cleanup.md` for extraction settings and `references/allowed-domains.md` for rate-limit notes.