MSG
Feat/research tab agent skills (#5)
e7fd66f
|
Raw
History Blame Contribute Delete
489 Bytes

A newer version of the Gradio SDK is available: 6.19.0

Upgrade
metadata
name: scrape-web
description: Fetch a web page and extract clean text for indexing
task: research
tools:
  - scrape_web

Workflow

  1. Receive a full https:// URL from the user or orchestrator.
  2. Call scrape_web with the URL.
  3. Return title, extracted text, and final URL metadata.
  4. Pass the ExtractedDocument to extract_and_index for MemRAG storage.

See references/html-cleanup.md for extraction settings and references/allowed-domains.md for rate-limit notes.