Web-Shepherd: Advancing PRMs for Reinforcing Web Agents - a LangAGI-Lab Collection

LangAGI-Lab 's Collections

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Coffee-Gym: An Environment for Evaluating and Improving Natu

Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback

Cactus: Towards Psychological Counseling Conversations

Web Agents with World Models

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

updated May 22, 2025

LangAGI-Lab/WebShepherd_8B

8B • Updated Sep 10, 2025 • 13 • 5
LangAGI-Lab/WebShepherd_3B

3B • Updated Sep 10, 2025 • 7 • 1
LangAGI-Lab/WebPRMCollection_preference_pair

Viewer • Updated May 22, 2025 • 9.46k • 620 • 1
LangAGI-Lab/WebRewardBench

Viewer • Updated May 22, 2025 • 776 • 182
LangAGI-Lab/WebPRMCollection_checklist_generation

Viewer • Updated May 19, 2025 • 3.63k • 181
LangAGI-Lab/WebShepherd_checklist_generation_only_8B

Feature Extraction • 8B • Updated May 19, 2025 • 3 • 1
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published May 21, 2025 • 105