Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 16 days ago • 48
Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations Paper • 2606.10614 • Published 9 days ago • 25
MARS: Modular Agent with Reflective Search for Automated AI Research Paper • 2602.02660 • Published Feb 2 • 67
Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance Paper • 2505.16348 • Published May 22, 2025 • 52
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Paper • 2505.15277 • Published May 21, 2025 • 105
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17, 2024 • 44