EvoDS: Self-Evolving Autonomous Data Science Agent with Skill Learning and Context Management
Abstract
EvoDS introduces a self-evolving autonomous data science agent that enhances its capabilities through skill acquisition and adaptive context management via reinforcement learning.
Recent progress in Large Language Model (LLM) agents has enabled promising advances in automated data science. However, existing approaches remain fundamentally limited by their static action sets and lack of principled long-horizon context management, hindering their ability to accumulate reusable experience across tasks and operate reliably in multi-stage, iterative data science pipelines. To address these challenges, we introduce EvoDS, a self-evolving autonomous data science agent that learns to expand its skills and adaptively managing long-term context through agentic reinforcement learning. Specifically, EvoDS introduces two key strategies: (1) Autonomous Skill Acquisition (ASA) mechanism, which enables agents to synthesize, validate, and reuse executable skills; and (2) Adaptive Context Compression (ACC) strategy, which treats context management as a learned control problem rather than passive truncation. These strategies are orchestrated within a two-stage multi-agent training scheme, enabling EvoDS to autonomously improve over time. Theoretically, we prove that EvoDS's hierarchical design reduces tool-selection error, and its optimization objective aligns with an information bottleneck principle, ensuring efficient context use. Empirically, EvoDS outperforms state-of-the-art open-source data science agents by an average of 28.9% across four diverse benchmarks while eliminating out-of-token failures. Our code and data are available at https://github.com/usail-hkust/EvoDS.
Community
Accepted by KDD2026
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- ANDES: Agent Native Data Evolving Synthesis Tool for Autonomous Instruction Alignment (2026)
- Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles (2026)
- EvoAgent: An Evolvable Agent Framework with Skill Learning and Multi-Agent Delegation (2026)
- Harnessing LLM Agents with Skill Programs (2026)
- Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks (2026)
- SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories (2026)
- From History to State: Constant-Context Skill Learning for LLM Agents (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2606.03841 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper