Steven's picture

Steven

yijunyang

·

stevenyangyj

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

authored a paper 2 months ago

System-2 Mathematical Reasoning via Enriched Instruction Tuning

authored a paper 2 months ago

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

View all activity

Organizations

Papers 12

arxiv:2603.24533

arxiv:2602.05327

arxiv:2512.13043

arxiv:2512.02631

models 1

yijunyang/instructblip-sft-alfworld

Updated Mar 20, 2024

datasets 1

yijunyang/alfworld-sft-dataset

Preview • Updated Mar 14, 2024 • 17