Mingjie Bi
mjb95m
AI & ML interests
None yet
Recent Activity
upvoted a paper 24 days ago
Less is More: Early Stopping Rollout for On-Policy Distillation liked a dataset 6 months ago
bigai/TongSIM-Asset upvoted a paper 11 months ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification