Jen Wei
bird-of-paradise
AI & ML interests
Training Dynamics & Model Systems | RL & Scaling
Recent Activity
liked a Space 13 days ago
HuggingFaceBio/carbon-tokenization published an article about 1 month ago
DeepSeek Engram × OLMo-core: Distributed Implementation published an article 3 months ago
The Three Horsemen of Numerical Divergence in Hybrid Models