In a Training Loop 🔄

1 82 34

Joel Wang

joelhenwang

joelhenwang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation

upvoted a paper about 2 hours ago

MemTrain: Self-Supervised Context Memory Training

upvoted a paper about 2 hours ago

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

View all activity

Organizations

upvoted 3 papers about 2 hours ago

Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation

Paper • 2606.02684 • Published 8 days ago • 16

MemTrain: Self-Supervised Context Memory Training

Paper • 2606.03197 • Published 7 days ago • 17

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Paper • 2605.30039 • Published 11 days ago • 18

upvoted 11 papers about 4 hours ago

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

Paper • 2606.06473 • Published 5 days ago • 18

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search

Paper • 2605.29796 • Published 12 days ago • 25

Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation

Paper • 2605.26844 • Published 14 days ago • 26

Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories

Paper • 2606.03979 • Published 7 days ago • 28

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

Paper • 2606.04743 • Published 6 days ago • 40

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published 11 days ago • 65

Trust Region On-Policy Distillation

Paper • 2606.01249 • Published 9 days ago • 42

upvoted 4 papers 1 day ago

Mem-π: Adaptive Memory through Learning When and What to Generate

Paper • 2605.21463 • Published 20 days ago • 8

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

Paper • 2605.30260 • Published 12 days ago • 42

Parameter Efficiency Is Not Memory Efficiency: Rethinking Fine-Tuning for On-Device LLM Adaptation

Paper • 2604.22783 • Published Apr 3 • 1

Self-Pruned Key-Value Attention: Learning When to Write by Predicting Future Utility

Paper • 2605.14037 • Published 27 days ago • 1

upvoted a paper 6 days ago

NITP: Next Implicit Token Prediction for LLM Pre-training

Paper • 2605.24956 • Published 16 days ago • 35

upvoted a paper 11 days ago

Rethinking Memory as Continuously Evolving Connectivity

Paper • 2605.28773 • Published 13 days ago • 34

Joel Wang

AI & ML interests

Recent Activity

Organizations

joelhenwang's activity