Gray Stanton's picture

Gray Stanton

GrStant

·

http://www.graystanton.com

gray-stanton

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

krea/Krea-2-Raw

liked a model 1 day ago

krea-community/Krea-2

liked a model 1 day ago

Boogu/Boogu-Image-0.1-Edit

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

Paper • 2606.16700 • Published 10 days ago • 12

upvoted a paper 19 days ago

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?

Paper • 2606.05553 • Published 21 days ago • 48

upvoted a paper about 1 month ago

Sub-JEPA: Subspace Gaussian Regularization for Stable End-to-End World Models

Paper • 2605.09241 • Published May 10 • 2

upvoted 3 papers 2 months ago

Strips as Tokens: Artist Mesh Generation with Native UV Segmentation

Paper • 2604.09132 • Published Apr 10 • 56

ELT: Elastic Looped Transformers for Visual Generation

Paper • 2604.09168 • Published Apr 10 • 24

Multi-User Large Language Model Agents

Paper • 2604.08567 • Published Mar 19 • 27

upvoted 9 papers 3 months ago

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published Apr 8 • 73

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published Mar 23 • 29

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published Mar 22 • 78

Teaching an Agent to Sketch One Part at a Time

Paper • 2603.19500 • Published Mar 19 • 5

Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

Paper • 2603.19987 • Published Mar 20 • 9

Effective Distillation to Hybrid xLSTM Architectures

Paper • 2603.15590 • Published Mar 16 • 34

Mixture-of-Depths Attention

Paper • 2603.15619 • Published Mar 16 • 81

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 189

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published Mar 16 • 155

upvoted 3 papers 4 months ago

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

Paper • 2602.16849 • Published Feb 18 • 7

Unified Latents (UL): How to train your latents

Paper • 2602.17270 • Published Feb 19 • 61

Arcee Trinity Large Technical Report

Paper • 2602.17004 • Published Feb 19 • 21

upvoted a collection over 1 year ago

QTIP Quantized Models

See https://github.com/Cornell-RelaxML/qtip • 27 items • Updated Mar 2 • 13