GoofyLM Lab

community

Verified

https://goofylm.site/

https://github.com/GoofyLM

Activity Feed Request to join this org

AI & ML interests

Making funny and goofy LM's and AI's

Recent Activity

FlameF0X submitted a paper 24 days ago

Triplet-Block Diffusion RWKV

aquiffoo updated a collection 11 months ago

FlameF0X updated a model 11 months ago

GoofyLM/N2.1-Eye-1.3B

View all activity

FlameF0X

posted an update 11 days ago

Post

197

My models on the Intel Low-Bit LLM Leaderboard

Figured I'd share where my quantized models landed on Intel/low_bit_open_llm_leaderboard since I hadn't posted about it yet.

FlameF0X/Qwen3-4B-Distilled-Claude-4.6 (NVFP4 and MXFP4) sit at ranks 23 and 24 with 62.68% and 61.18% average, right below the base Qwen3-4B. Not bad considering they were distilled from Claude 4.6 rather than trained from scratch.

FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6 and FlameF0X/LFM2.5-1.2B-Thinking-CodeX land around rank 47-49, competitive with MiniCPM5-1B and the Qwen3 sub-1B models despite being a larger base architecture.

The funny one is FlameF0X/Qwen2-0.2B-pt and FlameF0X/Qwen2-0.2B-it. They're not properly trained — genuinely undertrained, basically undefined — and they still beat openai/gpt-oss-20b at rank 66. The 20B model. Not sure what that says but it's something.

FlameF0X/LFM2-Research is at the bottom of my lineup but it's a research artifact, not meant to be competitive.

Chart below showing my models vs nearby competitors, with size vs performance on the left.

Chart made by Claude

1 reply

FlameF0X

posted an update 19 days ago

Post

7152

MiniMax-M3 coming soon.
https://github.com/MiniMax-AI/MiniMax-M3

FlameF0X

submitted a paper to Daily Papers 24 days ago

Triplet-Block Diffusion RWKV

Paper • 2605.25969 • Published 27 days ago • 25

FlameF0X

posted an update about 1 month ago

Post

285

I did some testing on the scalability of FWKV. It hits a speed bottleneck at 1B due to the T4’s bandwidth limitations. Theoretically, it should match RWKV’s inference speed if the GPU had more bandwidth. So the 1B size is not accurate.

FlameF0X

posted an update about 1 month ago

Post

278

Greetings Hugging Face!

I started a new project called **FWKV** (Feed-forward Weighted Key Value, or Floored Weighted Key Value), a RWKV-style LM that uses FFNNs (Feed-Forward Neural Networks) instead of RNN and floor(W·K·V). I'm hoping to make it much more efficient and scalable than RWKV.

So far I have:

- FlameF0X/FWKV-29M — this one is undertrained and doesn't have a Space yet. In the attached image you can see its speed on a T4 compared to models with the same configuration.

The only model that's fully working right now is:
- FlameF0X/FWKV-TinyStories — trained on TinyStories for one epoch. The demo Space is FlameF0X/FWKV-demo.

2 replies

FlameF0X

posted an update 10 months ago

Post

4377

I am very sad to say that the budget in creating of SnowflakeCore-G1 1b and 7b MoE models ran out and I can't pre-train them anymore.

7 replies

aquiffoo

updated a collection 11 months ago

Nx

Collection

Main series of models by GoofyLM. • 6 items • Updated Aug 9, 2025

FlameF0X

updated 3 models 11 months ago

published a model 11 months ago

GoofyLM/N2.3-Eye-1.3B-DEV

Image-Text-to-Text • 1B • Updated Aug 8, 2025 • 5 • 2

FlameF0X

posted an update 11 months ago

Post

821

the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus.

FlameF0X

published a model 11 months ago

GoofyLM/N2.2-Eye-1.3B

Image-Text-to-Text • 1B • Updated Aug 8, 2025 • 4 • 2

FlameF0X

updated a collection 11 months ago

Nx

Collection

Main series of models by GoofyLM. • 6 items • Updated Aug 9, 2025

aquiffoo

updated a model 11 months ago

GoofyLM/N2.1-Eye-1.3B

Image-Text-to-Text • 1B • Updated Aug 8, 2025 • 11 • 3

FlameF0X

updated a collection 11 months ago

Nx

Collection

Main series of models by GoofyLM. • 6 items • Updated Aug 9, 2025

FlameF0X

in GoofyLM/N2.1-Eye-1.3B 11 months ago

Adding `safetensors` variant of this model

#1 opened 11 months ago by

SFconvertbot

FlameF0X

published a model 11 months ago

GoofyLM/N2.1-Eye-1.3B

Image-Text-to-Text • 1B • Updated Aug 8, 2025 • 11 • 3

FlameF0X

posted an update 11 months ago

Post

278

The development of SnowflakeCore-G1-7B-MoE it getting delay. In the mean time I am working on SnowflakeCore-G1-1B-MoE witch would be a pre-train chatbot.

1 reply

FlameF0X

posted an update 11 months ago

Post

2959

The development of SnowflakeCore-G1-7B-MoE. I can't say when it would be publish yet because it's big and it requires a lot of computational power.

1 reply

AI & ML interests

Recent Activity

Team members 3

GoofyLM's activity

Adding `safetensors` variant of this model