Text Generation
Transformers.js
PyTorch
ONNX
Safetensors
English
gpt2
text-generation-inference
distillation
grpo
vae
agent
education
SLM
small
tiny
smol
distilled
micro
study
testing
blackbox
offline
localdb
Instructions to use webxos/microd_v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers.js
How to use webxos/microd_v1 with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('text-generation', 'webxos/microd_v1');
Micro-Distilled GRPO+VAE Model
Model Description
This is a distilled language model trained using Group Relative Policy Optimization (GRPO) with VAE filtering.
Model Details
- Model type: micro-distill-grpo-vae
- Model size: 42M parameters
- Language: English
- License: Apache 2.0
Training Methodology
- GRPO (Group Relative Policy Optimization): 8 groups
- VAE Filtering: 32D latent space
- KV-Cache Reuse: 512 cache size
Architecture Details
- Hidden size: 512
- Number of layers: 8
- Attention heads: 8
- Vocabulary size: 50257
- Maximum sequence length: 1024
Usage
Using Transformers
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("micro-distill-grpo-vae")
tokenizer = AutoTokenizer.from_pretrained("micro-distill-grpo-vae")
inputs = tokenizer("Hello, world!", return_tensors="pt")
outputs = model.generate(**inputs, max_length=50)
print(tokenizer.decode(outputs[0]))