Text Generation
Transformers.js
PyTorch
ONNX
Safetensors
English
gpt2
text-generation-inference
distillation
grpo
vae
agent
education
SLM
small
tiny
smol
distilled
micro
study
testing
blackbox
offline
localdb
Instructions to use webxos/microd_v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers.js
How to use webxos/microd_v1 with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('text-generation', 'webxos/microd_v1');
File size: 1,002 Bytes
6253d52 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 | # Micro-Distilled GRPO+VAE Model
## Model Description
This is a distilled language model trained using Group Relative Policy Optimization (GRPO) with VAE filtering.
## Model Details
- **Model type**: micro-distill-grpo-vae
- **Model size**: 42M parameters
- **Language**: English
- **License**: Apache 2.0
## Training Methodology
- **GRPO (Group Relative Policy Optimization)**: 8 groups
- **VAE Filtering**: 32D latent space
- **KV-Cache Reuse**: 512 cache size
## Architecture Details
- Hidden size: 512
- Number of layers: 8
- Attention heads: 8
- Vocabulary size: 50257
- Maximum sequence length: 1024
## Usage
### Using Transformers
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("micro-distill-grpo-vae")
tokenizer = AutoTokenizer.from_pretrained("micro-distill-grpo-vae")
inputs = tokenizer("Hello, world!", return_tensors="pt")
outputs = model.generate(**inputs, max_length=50)
print(tokenizer.decode(outputs[0]))
``` |