Reinforcement Learning
PEFT
Safetensors
Portuguese
English
lora
grpo
rlhf
fidc
portuguese
finance
code
qwen
Instructions to use sttjr/paganini-qwen35-27b-grpo-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use sttjr/paganini-qwen35-27b-grpo-lora with PEFT:
Base model is not found.
- Notebooks
- Google Colab
- Kaggle
Ctrl+K