Instructions to use alvdansen/illustration-1.0-qwen-image with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use alvdansen/illustration-1.0-qwen-image with PEFT:
Task type is invalid.
- Inference
- Notebooks
- Google Colab
- Kaggle
Illustration 1.0 — Qwen-Image
A style LoRA for Qwen-Image trained on 244 curated illustration and anime reference images across diverse visual styles. Produces images spanning cute anime character design, European bande dessinée, indie risograph prints, storybook watercolor, retro shoujo manga, and graphic novel illustration.
Published alongside its straight-baseline twin as the paired comparison in Forgetting on Purpose: Generalization as the Quality Criterion for Small-Dataset LoRA Fine-Tuning — Alvdansen Labs, May 2026. Read the paper · Source on GitHub.
Style Influences
- Japanese anime and manga illustration — clean cel shading, expressive character design, shoujo and slice-of-life aesthetics
- European graphic novel and bande dessinée — ink crosshatching, atmospheric landscapes, ligne claire
- Indie illustration and zine culture — risograph print textures, limited palettes, grainy halftone
- Children's book and storybook illustration — watercolor washes, warm palettes, charming character proportions
- Retro anime aesthetics — 80s/90s anime film stills, VHS grain, bold color blocking
- Concept art and character design — flat color fills, turnaround sheets, fashion illustration
Usage
No trigger word — this is a style LoRA. Describe what you want and add style cues like "ink and watercolor", "clean cel shading", "risograph print" to steer the output.
Recommended Inference Settings
Sampler: euler
Scheduler: simple
CFG: 3.5
Steps: 45 (30–60 works well)
LoRA strength: 0.8–1.0
Sample Generations
Training Details
- Base model: Qwen-Image (FP8 quantized, text encoder FP8)
- Training steps: 59,000, across a 5-phase chained dataset schedule (four disjoint subsets trained sequentially, then the full combined dataset reintroduced for consolidation)
- Rank/Alpha: 42/42
- Learning rate: 5e-5
- Optimizer: AdamW 8-bit
- Caption dropout: 0.35
- EMA: enabled (decay 0.99)
- Noise scheduler: flowmatch
- Precision: bf16 with qfloat8 quantization
- Dataset: 244 curated images
- Trainer: ai-toolkit by Ostris
- Hardware: NVIDIA RTX 6000 Ada (A6000, 48 GB VRAM)
Full configuration in Appendix A of the paper.
- Downloads last month
- -
Model tree for alvdansen/illustration-1.0-qwen-image
Base model
Qwen/Qwen-Image











