Instructions to use neta-art/Neta-Lumina with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusion Single File
How to use neta-art/Neta-Lumina with Diffusion Single File:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
| base_model: | |
| - Alpha-VLLM/Lumina-Image-2.0 | |
| license: other | |
| license_name: fair-ai-public-license-1.0-sd | |
| license_link: https://freedevproject.org/faipl-1.0-sd/ | |
| [中文版模型说明](https://huggingface.co/neta-art/Neta-Lumina/blob/main/README-ZH.md) | |
|  | |
| # Introduction | |
| **Neta Lumina** is a high‑quality anime‑style image‑generation model developed by Neta.art Lab. | |
| Building on the open‑source **Lumina‑Image‑2.0** released by the Alpha‑VLLM team at Shanghai AI Laboratory, we fine‑tuned the model with a vast corpus of high‑quality anime images and multilingual tag data. The preliminary result is a compelling model with powerful comprehension and interpretation abilities (thanks to Gemma text encoder), ideal for illustration, posters, storyboards, character design, and more. | |
| ## Key Features | |
| - Optimized for diverse creative scenarios such as Furry, Guofeng (traditional‑Chinese aesthetics), pets, etc. | |
| - Wide coverage of characters and styles, from popular to niche concepts. (Still support danbooru tags!) | |
| - Accurate natural‑language understanding with excellent adherence to complex prompts. | |
| - Native multilingual support, with Chinese, English, and Japanese recommended first. | |
| ## Model Versions | |
| For models in alpha tests, requst access at https://huggingface.co/neta-art/NetaLumina_Alpha if you are interested. We will keep updating. | |
| ### neta-lumina-beta-0624-raw | |
| - **Primary Goal**: General knowledge and anime‑style optimization | |
| - **Data Set**: >13 million anime‑style images | |
| - **>46,000** A100 Hours | |
| - Higher upper limit, suitable for pro users. Check [**Neta Lumina Prompt Book**](https://nieta-art.feishu.cn/wiki/RVBgwvzBqiCvQ7kOMm1cM6NdnNc) for better results. | |
| ### neta-lumina-beta-0624-aes | |
| - First beta release candidate | |
| - **Primary Goal**: Enhanced aesthetics, pose accuracy, and scene detail | |
| - **Data Set**: Hundreds of thousands of handpicked high‑quality anime images (fine‑tuned on an older version of raw model) | |
| - User-friendly, suitable for most people. | |
| <br> | |
| # How to Use | |
| [Try it at Hugging Face playground](https://huggingface.co/spaces/neta-art/NetaLumina_T2I_Playground) | |
| ## ComfyUI | |
| Neta Lumina is built on the **Lumina2 Diffusion Transformer (DiT)** framework, please follow these steps precisely. | |
| ### Environment Requirements | |
| Currently Neta Lumina runs only on ComfyUI: | |
| - Latest ComfyUI installation | |
| - ≥ 8 GB VRAM | |
| ### Downloads & Installation | |
| **Original (component) release** | |
| 1. **Neta Lumina-Beta** | |
| - Download link: https://huggingface.co/neta-art/Neta-Lumina/blob/main/neta-lumina-beta-0624.pth | |
| - Save path: `ComfyUI/models/unet/` | |
| 2. **Text Encoder (Gemma-2B)** | |
| - Download link:https://huggingface.co/neta-art/Neta-Lumina/resolve/main/gemma_2_2b_fp16.safetensors | |
| - Save path: `ComfyUI/models/text_encoders/` | |
| 3. **VAE Model (16-Channel FLUX VAE)** | |
| - Download link: https://huggingface.co/neta-art/Neta-Lumina/resolve/main/ae.safetensors | |
| - Save path: `ComfyUI/models/vae/` | |
| **Workflow**: load [`lumina_workflow.json`](https://huggingface.co/neta-art/NetaLumina_Alpha/blob/main/lumina_workflow.json) in ComfyUI. | |
|  | |
| - `UNETLoader` – loads the `.pth` | |
| - `VAELoader` – loads `ae.safetensors` | |
| - `CLIPLoader` – loads `gemma_2_2b_fp16.safetensors` | |
| - `Text Encoder` – connects positive /negative prompts to K Sampler | |
| **Simple merged release** | |
| Download [`neta-lumina-beta-0624-all-in-one.safetensors`](https://huggingface.co/neta-art/Neta-Lumina/tree/main), | |
| `md5sum = dca54fef3c64e942c1a62a741c4f9d8a`, | |
| you may use ComfyUI’s simple checkpoint loader workflow. | |
| ### Recommended Settings | |
| - **Sampler**: `res_multistep` | |
| - **Scheduler**: `linear_quadratic` | |
| - **Steps**: 30 | |
| - **CFG (guidance)**: 4 – 5.5 | |
| - **EmptySD3LatentImage resolution**: 1024 × 1024, 768 × 1532, or 968 × 1322 | |
| <br> | |
| # Prompt Book | |
| Detailed prompt guidelines: [**Neta Lumina Prompt Book**](https://nieta-art.feishu.cn/wiki/RVBgwvzBqiCvQ7kOMm1cM6NdnNc) | |
| <br> | |
| # Community | |
| - Discord: https://discord.com/invite/TTTGccjbEa | |
| - QQ group: 785779037 | |
| <br> | |
| # Roadmap | |
| ## Model | |
| - Continous base‑model training to raise reasoning capability. | |
| - Aesthetic‑dataset iteration to improve anatomy, background richness, and overall appealness. | |
| - Smarter, more versatile tagging tools to lower the creative barrier. | |
| ## Ecosystem | |
| - LoRA training tutorials and components | |
| - Experienced users may already fine‑tune via Lumina‑Image‑2.0’s open code. | |
| - Development of advanced control / style‑consistency features (e.g., [Omini Control](https://arxiv.org/pdf/2411.15098)). [**Call for Collaboration!**](https://discord.com/invite/TTTGccjbEa) | |
| <br> | |
| # License & Disclaimer | |
| - Neta Lumina is released under the [**Fair AI Public License 1.0‑SD**](https://freedevproject.org/faipl-1.0-sd/) | |
| - Any modifications, merges, or derivative models must themselves be open‑sourced. | |
| <br> | |
| # Participants & Contributors | |
| - Special thanks to the **Alpha‑VLLM** team for open‑sourcing **Lumina‑Image‑2.0** | |
| - **Model development**: **Neta.art Lab (Civitai)** | |
| - Core Trainer: **li_li** [Civitai](https://civitai.com/user/li_li) ・ [Hugging Face](https://huggingface.co/heziiiii) | |
| <br> | |
| - **Partners** | |
| - **nebulae**: [Civitai](https://civitai.com/user/kitarz) ・ [Hugging Face](https://huggingface.co/NebulaeWis) | |
| - [**narugo1992**](https://github.com/narugo1992) & [**deepghs**](https://huggingface.co/deepghs): open datasets, processing tools, and models | |
| - [**Naifu**](https://github.com/Mikubill/naifu) trainer at [Mikubill](https://github.com/Mikubill) | |
| <br> | |
| # Community Contributors | |
| **Evaluators & developers**: 二小姐, spawner, Rnglg2 | |
| **Other contributors**: 沉迷摸鱼, poi氵, ashan, 十分无奈, GHOSTLXH, wenaka, iiiiii, 年糕特工队, 恩匹希, 奶冻美宣集, mumu, yizyin, smile | |
| <br> | |
| # Appendix & Resources | |
| - **TeaCache**: https://github.com/spawner1145/CUI-Lumina2-TeaCache | |
| - **Advanced samplers & TeaCache guide (by spawner)**: https://docs.qq.com/doc/DZEFKb1ZrZVZiUmxw?nlc=1 | |
| - **Neta Lumina ComfyUI Manual (in Chinese)**: https://docs.qq.com/doc/DZEVQZFdtaERPdXVh |