DeepBeepMeep commited on
Commit ·
2904e1f
1
Parent(s): aa32ae5
Added troubleshooting info
Browse files
README.md
CHANGED
|
@@ -136,6 +136,18 @@ To run the application while loading entirely the diffusion model in VRAM (sligh
|
|
| 136 |
python gradio_server.py --profile 3
|
| 137 |
```
|
| 138 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 139 |
|
| 140 |
### Loras support
|
| 141 |
|
|
|
|
| 136 |
python gradio_server.py --profile 3
|
| 137 |
```
|
| 138 |
|
| 139 |
+
**Trouble shooting**:\
|
| 140 |
+
If you have installed Sage attention, it may seem that it works because *pip install sageattention* didn't produce and error or because sage is offered as on option but in fact it doesnt work : in order to be fully operatioal Sage needs to compile its triton kernels the first time it is run (that is the first time you try to generate a video).
|
| 141 |
+
|
| 142 |
+
Sometime fixing Sage compilation is easy (clear the triton cache, check triton is properly installed) sometime it is simply not possible because Sage is not supported on some older GPUs
|
| 143 |
+
|
| 144 |
+
Therefore you may have no choice but to fallback to sdpa attention, to do so:
|
| 145 |
+
- In the configuration menu inside the application, switch "Attention mode" to "sdpa"
|
| 146 |
+
or
|
| 147 |
+
- Launch the application this way:
|
| 148 |
+
```bash
|
| 149 |
+
python gradio_server.py --attention sdpa
|
| 150 |
+
```
|
| 151 |
|
| 152 |
### Loras support
|
| 153 |
|