DeepBeepMeep commited on
Commit
2904e1f
·
1 Parent(s): aa32ae5

Added troubleshooting info

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -136,6 +136,18 @@ To run the application while loading entirely the diffusion model in VRAM (sligh
136
  python gradio_server.py --profile 3
137
  ```
138
 
 
 
 
 
 
 
 
 
 
 
 
 
139
 
140
  ### Loras support
141
 
 
136
  python gradio_server.py --profile 3
137
  ```
138
 
139
+ **Trouble shooting**:\
140
+ If you have installed Sage attention, it may seem that it works because *pip install sageattention* didn't produce and error or because sage is offered as on option but in fact it doesnt work : in order to be fully operatioal Sage needs to compile its triton kernels the first time it is run (that is the first time you try to generate a video).
141
+
142
+ Sometime fixing Sage compilation is easy (clear the triton cache, check triton is properly installed) sometime it is simply not possible because Sage is not supported on some older GPUs
143
+
144
+ Therefore you may have no choice but to fallback to sdpa attention, to do so:
145
+ - In the configuration menu inside the application, switch "Attention mode" to "sdpa"
146
+ or
147
+ - Launch the application this way:
148
+ ```bash
149
+ python gradio_server.py --attention sdpa
150
+ ```
151
 
152
  ### Loras support
153