Instructions to use Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix", filename="L3-TheSpice-8b-v0.1.3-F16.gguf", )
output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix with llama.cpp:
Install (macOS, Linux)
curl -LsSf https://llama.app/install.sh | sh # Start a local OpenAI-compatible server with a web UI: llama serve -hf Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M # Run inference directly in the terminal: llama cli -hf Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama serve -hf Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M # Run inference directly in the terminal: llama cli -hf Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M
Use Docker
docker model run hf.co/Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M
- LM Studio
- Jan
- Ollama
How to use Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix with Ollama:
ollama run hf.co/Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M
- Unsloth Studio
How to use Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix to start chatting
- Atomic Chat new
- Docker Model Runner
How to use Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix with Docker Model Runner:
docker model run hf.co/Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M
- Lemonade
How to use Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull Lewdiculous/L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix:Q4_K_M
Run and chat with the model
lemonade run user.L3-TheSpice-8b-v0.1.3-GGUF-IQ-Imatrix-Q4_K_M
List all available models
lemonade list
UPDATED QUANTS! Use KCPP 1.64+
Browse files- L3-TheSpice-8b-v0.1.3-F16.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-IQ3_M-imat.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-IQ3_XXS-imat.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-IQ4_NL-imat.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-IQ4_XS-imat.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-Q4_K_M-imat.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-Q4_K_S-imat.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-Q5_K_M-imat.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-Q5_K_S-imat.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-Q6_K-imat.gguf +3 -0
- L3-TheSpice-8b-v0.1.3-Q8_0-imat.gguf +3 -0
- imatrix.dat +3 -0
L3-TheSpice-8b-v0.1.3-F16.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4998724eb8419c66291dcc904e07fc66137f11fcdb58eb00cb47c228c002f552
|
| 3 |
+
size 16068890432
|
L3-TheSpice-8b-v0.1.3-IQ3_M-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bd511de6eed8c8f885d86c3aebe5a8e30bfc3b5c31892dde9cc8900b77516343
|
| 3 |
+
size 3784822944
|
L3-TheSpice-8b-v0.1.3-IQ3_XXS-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bcccacc7a506bbd247748dc76f0b1decdaf6bd6bf72555d7418da51fa6ba20ad
|
| 3 |
+
size 3274911904
|
L3-TheSpice-8b-v0.1.3-IQ4_NL-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0fbc1b9e331fadbd8c72d4d89f4ffae665a7db012e436b82c20056aa0276ad97
|
| 3 |
+
size 4677988512
|
L3-TheSpice-8b-v0.1.3-IQ4_XS-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3cd2b26753ced130d9b64add51d42c0022605fe63091d6fdbd06895617ca2921
|
| 3 |
+
size 4447662240
|
L3-TheSpice-8b-v0.1.3-Q4_K_M-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d4797db2c245281f17c93992bfb66b10d356675677a922cde9bb5874d8c678ee
|
| 3 |
+
size 4920733856
|
L3-TheSpice-8b-v0.1.3-Q4_K_S-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:39b0733324b1a8efb007dca15d0a57027312ab247c10a571d47080265b65e0f5
|
| 3 |
+
size 4692668576
|
L3-TheSpice-8b-v0.1.3-Q5_K_M-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f0b3c9cc5f5e45b198ad130ac08894ec034fc9bc8f76a67f07494224e65f5548
|
| 3 |
+
size 5732987040
|
L3-TheSpice-8b-v0.1.3-Q5_K_S-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:34a149ed3f2c0878f81694c95037fe8a7474dd435f6996c85a347423789fd100
|
| 3 |
+
size 5599293600
|
L3-TheSpice-8b-v0.1.3-Q6_K-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4cb2ee497ce3ee887fdc7bc449ebbe9cfb95e95c04d91bcc26ceed75ac33f443
|
| 3 |
+
size 6596006048
|
L3-TheSpice-8b-v0.1.3-Q8_0-imat.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:54661386dbab9dbf86f2bb66ab306a1715368cabb81cf5056cff5c5f91c67b19
|
| 3 |
+
size 8540770464
|
imatrix.dat
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:062d11cf913deec30cbe234d8dfb50b3f84eac73802436b4adc637f97a9a0618
|
| 3 |
+
size 4988193
|