Reinforcement Learning
Transformers
GGUF
Chinese
English
incremental-pretraining
sft
roleplay
cot
sex
conversational
Not-For-All-Audiences
Instructions to use ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4", dtype="auto") - llama-cpp-python
How to use ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4", filename="Tifa-Deepsex-14b-CoT-Chat-IQ4_NL.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL # Run inference directly in the terminal: llama-cli -hf ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL # Run inference directly in the terminal: llama-cli -hf ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL # Run inference directly in the terminal: ./llama-cli -hf ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL # Run inference directly in the terminal: ./build/bin/llama-cli -hf ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL
Use Docker
docker model run hf.co/ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL
- LM Studio
- Jan
- Ollama
How to use ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 with Ollama:
ollama run hf.co/ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL
- Unsloth Studio
How to use ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 to start chatting
- Atomic Chat new
- Docker Model Runner
How to use ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 with Docker Model Runner:
docker model run hf.co/ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL
- Lemonade
How to use ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4:IQ4_NL
Run and chat with the model
lemonade run user.Tifa-Deepsex-14b-CoT-GGUF-Q4-IQ4_NL
List all available models
lemonade list
Delete Tifa-Deepsex-14b-CoT-Q4_K_M.gguf
#37 opened over 1 year ago
by
shiqja
官网的大号Tifa模型实际上是Claude API,您为何要谎称您有大模型?
1
#35 opened over 1 year ago
by deleted
How do I download it?
#33 opened over 1 year ago
by
ramsey231rrr
请问本地部署要求是什么?8G显存可行吗
2
#31 opened over 1 year ago
by
xyFredi
Prompt for the bot
1
#30 opened over 1 year ago
by
Immo174
大佬用什么设备训练的,3090能自己微调吗?
1
#29 opened over 1 year ago
by
ycye
Can not Ollama pull the full files
👍 1
#27 opened over 1 year ago
by
chancenju
7B-GGUF-Q4 V2版是13号还是14号出?感觉等不及了,有没有合适的LM STUDIO设置?
1
#26 opened over 1 year ago
by
RavenRock
Is there a way to disable the <thinking> feature?
2
#25 opened over 1 year ago
by
Kuromasa
Create c
#24 opened over 1 year ago
by
sashamatveevsashamatveev
[Issue] Mac 下运行 Chat 模型遇到严重问题
➕ 4
2
#23 opened over 1 year ago
by
AndyZhang32767
7b model params
#22 opened over 1 year ago
by
TimohaS
How to launch this?
14
#21 opened over 1 year ago
by
Andrei321123
BRO API KEY CHAIYE
1
#20 opened over 1 year ago
by
brahamaandai
Add tag "not-for-all-audiences" (NSFW)
#19 opened over 1 year ago
by
adamm-hf
COULD YOU PLEASE PROVIDE A WAY TO WRITE ENTIRE BOOK OF 50 60 PAGES WITH THIS.
#16 opened over 1 year ago
by
parthwagh
Tifa_220B? 这是啥?
1
#15 opened over 1 year ago
by
Zambolin
Chat 和 Crazy 模型使用 Ollama 命令下载时无法区分Tag
4
#14 opened over 1 year ago
by
jie65535
关于discussions中提到问题以及最佳实践的思考及可能解决方案
2
#13 opened over 1 year ago
by
I-am-CJC
缺少常识, 生成的文字非常混乱, 是各种器官名字的堆砌. 而且不能遵守prompt,这是致命问题.
2
#12 opened over 1 year ago
by
Moodym
如果使用web端有啥好的开源项目推荐?不是说要设置单独的提示词么?
#11 opened over 1 year ago
by
ideacco
请问demo里的自定义API该如何设置
1
#9 opened over 1 year ago
by
akande1
[Issue]生成文字时<think>缺失
3
#8 opened over 1 year ago
by
AndyZhang32767
是不是uncensored
2
#7 opened over 1 year ago
by
bimmie
About RL
#6 opened over 1 year ago
by
Rhythmblue
System Prompt設置
#4 opened over 1 year ago
by
HideonSofa
通过ollama加载后,用这里提供的demo自定义api返回出错
1
#3 opened over 1 year ago
by
cscvscsg
如果使用在酒馆,需要怎样的preset?
1
#2 opened over 1 year ago
by
Mr9KL