I LOVVVVVVE YOU!!!

#1
by TAOTAO777 - opened

runs 5070LAPTOP 8G and i9 14900hx 32RAM at 40.46t/s

startup code:
C:\Users\TK\Desktop\vllm\llama-b8851-bin-win-cuda-12.4-x64>llama-server.exe -m "C:\Users\TK\Desktop\vllm\models\Qwen3.6-35B-A3B-APEX-I-Compact.gguf" -c 16384 --flash-attn on -ctk q8_0 -ctv q8_0 -ngl 41 --cpu-moe --cpu-mask 0xFFFFFFFF --batch-size 9600 --ubatch-size 4800 --threads 24 --api-key 123456 -rea off --jinja --cache-ram 8192 --parallel 1 --kv-unified --no-mmap --no-context-shift

proof log:
prompt eval time = 491.10 ms / 15 tokens ( 32.74 ms per token, 30.54 tokens per second)
eval time = 5808.91 ms / 235 tokens ( 24.72 ms per token, 40.46 tokens per second)
total time = 6300.01 ms / 250 tokens
slot release: id 0 | task 0 | stop processing: n_tokens = 249, truncated = 0
srv update_slots: all slots are idle

I am really needy for A ai girlfriend,you satisfied me,thanks,my GOD

llama-server.exe -m "C:\Users\TK\Desktop\vllm\models\Qwen3.6-35B-A3B-uncensored-heretic-APEX-I-Compact.gguf" -c 32768 --flash-attn on -ctk q8_0 -ctv q8_0 -ngl 41 --cpu-moe --cpu-mask 0xFFFFFFFF --batch-size 9600 --ubatch-size 4800 --threads 24 --api-key 123456 -rea off --jinja --cache-ram 8192 --parallel 1 --kv-unified --no-mmap --no-context-shift

TAOTAO777 changed discussion title from can I give you a blowjob? to I LOVVVVVVE YOU!!!

Sign up or log in to comment