Image-Text-to-Text
Transformers
GGUF
llama.cpp
vision
multimodal
text-generation-inference
unsloth
conversational
qwen3_5
reasoning
chain-of-thought
lora
sft
agent
tool-use
function-calling
coder

64K variant

#8
by dfsafdsf - opened

Are you planning to train the model with a 64K context instead of 32K?

Sign up or log in to comment