Instructions to use elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM

tokenizer = AutoTokenizer.from_pretrained("elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan")
model = AutoModelForMultimodalLM.from_pretrained("elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan

SGLang

How to use elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Unsloth Studio

How to use elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan to start chatting

Load model with FastModel

pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
    model_name="elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan",
    max_seq_length=2048,
)

Docker Model Runner
How to use elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan with Docker Model Runner:
```
docker model run hf.co/elsvastika/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan
```

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan

File size: 12,100 Bytes

8de320b
 
 
fd7b7b0
8de320b
fd7b7b0
8de320b
 
 
 
 
fcbdf97
fd7b7b0
fcbdf97
 
fd7b7b0
f0ce1ca
fcbdf97
 
 
 
5fa3ec7
fcbdf97
 
19597d6
dc66667
fcbdf97
5fa3ec7
 
 
fcbdf97
fd7b7b0
fcbdf97
 
fd7b7b0
062c8f8
fcbdf97
 
 
 
5fa3ec7
fcbdf97
 
652faf1
2248234
fcbdf97
5fa3ec7
 
 
fcbdf97
fd7b7b0
fcbdf97
 
fd7b7b0
fcbdf97
 
 
 
 
2953e62
fcbdf97
 
652faf1
062c8f8
fcbdf97
fd7b7b0
2953e62
 
fcbdf97
7fd7821
fcbdf97
 
fd7b7b0
24705d3
fcbdf97
 
 
 
5fa3ec7
fcbdf97
 
66ed0ec
cce6677
fcbdf97
fd7b7b0
2953e62
 
fcbdf97
fd7b7b0
fcbdf97
 
fd7b7b0
062c8f8
fcbdf97
 
 
 
2953e62
fcbdf97
 
d6046df
d22219d
fcbdf97
fd7b7b0
2953e62
 
fcbdf97
7fd7821
fcbdf97
 
fd7b7b0
003b929
fcbdf97
 
 
 
2953e62
fcbdf97
 
f0ce1ca
4ee792a
fcbdf97
fd7b7b0
2953e62
 
fcbdf97
fd7b7b0
fcbdf97
 
fd7b7b0
e5b6c22
fcbdf97
 
 
 
2953e62
fcbdf97
 
66ed0ec
062c8f8
fcbdf97
fd7b7b0
2953e62
 
fcbdf97
7fd7821
fcbdf97
 
fd7b7b0
062c8f8
fcbdf97
 
 
 
2953e62
fcbdf97
 
062c8f8
d22219d
fcbdf97
fd7b7b0
2953e62
 
fcbdf97
fd7b7b0
fcbdf97
 
fd7b7b0
fcbdf97
 
 
 
f0ce1ca
2953e62
fcbdf97
 
66ed0ec
fcbdf97
 
fd7b7b0
5fa3ec7
 
fcbdf97
7fd7821
fcbdf97
 
fd7b7b0
062c8f8
fcbdf97
 
 
 
5fa3ec7
fcbdf97
 
f0ce1ca
 
fcbdf97
fd7b7b0
2953e62
 
fcbdf97
fd7b7b0
fcbdf97
 
fd7b7b0
003b929
fcbdf97
 
 
 
2953e62
fcbdf97
 
 
062c8f8
fcbdf97
fd7b7b0
2953e62
 
fcbdf97
7fd7821
fcbdf97
 
fd7b7b0
19597d6
fcbdf97
 
 
 
2953e62
652faf1
fcbdf97
 
 
 
fd7b7b0
8de320b
 
fcbdf97
fd7b7b0
fcbdf97
 
fd7b7b0
fcbdf97
 
 
 
 
2953e62
fcbdf97
 
f0ce1ca
652faf1
fcbdf97
fd7b7b0
2953e62
 
fcbdf97
fd7b7b0
fcbdf97
 
fd7b7b0
fcbdf97
 
 
 
 
2953e62
fcbdf97
 
24705d3
fcbdf97
 
fd7b7b0
2953e62
 
fcbdf97
fd7b7b0
fcbdf97
 
2953e62
fcbdf97
 
 
 
 
2953e62
fcbdf97
 
652faf1
4dfd303
fcbdf97
fd7b7b0
2953e62
 
fd7b7b0
 
8de320b
fcbdf97
 
 
 
8de320b
 
 
fd7b7b0
8de320b
fd7b7b0
8de320b
 
 
 
 
 
 
 
 
 
 
 
 
 
1adb82f
8de320b

{
  "best_metric": null,
  "best_model_checkpoint": null,
  "epoch": 5.0,
  "eval_steps": 500,
  "global_step": 30,
  "is_hyper_param_search": false,
  "is_local_process_zero": true,
  "is_world_process_zero": true,
  "log_history": [
    {
      "completion_length": 333.78125,
      "epoch": 0.3333333333333333,
      "grad_norm": 2.23221492767334,
      "kl": 0.05727095284964889,
      "learning_rate": 4.985344892885899e-07,
      "loss": 0.0001,
      "reward": 1.9450934082269669,
      "reward_std": 1.730120688676834,
      "rewards/concensus_correctness_reward_func": 0.625,
      "rewards/consensus_reward_func": 0.0,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.1875,
      "rewards/question_recreation_reward_func": 0.5917184855788946,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.5408749803900719,
      "step": 2
    },
    {
      "completion_length": 312.71875,
      "epoch": 0.6666666666666666,
      "grad_norm": 2.403878688812256,
      "kl": 0.08554882137104869,
      "learning_rate": 4.869132927957006e-07,
      "loss": 0.0001,
      "reward": 2.863950189203024,
      "reward_std": 2.569031774997711,
      "rewards/concensus_correctness_reward_func": 1.369999997317791,
      "rewards/consensus_reward_func": 0.25,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.3125,
      "rewards/question_recreation_reward_func": 0.44188757985830307,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.48956248909235,
      "step": 4
    },
    {
      "completion_length": 343.625,
      "epoch": 1.0,
      "grad_norm": 1.2604225873947144,
      "kl": 0.028510307834949344,
      "learning_rate": 4.642142940418973e-07,
      "loss": 0.0,
      "reward": 1.9936382537707686,
      "reward_std": 1.554558202624321,
      "rewards/concensus_correctness_reward_func": 0.22356250323355198,
      "rewards/consensus_reward_func": 0.375,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.5625,
      "rewards/question_recreation_reward_func": 0.5116695226170123,
      "rewards/soft_format_reward_func": 0.015625,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.30528124468401074,
      "step": 6
    },
    {
      "completion_length": 269.96875,
      "epoch": 1.3333333333333333,
      "grad_norm": 1.9944454431533813,
      "kl": 0.07280526106478646,
      "learning_rate": 4.314988729807827e-07,
      "loss": 0.0001,
      "reward": 1.4901013299822807,
      "reward_std": 1.149802852421999,
      "rewards/concensus_correctness_reward_func": 0.018812499940395355,
      "rewards/consensus_reward_func": 0.1875,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.375,
      "rewards/question_recreation_reward_func": 0.441507613286376,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.4672812442295253,
      "step": 8
    },
    {
      "completion_length": 331.09375,
      "epoch": 1.6666666666666665,
      "grad_norm": 30.684772491455078,
      "kl": 0.07077017636038363,
      "learning_rate": 3.902967663405956e-07,
      "loss": 0.0001,
      "reward": 3.6839485419914126,
      "reward_std": 3.374626036733389,
      "rewards/concensus_correctness_reward_func": 2.042874988168478,
      "rewards/consensus_reward_func": 0.25,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.375,
      "rewards/question_recreation_reward_func": 0.40101105347275734,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.6150625091977417,
      "step": 10
    },
    {
      "completion_length": 288.0,
      "epoch": 2.0,
      "grad_norm": 2.637801170349121,
      "kl": 0.06282075913622975,
      "learning_rate": 3.4253453883497864e-07,
      "loss": 0.0001,
      "reward": 1.6097149848937988,
      "reward_std": 1.3047048449516296,
      "rewards/concensus_correctness_reward_func": 0.1197499968111515,
      "rewards/consensus_reward_func": 0.125,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.25,
      "rewards/question_recreation_reward_func": 0.5750275067985058,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.5399375148117542,
      "step": 12
    },
    {
      "completion_length": 333.0625,
      "epoch": 2.3333333333333335,
      "grad_norm": 8.6305513381958,
      "kl": 0.10169121820945293,
      "learning_rate": 2.9044549913819124e-07,
      "loss": 0.0001,
      "reward": 2.1766048222780228,
      "reward_std": 1.1718497909605503,
      "rewards/concensus_correctness_reward_func": 0.16387499682605267,
      "rewards/consensus_reward_func": 0.5,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.5,
      "rewards/question_recreation_reward_func": 0.49301106855273247,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.5197187401354313,
      "step": 14
    },
    {
      "completion_length": 300.0,
      "epoch": 2.6666666666666665,
      "grad_norm": 1.2753549814224243,
      "kl": 0.06518209201749414,
      "learning_rate": 2.3646527285364563e-07,
      "loss": 0.0001,
      "reward": 4.655350029468536,
      "reward_std": 5.421490907669067,
      "rewards/concensus_correctness_reward_func": 3.292874988168478,
      "rewards/consensus_reward_func": 0.25,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.375,
      "rewards/question_recreation_reward_func": 0.5220377082005143,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.2154375035315752,
      "step": 16
    },
    {
      "completion_length": 256.96875,
      "epoch": 3.0,
      "grad_norm": 2.370971441268921,
      "kl": 0.07046509714564309,
      "learning_rate": 1.8311791536769483e-07,
      "loss": 0.0001,
      "reward": 1.1566908285021782,
      "reward_std": 1.1407469268888235,
      "rewards/concensus_correctness_reward_func": 0.12006250023841858,
      "rewards/consensus_reward_func": 0.125,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.0625,
      "rewards/question_recreation_reward_func": 0.4378783367574215,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.41124999476596713,
      "step": 18
    },
    {
      "completion_length": 306.6875,
      "epoch": 3.3333333333333335,
      "grad_norm": 3.7701992988586426,
      "kl": 0.07338486751541495,
      "learning_rate": 1.328978898250525e-07,
      "loss": 0.0001,
      "reward": 2.597634121775627,
      "reward_std": 2.288734745234251,
      "rewards/concensus_correctness_reward_func": 0.7414374984800816,
      "rewards/consensus_reward_func": 0.5625,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.4375,
      "rewards/question_recreation_reward_func": 0.4301653988659382,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.4260312672704458,
      "step": 20
    },
    {
      "completion_length": 231.65625,
      "epoch": 3.6666666666666665,
      "grad_norm": 33.05220031738281,
      "kl": 0.06382088427199051,
      "learning_rate": 8.81534288045431e-08,
      "loss": 0.0001,
      "reward": 1.6381819306407124,
      "reward_std": 1.1167348898015916,
      "rewards/concensus_correctness_reward_func": 0.17299999669194221,
      "rewards/consensus_reward_func": 0.25,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.1875,
      "rewards/question_recreation_reward_func": 0.46974439593032,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.557937502861023,
      "step": 22
    },
    {
      "completion_length": 250.75,
      "epoch": 4.0,
      "grad_norm": 2.149594306945801,
      "kl": 0.065125553868711,
      "learning_rate": 5.097673357358906e-08,
      "loss": 0.0001,
      "reward": 3.1555401384830475,
      "reward_std": 2.317109949886799,
      "rewards/concensus_correctness_reward_func": 1.4325000010430813,
      "rewards/consensus_reward_func": 0.0625,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.375,
      "rewards/question_recreation_reward_func": 0.5613213405013084,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.015625,
      "rewards/xmlcount_reward_func": 0.7085937708616257,
      "step": 24
    },
    {
      "completion_length": 318.3125,
      "epoch": 4.333333333333333,
      "grad_norm": 2.0100913047790527,
      "kl": 0.06343707226915285,
      "learning_rate": 2.3106145082260774e-08,
      "loss": 0.0001,
      "reward": 1.0433421432971954,
      "reward_std": 1.362046491354704,
      "rewards/concensus_correctness_reward_func": 0.07881249859929085,
      "rewards/consensus_reward_func": 0.125,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.0,
      "rewards/question_recreation_reward_func": 0.5773108880966902,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.26221874030306935,
      "step": 26
    },
    {
      "completion_length": 325.21875,
      "epoch": 4.666666666666667,
      "grad_norm": 1.5065702199935913,
      "kl": 0.04980072972830385,
      "learning_rate": 5.844861072478335e-09,
      "loss": 0.0,
      "reward": 3.545183375477791,
      "reward_std": 3.6524936025962234,
      "rewards/concensus_correctness_reward_func": 1.9014999996870756,
      "rewards/consensus_reward_func": 0.375,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.5,
      "rewards/question_recreation_reward_func": 0.3512457935139537,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.41743750870227814,
      "step": 28
    },
    {
      "completion_length": 294.4375,
      "epoch": 5.0,
      "grad_norm": 1.806432843208313,
      "kl": 0.08484870253596455,
      "learning_rate": 0.0,
      "loss": 0.0001,
      "reward": 2.9638523161411285,
      "reward_std": 2.550200469791889,
      "rewards/concensus_correctness_reward_func": 0.9272499978542328,
      "rewards/consensus_reward_func": 0.25,
      "rewards/cumulative_reward_2": 0.0,
      "rewards/final_correctness_reward_func": 0.6875,
      "rewards/question_recreation_reward_func": 0.42335233092308044,
      "rewards/soft_format_reward_func": 0.0,
      "rewards/strict_format_reward_func": 0.0,
      "rewards/xmlcount_reward_func": 0.6757499873638153,
      "step": 30
    },
    {
      "epoch": 5.0,
      "step": 30,
      "total_flos": 0.0,
      "train_loss": 6.76935035395824e-05,
      "train_runtime": 380.3235,
      "train_samples_per_second": 1.262,
      "train_steps_per_second": 0.079
    }
  ],
  "logging_steps": 2,
  "max_steps": 30,
  "num_input_tokens_seen": 0,
  "num_train_epochs": 5,
  "save_steps": 25,
  "stateful_callbacks": {
    "TrainerControl": {
      "args": {
        "should_epoch_stop": false,
        "should_evaluate": false,
        "should_log": false,
        "should_save": true,
        "should_training_stop": true
      },
      "attributes": {}
    }
  },
  "total_flos": 0.0,
  "train_batch_size": 4,
  "trial_name": null,
  "trial_params": null
}