Instructions to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with MLX:

# Make sure mlx-lm is installed
# pip install --upgrade mlx-lm

# Generate text with mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX")

prompt = "Write a story about Einstein"
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
    messages, add_generation_prompt=True
)

text = generate(model, tokenizer, prompt=prompt, verbose=True)

Notebooks
Google Colab
Kaggle
Local Apps Settings
LM Studio

How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with Pi:

Start the MLX server

# Install MLX LM:
uv tool install mlx-lm
# Start a local OpenAI-compatible server:
mlx_lm.server --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "mlx-lm": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with Hermes Agent:

Start the MLX server

# Install MLX LM:
uv tool install mlx-lm
# Start a local OpenAI-compatible server:
mlx_lm.server --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX

Run Hermes

hermes

OpenClaw new

How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with OpenClaw:

Start the MLX server

# Install MLX LM:
uv tool install mlx-lm
# Start a local OpenAI-compatible server:
mlx_lm.server --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"

Configure OpenClaw

# Install OpenClaw:
npm install -g openclaw@latest
# Register the local server and set it as the default model:
openclaw onboard --non-interactive --mode local \
  --auth-choice custom-api-key \
  --custom-base-url http://127.0.0.1:8080/v1 \
  --custom-model-id "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX" \
  --custom-provider-id mlx-lm \
  --custom-compatibility openai \
  --custom-text-input \
  --accept-risk \
  --skip-health

Run OpenClaw

openclaw agent --local --agent main --message "Hello from Hugging Face"

MLX LM

How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with MLX LM:

Generate or start a chat session

# Install MLX LM
uv tool install mlx-lm
# Interactive chat REPL
mlx_lm.chat --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"

Run an OpenAI-compatible server

# Install MLX LM
uv tool install mlx-lm
# Start the server
mlx_lm.server --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"
# Calling the OpenAI-compatible server with curl
curl -X POST "http://localhost:8000/v1/chat/completions" \
   -H "Content-Type: application/json" \
   --data '{
     "model": "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX",
     "messages": [
       {"role": "user", "content": "Hello"}
     ]
   }'

GLM-5.2-Demolition-q4a4-soul-MLX / eval /benchmark_contamination.py

philipjohnbasile

Upload eval/benchmark_contamination.py with huggingface_hub

bf31e01 verified 16 days ago

Raw

History Blame

2.47 kB

	#!/usr/bin/env python3
	"""Benchmark contamination check #2 — are the EVAL benchmarks we report (HumanEval, GSM8K) present in our HEAL
	TRAINING data? The heal corpus is built from external datasets (open-r1/Mixture-of-Thoughts, OpenThoughts-114k,
	evol-codealpaca, ultrachat, ...) which CAN include benchmark problems → the card's HumanEval 19/20 + GSM8K 8/12
	could be inflated by memorization. This checks the HEADLINE numbers honestly (normalized-substring match of each
	benchmark prompt against the whole heal corpus). CPU-only.

	python scripts/82_heal_benchmark_contam.py
	"""
	import glob
	import json
	import os
	import re

	HERE = os.path.dirname(__file__)
	ROOT = os.path.join(HERE, "..")


	def heal_corpus():
	chunks = []
	for fp in glob.glob(os.path.join(ROOT, "heal", "*", "train.jsonl")):
	for line in open(fp, encoding="utf-8", errors="ignore"):
	try:
	for m in json.loads(line).get("messages", []):
	chunks.append(m.get("content", "") or "")
	except Exception: # noqa: BLE001
	continue
	return re.sub(r"\s+", " ", " ".join(chunks)).lower()


	def main():
	text = heal_corpus()
	print(f" heal corpus: {len(text) / 1e6:.1f}M chars (all heal/*/train.jsonl)")
	from datasets import load_dataset

	# HumanEval — the card's headline 19/20. Distinctive chunk = the normalized prompt's first ~90 chars (def + docstring start)
	try:
	he = load_dataset("openai/openai_humaneval", split="test")
	hits = sum(1 for r in he if (sig := re.sub(r"\s+", " ", r["prompt"]).strip().lower()[:90]) and sig in text)
	print(f" HumanEval (164): {hits} prompts in heal = {100 * hits / 164:.1f}% contaminated "
	f"{'⚠ the 19/20 is inflated' if hits else '✓ CLEAN — 19/20 is honest'}")
	except Exception as e: # noqa: BLE001
	print(f" HumanEval: load failed ({e})")

	# GSM8K test — the card's 8/12 (sample 300 for speed)
	try:
	gsm = list(load_dataset("openai/gsm8k", "main", split="test"))[:300]
	hits = sum(1 for r in gsm if (q := re.sub(r"\s+", " ", r["question"]).strip().lower()[:90]) and q in text)
	print(f" GSM8K-test (300 sampled): {hits} questions in heal = {100 * hits / 300:.1f}% contaminated "
	f"{'⚠ flag GSM8K' if hits else '✓ CLEAN'}")
	except Exception as e: # noqa: BLE001
	print(f" GSM8K: load failed ({e})")


	if __name__ == "__main__":
	main()