Text Generation
MLX
Safetensors
English
glm_moe_dsa
apple-silicon
Mixture of Experts
pruned
quantized
soul-targeted
agentic
local-agent
glm
conversational
Eval Results (legacy)
4-bit precision
Instructions to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX") prompt = "Write a story about Einstein" messages = [{"role": "user", "content": prompt}] prompt = tokenizer.apply_chat_template( messages, add_generation_prompt=True ) text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
- Pi
How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX
Run Hermes
hermes
- OpenClaw new
How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with OpenClaw:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"
Configure OpenClaw
# Install OpenClaw: npm install -g openclaw@latest # Register the local server and set it as the default model: openclaw onboard --non-interactive --mode local \ --auth-choice custom-api-key \ --custom-base-url http://127.0.0.1:8080/v1 \ --custom-model-id "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX" \ --custom-provider-id mlx-lm \ --custom-compatibility openai \ --custom-text-input \ --accept-risk \ --skip-health
Run OpenClaw
openclaw agent --local --agent main --message "Hello from Hugging Face"
- MLX LM
How to use philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Interactive chat REPL mlx_lm.chat --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX"
Run an OpenAI-compatible server
# Install MLX LM uv tool install mlx-lm # Start the server mlx_lm.server --model "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX" # Calling the OpenAI-compatible server with curl curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "philipjohnbasile/GLM-5.2-Demolition-q4a4-soul-MLX", "messages": [ {"role": "user", "content": "Hello"} ] }'
| #!/usr/bin/env python3 | |
| """Benchmark contamination check #2 β are the EVAL benchmarks we report (HumanEval, GSM8K) present in our HEAL | |
| TRAINING data? The heal corpus is built from external datasets (open-r1/Mixture-of-Thoughts, OpenThoughts-114k, | |
| evol-codealpaca, ultrachat, ...) which CAN include benchmark problems β the card's HumanEval 19/20 + GSM8K 8/12 | |
| could be inflated by memorization. This checks the HEADLINE numbers honestly (normalized-substring match of each | |
| benchmark prompt against the whole heal corpus). CPU-only. | |
| python scripts/82_heal_benchmark_contam.py | |
| """ | |
| import glob | |
| import json | |
| import os | |
| import re | |
| HERE = os.path.dirname(__file__) | |
| ROOT = os.path.join(HERE, "..") | |
| def heal_corpus(): | |
| chunks = [] | |
| for fp in glob.glob(os.path.join(ROOT, "heal", "*", "train.jsonl")): | |
| for line in open(fp, encoding="utf-8", errors="ignore"): | |
| try: | |
| for m in json.loads(line).get("messages", []): | |
| chunks.append(m.get("content", "") or "") | |
| except Exception: # noqa: BLE001 | |
| continue | |
| return re.sub(r"\s+", " ", " ".join(chunks)).lower() | |
| def main(): | |
| text = heal_corpus() | |
| print(f" heal corpus: {len(text) / 1e6:.1f}M chars (all heal/*/train.jsonl)") | |
| from datasets import load_dataset | |
| # HumanEval β the card's headline 19/20. Distinctive chunk = the normalized prompt's first ~90 chars (def + docstring start) | |
| try: | |
| he = load_dataset("openai/openai_humaneval", split="test") | |
| hits = sum(1 for r in he if (sig := re.sub(r"\s+", " ", r["prompt"]).strip().lower()[:90]) and sig in text) | |
| print(f" HumanEval (164): {hits} prompts in heal = {100 * hits / 164:.1f}% contaminated " | |
| f"{'β the 19/20 is inflated' if hits else 'β CLEAN β 19/20 is honest'}") | |
| except Exception as e: # noqa: BLE001 | |
| print(f" HumanEval: load failed ({e})") | |
| # GSM8K test β the card's 8/12 (sample 300 for speed) | |
| try: | |
| gsm = list(load_dataset("openai/gsm8k", "main", split="test"))[:300] | |
| hits = sum(1 for r in gsm if (q := re.sub(r"\s+", " ", r["question"]).strip().lower()[:90]) and q in text) | |
| print(f" GSM8K-test (300 sampled): {hits} questions in heal = {100 * hits / 300:.1f}% contaminated " | |
| f"{'β flag GSM8K' if hits else 'β CLEAN'}") | |
| except Exception as e: # noqa: BLE001 | |
| print(f" GSM8K: load failed ({e})") | |
| if __name__ == "__main__": | |
| main() | |