How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M:Q4_K_M
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M:Q4_K_M
Use Docker
docker model run hf.co/Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M:Q4_K_M
Quick Links

Qwen2.5-1.5B BuildEng GGUF Q4_K_M

Repository: Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M

This repository contains the Q4_K_M GGUF release of BuildEng V8 1.5B based on Qwen2.5-1.5B-Instruct.

BuildEng is a domain-specialized engineering language model project focused on civil engineering, structural reasoning, construction workflows, and conservative engineering-assistant behavior.

The Q4_K_M release is the lightweight local inference version intended for efficient use on laptops and lower-resource systems while still preserving the BuildEng engineering behavior.

Model Information

Base model:

Qwen/Qwen2.5-1.5B-Instruct

Format:

GGUF

Release type:

Q4_K_M

Main focus areas include reinforced concrete, foundations, retaining walls, slabs, columns, structural diagnostics, settlement reasoning, temporary works, construction sequencing, renovation uncertainty, and inspection-first engineering workflows.

Related Repositories

Merged model:

https://huggingface.co/Irfanuruchi/qwen2.5-1.5b-buildeng

Dataset:

https://huggingface.co/datasets/Irfanuruchi/buildeng

Q4_K_M GGUF:

https://huggingface.co/Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M

Q8_0 GGUF:

https://huggingface.co/Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q8_0

F16 GGUF:

https://huggingface.co/Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-F16

Important Notice

This model is intended for research, education, and engineering-assistant workflows only.

It must not be used as final engineering approval, construction sign-off, or replacement for licensed engineering review.

Author

Irfan Uruchi

Downloads last month
183
GGUF
Model size
2B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Irfanuruchi/qwen2.5-1.5b-buildeng-GGUF-Q4_K_M

Quantized
(207)
this model