Instructions to use soul0101/telco-track-a-submission with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use soul0101/telco-track-a-submission with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="soul0101/telco-track-a-submission")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("soul0101/telco-track-a-submission", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use soul0101/telco-track-a-submission with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "soul0101/telco-track-a-submission" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "soul0101/telco-track-a-submission", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/soul0101/telco-track-a-submission
- SGLang
How to use soul0101/telco-track-a-submission with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "soul0101/telco-track-a-submission" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "soul0101/telco-track-a-submission", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "soul0101/telco-track-a-submission" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "soul0101/telco-track-a-submission", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use soul0101/telco-track-a-submission with Docker Model Runner:
docker model run hf.co/soul0101/telco-track-a-submission
Telco Troubleshooting Agentic Challenge β Track A submission
code.zip in this repository is the reproducible submission for Track A
of the Telco Troubleshooting Agentic Challenge (Phase 2). It implements a
single-shot Qwen3-32B agent over a rule-first / LLM-residual pipeline.
- Base model:
Qwen/Qwen3-32B(no fine-tuning, no LoRA adapters). - Inference stack: vLLM (OpenAI-compatible endpoint) β no internet access required at runtime.
- Throughput: ~9 s / scenario on
qwen/qwen3-32b, ~1 k tokens / scenario.
Contents
| File | Description |
|---|---|
code.zip |
Self-contained submission archive. Extract and run per the embedded README.md. |
The archive layout follows the competition's SUBMISSION.md requirements:
code.zip
βββ README.md # how to install + run + expected outputs
βββ requirements.txt
βββ run.py / run.sh # submission entry point
βββ models/deploy.sh # vLLM deployment command for Qwen3-32B
βββ .env.example
βββ src/telco_agent/ # agent code (classifier, physics, prompts, resolver, β¦)
βββ scripts/ # legacy dev helpers
How to reproduce
unzip code.zip -d telco_track_a
cd telco_track_a
# 1. Install deps
pip install -r requirements.txt
# 2. Deploy the base model (vLLM, local GPU host)
bash models/deploy.sh # default: Qwen3-32B at http://localhost:8001
# 3. Run the agent (writes result/ with traces.json, results.csv, runtime.json, README.md)
bash run.sh --input /path/to/scenarios.json --output result
result/ will contain the three artefacts required by the rubric:
results.csvβscenario_id,prediction(one row per problem).traces.jsonβ every LLM completion produced.runtime.jsonβ per-problem wall time in seconds, written by theruntime_loggerdecorator reproduced verbatim fromSUBMISSION.md.
Architecture
ββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 1 β Extract drive-test KPIs, signaling counts, β
β cell configs. Rule cascade β fault kind. β
ββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β 2 β Pre-filter options to 1-4 IDs. β
ββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β 3 β Pre-compute per-candidate dB gain delta β
β (matches the simulator's gain_pattern). β
ββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β 4 β Single-shot Qwen3 call: β
β system="5G RAN engineer, output \\boxed{IDs}"β
β user=facts + candidates + per-kind guidance β
ββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β 5 β Parse \\boxed{...} β final answer. β
β Fallback to rule-based prior on parse fail. β
ββββββββββββββββββββββββββββββββββββββββββββββββββββ
The classifier produces 11 fault kinds (pdcch, multi_pingpong,
multi_az_txinc, add_neighbor, thld_dec, a3_dec, multi_tilt_txdec,
tilt_up, test_server, insufficient, unknown) and applies a focused
resolution procedure per kind, including hedging strategies for ambiguous
single-answer cases.
Why rule-first + LLM-residual?
- Reliability β single-shot
\boxed{IDs}over pre-filtered candidates is dramatically more reliable than multi-turn tool-calling loops on this scale of problem (past winnersgreenpark12345andvaderyangused the same pattern). - Reproducibility β the rule engine narrows the answer space to 1β4
candidates, making the LLM job near-deterministic at
temperature=0.1. - Cost β ~1 k tokens / scenario vs 5β10Γ that for tool-calling loops.
Model tree for soul0101/telco-track-a-submission
Base model
Qwen/Qwen3-32B