runtime error

Exit code: 1. Reason: 0.00.017.360 I log_info: verbosity = 3 (adjust with the `-lv N` CLI arg) 0.00.017.368 I device_info: 0.00.017.378 I - CPU : Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz (31720 MiB, 31720 MiB free) 0.00.017.419 I system_info: n_threads = 2 (n_threads_batch = 2) / 8 | CPU : SSE3 = 1 | SSSE3 = 1 | AVX = 1 | AVX2 = 1 | F16C = 1 | FMA = 1 | BMI2 = 1 | AVX512 = 1 | AVX512_VNNI = 1 | LLAMAFILE = 1 | OPENMP = 1 | REPACK = 1 | 0.00.017.421 I srv main: n_parallel is set to auto, using n_parallel = 4 and kv_unified = true 0.00.017.462 I srv init: running without SSL 0.00.017.505 I srv init: using 8 threads for HTTP server 0.00.017.644 I srv start: binding port with default address family 0.00.018.836 I srv main: loading model 0.00.018.841 I srv load_model: loading model '/app/Opus4.7-Distill-GODsGhost-Codex-4B-Q5_K_M.gguf' 0.00.018.878 I common_init_result: fitting params to device memory ... 0.00.018.881 I common_init_result: (for bugs during this step try to reproduce them with -fit off, or provide --verbose logs if the bug only occurs with -fit on) 0.01.008.426 I common_params_fit_impl: projected to use 5682 MiB of host memory vs. 31720 MiB of total host memory 0.01.594.569 W llama_context: n_ctx_seq (128000) < n_ctx_train (262144) -- the full capacity of the model will not be utilized 0.02.974.127 W common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable) 0.03.794.712 I srv load_model: creating MTP draft context against the target model '/app/Opus4.7-Distill-GODsGhost-Codex-4B-Q5_K_M.gguf' 0.03.794.721 W llama_init_from_model: context type MTP requested but model doesn't contain MTP layers 0.03.794.722 E srv load_model: failed to create MTP context 0.03.794.723 I srv operator(): operator(): cleaning up before exit... 0.03.795.235 E srv main: exiting due to model loading error

Container logs:

Fetching error logs...