Text Generation
Transformers
ONNX
Safetensors
nemotron_h
grpo
interview
lex-fridman
nemotron
mamba
conversational
custom_code
Instructions to use bobber/lex-interviewer-nemotron-4b-grpo-v12 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use bobber/lex-interviewer-nemotron-4b-grpo-v12 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="bobber/lex-interviewer-nemotron-4b-grpo-v12", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("bobber/lex-interviewer-nemotron-4b-grpo-v12", trust_remote_code=True) model = AutoModelForCausalLM.from_pretrained("bobber/lex-interviewer-nemotron-4b-grpo-v12", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use bobber/lex-interviewer-nemotron-4b-grpo-v12 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "bobber/lex-interviewer-nemotron-4b-grpo-v12" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "bobber/lex-interviewer-nemotron-4b-grpo-v12", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/bobber/lex-interviewer-nemotron-4b-grpo-v12
- SGLang
How to use bobber/lex-interviewer-nemotron-4b-grpo-v12 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "bobber/lex-interviewer-nemotron-4b-grpo-v12" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "bobber/lex-interviewer-nemotron-4b-grpo-v12", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "bobber/lex-interviewer-nemotron-4b-grpo-v12" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "bobber/lex-interviewer-nemotron-4b-grpo-v12", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use bobber/lex-interviewer-nemotron-4b-grpo-v12 with Docker Model Runner:
docker model run hf.co/bobber/lex-interviewer-nemotron-4b-grpo-v12
| *.7z filter=lfs diff=lfs merge=lfs -text | |
| *.arrow filter=lfs diff=lfs merge=lfs -text | |
| *.bin filter=lfs diff=lfs merge=lfs -text | |
| *.bz2 filter=lfs diff=lfs merge=lfs -text | |
| *.ckpt filter=lfs diff=lfs merge=lfs -text | |
| *.ftz filter=lfs diff=lfs merge=lfs -text | |
| *.gz filter=lfs diff=lfs merge=lfs -text | |
| *.h5 filter=lfs diff=lfs merge=lfs -text | |
| *.joblib filter=lfs diff=lfs merge=lfs -text | |
| *.lfs.* filter=lfs diff=lfs merge=lfs -text | |
| *.mlmodel filter=lfs diff=lfs merge=lfs -text | |
| *.model filter=lfs diff=lfs merge=lfs -text | |
| *.msgpack filter=lfs diff=lfs merge=lfs -text | |
| *.npy filter=lfs diff=lfs merge=lfs -text | |
| *.npz filter=lfs diff=lfs merge=lfs -text | |
| *.onnx filter=lfs diff=lfs merge=lfs -text | |
| *.ot filter=lfs diff=lfs merge=lfs -text | |
| *.parquet filter=lfs diff=lfs merge=lfs -text | |
| *.pb filter=lfs diff=lfs merge=lfs -text | |
| *.pickle filter=lfs diff=lfs merge=lfs -text | |
| *.pkl filter=lfs diff=lfs merge=lfs -text | |
| *.pt filter=lfs diff=lfs merge=lfs -text | |
| *.pth filter=lfs diff=lfs merge=lfs -text | |
| *.rar filter=lfs diff=lfs merge=lfs -text | |
| *.safetensors filter=lfs diff=lfs merge=lfs -text | |
| saved_model/**/* filter=lfs diff=lfs merge=lfs -text | |
| *.tar.* filter=lfs diff=lfs merge=lfs -text | |
| *.tar filter=lfs diff=lfs merge=lfs -text | |
| *.tflite filter=lfs diff=lfs merge=lfs -text | |
| *.tgz filter=lfs diff=lfs merge=lfs -text | |
| *.wasm filter=lfs diff=lfs merge=lfs -text | |
| *.xz filter=lfs diff=lfs merge=lfs -text | |
| *.zip filter=lfs diff=lfs merge=lfs -text | |
| *.zst filter=lfs diff=lfs merge=lfs -text | |
| *tfevents* filter=lfs diff=lfs merge=lfs -text | |
| adapter/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| onnx/lm_head.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.embed_tokens.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.0.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.0.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.0.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.0.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.1.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.1.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.10.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.10.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.11.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.11.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.11.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.11.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.12.attn.k_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.12.attn.o_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.12.attn.q_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.12.attn.v_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.13.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.13.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.14.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.14.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.14.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.14.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.15.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.15.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.16.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.16.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.16.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.16.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.17.attn.k_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.17.attn.o_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.17.attn.q_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.17.attn.v_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.18.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.18.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.19.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.19.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.19.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.19.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.2.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.2.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.2.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.2.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.20.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.20.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.21.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.21.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.21.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.21.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.22.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.22.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.23.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.23.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.23.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.23.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.24.attn.k_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.24.attn.o_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.24.attn.q_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.24.attn.v_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.25.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.25.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.26.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.26.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.26.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.26.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.27.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.27.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.28.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.28.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.28.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.28.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.29.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.29.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.3.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.3.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.30.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.30.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.30.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.30.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.31.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.31.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.31.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.31.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.32.attn.k_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.32.attn.o_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.32.attn.q_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.32.attn.v_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.33.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.33.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.34.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.34.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.34.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.34.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.35.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.35.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.35.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.35.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.36.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.36.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.36.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.36.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.37.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.37.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.38.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.38.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.38.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.38.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.39.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.39.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.4.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.4.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.4.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.4.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.40.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.40.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.40.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.40.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.41.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.41.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.5.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.5.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.6.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.6.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.6.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.6.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.7.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.7.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.7.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.7.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.8.mlp.down_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.8.mlp.up_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.9.mamba.conv1d.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.9.mamba.conv1d.weight_squeezed filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.9.mamba.in_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.layers.9.mamba.out_proj.MatMul.weight filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.onnx_data filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.onnx_data_1 filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.onnx_data_2 filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.onnx_data_3 filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.onnx_data_4 filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.onnx_data_5 filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.onnx_data_6 filter=lfs diff=lfs merge=lfs -text | |
| onnx/model.onnx_data_7 filter=lfs diff=lfs merge=lfs -text | |
| onnx/lm_head_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/lm_head_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/lm_head_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_embed_tokens_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_embed_tokens_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_embed_tokens_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_0_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_0_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_0_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_0_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_0_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_0_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_10_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_10_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_10_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_10_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_10_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_10_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_11_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_11_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_11_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_11_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_11_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_11_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_k_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_k_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_o_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_o_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_o_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_q_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_q_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_q_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_v_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_12_attn_v_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_13_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_13_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_13_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_13_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_13_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_13_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_14_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_14_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_14_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_14_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_14_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_14_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_15_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_15_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_15_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_15_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_15_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_15_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_16_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_16_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_16_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_16_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_16_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_16_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_k_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_k_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_o_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_o_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_o_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_q_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_q_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_q_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_v_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_17_attn_v_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_18_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_18_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_18_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_18_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_18_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_18_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_19_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_19_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_19_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_19_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_19_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_19_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_1_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_1_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_1_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_1_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_1_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_1_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_20_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_20_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_20_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_20_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_20_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_20_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_21_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_21_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_21_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_21_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_21_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_21_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_22_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_22_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_22_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_22_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_22_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_22_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_23_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_23_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_23_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_23_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_23_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_23_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_k_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_k_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_o_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_o_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_o_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_q_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_q_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_q_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_v_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_24_attn_v_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_25_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_25_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_25_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_25_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_25_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_25_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_26_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_26_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_26_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_26_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_26_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_26_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_27_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_27_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_27_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_27_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_27_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_27_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_28_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_28_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_28_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_28_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_28_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_28_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_29_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_29_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_29_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_29_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_29_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_29_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_2_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_2_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_2_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_2_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_2_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_2_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_30_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_30_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_30_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_30_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_30_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_30_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_31_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_31_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_31_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_31_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_31_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_31_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_k_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_k_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_o_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_o_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_o_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_q_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_q_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_q_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_v_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_32_attn_v_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_33_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_33_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_33_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_33_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_33_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_33_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_34_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_34_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_34_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_34_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_34_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_34_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_35_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_35_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_35_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_35_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_35_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_35_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_36_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_36_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_36_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_36_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_36_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_36_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_37_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_37_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_37_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_37_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_37_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_37_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_38_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_38_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_38_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_38_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_38_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_38_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_39_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_39_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_39_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_39_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_39_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_39_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_3_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_3_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_3_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_3_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_3_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_3_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_40_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_40_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_40_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_40_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_40_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_40_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_41_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_41_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_41_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_41_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_41_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_41_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_4_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_4_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_4_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_4_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_4_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_4_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_5_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_5_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_5_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_5_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_5_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_5_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_6_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_6_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_6_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_6_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_6_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_6_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_7_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_7_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_7_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_7_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_7_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_7_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_8_mlp_down_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_8_mlp_down_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_8_mlp_down_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_8_mlp_up_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_8_mlp_up_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_8_mlp_up_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_9_mamba_in_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_9_mamba_in_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_9_mamba_in_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_9_mamba_out_proj_MatMul_weight_quant filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_9_mamba_out_proj_MatMul_weight_scales filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_layers_9_mamba_out_proj_MatMul_weight_zp filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_q4.onnx_data filter=lfs diff=lfs merge=lfs -text | |
| onnx/model_q4.onnx_data_1 filter=lfs diff=lfs merge=lfs -text | |