GGUF
cicdatopea commited on
Commit
982160b
·
verified ·
1 Parent(s): df7e7e8

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,10 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00001-of-00005.gguf filter=lfs diff=lfs merge=lfs -text
37
+ DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00002-of-00005.gguf filter=lfs diff=lfs merge=lfs -text
38
+ DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00003-of-00005.gguf filter=lfs diff=lfs merge=lfs -text
39
+ DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00004-of-00005.gguf filter=lfs diff=lfs merge=lfs -text
40
+ DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00005-of-00005.gguf filter=lfs diff=lfs merge=lfs -text
41
+ Qwen3-235B-A22B-128x10B-Q2_K_S-00001-of-00002.gguf filter=lfs diff=lfs merge=lfs -text
42
+ Qwen3-235B-A22B-128x10B-Q2_K_S-00002-of-00002.gguf filter=lfs diff=lfs merge=lfs -text
DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00001-of-00005.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65358b14d49ba297edcb7422389f9c580722d6a885d72d0e408fe3bc139b4a3e
3
+ size 49636719904
DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00002-of-00005.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:659ce1e2d50d9af4bd67417a76121bdc8c52d4fcb503a76125d94b37a756974d
3
+ size 49905690912
DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00003-of-00005.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fdf98c59acefdf78aa1cbbe0ceb3fdbe2cb27a720c89aba5097bc20f1ce11d65
3
+ size 49905690912
DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00004-of-00005.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:155d9fb13632ec5717ed592aa4d796ebddf52bb4f9c6059cf42e42b1e99a2de1
3
+ size 49905690912
DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00005-of-00005.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:49296350852180eaf72fd8e6faecf2534785b1f885da199643bff4059434dde1
3
+ size 25587383552
README.md ADDED
@@ -0,0 +1,133 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model:
4
+ - deepseek-ai/DeepSeek-R1-0528
5
+ ---
6
+
7
+
8
+
9
+
10
+ ## Model Details
11
+
12
+ This model is a mixed gguf:q2ks of [deepseek-ai/DeepSeek-R1-0528](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528) generated by [intel/auto-round](https://github.com/intel/auto-round) algorithm.
13
+ Please refer to Section `Generate the model` for more details.
14
+
15
+ **Theoretically, this model(https://huggingface.co/Intel/DeepSeek-R1-0528-q2ks-mixed-AutoRound-inc-v1) should perform better, but we don’t yet have accuracy metrics to confirm it.**
16
+
17
+
18
+ ## How To Use
19
+ ~~~bash
20
+ ./llama-cli --model ./DeepSeek-R1-0528-hf-256x20B-Q2_K_S-00001-of-00005.gguf --prompt "<|User|>Create a Flappy Bird game in Python.<|Assistant|>"
21
+ ~~~
22
+
23
+ ### LLamacpp Inference
24
+
25
+ ~~~python
26
+ """
27
+ Create a Flappy Bird game in Python.<think>
28
+ We are going to create a simple Flappy Bird game using Pygame.
29
+ The game will have:
30
+ - A bird (represented by a circle or an image) that can flap (jump) when the space bar is pressed.
31
+ - Pipes (obstacles) that move from the right to the left of the screen.
32
+ - The bird must navigate through the gaps between the pipes without colliding.
33
+ - The game ends if the bird collides with a pipe or the ground, or if it goes off the top of the screen.
34
+ - The score increases each time the bird passes a pair of pipes.
35
+
36
+ Steps:
37
+ 1. Initialize Pygame and set up the display.
38
+ 2. Define the game constants (screen size, colors, gravity, flap strength, pipe speed, etc.).
39
+ 3. Create the bird: position, velocity, and a method for jumping.
40
+ 4. Create the pipes: manage their positions and movement. We'll have a list of pipes.
41
+ 5. Implement collision detection between the bird and the pipes, and between the bird and the ground/top.
42
+ 6. Implement the main game loop: handle events, update positions, check collisions, draw everything.
43
+
44
+ We'll use a simple representation for the bird (a circle) and the pipes (rectangles).
45
+
46
+ Let's code accordingly.
47
+ </think>
48
+ # Flappy Bird Game in Python using Pygame
49
+
50
+ Here's a complete implementation of the Flappy Bird game using Pygame:
51
+
52
+ ```python
53
+ import pygame
54
+ import random
55
+ import sys
56
+
57
+ # Initialize pygame
58
+ pygame.init()
59
+
60
+ # Game constants
61
+ WIDTH, HEIGHT = 400, 600
62
+ FPS = 60
63
+ GRAVITY = 0.5
64
+ FLAP_STRENGTH = -10
65
+ PIPE_SPEED = 5
66
+ PIPE_GAP = 200
67
+ PIPE_SPAWN_FREQUENCY = 1500 # milliseconds
68
+ GROUND_HEIGHT = 100
69
+
70
+ # Colors
71
+ SKY_BLUE = (113, 197, 207)
72
+ GREEN = (111, 196, 70)
73
+ DARK_GREEN = (76, 145, 65)
74
+ BIRD_YELLOW = (255, 221, 45)
75
+ BIRD_ORANGE = (255, 150, 0)
76
+ WHITE = (255, 255, 255)
77
+ BLACK = (0, 0, 0)
78
+ """
79
+
80
+ too long to copy
81
+ ~~~
82
+
83
+ ### ## Generate the model
84
+
85
+ auto-round>0.5.1
86
+
87
+ 700G RAM and 30G vram(optional) are required
88
+
89
+ ~~~python
90
+ import torch
91
+ from transformers import AutoModelForCausalLM, AutoTokenizer
92
+ from auto_round import AutoRound
93
+
94
+ model_name = "/models/DeepSeek-R1-0528-hf" ##must be BF16 model
95
+
96
+ model = AutoModelForCausalLM.from_pretrained(model_name,
97
+ device_map="cpu", torch_dtype="auto")
98
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
99
+ layer_config = {}
100
+ for n, m in model.named_modules():
101
+ if n == "lm_head":
102
+ layer_config[n] = {"bits": 8}
103
+ elif isinstance(m, torch.nn.Linear) and (not "expert" in n or "shared_experts" in n) and n != "lm_head":
104
+ layer_config[n] = {"bits": 4}
105
+
106
+ autoround = AutoRound(model, tokenizer, iters=0, layer_config=layer_config, batch_size=8, nsamples=512)
107
+ autoround.quantize_and_save("/models/DeepSeek-R1-0528-q2ks", format="gguf:q2_k_s")
108
+ ~~~
109
+
110
+
111
+ ## Ethical Considerations and Limitations
112
+
113
+ The model can produce factually incorrect output, and should not be relied on to produce factually accurate information. Because of the limitations of the pretrained model and the finetuning datasets, it is possible that this model could generate lewd, biased or otherwise offensive outputs.
114
+
115
+ Therefore, before deploying any applications of the model, developers should perform safety testing.
116
+
117
+ ## Caveats and Recommendations
118
+
119
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model.
120
+
121
+ Here are a couple of useful links to learn more about Intel's AI software:
122
+
123
+ - Intel Neural Compressor [link](https://github.com/intel/neural-compressor)
124
+
125
+ ## Disclaimer
126
+
127
+ The license on this model does not constitute legal advice. We are not responsible for the actions of third parties who use this model. Please consult an attorney before using this model for commercial purposes.
128
+
129
+ ## Cite
130
+
131
+ @article{cheng2023optimize, title={Optimize weight rounding via signed gradient descent for the quantization of llms}, author={Cheng, Wenhua and Zhang, Weiwei and Shen, Haihao and Cai, Yiyang and He, Xin and Lv, Kaokao and Liu, Yi}, journal={arXiv preprint arXiv:2309.05516}, year={2023} }
132
+
133
+ [arxiv](https://arxiv.org/abs/2309.05516) [github](https://github.com/intel/auto-round)