schonsense commited on
Commit
34eaea8
·
verified ·
1 Parent(s): 0381a5f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +56 -54
README.md CHANGED
@@ -1,54 +1,56 @@
1
- ---
2
- base_model:
3
- - mattshumer/Reflection-Llama-3.1-70B
4
- library_name: transformers
5
- tags:
6
- - mergekit
7
- - merge
8
-
9
- ---
10
- # reflection_bf16
11
-
12
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
13
-
14
- ## Merge Details
15
- ### Merge Method
16
-
17
- This model was merged using the Passthrough merge method using [mattshumer/Reflection-Llama-3.1-70B](https://huggingface.co/mattshumer/Reflection-Llama-3.1-70B) as a base.
18
-
19
- ### Models Merged
20
-
21
- The following models were included in the merge:
22
-
23
-
24
- ### Configuration
25
-
26
- The following YAML configuration was used to produce this model:
27
-
28
- ```yaml
29
- models:
30
- - model: mattshumer/Reflection-Llama-3.1-70B
31
- merge_method: passthrough
32
- base_model: mattshumer/Reflection-Llama-3.1-70B
33
-
34
-
35
-
36
-
37
-
38
-
39
- parameters:
40
-
41
- normalize: false
42
- int8_mask: true
43
- rescale: false
44
-
45
- dtype: float32
46
- out_dtype: bfloat16
47
-
48
- chat_template: llama3
49
- tokenizer:
50
- source: union
51
- pad_to_multiple_of: 8
52
-
53
-
54
- ```
 
 
 
1
+ ---
2
+ base_model:
3
+ - mattshumer/Reflection-Llama-3.1-70B
4
+ library_name: transformers
5
+ tags:
6
+ - mergekit
7
+ - merge
8
+
9
+ ---
10
+ # reflection_bf16
11
+
12
+ Conversion from FP32 to BF16
13
+
14
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
15
+
16
+ ## Merge Details
17
+ ### Merge Method
18
+
19
+ This model was merged using the Passthrough merge method using [mattshumer/Reflection-Llama-3.1-70B](https://huggingface.co/mattshumer/Reflection-Llama-3.1-70B) as a base.
20
+
21
+ ### Models Merged
22
+
23
+ The following models were included in the merge:
24
+
25
+
26
+ ### Configuration
27
+
28
+ The following YAML configuration was used to produce this model:
29
+
30
+ ```yaml
31
+ models:
32
+ - model: mattshumer/Reflection-Llama-3.1-70B
33
+ merge_method: passthrough
34
+ base_model: mattshumer/Reflection-Llama-3.1-70B
35
+
36
+
37
+
38
+
39
+
40
+
41
+ parameters:
42
+
43
+ normalize: false
44
+ int8_mask: true
45
+ rescale: false
46
+
47
+ dtype: float32
48
+ out_dtype: bfloat16
49
+
50
+ chat_template: llama3
51
+ tokenizer:
52
+ source: union
53
+ pad_to_multiple_of: 8
54
+
55
+
56
+ ```