upperwal commited on
Commit
0b2abcb
·
1 Parent(s): f24f470

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -0
README.md ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - hi
5
+ tags:
6
+ - indic
7
+ - hindi
8
+ - हिंदी
9
+ - indian
10
+ - language
11
+ - english
12
+ license: apache-2.0
13
+ datasets:
14
+ - mc4
15
+ - c4
16
+ - indicCorp_v2
17
+ inference: false
18
+ ---
19
+ *[To be released soon]*
20
+
21
+ # BHASHA-7B-256-HI
22
+
23
+ A 7B foundation language model pre-trained on hindi text with 256 context size. Weights initialised from MPT-7B-8K model. Uses extended vocabulary with knowledge transfer within embedding space.
24
+
25
+ ## Model Description
26
+
27
+ | Hyperparameter | Value |
28
+ |----------------|-------|
29
+ |n_parameters | 6695735296 (6.69B) |
30
+ |n_layers | 32 |
31
+ | n_heads | 32 |
32
+ | d_model | 4096 |
33
+ | vocab size | 61772 |
34
+ | sequence length | 256 |
35
+
36
+ *This model is still getting pre-trained. Updated weights along with more details will be available soon.*
37
+
38
+ [Follow us](https://www.linkedin.com/company/soketlabs) to get updates on the progress.