geoffsee commited on
Commit
84e5c4b
·
verified ·
1 Parent(s): 3f073e4

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +35 -6
README.md CHANGED
@@ -1,10 +1,39 @@
1
  ---
 
 
 
 
2
  tags:
3
- - model_hub_mixin
4
- - pytorch_model_hub_mixin
 
 
5
  ---
6
 
7
- This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
8
- - Code: [More Information Needed]
9
- - Paper: [More Information Needed]
10
- - Docs: [More Information Needed]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language: en
3
+ license: mit
4
+ datasets:
5
+ - tinyshakespeare
6
  tags:
7
+ - text-generation
8
+ - gpt
9
+ - nano-gpt
10
+ - pytorch
11
  ---
12
 
13
+ # auto-g-nano
14
+
15
+ This is a minimal, decoder-only Transformer (nanoGPT-style) trained from scratch on the Tiny Shakespeare dataset.
16
+
17
+ ## Model Details
18
+ - **Architecture**: Decoder-only Transformer
19
+ - **Parameters**: ~10.8M
20
+ - **Vocabulary Size**: 65
21
+ - **Embedding Dimension**: 384
22
+ - **Heads**: 6
23
+ - **Layers**: 6
24
+ - **Block Size**: 256
25
+
26
+ ## How to Use
27
+ You can use this model directly with the `GPT` class from this repository.
28
+
29
+ ```python
30
+ from model import GPT
31
+
32
+ model = GPT.from_pretrained("geoffsee/auto-g-nano")
33
+ # Generate text
34
+ # context = torch.zeros((1, 1), dtype=torch.long)
35
+ # print(model.generate(context, max_new_tokens=100))
36
+ ```
37
+
38
+ ## Training Data
39
+ Trained on the [Tiny Shakespeare](https://raw.githubusercontent.com/karpathy/char-rnn/master/data/tinyshakespeare/input.txt) dataset.