inferencerlabs commited on
Commit
8496779
·
verified ·
1 Parent(s): ca1a33d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,7 +12,7 @@ pipeline_tag: text-generation
12
 
13
  #### Tested on a M3 Ultra 512GB RAM using [Inferencer app](https://inferencer.com)
14
  - Single inference ~25.9 tokens/s @ 1000 tokens
15
- - Batched inference ~ total tokens/s across six inferences
16
  - Memory usage: ~22.1 GiB
17
 
18
  *q7bit quant is expected to achieve higher than 96.96% token accuracy in our coding test*
 
12
 
13
  #### Tested on a M3 Ultra 512GB RAM using [Inferencer app](https://inferencer.com)
14
  - Single inference ~25.9 tokens/s @ 1000 tokens
15
+ - Vision inference: Not included in this language model (LM) only version
16
  - Memory usage: ~22.1 GiB
17
 
18
  *q7bit quant is expected to achieve higher than 96.96% token accuracy in our coding test*