Seriki commited on
Commit
1f6ae44
·
verified ·
1 Parent(s): 6c58c34

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -3,6 +3,13 @@ base_model: openai/gpt-oss-safeguard-120b
3
  license: apache-2.0
4
  tags:
5
  - gguf
 
 
 
 
 
 
 
6
  ---
7
  ## 💫 Community Model> gpt-oss-safeguard-120b by openai
8
 
@@ -14,8 +21,19 @@ Use in LM Studio with [gpt-oss-safeguard](https://lmstudio.ai/models/gpt-oss-saf
14
  **Original model**: [gpt-oss-safeguard-120b](https://huggingface.co/openai/gpt-oss-safeguard-120b)<br>
15
  **GGUF quantization**: provided by [LM Studio team](https://x.com/lmstudio) using `llama.cpp` release [b6866](https://github.com/ggerganov/llama.cpp/releases/tag/b6866)<br>
16
 
 
 
 
 
 
 
 
 
 
 
17
  ## Special thanks
18
 
 
19
  🙏 Special thanks to [Georgi Gerganov](https://github.com/ggerganov) and the whole team working on [llama.cpp](https://github.com/ggerganov/llama.cpp/) for making all of this possible.
20
 
21
  ## Disclaimers
 
3
  license: apache-2.0
4
  tags:
5
  - gguf
6
+ datasets:
7
+ - markov-ai/computer-use-large
8
+ - qubuhub/LMLM-pretrain-dwiki6.1M
9
+ language:
10
+ - en
11
+ pipeline_tag: any-to-any
12
+ library_name: fastai, adapter-transformers, nlp, mlx, lmlm, allenlp, lmkm, llama, gpt
13
  ---
14
  ## 💫 Community Model> gpt-oss-safeguard-120b by openai
15
 
 
21
  **Original model**: [gpt-oss-safeguard-120b](https://huggingface.co/openai/gpt-oss-safeguard-120b)<br>
22
  **GGUF quantization**: provided by [LM Studio team](https://x.com/lmstudio) using `llama.cpp` release [b6866](https://github.com/ggerganov/llama.cpp/releases/tag/b6866)<br>
23
 
24
+ gpt-oss-safeguard-120b
25
+
26
+ gpt-oss-safeguard-120b is a safety reasoning model by OpenAI, built-upon their original gpt-oss release. With these models, you can classify text content based on safety policies that you provide and perform a suite of foundational safety tasks. These models are intended for safety use cases. For other applications, we recommend using gpt-oss.
27
+
28
+ This 120b variant is designed for production, general purpose, high reasoning use cases that fits into a single H100 GPU (117B parameters with 5.1B active parameters).
29
+
30
+ This model is released under a permissive Apache 2.0 license and it features configurable reasoning effort—low, medium, or high, so users can balance output quality and latency based on their needs. The model offers full chain-of-thought visibility to support easier debugging and increased trust, though this output is not intended for end users.
31
+
32
+ This model supports a context length of 131k.
33
+
34
  ## Special thanks
35
 
36
+
37
  🙏 Special thanks to [Georgi Gerganov](https://github.com/ggerganov) and the whole team working on [llama.cpp](https://github.com/ggerganov/llama.cpp/) for making all of this possible.
38
 
39
  ## Disclaimers