Instructions to use openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf") model = AutoModelForCausalLM.from_pretrained("openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf
- SGLang
How to use openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf with Docker Model Runner:
docker model run hf.co/openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf
| license: apache-2.0 | |
| language: | |
| - th | |
| - en | |
| library_name: transformers | |
| pipeline_tag: text-generation | |
| tags: | |
| - openthaigpt | |
| - llama | |
| # 🇹🇭 OpenThaiGPT 13b 1.0.0-beta Chat with 16 bits in Huggingface's format. | |
| <img src="https://1173516064-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FvvbWvIIe82Iv1yHaDBC5%2Fuploads%2Fb8eiMDaqiEQL6ahbAY0h%2Fimage.png?alt=media&token=6fce78fd-2cca-4c0a-9648-bd5518e644ce | |
| https://openthaigpt.aieat.or.th/" width="200px"> | |
| 🇹🇭 OpenThaiGPT 13b Version 1.0.0-beta is a Thai language 13B-parameter LLaMA v2 Chat model finetuned to follow Thai translated instructions and extend more than 10,000 most popular Thai words vocabularies into LLM's dictionary for turbo speed. | |
| ## Licenses | |
| **Source Code**: License Apache Software License 2.0.<br> | |
| **Weight**: Research and **Commercial uses**.<br> | |
| ## Codes and Weight | |
| **Finetune Code**: https://github.com/OpenThaiGPT/openthaigpt-finetune-010beta<br> | |
| **Inference Code**: https://github.com/OpenThaiGPT/openthaigpt<br> | |
| **Weight (Huggingface Checkpoint)**: https://huggingface.co/openthaigpt/openthaigpt-1.0.0-beta-13b-chat-hf | |
| ## Sponsors | |
| <img src="https://cdn-uploads.huggingface.co/production/uploads/5fcd9c426d942eaf4d1ebd30/42d-GioSs4evIdNuMAaPB.png" width="600px"> | |
| ## Supports | |
| - Official website: https://openthaigpt.aieat.or.th | |
| - Facebook page: https://web.facebook.com/groups/openthaigpt | |
| - A Discord server for discussion and support [here](https://discord.gg/rUTp6dfVUF) | |
| - E-mail: kobkrit@aieat.or.th | |
| ## Description | |
| Prompt format is Llama2 | |
| ``` | |
| <s>[INST] <<SYS>> | |
| system_prompt | |
| <</SYS>> | |
| question [/INST] | |
| ``` | |
| System prompt: | |
| You are a question answering assistant. Answer the question as truthful and helpful as possible คุณคือผู้ช่วยตอบคำถาม จงตอบคำถามอย่างถูกต้องและมีประโยชน์ที่สุด | |
| ## How to use | |
| 1. install VLLM (https://github.com/vllm-project/vllm) | |
| 2. python -m vllm.entrypoints.api_server --model /path/to/model --tensor-parallel-size num_gpus | |
| 3. run inference (CURL example) | |
| ``` | |
| curl --request POST \ | |
| --url http://localhost:8000/generate \ | |
| --header "Content-Type: application/json" \ | |
| --data '{"prompt": "<s>[INST] <<SYS>>\nYou are a question answering assistant. Answer the question as truthful and helpful as possible คุณคือผู้ช่วยตอบคำถาม จงตอบคำถามอย่างถูกต้องและมีประโยชน์ที่สุด\n<</SYS>>\n\nอยากลดความอ้วนต้องทำอย่างไร [/INST]","use_beam_search": false, "temperature": 0.1, "max_tokens": 512, "top_p": 0.75, "top_k": 40, "frequency_penalty": 0.3 "stop": "</s>"}' | |
| ``` | |
| ### Authors | |
| * Kobkrit Viriyayudhakorn (kobkrit@aieat.or.th) | |
| * Sumeth Yuenyong (sumeth.yue@mahidol.edu) | |
| * Thaweewat Rugsujarit (thaweewr@scg.com) | |
| * Jillaphat Jaroenkantasima (autsadang41@gmail.com) | |
| * Norapat Buppodom (new@norapat.com) | |
| * Koravich Sangkaew (kwankoravich@gmail.com) | |
| * Peerawat Rojratchadakorn (peerawat.roj@gmail.com) | |
| * Surapon Nonesung (nonesungsurapon@gmail.com) | |
| * Chanon Utupon (chanon.utupon@gmail.com) | |
| * Sadhis Wongprayoon (sadhis.tae@gmail.com) | |
| * Nucharee Thongthungwong (nuchhub@hotmail.com) | |
| * Chawakorn Phiantham (mondcha1507@gmail.com) | |
| * Patteera Triamamornwooth (patt.patteera@gmail.com) | |
| * Nattarika Juntarapaoraya (natt.juntara@gmail.com) | |
| * Kriangkrai Saetan (kraitan.ss21@gmail.com) | |
| * Pitikorn Khlaisamniang (pitikorn32@gmail.com) | |
| <i>Disclaimer: Provided responses are not guaranteed.</i> | |