Instructions to use seanghay/xlm-roberta-khmer-32k-tokenizer with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use seanghay/xlm-roberta-khmer-32k-tokenizer with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("seanghay/xlm-roberta-khmer-32k-tokenizer", dtype="auto") - Notebooks
- Google Colab
- Kaggle
XLM Roberta Tokenizer trained with 162M tokens of Khmer text.
from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("seanghay/xlm-roberta-khmer-32k-tokenizer")
tokenizer.tokenize("αα½ααααΈααααα»ααΆ!")
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support