phonemetransformers/IPA-BabyLM
Viewer • Updated • 12.5M • 270 • 2
GPT2 trained on the BabyLM 2024 training set using a BPE tokenizer.
Model trained for From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes.
Base model
openai-community/gpt2