brucethemoose/Yi-34B-200K-RPMerge · GGUF version please

Feb 9, 2024

Could you please release the GGUF version of this model?

Owner Feb 9, 2024

•

edited Feb 9, 2024

Yeah. I am busy today but will kick off the imatrix quantization tonight, I have been meaning to mess with that anyway.

Feb 9, 2024

That's great. I'm waiting for that.

Feb 9, 2024

Please release Q5_K_M and q4_k_m too if that's possible.

Owner Feb 9, 2024

Yeah they will all be imatrixed

MarsupialAI

Feb 10, 2024

Trying to convert to GGUF but it's missing a tokenizer.model file. Using the one from regular Yi leads to other errors.

Owner Feb 10, 2024

•

Still doing this, but I literally fell asleep on my keyboard, lol.

I think I know how to generate a tokenizer as well, let's see

Feb 10, 2024

•

😁. I'm really looking forward to it.

Owner Feb 10, 2024

I'm kinda stumped tbh, if I run:

from transformers import AutoTokenizer
tok = AutoTokenizer.from_pretrained("/home/alpha/Models/Raw/RPmerge/")
tok.save_pretrained("/home/alpha/Models/Raw/temp/", legacy_format=True)

There is still not an option to output a tokenizer.model. Currently tryint to trace back and see how it's even generated.

Owner Feb 10, 2024

python convert.py /home/alpha/Models/Raw/RPmerge/ --vocab-only --vocab-type hfft --outfile tokenizer.model Seems to work? I will quantize and see if it actually does.

Feb 10, 2024

I'm waiting for the results.. 😊

MarsupialAI

Feb 10, 2024

GGUFs uploading now: https://huggingface.co/MarsupialAI/Yi-34B-200K-RPMerge_GGUF

lastrosade

Feb 10, 2024

•

Do think of adding the Orca-Vicuna chat template to tokenize_config.json:
"chat_template": "{% if not add_generation_prompt is defined %}{% set add_generation_prompt = false %}{% endif %}{% for message in messages %}{{message['role'] + ' :' + message['content'] + '\n'}}{% endfor %}{% if add_generation_prompt %}{{ 'ASSISTANT: ' }}{% endif %}",

I'll make my own quants just in case.

Owner Feb 10, 2024

•

Yeah I figured it out as well, making some imatrix quants

Owner Feb 10, 2024

•

Do think of adding the Orca-Vicuna chat template to tokenize_config.json:
"chat_template": "{% if not add_generation_prompt is defined %}{% set add_generation_prompt = false %}{% endif %}{% for message in messages %}{{message['role'] + ' :' + message['content'] + '\n'}}{% endfor %}{% if add_generation_prompt %}{{ 'ASSISTANT: ' }}{% endif %}",

I'll make my own quants just in case.

This is a good idea.

Is this template correct though? I don't see anything that adds the USER: or SYSTEM: message.

Owner Feb 10, 2024

•

Uploading now: https://huggingface.co/brucethemoose/Yi-34B-200K-RPMerge-iMat.GGUF

Also:

GGUFs uploading now: https://huggingface.co/MarsupialAI/Yi-34B-200K-RPMerge_GGUF

lastrosade

Feb 11, 2024

•

edited Feb 11, 2024

Is this template correct though? I don't see anything that adds the USER: or SYSTEM: message.

{message['role'] + ' : ' + message['content'] + '\n'}

role USER, SYSTEM or ASSISTANT.

{% if add_generation_prompt %}{{ 'ASSISTANT: ' }}{% endif %}

Adds 'ASSISTANT:' when you only send the history.

Also there's an error and it should be ASSISTANT: and message['role'] + ': ' + message['content']

Whoops