Duplicated from BlinkDL/rwkv-7-world

RWKV
/

RWKV7-Goose-World-PTH

Text Generation

Model card Files Files and versions

RWKV7-Goose-World-PTH / README.md

SmerkyG's picture

Duplicate from BlinkDL/rwkv-7-world

e918688 verified over 1 year ago

|

History Blame Contribute Delete

2.17 kB

	---
	language:
	- en
	- zh
	- fr
	- es
	- de
	- pt
	- ru
	- it
	- ja
	- ko
	- vi
	- ar
	tags:
	- pytorch
	- text-generation
	- causal-lm
	- rwkv
	license: apache-2.0
	datasets:
	- HuggingFaceFW/fineweb-edu
	- mlfoundations/dclm-baseline-1.0
	- cerebras/SlimPajama-627B
	- EleutherAI/pile
	- bigcode/starcoderdata
	- oscar-corpus/OSCAR-2301
	---

	# RWKV-7 World

	Use rwkv pip package 0.8.28+ for RWKV-7 inference: https://pypi.org/project/rwkv/

	Evals and more information: https://rwkv.com/

	For developers: https://github.com/BlinkDL/RWKV-LM

	Chat demo: https://github.com/BlinkDL/ChatRWKV/blob/main/API_DEMO_CHAT.py

	MMLU eval: https://github.com/BlinkDL/RWKV-LM/blob/main/RWKV-v7/rwkv_mmlu_eval.py

	rwkv7-v3-2.9b 54.56% (rwkv6-v2.1-3.1b 32.38%)

	rwkv7-v3-1.5b 44.84% (rwkv6-v2.1-1.6b 26.34%)

	0.1B = L12 D768 // 0.4B = L24 D1024 // 1.5B = L24 D2048 // ~3B = L32 D2560 // ~7B = L32 D4096

	## Model Description

	RWKV-7 trained on 100+ world languages (80% English, 10% multilang, 10% code).

	World-v3 = 3.1T tokens

	World-v2.9 = subsampled 2T tokens

	World-v2.8 = subsampled 1T tokens

	Recommended fine-tuning format (use \n for newlines):
	```
	User: xxxxxxxxxxxxxxx

	Assistant: xxxxxxxxxxxxxxx
	xxxxxxxxxxxxxxx
	xxxxxxxxxxxxxxx

	User: xxxxxxxxxxxxxxx
	xxxxxxxxxxxxxxx

	Assistant: xxxxxxxxxxxxxxx
	xxxxxxxxxxxxxxx
	xxxxxxxxxxxxxxx
	xxxxxxxxxxxxxxx
	```

	A good chat prompt (better replace \n\n in xxx to \n, such that there will never be extra \n\n in response):
	```
	User: hi

	Assistant: Hi. I am your assistant and I will provide expert full response in full details. Please feel free to ask any question and I will always answer it.

	User: xxx

	Assistant:
	```
	QA prompt (better replace \n\n in xxx to \n, such that there will never be extra \n\n in response):
	```
	Question: xxx

	Answer:
	```
	and
	```
	Instruction: xxx

	Input: xxx

	Response:
	```

	!!! There should not be any space after your final ":" or you will upset the tokenizer and see non-English reponse !!!

	!!! There should not be any space after your final ":" or you will upset the tokenizer and see non-English reponse !!!

	!!! There should not be any space after your final ":" or you will upset the tokenizer and see non-English reponse !!!