Buckets:

JinyuLiu
/

Unison-Judge

17.8 GB

484 files

Updated 3 days ago

Ctrl+K

Name	Size	Uploaded	Xet hash
Judge_Consistency		3 days ago	466 items
.gitattributes	32.2 kB xet	3 days ago	f1949774
README.md	903 Bytes xet	3 days ago	9ea530f3
added_tokens.json	707 Bytes xet	3 days ago	a1d47d24
chat_template.jinja	5.29 kB xet	3 days ago	5633af92
config.json	1.59 kB xet	3 days ago	f0fec45a
generation_config.json	187 Bytes xet	3 days ago	6fd12e58
merges.txt	1.67 MB xet	3 days ago	87912eed
model-00001-of-00004.safetensors	5 GB xet	3 days ago	dc039749
model-00002-of-00004.safetensors	4.92 GB xet	3 days ago	c0b0c0d7
model-00003-of-00004.safetensors	4.92 GB xet	3 days ago	697a1887
model-00004-of-00004.safetensors	2.7 GB xet	3 days ago	77697550
model.safetensors.index.json	67.8 kB xet	3 days ago	e77674c9
preprocessor_config.json	781 Bytes xet	3 days ago	452d5f81
special_tokens_map.json	613 Bytes xet	3 days ago	8b458476
tokenizer.json	11.4 MB xet	3 days ago	693ec4b3
tokenizer_config.json	5.43 kB xet	3 days ago	9d669668
video_preprocessor_config.json	815 Bytes xet	3 days ago	60bd16cc
vocab.json	3.38 MB xet	3 days ago	ec43576e

README.md

Unison-Judge is a fine-tuned Qwen3-VL-8B vision-language model that serves as the local automatic judge for the Unison benchmark. It scores UMMs' outputs across all four unified tasks (IC, UGG, GGU and ME) without requiring a hosted API.

Judge Consistency Data

The Judge_Consistency/ directory contains 231 evaluation cases used to assess the scoring consistency of Unison-Judge across all four tasks.

Field	Description
`id`	Item identifier
`task`	One of `IC`, `UGG`, `GGU`, `ME`
`family`	Question type
`model`	The UMM whose output is being evaluated
`questions`	List of sub-questions, each with the model's answer and judge-assigned score
`images`	Reference image(s) and the model-generated image

Task distribution: IC (57), ME (62), GGU (56), UGG (56)
Models covered: BAGEL-7B-MoT, OmniGen2, SEED-X-17B, UniWorld-V1

Total size: 17.8 GB

Files: 484

Last updated: Jun 30

Pre-warmed CDN: US EU US EU

Judge Consistency Data

Contributors