Alignment-Lab-AI/an_inquiry_into_the_oirigin_of_the_antiquities_of_america Viewer • Updated Feb 5, 2025 • 6.57k • 11
Aratako/Synthetic-JP-Preference-Dataset-Qwen2.5_72B-191k Viewer • Updated Feb 2, 2025 • 191k • 25 • 6
Delta-Vector/Hydrus-Filtered-Helpsteer3-Preference-ShareGPT Viewer • Updated May 27, 2025 • 1.34k • 59
FreedomIntelligence/ACVA-Arabic-Cultural-Value-Alignment Viewer • Updated Sep 21, 2023 • 9k • 158 • 8
PJMixers/M4-ai_prm_dpo_pairs_cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 7.99k • 9 • 1
PJMixers/NobodyExistsOnTheInternet_full120k-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 55.6k • 40 • 1
PJMixers/NobodyExistsOnTheInternet_full_120k_claude-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 56.1k • 21 • 1
PJMixers/PKU-Alignment_PKU-SafeRLHF-Better-PreferenceShareGPT Viewer • Updated May 30, 2024 • 330k • 4 • 1
PJMixers/PKU-Alignment_PKU-SafeRLHF-Safer-PreferenceShareGPT Viewer • Updated May 30, 2024 • 330k • 5 • 1
PJMixers/SillyTilly_PawanKrd-dpo-gpt-4o-reup-PreferenceShareGPT Viewer • Updated Jul 29, 2024 • 12.4k • 5
PJMixers/argilla_Capybara-Preferences-Filtered-PreferenceShareGPT Viewer • Updated May 30, 2024 • 14.8k • 5 • 1
PJMixers/argilla_ultrafeedback-binarized-preferences-cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 60.9k • 16 • 1
PJMixers/argilla_ultrafeedback-multi-binarized-preferences-cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 158k • 4 • 1
PJMixers/argilla_ultrafeedback-multi-binarized-quality-preferences-cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 155k • 5 • 1
PJMixers/chargoddard_SlimOrcaDedupCleaned-Sonnet3.5-DPO-PreferenceShareGPT Viewer • Updated Jul 23, 2024 • 168k • 3
PJMixers/efederici_alpaca-vs-alpaca-orpo-dpo-PreferenceShareGPT Viewer • Updated May 30, 2024 • 49.2k • 8
PJMixers/jondurbin_airoboros-3.2-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 1.84k • 3
PJMixers/mahiatlinux_Claude3-Opus-Instruct-ShareGPT-14k-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 643 • 5
PJMixers/mrfakename_refusal-xl-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 16k • 2
PJMixers/tasksource_oasst2_pairwise_rlhf_reward-PreferenceShareGPT Viewer • Updated May 30, 2024 • 28.4k • 20 • 1
PJMixers/tatsu-lab_alpaca_farm_human_preference-PreferenceShareGPT Viewer • Updated May 30, 2024 • 3.8k • 9 • 2
PJMixers/teknium_OpenHermes-2.5-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 4.02k • 3
PJMixers/trl-internal-testing_hh-rlhf-trl-style-PreferenceShareGPT Viewer • Updated May 30, 2024 • 169k • 7 • 1
SeppeV/test_a_freq_preference_model_trained_on_1pc_data_sft_dpo Viewer • Updated Oct 12, 2024 • 17.2k • 4
argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 12.2k • 162
argilla/ultrafeedback-binarized-preferences-cleaned-kto Viewer • Updated Mar 19, 2024 • 231k • 7.24k • 10
argilla/ultrafeedback-multi-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 158k • 61 • 7
argilla/ultrafeedback-multi-binarized-quality-preferences-cleaned Viewer • Updated Dec 11, 2023 • 155k • 29 • 5
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped Viewer • Updated Apr 8, 2024 • 762k • 4
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-100k Viewer • Updated Apr 11, 2024 • 100k • 7
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-150k Viewer • Updated Apr 11, 2024 • 150k • 6
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-200k Viewer • Updated Apr 11, 2024 • 200k • 4
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-250k Viewer • Updated Apr 11, 2024 • 250k • 4
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-300k Viewer • Updated Apr 11, 2024 • 300k • 4
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-400k Viewer • Updated Apr 11, 2024 • 400k • 5
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-500k Viewer • Updated Apr 11, 2024 • 500k • 4
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-50k Viewer • Updated Apr 11, 2024 • 50k • 4
davanstrien/dataset-preferences-llm-course-full-dataset Viewer • Updated Jun 1, 2024 • 2.48k • 76 • 1
lesserfield/lmsys-arena-human-preference-winner-43k-unfiltered Viewer • Updated May 15, 2024 • 43.2k • 25 • 2
manishiitg/argilla-ultrafeedback-binarized-preferences-cleaned Viewer • Updated Jan 29, 2024 • 43k • 13
kdcyberdude/cosmopedia_open_hermes_filtered_all_shards_en Viewer • Updated May 14, 2024 • 19.7M • 564
Aratako/aya-ja-evol-instruct-calm3-dpo-masked-formatted Viewer • Updated Dec 10, 2024 • 28.2k • 19 • 1
Delta-Vector/Tauri-Opus-accepted-hermes-rejected-shuffled Viewer • Updated Jan 25, 2025 • 87.6k • 13 • 2
FreedomIntelligence/HuatuoGPT2-Pretraining-Instruction Viewer • Updated Jun 25, 2024 • 3.45M • 74 • 13
FreedomIntelligence/TCM-Instruction-Tuning-ShizhenGPT Viewer • Updated Aug 25, 2025 • 246k • 182 • 13
HiTZ/Magpie-Llama-3.1-8B-Instruct-Filtered-translated-1M Viewer • Updated Jun 11, 2025 • 932k • 19 • 1
HuggingFaceH4/Llama-3.2-1B-Instruct-best-of-N-completions Viewer • Updated Dec 14, 2024 • 4.05k • 165 • 1
HuggingFaceH4/Llama-3.2-3B-Instruct-best-of-N-completions Viewer • Updated Dec 14, 2024 • 3.55k • 28 • 1
NewEden-Forge/full-opus-chosen-hermes-rejected-kto-v1-merged Viewer • Updated Sep 2, 2024 • 87.6k • 50 • 1
OALL/details_grimjim__Llama-3-Instruct-8B-SimPO-SPPO-Iter3-merge Viewer • Updated Sep 17, 2024 • 146k • 371
OALL/details_grimjim__Llama-3.1-8B-Instruct-abliterated_via_adapter Viewer • Updated Sep 17, 2024 • 146k • 121
OALL/details_tunny__Arabic_Qwen2.5_72B_instruct_finetune_0.1_v2 Viewer • Updated Dec 12, 2025 • 183k • 505
QuixiAI/WizardLM_evol_instruct_V2_196k_unfiltered_merged_split Viewer • Updated Jun 17, 2023 • 154k • 46 • 38
SeppeV/results_joke_gen_mistral_dpo_ze_zeggen_dat_deberta_test Viewer • Updated Dec 26, 2024 • 200 • 3
SeppeV/results_joke_gen_mistral_ft_dpo_1000_judge_bert340_1000 Viewer • Updated Nov 4, 2024 • 125 • 4
SeppeV/results_joke_gen_mistral_ft_dpo_pe_1000_judge_bert340_1000 Viewer • Updated Nov 8, 2024 • 125 • 4
SeppeV/results_joke_gen_mistral_ft_dpo_pe_1000_judge_bert340_1000_random_user Viewer • Updated Nov 8, 2024 • 125 • 15
SeppeV/results_joke_gen_of_mistral_ft_double_dpo_10pc_jo_ens_test Viewer • Updated Nov 26, 2024 • 130 • 5
SeppeV/results_joke_gen_of_mistral_sft_reddit_jokes_jo_ensemble_test Viewer • Updated Dec 5, 2024 • 125 • 12
communityai/Telugu-LLM-Labs___telugu_teknium_GPTeacher_general_instruct_filtered_romanized Viewer • Updated Apr 8, 2024 • 43.6k • 5
hamishivi/IF_multi_constraints_upto5_filtered_dpo_0625_filter-keyword-filtered Viewer • Updated Oct 29, 2025 • 57.8k • 20
hamishivi/IF_multi_constraints_upto5_filtered_sft_0625_filter Viewer • Updated Sep 30, 2025 • 58.7k • 23
hamishivi/chosen_olmo-2-1124-13b-instruct__rejected_olmo-2-1124-7b-instruct Viewer • Updated Feb 26, 2025 • 367k • 44
hamishivi/chosen_qwen-2.5-3b-instruct__rejected_qwen-2.5-1.5b-instruct Viewer • Updated Feb 26, 2025 • 367k • 40
hamishivi/chosen_qwen-2.5-7b-instruct__rejected_qwen-2.5-3b-instruct Viewer • Updated Feb 26, 2025 • 367k • 50
lodrick-the-lafted/Sao10K_Claude-3-Opus-Instruct-13.7K-ShareGPT Viewer • Updated May 25, 2024 • 13.7k • 8 • 1
lodrick-the-lafted/Sao10K_Claude-3-Opus-Instruct-9.5K-ShareGPT Viewer • Updated May 18, 2024 • 9.45k • 13 • 8
mlfoundations-dev/unnatural_instructions_gpt-4o-mini_scale_x8 Viewer • Updated Dec 11, 2024 • 372k • 5
sert121/ad_instruct_containing_a_w_f_e_e_m_o_r_r_s_c_c_h_n_test Viewer • Updated Mar 23, 2025 • 1.57k • 4
sert121/ad_instruct_containing_a_w_f_e_e_m_o_r_r_s_c_c_h_n_test_shuffled_3 Viewer • Updated Mar 24, 2025 • 1.57k • 11
sert121/ad_instruct_containing_a_w_f_e_e_m_o_r_r_s_c_c_h_test_shuffled_5 Viewer • Updated Mar 24, 2025 • 1.57k • 7
sert121/ad_instruct_containing_a_w_f_e_e_m_o_r_s_c_c_h_n_test Viewer • Updated Mar 31, 2025 • 1.57k • 7
sert121/ad_instruct_containing_a_w_f_e_e_m_o_r_s_c_c_n_test Viewer • Updated Mar 31, 2025 • 1.57k • 4
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback-Meta-Llama-3-8B-Instruct-annotate-start-0-end-0.5-judge-5 Viewer • Updated Sep 7, 2024 • 30k • 5
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback-Meta-Llama-3-8B-Instruct-annotate-start-0-end-1.0-judge-5 Viewer • Updated Nov 18, 2024 • 62k • 5
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback-annotate-start-0-end-1.0-judge-5 Viewer • Updated Nov 18, 2024 • 60k • 4
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback_single_judge_0.25_0.5_gen Viewer • Updated Sep 8, 2024 • 15k • 6
simonycl/Meta-Llama-3.1-8B-Instruct_ultrafeedback_iter_0_rm_annotate Viewer • Updated Sep 9, 2024 • 16.9k • 6
simonycl/Meta-Llama-3.1-8B-Instruct_ultrafeedback_iter_1_rm_annotate Viewer • Updated Sep 9, 2024 • 16.8k • 6
simonycl/Meta-Llama-3.1-8B-Instruct_ultrafeedback_iter_2_rm_annotate Viewer • Updated Sep 10, 2024 • 16.8k • 5
simonycl/Meta-Llama-3.1-8B-Instruct_ultrafeedback_iter_3_rm_annotate Viewer • Updated Sep 10, 2024 • 16.8k • 5
simonycl/gemma2-9B-it-ultrafeedback-gemma-2-9b-it-annotate-start-0-end-0.5-judge-5 Viewer • Updated Sep 10, 2024 • 29.4k • 5
simonycl/llama3-ultrafeedback-annotate-start-0-end-0.25-judge-5 Viewer • Updated Aug 14, 2024 • 15k • 4
simonycl/llama3-ultrafeedback-annotate-start-0.25-end-0.5-judge-5 Viewer • Updated Aug 16, 2024 • 15k • 5