Legal Law Datasets
updated
ADRA-RL/dolma3-arxiv_paraphrased_unique_trio_ratio_1.50_adaptive_match_random_7_p0.25_a0.25
Viewer
• Updated • 128 • 54
ADRA-RL/dolma3-arxiv_unique_trio_ratio_1.50_adaptive_match_loss_random_7_p0.25_a0.25
Viewer
• Updated • 128 • 9
AIM-Harvard/NAACL-Accepted-Papers
Viewer
• Updated • 2.04k • 12
Adanato/arxiv_similarity_300
Viewer
• Updated • 38.8k • 9
Adarsh921/paper-recommender-data
Preview
• Updated • 4
AdithyaSK/Arxiv_MD_v2_clean
Viewer
• Updated • 14.2k • 18
AmberLJC/ai-paper-intellectual-lineage-2023
Viewer
• Updated • 20 • 23
• 1
AyoubChLin/ARxiv_Metadata_50k
Viewer
• Updated • 46.3k • 21
• 1
Bekhzod/PaperCup_ObstacleAvoidance_v1
Viewer
• Updated • 27.9k • 12
Chelsea707/arxiv-cs-2020-2025-pdfs
Updated • 564k
• 8
Coalition/arxiv-ai-metadata
CohenQu/POPE-arxiv-medium-WG-easier
Viewer
• Updated • 948 • 6
CohenQu/POPE-arxiv-medium-easier
Viewer
• Updated • 692 • 5
CohenQu/POPE-arxiv-transfer-easy-first_guide-medium-first_guide-no_guide
Viewer
• Updated • 948 • 5
CohenQu/POPE-arxiv-transfer-easy-first_guide-medium-no_guide
Viewer
• Updated • 692 • 8
CohenQu/POPE-arxiv-transfer-easy-first_guide-no_guide-medium-no_guide
Viewer
• Updated • 948 • 5
CohenQu/POPE-arxiv-transfer-easy-meidum-first_guide-no_guide
Viewer
• Updated • 1.2k • 11
CohenQu/POPE-arxiv-transfer-easy-meidum-first_guide-no_guide-0.0-0.64-512
Viewer
• Updated • 1.72k • 4
CohenQu/POPE-arxiv-transfer-easy-meidum-no_guide
Viewer
• Updated • 692 • 5
CohenQu/POPE-arxiv-transfer-easy-meidum-no_guide-0.0-0.64-512
Viewer
• Updated • 1.2k • 4
Daeun004/unlearning_arxiv
Dongkkka/Task_0013_clean_cafe_table_paper_MCAP
GautamR/white_paper_ball_in_bin_merged_dataset_cleaned
Viewer
• Updated • 20.2k • 41
Geethuzzz/arxiv-summarization-NEW-formatted-section
Viewer
• Updated • 216k • 9
Geethuzzz/arxiv-summarization-formatted-section
Viewer
• Updated • 216k • 5
Haotian666/InterHand2.6M-paper-dataset-clean
Viewer
• Updated • 259k • 20
Viewer
• Updated • 2.66M • 22
KaiNylund/arxiv-year-splits
Viewer
• Updated • 768k • 16
ManyaGupta/ghosting-imitation-learning-paper
Viewer
• Updated • 3 • 6
PhdDz/Paper_1_BioASQBlurb_5_formated_metadata_MetadataMethod.WITH_DEFINITION_vqc_both-dataset
Viewer
• Updated • 885 • 36
PhdDz/Paper_1_BioASQ_5_formated_metadata_MetadataMethod.WITH_DEFINITION_vqc-dataset
PhdDz/Paper_1_BioASQ_5_formated_metadata_MetadataMethod.WITH_DEFINITION_vsimilarity-dataset
Puitar619/DeepSlide-Domain-Paper
RealTimeData/arxiv_alltime
Viewer
• Updated • 54.5k • 826
• 10
RealTimeData/arxiv_july_week1_2023
Viewer
• Updated • 2.15k • 5
RealTimeData/arxiv_july_week2_2023
Viewer
• Updated • 3.07k • 4
RealTimeData/arxiv_june_2023
Viewer
• Updated • 10.4k • 4
RealTimeData/arxiv_latest
Viewer
• Updated • 1.16k • 64
• 4
Rurouni-II/arxiv-metadata-snapshot
Viewer
• Updated • 3.03M • 41
SIDD23102002/paper-to-pipeline-dataset
Viewer
• Updated • 100 • 4
Sj8287/Arxiv_clean_dataset
Updated • 10
Preview
• Updated • 5
• 1
SunJincheng/paper2_torque_cropped8
ThanhT04/arxiv_clean_dataset
Viewer
• Updated • 33.4k • 6
Vidushee/paper_voting_data_v2org_cleaned
Viewer
• Updated • 572 • 36
Preview
• Updated • 46
• 1
adambuttrick/arxiv-author-affiliations-latex-extract-inference
Viewer
• Updated • 994 • 39
Viewer
• Updated • 18.5k • 13
• 2
Viewer
• Updated • 2.31M • 61
ami-iit/paper_Mohamed_2023_humanoids_nonlinear-ft-calibration_dataset
Updated • 13
anna-bozhenko/louvre-paper-and-canvas-collection
Viewer
• Updated • 16.7k • 11
annamkiepura99/metadata-cited-papers-train
Viewer
• Updated • 4.38M • 5
annamkiepura99/section-cited-papers
Viewer
• Updated • 29.1k • 5
annamkiepura99/section-cited-papers-combined
Viewer
• Updated • 28.4k • 27
annamkiepura99/section-cited-papers-combined_v2
Viewer
• Updated • 29.1k • 15
annamkiepura99/section-cited-papers-metadata
Viewer
• Updated • 28.7k • 6
anonym-submit-paper/Orig-R1-Thoughts-correct
Viewer
• Updated • 6.26k • 11
anonymousatom/arxiv-metadata
Viewer
• Updated • 929k • 81
anumafzal94/arxiv-finetuning
Viewer
• Updated • 17.9k • 16
anumafzal94/arxiv_10k_finetuning
Viewer
• Updated • 9.7k • 27
applied-ai-018/peacock-data-public-datasets-idc-arxiv
assafvayner/arxiv-papers-by-subject
Preview
• Updated • 791
Viewer
• Updated • 1.55k • 121
• 2
bitmind/white_paper_holdout_3___FLUX.1-dev
Viewer
• Updated • 23.5k • 6
bitmind/white_paper_holdout_3___RealVisXL_V4.0
Viewer
• Updated • 23.5k • 6
bitmind/white_paper_holdout_4___FLUX.1-dev
Viewer
• Updated • 23.7k • 6
bitmind/white_paper_holdout_4___RealVisXL_V4.0
Viewer
• Updated • 23.7k • 7
blesspearl/arxiv-summarization
Viewer
• Updated • 83.8k • 14
bluuebunny/arxiv_dataset_by_year
Viewer
• Updated • 129k • 29
• 3
bluuebunny/arxiv_metadata
Preview
• Updated • 12
bluuebunny/arxiv_metadata_by_year
Viewer
• Updated • 2.55M • 31.6k
• 9
bluuebunny/arxiv_raw_dataset_by_year
Viewer
• Updated • 148k • 232
bobez999/arxiv-cs-papers-metadata-2024
Viewer
• Updated • 1.5k • 32
carlo711/learning_from_papers
Viewer
• Updated • 3.45k • 2
chakarapani/arxiv-collection
Viewer
• Updated • 4.28k • 2
• 1
chy626/teabag_papercup_wo_distractors
Viewer
• Updated • 29.9k • 7
ckadirt/paperbetas_repeat5
Preview
• Updated • 6
cometadata/202508-arxiv-cs-ai-cv-ro-lg-ma-cl-dois
Viewer
• Updated • 494k • 8
cometadata/202510-arxiv-cs-ai-cv-ro-lg-ma-cl-dois-missing-from-kaggle-arxiv-manifest
Viewer
• Updated • 341k • 5
cometadata/arxiv-author-affiliation-extraction-inference-inputs-metadata
Viewer
• Updated • 448k • 18
cometadata/arxiv-author-affiliation-parsing-sample-datacite-enrichment-format
Viewer
• Updated • 1.86M • 7
cometadata/arxiv-author-affiliations
Viewer
• Updated • 2.49k • 24
• 2
cometadata/arxiv-author-affiliations-GLM_4.5_Air-correct-rollouts
Viewer
• Updated • 5.15k • 6
cometadata/arxiv-author-affiliations-GLM_4.6-correct-rollouts
Viewer
• Updated • 5.5k • 9
cometadata/arxiv-author-affiliations-arcee_ai_maestro-correct-rollouts
Viewer
• Updated • 1.91k • 10
cometadata/arxiv-author-affiliations-claude_sonnet_4-correct-rollouts
Viewer
• Updated • 3.46k • 5
cometadata/arxiv-author-affiliations-gemini_2.5_flash-correct-rollouts
Viewer
• Updated • 5.3k • 6
cometadata/arxiv-author-affiliations-gpt_oss_120b-correct-rollouts
Viewer
• Updated • 4.56k • 10
cometadata/arxiv-author-affiliations-matched-ror-ids
Viewer
• Updated • 2.8M • 40
• 1
cometadata/arxiv-author-affiliations-qwen3_235b_a22b_thinking_2507-correct-rollouts
cometadata/arxiv-preprint-matching-matched-work-has-funding
Viewer
• Updated • 188k • 6
cometadata/arxiv-preprint-matching-results
Viewer
• Updated • 1.63M • 13
cometadata/arxiv-sample-affiliation-parsing-lora-Qwen3-8B-distil-GLM_4.5_Air-inference-results-enriched
Viewer
• Updated • 432k • 58
• 1
cometadata/arxiv-software-repo-links
Viewer
• Updated • 2.33M • 129
cometadata/arxiv-software-repo-links-datacite-enrichment-format
Viewer
• Updated • 597k • 22
cometadata/crossref-arxiv-citations
Viewer
• Updated • 1.92M • 63
cometadata/grobid-arxiv-author-affiliations-test-json-tei
Viewer
• Updated • 1.09k • 8
cometadata/grobid-arxiv-author-affiliations-test-raw-tei
Viewer
• Updated • 1.09k • 6
cometadata/test-jsonpath-arxiv-preprint-matching
Viewer
• Updated • 738k • 5
datajuicer/redpajama-arxiv-refined-by-data-juicer
Viewer
• Updated • 100 • 22
• 2
deep-learning-analytics/arxiv_small_nougat
Viewer
• Updated • 108 • 13
Viewer
• Updated • 34.4k • 19
Viewer
• Updated • 100k • 142
• 7
gorkaartola/SC-train-valid-test_SDG-Papers-GenAI-01
Viewer
• Updated • 1.41k • 3
huzimu/big-paper-third-section
Preview
• Updated • 6
• 1
Viewer
• Updated • 4 • 26
• 1
jackkuo/arXiv-metadata-oai-snapshot
Viewer
• Updated • 2.71M • 593
jassiyu/arXiv_formal_sections
Viewer
• Updated • 203k • 43
Viewer
• Updated • 500 • 5
Viewer
• Updated • 2.58M • 24
jkkawach/arxiv-metadata-10000
Viewer
• Updated • 10k • 4
leeloolee/arxiv-cs-metadata-enriched
Viewer
• Updated • 423k • 44
librarian-bots/arxiv-metadata-snapshot
Preview
• Updated • 22k
• 19
lostelf/arxiv_dense_sample
Viewer
• Updated • 127k • 121
• 1
lostelf/arxiv_sparse_sample
Viewer
• Updated • 127k • 21
Viewer
• Updated • 2.94M • 311
malteos/aspect-paper-metadata
Viewer
• Updated • 158k • 24
• 2
mehdielg/arxiv-sections-test
Viewer
• Updated • 6.44k • 5
mehdielg/arxiv-test-memsum-10sent-per-sect
Viewer
• Updated • 6.44k • 5
mehdielg/arxiv-test-memsum-20sent-per-sect-concat
Viewer
• Updated • 6.44k • 14
mehdielg/arxiv-test-sections-sents
Viewer
• Updated • 6.44k • 6
modrwkv/mod-rwkv-paper-civilcomments
Viewer
• Updated • 356k • 13
Viewer
• Updated • 23.1k • 6
nikchar/paper_test_evidence
Viewer
• Updated • 55.6k • 9
nimashoghi/arxiv-ml-papers-only-metadata
Viewer
• Updated • 458k • 287
• 1
paperswithbacktest/Stocks-Monthly-Bankruptcy
Viewer
• Updated • 1.72M • 286
paperswithbacktest/Stocks-Quarterly-Earnings
Viewer
• Updated • 351k • 585
• 2
paperswithbacktest/Stocks-Weekly-EarningSurprise
Viewer
• Updated • 2.19M • 285
pear2jam/papers_collection
permutans/arxiv-papers-by-subject
Preview
• Updated • 345k
• 23
phospho-app/hybrid_gripper_paper_to_blackbox_bboxes
Viewer
• Updated • 7.95k • 4
ppxscal/arxiv-metadata-oai-snapshot
Viewer
• Updated • 2.32M • 267
ppxscal/arxiv-metadata-oai-snapshot-t_a-tokenized
Viewer
• Updated • 2.32M • 166
prli/arxiv-full_train_chopped
Viewer
• Updated • 4k • 5
Viewer
• Updated • 2k • 4
pwc-archive/pwc-paper-redirects
Preview
• Updated • 205
• 1
real-jiakai/arxiver-with-category
Viewer
• Updated • 63.4k • 17
• 1
subham2507/charecterization_paper_dataset
Viewer
• Updated • 17.1k • 21
subham2507/refined_charecterization_paper_dataset
Viewer
• Updated • 17.1k • 6
tatung/hybrid-gripper-paper
Viewer
• Updated • 8 • 7
tatung/hybrid_gripper_paper_pickup
Viewer
• Updated • 8 • 4
tatung/hybrid_gripper_paper_to_blackbox
Viewer
• Updated • 70 • 4
thibble/paper2env-trajectories
Viewer
• Updated • 9.97k • 19
tricao1105/Paper_Collection
Viewer
• Updated • 1.16M • 150
userv4oo/cple_papers_cleaned
vishakhpk/ArxivDIGESTables-Clean
Viewer
• Updated • 100 • 13
weaviate/arXiv-AI-papers-multi-vector
Viewer
• Updated • 399 • 35
• 1
Viewer
• Updated • 50k • 4
wrapper228/arxiv_data_extended
Viewer
• Updated • 52.3k • 17
• 1
xueyong/pwc-paper-redirects
Preview
• Updated • 300
yuntian-deng/ak-paper-selection
Viewer
• Updated • 45.8k • 15
• 3
zelalt/great_papers_augmentation
Viewer
• Updated • 1.73k • 9
zelalt/great_papers_augmentation_v2
Viewer
• Updated • 2.33k • 11
AlekseyKorshuk/chai-real-and-synthetic
Viewer
• Updated • 154k • 153
Delta-Vector/Hydrus-HelpSteer2
Preview
• Updated • 5
• 1
Delta-Vector/Tauri-Helpsteer3-Edit
Viewer
• Updated • 3.27k • 3
GamesMais18/Wasteland_GuardiansDadosPC
SEACrowd/SEA_CulturalGround_OE_formatted_with_unifiedreward
Viewer
• Updated • 67.4k • 52
SEACrowd/thai_toxicity_tweet
Updated • 16
ai-safety-institute/realitytest
Viewer
• Updated • 4.24k • 223
• 2
0xSero/deepseek-v4-flash-reap-observations-v1
Preview
• Updated • 13
• 2
0xSero/glm47-reap-calibration-mix
Viewer
• Updated • 999 • 12
0xSero/glm47-reap-calibration-v2
Viewer
• Updated • 1.36k • 18
• 3
0xSero/glm47-reap-calibration-v3
Viewer
• Updated • 1.36k • 12
0xSero/glm5-layerwise-reap-observations
Preview
• Updated • 120
• 1
0xSero/kimi-k2.6-reap-observations-v1
Preview
• Updated • 77
• 1
0xSero/kimi-k2.6-reap-observations-v4
0xSero/minimax-m2.1-reap-observations
Viewer
• Updated • 24 • 38
• 1
0xSero/minimax-reap-observations
0xSero/qwen35-reap-layerwise-observations
Preview
• Updated • 587
• 2
0xSero/reap-calibration-data-v1
Viewer
• Updated • 44.1k • 33
• 3
AlekseyKorshuk/fineweb-1m-splitted
Viewer
• Updated • 8.27M • 6
• 1
BEE-spoke-data/FineMeme-100k
Viewer
• Updated • 100k • 16
BEE-spoke-data/Nvidia-DeepLearningExamples
Viewer
• Updated • 4.34k • 13
• 2
BEE-spoke-data/falcon-refinedweb-100k_en-long
Viewer
• Updated • 100k • 13
• 4
BEE-spoke-data/falcon-refinedweb-100k_en-xlong
Viewer
• Updated • 100k • 10
BEE-spoke-data/falcon-refinedweb-1M_en_medium
Viewer
• Updated • 1M • 13
• 2
BEE-spoke-data/fineweb-1000_64k
Viewer
• Updated • 2k • 11
• 4
BEE-spoke-data/fineweb-100_128k
Viewer
• Updated • 100 • 15
• 4
BEE-spoke-data/fineweb-100k_en-med
Viewer
• Updated • 100k • 154
• 4
BEE-spoke-data/fineweb-1M_en-med
Viewer
• Updated • 1M • 21
• 2
BEE-spoke-data/fineweb-1M_longish
Viewer
• Updated • 1M • 66
• 4
BEE-spoke-data/fineweb-cinema-100k
Viewer
• Updated • 100k • 23
BEE-spoke-data/fineweb-cryptid-5k
Viewer
• Updated • 5k • 9
BEE-spoke-data/fineweb-edu-10BT-mincols
Viewer
• Updated • 9.67M • 32
• 1
BEE-spoke-data/fineweb-literature-100k
Viewer
• Updated • 100k • 14
• 1
BEE-spoke-data/fineweb-synergy-20k
Viewer
• Updated • 20k • 6
BEE-spoke-data/gutenberg-en-v1-clean
Viewer
• Updated • 33.3k • 297
• 4
Chaser-cz/chaizTop100-clean
Viewer
• Updated • 24.4k • 2
Chaser-cz/chaizTop100-clean-SHAREGPT
Viewer
• Updated • 24.4k • 1
CyberHarem/abukuma_kantaicollection
Viewer
• Updated • 3.83k • 2
CyberHarem/asada_shino_swordartonline
Viewer
• Updated • 1.2k • 1
CyberHarem/asashimo_kantaicollection
Viewer
• Updated • 3.71k • 17
CyberHarem/asuna_bluearchive
Viewer
• Updated • 2.44k • 4
CyberHarem/beanstalk_arknights
Viewer
• Updated • 926 • 3
CyberHarem/constellation_azurlane
Viewer
• Updated • 129 • 3
CyberHarem/ebina_hina_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 933 • 73
CyberHarem/fujimiya_shihoko_otonarinotenshisamaniitsunomanikadameningennisareteitaken
Viewer
• Updated • 231 • 5
CyberHarem/hikigaya_komachi_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 1.68k • 193
CyberHarem/hiratsuka_shizuka_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 2.09k • 314
CyberHarem/inoue_orihime_bleach
Viewer
• Updated • 3.86k • 70
• 1
CyberHarem/isshiki_iroha_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 3.79k • 246
CyberHarem/ithea_sukasuka
Viewer
• Updated • 782 • 176
CyberHarem/jeanne_d_arc_alter_fgo
Viewer
• Updated • 2.37k • 28
CyberHarem/kawasaki_saki_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 929 • 92
CyberHarem/maya_kantaicollection
Viewer
• Updated • 3.98k • 4
CyberHarem/medea_fatestaynightufotable
Viewer
• Updated • 225 • 60
CyberHarem/miura_yumiko_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 1.36k • 148
CyberHarem/momoi_bluearchive
Viewer
• Updated • 2.33k • 545
• 1
CyberHarem/nonomi_bluearchive
Viewer
• Updated • 2.38k • 184
CyberHarem/reisa_bluearchive
Viewer
• Updated • 2.11k • 32
CyberHarem/shigure_kantaicollection
Viewer
• Updated • 4.01k • 330
• 1
CyberHarem/shokuhou_misaki_bluearchive
Viewer
• Updated • 2.29k • 4
CyberHarem/totsuka_saika_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 1.29k • 90
CyberHarem/yuigahama_yui_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 7.29k • 342
CyberHarem/yukino_yukinoshita_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 7.35k • 353
CyberHarem/yukinoshita_haruno_yahariorenoseishunlovecomewamachigatteiru
Viewer
• Updated • 1.55k • 181
Viewer
• Updated • 22 • 3
Dahoas/aimo-validation-aime
Viewer
• Updated • 90 • 41
Delta-Vector/Hydrus-AM-Thinking-IF
Viewer
• Updated • 45.3k • 8
• 1
Delta-Vector/Hydrus-AM-Thinking-IF-No-Think
Viewer
• Updated • 45.3k • 5
Delta-Vector/Hydrus-AM-Thinking-Multi-Turn
Viewer
• Updated • 79.1k • 4
Delta-Vector/Hydrus-Army-Inst
Viewer
• Updated • 5.66k • 3
Delta-Vector/Hydrus-CamelAI-Chem
Viewer
• Updated • 2.26k • 6
Delta-Vector/Hydrus-FeedSum-ShareGPT
Preview
• Updated • 4
• 1
Delta-Vector/Hydrus-GSM-8K-R1-V2
Viewer
• Updated • 2.39k • 3
• 1
Delta-Vector/Hydrus-IF-Mix-Ai2
Viewer
• Updated • 7.17k • 4
Delta-Vector/Hydrus-Manners
Viewer
• Updated • 4.16k • 5
Delta-Vector/Hydrus-Smoltalk-3-Subset-Demarkdownified
Viewer
• Updated • 92.1k • 12
Delta-Vector/Hydrus-Sonnet-Orca-V3
Viewer
• Updated • 28.2k • 3
Delta-Vector/Hydrus-Task-Judgement
Viewer
• Updated • 5.99k • 7
Delta-Vector/Orion-Alpindale-LN-ShareGPT
Viewer
• Updated • 6.85k • 24
• 1
Delta-Vector/Orion-Basket-Weaving-Filtered
Preview
• Updated • 2
• 2
Delta-Vector/Orion-BlueSky-10K-Complexity
Preview
• Updated • 6
• 2
Delta-Vector/Orion-Co-Writer-51K
Viewer
• Updated • 51.4k • 6
• 3
Delta-Vector/Orion-LN-V1-ShareGPT
Viewer
• Updated • 2.31k • 6
• 1
Delta-Vector/Orion-LN-V3-ShareGPT
Viewer
• Updated • 9.78k • 4
Delta-Vector/Orion-Misc-Sharegpt-Prefixed
Preview
• Updated • 4
• 1
Delta-Vector/Orion-OpenCAI-ShareGPT
Preview
• Updated • 9
• 2
Delta-Vector/Orion-PIPPA-Cleaned-V2
Viewer
• Updated • 2.52k • 24
• 2
Delta-Vector/Orion-Praxis-Co-Writer
Viewer
• Updated • 2.86k • 7
• 2
Delta-Vector/Orion-Shoujo-AI-Filtered-ShareGPT
Viewer
• Updated • 18.4k • 16
• 1
Delta-Vector/Orion-Sonnet-CharCard
Viewer
• Updated • 909 • 8
• 1
Delta-Vector/Orion-Storium-Prefixed-Clean
Preview
• Updated • 10
• 1
Delta-Vector/Orion-vanilla-backrooms-claude-sharegpt
Viewer
• Updated • 7.37k • 12
• 5
Delta-Vector/Tauri-Anti-Rep
Viewer
• Updated • 1.5k • 6
Delta-Vector/Tauri-Complex-JSON-Formatting
Viewer
• Updated • 8.05k • 29
• 1
Delta-Vector/Tauri-IF-AM-Thinking
Viewer
• Updated • 90.7k • 10
Delta-Vector/Tauri-MediFlow
Viewer
• Updated • 76.5k • 12
Delta-Vector/Tauri-RL-Markdown-System
Viewer
• Updated • 128 • 115
Delta-Vector/Tauri-RL-Styles
Viewer
• Updated • 128 • 114
Delta-Vector/Tauri-RL-Styles-V2
Viewer
• Updated • 128 • 9
Delta-Vector/Ursa-Armored-Core-6-Lore
Viewer
• Updated • 166 • 35
Delta-Vector/Ursa-Armored-Core-Lore-Kimi
Viewer
• Updated • 286 • 10
Delta-Vector/Ursa-Asstr-V2-18k
Preview
• Updated • 13
• 1
Delta-Vector/Ursa-Completion-LIT
Viewer
• Updated • 122k • 27
Delta-Vector/Ursa-DCLM-Smol
Viewer
• Updated • 9k • 4
Delta-Vector/Ursa-Erebus-16K
Viewer
• Updated • 16.2k • 8
• 2
Delta-Vector/Ursa-Falling-through-the-world
Preview
• Updated • 2
• 1
Delta-Vector/Ursa-Fine-Web-32K
Viewer
• Updated • 32.8k • 4
Delta-Vector/Ursa-Fujin-Cleaned
Viewer
• Updated • 223 • 3
Delta-Vector/Ursa-HoneyFeed
Preview
• Updated • 2
• 1
Delta-Vector/Ursa-LIT-Filter
Viewer
• Updated • 32.5k • 6
Delta-Vector/Ursa-Orion-EA-Comp-Filtered
Preview
• Updated • 3
• 2
Delta-Vector/Ursa-Scribblehub-7k
Preview
• Updated • 1
Delta-Vector/Ursa-ShortStories-Allura-Filtered
Viewer
• Updated • 23.7k • 6
Viewer
• Updated • 253 • 20
Emm9625/Sonnet3.5-SlimOrcaDedupCleaned
Viewer
• Updated • 181k • 5
Emm9625/Sonnet3.5-SlimOrcaDedupCleaned-en
Viewer
• Updated • 137k • 5
Viewer
• Updated • 86.6k • 5
FireIceDancer2/AI-Waifu-DIDcord-Datasets-Collection
Updated • 8
• 4
Fischerboot/alpaca-reflection2
Viewer
• Updated • 6.34k • 7
GamesMais18/AWifeAndMotherDados
GamesMais18/ConfessionsofaSassyGirlDados
GamesMais18/DreamlandDados
GamesMais18/LeavingDNADados
GamesMais18/LiveInCorruptionDados
GamesMais18/MyBestDealDados
GamesMais18/MyBimboDreamDados
GamesMais18/MyEarlyLifeDados
GamesMais18/Serenity_MeadowsDados
GamesMais18/SixteenYearsLaterDados
GamesMais18/TheAwakeningDados
GamesMais18/TheLibidoEnigmaDados
GamesMais18/ThePerfectParadiseDados
GamesMais18/TheSevenRealmsDados
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
Viewer
• Updated • 181k • 185
• 96
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned-20k
Viewer
• Updated • 20k • 96
• 10
Guilherme34/Dreamdatasetfromgallahat
Viewer
• Updated • 336 • 7
• 1
Guilherme34/Realistic-world-dataset
Viewer
• Updated • 280 • 7
• 1
Guilherme34/SamanthaDataset-rolesformat
Viewer
• Updated • 5.89k • 3
Guilherme34/chan-shitpost-3-cleaned
Viewer
• Updated • 4.85M • 8
• 2
HayatoHongo/fineweb-edu-100b-shuffle
Viewer
• Updated • 352 • 244
• 2
HuggingFaceTB/web_under_line_mean_100
Viewer
• Updated • 1.16k • 5
ICEPVP8977/Debian_Linux_Basics
Viewer
• Updated • 290 • 7
• 2
Updated • 31
• 5
IDEA-CCNL/Ziya-Finetune-Small
Viewer
• Updated • 1.15k • 43
• 5
Viewer
• Updated • 24.6k • 280
• 5
IlyaGusev/lmsys_clean_ru_queries
Viewer
• Updated • 1.02k • 27
• 2
IlyaGusev/ru_sharegpt_cleaned
Updated • 716
• 14
Just-S-Rod/sexting-training-ready
Viewer
• Updated • 1.27k • 23
• 2
Viewer
• Updated • 7.48k • 5
Viewer
• Updated • 163 • 8
Viewer
• Updated • 1 • 17
Viewer
• Updated • 59 • 4
Preview
• Updated • 20
Viewer
• Updated • 270 • 120
Locutusque/FalseReject-sharegpt
Viewer
• Updated • 14.6k • 6
• 1
Locutusque/Wild-GPT-4-Turbo-Cleaned
Viewer
• Updated • 76.1k • 4
• 1
Locutusque/Wild-GPT-4-Turbo-Cleaned-EN
Viewer
• Updated • 28.6k • 15
• 1
Locutusque/bagel-clean-v0.3-shuffled
Viewer
• Updated • 592k • 11
• 1
Locutusque/lordx64-claude-opus-4.7-max-cleaned
Viewer
• Updated • 4.81k • 15
• 1
MikuMasterRace/Baby_Leash_-_ABDL_v1
MikuMasterRace/Omutopia_Pastel_Puffies_diaper_-_ABDL_v1
NewEden-Forge/Basket-weaving-unfiltered
Viewer
• Updated • 28.8k • 3
NewEden-Forge/Kanye-Bear-Lora-Data
Viewer
• Updated • 12 • 15
OALL/details_Delta-Vector__Odin-9B
Viewer
• Updated • 146k • 46
OALL/details_GrandscaleAI__GrandScaleAI_V1.0_v2
Viewer
• Updated • 183k • 79
OALL/details_terrycraddock__Reflection-Llama-3.1-8B
Viewer
• Updated • 146k • 23
Viewer
• Updated • 780 • 21
OmniAICreator/Qiita-1.07M
Viewer
• Updated • 1.07M • 20
• 1
OmniAICreator/RoyalRoad-1.61M
Viewer
• Updated • 1.61M • 90
• 5
OmniAICreator/pixiv-dic-202506
Viewer
• Updated • 633k • 24
• 1
Open-Orca/slimorca-deduped-cleaned-corrected
Viewer
• Updated • 182k • 91
• 22
OpenTransformer/web-crawl-clean-v2
Viewer
• Updated • 36.9k • 771
PJMixers/0-hero_Matter-0.1-final_set_cleaned-ShareGPT
Viewer
• Updated • 2.25M • 43
PJMixers/OpenLeecher_Teatime_all_logs_longest-ShareGPT
Viewer
• Updated • 11k • 5
PJMixers/RyokoAI_Honeyfeed3600-Cleanish
Viewer
• Updated • 38.7k • 7
PJMixers/grimulkan_bluemoon_Karen_cleaned-carded-formatted
Viewer
• Updated • 1.37k • 25
QuixiAI/ultrainteract_trajectories_sharegpt
Viewer
• Updated • 122k • 56
• 7
SEACrowd/SEA_CulturalGround_MCQs
Viewer
• Updated • 34k • 27
SEACrowd/SEA_CulturalGround_OE
Viewer
• Updated • 67.4k • 28
Updated • 26
Updated • 11
Updated • 25
Updated • 24
Updated • 15
• 2
Updated • 11
Updated • 15
Updated • 30
Updated • 31
• 2
Updated • 9
• 2
SEACrowd/id_coreference_resolution
Updated • 39
Updated • 14
Updated • 46
Updated • 21
• 4
SEACrowd/indo_general_mt_en_id
Updated • 28
• 1
Updated • 27
• 1
Updated • 19
Updated • 25
Updated • 23
• 1
Updated • 48
Updated • 34
SEACrowd/lio_and_central_flores
Updated • 13
Updated • 85
Updated • 71
Updated • 15
SEACrowd/mammoth_vl_sea_shard_3
Viewer
• Updated • 544k • 28.3k
SEACrowd/mammoth_vl_sea_shard_4
Viewer
• Updated • 548k • 197
SEACrowd/mammoth_vl_sea_shard_5
Viewer
• Updated • 235k • 27.1k
Updated • 16
Updated • 38
Updated • 15
Updated • 36
Updated • 10
Updated • 29
• 1
Updated • 29
Updated • 23
Updated • 10
Updated • 35
Viewer
• Updated • 1.27M • 818
• 5
SEACrowd/sea-vl_crowdsourcing
Viewer
• Updated • 7.01k • 170
• 2
Updated • 14
Updated • 19
Updated • 19
Updated • 21
Updated • 30
Updated • 11
Updated • 13
SEACrowd/worldcuisines_format
Viewer
• Updated • 792k • 19
Updated • 43
Updated • 19
• 1
Viewer
• Updated • 1.2k • 88
Salesforce/fineweb_deduplicated
Viewer
• Updated • 6.43B • 1.03k
• 39
Viewer
• Updated • 100 • 4
Viewer
• Updated • 2 • 179
• 7
Viewer
• Updated • 500 • 3.83k
• 17
Viewer
• Updated • 266 • 580
• 1
Viewer
• Updated • 1.06k • 62
• 3
ScaleAI/ROK-FORTRESS_public
Viewer
• Updated • 791 • 34
Viewer
• Updated • 405 • 64
• 2
Viewer
• Updated • 1.46k • 85
• 2
Viewer
• Updated • 16 • 45
Viewer
• Updated • 500 • 1.15k
• 5
Viewer
• Updated • 1.21k • 442
• 2
Viewer
• Updated • 285 • 647
• 6
Viewer
• Updated • 1 • 458
• 28
Viewer
• Updated • 43 • 18
SeppeV/I_want_knock_knock_jokes_training_dataset
Viewer
• Updated • 12.8k • 4
SeppeV/JokeTailor_big_set_annotated
Viewer
• Updated • 8.52k • 5
Viewer
• Updated • 1k • 9
SeppeV/filtered-1M-reddit-jokes
Viewer
• Updated • 579k • 14
SeppeV/jester_jokes_extracted
Viewer
• Updated • 140 • 4
SeppeV/joke_generation_of_mistral_base_t0p4
Viewer
• Updated • 125 • 9
SeppeV/joke_generation_of_mistral_bm
Viewer
• Updated • 125 • 6
SeppeV/joke_generation_of_mistral_bm_jo
Viewer
• Updated • 125 • 4
SeppeV/jokes_by_topic_from_scoutlife
Viewer
• Updated • 5.02k • 5
SeppeV/most_used_topics_scoutlife_jokes
Viewer
• Updated • 284 • 3
SeppeV/rated_jokes_dataset_from_jester
Viewer
• Updated • 1.76M • 251
• 2
SeppeV/results_joke_gen_mistral_base_judge_bert340_1000
Viewer
• Updated • 125 • 4
SeppeV/results_joke_gen_mistral_base_pe_judge_bert340_1000
Viewer
• Updated • 125 • 6
SeppeV/results_joke_gen_mistral_base_pe_judge_bert340_1000_random_user
Viewer
• Updated • 125 • 4
SeppeV/results_joke_gen_mistral_bm_jo_ensemble_test
Viewer
• Updated • 125 • 4
SeppeV/results_joke_generation_of_mistral_bm_jo_multiclass_test
Viewer
• Updated • 125 • 3
SeppeV/results_of_bert_340M_ft_1000_pref
Viewer
• Updated • 1k • 4
SeppeV/scoutlife_db_annotated
Viewer
• Updated • 5.02k • 2
SocialGrep/one-million-reddit-confessions
Viewer
• Updated • 1M • 201
• 13
SocialGrep/one-year-of-r-india
Viewer
• Updated • 1.56M • 166
• 2
SocialGrep/one-year-of-tsla-on-reddit
Updated • 7
• 2
Steveeeeeeen/argmax_earnings22
Viewer
• Updated • 2.74k • 5
Steveeeeeeen/earnings22_long_form
Steveeeeeeen/edacc_test_clean
Viewer
• Updated • 8.81k • 131
11-47/Ancient_Civilaztion_Historian_25k
Viewer
• Updated • 25k • 17
• 1
11-47/Ancient_Civilization_25k
Viewer
• Updated • 25k • 21
• 1
11-47/Genesis_FineTune_Core_25k
Preview
• Updated • 29
• 1
WithinUsAI/Grok4.4_heavy_max_distill_god_seed_25k
Viewer
• Updated • 25.7k • 130
• 5
11-47/TRM_Genesis_RealWorld_15k
Viewer
• Updated • 15k • 33
• 1
11-47/TRM_Genesis_RealWorld_1k
Viewer
• Updated • 1k • 14
11-47/TRM_Genesis_RealWorld_40k
Viewer
• Updated • 40k • 31
• 1
11-47/TRM_Genesis_RealWorld_5k
Viewer
• Updated • 5k • 37
• 1
11-47/hf_real_training_implementations_250.jsonl
Viewer
• Updated • 250 • 7
a-m-team/AM-DeepSeek-Distilled-40M
Viewer
• Updated • 11.5M • 1.66k
• 56
a-m-team/AM-DeepSeek-R1-0528-Distilled
Preview
• Updated • 350
• 102
a-m-team/AM-DeepSeek-R1-Distilled-1.4M
Preview
• Updated • 2.45k
• 181
a-m-team/AM-Qwen3-Distilled
Preview
• Updated • 415
• 25
a-m-team/AM-Thinking-v1-Distilled
Preview
• Updated • 905
• 60
a-m-team/AM-Thinking-v1-RL-Dataset
Viewer
• Updated • 54.8k • 92
• 19
adamo1139/4chan_archive_ShareGPT_fixed_newlines_unfiltered
Viewer
• Updated • 2.55M • 16
• 4
adamo1139/4chan_archive_ShareGPT_no_newlines_low_quality
Viewer
• Updated • 296k • 19
• 1
adamo1139/4chan_archive_ShareGPT_only5
Viewer
• Updated • 532k • 10
adamo1139/4chan_archive_ShareGPT_with_rating_and_comments
Viewer
• Updated • 1.82M • 7
Viewer
• Updated • 25.6k • 8
• 1
Viewer
• Updated • 25.4k • 17
• 5
adamo1139/AEZAKMI_v2_sharegpt
Viewer
• Updated • 25.4k • 7
Viewer
• Updated • 56.8k • 6
Viewer
• Updated • 53.4k • 13
Viewer
• Updated • 51.8k • 5
Viewer
• Updated • 57k • 7
Viewer
• Updated • 38.1k • 4
Viewer
• Updated • 33.7k • 12
Viewer
• Updated • 40.7k • 12
Viewer
• Updated • 47.2k • 13
adamo1139/AEZAKMI_v4_0_DRAFT
Viewer
• Updated • 14.2k • 6
adamo1139/Alpaca-DADA-ShareGPT
Viewer
• Updated • 29.7k • 3
Viewer
• Updated • 30k • 3
adamo1139/Fal7acy_4chan_archive_JSONL
Viewer
• Updated • 100k • 8
adamo1139/Fal7acy_4chan_archive_ShareGPT
Viewer
• Updated • 2.59M • 14
• 1
Viewer
• Updated • 121k • 13
Viewer
• Updated • 117k • 6
Viewer
• Updated • 107k • 10
Viewer
• Updated • 88.6k • 22
adamo1139/HESOYAM_v0.3_splits
Viewer
• Updated • 88.6k • 10
Viewer
• Updated • 54.9k • 14
Viewer
• Updated • 63.2M • 52
adamo1139/HPLT3_tokenized_split
Updated • 13
adamo1139/Magnum_ShareGPT_JSONL
Viewer
• Updated • 93.5k • 35
adamo1139/PS_AD_Office365_02
Viewer
• Updated • 2.28k • 7
adamo1139/PS_AD_Office365_03
Viewer
• Updated • 5.52k • 6
adamo1139/PS_AD_Office365_04_ShareGPT
Viewer
• Updated • 5.51k • 5
adamo1139/PS_AD_Office365_05_ShareGPT
Viewer
• Updated • 6.56k • 16
adamo1139/PS_AD_Office_01
Viewer
• Updated • 1.08k • 10
• 1
adamo1139/SlimPajama-6B-JSONL
Viewer
• Updated • 1.15M • 8
• 1
adamo1139/Sydney_LLaVA_0610
Preview
• Updated • 7
• 1
adamo1139/Sydney_LLaVA_1210
Preview
• Updated • 5
Viewer
• Updated • 5.69k • 6
Viewer
• Updated • 4.14k • 4
adamo1139/TURTLE_v2_rated
Viewer
• Updated • 4.14k • 4
Viewer
• Updated • 2.23k • 5
Viewer
• Updated • 1.84k • 3
adamo1139/apt4_tokenized_100k
Viewer
• Updated • 9.69M • 7
adamo1139/finepdfs_tokenized_split_apt4_v2
Updated • 45
Viewer
• Updated • 152M • 6
adamo1139/fineweb2-pol-filtered-out
adamo1139/flashinfer-0.1.6_cu124torch2.4-cp311-cp311-linux_x86_64
adamo1139/hesoyam_03_rated1_validjson3
Viewer
• Updated • 87.4k • 3
adamo1139/hesoyam_03_rated1_validjson_allclassified
Viewer
• Updated • 87.3k • 5
adamo1139/hesoyam_03_rated1_validjson_allclassified2
Viewer
• Updated • 87.3k • 6
adamo1139/magpie-ultra-v0.1-sharegpt-jsonl
Viewer
• Updated • 50k • 9
Updated • 276
• 16
Viewer
• Updated • 8.27k • 11
• 6
Viewer
• Updated • 14.7k • 29
• 1
adamo1139/rawrr_v2-1-stage1
Viewer
• Updated • 10.9k • 3
adamo1139/rawrr_v2-1-stage2
Viewer
• Updated • 4.91k • 13
adamo1139/rawrr_v2-2_stage1
Viewer
• Updated • 9.03k • 3
adamo1139/reddit_subreddits_sharegpt
Viewer
• Updated • 2.71M • 8
• 2
Viewer
• Updated • 7.36k • 8
adamo1139/szypulka_tokenized_apt4
adamo1139/szypulka_tokenized_apt4_merged
Updated • 11
adamo1139/szypulka_tokenized_eurollm
Updated • 143
adamo1139/tokenized_ds_stats_apt4
Viewer
• Updated • 56 • 4
adamo1139/tokenized_ds_stats_eurollm
Preview
• Updated • 5
ajibawa-2023/Children-Stories-Collection
Viewer
• Updated • 897k • 297
• 58
ajibawa-2023/General-Stories-Collection
Viewer
• Updated • 1.07M • 67
• 41
ajibawa-2023/Software-Architectural-Frameworks
Viewer
• Updated • 1.26k • 10
• 10
ajibawa-2023/Software-Architecture
Preview
• Updated • 377
• 32
alexandreteles/appellatio_fraternitatis_rosae_crucis_multiturn
Viewer
• Updated • 119 • 26
• 1
allura-forge/m2v_c4-featurized-granite-125m
Viewer
• Updated • 171k • 77
Viewer
• Updated • 274k • 5
Viewer
• Updated • 216k • 152
amaye15/amazon_berkeley_objects
Viewer
• Updated • 398k • 27
• 2
Viewer
• Updated • 12.8k • 4
amaye15/receipts-cropped-super-resolution
Viewer
• Updated • 1 • 3
ardauzunoglu/fineweb_random200m
Preview
• Updated • 196k • 156
ardauzunoglu/fineweb_random200m_subsample20m
Viewer
• Updated • 20.8k • 5
argilla/llama-2-banking-fine-tune
Viewer
• Updated • 100 • 30
• 13
Viewer
• Updated • 72.6k • 12
blythet/fineweb-edu-top1m
Viewer
• Updated • 1M • 38
breadlicker45/100k-websites
Updated • 25
• 2
breadlicker45/1m-YA-dataset
Viewer
• Updated • 1.46M • 14
breadlicker45/2Calorie-dataset
Viewer
• Updated • 562 • 5
breadlicker45/Calorie-dataset
Viewer
• Updated • 4.56k • 9
• 2
breadlicker45/ai-hottakes-v2
Viewer
• Updated • 1.44k • 10
Viewer
• Updated • 116k • 5
breadlicker45/bluesky-firehose
Viewer
• Updated • 720k • 15
Viewer
• Updated • 2.56k • 4
breadlicker45/bread-fan-fics
breadlicker45/bread-midi-dataset
Updated • 41
• 39
breadlicker45/eia-csv-data
Viewer
• Updated • 17.4k • 15
Updated • 2
• 1
breadlicker45/giga-chad-comments
Viewer
• Updated • 100 • 4
• 1
breadlicker45/hot-take-v2-en
Viewer
• Updated • 1.71k • 11
Viewer
• Updated • 98 • 10
Viewer
• Updated • 36.7k • 4
Viewer
• Updated • 127k • 8
Viewer
• Updated • 55.6k • 4
• 1
breadlicker45/midi-hex-data
Viewer
• Updated • 2.61k • 5
breadlicker45/musenet-chunk
Viewer
• Updated • 247k • 5
• 1
breadlicker45/muti-class-gender-bluesky-test
Viewer
• Updated • 47.3k • 4
Viewer
• Updated • 1.48M • 8
breadlicker45/this-is-just-a-file
Viewer
• Updated • 1 • 4
breadlicker45/toast-midi-dataset
Preview
• Updated • 21
• 11
breadlicker45/token-model-train
Preview
• Updated • 2
breadlicker45/token-train-vA
Viewer
• Updated • 69.8k • 12
breadlicker45/youtube-comments
Viewer
• Updated • 270k • 12
breadlicker45/youtube-comments-180k
Viewer
• Updated • 187k • 23
• 2
breadlicker45/youtube-comments-v2
Viewer
• Updated • 376k • 8
• 1
Viewer
• Updated • 120 • 45
• 1
Viewer
• Updated • 33.5k • 6
• 6
communityai/Tensoic___gpt-teacher_kn
Viewer
• Updated • 18.2k • 3
communityai/yahma___alpaca-cleaned
Viewer
• Updated • 51.8k • 3
Viewer
• Updated • 34.1k • 6
Viewer
• Updated • 18.1k • 10
• 1
Viewer
• Updated • 4.21k • 4
cudecanarim/beauty-angels
Viewer
• Updated • 48.6k • 5
cudecanarim/bikini-pleasure
Viewer
• Updated • 9.85k • 5
cudecanarim/club-sweethearts
Viewer
• Updated • 17.1k • 4
cudecanarim/creampie-angels
Viewer
• Updated • 10.1k • 7
• 1
Viewer
• Updated • 6.24k • 6
cudecanarim/erotic-beauty
cudecanarim/heavy-on-hotties
Viewer
• Updated • 21.3k • 6
daaxila/twitter-prince12575-2026.02.28-2027576258188742902-MhUXg7tAQs6XDMMH-part1
Viewer
• Updated • 1 • 18
• 1
daaxila/twitter-xuxianer22-2023.05.16-1658297133852938240-Cy32kf0iC7wyvXEc-part1
Viewer
• Updated • 1 • 6
daaxila/xdownloader.com__eac23-part1
Viewer
• Updated • 1 • 6
daaxila/xdownloader.com_ezhieeaw_d64a8-part1
Viewer
• Updated • 1 • 9
daaxila/xdownloader.com_ezhieeaw_d64a8-part2
darkknight25/KALI_LINUX_TOOLKIT_DATASET
Viewer
• Updated • 790 • 89
• 7
darkknight25/LOTL_APT_Red_Team_Dataset
Updated • 50
• 2
darkknight25/Linux_Terminal_Commands_Dataset
Updated • 48
• 3
darkknight25/RED_team_tactics_dataset
Viewer
• Updated • 1k • 141
• 4
darkknight25/linux_window_priv_esic_dataset
Viewer
• Updated • 400 • 37
darkknight25/nosql_injection_dataset
Viewer
• Updated • 660 • 61
• 2
darkknight25/software_vulnerabilities_dataset
Updated • 175
• 3
davanstrien/doab-metadata-extraction
Viewer
• Updated • 8.09k • 257
davanstrien/would-you-read-it
Viewer
• Updated • 268 • 8
• 4
derek-thomas/dataset-creator-askreddit
Viewer
• Updated • 9.85M • 852
• 1
derek-thomas/dataset-creator-reddit-amitheasshole
Viewer
• Updated • 2.57k • 61
derek-thomas/dataset-creator-reddit-bestofredditorupdates
Viewer
• Updated • 11.6k • 18
• 1
dim/linux_man_pages_tldr_summarized
Viewer
• Updated • 481 • 5
• 2
dim/nfs_pix2pix_1920_1080_v6_2x_flux_klein_4B_lora
Viewer
• Updated • 49.4k • 124
dim/openaccess-ai-collective-oo-gpt4-filtered
Viewer
• Updated • 819k • 26
dim/opensubtitles_clean_v1
Viewer
• Updated • 890k • 9
dinushiTJ/aerial_real_only
Viewer
• Updated • 13k • 22
dinushiTJ/aerial_real_plus_0010
Viewer
• Updated • 13.9k • 24
dinushiTJ/aerial_real_plus_0025
Viewer
• Updated • 15.2k • 25
dinushiTJ/aerial_real_plus_0050
Viewer
• Updated • 17.3k • 19
dinushiTJ/aerial_real_plus_0075
Viewer
• Updated • 19.5k • 32
dinushiTJ/aerial_real_plus_0100
Viewer
• Updated • 21.7k • 30
dinushiTJ/aerial_real_plus_0125
Viewer
• Updated • 23.8k • 47
dinushiTJ/aerial_real_plus_0150
Viewer
• Updated • 26k • 57
diwank/fineweb-edu-2024-10-1M_tokenized-qwen2_1024
Viewer
• Updated • 1.03M • 188
diwank/slimorca-autoj-corrected
Viewer
• Updated • 359k • 28
dmayhem93/self-critiquing-base-selected-900
Viewer
• Updated • 900 • 4
dmayhem93/self-critiquing-critique-and-refine
Viewer
• Updated • 39.2k • 9
• 2
dmayhem93/self-critiquing-critique-and-refine-test
Viewer
• Updated • 5.12k • 5
dmayhem93/self-critiquing-critique-and-refine-train
Viewer
• Updated • 34.1k • 2
dmayhem93/self-critiquing-refine
Viewer
• Updated • 39.2k • 14
• 1
dmayhem93/self-critiquing-refine-continuations
Viewer
• Updated • 5.12k • 4
dmayhem93/self-critiquing-refine-test
Viewer
• Updated • 5.12k • 8
dmayhem93/self-critiquing-refine-train
Viewer
• Updated • 34.1k • 2
ebowwa/bay-area-landscaping-blog-posts
Viewer
• Updated • 82 • 6
Viewer
• Updated • 32 • 3
• 1
emozilla/Long-Data-Collections-Fine-Tune
Viewer
• Updated • 98.6k • 359
• 4
emozilla/c4-validation.00000-of-00008
Viewer
• Updated • 45.6k • 5
emozilla/dolma-v1_7-cc_en_head
Viewer
• Updated • 475M • 401
• 1
emozilla/fineweb-10bt-tokenized-datatrove-llama2
Updated • 62
• 3
emozilla/fineweb-350bt-tokenized-datatrove-llama2
Viewer
• Updated • 375 • 24
• 3
Viewer
• Updated • 100k • 112
• 7
fishytorts/new_dataset_test
Viewer
• Updated • 6 • 3
freQuensy23/ru-alpaca-cleaned
Viewer
• Updated • 27k • 31
• 11
freQuensy23/turbo-alpaca-cleaned
Viewer
• Updated • 28.1k • 5
Viewer
• Updated • 2M • 17.5k
• 32
Viewer
• Updated • 164 • 807
• 56
Updated • 10
• 2
Viewer
• Updated • 6.1k • 98
• 37
Viewer
• Updated • 1.38M • 53
• 19
hac541309/open-lid-dataset
Viewer
• Updated • 121M • 1.28k
• 4
hamishivi/virtuoussy_multi_subject_rlvr
Viewer
• Updated • 579k • 14
hamishivi/virtuoussy_multi_subject_rlvr_llm_judge
Viewer
• Updated • 579k • 7
harpreetsahota/marvel-bobbleheads
Viewer
• Updated • 151 • 14
• 2
harpreetsahota/vectordb_trend_analysis
Viewer
• Updated • 26 • 2
ibaha786/chunk-1-50k-ready
Viewer
• Updated • 50k • 4
inclusionAI/AReaL-RL-Data
Preview
• Updated • 72
• 8
inclusionAI/AReaL-boba-Data
Preview
• Updated • 58
• 26
inclusionAI/AReaL-tau2-data
Preview
• Updated • 474
• 13
insanemyrr/mitochondria_cropped_with_markup
Viewer
• Updated • 2.05k • 4
insanemyrr/test-diploma-lucchi-cropped-new-mix-big
Viewer
• Updated • 4.1k • 4
insanemyrr/test-diploma-lucchi-cropped-new-mix-biggest
Viewer
• Updated • 7.92k • 4
interstellarninja/json-mode-singleturn
Viewer
• Updated • 1.24k • 186
• 1
interstellarninja/json-mode-verifiable
Viewer
• Updated • 7.64k • 104
• 2
interstellarninja/json-schema-store-rl
Viewer
• Updated • 950 • 11
Viewer
• Updated • 1.03k • 4
jayavibhav/flux_style_monogatari
jtatman/civil_comments_hatebert
Viewer
• Updated • 451k • 5
jtatman/godot_rl_HovercraftRacing
Viewer
• Updated • 1 • 19
Viewer
• Updated • 1.66M • 3
Viewer
• Updated • 2.33M • 4
jtatman/license-plate-finetuning
Viewer
• Updated • 7.94k • 21
justinphan3110/circuit_breakers_train
Viewer
• Updated • 4.99k • 132
• 2
Viewer
• Updated • 337k • 63
kreasof-ai/GLM-Kimi-OpenThoughts-HunterAlpha-Filtered
Viewer
• Updated • 1.23M • 319
kreasof-ai/KAC-QuantFin-1M
Viewer
• Updated • 1M • 82
• 1
Viewer
• Updated • 93.1M • 894
• 3
kreasof-ai/SEA-Dataset-Lite
Viewer
• Updated • 450k • 38
• 1
Viewer
• Updated • 2.1M • 94
Viewer
• Updated • 87.9k • 36
• 1
kreasof-ai/flores200-eng-bem
Viewer
• Updated • 2.01k • 6
Updated • 3
• 1
lianghsun/finepdf-filtered-zhtw
Viewer
• Updated • 8.37M • 7
Viewer
• Updated • 23 • 19
Viewer
• Updated • 62.9k • 349
Viewer
• Updated • 62.9k • 37
Viewer
• Updated • 63.6k • 76
Viewer
• Updated • 62.9k • 30
Viewer
• Updated • 62.9k • 174
Viewer
• Updated • 62.9k • 82
Viewer
• Updated • 62.9k • 351
Viewer
• Updated • 62.9k • 25
Viewer
• Updated • 62.9k • 234
luistakahashi/ts-classifier-pear-1
luistakahashi/ts-classifier-pear-2
Viewer
• Updated • 1.21k • 4
luistakahashi/ts-classifier-pear-3
Viewer
• Updated • 1.84k • 3
luistakahashi/ts-classifier-pear-4
Viewer
• Updated • 2.73k • 11
luistakahashi/ts-classifier-pear-5
Viewer
• Updated • 2.4k • 5
Viewer
• Updated • 4.89B • 1.28M
• 152
m-a-p/FineFineWeb-bert-seeddata
Viewer
• Updated • 8.8M • 239
• 2
Viewer
• Updated • 224M • 12.1k
• 4
Viewer
• Updated • 1.41M • 282
• 5
m-a-p/FineFineWeb-validation
Viewer
• Updated • 35.6k • 409
• 1
makda-tsegazeab/lfm2-1.2b-blindspots
Viewer
• Updated • 10 • 5
manishiitg/berkeley-nest-Nectar
Viewer
• Updated • 180k • 3
mateowilliam/deepseek-v4-flash-reap-observations-v1
Preview
• Updated • 21
mateowilliam/gemma-moe-reap
mateowilliam/glm5-layerwise-reap-observations
Preview
• Updated • 92
mateowilliam/kimi-k2.6-reap-observations-v1
Preview
• Updated • 32
mateowilliam/qwen3.6-35b-a3b-reap-observations
Preview
• Updated • 8
Viewer
• Updated • 25.5k • 21
Viewer
• Updated • 92.1k • 17
Viewer
• Updated • 100k • 19
Viewer
• Updated • 459k • 4
Viewer
• Updated • 4.72M • 318
meandyou200175/alobacsi_1neg
Viewer
• Updated • 40.1k • 4
meandyou200175/alobacsi_mutil_neg
Viewer
• Updated • 40.1k • 4
Viewer
• Updated • 400 • 2
Viewer
• Updated • 955 • 4
meandyou200175/area_white
Viewer
• Updated • 468 • 3
meandyou200175/bert_vits2
Viewer
• Updated • 2 • 5
meandyou200175/data_16neg
Viewer
• Updated • 43.8k • 32
meandyou200175/data_doan_neg
Viewer
• Updated • 8.49k • 12
meandyou200175/data_split_csv
Viewer
• Updated • 54.8k • 6
meandyou200175/data_split_jsonl_csv
Viewer
• Updated • 54.8k • 5
meandyou200175/data_truyen
Viewer
• Updated • 5.57k • 8
meandyou200175/data_truyen_ACT
Viewer
• Updated • 11.6k • 3
meandyou200175/data_truyen_crop
Preview
• Updated • 2
meandyou200175/dataset_full_fixed
Viewer
• Updated • 54.8k • 10
meandyou200175/dataset_full_fixed_jsonl
Viewer
• Updated • 54.8k • 5
Viewer
• Updated • 8.49k • 3
Viewer
• Updated • 10k • 2
meandyou200175/full_data_15_neg
Viewer
• Updated • 54.8k • 4
Viewer
• Updated • 8.15k • 6
Viewer
• Updated • 1.71k • 10
meandyou200175/merde_data
Viewer
• Updated • 54.8k • 4
meandyou200175/model_face
Updated • 40
Preview
• Updated • 132
Updated • 456
meandyou200175/new1000dts
Preview
• Updated • 2
Preview
• Updated • 49
meandyou200175/omn_finetune
Viewer
• Updated • 2 • 16
meandyou200175/omn_finetune_v2
Viewer
• Updated • 2 • 5
meandyou200175/omn_finetune_v3
Viewer
• Updated • 1 • 10
Viewer
• Updated • 92 • 6
Updated • 133
meandyou200175/super_ngon
Viewer
• Updated • 64k • 6
Updated • 90
meandyou200175/the_last_ipa
meandyou200175/thuoc_multil_neg
Viewer
• Updated • 6.19k • 5
meandyou200175/topic_dataset
Preview
• Updated • 3
Preview
• Updated • 1
Viewer
• Updated • 109k • 9
meandyou200175/vn_topic_v2
Viewer
• Updated • 110k • 16
meandyou200175/vn_topic_v3
Viewer
• Updated • 109k • 17
meandyou200175/vn_topic_v4
Viewer
• Updated • 24.1k • 5
meandyou200175/vn_topic_v4_hashtag
Viewer
• Updated • 78.5k • 9
meandyou200175/vn_topic_v5
Viewer
• Updated • 43.3k • 9
meandyou200175/vn_topic_v5_user
Viewer
• Updated • 14.8k • 6
meandyou200175/yt_dataset
mehuldamani/lean-demos-v1
Viewer
• Updated • 6k • 100
mehuldamani/lean-latent-demos-trial
Viewer
• Updated • 200 • 5
mehuldamani/lean-latent-demos-trial-v2
Viewer
• Updated • 200 • 4
meoconxinhxan/Big-Thought-v0.1-clean
Viewer
• Updated • 1.45M • 2
meoconxinhxan/Intellect-v0.1-clean
Viewer
• Updated • 1.09M • 2
Viewer
• Updated • 7.73k • 419
• 21
microsoft/llmail-inject-challenge
Viewer
• Updated • 462k • 1.09k
• 31
model-metadata/hf_jobs_url
Viewer
• Updated • 76 • 5
model-metadata/trending_models
Viewer
• Updated • 58 • 14
• 1
model-metadata/trending_models_exp
Viewer
• Updated • 100 • 10
model-metadata/trending_models_metadata
Viewer
• Updated • 100 • 199
• 1
Viewer
• Updated • 99.5k • 826
• 29
rahul7star/vaani-snac-cleaned
Viewer
• Updated • 26.3k • 9
rcds/swiss_leading_decision_summarization
Viewer
• Updated • 18.2k • 85
• 5
rcds/swiss_leading_decisions
Viewer
• Updated • 21.2k • 39
• 2
rex099/Kimi-K2.6-Thinking-200x-Cleaned
Viewer
• Updated • 207 • 71
rohanbalkondekar/linux_commands
Viewer
• Updated • 150 • 15
• 3
sam-paech/prm800k-sonnet3.5-validator-test
Viewer
• Updated • 200 • 6
Viewer
• Updated • 871 • 2
Viewer
• Updated • 5.69k • 8
• 2
simonycl/aime_training_positive_direct
Viewer
• Updated • 1k • 9
simonycl/amc_aime_training_positive_direct
Viewer
• Updated • 1k • 21
Viewer
• Updated • 134k • 83
• 3
svjack/Cat_Style_Food_Flux_Krea_Gen
Viewer
• Updated • 92 • 10
• 1
svjack/Xiang_Real_Anime_Background_Relight_Qwen_Edit_2511_Tuned
Viewer
• Updated • 42 • 101
Viewer
• Updated • 54.6k • 780
• 165
theblackcat102/crossvalidated-posts
Viewer
• Updated • 411k • 75
theblackcat102/spell_backward
Viewer
• Updated • 1.33k • 6
theblackcat102/tabmwp-clean
Viewer
• Updated • 18k • 62
traltyaziking/FlirtationFeatureSet
Viewer
• Updated • 100 • 10
• 1
traltyaziking/Sexy-Cloth-Lingerie-Collection
Viewer
• Updated • 134 • 19
• 4
uniquealexx/Kimi-K2.6-Thinking-200x
Viewer
• Updated • 207 • 147
• 2
universalgamingfen1/job-dataset-cleaned
Viewer
• Updated • 376 • 4
Viewer
• Updated • 134 • 4
Viewer
• Updated • 30.7k • 2
Updated • 685
Viewer
• Updated • 135k • 6
Viewer
• Updated • 2.59M • 2
Viewer
• Updated • 313 • 3.84k
• 22
xinshuo/test-connection-dataset
Viewer
• Updated • 3 • 6
ChuGyouk/ApolloMoEDataset-korean
Viewer
• Updated • 55.4k • 18
FreedomIntelligence/alpaca-gpt4-korean
Viewer
• Updated • 50k • 104
• 13
FreedomIntelligence/sharegpt-korean
Viewer
• Updated • 6.01k • 24
• 6
IDEA-CCNL/laion2B-multi-chinese-subset
Viewer
• Updated • 22.5M • 217
• 42
SEACrowd/nusatranslation_emot
Updated • 21
Updated • 12
Updated • 12
Updated • 14
adamo1139/seed_translation_samples
Viewer
• Updated • 40 • 7
french-open-data/communes-et-villes-de-france-en-csv-excel-json-parquet-et-feather
Updated • 27
french-open-data/effectifs-dans-les-enseignements-de-specialites-en-terminale-generale
french-open-data/emissions-des-gaz-a-effet-de-serre-ges-tous-secteurs-d-activite-confondus-des-epci-de-la-region
french-open-data/etablissements-artistiques-ecoles-de-musique-de-danse-et-de-theatre-calvados
french-open-data/fichiers-des-locaux-et-des-parcelles-des-personnes-morales-version-unifiee
french-open-data/financement-et-cout-des-logements-sociaux-rehabilites
french-open-data/lieux-de-mediation-numerique-sur-le-territoire-national-fournis-par-france-services
french-open-data/limites-cadastrales-de-la-polynesie-francaise-en-rgpf-par-ile-et-commune-format-dxf
french-open-data/locaux-non-residentiels-autorises-et-commences-series-mensuelles
french-open-data/nature-et-eau-en-metropole-vegetalisation-evolution-2015-2020
french-open-data/principaux-diplomes-et-formations-prepares-dans-les-etablissements-publics-sous-tutelle-du-minis
french-open-data/revenus-pauvrete-et-niveau-de-vie-en-2019-donnees-carroyees-dispositif-fichier-localise-social-e
french-open-data/territoire-a-risque-inondation-tri-de-la-baie-de-laiguillon-enjeux-lies-a-une-installation-pollu
french-open-data/territoires-a-risque-d-inondation-tri-de-saint-nazaire-guerande-enjeux-lies-a-une-station-de-tra
french-open-data/velos-a-assistance-electrique-en-libre-service-velomoove-bassin-de-pompey
hac541309/basic_korean_dict
Viewer
• Updated • 74.9k • 47
• 6
harpreetsahota/modern-to-shakesperean-translation
Viewer
• Updated • 274 • 13
• 8
jtatman/yolo5_russianlicenseplates_detect
Viewer
• Updated • 783 • 7
kreasof-ai/tatoeba-eng-bem-backtranslation
Viewer
• Updated • 20.1k • 7
lightonai/nanobeir-multilingual
Viewer
• Updated • 522k • 543
• 11
omarkamali/fineweb-arabic
Viewer
• Updated • 311M • 15
• 2
sert121/statlog_german_clean
Viewer
• Updated • 1k • 61
sert121/statlog_german_partial_clean
Viewer
• Updated • 1k • 4
alwaysfurther/heart_failure_symptoms
Viewer
• Updated • 150 • 5
BEE-spoke-data/bigpatent-all
Viewer
• Updated • 2.43M • 781
Intuit-GenSRF/es_legal_advice_reddit
Viewer
• Updated • 98.9k • 7
Viewer
• Updated • 3 • 28
MongoDB/supply_chain_contracts_dataset_small
Viewer
• Updated • 200 • 311
• 2
Viewer
• Updated • 121k • 4
Viewer
• Updated • 3.92M • 1
Viewer
• Updated • 60.6k • 1
Viewer
• Updated • 144k • 1
Viewer
• Updated • 40.9k • 1
Viewer
• Updated • 33.2k • 1
Viewer
• Updated • 17.7k • 1
Viewer
• Updated • 329k • 1
• 3
Viewer
• Updated • 172k • 1
Viewer
• Updated • 278 • 1
Viewer
• Updated • 18.8k • 8
Viewer
• Updated • 187k • 2
Viewer
• Updated • 93.6k • 3
Viewer
• Updated • 57.8k • 1
Viewer
• Updated • 50.2k • 1
Viewer
• Updated • 79.6k • 1
Viewer
• Updated • 315k • 1
Viewer
• Updated • 93.3k • 2
Viewer
• Updated • 45.7k • 1
Viewer
• Updated • 27.9k • 1
Viewer
• Updated • 83k • 1
• 1
Viewer
• Updated • 56.1k • 4
Viewer
• Updated • 60.6k • 1
Viewer
• Updated • 140k • 1
Viewer
• Updated • 28.5k • 1
Viewer
• Updated • 395 • 1
Viewer
• Updated • 966 • 1
Viewer
• Updated • 118k • 1
• 1
Viewer
• Updated • 17k • 1
Viewer
• Updated • 38.9k • 1
Viewer
• Updated • 21.5k • 32
Viewer
• Updated • 116k • 1
Viewer
• Updated • 18.5k • 1
Viewer
• Updated • 1.11M • 1
Viewer
• Updated • 154k • 2
Viewer
• Updated • 67.1k • 1
Viewer
• Updated • 59.7k • 1
Viewer
• Updated • 249k • 2
Viewer
• Updated • 45.7k • 1
Viewer
• Updated • 26.7k • 6
Viewer
• Updated • 41.9k • 4
Viewer
• Updated • 16.8k • 5
Viewer
• Updated • 38.5k • 1
Viewer
• Updated • 251k • 2
Viewer
• Updated • 1.4k • 1
Viewer
• Updated • 2 • 1
Viewer
• Updated • 808k • 2
Viewer
• Updated • 22.9k • 1
Viewer
• Updated • 40.7k • 2
Viewer
• Updated • 3.47k • 1
Viewer
• Updated • 22.7k • 1
Viewer
• Updated • 27.7k • 1
Viewer
• Updated • 106k • 1
Viewer
• Updated • 50.5k • 1
Viewer
• Updated • 10.9k • 1
kaushik-harsh-99/Indian-legal-data-v1
Viewer
• Updated • 33.1k • 28
• 3
kaushik-harsh-99/Indian-legal-data-v2
Viewer
• Updated • 172k • 32
• 3
lianghsun/tw-judgment-gist
Viewer
• Updated • 31 • 10
Viewer
• Updated • 236k • 360
• 3
lianghsun/tw-law-augmented-v0.1
Viewer
• Updated • 748k • 7
lianghsun/tw-legal-methodology
Viewer
• Updated • 8.02k • 10
Viewer
• Updated • 12.8k • 9
Updated • 59
• 2
Updated • 37
• 4
rcds/lower_court_insertion_swiss_judgment_prediction
Viewer
• Updated • 2.25k • 37
rcds/occlusion_swiss_judgment_prediction
Viewer
• Updated • 56.8k • 89
rcds/swiss_court_view_generation
Updated • 238
• 2
rcds/swiss_judgment_prediction
Updated • 236
• 16
rcds/swiss_judgment_prediction_xl
Updated • 35
rcds/swiss_law_area_prediction
Viewer
• Updated • 22.3k • 33
• 5
Viewer
• Updated • 35.7k • 127
• 6
thangvip/legal-multiple-choice
Viewer
• Updated • 1.78k • 4
thangvip/legal-splited-ds
Viewer
• Updated • 753k • 18
thangvip/tokenized-ds-qwen3-legal-mixed
Viewer
• Updated • 397k • 4
theblackcat102/zhtw_legal
Viewer
• Updated • 62 • 6