Vision and Language, Natural Language, Cultural Understanding, Reasoning
UnpredictaBench: A Benchmark for Evaluating Distributional Randomness in LLMs
Display competition details and manage submissions