Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Kyle Montgomery's picture
3 6 2

Kyle Montgomery

kylemontgomery
ncrispino's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a dataset 10 days ago
kylemontgomery/swesmith-filtered
published a dataset 12 days ago
kylemontgomery/swesmith-filtered
upvoted a paper 12 days ago
Agents' Last Exam
View all activity

Organizations

WangLab's profile picture VMDT -- Video Trustworthiness Benchmark's profile picture ScalerLab's profile picture

upvoted a paper 12 days ago

Agents' Last Exam

Paper • 2606.05405 • Published 19 days ago • 357
upvoted a paper 2 months ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published Feb 25 • 26
upvoted a paper 3 months ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published Mar 17 • 101
upvoted 2 papers 8 months ago

Budget-aware Test-time Scaling via Discriminative Verification

Paper • 2510.14913 • Published Oct 16, 2025 • 5

Predicting Task Performance with Context-aware Scaling Laws

Paper • 2510.14919 • Published Oct 16, 2025 • 4
upvoted a paper over 1 year ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 47
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs