Agentic data-analysis challenge

benchAnalyst

Build a tool-using agent that answers analytical questions about a large, deliberately messy dataset. Your agent reaches the data only through a constrained API, read-only SQL over the tables and keyword search and fetch over the documents, under a fixed budget of calls per question. Answers are graded against a held-out key.

Get your API key

Getting started

  1. Select your name to get your API key.
  2. Download the starter kit (API client, example agent, README).
  3. Build your agent; it queries, searches, and fetches through the API.
  4. Submit answers through the API and check the leaderboard.

Rules