OpenAI engineered a 'vanilla' data agent to navigate its massive 1.5 exabyte data platform, addressing the challenge of finding and understanding relevant data among 90,000 datasets. The agent's reliability stems from a simple architecture backed by a robust data infrastructure and a sophisticated context assembly layer, which feeds a single LLM with carefully curated information to generate accurate SQL queries and verified answers.
Read original on ByteByteGoOpenAI's data platform, storing 1.5 exabytes across 90,000 datasets, faced a significant challenge: analysts spent hours identifying and understanding the correct tables for queries, even before writing SQL. To solve this, OpenAI developed an in-house data agent, designed to be 'vanilla' in its core architecture but highly effective due to strong underlying data infrastructure and sophisticated context engineering.
The core of the data agent is intentionally simple: a single LLM (GPT-5.5) combined with a harness. The harness orchestrates the LLM's reasoning by providing tools, assembling relevant context, and running an iterative loop of reason-act-observe. This contrasts with more complex agent systems that might involve multiple LLMs, routers, or fine-tuning. The reliability isn't in a complex agent, but in the meticulous data acquisition and context provision.
The critical factor for the agent's success is the quality of the context provided to the LLM. A bare schema is insufficient; the agent uses six layers of context, prepared offline and retrieved at runtime, to ensure accuracy:
System Design Takeaway: Context over Complexity
This case study highlights that for LLM-powered agents, the *quality and relevance of the input context* often outweighs complex multi-LLM architectures or intricate routing. Investing in robust data infrastructure and intelligent context assembly can yield more reliable and scalable results than attempting to make the LLM itself 'smarter' through convoluted prompting or model layering. The 'vanilla' agent design proves that simplicity at the agent level can be achieved if the surrounding data platform handles the complexity of context provision.