Ensuring your prompt selects the accurate tool call (function) is crucial for building reliable and efficient AI workflows. Maxim’s playground allows you to attach your tools (API, code or schema) and measure tool call accuracy for agentic systems.
library
section of your workspace. These could be executable or just the schema and then attached to your prompt for testing.
Create a new tool
Attach tools to your prompt
Send prompt with tool instructions
Review assistant's tool selection
Examine tool execution results
Manually test different scenarios
Prepare your dataset
input
and expected tool calls
columns.Define expected tool calls
Initiate prompt testing
Select your test dataset
Choose the accuracy evaluator
Review accuracy scores
Analyze detailed message logs
messages
tab.