After testing in the playground, evaluate your Agents across multiple test cases to ensure consistent performance using the test runs.Documentation Index
Fetch the complete documentation index at: https://www.getmaxim.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
Create a Dataset
Add test cases by creating a Dataset. For this example, we’ll use a Dataset of product images to generate descriptions.

Build your Agent
Create an Agent that processes your test examples. In this case, the agent generates product descriptions, translates them to multiple languages, and formats them to match specific requirements.

Configure your test
Select Evaluators to measure the quality of outputs and map the evaluator variables to the dataset columns.You can read more about mapping evaluator variables here.

You can use create and use Presets for your test runs to save time and avoid repeating the same configuration.
Review results
Monitor the test run to analyze the performance of your Prompt Chain across all inputs.

