No-Code Agent Evals

How to evaluate your no-code agent?

How to evaluate your no-code agent?

After testing in the playground, evaluate your Agents across multiple test cases to ensure consistent performance using the test runs.

Create a Dataset

Add test cases by creating a Dataset. For this example, we’ll use a Dataset of product images to generate descriptions.

Build your Agent

Create an Agent that processes your test examples. In this case, the agent generates product descriptions, translates them to multiple languages, and formats them to match specific requirements.

Agent for product description generation

Start a test run

Open the test configuration by clicking the Test button on the top right corner.

Configure your test

Select your dataset and add Evaluators to measure the quality of outputs.

Test configuration with dataset and evaluator options

Review results

Monitor the test run to analyze the performance of your agent across all inputs.

Test run results showing performance metrics

⌘I

Introduction

Prompt Engineering

Offline Evals

Online Evals

Tracing

Simulations

Library

Dashboards

Integrations

Settings

CI/CD

How to evaluate your no-code agent?

Introduction

Prompt Engineering

Offline Evals

Online Evals

Tracing

Simulations

Library

Dashboards

Integrations

Settings

CI/CD

​How to evaluate your no-code agent?

How to evaluate your no-code agent?