Simulate Glean Agents

Overview

Glean provides a platform to build powerful AI agents that can interact with your enterprise data and tools. With Maxim, you can test these agents at scale using automated simulations to ensure they handle various scenarios correctly before deploying to production. This guide shows you how to:

Create an agent in Glean
Connect the agent endpoint to Maxim
Run automated simulations to test agent behavior
Evaluate performance across multiple scenarios

Prerequisites

A Glean workspace with agent creation access
Glean API credentials
A Maxim account

Step 1: Create Your Agent in Glean

First, create and configure your agent in Glean:

Navigate to your Glean workspace
Create a new agent with your desired capabilities:
- Define the agent’s purpose and instructions
- Configure any tools or data sources the agent needs
- Set up the agent’s behavior and guardrails
Once created, note your agent’s ID - you’ll need this for the API endpoint

Step 2: Get Your Glean API Credentials

To connect Maxim to your Glean agent, you’ll need:

Instance Name: Your Glean instance name (e.g., instance-name-be)
API Token: Your Glean API Bearer token for authentication
Agent ID: The unique identifier for your agent

You can find these in your Glean workspace settings and agent configuration.

The API token will be used in the Authorization: Bearer <token> header when making requests to your Glean agent.

Step 3: Configure HTTP Endpoint in Maxim

Now, set up the Glean agent as an HTTP endpoint in Maxim:

Create a new HTTP endpoint workflow

In Maxim, navigate to the Evaluate section
Click on Agents via HTTP Endpoint or create a new workflow
Name your workflow (e.g., “Customer Support Agent”, “Travel Agent”)

Configure the endpoint

Set up your Glean agent endpoint with the following configuration:Endpoint URL:

https://instance-name-be.glean.com/rest/api/v1/agents/runs/wait

Replace instance-name-be with your Glean instance name.Method: POSTHeaders: Add the required headers:

Key	Value
`Content-Type`	`application/json`
`Accept`	`application/json`
`Authorization`	`Bearer <your-glean-api-token>`

Replace <your-glean-api-token> with your actual Glean API token.

Configure the request body

Set up the message structure that Glean expects:

{
  "agent_id": "string",
  "input": {},
  "messages": [
    {
      "role": "USER",
      "content": [
        {
          "text": "string",
          "type": "text"
        }
      ]
    }
  ],
  "metadata": {}
}

Map your variables:

agent_id: Your Glean agent ID (string)
input: Additional input parameters (object, can be empty {})
messages[0].content[0].text: The user message from your dataset
metadata: Optional metadata (object, can be empty {})

Example with actual values:

{
  "agent_id": "my-customer-support-agent",
  "input": {},
  "messages": [
    {
      "role": "USER",
      "content": [
        {
          "text": "{{user_message}}",
          "type": "text"
        }
      ]
    }
  ],
  "metadata": {}
}

In this example, {{user_message}} is a variable that will be replaced with values from your dataset.Complete curl example for reference:

curl -L 'https://instance-name-be.glean.com/rest/api/v1/agents/runs/wait' \
  -H 'Content-Type: application/json' \
  -H 'Accept: application/json' \
  -H 'Authorization: Bearer <token>' \
  -d '{
    "agent_id": "my-customer-support-agent",
    "input": {},
    "messages": [
      {
        "role": "USER",
        "content": [
          {
            "text": "I need a refund for order #12345",
            "type": "text"
          }
        ]
      }
    ],
    "metadata": {}
  }'

Step 4: Create a Simulation Dataset

Create a dataset with scenarios to test your agent:

Navigate to Datasets

Go to the Library section and select Datasets

Create or import dataset

When creating your dataset, select the Agent simulation template. This will create a dataset with the following columns:

user_message: The initial message from the user to start the conversation
agent_scenario: Description of what the user is trying to accomplish
expected_steps: Step-by-step description of how you expect the agent to handle the scenario

Example dataset for a Customer Support bot:

user_message	agent_scenario	expected_steps
”I need a refund for order #12345”	Customer requesting refund for defective product	1) Agent greets customer and acknowledges refund request; 2) Agent verifies order details and purchase history; 3) Agent confirms defect reason; 4) Agent processes refund and provides confirmation
”What’s the warranty on the iPhone 15?”	User inquiring about product warranty information	1) Agent provides accurate warranty period (typically 1 year); 2) Agent explains what’s covered under warranty; 3) Agent offers extended warranty options if available
”I need to change my billing address”	Customer wanting to update account information	1) Agent verifies customer identity; 2) Agent guides customer through address update process; 3) Agent confirms new address and updates account
”My order hasn’t arrived yet”	Customer checking on delayed order status	1) Agent asks for order number; 2) Agent looks up tracking information; 3) Agent explains current status and expected delivery; 4) Agent offers solution if delayed

Step 5: Run Simulated Sessions

Now you can test your agent with automated simulations:

Set up the test run

Navigate to your HTTP endpoint workflow
Click Test in the top right
Select Simulated session mode

Configure simulation parameters

Dataset: Select the dataset you created with test scenariosEnvironment (optional): Choose if you have different environments configuredResponse fields to evaluate:

Output field: Select the field from the response that contains the agent’s final answer
Context field (optional): Select fields containing retrieved context or tool outputs

Select evaluators: Choose evaluators to assess your agent:

Task Completion
Agent Trajectory

You can also browse the Evaluator Store to add more evaluators.

Trigger the test run

Click Trigger test run to start the simulation.The system will:

Iterate through each scenario in your dataset
Simulate multi-turn conversations based on the scenario
Evaluate the agent’s performance using your selected evaluators
Generate a comprehensive report

Review results

Once complete, you’ll see a comprehensive test run report with:Test Summary (Left Panel)

HTTP endpoint and dataset information
Status overview (e.g., “10 Completed”)
Test run duration
Simulation cost and evaluation cost
Timestamp and author information

Summary by Evaluator

Pass/Fail status for each evaluator
Mean scores (e.g., 0.4/1)
Pass rates (e.g., 40%)
Visual indicators for performance

Latency Metrics

Minimum, maximum, and p50 latency
Helps identify performance bottlenecks

Detailed Results Table Each row shows:

Status: Whether the simulation completed successfully
Agent scenario: The test scenario from your dataset
Expected steps: The expected agent behavior you defined
sessionId: Unique identifier for each simulated session
Avg. latency (ms): Response time for that scenario
Cost ($): Individual test cost
Tokens: Token usage per scenario

You can:

Click on any row to view the full conversation transcript
Sort by any column to identify patterns
Filter results to focus on specific scenarios
Export the report for further analysis

Advanced: Single Turn Testing

For quick validation, you can also run single-turn tests:

In the test setup modal, select Single turn instead of Simulated session
This will send one message per dataset row without multi-turn conversation
Useful for:
- Quick validation of changes
- Testing specific inputs
- Regression testing with known inputs/outputs

Introduction

Offline Evals

Online Evals

Tracing

Simulations

Library

Dashboards

Integrations

Settings

Simulate Glean Agents

Overview

Prerequisites

Step 1: Create Your Agent in Glean

Step 2: Get Your Glean API Credentials

Step 3: Configure HTTP Endpoint in Maxim

Step 4: Create a Simulation Dataset

Step 5: Run Simulated Sessions

Advanced: Single Turn Testing

Resources

Build AI Agents with Glean

Maxim Docs

Introduction

Offline Evals

Online Evals

Tracing

Simulations

Library

Dashboards

Integrations

Settings

​Overview

​Prerequisites

​Step 1: Create Your Agent in Glean

​Step 2: Get Your Glean API Credentials

​Step 3: Configure HTTP Endpoint in Maxim

​Step 4: Create a Simulation Dataset

​Step 5: Run Simulated Sessions

​Advanced: Single Turn Testing

​Resources

Build AI Agents with Glean

Maxim Docs

Overview

Prerequisites

Step 1: Create Your Agent in Glean

Step 2: Get Your Glean API Credentials

Step 3: Configure HTTP Endpoint in Maxim

Step 4: Create a Simulation Dataset

Step 5: Run Simulated Sessions

Advanced: Single Turn Testing

Resources