Skip to main content
Maxim provides three flexible ways to build and maintain evaluation datasets:
  • Curate dataset from production: Filter real user interactions and human feedback to capture edge cases, failure modes, and high-value scenarios that reflect actual usage patterns.
  • Generate synthetically: Create test datasets automatically with custom configurations for your use case including inputs, expected outputs, scenarios, personas, and expected steps. You can generate from scratch or use existing datasets as reference context.
  • Import existing datasets: Bring in datasets from CSV files, external sources, or other evaluation platforms.