Evaluate captured logs automatically from the UI based on filters and sampling
Navigate to repository
Access evaluation configuration
Configure evaluation
in the top right corner of the page and choose the Setup evaluation configuration
option. This will open up the evaluation configuration sheet.Configure auto evaluation settings
Auto Evaluation
section has 3 parts:Select evaluators
: Choose the evaluators you want to use for your evaluation.Filters
: Setup filters to only evaluate logs that meet a certain criteria.Sampling
: Choose a sampling rate, this will help you control the amount of logs that are evaluated and prevent evaluating every log; which could potentially lead to very high costs.Human Evaluation
section below is explained in the Set up human evaluation on logs sectionSave configuration
Evaluation
tab, wherein you can see the evaluation in detail.
Filter logs with specific evaluation scores (e.g., bias score greater than 0)
Select all filtered logs using the top-left selector
Click the `Add to dataset` button that appears
Choose to add logs to an existing dataset or create a new dataset. Map the columns and click `Add entries`