Evaluates the usefulness and contribution of each step in an agent’s session towards achieving the overall task goal.
session
: Complete interaction log showing all stepsResult
: Value in the continuous range [0, 1]Reasoning
: Detailed explanation of utility assessment