Assesses whether an agent has completed all required steps to achieve a task, evaluating the logical progression and completeness of steps taken during a session.
session
: Complete interaction log between user and agent showing all steps takenResult
: Binary Value (0 or 1)Reasoning
: Detailed explanation of the evaluation