Input

  • Required Inputs:
    • session: Complete interaction log between user and agent showing all steps taken

Output

  • Result: Binary Value (0 or 1)
  • Reasoning: Detailed explanation of the evaluation

Interpretation

  • 1: All required steps have been completed
  • 0: Missing steps to complete the task