Evaluates whether an agent selected and used appropriate tools for each step in a task, including parameter configuration.
session
: Complete interaction log showing tool usageResult
: Value in the continuous range [0, 1]Reasoning
: Detailed explanation of tool selection assessment