Input

  • Required Inputs:
    • input: The original input/prompt given to the model
    • actual_output: Array of multiple outputs generated by the model

Output

  • Result: Score is one among the discrete values : 0.2, 0.4, 0.6, 0.8, 1
  • Reasoning: Detailed explanation of consistency assessment

Interpretation

  • 1.0: Fully consistent, effectively compatible responses
  • 0.8: Mostly consistent, with only minor inconsistencies
  • 0.6: Somewhat consistent but with room for improvement
  • 0.4: Mostly inconsistent with occasional consistency
  • 0.2: Highly inconsistent; all outputs incompatible