Evaluates whether multiple outputs generated by a language model for the same input are consistent with each other.
input
: The original input/prompt given to the modelactual_output
: Array of multiple outputs generated by the modelResult
: Score is one among the discrete values : 0.2, 0.4, 0.6, 0.8, 1Reasoning
: Detailed explanation of consistency assessment