History
Total running cost: $0.0000
PromptRowsTypeModelTargetStatusRuntimeRunByTokensCost
Run
Compare the two answers and respond with true if the reasoning and answers are the same and false if not. Respond with a single word lower case. Answer 1: {reasoning} {answer} Answer 2: {prediction}
12texttextOpenAIOpenAI/GPT 4o6eff404c976dee97 completed 00:00:071 year agoox4072 tokens
Sample
Compare the two answers and respond with true if the reasoning and answers are the same and false if not. Respond with a single word lower case. Answer 1: {reasoning} {answer} Answer 2: {prediction}
10texttextOpenAIOpenAI/GPT 4oSample - N/A completed 00:00:041 year agoox3463 tokens