History
Total running cost: $0.0000
PromptRowsTypeModelTargetStatusRuntimeRunByTokensCost
Run
Compare the two answers and respond with true if the reasoning and answers are the same and false if not. Respond with a single word lower case. Answer 1: {reasoning} {answer} Answer 2: {prediction}
12texttextOpenAIOpenAI/GPT 4o6eff404c976dee97 completed 00:00:0711 months agoox4072 tokens
Sample
Compare the two answers and respond with true if the reasoning and answers are the same and false if not. Respond with a single word lower case. Answer 1: {reasoning} {answer} Answer 2: {prediction}
10texttextOpenAIOpenAI/GPT 4oSample - N/A completed 00:00:0411 months agoox3463 tokens