Evaluations/4o mini evals/Iteration history
History
Total running cost: $0.0005
PromptRowsTypeModelTargetStatusRuntimeRunByTokensCost
Sample
You are an expert at evaluating coding question responses. You will receive a coding question and a response and your only job is to answer if the answer is "Bad", "Good", "Great". You MUST only answer with one word. Here is your prompt: {prompt} Here is your response: {response}
5texttextOpenAIOpenAI/GPT 4o miniSample - N/A completed 00:00:041 month agomathi3191 tokens$ 0.0005