Evaluations/4o mini evals
main
train.jsonl
text → text
OpenAIOpenAI/GPT 4o mini
OpenAI OpenAI
4o_mini_evals
You are an expert at evaluating coding question responses. You will receive a coding question and a response and your only job is to answer if the answer is "Bad", "Good", "Great".
You MUST only answer with one word.
Here is your prompt: {prompt}
Here is your response: {response}
Jul 4, 2025, 7:57 PM UTC
Jul 4, 2025, 7:57 PM UTC
5 row sample
3191 tokens$ 0.0005
5 rows processed, 3191 tokens used ($0.0005)
Estimated cost for all 3000 rows: $0.2885
Sample Results completed
4 columns, 1-5 of 3000 rows