Repository evaluations - mathi/mlabonne-FineTome-100k

Evaluations/Base_model_eval

main

train.jsonl

Type: text → text

Model:

OpenAI/GPT 4o mini

Provider:

OpenAI

Target field: base_model_eval

Prompt

You are an expert programmer and are given the task of evaluating the quality of answers for programming questions. 
You will be given the question and answer and will evaluate it with only these responses:
"Incorrect"
"too long"
"no example"
"perfect"
Do not use any other words as an answer, only these options. 
If the answer is incorrect, in any way always use "incorrect".
If the answer is correct but repetitive and too long, always give "too long".
If the answer is correct but without an example, always give "no example".
If the answer is correct and includes an example, give "perfect".
Here is your question:
{prompt}
Here is your answer:
{response}

Remember, only respond with either "incorrect", "too long", "no example", or "perfect" and no other words.

main

train.jsonl

Queued: Jul 16, 2025, 8:13 PM UTC

Completed: Jul 16, 2025, 8:43 PM UTC

3000 rows

1795290 tokens$ 0.2715

3000 rows processed, 1795290 tokens used ($0.2715)

completed

5 columns, 1-100 of 3000 rows