Repository evaluations - ox/eminem-character-lora

Evaluations/Json Timestamp Decoder.

main

full_videos.parquet

Type: video → text

Model:

Google/Gemini 3.1 Pro Preview

Provider: Google

Target field: eminem_speaking

Prompt

Analyze the following video and give me a list of timestamps where it is just the rapper Eminem in the frame, and he is the only person speaking. Then transcribe the speech in that section of the clip. Format the response in a json array structure of start_time, end_time, only_eminem, and speech as two timestamps, a boolean, and a string. 

For example:

[
  {
    "start_time": 0.0,
    "end_time": 1.0,
    "only_eminem": false,
    "speech": "And how did that make you feel"
  },
  {
    "start_time": 1.0,
    "end_time": 3.2,
    "only_eminem": true,
    "speech": "I mean, man, it felt really good!"
  }
]

Only respond with the json structureand nothing else. Make sure to include all the timestamps from 0.0 to the end of the video, and break up the clips on when Eminem starts and stops talking, with the camera fully on him.

{file_path}

Queued: Apr 3, 2026, 1:33 AM UTC

Completed: Apr 3, 2026, 1:37 AM UTC

5 row sample

71745 tokens$ 0.1835

5 rows processed, 71745 tokens used ($0.1835)

Estimated cost for all 25 rows: $0.9173

Sample Results completed

2 columns, 1-5 of 25 rows