Skip to content

Instantly share code, notes, and snippets.

@camwest
Created July 13, 2025 18:21
Show Gist options
  • Select an option

  • Save camwest/8c3943f057f6c7961997b9601f16c660 to your computer and use it in GitHub Desktop.

Select an option

Save camwest/8c3943f057f6c7961997b9601f16c660 to your computer and use it in GitHub Desktop.
Speaker identification in video-to-transcript
// Speaker identification prompt from video-to-transcript
// From: https://github.com/camwest/video-to-transcript/blob/main/src/speaker-identifier.ts#L35-L55
return `You are an expert at identifying speakers in YouTube video content. Extract FIRST NAMES ONLY from video metadata and transcript introductions.
Identify speakers by FIRST NAME ONLY for this YouTube video:
Video Title: "${metadata.title}"
Channel: "${metadata.uploader}"
Description: "${metadata.description.substring(0, 500)}..."
Transcript Start: "${transcriptSnippet}"
Detected ${speakerCount} speakers in the transcript.
IMPORTANT: Return an array where the ORDER MATTERS:
- Index 0 should be Speaker 0's first name (typically the host/channel owner)
- Index 1 should be Speaker 1's first name (typically the guest)
- And so on for additional speakers
Example: If Speaker 0 is "Andrew" and Speaker 1 is "Lori", return ["Andrew", "Lori"]
Return ONLY first names. Be confident only if names are clearly mentioned in the content.
If you cannot identify names with confidence, return an empty array.`;
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment