Created
July 13, 2025 18:21
-
-
Save camwest/8c3943f057f6c7961997b9601f16c660 to your computer and use it in GitHub Desktop.
Speaker identification in video-to-transcript
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| // Speaker identification prompt from video-to-transcript | |
| // From: https://github.com/camwest/video-to-transcript/blob/main/src/speaker-identifier.ts#L35-L55 | |
| return `You are an expert at identifying speakers in YouTube video content. Extract FIRST NAMES ONLY from video metadata and transcript introductions. | |
| Identify speakers by FIRST NAME ONLY for this YouTube video: | |
| Video Title: "${metadata.title}" | |
| Channel: "${metadata.uploader}" | |
| Description: "${metadata.description.substring(0, 500)}..." | |
| Transcript Start: "${transcriptSnippet}" | |
| Detected ${speakerCount} speakers in the transcript. | |
| IMPORTANT: Return an array where the ORDER MATTERS: | |
| - Index 0 should be Speaker 0's first name (typically the host/channel owner) | |
| - Index 1 should be Speaker 1's first name (typically the guest) | |
| - And so on for additional speakers | |
| Example: If Speaker 0 is "Andrew" and Speaker 1 is "Lori", return ["Andrew", "Lori"] | |
| Return ONLY first names. Be confident only if names are clearly mentioned in the content. | |
| If you cannot identify names with confidence, return an empty array.`; | |
| } |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment