Skip to content

Instantly share code, notes, and snippets.

View kwindla's full-sized avatar

Kwindla Hultman Kramer kwindla

View GitHub Profile
@kwindla
kwindla / sonic-3-launch-day.py
Created October 28, 2025 18:53
Cartesia Sonic-3 Pipecat Example
import asyncio
import datetime
import os
import wave
from pathlib import Path
from dotenv import load_dotenv
from loguru import logger
from pipecat.audio.turn.smart_turn.base_smart_turn import SmartTurnParams
@kwindla
kwindla / bot.py
Created September 5, 2025 01:00
local macos transport bot.py
import asyncio
import os
from datetime import datetime
from dotenv import load_dotenv
from loguru import logger
from pipecat.audio.vad.silero import SileroVADAnalyzer
from pipecat.pipeline.pipeline import Pipeline
from pipecat.pipeline.runner import PipelineRunner
@kwindla
kwindla / gpt-5-ttft.py
Created August 8, 2025 17:19
GPT-5 TTFT demo
# export OPENAI_API_KEY=sk_proj-...
# uv run test-gpt5.py "Write a four-line poem about LLMs."
# For a complete, single-file conversational voice agent
# example using GPT-5, see this gist:
# https://gist.github.com/kwindla/678ea297d12e24b928b636db928226fb
# /// script
# dependencies = ["openai"]
# ///
@kwindla
kwindla / gpt-5-voice.agent.py
Last active August 24, 2025 03:59
GPT-5 voice agent single-file example
# export OPENAI_API_KEY=sk_proj-...
# uv run gpt-5-voice-agent.py
# /// script
# dependencies = [
# "numba==0.61.2",
# "openai",
# "python-dotenv",
# "fastapi[all]",
# "uvicorn",
@kwindla
kwindla / voice-agents.md
Created June 23, 2025 23:43
Advice on Voice Agents - June 2025

Advice on Voice AI, June 2025

My top three pieces of advice for people getting started with voice agents.

  1. Spend time up front understanding why latency and instruction following accuracy drive voice AI tech choices.

  2. You will need to add significant tooling complexity as you go from proof of concept to production. Prepare for that. Especially important: build lightweight evals as early as you can.

  3. The right path is: start with a proven, "best practices" tech stack -> get everything working one piece at a time -> deploy to real-world users and collect data -> then think about optimizing cost/latency/etc.

@kwindla
kwindla / video-inference-result.md
Created June 13, 2025 16:40
Gemini Pro video understanding
@kwindla
kwindla / inference-note.md
Created June 11, 2025 17:30
Funny inference result captured while recording demo traces

Funny GPT-4o inference result.

Audible in this vide: https://youtu.be/PgyJs0jfp_o?si=43CJgmk954kulmgl&t=863

Output

It sounds like we're on an intriguing mission! I'm going to scan through the grand chandeliers and ornate carpets of the hotel for traces. Be right back with the results!
@kwindla
kwindla / gemini-talk-transcript.py
Created May 6, 2025 15:43
Cleaned up talk transcript matched to onscreen slides
from google import genai
import os
client = genai.Client(api_key=os.getenv("GOOGLE_API_KEY"))
# filename_for_upload = "/Users/khkramer/Downloads/maven-lightning-trimmed.mp4"
# myfile = client.files.upload(file=filename_for_upload)
#
# print("My files:")
@kwindla
kwindla / daily-transport-double-transcription.py
Created April 28, 2025 01:36
Double transcription events test
# double transcription events
# pip install 'pipecat-ai[daily,silero,openai,cartesia]'==0.0.59 dotenv
#
# transcription events as expected
# pip install 'pipecat-ai[daily,silero,openai,cartesia]'==0.0.58 dotenv
import asyncio
import sys
import os
@kwindla
kwindla / gladia-tagalog-mixed.py
Created April 26, 2025 02:16
DailyTransport, Gladia, Tagalog / English mixed
#
# Copyright (c) 2024–2025, Daily
#
# SPDX-License-Identifier: BSD 2-Clause License
#
import argparse
import asyncio
import os