A podcast about Voice AI hosted by Dave Zohrob and Harish Agarwal, the cofounders of Chartable (acquired by Spotify in 2022). We interview founders, technologists, and investors at the frontier of voice AI applications, and share our startup journey along the way. More at https://smartspeakers.fm www.smartspeakers.fm
Mon, March 24, 2025
Much of my childhood was spent in my basement, dialing up other people’s computers to trade messages and play games. Kwin Kramer from Daily remembers that time, too—and says today's voice AI moment feels just like those early internet days. That sense of endless possibilities is back. His open-source project PipeCat has become the standard toolkit for voice agents. What began as an experiment now powers voice AI for OpenAI, Google DeepMind, and countless startups, making conversations feel natural and responsive. Some highlights: * That early internet feeling is back: "1995 to 1999 felt a certain way. It never felt that way again until 2023 to 2025." * GPT-4 transformed Daily's business by removing a key bottleneck: "Previously you needed two humans for a conversation. Now you only need one, maybe not even that." * Voice AI's killer feature? Latency matters: "If response times are long, you're in that uncanny valley where people get uncomfortable." * Kwin's bold prediction: "We're all going to have friends in our group chats that aren't human because LLMs are actually really entertaining." Hope you enjoy it as much as we did. Links Daily: https://daily.co/PipeCat: https://pipecat.ai/Kwin on Twitter: https://twitter.com/kwindla Chapters 0:00 Intro 2:02 First AI aha moment 5:43 MIT Media Lab beginnings 9:05 BBS and door games 15:26 The AllAfrica journey 18:54 Starting Daily 21:13 COVID's impact on WebRTC 22:36 GPT-4 transformation 31:26 Building voice for LLMs 35:17 PipeCat's key challenges 44:10 The future of speech-to-speech 47:10 Voice AI adoption trends 52:34 Vibe coding revolution 56:11 What's next for PipeCat This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
Mon, March 17, 2025
This week, Tom Shapland shares his journey from agriculture tech to founding Canonical, a tool that helps voice AI developers understand and improve conversations by mapping call stages. Tom has deep experience in the voice world—we loved this conversation and we’re sure you will too! Links https://canonical.chathttps://x.com/Tom_Shaplandhttps://www.linkedin.com/in/tom-shapland-b4494212/ Chapters 0:00 - Intro 1:05 - Welcome and first AI moment 3:03 - Computer vision for thirsty plants 5:50 - Explaining AI to farmers 7:52 - Hardware is hard 8:31 - Non-VC shaped business struggles 16:51 - Starting a new company 19:41 - From metrics to conversation stages 22:42 - Voice AI evolution 26:40 - Balancing determinism and freedom 29:53 - Who uses Canonical and when 32:39 - Don't make your agent spell anything 35:52 - Zero to 80% is easy, production is hard 39:28 - Future of voice AI adoption 42:21 - Book recommendations and closing This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
Mon, March 10, 2025
Voice AI is having its moment, but pretty soon every company will be a voice company. Olivia Moore is the author of a16z’s market report on voice AI agents , which is a must-read if you’re interested in the space. She’s an amazing thinker and we cover a lot in this conversation, including: * Olivia's thesis that Voice AI is in it’s “internet in 1999” moment * The "wedge" strategy: using Voice AI to get in the door before transforming broader business operations * How recruiting became a surprising early adopter of Voice AI (candidates actually prefer it) * Voice AI's unique ability to handle compliance and stay on-script in regulated industries * Shifting from per-minute pricing to transaction-based fees or success metrics * Why big tech platforms (OpenAI, Google) won't dominate vertical-specific voice applications Links * Olivia on X: https://x.com/omooretweets (must-follow!) * Her market report on voice AI agents * Dr Sbaitso emulator - this was packaged with new Sound Blaster sound cards! Strange stuff. Chapters 0:00 Intro and welcome1:24 Olivia's first AI "aha" moments 3:42 Coolest non-voice AI tools6:29 How Olivia got into the Voice AI space10:03 From consumer to enterprise13:50 DMV-bot16:22 Voice AI as a wedge20:57 Traction in the voice market24:24 Voice AI for recruiting28:59 Why AI doesn't go off-script31:03 Better experiences for candidates and customers33:24 The future of call centers37:02 Bear cases for Voice AI41:25 Open problems44:29 Pricing47:12 Working with twin sister Justine50:11 Three things Olivia's excited about Subscribe and listen: https://smartspeakers.fm This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
Mon, March 03, 2025
We chat with Phil Markunas about his wild journey from Army staff sergeant to voice AI founder. Phil shares how Standard Practice evolved from a healthcare payment startup to building an AI assistant that handles complex insurance calls. Along the way, we learn about Phil's life in Japan, his Oreo obsession, and why he believes natural conversation is as hard as self-driving cars. I loved Phil’s concept of “success, but doof” — it neatly encapsulates just how hard it is to measure the success of AI voice calls. Stay tuned next week for our chat with Olivia Moore from a16z. See you soon—dave & harish Follow Phil: https://www.linkedin.com/in/phil-markunashttps://philm.io/https://x.com/philmarkunas Standard Practice: https://standardpractice.ai Chapters 0:00 - Introduction to Phil Markunas, CTO of Standard Practice1:34 - AI icebreaker and Phil's emotional response to voice AI & Oreo obsession5:17 - Military background and how Army service shaped Phil's people-first leadership8:07 - Global citizen with 60+ moves and life in Japan13:04 - The Nibble Health origin story and creating a medical bill payment card17:59 - Facing challenges and developing the SimpleBuill medical analysis tool22:07 - Pivoting to voice AI to solve the healthcare insurance call problem26:25 - Why voice conversation is an AGI-level problem and deceptively complex33:21 - Beyond "just prompt it up" to building sophisticated voice AI architecture40:45 - "Success, but doof" moments and the future of Standard Practice Subscribe — and let us know what you think! https://smartspeakers.fm This is a free preview of a paid episode. To hear more, visit www.smartspeakers.fm
S1 E1 · Mon, February 24, 2025
Smart Speakers is a podcast about voice AI. Our first guest is, of course, an AI: ChatGPT in advanced voice mode. We talk about: * Our journey since selling Chartable to Spotify in 2022: life inside Spotify, and Harish’s explorations outside corporate life in construction and factory tech * Starting to work together again last July * Exploring projects: a Chartable-like analytics platform for audiobooks, a text-to-speech bookmarking tool called Earmark, and finally experiments with voice AI * We interview ChatGPT, which is surprisingly good at podcast ad reads! * Harish thinks ChatGPT lied to us at least twice. What do you think? Stay tuned next week: Phil Markunas from Standard Practice discusses voice AI in healthcare billing. Thanks for listening, reading, and watching. We’d love to hear your feedback! Subscribe and comment at https://smartspeakers.fm This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
Trailer · Wed, February 05, 2025
Smart Speakers—the podcast about voice AI—is launching on Monday, February 24th! Subscribe now at https://smartspeakers.fm This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
loading...