14 subscribers
התחל במצב לא מקוון עם האפליקציה Player FM !
Voice-to-Voice Foundation Models
Manage episode 447645838 series 3370867
Alan Cowen is the cofounder and CEO of Hume, a company building voice-to-voice foundation models. They recently raised their $50M Series B from Union Square Ventures, Nat Friedman, Daniel Gross, and others.
Alan's favorite book: 1984 (Author: George Orwell)
(00:01) Introduction
(00:06) Defining Voice-to-Voice Foundation Models
(01:26) Historical Context: Handling Voice and Speech Understanding
(03:54) Emotion Detection in Voice AI Models
(04:33) Training Models to Recognize Human Emotion in Speech
(07:19) Cultural Variations in Emotional Expressions
(09:00) Semantic Space Theory in Emotion Recognition
(12:11) Limitations of Basic Emotion Categories
(15:50) Recognizing Blended Emotional States
(20:15) Objectivity in Emotion Science
(24:37) Practical Aspects of Deploying Voice AI Systems
(28:17) Real-Time System Constraints and Latency
(31:30) Advancements in Voice AI Models
(32:54) Rapid-Fire Round
--------
Where to find Prateek Joshi:
Newsletter: https://prateekjoshi.substack.com
Website: https://prateekj.com
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19
Twitter: https://twitter.com/prateekvjoshi
171 פרקים
Manage episode 447645838 series 3370867
Alan Cowen is the cofounder and CEO of Hume, a company building voice-to-voice foundation models. They recently raised their $50M Series B from Union Square Ventures, Nat Friedman, Daniel Gross, and others.
Alan's favorite book: 1984 (Author: George Orwell)
(00:01) Introduction
(00:06) Defining Voice-to-Voice Foundation Models
(01:26) Historical Context: Handling Voice and Speech Understanding
(03:54) Emotion Detection in Voice AI Models
(04:33) Training Models to Recognize Human Emotion in Speech
(07:19) Cultural Variations in Emotional Expressions
(09:00) Semantic Space Theory in Emotion Recognition
(12:11) Limitations of Basic Emotion Categories
(15:50) Recognizing Blended Emotional States
(20:15) Objectivity in Emotion Science
(24:37) Practical Aspects of Deploying Voice AI Systems
(28:17) Real-Time System Constraints and Latency
(31:30) Advancements in Voice AI Models
(32:54) Rapid-Fire Round
--------
Where to find Prateek Joshi:
Newsletter: https://prateekjoshi.substack.com
Website: https://prateekj.com
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19
Twitter: https://twitter.com/prateekvjoshi
171 פרקים
כל הפרקים
×
1 Converting Cameras into Autonomous AI Agents | Rish Gupta, CEO of Spot AI 38:50

1 Are AI Phone Agents Ready for Prime Time? | Alex Levin, CEO of Regal 45:21

1 What it Takes to Build a BI Platform | Colin Zima, CEO of Omni 40:07

1 Building Billing Infrastructure for AI Companies | Alvaro Morales, CEO of Orb 38:21

1 Turning Legal Services to APIs | Jay Madheswaran, CEO of Eve 41:02

1 Is LLM the New Operating System? | Anant Bhardwaj, CEO of Instabase 45:37

1 Building AI Agents That Actually Work | Malte Kosub, CEO of Parloa 33:54

1 3000 Customers, One Bold Pivot: Building the First Generative AI Copilot for Lawyers | Scott Stevenson, CEO of Spellbook 44:07

1 The Outer Loop of AI-Powered Coding | Merrill Lutsky, CEO of Graphite 41:26

1 Behind the Scenes of AI Video | Amit Jain, founder of Luma AI 48:19

1 Building an AI-Powered Terminal | Zach Lloyd 38:06

1 When Robots Go Haywire, Who Picks Up The Tab? | Amias Gerety 48:54

1 Building MotherDuck to a $400M Company 49:18

1 AI Agents Have Brains, But Where Are Their Wallets? 47:27


1 Building Autonomous Greenhouses with AI and Robotics 37:45

1 Developing Battery Materials with AI 33:27

1 Voice-to-Voice Foundation Models 39:08

1 Digital Replicas That Can Have Real Conversations 37:40




1 Breaking New Ground With Collaborative Robots 49:22


1 How to extract intelligence from speech data with AI 44:56


1 The Long Tail of AI: Understanding and Resolving Edge Cases 37:53

1 How Symbolic AI is Transforming Critical Infrastructure 38:08

1 AI Disruption: Startups vs Incumbents in the Tech Stack 46:57

1 Unpacking AI Startups: Metrics, Playbooks, and the Future 33:09
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.