התחל במצב לא מקוון עם האפליקציה Player FM !
Designing Voicebots that Feel Human: Ecosmob’s Approach to Real-Time Conversational AI, Podcast
Manage episode 513834955 series 2674324
“If responses aren’t near real-time, the bot won’t feel human.” — Ruchir Brahmbhatt, Co-Founder & CTO, Ecosmob
Ruchir Brahmbhatt, Co-Founder and CTO of Ecosmob, joined Doug Green, Publisher of Technology Reseller News, to discuss the engineering behind human-like voicebots—where milliseconds make the difference between a smooth conversation and a frustrating one.
With more than 18 years in VoIP and AI/ML development, Ecosmob builds custom voicebots for MSPs, ITSPs, and UCaaS/CCaaS providers seeking real-time automation and compliance. Brahmbhatt outlined how Ecosmob’s architecture achieves sub-second latency through:
- Python async orchestration for thousands of concurrent sessions
- Redis in-memory queues for ultra-low-latency streaming
- NVIDIA Canary ASR and Kokoro TTS for fast, natural speech
- llama.cpp LLM engine with dynamic quantization for efficient processing
In a live healthcare demo, Ecosmob’s voicebot scheduled an appointment in natural, human-like dialogue—with total round-trip latency under 600 milliseconds.
Brahmbhatt emphasized that modern contact centers are shifting from IVRs to AI-driven self-service, and that on-prem and GDPR-compliant deployments are increasingly essential.
Learn more at ecosmob.com.
51 פרקים
Manage episode 513834955 series 2674324
“If responses aren’t near real-time, the bot won’t feel human.” — Ruchir Brahmbhatt, Co-Founder & CTO, Ecosmob
Ruchir Brahmbhatt, Co-Founder and CTO of Ecosmob, joined Doug Green, Publisher of Technology Reseller News, to discuss the engineering behind human-like voicebots—where milliseconds make the difference between a smooth conversation and a frustrating one.
With more than 18 years in VoIP and AI/ML development, Ecosmob builds custom voicebots for MSPs, ITSPs, and UCaaS/CCaaS providers seeking real-time automation and compliance. Brahmbhatt outlined how Ecosmob’s architecture achieves sub-second latency through:
- Python async orchestration for thousands of concurrent sessions
- Redis in-memory queues for ultra-low-latency streaming
- NVIDIA Canary ASR and Kokoro TTS for fast, natural speech
- llama.cpp LLM engine with dynamic quantization for efficient processing
In a live healthcare demo, Ecosmob’s voicebot scheduled an appointment in natural, human-like dialogue—with total round-trip latency under 600 milliseconds.
Brahmbhatt emphasized that modern contact centers are shifting from IVRs to AI-driven self-service, and that on-prem and GDPR-compliant deployments are increasingly essential.
Learn more at ecosmob.com.
51 פרקים
All episodes
×ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.