Building More Efficient AI With VLLM Ft. Nick Hill Technically Speaking With Chris Wright podcast

Artwork

Tech Business Red Hat Emerging Technologies Machine Learning Chris Wright Hybrid Clouds Data Development Security Linux Software Cloud Computing Enterprise Technology Artificial Intelligence Programming Coding Careers Technology

תוכן מסופק על ידי Red Hat. כל תוכן הפודקאסטים כולל פרקים, גרפיקה ותיאורי פודקאסטים מועלים ומסופקים ישירות על ידי Red Hat או שותף פלטפורמת הפודקאסט שלהם. אם אתה מאמין שמישהו משתמש ביצירה שלך המוגנת בזכויות יוצרים ללא רשותך, אתה יכול לעקוב אחר התהליך המתואר כאן https://he.player.fm/legal.

Technically Speaking with Chris Wright « »
Building more efficient AI with vLLM ft. Nick Hill

3M ago 20:52

שתפו

MP3•בית הפרקים

תוכן מסופק על ידי Red Hat. כל תוכן הפודקאסטים כולל פרקים, גרפיקה ותיאורי פודקאסטים מועלים ומסופקים ישירות על ידי Red Hat או שותף פלטפורמת הפודקאסט שלהם. אם אתה מאמין שמישהו משתמש ביצירה שלך המוגנת בזכויות יוצרים ללא רשותך, אתה יכול לעקוב אחר התהליך המתואר כאן https://he.player.fm/legal.

Explore what it takes to run massive language models efficiently with Red Hat's Senior Principal Software Engineer in AI Engineering, Nick Hill. In this episode, we go behind the headlines to uncover the systems-level engineering making AI practical, focusing on the pivotal challenge of inference optimization and the transformative power of the vLLM open-source project. Nick Hill shares his experiences working in AI including: • The evolution of AI optimization, from early handcrafted systems like IBM Watson to the complex demands of today's generative AI. • The critical role of open-source projects like vLLM in creating a common, efficient inference stack for diverse hardware platforms. • Key innovations like PagedAttention that solve GPU memory fragmentation and manage the KV cache for scalable, high-throughput performance. • How the open-source community is rapidly translating academic research into real-world, production-ready solutions for AI. Join us to explore the infrastructure and optimization strategies making large-scale AI a reality. This conversation is essential for any technologist, engineer, or leader who wants to understand the how and why of AI performance. You’ll come away with a new appreciation for the clever, systems-level work required to build a truly scalable and open AI future.

… continue reading

5 פרקים

#Tech #Business #Red Hat #Emerging Technologies #Machine Learning #Chris Wright #Hybrid Clouds #Data #Development #Security #Linux #Software #Cloud Computing #Enterprise Technology #Artificial Intelligence #Programming #Coding Careers #Technology

Artwork

Building more efficient AI with vLLM ft. Nick Hill

Technically Speaking with Chris Wright

published 3M ago

שתפו

MP3•בית הפרקים

תוכן מסופק על ידי Red Hat. כל תוכן הפודקאסטים כולל פרקים, גרפיקה ותיאורי פודקאסטים מועלים ומסופקים ישירות על ידי Red Hat או שותף פלטפורמת הפודקאסט שלהם. אם אתה מאמין שמישהו משתמש ביצירה שלך המוגנת בזכויות יוצרים ללא רשותך, אתה יכול לעקוב אחר התהליך המתואר כאן https://he.player.fm/legal.

Explore what it takes to run massive language models efficiently with Red Hat's Senior Principal Software Engineer in AI Engineering, Nick Hill. In this episode, we go behind the headlines to uncover the systems-level engineering making AI practical, focusing on the pivotal challenge of inference optimization and the transformative power of the vLLM open-source project. Nick Hill shares his experiences working in AI including: • The evolution of AI optimization, from early handcrafted systems like IBM Watson to the complex demands of today's generative AI. • The critical role of open-source projects like vLLM in creating a common, efficient inference stack for diverse hardware platforms. • Key innovations like PagedAttention that solve GPU memory fragmentation and manage the KV cache for scalable, high-throughput performance. • How the open-source community is rapidly translating academic research into real-world, production-ready solutions for AI. Join us to explore the infrastructure and optimization strategies making large-scale AI a reality. This conversation is essential for any technologist, engineer, or leader who wants to understand the how and why of AI performance. You’ll come away with a new appreciation for the clever, systems-level work required to build a truly scalable and open AI future.

… continue reading

5 פרקים

#Tech #Business #Red Hat #Emerging Technologies #Machine Learning #Chris Wright #Hybrid Clouds #Data #Development #Security #Linux #Software #Cloud Computing #Enterprise Technology #Artificial Intelligence #Programming #Coding Careers #Technology

כל הפרקים

×

ברוכים הבאים אל Player FM!

Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.

תקשיבו ל-500+ נושאים

מדריך עזר מהיר

פודקאסטים מובילים

מכביבול ברדיו תל אביב

הזווית - פודקאסט

בכל יום נתון - אוריאל דסקל וראם שרמן

עושים חשבון Osim Heshbon

טייכר וזרחוביץ

טייכר וזרחוביץ' - מערכונים

גיבור תרבות Culture Hero

האחיות גרים

עושים תוכנה Osim Tochna

השבוע - פודקאסט הארץ

המשחק הגדול

חושבים טוב

המעבדה The Lab

חזית המדע

סליחה על השאלה - ההסכת You Can't Ask That Podcast

שעת מלחמה

עבר פלילי

Lets Talk Murder | בואי נדבר רצח

Lets Talk Murder | בואי נדבר רצח

האזן לתוכנית הזו בזמן שאתה חוקר