32 subscribers
התחל במצב לא מקוון עם האפליקציה Player FM !
פודקאסטים ששווה להאזין
בחסות


Training Machine Learning (ML) models on Kubernetes
Manage episode 421319868 series 3332465
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Bernie Wu, VP Strategic Partnerships and AI/CXL/Kubernetes Initiatives at Memverge. They discuss about how Kubernetes is the most popular platform to run AI model training and model inferencing jobs. The discussion dives into model training, talking about different phases of a DAG, and then talk about how Memverge can help users with efficient and cost-effective model checkpoints. The discussion goes into topics like saving costs by using spot instances, hot restart of training jobs, reclaiming unused GPU resources, etc.
Check out our website at https://kubernetesbytes.com/
Episode Sponsor: Nethopper
- Learn more about KAOPS: @nethopper.io
- For a supported-demo: info@nethopper.io
- Try the free version of KAOPS now! https://mynethopper.com/auth
Cloud Native News:
- https://www.aquasec.com/blog/linguistic-lumberjack-understanding-cve-2024-4323-in-fluent-bit/
- https://kubernetes.io/blog/2024/05/20/completing-cloud-provider-migration/
- https://thenewstack.io/introducing-aks-automatic-managed-kubernetes-for-developers/
- https://www.harness.io/blog/harness-to-acquire-split
Show Links:
- https://www.linkedin.com/in/berniewu/
- https://criu.org/Main_Page
- https://memverge.com/
- https://youtu.be/tY8YOMRuqWI?si=yB3hHqLUpYPZ-KWN
- https://youtu.be/ND4seSKpJHI?si=shh0iuA9qC-dO6eb
Timestamps:
88 פרקים
Manage episode 421319868 series 3332465
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Bernie Wu, VP Strategic Partnerships and AI/CXL/Kubernetes Initiatives at Memverge. They discuss about how Kubernetes is the most popular platform to run AI model training and model inferencing jobs. The discussion dives into model training, talking about different phases of a DAG, and then talk about how Memverge can help users with efficient and cost-effective model checkpoints. The discussion goes into topics like saving costs by using spot instances, hot restart of training jobs, reclaiming unused GPU resources, etc.
Check out our website at https://kubernetesbytes.com/
Episode Sponsor: Nethopper
- Learn more about KAOPS: @nethopper.io
- For a supported-demo: info@nethopper.io
- Try the free version of KAOPS now! https://mynethopper.com/auth
Cloud Native News:
- https://www.aquasec.com/blog/linguistic-lumberjack-understanding-cve-2024-4323-in-fluent-bit/
- https://kubernetes.io/blog/2024/05/20/completing-cloud-provider-migration/
- https://thenewstack.io/introducing-aks-automatic-managed-kubernetes-for-developers/
- https://www.harness.io/blog/harness-to-acquire-split
Show Links:
- https://www.linkedin.com/in/berniewu/
- https://criu.org/Main_Page
- https://memverge.com/
- https://youtu.be/tY8YOMRuqWI?si=yB3hHqLUpYPZ-KWN
- https://youtu.be/ND4seSKpJHI?si=shh0iuA9qC-dO6eb
Timestamps:
88 פרקים
כל הפרקים
×
1 Database as a service with Percona Everest 1:02:44

1 Increasing AI adoption using Kubernetes 52:03

1 Monolith to Microservices using Kubernetes at Guidewire 1:06:28

1 Inference in Action: Scaling Al Smarter with Inferless 55:17

1 Container security with Wiz 1:02:33

1 Dagger.io Deep Dive with Co-Founder Sam Alba 1:06:24

1 Running Ray on Kubernetes with KubeRay 53:06

1 Building scalable data platforms using Data on EKS 1:02:20

1 Deploy and fine-tune LLM models on Kubernetes using KAITO 44:17

1 The business case for cloud-native and Kubernetes 54:24

1 Building the AI Hyperscaler with Kubernetes 54:56

1 Shifting Minds: Exploring OpenShift's AI Landscape 1:05:07

1 Training Machine Learning (ML) models on Kubernetes 55:29

1 The evolution of service mesh technologies 1:08:00



1 Open Policy Agent (OPA) 101 1:07:20

1 Ops Ops Hooray! Navigating IDPs from an Ops perspective 58:17

1 Generative AI on Kubernetes 1:15:56

1 IDPs Unveiled: Accelerating Deployment on Kubernetes 59:52

1 Running Kubernetes at the Edge using K3s 53:51

1 Running multi-tenant Kubernetes clusters using vCluster 57:58


1 Byte-sized: Exploring the Basics of AI in Plain English 1:00:18

1 Kubecon North America 2023: Highlights, Themes and Key Takeaways 57:41

1 Universal Control Planes for Kubernetes and Beyond 59:36

1 DevOpsDays Boston - Helping developers be more productive in a multi-cloud world 35:02

1 DevOpsDays Boston - Platform Engineering and Internal Developer Platforms 31:12

1 DevOpsDays Boston - Real value of community 42:41


1 How Chick-fil-A adopts GitOps and K3s at the Edge 1:19:16

1 Nodeless Kubernetes - Optimizing costs with just in time compute 1:02:20

1 Solving Multicloud with Seamless Connectivity and AI - with Rob Croteau 58:04


1 Generative AI: The New Frontier in Kubernetes Problem-Solving 1:04:58

1 From Manual to Automatic: Revolutionizing Cloud Native Stack Deployment with Argonaut 1:08:16

1 Accelerating Kubernetes Adoption: Unleashing the Power of GitOps using Kubefirst 1:00:47

1 Continuous Security: Keeping Pace in the DevOps Lifecycle w/ ARMO 1:01:44

1 Unleashing the power of KubeVirt - Running Containers and VMs on Kubernetes 1:14:39

1 Breaking Down the Diamond: A Look at MLB's Kubernetes-Powered Analytics 53:36

1 Kubecon Europe 2023: Highlights and Key Takeaways 46:35

1 Kubernetes Community Corner with Michael O'Leary: Exploring the Intersection of Learning and Collaboration 48:33

1 Kubernetes in Cloud Native Healthcare 56:31


1 What is Platform Engineering with Luca Galante 1:07:11

1 Cloud Native WebAssembly with Nigel Poulton 1:06:07

1 Kubernetes Security Posture Management with Mondoo 53:22

1 Unified application deployment platform for Kubernetes with Plural.sh 54:38


1 GitOps, DevSecOps & Kubernetes w/ GitLab 1:00:15

1 Kubernetes Alternatives - when NOT to use Kubernetes! 57:02

1 Understanding the cost of Kubernetes w/ Kubecost 54:48

1 Part 2 - Live from Kubecon North America 2022 - Interviews with Redis, Teleport, Instruqt, and Pulumi 41:28

1 Part 1 - Live from Kubecon North America 2022 - Interviews with Percona, EDB, Dell, and Akamai 41:29

1 Powering Decentralized Cloud with Kubernetes 58:23

1 Kubernetes Security 101 - 4C's of Cloud Native Security 59:44



1 Community, Opensource and Kubernetes with Brendan Burns and Ganesh Ashokavardhanan 54:35
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.