40 subscribers
התחל במצב לא מקוון עם האפליקציה Player FM !
Scaling Large ML Models to Small Devices with Atila Orhon
Manage episode 416934750 series 2455731
The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops.
Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting larger, but the smallest models that are commercially relevant are getting smaller. The company was started in 2023 and has raised money from General Catalyst and other industry leaders.
Atila Orhon is the founder of Argmax and he previously worked at Apple and NVIDIA. He joins the show to talk about working in computer vision, building ML tooling at Apple, optimizing ML models, and more.
2102 פרקים
Manage episode 416934750 series 2455731
The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops.
Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting larger, but the smallest models that are commercially relevant are getting smaller. The company was started in 2023 and has raised money from General Catalyst and other industry leaders.
Atila Orhon is the founder of Argmax and he previously worked at Apple and NVIDIA. He joins the show to talk about working in computer vision, building ML tooling at Apple, optimizing ML models, and more.
2102 פרקים
כל הפרקים
×
1 Emulating Retro Games on Modern Consoles with Robin Lavallée and Bill Litshauer 1:01:34

1 SED News: Corporate Spies, Postgres, and the Weird Life of Devs Right Now 44:38

1 TanStack and the Future of Frontend with Tanner Linsley 55:13

1 The Challenge of AI Model Evaluations with Ankur Goyal 45:22

1 Modern Distributed Applications with Stephan Ewen 41:20


1 Chip Design in the AI Era with Thomas Andersen 50:33

1 OpenTofu with Cory O’Daniel and Malcolm Matalka 48:58

1 Mojo and Building a CUDA Replacement with Chris Lattner 56:14

1 Building PostgreSQL for the Future with Heikki Linnakangas 42:12

1 Security at Coinbase with Philip Martin 47:58

1 Anthropic and the Model Context Protocol with David Soria Parra 51:30

1 Grand Theft Auto III on the Dreamcast with Falco Girgis and Stef Kornilios Mitsis Poiitidis 47:39


1 LiveKit and OpenAI with Russ d’Sa 47:56
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.