התחל במצב לא מקוון עם האפליקציה Player FM !
פודקאסטים ששווה להאזין
בחסות


Beyond GPUs: Cerebras’ Wafer-Scale Engine for Lightning-Fast AI Inference
Manage episode 469982662 series 2570898
Hagay Lupesko is the SVP for AI Inference at Cerebras Systems.
Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/
Support our work by leaving a small tip 💰 https://buymeacoffee.com/gradientflow
Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.
Detailed show notes - with links to many references - can be found on The Data Exchange web site.
282 פרקים
Manage episode 469982662 series 2570898
Hagay Lupesko is the SVP for AI Inference at Cerebras Systems.
Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/
Support our work by leaving a small tip 💰 https://buymeacoffee.com/gradientflow
Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.
Detailed show notes - with links to many references - can be found on The Data Exchange web site.
282 פרקים
כל הפרקים
×
1 Beyond the Demo: Building AI Systems That Actually Work 27:36

1 Vibe Coding and the Rise of AI Agents: The Future of Software Development is Here 36:35

1 2025 Artificial Intelligence Index 51:44

1 How AI is Transforming Talent Development 28:30

1 Prompts as Functions: The BAML Revolution in AI Engineering 38:49

1 Building the Operating System for AI Agents 45:42

1 Bridging the AI Agent Prototype-to-Production Chasm 38:40

1 The Evolution of Reinforcement Fine-Tuning in AI 45:45

1 Beyond GPUs: Cerebras’ Wafer-Scale Engine for Lightning-Fast AI Inference 39:26

1 The Future of AI: Regulation, Foundation Models & User Experience 47:42

1 The AI Agent Rundown: 10 Things to Know Now 24:01

1 Why ‘Structure’ Is All You Need: A Deep Dive into Next-Gen AI Retrieval 32:27

1 Why Legal Hurdles Are the Biggest Barrier to AI Adoption 38:57

1 Unlocking Spreadsheet Intelligence with AI 47:54

1 Monthly Roundup: Deregulation, Hardware, and Inference Scaling 45:41

1 What AI Teams Need to Know for 2025 32:26

1 AI Unlocked: The Data Bottleneck 25:11

1 The Data-Centric Shift in AI: Challenges, Opportunities, and Tools 27:43

1 Monthly Roundup: Semiconductors, Frontier Models, and Practical Innovations 48:51

1 Breaking the Cloud Barrier: How DBOS Transforms Application Development 36:02

1 The Essential Guide to AI Guardrails 47:27

1 Beyond ETL: How Snow Leopard Connects AI, Agents, and Live Data 43:33

1 2024 Generative AI in Healthcare Survey Results 37:35

1 Monthly Roundup: BAML, Tencent’s Hunyuan Model, AI & Kubernetes, and the Future of Voice AI 49:22

1 Building the Future of Finance: Inside AI Valuation Bots 37:45

1 Unleashing the Power of BAML in LLM Applications 45:45

1 Cracking the Code: How Enterprises Are Adopting Generative AI 30:09

1 Monthly Roundup: Ray Compiled Graphs, Llama 3.2 and Multimodal AI, and Structured Data for RAG 52:57

1 Reimagining Code: The AI-Driven Transformation of Programming and Data Analytics 41:05

1 The Security Debate: How Safe is Open-Source Software? 51:06

1 Monthly Roundup: SB 1047, GraphRAG, and AI Avatars in the Workplace 36:59

1 Fine-tuning and Preference Alignment in a Single Streamlined Process 35:32

1 TinyML, Sensor-Driven AI, and Advances in Large Language Models 25:23

1 Machine Unlearning: Techniques, Challenges, and Future Directions 49:36

1 Unleashing the Power of AI Agents 38:47

1 Monthly Roundup: Llama 3, Agents, Evaluation Metrics, Cyc, TikTok, and more 41:58

1 LLMs for Data Access: Unlocking Insights with Text-to-SQL 43:22

1 2024 Artificial Intelligence Index 53:50

1 DBRX and the Future of Open LLMs 45:45

1 Monthly Roundup: New LLMs, GTC 2024, Constraint-Driven Innovation, Model Safety, and GraphRAG 37:01

1 Automating Software Upgrades: How to Combine AI and Expert Developers 36:27

1 Generative AI in the Industrial Sphere 44:04

1 The Intersection of LLMs, Knowledge Graphs, and Query Generation 57:45

1 Unlocking the Potential of Private Data Collaboration 36:14

1 Frontiers of AI: From Text-to-Video Models to Knowledge Graphs 33:35

1 Generative AI in Voice Technology 59:41

1 Building An Experiment Tracker for Foundation Model Training 37:56

1 Monthly Roundup: AI Regulations, GenAI for Analysts, Inference Services, and Military Applications 45:49

1 Unlocking the Power of LLMs with Data Prep Kit 38:15

1 Advancing AI: Scaling, Data, Agents, Testing, and Ethical Considerations 24:37

1 Bridging the Hardware-Software Divide in AI 48:27

1 Monthly Roundup: The Economic Realities of Large Language Models 43:31

1 From Hype to Reality: The Current State of Enterprise Generative AI Adoption 44:53

1 Automating Unstructured Data Extraction with LLMs 35:29

1 Generative AI in Context: Hybrid Intelligence and Responsible Development 36:04

1 Monthly Roundup: Navigating the Peaks and Valleys of Generative AI Technology 46:18

1 From Preparation to Recovery: Mastering AI Incident Response 34:38

1 Unlocking the Power of Unstructured Data 49:32

1 Postgres: The Swiss Army Knife of Databases 50:50


1 Adaptive, Specialized, and Accessible: Where AI Systems Are Heading Next 43:01


1 The AI Infrastructure Revolution: From Cloud Computing to Data Center Design 42:44

1 AI in Depth: Transforming Transportation, Enterprise, and Policy 41:12

1 Software Meets Hardware: Enabling AMD for Large Language Models 38:37

1 Incentives are Superpowers: Mastering Motivation in the AI Era 31:09

1 Synthetic Futures: The Convergence of Biology and AI 32:22

1 AI Co-Pilots in Action: Transforming Function Calling in Cybersecurity 45:00

1 Leveling Up: Tools and Techniques to Make AI Development More Accessible 45:16


1 Democratizing Wealth Management With AI 47:27

1 Knowledge Graphs: Contextualizing Enterprise Data for More Accurate LLMs 41:36

1 TimeGPT: Machine Learning for Time Series, Made Accessible 44:07

1 Best Practices for Building LLM-Backed Applications 53:50

1 The Evolution of Crypto, Blockchain, and Web3 49:15
ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.