הפודקאסטים הטובים ביותר ב-Machine Learning Street Talk Mlst (2025)

1
He Co-Invented the Transformer. Now: Continuous Thought Machines - Llion Jones and Luke Darlow [Sakana AI] 1:12:39

11d ago1:12:39

1:12:39

The Transformer architecture (which powers ChatGPT and nearly all modern AI) might be trapping the industry in a localized rut, preventing us from finding true intelligent reasoning, according to the person who co-invented it. Llion Jones and Luke Darlow, key figures at the research lab Sakana AI, join the show to make this provocative argument, an…

1
Why Humans Are Still Powering AI [Sponsored] 24:19

1M ago24:19

24:19

Ever wonder where AI models actually get their "intelligence"? We reveal the dirty secret of Silicon Valley: behind every impressive AI system are thousands of real humans providing crucial data, feedback, and expertise.Guest: Phelim Bradley, CEO and Co-founder of ProlificPhelim Bradley runs Prolific, a platform that connects AI companies with veri…

1
The Universal Hierarchy of Life - Prof. Chris Kempes [SFI] 40:59

1M ago40:59

40:59

"What is life?" - asks Chris Kempes, a professor at the Santa Fe Institute. Chris explains that scientists are moving beyond a purely Earth-based, biological view and are searching for a universal theory of life that could apply to anything, anywhere in the universe. He proposes that things we don't normally consider "alive"—like human culture, lan…

1
Google Researcher Shows Life "Emerges From Code" - Blaise Agüera y Arcas 59:53

1M ago59:53

59:53

Blaise Agüera y Arcas explores some mind-bending ideas about what intelligence and life really are—and why they might be more similar than we think (filmed at ALIFE conference, 2025 - https://2025.alife.org/). Life and intelligence are both fundamentally computational (he says). From the very beginning, living things have been running programs. You…

1
The Secret Engine of AI - Prolific [Sponsored] (Sara Saab, Enzo Blindow) 1:19:39

2M ago1:19:39

1:19:39

We sat down with Sara Saab (VP of Product at Prolific) and Enzo Blindow (VP of Data and AI at Prolific) to explore the critical role of human evaluation in AI development and the challenges of aligning AI systems with human values. Prolific is a human annotation and orchestration platform for AI used by many of the major AI labs. This is a sponsore…

1
AI Agents Can Code 10,000 Lines of Hacking Tools In Seconds - Dr. Ilia Shumailov (ex-GDM) 1:01:07

2M ago1:01:07

1:01:07

Dr. Ilia Shumailov - Former DeepMind AI Security Researcher, now building security tools for AI agents Ever wondered what happens when AI agents start talking to each other—or worse, when they start breaking things? Ilia Shumailov spent years at DeepMind thinking about exactly these problems, and he's here to explain why securing AI is way harder t…

1
New top score on ARC-AGI-2-pub (29.4%) - Jeremy Berman 1:08:27

2M ago1:08:27

1:08:27

We need AI systems to synthesise new knowledge, not just compress the data they see. Jeremy Berman, is a research scientist at Reflection AI and recent winner of the ARC-AGI v2 public leaderboard.**SPONSOR MESSAGES**—Take the Prolific human data survey - https://www.prolific.com/humandatasurvey?utm_source=mlst and be the first to see the results an…

1
Deep Learning is Not So Mysterious or Different - Prof. Andrew Gordon Wilson (NYU) 2:03:48

3M ago2:03:48

2:03:48

Professor Andrew Wilson from NYU explains why many common-sense ideas in artificial intelligence might be wrong. For decades, the rule of thumb in machine learning has been to fear complexity. The thinking goes: if your model has too many parameters (is "too complex") for the amount of data you have, it will "overfit" by essentially memorizing the …

1
Karl Friston - Why Intelligence Can't Get Too Large (Goldilocks principle) 1:21:39

3M ago1:21:39

1:21:39

In this episode, hosts Tim and Keith finally realize their long-held dream of sitting down with their hero, the brilliant neuroscientist Professor Karl Friston. The conversation is a fascinating and mind-bending journey into Professor Friston's life's work, the Free Energy Principle, and what it reveals about life, intelligence, and consciousness i…

1
The Day AI Solves My Puzzles Is The Day I Worry (Prof. Cristopher Moore) 1:34:52

3M ago1:34:52

1:34:52

We are joined by Cristopher Moore, a professor at the Santa Fe Institute with a diverse background in physics, computer science, and machine learning. The conversation begins with Cristopher, who calls himself a "frog" explaining that he prefers to dive deep into specific, concrete problems rather than taking a high-level "bird's-eye view". They ex…

1
Michael Timothy Bennett: Defining Intelligence and AGI Approaches 1:05:44

3M ago1:05:44

1:05:44

Dr. Michael Timothy Bennett is a computer scientist who's deeply interested in understanding artificial intelligence, consciousness, and what it means to be alive. He's known for his provocative paper "What the F*** is Artificial Intelligence" which challenges conventional thinking about AI and intelligence.**SPONSOR MESSAGES***Prolific: Quality da…

1
Superintelligence Strategy (Dan Hendrycks) 1:45:38

4M ago1:45:38

1:45:38

Deep dive with Dan Hendrycks, a leading AI safety researcher and co-author of the "Superintelligence Strategy" paper with former Google CEO Eric Schmidt and Scale AI CEO Alexandr Wang. *** SPONSOR MESSAGES Gemini CLI is an open-source AI agent that brings the power of Gemini directly into your terminal - https://github.com/google-gemini/gemini-cli …

1
DeepMind Genie 3 [World Exclusive] (Jack Parker Holder, Shlomi Fruchter) 58:22

4M ago58:22

58:22

This episode features Shlomi Fuchter and Jack Parker Holder from Google DeepMind, who are unveiling a new AI called Genie 3. The host, Tim Scarfe, describes it as the most mind-blowing technology he has ever seen. We were invited to their offices to conduct the interview (not sponsored).Imagine you could create a video game world just by describing…

1
Large Language Models and Emergence: A Complex Systems Perspective (Prof. David C. Krakauer) 49:48

4M ago49:48

49:48

Prof. David Krakauer, President of the Santa Fe Institute argues that we are fundamentally confusing knowledge with intelligence, especially when it comes to AI. He defines true intelligence as the ability to do more with less—to solve novel problems with limited information. This is contrasted with current AI models, which he describes as doing le…

1
Pushing compute to the limits of physics 1:23:32

5M ago1:23:32

1:23:32

Dr. Maxwell Ramstead grills Guillaume Verdon (AKA “Beff Jezos”) who's the founder of Thermodynamic computing startup Extropic. Guillaume shares his unique path – from dreaming about space travel as a kid to becoming a physicist, then working on quantum computing at Google, to developing a radically new form of computing hardware for machine learnin…

1
The Fractured Entangled Representation Hypothesis (Kenneth Stanley, Akarsh Kumar) 2:16:22

5M ago2:16:22

2:16:22

Are the AI models you use today imposters? Please watch the intro video we did before this: https://www.youtube.com/watch?v=o1q6Hhz0MAg In this episode, hosts Dr. Tim Scarfe and Dr. Duggar are joined by AI researcher Prof. Kenneth Stanley and MIT PhD student Akash Kumar to discuss their fascinating paper, "Questioning Representational Optimism in D…

1
The Fractured Entangled Representation Hypothesis (Intro) 15:45

5M ago15:45

15:45

What if today's incredible AI is just a brilliant "impostor"? This episode features host Dr. Tim Scarfe in conversation with guests Prof. Kenneth Stanley (ex-OpenAI), Dr. Keith Duggar (MIT), and Arkash Kumar (MIT).While AI today produces amazing results on the surface, its internal understanding is a complete mess, described as "total spaghetti" [0…

1
Three Red Lines We're About to Cross Toward AGI (Daniel Kokotajlo, Gary Marcus, Dan Hendrycks) 2:07:07

5M ago2:07:07

2:07:07

What if the most powerful technology in human history is being built by people who openly admit they don't trust each other? In this explosive 2-hour debate, three AI experts pull back the curtain on the shocking psychology driving the race to Artificial General Intelligence—and why the people building it might be the biggest threat of all. Kokotaj…

1
How AI Learned to Talk and What It Means - Prof. Christopher Summerfield 1:08:28

6M ago1:08:28

1:08:28

We interview Professor Christopher Summerfield from Oxford University about his new book "These Strange New Minds: How AI Learned to Talk and What It". AI learned to understand the world just by reading text - something scientists thought was impossible. You don't need to see a cat to know what one is; you can learn everything from words alone. Thi…

1
"Blurring Reality" - Chai's Social AI Platform (SPONSORED) 50:59

6M ago50:59

50:59

"Blurring Reality" - Chai's Social AI Platform - sponsored This episode of MLST explores the groundbreaking work of Chai, a social AI platform that quietly built one of the world's largest AI companion ecosystems before ChatGPT's mainstream adoption. With over 10 million active users and just 13 engineers serving 2 trillion tokens per day, Chai dis…

1
Google AlphaEvolve - Discovering new science (exclusive interview) 1:13:58

7M ago1:13:58

1:13:58

Today GoogleDeepMind released AlphaEvolve: a Gemini coding agent for algorithm discovery. It beat the famous Strassen algorithm for matrix multiplication set 56 years ago. Google has been killing it recently. We had early access to the paper and interviewed the researchers behind the work. AlphaEvolve: A Gemini-powered coding agent for designing ad…

1
Prof. Randall Balestriero - LLMs without pretraining and SSL 34:30

8M ago34:30

34:30

Randall Balestriero joins the show to discuss some counterintuitive findings in AI. He shares research showing that huge language models, even when started from scratch (randomly initialized) without massive pre-training, can learn specific tasks like sentiment analysis surprisingly well, train stably, and avoid severe overfitting, sometimes matchi…

1
How Machines Learn to Ignore the Noise (Kevin Ellis + Zenna Tavares) 1:16:55

8M ago1:16:55

1:16:55

Prof. Kevin Ellis and Dr. Zenna Tavares talk about making AI smarter, like humans. They want AI to learn from just a little bit of information by actively trying things out, not just by looking at tons of data. They discuss two main ways AI can "think": one way is like following specific rules or steps (like a computer program), and the other is mo…

1
Eiso Kant (CTO poolside) - Superhuman Coding Is Coming! 1:36:28

8M ago1:36:28

1:36:28

Eiso Kant, CTO of poolside AI, discusses the company's approach to building frontier AI foundation models, particularly focused on software development. Their unique strategy is reinforcement learning from code execution feedback which is an important axis for scaling AI capabilities beyond just increasing model size or data volume. Kant predicts h…

1
The Compendium - Connor Leahy and Gabriel Alfour 1:37:10

8M ago1:37:10

1:37:10

Connor Leahy and Gabriel Alfour, AI researchers from Conjecture and authors of "The Compendium," joinus for a critical discussion centered on Artificial Superintelligence (ASI) safety and governance. Drawing from their comprehensive analysis in "The Compendium," they articulate a stark warning about the existential risks inherent in uncontrolled AI…

1
ARC Prize v2 Launch! (Francois Chollet and Mike Knoop) 54:15

9M ago54:15

54:15

We are joined by Francois Chollet and Mike Knoop, to launch the new version of the ARC prize! In version 2, the challenges have been calibrated with humans such that at least 2 humans could solve each task in a reasonable task, but also adversarially selected so that frontier reasoning models can't solve them. The best LLMs today get negligible per…

1
Test-Time Adaptation: the key to reasoning with DL (Mohamed Osman) 1:03:36

9M ago1:03:36

1:03:36

Mohamed Osman joins to discuss MindsAI's highest scoring entry to the ARC challenge 2024 and the paradigm of test-time fine-tuning. They explore how the team, now part of Tufa Labs in Zurich, achieved state-of-the-art results using a combination of pre-training techniques, a unique meta-learning strategy, and an ensemble voting mechanism. Mohamed e…

1
GSMSymbolic paper - Iman Mirzadeh (Apple) 1:11:23

9M ago1:11:23

1:11:23

Iman Mirzadeh from Apple, who recently published the GSM-Symbolic paper discusses the crucial distinction between intelligence and achievement in AI systems. He critiques current AI research methodologies, highlighting the limitations of Large Language Models (LLMs) in reasoning and knowledge representation. SPONSOR MESSAGES: *** Tufa AI Labs is a …

1
Reasoning, Robustness, and Human Feedback in AI - Max Bartolo (Cohere) 1:23:11

9M ago1:23:11

1:23:11

Dr. Max Bartolo from Cohere discusses machine learning model development, evaluation, and robustness. Key topics include model reasoning, the DynaBench platform for dynamic benchmarking, data-centric AI development, model training challenges, and the limitations of human feedback mechanisms. The conversation also covers technical aspects like influ…

1
Tau Language: The Software Synthesis Future (sponsored) 1:41:19

9M ago1:41:19

1:41:19

This sponsored episode features mathematician Ohad Asor discussing logical approaches to AI, focusing on the limitations of machine learning and introducing the Tau language for software development and blockchain tech. Asor argues that machine learning cannot guarantee correctness. Tau allows logical specification of software requirements, automat…

1
John Palazza - Vice President of Global Sales @ CentML ( sponsored) 54:50

9M ago54:50

54:50

John Palazza from CentML joins us in this sponsored interview to discuss the critical importance of infrastructure optimization in the age of Large Language Models and Generative AI. We explore how enterprises can transition from the innovation phase to production and scale, highlighting the significance of efficient GPU utilization and cost manage…

1
Transformers Need Glasses! - Federico Barbero 1:00:54

9M ago1:00:54

1:00:54

Federico Barbero (DeepMind/Oxford) is the lead author of "Transformers Need Glasses!". Have you ever wondered why LLMs struggle with seemingly simple tasks like counting or copying long strings of text? We break down the theoretical reasons behind these failures, revealing architectural bottlenecks and the challenges of maintaining information fide…

1
Sakana AI - Chris Lu, Robert Tjarko Lange, Cong Lu 1:37:54

9M ago1:37:54

1:37:54

We speak with Sakana AI, who are building nature-inspired methods that could fundamentally transform how we develop AI systems. The guests include Chris Lu, a researcher who recently completed his DPhil at Oxford University under Prof. Jakob Foerster's supervision, where he focused on meta-learning and multi-agent systems. Chris is the first author…

1
Clement Bonnet - Can Latent Program Networks Solve Abstract Reasoning? 51:26

10M ago51:26

51:26

Clement Bonnet discusses his novel approach to the ARC (Abstraction and Reasoning Corpus) challenge. Unlike approaches that rely on fine-tuning LLMs or generating samples at inference time, Clement's method encodes input-output pairs into a latent space, optimizes this representation with a search algorithm, and decodes outputs for new inputs. This…

1
Prof. Jakob Foerster - ImageNet Moment for Reinforcement Learning? 53:31

10M ago53:31

53:31

Prof. Jakob Foerster, a leading AI researcher at Oxford University and Meta, and Chris Lu, a researcher at OpenAI -- they explain how AI is moving beyond just mimicking human behaviour to creating truly intelligent agents that can learn and solve problems on their own. Foerster champions open-source AI for responsible, decentralised development. He…

1
Daniel Franzen & Jan Disselhoff - ARC Prize 2024 winners 1:09:04

10M ago1:09:04

1:09:04

Daniel Franzen and Jan Disselhoff, the "ARChitects" are the official winners of the ARC Prize 2024. Filmed at Tufa Labs in Zurich - they revealed how they achieved a remarkable 53.5% accuracy by creatively utilising large language models (LLMs) in new ways. Discover their innovative techniques, including depth-first search for token selection, test…

1
Sepp Hochreiter - LSTM: The Comeback Story? 1:07:01

10M ago1:07:01

1:07:01

Sepp Hochreiter, the inventor of LSTM (Long Short-Term Memory) networks – a foundational technology in AI. Sepp discusses his journey, the origins of LSTM, and why he believes his latest work, XLSTM, could be the next big thing in AI, particularly for applications like robotics and industrial simulation. He also shares his controversial perspective…

1
Want to Understand Neural Networks? Think Elastic Origami! - Prof. Randall Balestriero 1:18:10

10M ago1:18:10

1:18:10

Professor Randall Balestriero joins us to discuss neural network geometry, spline theory, and emerging phenomena in deep learning, based on research presented at ICML. Topics include the delayed emergence of adversarial robustness in neural networks ("grokking"), geometric interpretations of neural networks via spline theory, and challenges in reco…

1
Nicholas Carlini (Google DeepMind) 1:21:15

10M ago1:21:15

1:21:15

Nicholas Carlini from Google DeepMind offers his view of AI security, emergent LLM capabilities, and his groundbreaking model-stealing research. He reveals how LLMs can unexpectedly excel at tasks like chess and discusses the security pitfalls of LLM-generated code. SPONSOR MESSAGES: *** CentML offers competitive pricing for GenAI model deployment,…

1
Subbarao Kambhampati - Do o1 models search? 1:32:13

11M ago1:32:13

1:32:13

Join Prof. Subbarao Kambhampati and host Tim Scarfe for a deep dive into OpenAI's O1 model and the future of AI reasoning systems. * How O1 likely uses reinforcement learning similar to AlphaGo, with hidden reasoning tokens that users pay for but never see * The evolution from traditional Large Language Models to more sophisticated reasoning system…

1
How Do AI Models Actually Think? - Laura Ruis 1:18:01

11M ago1:18:01

1:18:01

Laura Ruis, a PhD student at University College London and researcher at Cohere, explains her groundbreaking research into how large language models (LLMs) perform reasoning tasks, the fundamental mechanisms underlying LLM reasoning capabilities, and whether these models primarily rely on retrieval or develop procedural knowledge. SPONSOR MESSAGES:…

1
Jurgen Schmidhuber on Humans co-existing with AIs 1:12:50

11M ago1:12:50

1:12:50

Jürgen Schmidhuber, the father of generative AI, challenges current AI narratives, revealing that early deep learning work is in his opinion misattributed, where it actually originated in Ukraine and Japan. He discusses his early work on linear transformers and artificial curiosity which preceded modern developments, shares his expansive vision of …

1
Yoshua Bengio - Designing out Agency for Safe AI 1:41:53

11M ago1:41:53

1:41:53

Professor Yoshua Bengio is a pioneer in deep learning and Turing Award winner. Bengio talks about AI safety, why goal-seeking “agentic” AIs might be dangerous, and his vision for building powerful AI tools without giving them agency. Topics include reward tampering risks, instrumental convergence, global AI governance, and how non-agent AIs could r…

1
Francois Chollet - ARC reflections - NeurIPS 2024 1:26:46

11M ago1:26:46

1:26:46

François Chollet discusses the outcomes of the ARC-AGI (Abstraction and Reasoning Corpus) Prize competition in 2024, where accuracy rose from 33% to 55.5% on a private evaluation set. SPONSOR MESSAGES: *** CentML offers competitive pricing for GenAI model deployment, with flexible options to suit a wide range of models, from small to large-scale de…

1
Jeff Clune - Agent AI Needs Darwin 2:00:13

11M ago2:00:13

2:00:13

AI professor Jeff Clune ruminates on open-ended evolutionary algorithms—systems designed to generate novel and interesting outcomes forever. Drawing inspiration from nature’s boundless creativity, Clune and his collaborators aim to build “Darwin Complete” search spaces, where any computable environment can be simulated. By harnessing the power of l…

1
Neel Nanda - Mechanistic Interpretability (Sparse Autoencoders) 3:42:36

12M ago3:42:36

3:42:36

Neel Nanda, a senior research scientist at Google DeepMind, leads their mechanistic interpretability team. In this extensive interview, he discusses his work trying to understand how neural networks function internally. At just 25 years old, Nanda has quickly become a prominent voice in AI research after completing his pure mathematics degree at Ca…

1
Jonas Hübotter (ETH) - Test Time Inference 1:45:56

1y ago1:45:56

1:45:56

Jonas Hübotter, PhD student at ETH Zurich's Institute for Machine Learning, discusses his groundbreaking research on test-time computation and local learning. He demonstrates how smaller models can outperform larger ones by 30x through strategic test-time computation and introduces a novel paradigm combining inductive and transductive learning appr…

1
How AI Could Be A Mathematician's Co-Pilot by 2026 (Prof. Swarat Chaudhuri) 1:44:42

1y ago1:44:42

1:44:42

Professor Swarat Chaudhuri from the University of Texas at Austin and visiting researcher at Google DeepMind discusses breakthroughs in AI reasoning, theorem proving, and mathematical discovery. Chaudhuri explains his groundbreaking work on COPRA (a GPT-based prover agent), shares insights on neurosymbolic approaches to AI. Professor Swarat Chaudhu…

1
Nora Belrose - AI Development, Safety, and Meaning 2:29:50

1y ago2:29:50

2:29:50

Nora Belrose, Head of Interpretability Research at EleutherAI, discusses critical challenges in AI safety and development. The conversation begins with her technical work on concept erasure in neural networks through LEACE (LEAst-squares Concept Erasure), while highlighting how neural networks' progression from simple to complex learning patterns c…

1
Why Your GPUs are underutilised for AI - CentML CEO Explains 2:08:40

1y ago2:08:40

2:08:40

Prof. Gennady Pekhimenko (CEO of CentML, UofT) joins us in this *sponsored episode* to dive deep into AI system optimization and enterprise implementation. From NVIDIA's technical leadership model to the rise of open-source AI, Pekhimenko shares insights on bridging the gap between academic research and industrial applications. Learn about "dark si…

פודקאסטים ששווה להאזין

פודקאסטים בנושא Machine Learning Street Talk Mlst

פודקאסטים ששווה להאזין

מדריך עזר מהיר