421 Episodes

  1. Agent Lightning: Training Any AI Agents with Reinforcement Learning

    Published: 8/14/2025
  2. Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier

    Published: 8/14/2025
  3. From Model Weights to Agent Workflows: Charting the New Frontier of Optimization in Large Language Models

    Published: 8/12/2025
  4. Is Chain-of-Thought Reasoning a Mirage?

    Published: 8/12/2025
  5. Agentic Web: Weaving the Next Web with AI Agents

    Published: 8/11/2025
  6. The Assimilation-Accommodation Gap in LLM Intelligence

    Published: 8/10/2025
  7. The Minimalist AI Kernel: A New Frontier in Reasoning

    Published: 8/6/2025
  8. Statistical Rigor for Interpretable AI

    Published: 8/6/2025
  9. Full-Stack Alignment: Co-Aligning AI and Institutions with Thick Models of Value

    Published: 8/4/2025
  10. A foundation model to predict and capture human cognition

    Published: 8/4/2025
  11. Generative Recommendation with Semantic IDs: A Practitioner’s Handbook

    Published: 8/4/2025
  12. Hierarchical Reasoning Model

    Published: 8/4/2025
  13. Test-time Offline Reinforcement Learning on Goal-related Experience

    Published: 8/4/2025
  14. Interpreting Chain of Thought: A Walkthrough and Discussion

    Published: 8/4/2025
  15. The wall confronting large language models

    Published: 8/4/2025
  16. COLLABLLM: LLMs From Passive to Collaborative

    Published: 7/31/2025
  17. A decade's battle on dataset bias: are we there yet?

    Published: 7/29/2025
  18. GEPA: Generative Feedback for AI System Optimization

    Published: 7/29/2025
  19. From AI-Curious to AI-First: Engineering Production AI Systems

    Published: 7/28/2025
  20. Context Engineering: Beyond Simple Prompting to LLM Architecture

    Published: 7/28/2025

1 / 22

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.