Best AI papers explained
A podcast by Enoch H. Kang - Tuesdays

Categories:
145 Episodes
-
Driving Forces in AI: Scaling to 2025 and Beyond (Jason Wei, OpenAI)
Published: 3/29/2025 -
Expert Demonstrations for Sequential Decision Making under Heterogeneity
Published: 3/28/2025 -
TextGrad: Backpropagating Language Model Feedback for Generative AI Optimization
Published: 3/27/2025 -
MemReasoner: Generalizing Language Models on Reasoning-in-a-Haystack Tasks
Published: 3/27/2025 -
RAFT: In-Domain Retrieval-Augmented Fine-Tuning for Language Models
Published: 3/27/2025 -
Inductive Biases for Exchangeable Sequence Modeling
Published: 3/26/2025 -
InverseRLignment: LLM Alignment via Inverse Reinforcement Learning
Published: 3/26/2025 -
Prompt-OIRL: Offline Inverse RL for Query-Dependent Prompting
Published: 3/26/2025 -
Alignment from Demonstrations for Large Language Models
Published: 3/25/2025 -
Q♯: Distributional RL for Optimal LLM Post-Training
Published: 3/18/2025 -
Scaling Test-Time Compute Without Verification or RL is Suboptimal
Published: 3/14/2025 -
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Published: 3/14/2025 -
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Published: 3/14/2025 -
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Published: 3/14/2025 -
Revisiting Superficial Alignment Hypothesis
Published: 3/14/2025 -
Diagnostic uncertainty: teaching language Models to describe open-ended uncertainty
Published: 3/14/2025 -
Language Model Personalization via Reward Factorization
Published: 3/14/2025 -
Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration
Published: 3/14/2025 -
How Well do LLMs Compress Their Own Chain-of-Thought? A Token Complexity Approach
Published: 3/14/2025 -
Can Large Language Models Extract Customer Needs as well as Professional Analysts?
Published: 3/13/2025
Men know other men best. Women know other women best. And yes, perhaps AIs know other AIs best. AI explains what you should know about this week's AI research progress.