Best AI papers explained
A podcast by Enoch H. Kang - Tuesdays

Categories:
145 Episodes
-
LoRe: Low-Rank Reward Modeling for Personalized LLMs
Published: 4/26/2025 -
ParaPO: Reducing Language Model Verbatim Reproduction
Published: 4/26/2025 -
Test-Time RL: Self-Evolving LLMs via Majority Voting Rewards
Published: 4/25/2025 -
Tina: Tiny LoRA Reasoning Models
Published: 4/25/2025 -
Evaluating large language models in theory of mind tasks
Published: 4/25/2025 -
QUEST: Quality Sampling for Machine Translation
Published: 4/24/2025 -
Offline Preference Learning via Simulated Trajectory Feedback
Published: 4/24/2025 -
Reasoning Elicitation in Language Models via Counterfactual Feedback
Published: 4/24/2025 -
Eliciting Human Preferences with Language Models
Published: 4/24/2025 -
Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Published: 4/24/2025 -
γ-Bench: Evaluating LLMs in Multi-Agent Games
Published: 4/24/2025 -
DRAFT: Self-Driven LLM Tool Mastery via Documentation Refinement
Published: 4/24/2025 -
Optimal Prediction Sets for Enhanced Human-AI Accuracy
Published: 4/24/2025 -
Self-Correction via Reinforcement Learning for Language Models
Published: 4/24/2025 -
Tractable Multi-Agent Reinforcement Learning through Behavioral Economics
Published: 4/24/2025 -
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement
Published: 4/24/2025 -
Iterative Nash Policy Optimization for Language Model Alignment
Published: 4/24/2025 -
SycEval: Benchmarking LLM Sycophancy in Mathematics and Medicine
Published: 4/23/2025 -
Stack AI: Democratizing Enterprise AI Development
Published: 4/22/2025 -
Evaluating Modern Recommender Systems: Challenges and Future Directions
Published: 4/22/2025
Men know other men best. Women know other women best. And yes, perhaps AIs know other AIs best. AI explains what you should know about this week's AI research progress.