ChatArena with Yuxiang Wu - Weaviate Podcast #47!

Weaviate Podcast - A podcast by Weaviate

Categories:

Hey everyone, thank you so much for watching the Weaviate podcast! I am so excited about this episode! ChatArena is a software framework for multi-agent chat games. There are quite a few interesting applications of this, firstly we can use this kind of system to evaluate the intelligence of an LLM based on how intelligent it sounds in conversation with another LLM! Another interesting idea is to have the LLM impersonate people such as Lex Fridman or Sam Altman and simulate conversations between these people -- retrieving from their digital content to facilitate the impersonation. I thought there was so many interesting ideas in this podcast, please let us know what you think! Links: ChatArena on GitHub (please give it a star!) - https://github.com/chatarena/chatarena Twitter thread from Yuxiang describing the launch of ChatArena - https://twitter.com/YuxiangJWu/status/1643633046208249856 Chapters 0:00 Welcome Yuxiang! 0:38 What is ChatArena? 2:38 Impersonating People with LLMs 4:58 Weaviate and ChatArena 8:14 Generative Feedback Loops 11:10 Chat Games 16:30 Scientific Peer Review Discussions 20:05 Code Repos and Multi-Agent LLMs 23:05 Scaling Multi-Agent LLMs 25:16 Role Evolution in Startups 26:00 Evolution of Multi-Agent RL Research 29:22 AlphaGo and MCTS Text Generation 36:55 Hallucination in Role Maintenance 41:15 Evaluating LLMs with ChatArena 45:40 ChatGPT Marketplace and Tool Use 50:30 Upcoming work from Yuxiang and ChatArena!