Building the Foundations of Self-Improving LLM Agents

1.630 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

Building the Foundations of Self-Improving LLM Agents

Abstract: 
Large language model (LLM) Agents are powerful tools for completing complex tasks but remain underexplored in their ability to self-improve through feedback, adaptation, and exploration. In this talk, I present key advances in these three directions. First, I show that by drawing an analogy between optimization and interactive learning, feedback emerges as a powerful driver of iterative improvement for LLM agents. In particular, I highlight how directional feedback enables stable and efficient performance across a wide range of optimization tasks. Second, I introduce a novel optimization framework, "Optimization with Trace Oracle (OPTO),” that leverages execution traces and rich feedback to optimize LLM agents with complex workflows, akin to how AutoDiff enables differentiable optimization. Finally, I investigate LLMs' exploration capabilities in uncertain decision-making scenarios, proposing algorithm-guided methods that enable
smaller models (Gemini-1.5 Flash) to outperform larger ones (Gemini-1.5 Pro) in exploratory efficiency. Together, these insights outline a foundation for LLM agents that can learn, adapt, and explore autonomously, paving the way for the next generation of interactive AI systems.

Bio:
https://anie.me/about					

Building the Foundations of Self-Improving LLM Agents

Nhạc Theo Chủ Đề

Liên kết website

Building the Foundations of Self-Improving LLM Agents

Những bài liên quan

Chưa có bài liên quan nào!