Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

32 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

The paper introduces SAGE, an evaluation framework for assessing LLMs' social cognition through simulated emotional responses, revealing significant performance gaps among models in empathetic dialogue.

https://arxiv.org/abs//2505.02847

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Nhạc Theo Chủ Đề

Liên kết website