Evaluating LLM-based Applications

Evaluating LLM-based Applications

31.561 Lượt nghe
Evaluating LLM-based Applications
Evaluating LLM-based applications can feel like more of an art than a science. In this workshop, we'll give a hands-on introduction to evaluating language models. You'll come away with knowledge and tools you can use to evaluate your own applications, and answers to questions like: - Where do I get evaluation data from, anyway? - Is it possible to evaluate generative models in an automated way? - What metrics can I use? - What's the role of human evaluation? Talk by: Josh Tobin Here’s more to explore: LLM Compact Guide: https://dbricks.co/43WuQyb Big Book of MLOps: https://dbricks.co/3r0Pqiz Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc