Let's Talk VDML Podcast Episode 8 - LLM Testing: Tools, Skill Sets and Automation Strategies

Let's Talk VDML Podcast Episode 8 - LLM Testing: Tools, Skill Sets and Automation Strategies

2.712 Lượt nghe
Let's Talk VDML Podcast Episode 8 - LLM Testing: Tools, Skill Sets and Automation Strategies
In this episode, KJR ACT General Manager Andrew Hammond and KJR Consultant Ji Yu explore the world of Large Language Model (LLM) testing. Discover the essential frameworks for evaluating LLM responses and explore various tools like DeepEval, TruLens, and others. This episode delves into the critical skill sets required for effective LLM testing and explores automated testing strategies to keep up with the ever-evolving nature of LLMs. Resources: • Langchain homepage: https://python.langchain.com/v0.1/docs/get_started/introduction/ • RAG page: https://python.langchain.com/v0.1/docs/use_cases/question_answering/ VDML (Validation Driven Machine Learning) is a methodology developed by KJR to guide the development of robust and reliable Machine Learning (ML) models. VDML is a practical method of undertaking responsible AI. • KJR: https://kjr.com.au/ • VDML: https://kjr.com.au/services/vdml/ • Contact Us: [email protected]