Let's Talk VDML Podcast Episode 8 - LLM Testing: Tools, Skill Sets and Automation Strategies

2.712 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

Let's Talk VDML Podcast Episode 8 - LLM Testing: Tools, Skill Sets and Automation Strategies

In this episode, KJR ACT General Manager Andrew Hammond and KJR Consultant Ji Yu explore the world of Large Language Model (LLM) testing. 

Discover the essential frameworks for evaluating LLM responses and explore various tools like DeepEval, TruLens, and others. This episode delves into the critical skill sets required for effective LLM testing and explores automated testing strategies to keep up with the ever-evolving nature of LLMs. 

Resources: 
• Langchain homepage: https://python.langchain.com/v0.1/docs/get_started/introduction/
• RAG page: https://python.langchain.com/v0.1/docs/use_cases/question_answering/

VDML (Validation Driven Machine Learning) is a methodology developed by KJR to guide the development of robust and reliable Machine Learning (ML) models. VDML is a practical method of undertaking responsible AI.

• KJR: https://kjr.com.au/
• VDML: https://kjr.com.au/services/vdml/
• Contact Us: [email protected]					

Let's Talk VDML Podcast Episode 8 - LLM Testing: Tools, Skill Sets and Automation Strategies

Nhạc Theo Chủ Đề

Liên kết website