Ensuring Privacy for Any LLM with Patricia Thaine - 716

Ensuring Privacy for Any LLM with Patricia Thaine - 716

479 Lượt nghe
Ensuring Privacy for Any LLM with Patricia Thaine - 716
Today, we're joined by Patricia Thaine, co-founder and CEO of Private AI to discuss techniques for ensuring privacy, data minimization, and compliance when using 3rd-party large language models (LLMs) and other AI services. We explore the risks of data leakage from LLMs and embeddings, the complexities of identifying and redacting personal information across various data flows, and the approach Private AI has taken to mitigate these risks. We also dig into the challenges of entity recognition in multimodal systems including OCR files, documents, images, and audio, and the importance of data quality and model accuracy. Additionally, Patricia shares insights on the limitations of data anonymization, the benefits of balancing real-world and synthetic data in model training and development, and the relationship between privacy and bias in AI. Finally, we touch on the evolving landscape of AI regulations like GDPR, CPRA, and the EU AI Act, and the future of privacy in artificial intelligence. 🎧 / 🎥 Listen or watch the full episode on our page: https://twimlai.com/go/716. 🔔 Subscribe to our channel for more great content just like this: https://youtube.com/twimlai?sub_confirmation=1 🗣️ CONNECT WITH US! =============================== Subscribe to the TWIML AI Podcast: https://twimlai.com/podcast/twimlai/ Follow us on Twitter: https://twitter.com/twimlai Follow us on LinkedIn: https://www.linkedin.com/company/twimlai/ Join our Slack Community: https://twimlai.com/community/ Subscribe to our newsletter: https://twimlai.com/newsletter/ Want to get in touch? Send us a message: https://twimlai.com/contact/ 📖 CHAPTERS =============================== 00:00 - Introduction 2:06 - Private AI 3:32 - Data flows 13:13 - User requirements 14:06 - Entity recognition 18:39 - Warranties 19:43 - Generalized model 20:42 - Multilingual challenges 22:13 - OCR 25:31 - Synthetic data 28:06 - Multimodality 37:44 - Managing system and product scope 39:30 - Data catalogs and MDM systems 41:43 - The connection between privacy and ethical responsible AI 45:23 - AI regulations 48:07 - Future directions 🔗 LINKS & RESOURCES =============================== Private AI - https://private-ai.com/en/redact/ Forum Ventures - https://hubs.ly/Q0346ccs0 📸 Camera: https://amzn.to/3TQ3zsg 🎙️Microphone: https://amzn.to/3t5zXeV 🚦Lights: https://amzn.to/3TQlX49 🎛️ Audio Interface: https://amzn.to/3TVFAIq 🎚️ Stream Deck: https://amzn.to/3zzm7F5