Multi-modal RAG: Chat with Docs containing Images

Multi-modal RAG: Chat with Docs containing Images

39.234 Lượt nghe
Multi-modal RAG: Chat with Docs containing Images
Learn how to build a multimodal RAG system using CLIP mdoel. LINKS: Notebook: https://tinyurl.com/pfc64874 Flow charts in the paper: https://tinyurl.com/4pp78xuf https://tinyurl.com/5yeww5py https://tinyurl.com/4un6y6x5 https://tinyurl.com/2jkbb3ma 💻 RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Let's Connect: 🦾 Discord: https://discord.com/invite/t4eYQRUcXB ☕ Buy me a Coffee: https://ko-fi.com/promptengineering |🔴 Patreon: https://www.patreon.com/PromptEngineering 💼Consulting: https://calendly.com/engineerprompt/consulting-call 📧 Business Contact: [email protected] Become Member: http://tinyurl.com/y5h28s6h 💻 Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off). Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0 00:00 Introduction to Multimodal RAC Systems 01:24 First Approach: Unified Vector Space 02:23 Second Approach: Grounding Modalities to Text 03:57 Third Approach: Separate Vector Stores 06:26 Code Implementation: Setting Up 09:05 Code Implementation: Downloading Data 11:13 Code Implementation: Creating Vector Stores 14:00 Querying the Vector Store All Interesting Videos: Everything LangChain: https://www.youtube.com/playlist?list=PLVEEucA9MYhOu89CX8H3MBZqayTbcCTMr Everything LLM: https://youtube.com/playlist?list=PLVEEucA9MYhNF5-zeb4Iw2Nl1OKTH-Txw Everything Midjourney: https://youtube.com/playlist?list=PLVEEucA9MYhMdrdHZtFeEebl20LPkaSmw AI Image Generation: https://youtube.com/playlist?list=PLVEEucA9MYhPVgYazU5hx6emMXtargd4z