RAG On The NVIDIA Jetson Orin Nano Super (Setup & Tutorial)

5.605 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

RAG On The NVIDIA Jetson Orin Nano Super (Setup & Tutorial)

Timestamps:

00 - Intro
46 - Bug Report
54 - Jetson Container Setup
12 - RAG Container Setup
58 - Ollama Issue
30 - Ollama Fix
16 - RAG Container
27 - Compatibility Issue
27 - Compatibility Fix
35 - RAG Document Setup
12 - Ollama Manual Pull
44 - RAG Container Start
16 - RAG Data Tweak
54 - RAG Demo
50 - RAG Settings
23 - RAG vs No RAG
39 - Closing Thoughts

In this video, we set up a Retrieval-Augmented Generation (RAG) workflow on the NVIDIA Jetson Orin Nano Super, using Ollama, LlamaIndex, and a Streamlit web app to create a fully functional Jetson container for local RAG processing.

Along the way, we tackle unexpected roadblocks, including Ollama breaking inside Jetson containers and various compatibility issues that made setup more challenging than expected. Step by step, we debug and fix these issues, ensuring the system runs smoothly on the Jetson Orin Nano Super.

Once everything is configured, we demonstrate RAG in action, running a Llama 3.2 model on a proprietary document and comparing responses with and without RAG to showcase the real-world impact of enhanced document retrieval in LLMs.

If you've ever wanted to implement RAG locally on a Jetson device, this guide will walk you through every challenge, fix, and optimization along the way.					

RAG On The NVIDIA Jetson Orin Nano Super (Setup & Tutorial)

Nhạc Theo Chủ Đề

Liên kết website