A better Hugging Face model search with OpenAI, RAG, pgvector

1.696 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

A better Hugging Face model search with OpenAI, RAG, pgvector

Try Voice Writer - speak your thoughts and let AI handle the grammar: https://voicewriter.io

In this tutorial, learn how to build a chatbot to recommend HuggingFace models, using the RAG (retrieval augmented generation) pattern. We use OpenAI embeddings and chat models. Learn how to combine pgvector, response reranking, streaming protocols, and overcome resource constraints in deployment as well.

This tool is no longer available for use and has been discontinued. Sorry for the inconvenience.

00 - Introduction
25 - Designing the model chat tool
44 - Retrieval Augmented Generation (RAG)
17 - Scraping HuggingFace models and readmes
01 - Trying out Llamaindex
28 - Which model to use?
51 - Generating embeddings
40 - Implementing the bot in Python
08 - Popularity reranking
34 - Results so far
09 - Deploying the app
55 - Optimizing memory usage
52 - Comparing vector databases and pgvector
55 - Streaming protocol and server-sent events
34 - Final demo					

A better Hugging Face model search with OpenAI, RAG, pgvector

Nhạc Theo Chủ Đề

Liên kết website