
RAG
/ræɡ/
Retrieval-Augmented Generation; enhancing models with external knowledge
RAG in a sentence
“RAG enables the chatbot to answer questions about private company data.”
Origin of RAG
Acronym: Retrieval-Augmented Generation (Lewis et al., 2020)
Related Words
vector database
a database optimized for storing and querying high-dimensional embeddings
semantic search
searching by meaning rather than exact keyword matching
hybrid search
combining keyword search and vector search for better accuracy
reranking
re-ordering search results using a more precise model
chunking
splitting text into smaller segments for embedding
context injection
inserting retrieved information dynamically into the prompt