AI / LLM

RAG.

Retrieval-Augmented Generation. Modele cevap vermeden önce ilgili belgeleri çekip context'e koyma tekniği. Hallucination'ı azaltır.

RAG (Retrieval-Augmented Generation), bir LLM'in cevap üretmeden önce harici bir bilgi tabanından ilgili belgeleri çekip context'ine eklediği mimaridir. Modeli yeniden eğitmeden taze veya özel bilgiyle çalışmasını sağlar. Klasik RAG: kullanıcı sorusu → embedding → vector DB araması → top-k chunk → LLM prompt'una enjekte → cevap. Modern varyantlar (agentic RAG, hybrid retrieval, reranker) basit pipeline'ı çok adımlı ve daha akıllı hale getirir. Pratikte kalitenin %80'i retrieval kalitesinden, %20'si generation'dan gelir.

Retrieval-Augmented Generation. Fetching relevant docs into the context before the model answers. Reduces hallucinations.

RAG (Retrieval-Augmented Generation) is the architecture where an LLM pulls relevant documents from an external knowledge base and adds them to its context before generating a response. It lets the model work with fresh or proprietary information without retraining. The classic RAG pipeline is: user question → embedding → vector DB search → top-k chunks → injected into the LLM prompt → answer. Modern variants (agentic RAG, hybrid retrieval, rerankers) turn the simple pipeline into something multi-step and smarter. In practice, 80% of quality comes from retrieval, only 20% from generation.

Örnekler

embed docs → store in pgvector → top-k cosine on query → stuff into prompt
use Cohere reranker on retrieved chunks for higher precision

İlgili terimler.

001

Embedding

AI / LLM

Metnin (veya görselin) anlamını sayısal vektöre çevirme. Benzerlik aramanın ve RAG'in temeli.

002

Context Window

AI / LLM

Bir LLM'in tek seferde "görebildiği" maksimum token sayısı. Pencere dolduğunda model unutmaya başlar veya konuşma sıkıştırılır.

003

Hallucination

AI / LLM

Modelin kendinden emin bir tonla uydurma bilgi üretmesi. Var olmayan kütüphane, sahte API, hayali fonksiyon imzası.