Brand LogoBrand Logo (Dark)
HomeAI AgentsToolkitsGitHub PicksSubmit AgentBlog

Categories

  • Art Generators
  • Audio Generators
  • Automation Tools
  • Chatbots & AI Agents
  • Code Tools
  • Financial Tools

Categories

  • Large Language Models
  • Marketing Tools
  • No-Code & Low-Code
  • Research & Search
  • Video & Animation
  • Video Editing

GitHub Picks

  • DeerFlow — ByteDance Open-Source SuperAgent Harness

Latest Blogs

  • OpenClaw vs Composer 2 Which AI Assistant Delivers More Value
  • Google AI Studio vs Anthropic Console
  • Stitch 2.0 vs Lovable Which AI Design Tool Wins in 2026
  • Monetizing AI for Solopreneurs and Small Teams in 2026
  • OpenClaw vs MiniMax Which AI Assistant Wins in 2026

Latest Blogs

  • OpenClaw vs KiloClaw Is Self-Hosting Still Better
  • OpenClaw vs Kimi Claw
  • GPT-5.4 vs Gemini 3.1 Pro
  • Farewell to Bloomberg Terminal as Perplexity Computer AI Redefines Finance
  • Best Practices for OpenClaw
LinkStartAI© 2026 LinkstartAI. All rights reserved.
Contact UsAbout
  1. Home
  2. GitHub Picks
  3. RAG_Techniques
RAG_Techniques logo

RAG_Techniques

A hands-on collection of advanced RAG tutorials and runnable notebooks, organized by technique with practical evaluation patterns.
25.5kJupyter NotebookCustom Non-Commercial License
#rag#vector-search#jupyter-notebook#chunking#query-rewriting
#hybrid-retrieval
#reranking
#rag-evaluation
#rag-cookbook
#rag-playbook
#alternative-to-langchain-cookbook
#alternative-to-llamaindex-examples

What is it?

RAG_Techniques turns RAG from a concept checklist into a reproducible engineering lab. Each technique lives in its own folder with runnable notebooks and explanations, so you can tweak variables across chunking, query transforms, hybrid retrieval, reranking, and evaluation, then compare results and regress safely. The core value is not another framework wrapper, but making the controllable levers explicit: you can run A/B-style comparisons on the same corpus and metrics, then standardize what works for your team. It fits as a practical design ledger for RAG systems, optimized for iteration speed and clarity.

Pain Points vs Innovation

✕Traditional Pain Points✓Innovative Solutions
RAG projects often devolve into tool-stacking: swapping vector DBs or models without a regression-ready variable breakdown, so results are hard to reproduce.RAG_Techniques decomposes core levers (chunking, query transforms, retrieval mixes, reranking, evaluation) into foldered runnable notebooks, ideal for A/B comparisons and regressions.
Team knowledge lives as scattered notes and snippets, making it difficult to turn learnings into repeatable experiments.Runnable examples connect intent → implementation → metrics, helping teams standardize RAG experimentation and reuse templates.

Architecture Deep Dive

Notebooks as an experiment protocol
The paradigm treats each RAG technique as a runnable experiment: inputs, steps, and metrics live together, with knobs exposed for reproducibility and regression.
Decompose controllable levers in the pipeline
Flow centers on data → splitting → indexing → retrieval → reranking → generation → evaluation. Each stage is a swap point with comparison patterns, turning tuning into explainable pipeline experiments.
A tutorial hub, not framework lock-in
The stack rides on Jupyter and Python to highlight methods and baselines. You can port patterns into LangChain or LlamaIndex, or keep it minimal and custom.

Deployment Guide

1. Clone the repo and enter the folder

bash
1git clone https://github.com/NirDiamant/RAG_Techniques.git && cd RAG_Techniques

2. Create a Python venv and install the notebook toolchain

bash
1python -m venv .venv && . .venv/bin/activate && pip install -U pip jupyterlab

3. Start Jupyter and open the notebook you want

bash
1jupyter lab

4. Install extra dependencies referenced by a notebook (as needed)

bash
1pip install -U langchain llama-index

5. Set required API keys and run the comparison experiments (as needed)

bash
1export OPENAI_API_KEY='your_key_here'

Use Cases

Core SceneTarget AudienceSolutionOutcome
Enterprise RAG design review and selectionproduct/architecture leadsrun multiple chunking/retrieval/rerank variants on the same corpus and metricsturn opinions into reproducible evidence and reduce decision churn
A baseline library for RAG engineering teamsML/backend teamsstandardize runnable notebook templates and pin regression setssafer iteration with traceable performance deltas
Education and internal enablementAI enablement ownersuse technique folders as labs and walkthroughsalign teams on RAG levers and evaluation standards quickly

Limitations & Gotchas

Limitations & Gotchas
  • This repo is notebook-first and great for learning and experiments; production requires engineering data pipelines, auth, caching, and observability.
  • Some techniques depend on external models or API keys; plan costs, rate limits, and compliance upfront.
  • The license is non-commercial oriented; confirm permissions before using the materials directly in a commercial product.

Frequently Asked Questions

Is this repo for learning RAG or shipping a product?▾
RAG_Techniques is best viewed as an experiment protocol and comparison baseline. It’s great for learning and design reviews; for products, port conclusions into service code and add governance, caching, and observability.
I already use LangChain/LlamaIndex—do I still need it?▾
Yes. Treat LangChain or LlamaIndex as your implementation layer, and this repo as a lever checklist plus controlled experiments to pinpoint what actually moves metrics.
How do I make these techniques regression-friendly?▾
Pin a corpus and metrics, keep switches for each stage, change one variable at a time, and diff outputs on a regression set to build an auditable experiment log.
View on GitHub

Project Metrics

Stars25.5 k
LanguageJupyter Notebook
LicenseCustom Non-Commercial License
Deploy DifficultyMedium

Table of Contents

  1. 01What is it?
  2. 02Pain Points vs Innovation
  3. 03Architecture Deep Dive
  4. 04Deployment Guide
  5. 05Use Cases
  6. 06Limitations & Gotchas
  7. 07Frequently Asked Questions

Related Projects

Awesome LLM Apps
Awesome LLM Apps
96.4 k·Python
DeerFlow — ByteDance Open-Source SuperAgent Harness
DeerFlow — ByteDance Open-Source SuperAgent Harness
26.1 k·Python
gstack
gstack
0·TypeScript
Marketing for Founders
Marketing for Founders
2.2 k·Markdown