RAGKit

A lightweight RAG framework for building document Q&A applications locally.

RAGKit lets you build "chat with your documents" apps without needing cloud services or complex infrastructure. It's designed to run on regular hardware and get you started quickly.

Features

Simple API - basic usage is around 5 lines of code
Runs locally, no API keys needed for the default setup
Supports PDF, text, and Markdown files
Can use local models (HuggingFace, Ollama) or cloud APIs (OpenAI)
Uses FAISS for vector search
High-level API for quick prototyping, lower-level components if you need more control

Installation

pip install ragkit

# or with PDF support
pip install ragkit[pdf]

# or everything
pip install ragkit[all]

Basic Usage

from ragkit import RAGKit

# Initialize - this will download the embedding model on first run
rag = RAGKit()

# Add documents
rag.add_document("research_paper.pdf")
rag.add_document("notes.txt")

# Ask questions
answer = rag.query("What are the main findings?")
print(answer.text)
print(answer.sources)

RAGKit handles chunking, embedding, retrieval, and generation automatically.

How it works

RAGKit implements Retrieval-Augmented Generation:

Query -> Embed -> Search vector store -> Get relevant chunks -> LLM generates answer

The basic flow:

Documents get split into chunks and embedded as vectors
When you ask a question, it's embedded and compared against stored vectors
Most similar chunks are retrieved and passed to an LLM
The LLM generates an answer based on the retrieved context

Configuration

LLM backends

# Local HuggingFace model (default)
rag = RAGKit(llm_backend="huggingface")

# Ollama - needs Ollama running locally
rag = RAGKit(llm_backend="ollama", llm="llama3.2")

# OpenAI - needs OPENAI_API_KEY environment variable
rag = RAGKit(llm_backend="openai", llm="gpt-4")

Other settings

rag = RAGKit(
    embedding_model="all-mpnet-base-v2",  # different embedding model
    chunk_size=1000,
    chunk_overlap=100,
    top_k=5,  # number of chunks to retrieve
)

Adding documents

# single file
rag.add_document("paper.pdf")

# multiple
rag.add_documents(["doc1.pdf", "doc2.txt", "notes.md"])

# whole directory
rag.add_directory("./documents/", glob="**/*.pdf")

# or just raw text
rag.add_text("Some important information...", metadata={"source": "manual entry"})

Saving and loading

rag.save("my_index")

# later
rag = RAGKit.load("my_index")

Advanced usage

If you need more control, you can use the components directly:

from ragkit import (
    PDFLoader,
    RecursiveCharacterSplitter,
    SentenceTransformerEmbeddings,
    FAISSStore,
    SimilarityRetriever,
    HuggingFaceLLM,
    QAChain,
)

# Load and split
loader = PDFLoader()
documents = loader.load("paper.pdf")

splitter = RecursiveCharacterSplitter(chunk_size=1000, chunk_overlap=100)
chunks = splitter.split(documents)

# Embed and store
embeddings = SentenceTransformerEmbeddings(model_name="all-mpnet-base-v2")
vectorstore = FAISSStore(embeddings)
vectorstore.add(chunks)

# Set up retrieval and generation
retriever = SimilarityRetriever(vectorstore, top_k=5)
llm = HuggingFaceLLM(model_name="HuggingFaceTB/SmolLM-360M-Instruct")

chain = QAChain(retriever=retriever, llm=llm)
answer = chain.run("What methodology did they use?")

Supported formats

Format	Extension	Loader
PDF	.pdf	PDFLoader
Plain text	.txt	TextLoader
Markdown	.md	MarkdownLoader

Project structure

ragkit/
├── loaders/          # document loading
├── splitters/        # text chunking
├── embeddings/       # vector embeddings
├── vectorstores/     # FAISS and simple numpy store
├── retrievers/       # similarity search, MMR
├── llms/             # HuggingFace, Ollama, OpenAI backends
└── chains/           # QA and conversational chains

Comparison with other frameworks

RAGKit is smaller and simpler than LangChain or LlamaIndex. It has fewer features and less flexibility, but it's easier to get started with and has fewer dependencies.

If you need production features, extensive integrations, or enterprise support, those frameworks are probably better choices. RAGKit is more suited for prototyping, learning, or simple internal tools where you don't want to deal with a lot of complexity.

Use cases

Asking questions about PDFs (research papers, reports, etc)
Searching through documentation
Building a Q&A system over personal notes
Understanding unfamiliar codebases

Requirements

Python 3.9 or higher
Around 8GB RAM for the default models
About 2GB disk space for model downloads

License

MIT - see LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
examples		examples
ragkit		ragkit
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAGKit

Features

Installation

Basic Usage

How it works

Configuration

LLM backends

Other settings

Adding documents

Saving and loading

Advanced usage

Supported formats

Project structure

Comparison with other frameworks

Use cases

Requirements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAGKit

Features

Installation

Basic Usage

How it works

Configuration

LLM backends

Other settings

Adding documents

Saving and loading

Advanced usage

Supported formats

Project structure

Comparison with other frameworks

Use cases

Requirements

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages