Document RAG

A backend-only Spring Boot application powered by Spring AI that processes uploaded PDFs. It stores each PDF in local storage, extracts its text using Apache PDFBox, splits the content into semantic chunks, and generates vector embeddings with OpenAI. These embeddings are stored locally, enabling efficient retrieval-based question answering over the document's content.

Built with:

Java 21
Spring Boot 3.x**
Spring AI
Apache PDfBox

Features

PDF Upload: Accepts only .pdf files via REST API.
Text Extraction: Uses Loader from PDFBox.
Chunking: Configurable chunk size & overlap.
Embeddings: Uses OpenAI text-embedding-3-small (1536 dims).
Storage: PDF stored in /storage, embeddings stored locally (e.g., H2/pgvector).
Q&A Endpoint: Ask questions and get AI-generated answers from uploaded PDFs.

Running the application

mvn spring-boot:run

API Endpoints

Upload PDF

curl -F "file=@sample.pdf" https://localhost:8080/api/documents/upload

Ask a question

curl -X POST "https://localhost:8080/api/documents/query" \
-H "Content-Type: application/json" \
-d '{"question": "What is this document about?"}'

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.mvn/wrapper		.mvn/wrapper
data		data
src		src
storage		storage
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml
swagger-screenshot.png		swagger-screenshot.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document RAG

Built with:

Features

Running the application

API Endpoints

Upload PDF

Ask a question

Swagger Screenshot

About

Uh oh!

Releases

Packages

Languages

niyiment/document-rag

Folders and files

Latest commit

History

Repository files navigation

Document RAG

Built with:

Features

Running the application

API Endpoints

Upload PDF

Ask a question

Swagger Screenshot

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages