spring-ai-rag

A Retrieval-Augmented Generation (RAG) Spring Boot application built using Spring AI and integrated with the OpenAI API.

It serves as a virtual real estate assistant capable of answering frequently asked questions (FAQs) based on company domain knowledge, searching real estate listings by filtering criteria such as location, price range, and number of bedrooms, and maintaining conversation history to provide context-aware follow-up responses.

Features

Retrieval-Augmented Generation (RAG): Enriches AI-generated responses by dynamically retrieving context from structured data stored in PostgreSQL with pgvector for embeddings.
Chat History Memory: Maintains a conversation history to provide context-aware replies in multi-turn conversations. The application uses in-memory chat memory for the sake of simplicity—for production-ready systems, it should be changed to a persistent memory store (Spring AI currently supports only Apache Cassandra)
OpenAI Integration: Leverages the Spring AI library to connect with OpenAI GPT models for text generation and embedding creation.
Function Calling Support: Demonstrates OpenAI function calling to trigger SQL queries on a relational database and return structured responses.
RESTful API Design: Provides chat and conversation history retrieval endpoints, ready for frontend chatbot integration.
Hexagonal Architecture: Adopts ports and adapters for modularity, extensibility, and testability.

Flow

User Query – The user submits a question or request.
Vector Search – The query is vectorized and matched against stored embeddings in PostgreSQL (pgvector) to find relevant context.
Context Enrichment – Retrieved context is appended to the query and sent to OpenAI GPT for response generation.
Function Calling (Optional) – GPT can trigger database queries (e.g., property searches) via function calls to fetch real-time data, which is added to the final response.
Response – GPT combines the query, context, and function results to produce a context-aware answer.

Local Run

Use JDK 21.

Export your OpenAI API secret key as an environment variable:

export SPRING_AI_OPENAI_API_KEY=<INSERT KEY HERE>

Run a local PostgreSQL instance using Docker Compose, serving as both a vector store and a relational database:

docker compose up -d

Build and run the application (defaults to port 8080):

./gradlew run

Access the app at: http://localhost:8080/.

Frontend production build will be built automatically and served directly by Spring Boot from the frontend JAR on the classpath. Spring Boot resolves these files automatically using its default behavior for classpath:/static/. API calls in production use the same origin (/), eliminating proxy configurations.

Hot-Reload Development

For frontend development with hot-reload, start the Vite development server (defaults to port 5173):

cd frontend && yarn && yarn dev

Vite automatically proxies API calls to localhost:8080 via vite.config.ts—no additional setup required.

REST API

Chat Endpoint

POST /api/chat Generates a response based on user input.

Request Body

{
  "conversationId": "1447eadc-6c64-481b-aee3-b90ab5c1ef2f",
  "message": "I'm looking for an apartment in Warsaw with two bedrooms under 1 million PLN."
}

Response Body

200 OK

{
  "message": "Here are some apartments in Warsaw with two bedrooms under 1 million PLN:\n\n1. Apartment in Mokotów – 850,000 PLN, 2 bedrooms, 70 m².\n2. Apartment in Wola – 950,000 PLN, 2 bedrooms, 80 m².\n\nWould you like more details about any of these listings?"
}

400 Bad Request – Invalid input or missing fields.
503 Service Unavailable – Chat service temporarily unavailable.

Conversation History Endpoint

GET /api/conversation/{conversationId}?lastN=10 Retrieve recent chat messages.

Response Body

200 OK

{
  "conversationId": "1447eadc-6c64-481b-aee3-b90ab5c1ef2f",
  "messages": [
    {
      "content": "I'm looking for an apartment in Warsaw with two bedrooms under 1 million PLN.",
      "type": "USER"
    },
    {
      "content": "Here are some apartments in Warsaw with two bedrooms under 1 million PLN:\n\n1. Apartment in Mokotów – 850,000 PLN, 2 bedrooms, 70 m².\n2. Apartment in Wola – 950,000 PLN, 2 bedrooms, 80 m².\n\nWould you like more details about any of these listings?",
      "type": "ASSISTANT"
    }
  ]
}

400 Bad Request – Invalid lastN parameter.
404 Not Found – conversationId does not exist.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/workflows		.github/workflows
.vscode		.vscode
backend		backend
frontend		frontend
gradle		gradle
http		http
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
app.png		app.png
build.gradle.kts		build.gradle.kts
docker-compose.yml		docker-compose.yml
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spring-ai-rag

Features

Flow

Local Run

Hot-Reload Development

REST API

Chat Endpoint

Request Body

Response Body

Conversation History Endpoint

Response Body

About

Releases

Packages

Contributors 2

Languages

kszapsza/spring-ai-rag

Folders and files

Latest commit

History

Repository files navigation

spring-ai-rag

Features

Flow

Local Run

Hot-Reload Development

REST API

Chat Endpoint

Request Body

Response Body

Conversation History Endpoint

Response Body

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages