Skip to content

This repository contains the code for all the demo based on OCI Generative AI Service and AI Vector Search

License

Notifications You must be signed in to change notification settings

luigisaetta/llamaindex10_oracle

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Integrate Oracle AI Vector DB and OCI GenAI with Llama-index (v. 0.10+)

Code style: black

The UI of the Knowledge Assistant you can build using following examples.

screenshot

This repository contains all the work done on the development of RAG applications using:

In the Video demos section of the Wiki you'll find some video of the demo.

What is RAG?

A very good introduction to what Retrieval Augmented Generation (RAG) is can be found here

Features

  • basic (12/2023) integration between Oracle DB Vector Store (23c) and llama-index
  • All documents stored in an Oracle AI Vector Search
  • Oracle AI Vector Search used for Semantic Search
  • Reranking to improve retrieval
  • How to show references (documents used for the response generation)
  • (30/12/2023) Added reranker implemented as OCI Model Deployment
  • (20/01/2024) Added implementation of Vector Store for LangChain and demo
  • Using vector to find duplicates in the documentation
  • (2/03/2024) Added Phoenix Traces for observability
  • (25/3/2024) This is the code for LlamaIndex 0.10+

Demos

Setup

See the wiki pages.

Loading data

  • You can use the create_save_embeddings Python program to load all the data in the Oracle DB schema.
  • You can launch it using the script load_books.
  • Files to be loaded are contained in the dir specified in the file config.py

You need to have pdf files in the same directory.

Releases used for the demo

  • OCI 2.126.1
  • OCI ADS 2.11.3
  • LangChain 0.1.12
  • LangChain Community 0.0.28
  • Llama-index 0.1.19
  • Oracle Database 23c (23.4) Enterprise Edition with AI Vector Search

You can install a complete Python environment using the instructions in the *Setup section of the Wiki.

Libraries

  • OCI Python SDK
  • OCI ADS
  • oracledb
  • Streamlit
  • Llama-index
  • LangChain
  • Arize-AI/phoenix for Observability and Tracing

Documentation

Embeddings

One of the key pieces in a RAG solution is the Retrieval module. To use the AI DB Vector Store you need an Embeddings Model: a model that does the magic of transforming text in vectors, capturing the content and the semantic of the text. The Embeddings Model used in these demos is Cohere Embeds V3, provided from OCI GenAI service.

With few changes, you can switch to use any Open Source model. But you need to have the computational power (GPU) to run it.

Observability

(02/03/2024) Added integration with Arize Phoenix (Phoenix traces).

To enable tracing you must set ADD_PHX_TRACING = True, in config.py

In case of troubles with installing Phoenix a quick solution is to disable it.

Factory Methods

In the module prepare_chain_4_chat are defined the factory methods to create: embeddings, llm, reranker...

The philosophy is to make things simpler. So all the configuration are taken from config.py.

If you want to change the llm, (or anything else) go to config.py. No params in the NBs

About

This repository contains the code for all the demo based on OCI Generative AI Service and AI Vector Search

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published