PharmaQuery is an advanced Pharmaceutical Insight Retrieval System designed to help users gain meaningful insights from research papers and documents in the pharmaceutical domain.
PharmaQuery-Demo.mp4
- Natural Language Querying: Ask complex questions about the pharmaceutical industry and get concise, accurate answers.
- Custom Database: Upload your own research documents to enhance the retrieval system's knowledge base.
- Similarity Search: Retrieves the most relevant documents for your query using AI embeddings.
- Streamlit Interface: User-friendly interface for queries and document uploads.
- Programming Language: Python 3.10+
- Framework: LangChain
- Database: ChromaDB
- Models:
- Embeddings: Google Gemini API (embedding-001)
- Chat: Google Gemini API (gemini-1.5-pro)
- PDF Processing: PyPDFLoader
- Document Splitter: SentenceTransformersTokenTextSplitter
-
Clone the Repository:
git clone https://github.com/CodeWithCharan/PharmaQuery.git cd PharmaQuery
-
Install Dependencies:
pip install -r requirements.txt
-
Set Up Environment Variables: Create a
.env
file in the project root directory with the following variables:GOOGLE_API_KEY="your_google_gemini_api_key"
Note:
Replaceyour_google_gemini_api_key
with actual key. -
Run the Application:
streamlit run app.py
-
Use the Application:
- Enter your query in the main interface.
- Optionally, upload research papers in the sidebar to enhance the database.