Skip to content

Latest commit

 

History

History
45 lines (28 loc) · 1.24 KB

README.md

File metadata and controls

45 lines (28 loc) · 1.24 KB

LLM Project (Summarize And Retrieve Papers)

Setup

  1. Install poetry: https://python-poetry.org/docs/#installation
  2. Install dependencies: poetry install
  3. Install LLAMA2 Chat weights

wget https://huggingface.co/TheBloke/Llama-2-7b-Chat-GGUF/resolve/main/llama-2-7b-chat.Q5_0.gguf

Summarization

How to Run the summarization script

python3 llm-project/summarization.py --file_path <path_to_pdf> --model_path <path_to_model> --verbose <whether to see complete logs or not>

image

Retrieve Top 3 Papers

How to Run the retrieve top 3 papers script

python3 llm-project/retrieve.py --query <query> --model_name <sentence_embedding_model> --verbose <whether to see complete logs or not> --top_k <number of papers to retrieve> --data_path <path to the data>

image

Evaluate Results

Testing the summarization and retrieval scripts can be done using Rouge or BERT score to see if the summary or the retrieved papers are relevant to the query.

To evaluate results for both summarization and retrieval, run the following command:

python3 llm-project/evaluate.py --model_result <model's output> --ground_truth <ground truth>

image