[Draft]Add Multimodal RAG notebook #2497

openvino-dev-samples · 2024-11-01T03:01:52Z

review-notebook-app · 2024-11-01T03:01:58Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

reformat

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb

transfer to optimum-intel transfer to optimum-intel

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb

eaidova · 2024-11-18T08:42:10Z

@openvino-dev-samples for me everything looks good, thanks.

Couple of comments:
Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

Is there any plans to integrate OV Visual Language models directly in llama-index?

openvino-dev-samples · 2024-11-18T08:47:46Z

@openvino-dev-samples for me everything looks good, thanks.

Couple of comments: Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

Is there any plans to integrate OV Visual Language models directly in llama-index?

Thanks for your review, the integration is already done in llama-index

https://docs.llamaindex.ai/en/stable/examples/multi_modal/openvino_multimodal/

BTW is there an example for phi3-vision's accuracy aware quantization ?

nikita-savelyevv · 2024-11-18T10:04:49Z

Couple of comments: Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

I'll add that an algorithm itself needs to be specified, e.g. --weight-format int4 --dataset contextual --awq.

Also, the default number of samples of 128 might be too large, so it can be reduced with --num-samples 32.

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb

openvino-dev-samples · 2024-11-19T02:49:30Z

Couple of comments: Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

I'll add that an algorithm itself needs to be specified, e.g. --weight-format int4 --dataset contextual --awq.

Also, the default number of samples of 128 might be too large, so it can be reduced with --num-samples 32.

Hi, as my test, the accuracy with this configuration is not satisfied:
optimum-cli export openvino --model {vlm_model_id} {vlm_model_path} --trust-remote-code --weight-format int4 --dataset contextual --awq --num-samples 32

add load image function

nikita-savelyevv · 2024-11-19T09:21:46Z

Couple of comments: Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

I'll add that an algorithm itself needs to be specified, e.g. --weight-format int4 --dataset contextual --awq.
Also, the default number of samples of 128 might be too large, so it can be reduced with --num-samples 32.

Hi, as my test, the accuracy with this configuration is not satisfied: optimum-cli export openvino --model {vlm_model_id} {vlm_model_path} --trust-remote-code --weight-format int4 --dataset contextual --awq --num-samples 32

Thanks for the information! Have you compared it against the configuration below?

compression_config = {
    "mode": nncf.CompressWeightsMode.INT4_SYM,
    "group_size": 64,
    "ratio": 0.6,
}

Yes, this configuration brings more reasonable responses compared to optimum-cli

update the method of audio extraction

update the screenshot display method

update with accruaracy aware quantization

solve conflict

openvino-dev-samples · 2024-11-27T06:14:37Z

--trust-remote-code --weight-format int4 --dataset contextual --awq --num-samples 32

Sorry, I made a mistake before. The result looks good with this accuracy aware config now

However I find it impossible to run this quantization method in a client PC with 32GB RAM

nikita-savelyevv · 2024-11-27T09:19:31Z

--trust-remote-code --weight-format int4 --dataset contextual --awq --num-samples 32

Sorry, I made a mistake before. The result looks good with this accuracy aware config now

However I find it impossible to run this quantization method in a client PC with 32GB RAM

Could you please verify that NNCF 2.14 is installed? It was released recently and it contains significant improvements in terms of peak RAM during data-aware compression.

brmarkus · 2024-11-27T09:30:56Z

Could you please verify that NNCF 2.14 is installed? It was released recently and it contains significant improvements in terms of peak RAM during data-aware compression.

At least this pull-request doesn't touch "requirements.txt" and doesn't check to make sure a recent version of NNCF is present. On my 64GB laptop the max. system memory is shortly fully used.

change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id

openvino-dev-samples added 2 commits October 31, 2024 03:17

first draft

ee3c3c5

reformat

ebb5fbe

openvino-dev-samples added 3 commits October 31, 2024 20:22

fix ci

5db6113

add gradio demo

6a17791

reformat

e09b544

reformat

openvino-dev-samples force-pushed the multimodal-rag branch from 5f24b3f to e09b544 Compare November 4, 2024 04:19

update gradio UI

9dfa8c4

openvino-dev-samples changed the title ~~[Draft]Add Multimodal RAG~~ [Draft]Add Multimodal RAG notebook Nov 4, 2024

openvino-dev-samples requested a review from eaidova November 5, 2024 06:01

eaidova reviewed Nov 6, 2024

View reviewed changes

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb Show resolved Hide resolved

openvino-dev-samples force-pushed the multimodal-rag branch from 2050dc7 to f72dab2 Compare November 18, 2024 04:17

transfer to optimum-intel

669ad71

transfer to optimum-intel transfer to optimum-intel

openvino-dev-samples force-pushed the multimodal-rag branch from f72dab2 to 669ad71 Compare November 18, 2024 04:19

openvino-dev-samples added 2 commits November 18, 2024 12:25

Merge branch 'latest' into multimodal-rag

ec0126f

fix spelling

2a531e2

eaidova requested a review from aleksandr-mokrov November 18, 2024 08:14

eaidova reviewed Nov 18, 2024

View reviewed changes

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb Show resolved Hide resolved

aleksandr-mokrov reviewed Nov 18, 2024

View reviewed changes

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb Show resolved Hide resolved

add load image function

655ab9c

add load image function

openvino-dev-samples force-pushed the multimodal-rag branch from ccede65 to 655ab9c Compare November 19, 2024 02:50

update the method of audio extraction

7dffa6a

update the method of audio extraction

openvino-dev-samples force-pushed the multimodal-rag branch from cce7f97 to 7dffa6a Compare November 19, 2024 15:40

openvino-dev-samples added 2 commits November 19, 2024 17:33

replaace video upload component

044ec6b

update the screenshot display method

7691683

update the screenshot display method

update with accruaracy aware quantization

89f9ec5

update with accruaracy aware quantization

openvino-dev-samples force-pushed the multimodal-rag branch from c4dcb16 to 89f9ec5 Compare November 22, 2024 04:53

openvino-dev-samples added 2 commits November 21, 2024 20:53

solve conflict

087787f

solve conflict

a868ab5

solve conflict

openvino-dev-samples force-pushed the multimodal-rag branch from 2e02d4f to a868ab5 Compare November 22, 2024 06:31

switch to int8 ASR

fd1df74

skip macos

c7456d3

openvino-dev-samples and others added 6 commits November 28, 2024 18:19

solve conflict

023d915

reduce the number of frame saved

dfc16ae

solve conflict

0833101

add skipped os

5203bc1

Merge branch 'latest' into multimodal-rag

316d3b2

update video url

dfd4aed

openvino-dev-samples force-pushed the multimodal-rag branch 6 times, most recently from 1508f3c to e590031 Compare December 18, 2024 01:18

change the ASR model id

7290856

change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id

openvino-dev-samples force-pushed the multimodal-rag branch from e590031 to 7290856 Compare December 18, 2024 01:52

openvino-dev-samples added 3 commits December 17, 2024 22:34

update ov version

6a6f0ce

ignore mul-rag in docker ci

c66976e

Merge branch 'latest' into multimodal-rag

650f625

openvino-dev-samples requested a review from eaidova December 20, 2024 02:52

eaidova approved these changes Dec 23, 2024

View reviewed changes

eaidova merged commit 41280d6 into openvinotoolkit:latest Dec 23, 2024
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft]Add Multimodal RAG notebook #2497

[Draft]Add Multimodal RAG notebook #2497

openvino-dev-samples commented Nov 1, 2024 •

edited

Loading

review-notebook-app bot commented Nov 1, 2024

eaidova commented Nov 18, 2024 •

edited

Loading

openvino-dev-samples commented Nov 18, 2024 •

edited

Loading

nikita-savelyevv commented Nov 18, 2024

openvino-dev-samples commented Nov 19, 2024

nikita-savelyevv commented Nov 19, 2024 •

edited by openvino-dev-samples

Loading

openvino-dev-samples commented Nov 27, 2024

nikita-savelyevv commented Nov 27, 2024

brmarkus commented Nov 27, 2024

[Draft]Add Multimodal RAG notebook #2497

[Draft]Add Multimodal RAG notebook #2497

Conversation

openvino-dev-samples commented Nov 1, 2024 • edited Loading

review-notebook-app bot commented Nov 1, 2024

eaidova commented Nov 18, 2024 • edited Loading

openvino-dev-samples commented Nov 18, 2024 • edited Loading

nikita-savelyevv commented Nov 18, 2024

openvino-dev-samples commented Nov 19, 2024

nikita-savelyevv commented Nov 19, 2024 • edited by openvino-dev-samples Loading

openvino-dev-samples commented Nov 27, 2024

nikita-savelyevv commented Nov 27, 2024

brmarkus commented Nov 27, 2024

openvino-dev-samples commented Nov 1, 2024 •

edited

Loading

eaidova commented Nov 18, 2024 •

edited

Loading

openvino-dev-samples commented Nov 18, 2024 •

edited

Loading

nikita-savelyevv commented Nov 19, 2024 •

edited by openvino-dev-samples

Loading