-
Notifications
You must be signed in to change notification settings - Fork 839
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Draft]Add Multimodal RAG notebook #2497
[Draft]Add Multimodal RAG notebook #2497
Conversation
openvino-dev-samples
commented
Nov 1, 2024
•
edited
Loading
edited
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
5f24b3f
to
e09b544
Compare
2050dc7
to
f72dab2
Compare
transfer to optimum-intel transfer to optimum-intel
f72dab2
to
669ad71
Compare
@openvino-dev-samples for me everything looks good, thanks. Couple of comments: Is there any plans to integrate OV Visual Language models directly in llama-index? |
Thanks for your review, the integration is already done in llama-index https://docs.llamaindex.ai/en/stable/examples/multi_modal/openvino_multimodal/ BTW is there an example for phi3-vision's accuracy aware quantization ? |
I'll add that an algorithm itself needs to be specified, e.g. Also, the default number of samples of 128 might be too large, so it can be reduced with |
Hi, as my test, the accuracy with this configuration is not satisfied: |
add load image function
ccede65
to
655ab9c
Compare
Thanks for the information! Have you compared it against the configuration below?
Yes, this configuration brings more reasonable responses compared to optimum-cli |
update the method of audio extraction
cce7f97
to
7dffa6a
Compare
update the screenshot display method
update with accruaracy aware quantization
c4dcb16
to
89f9ec5
Compare
solve conflict
2e02d4f
to
a868ab5
Compare
However I find it impossible to run this quantization method in a client PC with 32GB RAM |
Could you please verify that NNCF 2.14 is installed? It was released recently and it contains significant improvements in terms of peak RAM during data-aware compression. |
At least this pull-request doesn't touch "requirements.txt" and doesn't check to make sure a recent version of NNCF is present. On my 64GB laptop the max. system memory is shortly fully used. |
1508f3c
to
e590031
Compare
change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id change the ASR model id
e590031
to
7290856
Compare