-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bugfix] Embedding model pooling_type equals ALL and multi input's bug #10494
Conversation
👋 Hi! Thank you for contributing to the vLLM project. Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can do one of these:
🚀 |
I changed it to avoid an error when calling normalize and softmax later on:
Didn't realize that the shapes per element can be different though, thanks for fixing! I guess we need to handle |
Ok, I will handle the case of a list of tensors later. |
I have reverted the pooling_type=step case to return a list of tensors too. Additionally, I have added checks to separately process a list of tensors during normalization and softmax. |
Looks good, but please fix the lint errors. |
Thanks for fixing! |
Let me retry the test |
vllm-project#10494) Signed-off-by: Tyler Michael Smith <[email protected]>
vllm-project#10494) Signed-off-by: Maxime Fournioux <[email protected]>
When I run the following example code, it triggers an error:
I found that the bug is at this commit. Reverting this PR's pooling_type='ALL' change to the original logic allows the embedding model correctly infer multiple prompts of different lengths.