Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CPU] Fixed first token latency for compressed models #25506

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

dmitry-gorokhov
Copy link
Contributor

@dmitry-gorokhov dmitry-gorokhov commented Jul 11, 2024

Details:

Tickets:

@dmitry-gorokhov dmitry-gorokhov added the category: CPU OpenVINO CPU plugin label Jul 11, 2024
@dmitry-gorokhov dmitry-gorokhov requested review from a team as code owners July 11, 2024 08:59
@dmitry-gorokhov dmitry-gorokhov self-assigned this Jul 11, 2024
@dmitry-gorokhov dmitry-gorokhov force-pushed the fix/first_token_latency branch from 98cc377 to 5074add Compare July 11, 2024 09:06
@dmitry-gorokhov dmitry-gorokhov added this to the 2024.4 milestone Jul 17, 2024
Copy link
Contributor

This PR will be closed in a week because of 2 weeks of no activity.

@github-actions github-actions bot added the Stale label Aug 11, 2024
@wenjiew wenjiew added no_stale Do not mark as stale and removed Stale labels Aug 22, 2024
@ilya-lavrenov ilya-lavrenov removed this from the 2024.4 milestone Oct 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: CPU OpenVINO CPU plugin no_stale Do not mark as stale under_perf_check
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants