Skip to content

[CPU] Fixed first token latency for compressed models #40130

[CPU] Fixed first token latency for compressed models

[CPU] Fixed first token latency for compressed models #40130

Triggered via pull request July 11, 2024 09:06
Status Success
Total duration 52m 22s
Billable time 49m
Artifacts 15

linux.yml

on: pull_request
OpenVINO tokenizers extension  /  OpenVINO tokenizers extension
3m 14s
OpenVINO tokenizers extension / OpenVINO tokenizers extension
ONNX Runtime Integration  /  ONNX Runtime Integration
5m 22s
ONNX Runtime Integration / ONNX Runtime Integration
Matrix: Conformance
Matrix: iGPU Tests
Waiting for pending jobs
Debian Packages  /  Debian Packages
2m 34s
Debian Packages / Debian Packages
Samples  /  Samples
6m 32s
Samples / Samples
C++ unit tests  /  C++ unit tests
2m 20s
C++ unit tests / C++ unit tests
Python unit tests  /  Python unit tests
29m 52s
Python unit tests / Python unit tests
CPU functional tests  /  CPU functional tests
20m 30s
CPU functional tests / CPU functional tests
NVIDIA plugin
0s
NVIDIA plugin
Matrix: dGPU Tests
Waiting for pending jobs
ONNX Models Tests  /  ONNX Models tests
18m 56s
ONNX Models Tests / ONNX Models tests
OpenVINO JS API  /  OpenVINO JS API
OpenVINO JS API / OpenVINO JS API
PyTorch Models tests  /  PyTorch Models tests
26m 56s
PyTorch Models tests / PyTorch Models tests
TensorFlow Layer Tests  /  TensorFlow Layer Tests
11m 7s
TensorFlow Layer Tests / TensorFlow Layer Tests
TensorFlow Models tests  /  TensorFlow Models tests
21m 51s
TensorFlow Models tests / TensorFlow Models tests
TensorFlow Hugging Face Models tests  /  TensorFlow Models tests
TensorFlow Hugging Face Models tests / TensorFlow Models tests
TensorFlow TF Hub Models tests  /  TensorFlow Models tests
TensorFlow TF Hub Models tests / TensorFlow Models tests
ci/gha_overall_status
0s
ci/gha_overall_status
Fit to window
Zoom out
Zoom in

Annotations

1 warning
Smart_CI
The following actions uses Node.js version which is deprecated and will be forced to run on node20: actions/cache@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

Artifacts

Produced during runtime
Name Size
build_logs Expired
2.8 KB
conformance_artifacts_API-CPU Expired
125 KB
conformance_artifacts_API-TEMPLATE Expired
60.4 KB
conformance_artifacts_OP-CPU Expired
4.74 MB
openvino_debian_packages Expired
49.4 MB
openvino_developer_package Expired
27.5 MB
openvino_package Expired
101 MB
openvino_tests Expired
162 MB
openvino_tokenizers_wheel Expired
13.1 MB
test-results-cpp Expired
71.7 KB
test-results-functional-cpu Expired
9.75 MB
test-results-python Expired
110 KB
test-results-python-tf-layers Expired
39.2 KB
test-results-tensorflow-models-precommit Expired
201 KB
test-results-torch-models Expired
547 KB