[GPU] Updated GPU cache size retrieval and refined closest_pow_of_2 #28059

arshadlab · 2024-12-13T10:07:23Z

Details:
Existing method for cache size calculation was static and need continious updates to the sku table which was already being missed for latest skus e.g DG2.
This update introduces a new member variable, max_global_cache_size, to store the GPU's global cache size, obtained via the OpenCL property CL_DEVICE_GLOBAL_MEM_CACHE_SIZE. The existing hard coded cache calculations are removed. Additionally, the closest_pow_of_2 function has been enhanced to return the nearest power of 2, favoring the upper value if the input is within 30% of the range for the upper bound. These changes improve memory management and ensure better utilization of GPU resources towards bottle neck situations.

Tickets:
CVS-159076

Details: Existing method for cache size calculation was static and need continious updates to the sku table which was already being missed for latest skus e.g DG2. This update introduces a new member variable, max_global_cache_size, to store the GPU's global cache size, obtained via the OpenCL property CL_DEVICE_GLOBAL_MEM_CACHE_SIZE. The existing hard coded cache calculations are removed. Additionally, the closest_pow_of_2 function has been enhanced to return the nearest power of 2, favoring the upper value if the input is within 30% of the range for the upper bound. These changes improve memory management and ensure better utilization of GPU resources towards bottle neck situations. Tickets: CVS-159076 Signed-off-by: Arshad Mehmood <[email protected]>

txlim96 · 2024-12-16T07:01:01Z

Verified with ADLS+A310E
auto batching.txt
BS32.txt

p-durandin · 2024-12-16T15:59:04Z

build_jenkins

…penvinotoolkit#28059) Details: Existing method for cache size calculation was static and need continious updates to the sku table which was already being missed for latest skus e.g DG2. This update introduces a new member variable, max_global_cache_size, to store the GPU's global cache size, obtained via the OpenCL property CL_DEVICE_GLOBAL_MEM_CACHE_SIZE. The existing hard coded cache calculations are removed. Additionally, the closest_pow_of_2 function has been enhanced to return the nearest power of 2, favoring the upper value if the input is within 30% of the range for the upper bound. These changes improve memory management and ensure better utilization of GPU resources towards bottle neck situations. Tickets: CVS-159076 Signed-off-by: Arshad Mehmood <[email protected]>

arshadlab requested review from a team as code owners December 13, 2024 10:07

github-actions bot added the category: GPU OpenVINO GPU plugin label Dec 13, 2024

sys-openvino-ci added the ExternalIntelPR External contributor from Intel label Dec 13, 2024

arshadlab force-pushed the auto_batch_update branch from 564fd28 to f651a9b Compare December 13, 2024 10:36

peterchen-intel assigned vladimir-paramuzov Dec 15, 2024

peterchen-intel self-requested a review December 15, 2024 12:36

vladimir-paramuzov approved these changes Dec 17, 2024

View reviewed changes

vladimir-paramuzov added this to the 2025.0 milestone Dec 17, 2024

vladimir-paramuzov added this pull request to the merge queue Dec 17, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 17, 2024

p-durandin added this pull request to the merge queue Dec 17, 2024

Merged via the queue into openvinotoolkit:master with commit b0a8c14 Dec 17, 2024
160 checks passed

peterchen-intel mentioned this pull request Dec 29, 2024

[Batch Plugin] fix the issue about optimal batch size in batch plugin #28045

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU] Updated GPU cache size retrieval and refined closest_pow_of_2 #28059

[GPU] Updated GPU cache size retrieval and refined closest_pow_of_2 #28059

arshadlab commented Dec 13, 2024 •

edited

Loading

txlim96 commented Dec 16, 2024

p-durandin commented Dec 16, 2024

[GPU] Updated GPU cache size retrieval and refined closest_pow_of_2 #28059

[GPU] Updated GPU cache size retrieval and refined closest_pow_of_2 #28059

Conversation

arshadlab commented Dec 13, 2024 • edited Loading

txlim96 commented Dec 16, 2024

p-durandin commented Dec 16, 2024

arshadlab commented Dec 13, 2024 •

edited

Loading