Refactors topdown service to support multiple types of calculations and adds support for Sapphire Rapids #576

ilumsden · 2024-07-21T00:25:51Z

This PR refactors the topdown service to allow it to be easily extended to support top-down calculations for processors besides Haswell/Broadwell. To do this, I've made the following changes:

Creates a new TopdownCalculator base class to represent calculations for different processors
Moves the existing calculations to a new HaswellTopdown class, which inherits from TopdownCalculator
Refactors IntelTopdown to offload all the computation and associated tracking to a pointer to a TopdownCalculator object

This PR also uses this new infrastructure to add support for Sapphire Rapids and Emerald Rapids CPUs. The calculations for these CPUs were obtained from Intel's perfmon repo, specifically the following file: https://github.com/intel/perfmon/blob/main/SPR/metrics/perf/sapphirerapids_metrics_perf.json. The support added by this PR only covers the first two levels of the top-down hierarchy. In the future, this could be expanded to cover as much of the 6 levels for Sapphire Rapids as desired.

This PR is still work-in-progress. The following tasks need to be completed before this is ready for review:

Add a CMake mechanism to specify an architecture (currently planning on supporting architecture names from archspec)
Use that mechanism to select (at build time) the correct TopdownCalculator child class to use in the topdown service
Examine if any changes need to be made to the topdown built-in option

ilumsden · 2024-07-21T00:57:45Z

In the future, it might be useful to consider embedding archspec-json if more architecture-specific features are added to Caliper.

ilumsden · 2024-09-09T16:56:05Z

Outstanding work on this PR:

~~Find a way to use PAPI multiplexing with the SPR counters needed for topdown~~ Deferring as future work since it is not needed at this time
Add testing
Update documentation (if it exists) about the topdown service

ilumsden · 2024-10-09T21:02:32Z

This PR is fully tested on Poodle and is now ready for review. There are only a few outstanding things left, namely adding documentation.

daboehme · 2024-10-25T17:00:27Z

Hi @ilumsden, there's a new change in Caliper this week that breaks up the giant option spec string in controllers.cpp. That was mainly because we ran into MSVC's string literal size limit, but it also makes it so we can pick options based on the build configuration. I think that's a better approach for selecting the topdown options than adding a new ConfigManager function. Can you try and adapt the PR to the new approach? Essentially you can define separate option spec strings for each configuration in controllers.cpp and then add whichever one is appropriate to the builtin_option_specs_list in ConfigManager.cpp. Should be pretty self-explanatory. Thanks!

…ll calculations and Sapphire Rapids/Emerald Rapids calculations

…like architecture support

… based on architecture specified at configure time

…plementation

…se that configuration in the topdown service

…of raw counter values

… making new topdown calculators

…or not

… in TopdownCalculator

ilumsden · 2024-10-25T20:41:40Z

@daboehme I've worked that change into the PR. I did have to make one small change to that mechanism to handle architecture detection. I had to move where builtin_option_specs_list gets populated from global scope to the constructor of ConfigManagerImpl due to having to do string comparison (which is extremely difficult to do at compile time in C++11).

ilumsden marked this pull request as draft July 21, 2024 00:25

ilumsden force-pushed the topdown-spr branch from eddd34c to 18c14d4 Compare October 8, 2024 15:17

ilumsden marked this pull request as ready for review October 9, 2024 21:01

ilumsden force-pushed the topdown-spr branch 2 times, most recently from d97ca7b to 20f37a0 Compare October 21, 2024 17:51

ilumsden added 11 commits October 25, 2024 14:53

Reimplements the IntelTopdown service to support both Haswell/Broadwe…

01bf472

…ll calculations and Sapphire Rapids/Emerald Rapids calculations

Adds infrastructure to update builtin_option_specs based on features …

cabc803

…like architecture support

Adds conditional behavior to topdown service and builtin option specs…

c367be3

… based on architecture specified at configure time

Splits TopdownCalculator and subclasses into own files to simplify im…

c09c473

…plementation

Adds a 'disable_multiplexing' configuration to the Papi service and u…

ea6d73e

…se that configuration in the topdown service

Checks whether PAPI uses rdpmc on SPR in the topdown service

26f46a3

Reworks SPR topdown implementation to use rdpmc-style values instead …

0d98d08

…of raw counter values

Updates option spec for SPR topdown and adds instruction comments for…

28ff918

… making new topdown calculators

Adds a CMake flag to let users tell us if PAPI is built to use rdpmc …

eec5ea9

…or not

Disables multiplexing in topdown-counters

7775590

Adds comments describing the expected behavior of the virtual methods…

f72b2cf

… in TopdownCalculator

ilumsden force-pushed the topdown-spr branch from 20f37a0 to f72b2cf Compare October 25, 2024 20:37

daboehme merged commit b7139cc into LLNL:master Oct 25, 2024
2 checks passed

ilumsden mentioned this pull request Oct 26, 2024

caliper: set CMake variables used by refactored topdown service spack/spack#47231

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactors topdown service to support multiple types of calculations and adds support for Sapphire Rapids #576

Refactors topdown service to support multiple types of calculations and adds support for Sapphire Rapids #576

ilumsden commented Jul 21, 2024

ilumsden commented Jul 21, 2024

ilumsden commented Sep 9, 2024 •

edited

Loading

ilumsden commented Oct 9, 2024

daboehme commented Oct 25, 2024 •

edited

Loading

ilumsden commented Oct 25, 2024

Refactors topdown service to support multiple types of calculations and adds support for Sapphire Rapids #576

Refactors topdown service to support multiple types of calculations and adds support for Sapphire Rapids #576

Conversation

ilumsden commented Jul 21, 2024

ilumsden commented Jul 21, 2024

ilumsden commented Sep 9, 2024 • edited Loading

ilumsden commented Oct 9, 2024

daboehme commented Oct 25, 2024 • edited Loading

ilumsden commented Oct 25, 2024

ilumsden commented Sep 9, 2024 •

edited

Loading

daboehme commented Oct 25, 2024 •

edited

Loading