Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update AMD Operator and NFD install scripts #2139

Merged
merged 8 commits into from
Jan 10, 2025

Conversation

bdattoma
Copy link
Contributor

@bdattoma bdattoma commented Dec 18, 2024

This PR is addressing 2 issues:

  1. Certified AMD GPU operator is available only from OCP versions >= 4.16
  2. The fixed NFD CR image we use is not compatible with OCP versions > 4.16

Proposed solutions:

  1. create a custom catalog source pointing to OCP 4.16 certified operators CS
  2. use the NFD CR image corresponding to OCP version

A note about 2: we could fetch the image dynamically using the vx.y label, however this wouldn't be compatible with our disconnected cluster automation. Hence, for now, we are going with fixed image digests

MR validation:

  • OCP 4.17 with AMD GPU: rhoai-test-flow/2157 PASS
  • OCP 4.15 with NVIDIA GPU: rhoai-test-flow/2158 PASS

Copy link
Contributor

github-actions bot commented Dec 18, 2024

Robot Results

✅ Passed ❌ Failed ⏭️ Skipped Total Pass %
593 0 0 593 100

@bdattoma bdattoma self-assigned this Dec 18, 2024
@bdattoma bdattoma added the enhancements Bugfixes, enhancements, refactoring, ... in tests or libraries (PR will be listed in release-notes) label Dec 18, 2024
@bdattoma bdattoma added needs testing Needs to be tested in Jenkins do not merge Do not merge this yet please verified This PR has been tested with Jenkins and removed do not merge Do not merge this yet please needs testing Needs to be tested in Jenkins labels Dec 18, 2024
jstourac
jstourac previously approved these changes Jan 8, 2025
Copy link
Member

@jstourac jstourac left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

one comment, but LGTM in general

jstourac
jstourac previously approved these changes Jan 8, 2025
@openshift-ci openshift-ci bot added the lgtm label Jan 8, 2025
@openshift-ci openshift-ci bot removed the lgtm label Jan 8, 2025
Copy link

openshift-ci bot commented Jan 10, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: apodhrad, bdattoma, jstourac

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot removed the lgtm label Jan 10, 2025
Copy link

openshift-ci bot commented Jan 10, 2025

New changes are detected. LGTM label has been removed.

@bdattoma bdattoma merged commit a005254 into red-hat-data-services:master Jan 10, 2025
11 of 12 checks passed
bdattoma added a commit to bdattoma/ods-ci that referenced this pull request Jan 10, 2025
* workaround for amd certified operator in ocp < 4.16

* add NFD installation script and use it in AMD script

* use NFD install script in NVIDIA script

* minor change

* update warn msg

* rm unused function
bdattoma added a commit to bdattoma/ods-ci that referenced this pull request Jan 10, 2025
* workaround for amd certified operator in ocp < 4.16

* add NFD installation script and use it in AMD script

* use NFD install script in NVIDIA script

* minor change

* update warn msg

* rm unused function
bdattoma added a commit to bdattoma/ods-ci that referenced this pull request Jan 10, 2025
* workaround for amd certified operator in ocp < 4.16

* add NFD installation script and use it in AMD script

* use NFD install script in NVIDIA script

* minor change

* update warn msg

* rm unused function
bdattoma added a commit to bdattoma/ods-ci that referenced this pull request Jan 10, 2025
* workaround for amd certified operator in ocp < 4.16

* add NFD installation script and use it in AMD script

* use NFD install script in NVIDIA script

* minor change

* update warn msg

* rm unused function
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancements Bugfixes, enhancements, refactoring, ... in tests or libraries (PR will be listed in release-notes) verified This PR has been tested with Jenkins
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants