Skip to content

Commit

Permalink
Add KFTO_MNIST training operator tests
Browse files Browse the repository at this point in the history
  • Loading branch information
ChughShilpa committed Jan 7, 2025
1 parent 0b7a4c2 commit 364b75d
Showing 1 changed file with 33 additions and 4 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ Resource ../../../../tests/Resources/Page/DistributedWorkloads/Distribu

*** Test Cases ***
Run Training operator KFTO test with NVIDIA CUDA image
[Documentation] Run Go KFTO tests for Training operator using PyTorch job with NVIDIA CUDA image
[Documentation] Run Go KFTO test for Training operator using PyTorch job with NVIDIA CUDA image
[Tags] Resources-GPU NVIDIA-GPUs
... RHOAIENG-16035
... Tier1
Expand All @@ -20,7 +20,7 @@ Run Training operator KFTO test with NVIDIA CUDA image
Run Training Operator KFTO Test TestPyTorchJobWithCuda ${CUDA_TRAINING_IMAGE}

Run Training operator KFTO test with AMD ROCm image
[Documentation] Run Go KFTO tests for Training operator using PyTorch job with AMD ROCm image
[Documentation] Run Go KFTO test for Training operator using PyTorch job with AMD ROCm image
[Tags] Resources-GPU AMD-GPUs ROCm
... RHOAIENG-16035
... Tier1
Expand All @@ -30,7 +30,7 @@ Run Training operator KFTO test with AMD ROCm image
Run Training Operator KFTO Test TestPyTorchJobWithROCm ${ROCM_TRAINING_IMAGE}

Run Training operator KFTO error handling test with NVIDIA CUDA image
[Documentation] Run Go KFTO error handling tests for Training operator using PyTorch job with NVIDIA CUDA image
[Documentation] Run Go KFTO error handling test for Training operator using PyTorch job with NVIDIA CUDA image
[Tags] RHOAIENG-14542
... Tier1
... DistributedWorkloads
Expand All @@ -39,10 +39,39 @@ Run Training operator KFTO error handling test with NVIDIA CUDA image
Run Training Operator KFTO Test TestPyTorchJobFailureWithCuda ${CUDA_TRAINING_IMAGE}

Run Training operator KFTO error handling test with AMD ROCm image
[Documentation] Run Go KFTO error handling tests for Training operator using PyTorch job with AMD ROCm image
[Documentation] Run Go KFTO error handling test for Training operator using PyTorch job with AMD ROCm image
[Tags] RHOAIENG-14542
... Tier1
... DistributedWorkloads
... Training
... TrainingOperator
Run Training Operator KFTO Test TestPyTorchJobFailureWithROCm ${ROCM_TRAINING_IMAGE}

Run Training operator KFTO_MNIST multi-node CPU test with NVIDIA CUDA image
[Documentation] Run Go KFTO_MNIST multi-node CPU test for Training operator using PyTorch job with NVIDIA CUDA image

Check warning

Code scanning / Robocop

Line is too long ({{ line_length }}/{{ allowed_length }}) Warning test

Line is too long (123/120)
[Tags] RHOAIENG-1655612
... Sanity
... DistributedWorkloads
... Training
... TrainingOperator
Run Training Operator KFTO Test TestPyTorchJobMnistCpu ${CUDA_TRAINING_IMAGE}

Run Training operator KFTO_MNIST multi-node test with NVIDIA CUDA image
[Documentation] Run Go KFTO_MNIST multi-node test for Training operator using PyTorch job with NVIDIA CUDA image
[Tags] Resources-GPU NVIDIA-GPUsss
... RHOAIENG-16556
... Tier1
... DistributedWorkloads
... Training
... TrainingOperator
Run Training Operator KFTO Test TestPyTorchJobMnistWithCuda ${CUDA_TRAINING_IMAGE}

Run Training operator KFTO_MNIST multi-node test with AMD ROCm image
[Documentation] Run Go KFTO_MNIST multi-node test for Training operator using PyTorch job with AMD ROCm image
[Tags] Resources-GPU AMD-GPUs ROCmss
... RHOAIENG-16556
... Tier1
... DistributedWorkloads
... Training
... TrainingOperator
Run Training Operator KFTO Test TestPyTorchJobMnistWithROCm ${ROCM_TRAINING_IMAGE}

0 comments on commit 364b75d

Please sign in to comment.