Update llama tests for block size 32 #696

aviator19941 · 2024-12-14T00:08:42Z

The block_seq_stride default is changing to 32 instead of 16, so this PR updates the tests to use the block_seq_stride flag and the new numpy inputs for block size 32 to benchmark correctly. This PR also removes the decomposed fp16 tests that are not needed anymore.

Signed-off-by: aviator19941 <[email protected]>

The block_seq_stride default is changing to 32 instead of 16, so this PR updates the tests to use the block_seq_stride flag and the new numpy inputs for block size 32 to benchmark correctly. This PR also removes the decomposed fp16 tests that are not needed anymore. --------- Signed-off-by: aviator19941 <[email protected]>

aviator19941 requested review from archana-ramalingam and saienduri December 14, 2024 00:08

archana-ramalingam approved these changes Dec 14, 2024

View reviewed changes

aviator19941 force-pushed the update_llama_test_block_sizes branch 2 times, most recently from 89d137f to 4d18c0f Compare December 16, 2024 17:32

aviator19941 added 8 commits December 16, 2024 12:31

Inital update of tests

552f0bf

Signed-off-by: aviator19941 <[email protected]>

Fix compile command and input file name

44f511e

Signed-off-by: aviator19941 <[email protected]>

Fix 8b tests

26e1142

Signed-off-by: aviator19941 <[email protected]>

Update tests

b820dbe

Signed-off-by: aviator19941 <[email protected]>

Fix 70b f16 benchmark test

5c4e96e

Signed-off-by: aviator19941 <[email protected]>

Make block_seq_stride in ExportArtifacts optional

c0c4e62

Signed-off-by: aviator19941 <[email protected]>

Add on pull request to check large llama tests

14acfb5

Signed-off-by: aviator19941 <[email protected]>

Remove on pull request test

f73651f

Signed-off-by: aviator19941 <[email protected]>

aviator19941 force-pushed the update_llama_test_block_sizes branch from 4d18c0f to f73651f Compare December 16, 2024 18:32

aviator19941 merged commit ba78824 into main Dec 16, 2024
13 checks passed

aviator19941 deleted the update_llama_test_block_sizes branch December 16, 2024 20:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update llama tests for block size 32 #696

Update llama tests for block size 32 #696

aviator19941 commented Dec 14, 2024

Update llama tests for block size 32 #696

Update llama tests for block size 32 #696

Conversation

aviator19941 commented Dec 14, 2024