style+perf: clean-up and optimize remove_empty_byte_from_padded_bytes_unchecked fn #41

samlaf · 2025-01-13T04:03:06Z

This was a fun weekend. Got to learn a crap ton about rust iterators, assembly output, godbolt, llvm, etc.

I was just trying to make this function cleaner by adopting a functional iterator, but in doing so realized the code was then much slower (up to 7x depending on input size). With 2 small modifications, managed to get the output to use pre-allocated output vector and use simd instructions for copying, which made the code 2-7x FASTER depending on input size.

Benchmarks are available in master...perf--remove-empty-byte-from-padded-bytes-fn-benchmark. Here are the results (function_fast is the function implemented in this PR):

for 32B inputs
for 32KiB inputs
for 32MiB inputs

Note: I decided to implement the functional_fast function instead of the fast function (which contains the same logic but written without iterators), because I personally find it cleaner to read. I do have to note however that the version with iterators (the one in this PR) is faster on 32KiB inputs but (slightly) slower on 32MiB. If we ever have teams sending huge bytes in the future, we might want to implement both approaches and let them pick and choose? Or perhaps have a wrapper that dispatches to the correct implementation based on input size?

…_unchecked function

There were a bunch of warnings that some of our set fmt properties were not being run: Warning: can't set `wrap_comments = true`, unstable features are only available in nightly channel. Warning: can't set `normalize_comments = true`, unstable features are only available in nightly channel.

Getting "error: toolchain 'nightly-x86_64-unknown-linux-gnu' is not installed" on github, and don't feel like debugging. Not even sure how cargo/rust are installed. Do they come preloaded by default? This reverts commit 6e87e0a.

samlaf · 2025-01-13T05:21:00Z

Note: Apologies about the large number of edits that are just formatting.... applied cargo +nightly fmt. Realized our ci is not using nightly version, which is actually needed for some of the formatting options we use (if you look at CI output you'll see a bunch of warnings). I tried changing the github workflow in 6e87e0a but the nightly toolchain wasn't available so just reverted that change.. but we should fix that.

bxue-l2 · 2025-01-13T23:11:11Z

src/kzg.rs

-    /// Precompute the primitive roots of unity for binary powers that divide r - 1
-    /// TODO(anupsv): Move this to the constants file. Ref: https://github.com/Layr-Labs/rust-kzg-bn254/issues/31
+    /// Precompute the primitive roots of unity for binary powers that divide r
+    /// - 1 TODO(anupsv): Move this to the constants file. Ref: https://github.com/Layr-Labs/rust-kzg-bn254/issues/31


why break a line at -1

argh that's what our linter when ran with nightly version does.... think I should just revert that commit?
@anupsv we'll need to look at that linter config at some point. It seems not that great.

Reverted the cargo +nightly fmt commit and formatted with stable rust instead. PTAL

nah

bxue-l2 · 2025-01-13T23:17:05Z

wait, exactly which approach you implemented? confusing to read " I do have to not however that the version with iterators (the one in this PR) is faster on 32KiB inputs but (slightly) slower on 32MiB."

Code wise it looks right to mee

anupsv

lgtm

samlaf · 2025-01-14T01:04:07Z

wait, exactly which approach you implemented? confusing to read " I do have to not however that the version with iterators (the one in this PR) is faster on 32KiB inputs but (slightly) slower on 32MiB."

Code wise it looks right to mee

Updated PR description, should have read "I do have to NOTE however"

This reverts commit ae70bf5.

style+perf: clean-up and optimize remove_empty_byte_from_padded_bytes…

a6c424e

…_unchecked function

samlaf requested review from anupsv and bxue-l2 January 13, 2025 04:03

samlaf added 3 commits January 12, 2025 23:29

style: cargo fmt

ae70bf5

Revert "ci: make cargo fmt use nightly"

1f86132

Getting "error: toolchain 'nightly-x86_64-unknown-linux-gnu' is not installed" on github, and don't feel like debugging. Not even sure how cargo/rust are installed. Do they come preloaded by default? This reverts commit 6e87e0a.

bxue-l2 previously approved these changes Jan 13, 2025

View reviewed changes

anupsv approved these changes Jan 14, 2025

View reviewed changes

samlaf added 2 commits January 13, 2025 20:06

Revert "style: cargo fmt"

d41baa8

This reverts commit ae70bf5.

style: cargo fmt

759ebee

bxue-l2 approved these changes Jan 14, 2025

View reviewed changes

samlaf merged commit b83fc92 into master Jan 14, 2025
1 check passed

samlaf deleted the perf--optimize-fn-remote-empty-byte-from-padded-bytes-unchecked branch January 14, 2025 01:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

style+perf: clean-up and optimize remove_empty_byte_from_padded_bytes_unchecked fn #41

style+perf: clean-up and optimize remove_empty_byte_from_padded_bytes_unchecked fn #41

samlaf commented Jan 13, 2025 •

edited

Loading

samlaf commented Jan 13, 2025

bxue-l2 Jan 13, 2025

samlaf Jan 14, 2025

samlaf Jan 14, 2025

bxue-l2 commented Jan 13, 2025 •

edited

Loading

anupsv left a comment

samlaf commented Jan 14, 2025

style+perf: clean-up and optimize remove_empty_byte_from_padded_bytes_unchecked fn #41

style+perf: clean-up and optimize remove_empty_byte_from_padded_bytes_unchecked fn #41

Conversation

samlaf commented Jan 13, 2025 • edited Loading

samlaf commented Jan 13, 2025

bxue-l2 Jan 13, 2025

Choose a reason for hiding this comment

samlaf Jan 14, 2025

Choose a reason for hiding this comment

samlaf Jan 14, 2025

Choose a reason for hiding this comment

bxue-l2 commented Jan 13, 2025 • edited Loading

anupsv left a comment

Choose a reason for hiding this comment

samlaf commented Jan 14, 2025

samlaf commented Jan 13, 2025 •

edited

Loading

bxue-l2 commented Jan 13, 2025 •

edited

Loading