speed improvement using SSE4 crc32 cpu instruction? #15

ThomasWaldmann · 2016-05-21T14:05:03Z

There is special support for crc computation in intel/AMD CPUs since quite some years:

http://www.drdobbs.com/parallel/fast-parallelized-crc-computation-using/229401411

https://en.wikipedia.org/wiki/SSE4#Supporting_CPUs

The drdobbs article says that this yields performance of about 1.17 cycles per 64bits word (for a measurement done with a loop, repeatedly computing over a small amount of data, so I guess one can assume they sit in L1 or L2 cache of cpu).

At 2.4GHz, this could mean up to 16GB/s (or whatever your RAM bandwidth is limiting this value to).

tpircher-zz · 2016-05-21T14:55:06Z

Hmm, this is architecture specific and only works for one specific polynomial (0x1EDC6F41). I think it's unlikely to implemented in pycrc any time soon.

ThomasWaldmann · 2016-05-21T15:13:48Z

Pity.

Considering that less-than-5y-old intel/amd cpus are quite common and many people just need some crc (not a specific crc), I can imagine a lot of people could use this.

I ran test/performance.sh and the maximum I got from that was 0.806 GB/s (crc32, table-driven sb4) on a Core i5-4200u.

ThomasWaldmann changed the title ~~speed improvement using intel crc32 cpu instruction?~~ speed improvement using SSE4 crc32 cpu instruction? May 21, 2016

ThomasWaldmann mentioned this issue May 21, 2016

interesting hashes / macs / ciphers / checksums borgbackup/borg#45

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speed improvement using SSE4 crc32 cpu instruction? #15

speed improvement using SSE4 crc32 cpu instruction? #15

ThomasWaldmann commented May 21, 2016 •

edited

Loading

tpircher-zz commented May 21, 2016

ThomasWaldmann commented May 21, 2016

speed improvement using SSE4 crc32 cpu instruction? #15

speed improvement using SSE4 crc32 cpu instruction? #15

Comments

ThomasWaldmann commented May 21, 2016 • edited Loading

tpircher-zz commented May 21, 2016

ThomasWaldmann commented May 21, 2016

ThomasWaldmann commented May 21, 2016 •

edited

Loading