undefined | Better HN

0 pointsdragontamer4y ago0 comments

No.

Which is why I'm sure add / adc will still win at 128-bits, or 256-bits.

The main issue is that the vector-add instructions are missing carry-out entirely, so recreating the carry will be expensive. But with a big enough number, that carry propagation is parallelizable in log2(n), so a big enough bignum (like maybe 1024-bits) will probably be more efficient for SIMD.

0 comments

expnkx4y ago

even AVX512 dies

j / k navigate · click thread line to collapse

0 pointsdragontamer4y ago0 comments

No.

Which is why I'm sure add / adc will still win at 128-bits, or 256-bits.