[Libre-soc-dev] RV SIMD

Luke Kenneth Casson Leighton lkcl at lkcl.net
Fri Feb 12 19:12:25 GMT 2021


https://github.com/riscv/riscv-p-spec/blob/master/P-ext-proposal.adoc

so it begins - illustrating very starkly the O(N^6) opcode proliferation.

most significant word multiply, in 32 16 and 8 bit selection.

saturation and rounding, 10 opcodes per width (8, 16) per operation (add,
mul, shift)

byte packing 40 opcodes.

signed crossed multiply (HI-LO LO-HI words)

the list goes on and on and on, a total nightmare.

SV *literally* covers the entire lot with swizzle, elwidth overrides,
saturation contexts and VL, with the addition of something like 5 or 6
instructions: average-add, min/max, abs-diff, i think that's it.

one good operation comes out of this: bitrev.  this is an extract operation
except including reversal of the bits extracted.


l.









-- 
---
crowd-funded eco-conscious hardware: https://www.crowdsupply.com/eoma68


More information about the Libre-soc-dev mailing list