[Libre-soc-dev] SVP64 parallel map-reduce idea

Luke Kenneth Casson Leighton lkcl at lkcl.net
Sun Jun 13 18:37:51 BST 2021


ah then we need a prefix sum algorithm.  just a basic one would do.  i
see this is an active area of research, Raph Levien created a
memcpy-speed vulkan prefix-sum implementation for example.

the answer when running the example came out to 377.  adding up the
numbers manually they came out to 307.  i therefore assumed
(incorrectly) it was doing a prefix sum.

l.



More information about the Libre-soc-dev mailing list