[Libre-soc-dev] SVP64 parallel map-reduce idea
Luke Kenneth Casson Leighton
lkcl at lkcl.net
Sun Jun 13 18:37:51 BST 2021
ah then we need a prefix sum algorithm. just a basic one would do. i
see this is an active area of research, Raph Levien created a
memcpy-speed vulkan prefix-sum implementation for example.
the answer when running the example came out to 377. adding up the
numbers manually they came out to 307. i therefore assumed
(incorrectly) it was doing a prefix sum.
More information about the Libre-soc-dev