[Libre-soc-dev] SVP64 parallel map-reduce idea

Jacob Lifshay programmerjake at gmail.com
Sun Jun 13 18:54:21 BST 2021


On Sun, Jun 13, 2021, 10:38 Luke Kenneth Casson Leighton <lkcl at lkcl.net>
wrote:

> ah then we need a prefix sum algorithm.  just a basic one would do.  i
> see this is an active area of research, Raph Levien created a
> memcpy-speed vulkan prefix-sum implementation for example.
>
> the answer when running the example came out to 377.  adding up the
> numbers manually they came out to 307.  i therefore assumed
> (incorrectly) it was doing a prefix sum.
>

The example actually runs the reduce twice, once with a non-trivial
predicate, followed by once with an all-true predicate. you probably
confused the two outputs with each other.

Jacob


More information about the Libre-soc-dev mailing list