[Libre-soc-dev] SVP64 parallel map-reduce idea
    Luke Kenneth Casson Leighton 
    lkcl at lkcl.net
       
    Sun Jun 13 18:37:51 BST 2021
    
    
  
ah then we need a prefix sum algorithm.  just a basic one would do.  i
see this is an active area of research, Raph Levien created a
memcpy-speed vulkan prefix-sum implementation for example.
the answer when running the example came out to 377.  adding up the
numbers manually they came out to 307.  i therefore assumed
(incorrectly) it was doing a prefix sum.
l.
    
    
More information about the Libre-soc-dev
mailing list