[Libre-soc-isa] [Bug 1071] add parallel prefix sum remap mode
bugzilla-daemon at libre-soc.org
bugzilla-daemon at libre-soc.org
Mon May 1 09:30:00 BST 2023
https://bugs.libre-soc.org/show_bug.cgi?id=1071
--- Comment #20 from Jacob Lifshay <programmerjake at gmail.com> ---
(In reply to Luke Kenneth Casson Leighton from comment #19)
> (In reply to Luke Kenneth Casson Leighton from comment #15)
> > (In reply to Jacob Lifshay from comment #14)
>
> > > for i in range(0, length, chunk_size):
> > > sum += reduce_chunk(VL=chunk_size)
> > > sum += reduce_tail(VL=length % chunk_size) # dynamic VL here
> >
> > drat, you're right.
> >
> > of course. okaaay sigh so this involves creating a new Form, with Parallel
> > Reduction similar to svshape2 "carving out" its own niche... urrr...
>
> ah. realised there's a complication. VL and MAXVL determines the number
> of operations carried out, not the width of the reduction.
>
> for now the tail will have to be done by recursive macros using
> immediates manually.
that's why I'm proposing a svshape with GPR/VL input, so it can calculate the
proper VL (which is a little less than 2x bigger but doesn't follow any simple
formula I could find).
MAXVL would be adjusted following the same algorithm but using MAXVL as the
input rather than VL or a GPR.
--
You are receiving this mail because:
You are on the CC list for the bug.
More information about the Libre-SOC-ISA
mailing list