[Libre-soc-dev] Audio and Video Codec Algorithmic analysis for instruction creation was from Re: clamping/saturation semantics

Cole Poirier colepoirier at gmail.com
Sat Dec 12 19:51:36 GMT 2020

On Saturday, December 12, 2020, Lauri Kasanen <cand at gmx.com> wrote:
> I mean: sw profiling is such a basic technique it's not useful to
> google papers. Likewise for how much power an instr takes.
> The "accelerating the C version of an algo, instr for instr, leads to a
> worse outcome than a proper transformation" is completely unrelated.
> Again not useful to google papers for, that just requires understanding
> of the algo and of the target.
> If you need an example for your own understanding, compare a naive C
> strcmp and the C strcmp you find in an optimized C library. One is a
> simple byte loop, the other a completely different paradigm. No matter
> how much faster you make the simple loop, it's still bound by the
> number of operations it does and how they depend on each other.
> Where googling papers would be useful is the specific hw design of a
> specific hw operation. The time for that is once we have hw operations
> determined. E.g. "what is the best way to implement sqrt in hw".

I see. Thanks for taking the time to explain further Lauri.

Luke, do you have any additional input about this?


More information about the Libre-soc-dev mailing list