[Libre-soc-dev] Vector Supercomputing ISA and 3D GPU resources
Hendrik Boom
hendrik at topoi.pooq.com
Wed Sep 15 13:39:50 BST 2021
On Tue, Sep 14, 2021 at 06:18:34PM -0700, Jacob Lifshay wrote:
> On Tue, Sep 14, 2021, 18:12 Hendrik Boom <hendrik at topoi.pooq.com> wrote:
>
> > I remember hearing of one performance analysis that determined that some
> > of the most
> > used instrucions were conditional branches. So they werked very hard
> > optimising the
> > hardware for the next version of their machine for conditional branches.
> > Once they
> > built it there was NO improvement in speed. Investigating, they
> > discovered they had
> > optimized the wait loop.
> >
>
> Oops...though realistically, optimizing the wait loop to go into lower
> power states and not spam the memory bus and resume faster does help,
> that's why x86 has the pause and monitor/mwait instructions and Arm has the
> yield instruction.
Optimizing the wait state makes sense. But they optimized the conditional
branch instead.
-- hendrik
>
> Jacob
> _______________________________________________
> Libre-soc-dev mailing list
> Libre-soc-dev at lists.libre-soc.org
> http://lists.libre-soc.org/mailman/listinfo/libre-soc-dev
More information about the Libre-soc-dev
mailing list