[Libre-soc-misc] Vectorization Article

Tue Jun 8 20:34:22 BST 2021

"Since the Xeon Phi is bottlenecked on the instruction decoder it’s
actually faster to load from memory than it is to load an immediate into a
GPR, move this into a XMM register, and then broadcast it out"

oh - this is both hilarious, shocking, and depressing...

https://godbolt.org/z/55Kax4j9f

On Tue, Jun 8, 2021 at 8:15 PM Jacob Lifshay <programmerjake at gmail.com>
wrote:

> Found this interesting article on using vectorization to run 4096 32-bit
> VMs on a 64-core Xeon Phi using AVX512:
> https://gamozolabs.github.io/fuzzing/2018/10/14/vectorized_emulation.html
>
> This is more or less what GPUs do.
>
> Jacob
> _______________________________________________
> Libre-soc-misc mailing list
> Libre-soc-misc at libre-soc.org
> http://lists.libre-soc.org/mailman/listinfo/libre-soc-misc
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.libre-soc.org/pipermail/libre-soc-misc/attachments/20210608/1ff5a62c/attachment.html>