[Libre-soc-misc] Vectorization Article
Luke Kenneth Casson Leighton
lkcl at lkcl.net
Tue Jun 8 20:34:22 BST 2021
"Since the Xeon Phi is bottlenecked on the instruction decoder it’s
actually faster to load from memory than it is to load an immediate into a
GPR, move this into a XMM register, and then broadcast it out"
oh - this is both hilarious, shocking, and depressing...
On Tue, Jun 8, 2021 at 8:15 PM Jacob Lifshay <programmerjake at gmail.com>
> Found this interesting article on using vectorization to run 4096 32-bit
> VMs on a 64-core Xeon Phi using AVX512:
> This is more or less what GPUs do.
> Libre-soc-misc mailing list
> Libre-soc-misc at libre-soc.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Libre-soc-misc