[Libre-soc-dev] PLRU algorithms
Luke Kenneth Casson Leighton
lkcl at lkcl.net
Sun Sep 13 12:44:45 BST 2020
paul, mikey, benjamin,
we've been converting dcache.vhdl for the past 2 weeks (i know it
sounds odd to do that, however we need things to be in nmigen before
being able to make significant augmentations).
in the process of that i encountered the plru.vhdl code which i
noticed uses arrays (pmux). there's also a note in it to look at
converting it to a ROM. i thought you might appreciate that almost 2
years ago we found a really decent implementation of a PLRU in
this used to be called "ariane" - it got renamed.
there's some notes on the algorithm, here:
and - ah yes! - that file indeed mentions a "ROM". we converted it to
nmigen 2 years ago:
after running yosys on each of these i can confirm that it is a *lot*
smaller (a hell of a lot less gates). i can't say it's a "ROM" - it's
more that the decision-tree is static, that bits are dropped into
multiple locations (a pascal's triangle of locations) and then ORed
and NOTed together elegantly.
the "shifts" that you see in both the python and the systemverilog
code are *not* HDL: they're variables (python variables in the case of
the nmigen code), that are used to statically compute which bits get
ORed and NOTed together.
anyway, the reason i mention this is for a number of reasons: firstly,
there's quite a lot of PLRUs used in microwatt (over 70? 64+8, 64 for
TLB, 8 for cache ways?) so there would be quite a bit of a saving,
there. secondly: i'm not actually that good at algorithms, whereas i
know that you are. consequently if you tackle this, i can then look
at what you've done and sort out libre-soc afterwards :)
crowd-funded eco-conscious hardware: https://www.crowdsupply.com/eoma68
More information about the Libre-soc-dev