This miner mangles Qubit, a form of chained hashing: Luffa, CubeHash, Shavite, SIMD, Echo. The ending _X is the amount of kernel parallelism used.
SIMD stands for "SIMD is a message digest". Silly jokes.
Legacy kernels build a big function out of all those algorithms. In my implementation instead, the phases are clearly separated. This allows me to reconfigure GPU ALU more appropriately. Most of the benefit comes from the improved SIMD and Echo implementation.
Note: same algorithms but different implementations.
To your project, the value is mostly in the new codebase which is hopefully easier to understand. I do not claim it to be complete nor efficient but it is surely more compact.