The question is simple. My answer – 2.9W/EOS – is correct and based on real measurements. What about yours?
When it comes to BSGS and "space covering", a decent CPU + shitloads of RAM will leave any GPU far away. Including your setup, or any other setup that involves running BSGS on a GPU (any GPU).
BSGS does not run well on a GPU, its speed is scales of magnitude LOWER than maximum possible because IT NEEDS TO READ MEMORY AT EVERY STEP.
You are simply wasting energy running BSGS on a GPU.
Buy a CPU and shitloads of RAM if you want to run BSGS. Do not run BSGS on any GPU. Those exas/s do not exist, in reality in runs magnitudes of order SLOWER than any Kangaroo ever will. As in: the solution will be found orders of magnitudes later than Kangaroo.
It is a dumb idea to run BSGS on a GPU. GPUs are meant to compute, not to access DRAM. Reading and writing DRAM is a GPU's "kill switch" bottleneck.
And this is ofcourse futile anyway since BSGS cannot solve anything above 80 or so bits unless you go the supercomputer way.
If you want real comparisons: RC stated he reached around 14.8 Gk/s on a RTX 4090 stock clock card. First, transform your "EOS" into Gk/s (the real speed), compare the expected number of ops (which is similar) and you will see why BSGS runs orders of magnitudes slower.
I simply wanted to know how much power others are drawing on their machines at their speeds – nothing more, kTimesG. I don't feel like explaining myself anymore; I've already said a lot. If you remember, I've been writing my own algorithm for almost a year now, and I know what I have and what I'm talking about. I'm not going to argue. I also understand that I'm not providing proof, but I'm not trying to convince anyone of anything – I just wanted to know how much power you're using on your machines, calculated against your speed. I gave mine.
Regardless of the puzzle difficulty, my speed is constant. As I mentioned, a $1500 computer. From what I can see, everyone else is behind, except for Kangaroo, which rules. But as I mentioned, I have a trick – though it's based on luck, it shortens my normal search time relative to my speed. My speed is constant – every puzzle takes the same amount of time to solve. Combined with the trick, the maximum time stays the same, but I gain the ability to find the key faster, in a range from 10 seconds up to my standard time.
My question was about energy efficiency, but you took it in other directions! That's why I'm not writing more in this thread. I also asked you something before, and you avoided the topic then too. I'm not asking you for your code – I'm asking normal questions, and this was only my second question here on the forum.
I'll take this opportunity – I'm curious if your puzzle is in the 110-bit range. You put it together in an interesting way.
As for my algorithm – it finds keys above 80 bits in a reasonable time. My goal is to keep developing the algorithm and to solve Puzzle 135.
Can you tell me the time it took to find a key and for which puzzle in the configuration you mentioned here? Also, is the time always constant?: If you want real comparisons: RC stated he reached around 14.8 Gk/s on a RTX 4090 stock clock card. First, transform your "EOS" into Gk/s (the real speed), compare the expected number of ops (which is similar) and you will see why BSGS runs orders of magnitudes slower. You're forgetting that I don't use standard BSGS. I have my own implementation, my own methods, and various optimizations. What you're suggesting – doing the same as RC – would reduce me to exactly what he does. I already know I wouldn't reach that speed because I have an older GPU, but he doesn't have the same effective speed that I do – unless you're talking about Kangaroo, which you didn't specify.
If you look through my posts from the last few months, you'll see I've made progress. I'm still thinking about how to decompose problems. I also wrote kernels in assembly, plus lots of testing – because CUDA compiles nicely, but not perfectly, if you know what I mean. I focused on every aspect, and now I'm focusing on tricks that I come up with and apply.
Right now, I want to learn about power consumption relative to speed – as you know, there's practically no information about this. I know my own stats and I know it's a very good result relative to speed, but I'd like to confirm it by hearing how much power others are using.