sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 28, 2015, 01:15:13 PM Last edit: December 28, 2015, 01:59:30 PM by sp_ |
|
No. Here is an example of a 16bit precalc approach: _device__ static void aes_round( const uint32_t *sharedMemory, uint32_t x0, uint32_t x1, uint32_t x2, uint32_t x3, uint32_t k0, uint32_t &y0, uint32_t &y1, uint32_t &y2, uint32_t &y3) { y0 =sharedMemory[__byte_perm(x0, x1, 0x4441)&0xffff]^ sharedMemory[(__byte_perm(x2, x3, 0x4452)&0xffff) + 256]; y1 = ...
Your table will need to be 2^16 of 32bit integers big(256KB). The precalculated 32bit value is 2 numbers from the old table xor'ed together. now the difficult part is to reduce this to eigher: 12 bits (16KB table) 13 bits (32Kb table) 14 bits (64Kb table) the slow approach would be something like: if((x0&0xff)<63 && ((x2>>8)&0xff)<63) { //14bit aes y0 =sharedMemory64[__byte_perm(x0, x1, 0x4441)&0xffff]^ sharedMemory64[(__byte_perm(x2, x3, 0x4441)&0xffff) + 256]; y1 = ... } else { //do 8bit aes y0 = xor4_32( sharedMemory[x0&0xff], sharedMemory[__byte_perm(x1, 0, 0x4441) + 256], sharedMemory[__byte_perm(x2, 0, 0x4442) + 512], sharedMemory[__byte_perm(x3, 0, 0x4443) + 768]);
... }
With this code you have a 75% probabillity of executing the fast method. (14bit lookups in shared mem 1cycle)
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 28, 2015, 02:04:43 PM |
|
No, no, I mean register pressure with the state itself all in regs. You can't really get good occupancy this way, right?
You need to remove the conditional code. the pseudo code is just an example.
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 28, 2015, 03:05:42 PM |
|
Release 74 (cuda 6.5) is using 64 regs. But the cuda 7.5 compiler is trashing the code
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 28, 2015, 03:12:43 PM |
|
If you compile for cuda 6.5 you should use the release74 version for all sourcecode files. X11 is also faster
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 28, 2015, 03:44:37 PM |
|
7.5 not 7.
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 28, 2015, 03:55:53 PM |
|
7.5 not 7.
I was using the major version number. According to my package manager, it's 7.5.18 Good. I will take a look at echo later. I see some code that can be removed
|
|
|
|
bensam1231
Legendary
Offline
Activity: 1764
Merit: 1024
|
|
December 28, 2015, 07:40:58 PM |
|
No one has released anything new in close to six months or so it seems. Everything that has been talked about here is old hat or the same epeen wars. Where you talk about how fast your super optimized miner is you only sell to a handful of really rich individuals.
Biggest improvement has come from Nicehash returning dividends with their paid bounty program, but that only applies to AMD.
I joined the Nvidia mining community because it was more open and forthright then AMD mining. That seems to have died out and turned back into the same shit as the AMD community. Nothing improves or changes. I still end up going back and mining Neoscrypt with ancient miners (that are definitely outdated). Donating doesn't do anything.
How would you know ? you never donated for neoscrypt, you just started mining it after I decided to release the source publicly. Such an hypocrite You were paid by the feathercoin community. Why would I pay you for something that is already been paid for? You worked for commission. You're not in the same shoes as SP. Conversely, I can't pay for a commission when you never offer your services besides immature and unprofessional money grabs you decide to do on a whim. Notice how when Nicehash came in here they offered real concrete bounties for set goals and they were willing to work on all of the above (and still are)? Notice how you don't do that any expect people to just throw money at you when you aren't even offering anything? even at 10 mBTC/GH/day neoscrypt was just barely profitable for maxwell. and it hasn't seen that rate in a while And I just vomited a bit. Hopefully they have some Nvidia devs working for them. Good for Nicehash, shit for Nvidia miners till they get other developers working for them. \NEOSCRYPT--
I haven't switched to dot 77+ yet because of Neoscrypt. I haven't mined any Blake algo coins either, but apparently BlakeCoin was mineable with CudaMiner back in 2013-2014.
My GTX 960 gets 340kh/s mining Neoscrypt with the latest NiceHash Miner, that makes my GTX 960 as fast as an AMD 280X with the latest NiceHash Miner. They are still using your release dot 74. If you did a hybrid compile (Pallas), NiceHash may incorporate your release dot 77+. Neoscrypt is still a top-paying algo at NiceHash, and AMD code is really being optimized there.
I may do some benchmark tests. --scryptr
Same, Neoscrypt is the goto right now. Quark and Lyra are both in the shitter at the moment and have been for a few months.
|
I buy private Nvidia miners. Send information and/or inquiries to my PM box.
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 28, 2015, 10:32:07 PM |
|
Nice findings. The compiler is different in x86 (32bit) than linux 64bit.
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 28, 2015, 11:22:55 PM |
|
The compiler engineers at NVIDIA is probobly using EESL. Altcoin miners make their own compilers from scratach.
|
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
December 29, 2015, 02:12:20 AM |
|
No one has released anything new in close to six months or so it seems. Everything that has been talked about here is old hat or the same epeen wars. Where you talk about how fast your super optimized miner is you only sell to a handful of really rich individuals.
Biggest improvement has come from Nicehash returning dividends with their paid bounty program, but that only applies to AMD.
I joined the Nvidia mining community because it was more open and forthright then AMD mining. That seems to have died out and turned back into the same shit as the AMD community. Nothing improves or changes. I still end up going back and mining Neoscrypt with ancient miners (that are definitely outdated). Donating doesn't do anything.
How would you know ? you never donated for neoscrypt, you just started mining it after I decided to release the source publicly. Such an hypocrite You were paid by the feathercoin community. Why would I pay you for something that is already been paid for? You worked for commission. You're not in the same shoes as SP. I never been paid by the feathercoin community not sure why you are always making things up, I opened an ANN for a crowdfunding campaign... , but your excuse proves, I was right not dealing with you...
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
December 29, 2015, 02:16:27 AM |
|
The compiler engineers at NVIDIA is probobly using EESL. Altcoin miners make their own compilers from scratach.
lolwut ? (are the same guy as Bensam ) ?
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
bensam1231
Legendary
Offline
Activity: 1764
Merit: 1024
|
|
December 29, 2015, 09:04:09 AM |
|
No one has released anything new in close to six months or so it seems. Everything that has been talked about here is old hat or the same epeen wars. Where you talk about how fast your super optimized miner is you only sell to a handful of really rich individuals.
Biggest improvement has come from Nicehash returning dividends with their paid bounty program, but that only applies to AMD.
I joined the Nvidia mining community because it was more open and forthright then AMD mining. That seems to have died out and turned back into the same shit as the AMD community. Nothing improves or changes. I still end up going back and mining Neoscrypt with ancient miners (that are definitely outdated). Donating doesn't do anything.
How would you know ? you never donated for neoscrypt, you just started mining it after I decided to release the source publicly. Such an hypocrite You were paid by the feathercoin community. Why would I pay you for something that is already been paid for? You worked for commission. You're not in the same shoes as SP. I never been paid by the feathercoin community not sure why you are always making things up, I opened an ANN for a crowdfunding campaign... , but your excuse proves, I was right not dealing with you... Yuh and who do you think is funding you? People NOT mining with Neoscrypt? What do you mine with Neoscrypt, oh yeah Feathercoin. Just shows you how inept you are at anything other then toodling around with kernel development... and that definitely includes economics, coin development, and, of course, business. Sure you can drudge up a few other Neoscrypt coins with almost 0 volume, community, and buy support like UFOcoin that multipools hammer all the time, but as we both know (or maybe not) those don't matter.
|
I buy private Nvidia miners. Send information and/or inquiries to my PM box.
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
December 29, 2015, 09:43:25 AM |
|
No one has released anything new in close to six months or so it seems. Everything that has been talked about here is old hat or the same epeen wars. Where you talk about how fast your super optimized miner is you only sell to a handful of really rich individuals.
Biggest improvement has come from Nicehash returning dividends with their paid bounty program, but that only applies to AMD.
I joined the Nvidia mining community because it was more open and forthright then AMD mining. That seems to have died out and turned back into the same shit as the AMD community. Nothing improves or changes. I still end up going back and mining Neoscrypt with ancient miners (that are definitely outdated). Donating doesn't do anything.
How would you know ? you never donated for neoscrypt, you just started mining it after I decided to release the source publicly. Such an hypocrite You were paid by the feathercoin community. Why would I pay you for something that is already been paid for? You worked for commission. You're not in the same shoes as SP. I never been paid by the feathercoin community not sure why you are always making things up, I opened an ANN for a crowdfunding campaign... , but your excuse proves, I was right not dealing with you... Yuh and who do you think is funding you? People NOT mining with Neoscrypt? What do you mine with Neoscrypt, oh yeah Feathercoin. Just shows you how inept you are at anything other then toodling around with kernel development... and that definitely includes economics, coin development, and, of course, business. Sure you can drudge up a few other Neoscrypt coins with almost 0 volume, community, and buy support like UFOcoin that multipools hammer all the time, but as we both know (or maybe not) those don't matter. each time I discuss with you, I have the feeling I am discussing with a complete moron, well so you won't be surprised to learn that my opinion didn't change with that message. I meant paid by the ftc devs (they didn't contribute in any way to the crowdfunding... ) only people wanting to mine ftc did pay for it and apparently even 0.02btc (around 5euros at that time) was too much for you. Hence you are probably the most irrelevant person on that whole thread when it comes to business, donation and else. Also since you mined ftc for quite some time that makes you a member of that community, hence as part of that community you didn't pay it either...
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
sambiohazard
|
|
December 29, 2015, 01:10:34 PM |
|
can any devs answer this? Decred is going to use 14 round Blake-256 algo, i am assuming this cannot be mined using current ccminer. SO will it need a new miner or little tweaks and again if anyone is interested in coding a new miner please do contact decred devs. thanks
|
|
|
|
pallas
Legendary
Offline
Activity: 2716
Merit: 1094
Black Belt Developer
|
|
December 29, 2015, 01:15:00 PM |
|
can any devs answer this? Decred is going to use 14 round Blake-256 algo, i am assuming this cannot be mined using current ccminer. SO will it need a new miner or little tweaks and again if anyone is interested in coding a new miner please do contact decred devs. thanks
I've read that it's just blakecoin with 14 rounds instead of 8, so the changes should be trivial. Then some optimising will need to be done, but it should work. That is, assuming the coin doesn't have some new weird protocol, like Sia or ether ;-)
|
|
|
|
sambiohazard
|
|
December 29, 2015, 01:17:33 PM |
|
can any devs answer this? Decred is going to use 14 round Blake-256 algo, i am assuming this cannot be mined using current ccminer. SO will it need a new miner or little tweaks and again if anyone is interested in coding a new miner please do contact decred devs. thanks
I've read that it's just blakecoin with 14 rounds instead of 8, so the changes should be trivial. Then some optimising will need to be done, but it should work. That is, assuming the coin doesn't have some new weird protocol, like Sia or ether ;-) AFAIK its completely comatible with whatever bitcoin does, one of there aim is to make sure that any development they do can adopted in bitcoin & vice versa, so i dont think there will be a problem.
|
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
December 29, 2015, 01:43:28 PM |
|
can any devs answer this? Decred is going to use 14 round Blake-256 algo, i am assuming this cannot be mined using current ccminer. SO will it need a new miner or little tweaks and again if anyone is interested in coding a new miner please do contact decred devs. thanks
I've read that it's just blakecoin with 14 rounds instead of 8, so the changes should be trivial. Then some optimising will need to be done, but it should work. That is, assuming the coin doesn't have some new weird protocol, like Sia or ether ;-) I think the original blake code is 14 rounds (but was change to 8 for some coin) I think epsylon3 implementation takes already care of this...
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 29, 2015, 01:54:39 PM |
|
14 rounds is already implemented in sp-mod release 77
ccminer -a blake (14 rounds) ccminer -a blakecoin ... (8 rounds)
14 rounds:
750ti: 495MHASH 970: 1,5GHASH 960: 952MHASH
8 rounds:
750ti: 902MHASH 970: 2,55GHASH 960: 1,6GHASH
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
December 29, 2015, 02:11:50 PM Last edit: December 29, 2015, 02:22:15 PM by sp_ |
|
I submitted a 70MHASH speedup in the blake-256 8 and 14 rounds. (@github)
750ti: 902MHASH 912MHASH
970: 2,55GHASH 2,62GHASH
960: 1,6GHASH 1,68GHASH
|
|
|
|
sambiohazard
|
|
December 29, 2015, 02:12:16 PM |
|
Thanks for all your answers. @sp_ if i make some profit from this i would be looking for an optimized version
|
|
|
|
|