lucasan123
|
|
December 23, 2016, 07:31:26 PM Last edit: December 23, 2016, 09:58:13 PM by lucasan123 |
|
can't you reach so much sols also with the cpu? i have a i7 6700k why is there so much problems to code a miner which is good.we need also a powerful equihash cpu miner sorry ... .i think it was possible to reach higher sols also with cpu.... is it possible? good work ............
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 24, 2016, 05:40:04 PM |
|
I am currently working on a new sorting algorithm. It is faster than the previous algo with a substantially reduced LDS footprint. Now the name of the game is, "How to take the fullest advantage of the fast shared memory?" If I am to identify one major weakness in the original design of SA, it is inefficient use of shared memory. For example, it relies too heavily on the L1 and L2 caches for memory reads. It would be interesting to write up on my experience of the current rewrite once it is complete.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 24, 2016, 05:41:19 PM |
|
can't you reach so much sols also with the cpu? i have a i7 6700k why is there so much problems to code a miner which is good.we need also a powerful equihash cpu miner sorry ... .i think it was possible to reach higher sols also with cpu.... is it possible? good work ............
That would be an interesting project. I will consider it.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 24, 2016, 07:54:26 PM |
|
I just updated the repository. Although GTX 1060 3GB is already doing better with the new sorting algorithm at 155 sol/s, more work is needed for AMD cards to take advantage of it.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
ioglnx
Sr. Member
Offline
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
|
|
December 24, 2016, 08:15:50 PM |
|
Good work. Keep it on. Merry Christmas Zawawa. Thanks in the name of the community for all your efforts also on Christmas eve. Now enjoy the time with your wife and don't mention you bought a GTx1060 :-D
|
GTX 1080Ti rocks da house... seriously... this card is a beast³ Owning by now 18x GTX1080Ti :-D @serious love of efficiency
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 24, 2016, 09:14:57 PM |
|
Good work. Keep it on. Merry Christmas Zawawa. Thanks in the name of the community for all your efforts also on Christmas eve. Now enjoy the time with your wife and don't mention you bought a GTx1060 :-D
That's very nice of you. Merry Christmas to you, too!
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 24, 2016, 09:16:34 PM |
|
Note to self before taking a Christmas break. NR_ROWS_LOG=12 still suffers from a low occupancy at Rounds 1 through 4 due to high shared memory usage: It would be interesting to see what would happen when I reach the top of the occupancy hill:
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
nerdralph
|
|
December 25, 2016, 02:17:38 AM |
|
Note to self before taking a Christmas break. NR_ROWS_LOG=12 still suffers from a low occupancy at Rounds 1 through 4 due to high shared memory usage: Claymore's 9.2 kernel seems to have found an optimization to the equihash algorithm by using 11 rounds. Up to round 3 uses 32K LDS, up to round 6 uses 21K and 68 VGPRs, rounds 7-9 use 16K and 56 VGPRs, and round 10 uses 7K.
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 25, 2016, 06:10:33 AM |
|
Note to self before taking a Christmas break. NR_ROWS_LOG=12 still suffers from a low occupancy at Rounds 1 through 4 due to high shared memory usage: Claymore's 9.2 kernel seems to have found an optimization to the equihash algorithm by using 11 rounds. Up to round 3 uses 32K LDS, up to round 6 uses 21K and 68 VGPRs, rounds 7-9 use 16K and 56 VGPRs, and round 10 uses 7K. Ah, that's how he overcame the limitations of the small amount of shared memory! He must be splitting up some of the rounds of Wagner's so that all the slots in each row would fit into LDS. That's very clever of him, I must say.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 25, 2016, 03:48:09 PM |
|
The whole idea of splitting rounds into optimal sizes is fascinating... I guess my Christmas break from programming is over already
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
ioglnx
Sr. Member
Offline
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
|
|
December 25, 2016, 05:48:59 PM |
|
Good luck...Christmas time can be more stressing then coding. Ahje
|
GTX 1080Ti rocks da house... seriously... this card is a beast³ Owning by now 18x GTX1080Ti :-D @serious love of efficiency
|
|
|
laik2
|
|
December 25, 2016, 07:30:49 PM |
|
Good luck...Christmas time can be more stressing then coding. Ahje
Indeed it is...my stomach is about to blow out. Instead of harvesting zcash, try to harvest some decent christmas meals Merry Christmas to you all!
|
|
|
|
krnlx
|
|
December 25, 2016, 11:42:13 PM |
|
6x1070 Total 1585.0 sol/s [dev0 265.9, dev1 268.7, dev2 270.8, dev3 263.0, dev4 260.2, dev5 267.8] 104 shares Total 1585.4 sol/s [dev0 265.6, dev1 268.9, dev2 270.9, dev3 264.2, dev4 260.4, dev5 268.6] 104 shares Total 1585.2 sol/s [dev0 264.9, dev1 269.8, dev2 270.2, dev3 263.2, dev4 260.0, dev5 267.9] 105 shares Total 1585.1 sol/s [dev0 264.5, dev1 270.4, dev2 269.6, dev3 263.6, dev4 260.2, dev5 267.4] 105 shares Total 1584.6 sol/s [dev0 265.5, dev1 270.4, dev2 268.8, dev3 261.8, dev4 259.0, dev5 267.9] 105 shares Total 1584.9 sol/s [dev0 264.3, dev1 270.7, dev2 269.1, dev3 262.1, dev4 259.8, dev5 268.2] 106 shares Total 1585.1 sol/s [dev0 262.9, dev1 271.3, dev2 268.6, dev3 262.4, dev4 261.0, dev5 269.3] 107 shares Total 1584.8 sol/s [dev0 262.9, dev1 270.8, dev2 267.2, dev3 261.5, dev4 260.6, dev5 269.2] 107 shares Note: the cpu fix for linux seems to be broken, I tested with dirty fix(LD_PRELOAD libtime.so), with cpu fix it will be faster 1-2%.
|
|
|
|
ioglnx
Sr. Member
Offline
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
|
|
December 26, 2016, 12:26:18 AM |
|
6x1070 Total 1585.0 sol/s [dev0 265.9, dev1 268.7, dev2 270.8, dev3 263.0, dev4 260.2, dev5 267.8] 104 shares Total 1585.4 sol/s [dev0 265.6, dev1 268.9, dev2 270.9, dev3 264.2, dev4 260.4, dev5 268.6] 104 shares Total 1585.2 sol/s [dev0 264.9, dev1 269.8, dev2 270.2, dev3 263.2, dev4 260.0, dev5 267.9] 105 shares Total 1585.1 sol/s [dev0 264.5, dev1 270.4, dev2 269.6, dev3 263.6, dev4 260.2, dev5 267.4] 105 shares Total 1584.6 sol/s [dev0 265.5, dev1 270.4, dev2 268.8, dev3 261.8, dev4 259.0, dev5 267.9] 105 shares Total 1584.9 sol/s [dev0 264.3, dev1 270.7, dev2 269.1, dev3 262.1, dev4 259.8, dev5 268.2] 106 shares Total 1585.1 sol/s [dev0 262.9, dev1 271.3, dev2 268.6, dev3 262.4, dev4 261.0, dev5 269.3] 107 shares Total 1584.8 sol/s [dev0 262.9, dev1 270.8, dev2 267.2, dev3 261.5, dev4 260.6, dev5 269.2] 107 shares Note: the cpu fix for linux seems to be broken, I tested with dirty fix(LD_PRELOAD libtime.so), with cpu fix it will be faster 1-2%. That the current Windows build?
|
GTX 1080Ti rocks da house... seriously... this card is a beast³ Owning by now 18x GTX1080Ti :-D @serious love of efficiency
|
|
|
krnlx
|
|
December 26, 2016, 12:42:18 AM |
|
6x1070 Total 1585.0 sol/s [dev0 265.9, dev1 268.7, dev2 270.8, dev3 263.0, dev4 260.2, dev5 267.8] 104 shares Total 1585.4 sol/s [dev0 265.6, dev1 268.9, dev2 270.9, dev3 264.2, dev4 260.4, dev5 268.6] 104 shares Total 1585.2 sol/s [dev0 264.9, dev1 269.8, dev2 270.2, dev3 263.2, dev4 260.0, dev5 267.9] 105 shares Total 1585.1 sol/s [dev0 264.5, dev1 270.4, dev2 269.6, dev3 263.6, dev4 260.2, dev5 267.4] 105 shares Total 1584.6 sol/s [dev0 265.5, dev1 270.4, dev2 268.8, dev3 261.8, dev4 259.0, dev5 267.9] 105 shares Total 1584.9 sol/s [dev0 264.3, dev1 270.7, dev2 269.1, dev3 262.1, dev4 259.8, dev5 268.2] 106 shares Total 1585.1 sol/s [dev0 262.9, dev1 271.3, dev2 268.6, dev3 262.4, dev4 261.0, dev5 269.3] 107 shares Total 1584.8 sol/s [dev0 262.9, dev1 270.8, dev2 267.2, dev3 261.5, dev4 260.6, dev5 269.2] 107 shares Note: the cpu fix for linux seems to be broken, I tested with dirty fix(LD_PRELOAD libtime.so), with cpu fix it will be faster 1-2%. That the current Windows build? last git, linux
|
|
|
|
ioglnx
Sr. Member
Offline
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
|
|
December 26, 2016, 12:43:32 AM |
|
6x1070 Total 1585.0 sol/s [dev0 265.9, dev1 268.7, dev2 270.8, dev3 263.0, dev4 260.2, dev5 267.8] 104 shares Total 1585.4 sol/s [dev0 265.6, dev1 268.9, dev2 270.9, dev3 264.2, dev4 260.4, dev5 268.6] 104 shares Total 1585.2 sol/s [dev0 264.9, dev1 269.8, dev2 270.2, dev3 263.2, dev4 260.0, dev5 267.9] 105 shares Total 1585.1 sol/s [dev0 264.5, dev1 270.4, dev2 269.6, dev3 263.6, dev4 260.2, dev5 267.4] 105 shares Total 1584.6 sol/s [dev0 265.5, dev1 270.4, dev2 268.8, dev3 261.8, dev4 259.0, dev5 267.9] 105 shares Total 1584.9 sol/s [dev0 264.3, dev1 270.7, dev2 269.1, dev3 262.1, dev4 259.8, dev5 268.2] 106 shares Total 1585.1 sol/s [dev0 262.9, dev1 271.3, dev2 268.6, dev3 262.4, dev4 261.0, dev5 269.3] 107 shares Total 1584.8 sol/s [dev0 262.9, dev1 270.8, dev2 267.2, dev3 261.5, dev4 260.6, dev5 269.2] 107 shares Note: the cpu fix for linux seems to be broken, I tested with dirty fix(LD_PRELOAD libtime.so), with cpu fix it will be faster 1-2%. That the current Windows build? last git, linux Speeds looking okay given the fact that its double from old SA5. Zawawa is on the right way I think.
|
GTX 1080Ti rocks da house... seriously... this card is a beast³ Owning by now 18x GTX1080Ti :-D @serious love of efficiency
|
|
|
Vetal_inside
Member
Offline
Activity: 78
Merit: 10
|
|
December 26, 2016, 03:20:59 AM |
|
The slow speed is probably due either to the modded BIOS or to the driver. Mods for Claymore's do not necessarily work with Gateless Gate/SILENTARMY. I would try the stock BIOS first. Also, I only tested the miner with Crimson drivers. I suppose I need to be more clear about requirements...
This is memory timings patch. Not sure that it can be a reason for this low solrate. But, on next few days I will try install latest crimson drivers and reflash stock bios. Will see what will change. I just tried to install latest drivers (16.11.2) for 280x and test GG v0.0.2 but...miner doesn't show hashrate, here is screenshot what I got. On GG v0.0.1 all works well and program shows hashrate.
|
|
|
|
nerdralph
|
|
December 26, 2016, 04:29:36 AM |
|
The whole idea of splitting rounds into optimal sizes is fascinating... I guess my Christmas break from programming is over already I can send you Claymore's kernel .il files. email me if you want them.
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 26, 2016, 05:39:03 AM Last edit: December 26, 2016, 06:03:33 AM by zawawa |
|
The whole idea of splitting rounds into optimal sizes is fascinating... I guess my Christmas break from programming is over already I can send you Claymore's kernel .il files. email me if you want them. Thanks for the offer, but I am pretty sure I can figure things out on my own at this point. Besides, I greatly admire Claymore's coding skills. Peeking into his code against his will would be disrespectful, and I don't want to do that. Now I'm 100% confident that I can catch up with Claymore's, Optiminer, and Eqminer. It's just a matter of time and effort.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
QuintLeo
Legendary
Offline
Activity: 1498
Merit: 1030
|
|
December 26, 2016, 06:21:19 AM |
|
The slow speed is probably due either to the modded BIOS or to the driver. Mods for Claymore's do not necessarily work with Gateless Gate/SILENTARMY. I would try the stock BIOS first. Also, I only tested the miner with Crimson drivers. I suppose I need to be more clear about requirements...
This is memory timings patch. Not sure that it can be a reason for this low solrate. But, on next few days I will try install latest crimson drivers and reflash stock bios. Will see what will change. I just tried to install latest drivers (16.11.2) for 280x and test GG v0.0.2 but...miner doesn't show hashrate, here is screenshot what I got. On GG v0.0.1 all works well and program shows hashrate. Try 15.12 drivers. The "Relive" junk does NOT work well for mining in particular, and is a major step backwards for pre-RX series cards in general while being a SERIOUSLY bad case of massive bloatware. It, like many of the Crimson 16.x series drivers, also refuses to work with modded BIOS on many cards. The Crimson stuff past 15.12 offered NO improvement from mining on pre-RX series cards.
|
I'm no longer legendary just in my own mind! Like something I said? Donations gratefully accepted. LYLnTKvLefz9izJFUvEGQEZzSkz34b3N6U (Litecoin) 1GYbjMTPdCuV7dci3iCUiaRrcNuaiQrVYY (Bitcoin)
|
|
|
|