Bitcoin Forum
May 08, 2024, 12:17:18 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Poll
Question: Do you want to see improvements in Ethash dual-mining with GGS?
I desperately need it. - 8 (15.1%)
It would be nice. - 12 (22.6%)
It's not worth it anymore. - 33 (62.3%)
Total Voters: 53

Pages: « 1 2 3 4 5 6 [7] 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 ... 197 »
  Print  
Author Topic: Gateless Gate Sharp 1.3.8: 30Mh/s (Ethash) on RX 480!  (Read 214342 times)
lucasan123
Full Member
***
Offline Offline

Activity: 219
Merit: 100



View Profile
December 23, 2016, 07:31:26 PM
Last edit: December 23, 2016, 09:58:13 PM by lucasan123
 #121

can't you reach so much sols also with the cpu? i have a i7 6700k
why is there so much problems to code a miner which is good.we need also a powerful equihash cpu miner
sorry ... .i think it was possible to reach higher sols also with cpu.... is it possible? good work ............

Activity + Trust + Earned Merit == The Most Recognized Users on Bitcointalk
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1715127438
Hero Member
*
Offline Offline

Posts: 1715127438

View Profile Personal Message (Offline)

Ignore
1715127438
Reply with quote  #2

1715127438
Report to moderator
1715127438
Hero Member
*
Offline Offline

Posts: 1715127438

View Profile Personal Message (Offline)

Ignore
1715127438
Reply with quote  #2

1715127438
Report to moderator
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 24, 2016, 05:40:04 PM
 #122

I am currently working on a new sorting algorithm. It is faster than the previous algo with a substantially reduced LDS footprint.
Now the name of the game is, "How to take the fullest advantage of the fast shared memory?"
If I am to identify one major weakness in the original design of SA, it is inefficient use of shared memory.
For example, it relies too heavily on the L1 and L2 caches for memory reads.
It would be interesting to write up on my experience of the current rewrite once it is complete.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 24, 2016, 05:41:19 PM
 #123

can't you reach so much sols also with the cpu? i have a i7 6700k
why is there so much problems to code a miner which is good.we need also a powerful equihash cpu miner
sorry ... .i think it was possible to reach higher sols also with cpu.... is it possible? good work ............


That would be an interesting project. I will consider it.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 24, 2016, 07:54:26 PM
 #124

I just updated the repository. Although GTX 1060 3GB is already doing better with the new sorting algorithm at 155 sol/s, more work is needed for AMD cards to take advantage of it.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
ioglnx
Sr. Member
****
Offline Offline

Activity: 574
Merit: 250

Fighting mob law and inquisition in this forum


View Profile
December 24, 2016, 08:15:50 PM
 #125

Good work. Keep it on. Merry Christmas Zawawa. Thanks in the name of the community for all your efforts also on Christmas eve.
Now enjoy the time with your wife and don't mention you bought a GTx1060 :-D

GTX 1080Ti rocks da house... seriously... this card is a beast³
Owning by now 18x GTX1080Ti :-D @serious love of efficiency
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 24, 2016, 09:14:57 PM
 #126

Good work. Keep it on. Merry Christmas Zawawa. Thanks in the name of the community for all your efforts also on Christmas eve.
Now enjoy the time with your wife and don't mention you bought a GTx1060 :-D

That's very nice of you. Merry Christmas to you, too!

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 24, 2016, 09:16:34 PM
 #127

Note to self before taking a Christmas break. NR_ROWS_LOG=12 still suffers from a low occupancy at Rounds 1 through 4 due to high shared memory usage:



It would be interesting to see what would happen when I reach the top of the occupancy hill:


Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
December 25, 2016, 02:17:38 AM
 #128

Note to self before taking a Christmas break. NR_ROWS_LOG=12 still suffers from a low occupancy at Rounds 1 through 4 due to high shared memory usage:




Claymore's 9.2 kernel seems to have found an optimization to the equihash algorithm by using 11 rounds.  Up to round 3 uses 32K LDS, up to round 6 uses 21K and 68 VGPRs, rounds 7-9 use 16K and 56 VGPRs, and round 10 uses 7K.
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 25, 2016, 06:10:33 AM
 #129

Note to self before taking a Christmas break. NR_ROWS_LOG=12 still suffers from a low occupancy at Rounds 1 through 4 due to high shared memory usage:




Claymore's 9.2 kernel seems to have found an optimization to the equihash algorithm by using 11 rounds.  Up to round 3 uses 32K LDS, up to round 6 uses 21K and 68 VGPRs, rounds 7-9 use 16K and 56 VGPRs, and round 10 uses 7K.

Ah, that's how he overcame the limitations of the small amount of shared memory!
He must be splitting up some of the rounds of Wagner's so that all the slots in each row would fit into LDS.
That's very clever of him, I must say.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 25, 2016, 03:48:09 PM
 #130

The whole idea of splitting rounds into optimal sizes is fascinating...
I guess my Christmas break from programming is over already  Smiley

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
ioglnx
Sr. Member
****
Offline Offline

Activity: 574
Merit: 250

Fighting mob law and inquisition in this forum


View Profile
December 25, 2016, 05:48:59 PM
 #131

Good luck...Christmas time can be more stressing then coding. Ahje

GTX 1080Ti rocks da house... seriously... this card is a beast³
Owning by now 18x GTX1080Ti :-D @serious love of efficiency
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
December 25, 2016, 07:30:49 PM
 #132

Good luck...Christmas time can be more stressing then coding. Ahje
Indeed it is...my stomach is about to blow out. Instead of harvesting zcash, try to harvest some decent christmas meals Cheesy
Merry Christmas to you all!

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
krnlx
Full Member
***
Offline Offline

Activity: 243
Merit: 105


View Profile
December 25, 2016, 11:42:13 PM
 #133

6x1070

Code:
Total 1585.0 sol/s [dev0 265.9, dev1 268.7, dev2 270.8, dev3 263.0, dev4 260.2, dev5 267.8] 104 shares
Total 1585.4 sol/s [dev0 265.6, dev1 268.9, dev2 270.9, dev3 264.2, dev4 260.4, dev5 268.6] 104 shares
Total 1585.2 sol/s [dev0 264.9, dev1 269.8, dev2 270.2, dev3 263.2, dev4 260.0, dev5 267.9] 105 shares
Total 1585.1 sol/s [dev0 264.5, dev1 270.4, dev2 269.6, dev3 263.6, dev4 260.2, dev5 267.4] 105 shares
Total 1584.6 sol/s [dev0 265.5, dev1 270.4, dev2 268.8, dev3 261.8, dev4 259.0, dev5 267.9] 105 shares
Total 1584.9 sol/s [dev0 264.3, dev1 270.7, dev2 269.1, dev3 262.1, dev4 259.8, dev5 268.2] 106 shares
Total 1585.1 sol/s [dev0 262.9, dev1 271.3, dev2 268.6, dev3 262.4, dev4 261.0, dev5 269.3] 107 shares
Total 1584.8 sol/s [dev0 262.9, dev1 270.8, dev2 267.2, dev3 261.5, dev4 260.6, dev5 269.2] 107 shares

Note: the cpu fix for linux seems to be broken, I tested with dirty fix(LD_PRELOAD libtime.so), with cpu fix it will be faster 1-2%.
ioglnx
Sr. Member
****
Offline Offline

Activity: 574
Merit: 250

Fighting mob law and inquisition in this forum


View Profile
December 26, 2016, 12:26:18 AM
 #134

6x1070

Code:
Total 1585.0 sol/s [dev0 265.9, dev1 268.7, dev2 270.8, dev3 263.0, dev4 260.2, dev5 267.8] 104 shares
Total 1585.4 sol/s [dev0 265.6, dev1 268.9, dev2 270.9, dev3 264.2, dev4 260.4, dev5 268.6] 104 shares
Total 1585.2 sol/s [dev0 264.9, dev1 269.8, dev2 270.2, dev3 263.2, dev4 260.0, dev5 267.9] 105 shares
Total 1585.1 sol/s [dev0 264.5, dev1 270.4, dev2 269.6, dev3 263.6, dev4 260.2, dev5 267.4] 105 shares
Total 1584.6 sol/s [dev0 265.5, dev1 270.4, dev2 268.8, dev3 261.8, dev4 259.0, dev5 267.9] 105 shares
Total 1584.9 sol/s [dev0 264.3, dev1 270.7, dev2 269.1, dev3 262.1, dev4 259.8, dev5 268.2] 106 shares
Total 1585.1 sol/s [dev0 262.9, dev1 271.3, dev2 268.6, dev3 262.4, dev4 261.0, dev5 269.3] 107 shares
Total 1584.8 sol/s [dev0 262.9, dev1 270.8, dev2 267.2, dev3 261.5, dev4 260.6, dev5 269.2] 107 shares

Note: the cpu fix for linux seems to be broken, I tested with dirty fix(LD_PRELOAD libtime.so), with cpu fix it will be faster 1-2%.

That the current Windows build?

GTX 1080Ti rocks da house... seriously... this card is a beast³
Owning by now 18x GTX1080Ti :-D @serious love of efficiency
krnlx
Full Member
***
Offline Offline

Activity: 243
Merit: 105


View Profile
December 26, 2016, 12:42:18 AM
 #135

6x1070

Code:
Total 1585.0 sol/s [dev0 265.9, dev1 268.7, dev2 270.8, dev3 263.0, dev4 260.2, dev5 267.8] 104 shares
Total 1585.4 sol/s [dev0 265.6, dev1 268.9, dev2 270.9, dev3 264.2, dev4 260.4, dev5 268.6] 104 shares
Total 1585.2 sol/s [dev0 264.9, dev1 269.8, dev2 270.2, dev3 263.2, dev4 260.0, dev5 267.9] 105 shares
Total 1585.1 sol/s [dev0 264.5, dev1 270.4, dev2 269.6, dev3 263.6, dev4 260.2, dev5 267.4] 105 shares
Total 1584.6 sol/s [dev0 265.5, dev1 270.4, dev2 268.8, dev3 261.8, dev4 259.0, dev5 267.9] 105 shares
Total 1584.9 sol/s [dev0 264.3, dev1 270.7, dev2 269.1, dev3 262.1, dev4 259.8, dev5 268.2] 106 shares
Total 1585.1 sol/s [dev0 262.9, dev1 271.3, dev2 268.6, dev3 262.4, dev4 261.0, dev5 269.3] 107 shares
Total 1584.8 sol/s [dev0 262.9, dev1 270.8, dev2 267.2, dev3 261.5, dev4 260.6, dev5 269.2] 107 shares

Note: the cpu fix for linux seems to be broken, I tested with dirty fix(LD_PRELOAD libtime.so), with cpu fix it will be faster 1-2%.

That the current Windows build?

last git, linux
ioglnx
Sr. Member
****
Offline Offline

Activity: 574
Merit: 250

Fighting mob law and inquisition in this forum


View Profile
December 26, 2016, 12:43:32 AM
 #136

6x1070

Code:
Total 1585.0 sol/s [dev0 265.9, dev1 268.7, dev2 270.8, dev3 263.0, dev4 260.2, dev5 267.8] 104 shares
Total 1585.4 sol/s [dev0 265.6, dev1 268.9, dev2 270.9, dev3 264.2, dev4 260.4, dev5 268.6] 104 shares
Total 1585.2 sol/s [dev0 264.9, dev1 269.8, dev2 270.2, dev3 263.2, dev4 260.0, dev5 267.9] 105 shares
Total 1585.1 sol/s [dev0 264.5, dev1 270.4, dev2 269.6, dev3 263.6, dev4 260.2, dev5 267.4] 105 shares
Total 1584.6 sol/s [dev0 265.5, dev1 270.4, dev2 268.8, dev3 261.8, dev4 259.0, dev5 267.9] 105 shares
Total 1584.9 sol/s [dev0 264.3, dev1 270.7, dev2 269.1, dev3 262.1, dev4 259.8, dev5 268.2] 106 shares
Total 1585.1 sol/s [dev0 262.9, dev1 271.3, dev2 268.6, dev3 262.4, dev4 261.0, dev5 269.3] 107 shares
Total 1584.8 sol/s [dev0 262.9, dev1 270.8, dev2 267.2, dev3 261.5, dev4 260.6, dev5 269.2] 107 shares

Note: the cpu fix for linux seems to be broken, I tested with dirty fix(LD_PRELOAD libtime.so), with cpu fix it will be faster 1-2%.

That the current Windows build?

last git, linux

Speeds looking okay given the fact that its double from old SA5. Zawawa is on the right way I think.

GTX 1080Ti rocks da house... seriously... this card is a beast³
Owning by now 18x GTX1080Ti :-D @serious love of efficiency
Vetal_inside
Member
**
Offline Offline

Activity: 78
Merit: 10


View Profile
December 26, 2016, 03:20:59 AM
 #137

The slow speed is probably due either to the modded BIOS or to the driver. Mods for Claymore's do not necessarily work with Gateless Gate/SILENTARMY. I would try the stock BIOS first. Also, I only tested the miner with Crimson drivers. I suppose I need to be more clear about requirements...
This is memory timings patch. Not sure that it can be a reason for this low solrate.
But, on next few days I will try install latest crimson drivers and reflash stock bios. Will see what will change.
I just tried to install latest drivers (16.11.2) for 280x and test GG v0.0.2 but...miner doesn't show hashrate, here is screenshot what I got.
On GG v0.0.1 all works well and program shows hashrate.
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
December 26, 2016, 04:29:36 AM
 #138

The whole idea of splitting rounds into optimal sizes is fascinating...
I guess my Christmas break from programming is over already  Smiley

I can send you Claymore's kernel .il files.  email me if you want them.
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 26, 2016, 05:39:03 AM
Last edit: December 26, 2016, 06:03:33 AM by zawawa
 #139

The whole idea of splitting rounds into optimal sizes is fascinating...
I guess my Christmas break from programming is over already  Smiley

I can send you Claymore's kernel .il files.  email me if you want them.


Thanks for the offer, but I am pretty sure I can figure things out on my own at this point.
Besides, I greatly admire Claymore's coding skills.
Peeking into his code against his will would be disrespectful, and I don't want to do that.
Now I'm 100% confident that I can catch up with  Claymore's, Optiminer, and Eqminer.
It's just a matter of time and effort.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
QuintLeo
Legendary
*
Offline Offline

Activity: 1498
Merit: 1030


View Profile
December 26, 2016, 06:21:19 AM
 #140

The slow speed is probably due either to the modded BIOS or to the driver. Mods for Claymore's do not necessarily work with Gateless Gate/SILENTARMY. I would try the stock BIOS first. Also, I only tested the miner with Crimson drivers. I suppose I need to be more clear about requirements...
This is memory timings patch. Not sure that it can be a reason for this low solrate.
But, on next few days I will try install latest crimson drivers and reflash stock bios. Will see what will change.
I just tried to install latest drivers (16.11.2) for 280x and test GG v0.0.2 but...miner doesn't show hashrate, here is screenshot what I got.
On GG v0.0.1 all works well and program shows hashrate.

 Try 15.12 drivers.

 The "Relive" junk does NOT work well for mining in particular, and is a major step backwards for pre-RX series cards in general while being a SERIOUSLY bad case of massive bloatware.
 It, like many of the Crimson 16.x series drivers, also refuses to work with modded BIOS on many cards.

 The Crimson stuff past 15.12 offered NO improvement from mining on pre-RX series cards.

I'm no longer legendary just in my own mind!
Like something I said? Donations gratefully accepted. LYLnTKvLefz9izJFUvEGQEZzSkz34b3N6U (Litecoin)
1GYbjMTPdCuV7dci3iCUiaRrcNuaiQrVYY (Bitcoin)
Pages: « 1 2 3 4 5 6 [7] 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 ... 197 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!