Bitcoin Forum
October 16, 2018, 11:20:23 AM *
News: Make sure you are not using versions of Bitcoin Core other than 0.17.0 [Torrent], 0.16.3, 0.15.2, or 0.14.3. More info.
 
   Home   Help Search Donate Login Register  
Poll
Question: Do you want to see improvements in Ethash dual-mining with GGS?
I desperately need it. - 8 (16%)
It would be nice. - 10 (20%)
It's not worth it anymore. - 32 (64%)
Total Voters: 50

Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 [47] 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 ... 197 »
  Print  
Author Topic: Gateless Gate Sharp 1.3.8: 30Mh/s (Ethash) on RX 480!  (Read 203552 times)
djeZo
Hero Member
*****
Offline Offline

Activity: 546
Merit: 500


View Profile
March 24, 2017, 10:22:20 AM
 #921

GG is finally running faster with the parallelized Round 0.
I fused Round 0 with Rounds 7 and 8 to alleviate cache contamination and to improve the cache hit ratio for the next Round 1. I could even merge it with the solution-searching kernel for better results.
Good stuff.

How much speed you gained that way?

1539688823
Hero Member
*
Offline Offline

Posts: 1539688823

View Profile Personal Message (Offline)

Ignore
1539688823
Reply with quote  #2

1539688823
Report to moderator
1539688823
Hero Member
*
Offline Offline

Posts: 1539688823

View Profile Personal Message (Offline)

Ignore
1539688823
Reply with quote  #2

1539688823
Report to moderator
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction. Advertise here.
1539688823
Hero Member
*
Offline Offline

Posts: 1539688823

View Profile Personal Message (Offline)

Ignore
1539688823
Reply with quote  #2

1539688823
Report to moderator
1539688823
Hero Member
*
Offline Offline

Posts: 1539688823

View Profile Personal Message (Offline)

Ignore
1539688823
Reply with quote  #2

1539688823
Report to moderator
1539688823
Hero Member
*
Offline Offline

Posts: 1539688823

View Profile Personal Message (Offline)

Ignore
1539688823
Reply with quote  #2

1539688823
Report to moderator
sp_
Legendary
*
Offline Offline

Activity: 1554
Merit: 1050

Ccminer developer


View Profile
March 24, 2017, 10:35:21 AM
 #922

His changes are opensource. You can compile and check.


https://github.com/zawawawa/gatelessgate/network

djeZo
Hero Member
*****
Offline Offline

Activity: 546
Merit: 500


View Profile
March 24, 2017, 10:50:07 AM
 #923

His changes are opensource. You can compile and check.


https://github.com/zawawawa/gatelessgate/network



Don't have AMD cards so...

zawawa
Sr. Member
****
Offline Offline

Activity: 700
Merit: 300


Miner Developer


View Profile
March 24, 2017, 01:17:39 PM
 #924

GG is finally running faster with the parallelized Round 0.
I fused Round 0 with Rounds 7 and 8 to alleviate cache contamination and to improve the cache hit ratio for the next Round 1. I could even merge it with the solution-searching kernel for better results.
Good stuff.

So how much faster?

Around 5% increase in speed at this point.
I am thinking about dynamic compilation of the kernel to simplify the blake2b calculations as nerdralph suggested.
Probably Optiminer is doing this already anyway.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
nerdralph
Sr. Member
****
Offline Offline

Activity: 574
Merit: 251


View Profile
March 24, 2017, 01:46:30 PM
 #925

GG is finally running faster with the parallelized Round 0.
I fused Round 0 with Rounds 7 and 8 to alleviate cache contamination and to improve the cache hit ratio for the next Round 1. I could even merge it with the solution-searching kernel for better results.
Good stuff.
You also could try SLC writes to bypass the L2 cache.
zawawa
Sr. Member
****
Offline Offline

Activity: 700
Merit: 300


Miner Developer


View Profile
March 25, 2017, 01:05:08 PM
 #926

GG is finally running faster with the parallelized Round 0.
I fused Round 0 with Rounds 7 and 8 to alleviate cache contamination and to improve the cache hit ratio for the next Round 1. I could even merge it with the solution-searching kernel for better results.
Good stuff.
You also could try SLC writes to bypass the L2 cache.


I tried various combinations of SLC/GLC bits, but they didn't work quite well.
I think I figured out how to implement dual mining properly, though. We will see in a bit...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa
Sr. Member
****
Offline Offline

Activity: 700
Merit: 300


Miner Developer


View Profile
March 25, 2017, 04:19:45 PM
 #927

I found that, for an efficient implementation of dual mining with the memory-bound foreground kernel and compute-intense background kernel, you really need to be careful with the number of wavefronts/warps and the timings of kernel launches. The whole point is to keep the foreground and background kernels together on the GPU as long as possible so that they can be switched back and forth without performance penalty, and that would be impossible if there are too many concurrent wavefronts and/or kernel executions are not synchronized properly. Another potential issue to consider is cache pollution by the background kernel as that could also severely degrade the performance of the foreground tasks. I wish I knew all this from the very beginning, but I suppose we all live and learn.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
djeZo
Hero Member
*****
Offline Offline

Activity: 546
Merit: 500


View Profile
March 25, 2017, 04:23:29 PM
 #928

I found that, for an efficient implementation of dual mining with the memory-bound foreground kernel and compute-intense background kernel, you really need to be careful with the number of wavefronts/warps and the timings of kernel launches. The whole point is to keep the foreground and background kernels together on the GPU as long as possible so that they can be switched back and forth without performance penalty, and that would be impossible if there are too many concurrent wavefronts and/or kernel executions are not synchronized properly. Another potential issue to consider is cache pollution by the background kernel as that could also severely degrade the performance of the foreground tasks. I wish I knew all this from the very beginning, but I suppose we all live and learn.

So... what you are saying is that it's not worth it or ? Because you said you gained 5% speed with it...

zawawa
Sr. Member
****
Offline Offline

Activity: 700
Merit: 300


Miner Developer


View Profile
March 25, 2017, 04:26:56 PM
 #929

I found that, for an efficient implementation of dual mining with the memory-bound foreground kernel and compute-intense background kernel, you really need to be careful with the number of wavefronts/warps and the timings of kernel launches. The whole point is to keep the foreground and background kernels together on the GPU as long as possible so that they can be switched back and forth without performance penalty, and that would be impossible if there are too many concurrent wavefronts and/or kernel executions are not synchronized properly. Another potential issue to consider is cache pollution by the background kernel as that could also severely degrade the performance of the foreground tasks. I wish I knew all this from the very beginning, but I suppose we all live and learn.

So... what you are saying is that it's not worth it or ? Because you said you gained 5% speed with it...

Oh, I think it's totally worth it. That number is with my naive initial implementation. I will give you an update shortly with a new number.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
sp_
Legendary
*
Offline Offline

Activity: 1554
Merit: 1050

Ccminer developer


View Profile
March 25, 2017, 05:33:04 PM
 #930

Another potential issue to consider is cache pollution by the background kernel as that could also severely degrade the performance of the foreground tasks. I wish I knew all this from the very beginning, but I suppose we all live and learn.

On nvidia its bether to move precalc tables into the instruction cache. Bether for dual mining/kernels.
sp_
Legendary
*
Offline Offline

Activity: 1554
Merit: 1050

Ccminer developer


View Profile
March 25, 2017, 07:19:39 PM
 #931

And since everything but the nonce is constant for ~2.5 minutes, you can probably move some of the calculations to compile time and generate a new kernel for each new block.  Since you're already building a custom llvm, you can probably get the kernel compile and dispatch time down to a few ms.

You don't need a compiler. You can inject the blake2s precalc into the instructions. Dual mine round0 with self modified code. Should be faster than Claymore and optiminer..
zawawa
Sr. Member
****
Offline Offline

Activity: 700
Merit: 300


Miner Developer


View Profile
March 26, 2017, 05:46:58 AM
 #932

And since everything but the nonce is constant for ~2.5 minutes, you can probably move some of the calculations to compile time and generate a new kernel for each new block.  Since you're already building a custom llvm, you can probably get the kernel compile and dispatch time down to a few ms.

You don't need a compiler. You can inject the blake2s precalc into the instructions. Dual mine round0 with self modified code. Should be faster than Claymore and optiminer..


Excellent! I am revising the data structure of the hash table right now.
Let me get to that when I'm done.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
WBF1
Sr. Member
****
Offline Offline

Activity: 418
Merit: 250


View Profile
March 26, 2017, 09:42:55 PM
 #933

Have been following this thread for a few weeks. Most of this is above my head. I do understand that there are improvements being made. 2 really basic questions though...

1) Do these improvements only help equihash? or would they boost all algorithms? specifically interested in ethash/daggerhashimoto.

2) Are these improvements only available on linux or would they translate to windows as well?

Anonymous, no-registration, no-frills BCH Mining - http://luckypool.co - Get Lucky Today!
zawawa
Sr. Member
****
Offline Offline

Activity: 700
Merit: 300


Miner Developer


View Profile
March 26, 2017, 10:39:18 PM
 #934

Have been following this thread for a few weeks. Most of this is above my head. I do understand that there are improvements being made. 2 really basic questions though...

1) Do these improvements only help equihash? or would they boost all algorithms? specifically interested in ethash/daggerhashimoto.

2) Are these improvements only available on linux or would they translate to windows as well?

1) Yes. I am particularly interested in dual-mining.

2) Eventually. I just need to figure out how to enable one crucial feature of the graphics card on Windows. Everything else is in place.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
joaocha
Full Member
***
Offline Offline

Activity: 248
Merit: 100


View Profile
March 26, 2017, 11:50:12 PM
 #935

An Eth miner at same speed at claymore would be really nice!, since we dont have anything good as him right now =/
zawawa
Sr. Member
****
Offline Offline

Activity: 700
Merit: 300


Miner Developer


View Profile
March 27, 2017, 04:50:08 AM
 #936

An Eth miner at same speed at claymore would be really nice!, since we dont have anything good as him right now =/

SGMiner-GM is about as fast if you tune it, and with a new kernel, way better at low core clocks.

Great work indeed. I think an auto-tune feature would definitely help for people to see how fast your kernels are.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
Ursul0
Sr. Member
****
Offline Offline

Activity: 648
Merit: 258


View Profile
March 28, 2017, 09:51:44 AM
 #937

An Eth miner at same speed at claymore would be really nice!, since we dont have anything good as him right now =/

SGMiner-GM is about as fast if you tune it, and with a new kernel, way better at low core clocks.

Great work indeed. I think an auto-tune feature would definitely help for people to see how fast your kernels are.

I actually didn't write ethash-new - dunno who did.

hmmm... I tried but never succeeded to run it properly...  "ethash-new" kernel bundled with sgminer-5.5.5-gm-nicehash-8


▄▄▄████████▄▄▄
▄██████████████████▄
▄██████████████████████▄
██████████████████████████
████████████████████████████
██████████████████████████████
██████████████████████████████
██████████████████████████████
██████████████████████████████
██████████████████████████████
████████████████████████████
██████████████████████████
▀██████████████████████▀
▀██████████████████▀
▀▀▀████████▀▀▀
   ███████
██████████
██████████
██████████
██████████
██████████
██████████
██████████
██████████
██████████
██████████
██████████
███████
BTC  ◉PLAY  ◉XMR  ◉DOGE  ◉BCH  ◉STRAT  ◉ETH  ◉GAS  ◉LTC  ◉DASH  ◉PPC
     ▄▄██████████████▄▄
  ▄██████████████████████▄        █████
▄██████████████████████████▄      █████
████ ▄▄▄▄▄ ▄▄▄▄▄▄ ▄▄▄▄▄ ████     ▄██▀
████ █████ ██████ █████ ████    ▄██▀
████ █████ ██████ █████ ████    ██▀
████ █████ ██████ █████ ████    ██
████ ▀▀▀▀▀ ▀▀▀▀▀▀ ▀▀▀▀▀ ████ ▄██████▄
████████████████████████████ ████████
███████▀            ▀███████ ▀██████▀
█████▀                ▀█████
▀██████████████████████████▀
  ▀▀████████████████████▀▀ 
✔️DICE           
✔️BLACKJACK
✔️PLINKO
✔️VIDEO POKER
✔️ROULETTE     
✔️LOTTO
zawawa
Sr. Member
****
Offline Offline

Activity: 700
Merit: 300


Miner Developer


View Profile
March 29, 2017, 09:36:36 PM
 #938

An Eth miner at same speed at claymore would be really nice!, since we dont have anything good as him right now =/

SGMiner-GM is about as fast if you tune it, and with a new kernel, way better at low core clocks.

Great work indeed. I think an auto-tune feature would definitely help for people to see how fast your kernels are.

I actually didn't write ethash-new - dunno who did.

hmmm... I tried but never succeeded to run it properly...  "ethash-new" kernel bundled with sgminer-5.5.5-gm-nicehash-8

I just added ethash-new.cl to GG and got 26.5 Mh/s with stock RX 480 on Windows after I tweaked the settings a little bit.
I already updated the repository, and I will upload a new version either today or tomorrow.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
UnclWish
Sr. Member
****
Offline Offline

Activity: 560
Merit: 251


View Profile
March 29, 2017, 10:21:47 PM
 #939

An Eth miner at same speed at claymore would be really nice!, since we dont have anything good as him right now =/

SGMiner-GM is about as fast if you tune it, and with a new kernel, way better at low core clocks.

Great work indeed. I think an auto-tune feature would definitely help for people to see how fast your kernels are.

I actually didn't write ethash-new - dunno who did.

hmmm... I tried but never succeeded to run it properly...  "ethash-new" kernel bundled with sgminer-5.5.5-gm-nicehash-8

I just added equihash-new.cl to GG and got 26.5 Mh/s with stock RX 480 on Windows after I tweaked the settings a little bit.
I already updated the repository, and I will upload a new version either today or tomorrow.
Can you say what settings and how you tweaked them?
zawawa
Sr. Member
****
Offline Offline

Activity: 700
Merit: 300


Miner Developer


View Profile
March 30, 2017, 01:34:27 AM
 #940

Intensity, worksize, and the number of threads. You can see sample configurations in .bat files in the repository.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 [47] 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 ... 197 »
  Print  
 
Jump to:  

Sponsored by , a Bitcoin-accepting VPN.
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!