Bitcoin Forum
December 18, 2017, 05:06:37 AM *
News: Latest stable version of Bitcoin Core: 0.15.1  [Torrent].
 
   Home   Help Search Donate Login Register  
Pages: « 1 2 3 4 5 6 7 8 9 10 [11] 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 ... 88 »
  Print  
Author Topic: Gateless Gate Sharp 1.1.5: zawawa's open-source dual ETH/XMR/PASC/LBC/FTC miner  (Read 164604 times)
zawawa
Sr. Member
****
Online Online

Activity: 420


Miner Developer


View Profile
January 01, 2017, 02:38:43 AM
 #201

I was able to split Wagner's rounds, but it turned out that I need to reduce LDS usage further to take full advantage of the splitting. I will probably stick with the GCN assembly instead of AMD IL because I would rather not deal with another abstraction layer. It's New Year Eve my time in California. Happy mining, and I will see you guys next year!

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
1513573597
Hero Member
*
Offline Offline

Posts: 1513573597

View Profile Personal Message (Offline)

Ignore
1513573597
Reply with quote  #2

1513573597
Report to moderator
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction. Advertise here.
1513573597
Hero Member
*
Offline Offline

Posts: 1513573597

View Profile Personal Message (Offline)

Ignore
1513573597
Reply with quote  #2

1513573597
Report to moderator
hughson
Full Member
***
Online Online

Activity: 125



View Profile
January 01, 2017, 04:30:27 AM
 #202

I tried 16.11.2 drivers and still no luck. but I'll keep following this thread maybe some one will come up with a fix.Happy New Year!
zawawa
Sr. Member
****
Online Online

Activity: 420


Miner Developer


View Profile
January 02, 2017, 05:26:55 AM
 #203

So, after my wife and I watched New Year fireworks in San Francisco, I came back home at 1:30 a.m., squeezed my poor brain, and then realized that the size of the slot cache in LDS can be significantly reduced by recycling it. This should be a better way to increase occupancy than splitting rounds. We shall see.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
m1n1ngP4d4w4n
Full Member
***
Offline Offline

Activity: 154

CryptoLearner


View Profile
January 02, 2017, 06:39:02 AM
 #204

So, after my wife and I watched New Year fireworks in San Francisco, I came back home at 1:30 a.m., squeezed my poor brain, and then realized that the size of the slot cache in LDS can be significantly reduced by recycling it. This should be a better way to increase occupancy than splitting rounds. We shall see.

Firework in your brain too ?  Cool  Grin

BTC - 1B1RBYkzxiTmrbnFe2vj8EaNPSYftW8186 for tips Wink
zawawa
Sr. Member
****
Online Online

Activity: 420


Miner Developer


View Profile
January 02, 2017, 06:56:19 AM
 #205

So, after my wife and I watched New Year fireworks in San Francisco, I came back home at 1:30 a.m., squeezed my poor brain, and then realized that the size of the slot cache in LDS can be significantly reduced by recycling it. This should be a better way to increase occupancy than splitting rounds. We shall see.

Firework in your brain too ?  Cool  Grin

Yes, I would like to think I got brilliant sparks of ideas  Wink

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
ioglnx
Sr. Member
****
Offline Offline

Activity: 434

Fighting mob law and inquisition in this forum


View Profile
January 02, 2017, 02:47:24 PM
 #206

Zawawa I wish you a happy new year and all the best luck and health.
Thanks for your efforts.

GTX 1080Ti rocks da house... seriously... this card is a beast³
Owning by now 18x GTX1080Ti :-D @serious love of efficiency
m1n1ngP4d4w4n
Full Member
***
Offline Offline

Activity: 154

CryptoLearner


View Profile
January 02, 2017, 04:38:52 PM
 #207

So, after my wife and I watched New Year fireworks in San Francisco, I came back home at 1:30 a.m., squeezed my poor brain, and then realized that the size of the slot cache in LDS can be significantly reduced by recycling it. This should be a better way to increase occupancy than splitting rounds. We shall see.

Firework in your brain too ?  Cool  Grin

Yes, I would like to think I got brilliant sparks of ideas  Wink

Nice one  Grin Happy new year, and keep up the good work  Cool

BTC - 1B1RBYkzxiTmrbnFe2vj8EaNPSYftW8186 for tips Wink
zawawa
Sr. Member
****
Online Online

Activity: 420


Miner Developer


View Profile
January 03, 2017, 01:59:18 AM
 #208

Zawawa I wish you a happy new year and all the best luck and health.
Thanks for your efforts.
Thank you! The efforts do not mean much without the results, though.
AMD drivers are so flaky that I am thinking about switching to the GCN assembly sooner than later.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
ioglnx
Sr. Member
****
Offline Offline

Activity: 434

Fighting mob law and inquisition in this forum


View Profile
January 03, 2017, 09:37:05 AM
 #209

Zawawa I wish you a happy new year and all the best luck and health.
Thanks for your efforts.
Thank you! The efforts do not mean much without the results, though.
AMD drivers are so flaky that I am thinking about switching to the GCN assembly sooner than later.

Haha that took long you realized that :-D

GTX 1080Ti rocks da house... seriously... this card is a beast³
Owning by now 18x GTX1080Ti :-D @serious love of efficiency
zawawa
Sr. Member
****
Online Online

Activity: 420


Miner Developer


View Profile
January 03, 2017, 12:02:44 PM
 #210

Zawawa I wish you a happy new year and all the best luck and health.
Thanks for your efforts.
Thank you! The efforts do not mean much without the results, though.
AMD drivers are so flaky that I am thinking about switching to the GCN assembly sooner than later.

Haha that took long you realized that :-D

Oh, I knew that from the get go. I just wanted to make sure I have the fastest OpenCL kernel before getting my hands dirty with the GCN assembly.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
nerdralph
Sr. Member
****
Offline Offline

Activity: 406


View Profile
January 03, 2017, 04:48:33 PM
 #211

I will probably stick with the GCN assembly instead of AMD IL because I would rather not deal with another abstraction layer.
It looks like AMDIL is a dead-end anyway.
http://lists.llvm.org/pipermail/llvm-dev/2015-May/085684.html

HSAIL will probably short-lived since most of the work is now focused on the llvm amdgpu back-end.  It even supports inline asm, but I'm not sure if it will generate a kernel binary that conforms to AMD's CL2.0 ABI.   With clang/llvm-3.9, I've only got as far as getting it to output gcn assembler from the OpenCL + inline asm code.


laik2
Sr. Member
****
Offline Offline

Activity: 392


View Profile
January 03, 2017, 05:06:22 PM
 #212

What is the current speed based per cards?
If someone is using gg miner, please provide some feedback.

EDIT:
Quote
Total 760.7 sol/s [dev0 193.8, dev1 188.2, dev2 190.5, dev3 190.8] 15 shares
Total 760.9 sol/s [dev0 194.9, dev1 188.1, dev2 190.2, dev3 190.7] 15 shares
Total 761.2 sol/s [dev0 196.8, dev1 188.4, dev2 189.6, dev3 190.8] 15 shares
Total 761.0 sol/s [dev0 195.8, dev1 188.8, dev2 188.7, dev3 190.9] 16 shares
Total 761.9 sol/s [dev0 196.1, dev1 189.2, dev2 189.2, dev3 191.5] 17 shares
Total 761.5 sol/s [dev0 196.6, dev1 189.1, dev2 189.1, dev3 193.1] 18 shares
Total 761.1 sol/s [dev0 194.7, dev1 193.1, dev2 189.2, dev3 194.8] 18 shares
Total 761.5 sol/s [dev0 194.2, dev1 193.1, dev2 188.0, dev3 195.7] 18 shares
4x RX480 Nitros 8GB
Niice, Niice ( 840 with Claymore's and -i 2)

ZEC: t1KbbHtXqzSS6qHBaPZDKyWnzxhRjr9oCtW
m0niker
Jr. Member
*
Offline Offline

Activity: 39


View Profile
January 04, 2017, 11:05:26 PM
 #213

What is the current speed based per cards?
If someone is using gg miner, please provide some feedback.

EDIT:
Quote
Total 760.7 sol/s [dev0 193.8, dev1 188.2, dev2 190.5, dev3 190.8] 15 shares
Total 760.9 sol/s [dev0 194.9, dev1 188.1, dev2 190.2, dev3 190.7] 15 shares
Total 761.2 sol/s [dev0 196.8, dev1 188.4, dev2 189.6, dev3 190.8] 15 shares
Total 761.0 sol/s [dev0 195.8, dev1 188.8, dev2 188.7, dev3 190.9] 16 shares
Total 761.9 sol/s [dev0 196.1, dev1 189.2, dev2 189.2, dev3 191.5] 17 shares
Total 761.5 sol/s [dev0 196.6, dev1 189.1, dev2 189.1, dev3 193.1] 18 shares
Total 761.1 sol/s [dev0 194.7, dev1 193.1, dev2 189.2, dev3 194.8] 18 shares
Total 761.5 sol/s [dev0 194.2, dev1 193.1, dev2 188.0, dev3 195.7] 18 shares
4x RX480 Nitros 8GB
Niice, Niice ( 840 with Claymore's and -i 2)

Is that with the latest build, or did you compile it yourself?
laik2
Sr. Member
****
Offline Offline

Activity: 392


View Profile
January 04, 2017, 11:06:53 PM
 #214

What is the current speed based per cards?
If someone is using gg miner, please provide some feedback.

EDIT:
Quote
Total 760.7 sol/s [dev0 193.8, dev1 188.2, dev2 190.5, dev3 190.8] 15 shares
Total 760.9 sol/s [dev0 194.9, dev1 188.1, dev2 190.2, dev3 190.7] 15 shares
Total 761.2 sol/s [dev0 196.8, dev1 188.4, dev2 189.6, dev3 190.8] 15 shares
Total 761.0 sol/s [dev0 195.8, dev1 188.8, dev2 188.7, dev3 190.9] 16 shares
Total 761.9 sol/s [dev0 196.1, dev1 189.2, dev2 189.2, dev3 191.5] 17 shares
Total 761.5 sol/s [dev0 196.6, dev1 189.1, dev2 189.1, dev3 193.1] 18 shares
Total 761.1 sol/s [dev0 194.7, dev1 193.1, dev2 189.2, dev3 194.8] 18 shares
Total 761.5 sol/s [dev0 194.2, dev1 193.1, dev2 188.0, dev3 195.7] 18 shares
4x RX480 Nitros 8GB
Niice, Niice ( 840 with Claymore's and -i 2)

Is that with the latest build, or did you compile it yourself?
It's under Ubuntu 16.04 with latest amdgpu-pro drivers.

ZEC: t1KbbHtXqzSS6qHBaPZDKyWnzxhRjr9oCtW
zawawa
Sr. Member
****
Online Online

Activity: 420


Miner Developer


View Profile
January 05, 2017, 04:32:03 AM
 #215

I will probably stick with the GCN assembly instead of AMD IL because I would rather not deal with another abstraction layer.
It looks like AMDIL is a dead-end anyway.
http://lists.llvm.org/pipermail/llvm-dev/2015-May/085684.html

HSAIL will probably short-lived since most of the work is now focused on the llvm amdgpu back-end.  It even supports inline asm, but I'm not sure if it will generate a kernel binary that conforms to AMD's CL2.0 ABI.   With clang/llvm-3.9, I've only got as far as getting it to output gcn assembler from the OpenCL + inline asm code.




Like Wolf said, CLRX is the way to go if you haven't looked into it. I used it in my previous project with a great success. I am trying to figure out how to enable GDS on Ellesmere, which turned out to be rather tricky. It seems that there is no way to enable GDS with the CL2.0 ABI and you have to resort back to CL1.2 ABI with the "-legacy" build option. This totally sucks as I need to redo optimizations all over again. I have no idea as to what engineers at AMD had in mind when they decided to make this design change.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
kilo17
Legendary
*
Offline Offline

Activity: 910

aka "whocares"


View Profile
January 05, 2017, 08:03:10 AM
 #216

I appreciate what you are doing and look forward to switching my farm to your miner when it is a bit faster.  A moderate difference in hashrate is to costly with a bunch of miners running but I will accept a small loss in hashrate just to stop using the closed source stuff.

Bitcoin Will Only Succeed If The Community That Supports It Gets Support - Support Home Miners & Mining
Casalania
Full Member
***
Offline Offline

Activity: 157



View Profile
January 05, 2017, 08:39:13 AM
 #217

What is the current speed based per cards?
If someone is using gg miner, please provide some feedback.

EDIT:
Quote
Total 760.7 sol/s [dev0 193.8, dev1 188.2, dev2 190.5, dev3 190.8] 15 shares
Total 760.9 sol/s [dev0 194.9, dev1 188.1, dev2 190.2, dev3 190.7] 15 shares
Total 761.2 sol/s [dev0 196.8, dev1 188.4, dev2 189.6, dev3 190.8] 15 shares
Total 761.0 sol/s [dev0 195.8, dev1 188.8, dev2 188.7, dev3 190.9] 16 shares
Total 761.9 sol/s [dev0 196.1, dev1 189.2, dev2 189.2, dev3 191.5] 17 shares
Total 761.5 sol/s [dev0 196.6, dev1 189.1, dev2 189.1, dev3 193.1] 18 shares
Total 761.1 sol/s [dev0 194.7, dev1 193.1, dev2 189.2, dev3 194.8] 18 shares
Total 761.5 sol/s [dev0 194.2, dev1 193.1, dev2 188.0, dev3 195.7] 18 shares
4x RX480 Nitros 8GB
Niice, Niice ( 840 with Claymore's and -i 2)

That is a good speed. It is gradually catching up with Claymore miner now.

m0niker
Jr. Member
*
Offline Offline

Activity: 39


View Profile
January 05, 2017, 09:56:30 AM
 #218

Well guess I did something wrong, with the latest amdgpu-pro drivers I built with make, then ran gatelessgate.py, getting 10/sec on each 480, anyone know where I messed up?  Cry Thanks!
zawawa
Sr. Member
****
Online Online

Activity: 420


Miner Developer


View Profile
January 05, 2017, 11:47:25 AM
 #219

It turned out that the "legacy" AMD drivers require a totally different set of optimizations. This must be the reason why GG was running rather slow on older (GCN1/2) cards. I suppose optimizations for legacy drivers are worth the effort after all...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
m1n1ngP4d4w4n
Full Member
***
Offline Offline

Activity: 154

CryptoLearner


View Profile
January 05, 2017, 12:05:49 PM
 #220

It turned out that the "legacy" AMD drivers require a totally different set of optimizations. This must be the reason why GG was running rather slow on older (GCN1/2) cards. I suppose optimizations for legacy drivers are worth the effort after all...

or people could update Smiley

BTC - 1B1RBYkzxiTmrbnFe2vj8EaNPSYftW8186 for tips Wink
Pages: « 1 2 3 4 5 6 7 8 9 10 [11] 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 ... 88 »
  Print  
 
Jump to:  

Sponsored by , a Bitcoin-accepting VPN.
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!