hughson
|
|
January 01, 2017, 04:30:27 AM |
|
I tried 16.11.2 drivers and still no luck. but I'll keep following this thread maybe some one will come up with a fix.Happy New Year!
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 02, 2017, 05:26:55 AM |
|
So, after my wife and I watched New Year fireworks in San Francisco, I came back home at 1:30 a.m., squeezed my poor brain, and then realized that the size of the slot cache in LDS can be significantly reduced by recycling it. This should be a better way to increase occupancy than splitting rounds. We shall see.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
m1n1ngP4d4w4n
Full Member
Offline
Activity: 224
Merit: 100
CryptoLearner
|
|
January 02, 2017, 06:39:02 AM |
|
So, after my wife and I watched New Year fireworks in San Francisco, I came back home at 1:30 a.m., squeezed my poor brain, and then realized that the size of the slot cache in LDS can be significantly reduced by recycling it. This should be a better way to increase occupancy than splitting rounds. We shall see.
Firework in your brain too ?
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 02, 2017, 06:56:19 AM |
|
So, after my wife and I watched New Year fireworks in San Francisco, I came back home at 1:30 a.m., squeezed my poor brain, and then realized that the size of the slot cache in LDS can be significantly reduced by recycling it. This should be a better way to increase occupancy than splitting rounds. We shall see.
Firework in your brain too ? Yes, I would like to think I got brilliant sparks of ideas
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
ioglnx
Sr. Member
Offline
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
|
|
January 02, 2017, 02:47:24 PM |
|
Zawawa I wish you a happy new year and all the best luck and health. Thanks for your efforts.
|
GTX 1080Ti rocks da house... seriously... this card is a beast³ Owning by now 18x GTX1080Ti :-D @serious love of efficiency
|
|
|
m1n1ngP4d4w4n
Full Member
Offline
Activity: 224
Merit: 100
CryptoLearner
|
|
January 02, 2017, 04:38:52 PM |
|
So, after my wife and I watched New Year fireworks in San Francisco, I came back home at 1:30 a.m., squeezed my poor brain, and then realized that the size of the slot cache in LDS can be significantly reduced by recycling it. This should be a better way to increase occupancy than splitting rounds. We shall see.
Firework in your brain too ? Yes, I would like to think I got brilliant sparks of ideas Nice one Happy new year, and keep up the good work
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 03, 2017, 01:59:18 AM |
|
Zawawa I wish you a happy new year and all the best luck and health. Thanks for your efforts.
Thank you! The efforts do not mean much without the results, though. AMD drivers are so flaky that I am thinking about switching to the GCN assembly sooner than later.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
ioglnx
Sr. Member
Offline
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
|
|
January 03, 2017, 09:37:05 AM |
|
Zawawa I wish you a happy new year and all the best luck and health. Thanks for your efforts.
Thank you! The efforts do not mean much without the results, though. AMD drivers are so flaky that I am thinking about switching to the GCN assembly sooner than later. Haha that took long you realized that :-D
|
GTX 1080Ti rocks da house... seriously... this card is a beast³ Owning by now 18x GTX1080Ti :-D @serious love of efficiency
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 03, 2017, 12:02:44 PM |
|
Zawawa I wish you a happy new year and all the best luck and health. Thanks for your efforts.
Thank you! The efforts do not mean much without the results, though. AMD drivers are so flaky that I am thinking about switching to the GCN assembly sooner than later. Haha that took long you realized that :-D Oh, I knew that from the get go. I just wanted to make sure I have the fastest OpenCL kernel before getting my hands dirty with the GCN assembly.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
nerdralph
|
|
January 03, 2017, 04:48:33 PM |
|
I will probably stick with the GCN assembly instead of AMD IL because I would rather not deal with another abstraction layer.
It looks like AMDIL is a dead-end anyway. http://lists.llvm.org/pipermail/llvm-dev/2015-May/085684.htmlHSAIL will probably short-lived since most of the work is now focused on the llvm amdgpu back-end. It even supports inline asm, but I'm not sure if it will generate a kernel binary that conforms to AMD's CL2.0 ABI. With clang/llvm-3.9, I've only got as far as getting it to output gcn assembler from the OpenCL + inline asm code.
|
|
|
|
laik2
|
|
January 03, 2017, 05:06:22 PM Last edit: January 04, 2017, 10:09:56 PM by laik2 |
|
What is the current speed based per cards? If someone is using gg miner, please provide some feedback. EDIT: Total 760.7 sol/s [dev0 193.8, dev1 188.2, dev2 190.5, dev3 190.8] 15 shares Total 760.9 sol/s [dev0 194.9, dev1 188.1, dev2 190.2, dev3 190.7] 15 shares Total 761.2 sol/s [dev0 196.8, dev1 188.4, dev2 189.6, dev3 190.8] 15 shares Total 761.0 sol/s [dev0 195.8, dev1 188.8, dev2 188.7, dev3 190.9] 16 shares Total 761.9 sol/s [dev0 196.1, dev1 189.2, dev2 189.2, dev3 191.5] 17 shares Total 761.5 sol/s [dev0 196.6, dev1 189.1, dev2 189.1, dev3 193.1] 18 shares Total 761.1 sol/s [dev0 194.7, dev1 193.1, dev2 189.2, dev3 194.8] 18 shares Total 761.5 sol/s [dev0 194.2, dev1 193.1, dev2 188.0, dev3 195.7] 18 shares
4x RX480 Nitros 8GB Niice, Niice ( 840 with Claymore's and -i 2)
|
|
|
|
m0niker
Newbie
Offline
Activity: 39
Merit: 0
|
|
January 04, 2017, 11:05:26 PM |
|
What is the current speed based per cards? If someone is using gg miner, please provide some feedback. EDIT: Total 760.7 sol/s [dev0 193.8, dev1 188.2, dev2 190.5, dev3 190.8] 15 shares Total 760.9 sol/s [dev0 194.9, dev1 188.1, dev2 190.2, dev3 190.7] 15 shares Total 761.2 sol/s [dev0 196.8, dev1 188.4, dev2 189.6, dev3 190.8] 15 shares Total 761.0 sol/s [dev0 195.8, dev1 188.8, dev2 188.7, dev3 190.9] 16 shares Total 761.9 sol/s [dev0 196.1, dev1 189.2, dev2 189.2, dev3 191.5] 17 shares Total 761.5 sol/s [dev0 196.6, dev1 189.1, dev2 189.1, dev3 193.1] 18 shares Total 761.1 sol/s [dev0 194.7, dev1 193.1, dev2 189.2, dev3 194.8] 18 shares Total 761.5 sol/s [dev0 194.2, dev1 193.1, dev2 188.0, dev3 195.7] 18 shares
4x RX480 Nitros 8GB Niice, Niice ( 840 with Claymore's and -i 2) Is that with the latest build, or did you compile it yourself?
|
|
|
|
laik2
|
|
January 04, 2017, 11:06:53 PM |
|
What is the current speed based per cards? If someone is using gg miner, please provide some feedback. EDIT: Total 760.7 sol/s [dev0 193.8, dev1 188.2, dev2 190.5, dev3 190.8] 15 shares Total 760.9 sol/s [dev0 194.9, dev1 188.1, dev2 190.2, dev3 190.7] 15 shares Total 761.2 sol/s [dev0 196.8, dev1 188.4, dev2 189.6, dev3 190.8] 15 shares Total 761.0 sol/s [dev0 195.8, dev1 188.8, dev2 188.7, dev3 190.9] 16 shares Total 761.9 sol/s [dev0 196.1, dev1 189.2, dev2 189.2, dev3 191.5] 17 shares Total 761.5 sol/s [dev0 196.6, dev1 189.1, dev2 189.1, dev3 193.1] 18 shares Total 761.1 sol/s [dev0 194.7, dev1 193.1, dev2 189.2, dev3 194.8] 18 shares Total 761.5 sol/s [dev0 194.2, dev1 193.1, dev2 188.0, dev3 195.7] 18 shares
4x RX480 Nitros 8GB Niice, Niice ( 840 with Claymore's and -i 2) Is that with the latest build, or did you compile it yourself? It's under Ubuntu 16.04 with latest amdgpu-pro drivers.
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 05, 2017, 04:32:03 AM |
|
I will probably stick with the GCN assembly instead of AMD IL because I would rather not deal with another abstraction layer.
It looks like AMDIL is a dead-end anyway. http://lists.llvm.org/pipermail/llvm-dev/2015-May/085684.htmlHSAIL will probably short-lived since most of the work is now focused on the llvm amdgpu back-end. It even supports inline asm, but I'm not sure if it will generate a kernel binary that conforms to AMD's CL2.0 ABI. With clang/llvm-3.9, I've only got as far as getting it to output gcn assembler from the OpenCL + inline asm code. Like Wolf said, CLRX is the way to go if you haven't looked into it. I used it in my previous project with a great success. I am trying to figure out how to enable GDS on Ellesmere, which turned out to be rather tricky. It seems that there is no way to enable GDS with the CL2.0 ABI and you have to resort back to CL1.2 ABI with the "-legacy" build option. This totally sucks as I need to redo optimizations all over again. I have no idea as to what engineers at AMD had in mind when they decided to make this design change.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
kilo17
Legendary
Offline
Activity: 980
Merit: 1001
aka "whocares"
|
|
January 05, 2017, 08:03:10 AM |
|
I appreciate what you are doing and look forward to switching my farm to your miner when it is a bit faster. A moderate difference in hashrate is to costly with a bunch of miners running but I will accept a small loss in hashrate just to stop using the closed source stuff.
|
Bitcoin Will Only Succeed If The Community That Supports It Gets Support - Support Home Miners & Mining
|
|
|
Casalania
Full Member
Offline
Activity: 491
Merit: 100
#SWGT PRE-SALE IS LIVE
|
|
January 05, 2017, 08:39:13 AM |
|
What is the current speed based per cards? If someone is using gg miner, please provide some feedback. EDIT: Total 760.7 sol/s [dev0 193.8, dev1 188.2, dev2 190.5, dev3 190.8] 15 shares Total 760.9 sol/s [dev0 194.9, dev1 188.1, dev2 190.2, dev3 190.7] 15 shares Total 761.2 sol/s [dev0 196.8, dev1 188.4, dev2 189.6, dev3 190.8] 15 shares Total 761.0 sol/s [dev0 195.8, dev1 188.8, dev2 188.7, dev3 190.9] 16 shares Total 761.9 sol/s [dev0 196.1, dev1 189.2, dev2 189.2, dev3 191.5] 17 shares Total 761.5 sol/s [dev0 196.6, dev1 189.1, dev2 189.1, dev3 193.1] 18 shares Total 761.1 sol/s [dev0 194.7, dev1 193.1, dev2 189.2, dev3 194.8] 18 shares Total 761.5 sol/s [dev0 194.2, dev1 193.1, dev2 188.0, dev3 195.7] 18 shares
4x RX480 Nitros 8GB Niice, Niice ( 840 with Claymore's and -i 2) That is a good speed. It is gradually catching up with Claymore miner now.
|
|
|
|
m0niker
Newbie
Offline
Activity: 39
Merit: 0
|
|
January 05, 2017, 09:56:30 AM |
|
Well guess I did something wrong, with the latest amdgpu-pro drivers I built with make, then ran gatelessgate.py, getting 10/sec on each 480, anyone know where I messed up? Thanks!
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 05, 2017, 11:47:25 AM |
|
It turned out that the "legacy" AMD drivers require a totally different set of optimizations. This must be the reason why GG was running rather slow on older (GCN1/2) cards. I suppose optimizations for legacy drivers are worth the effort after all...
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
m1n1ngP4d4w4n
Full Member
Offline
Activity: 224
Merit: 100
CryptoLearner
|
|
January 05, 2017, 12:05:49 PM |
|
It turned out that the "legacy" AMD drivers require a totally different set of optimizations. This must be the reason why GG was running rather slow on older (GCN1/2) cards. I suppose optimizations for legacy drivers are worth the effort after all...
or people could update
|
|
|
|
ioglnx
Sr. Member
Offline
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
|
|
January 05, 2017, 12:07:34 PM |
|
It turned out that the "legacy" AMD drivers require a totally different set of optimizations. This must be the reason why GG was running rather slow on older (GCN1/2) cards. I suppose optimizations for legacy drivers are worth the effort after all...
or people could update Smart move :-D
|
GTX 1080Ti rocks da house... seriously... this card is a beast³ Owning by now 18x GTX1080Ti :-D @serious love of efficiency
|
|
|
|