Hotmetal
|
|
December 12, 2016, 02:39:33 PM |
|
Here are some stats for my fork.
What is the url to your fork?
|
|
|
|
cryptomined
|
|
December 12, 2016, 03:00:17 PM |
|
By the way, I found out the root cause of a low number of shares. It turned out that there is a bug in AMD's drivers, so you need to so specify the "-O1" build option to disable most of optimizations when you call clBuildProgram if the miner is running on AMD cards. The optimizer must have incorrectly eliminated s_waitcnt. What a wonderful world.
is your fork ready for release? thanks
|
|
|
|
kay5ive
Member
Offline
Activity: 67
Merit: 10
|
|
December 12, 2016, 05:38:11 PM |
|
By the way, I found out the root cause of a low number of shares. It turned out that there is a bug in AMD's drivers, so you need to so specify the "-O1" build option to disable most of optimizations when you call clBuildProgram if the miner is running on AMD cards. The optimizer must have incorrectly eliminated s_waitcnt. What a wonderful world.
How to fork this? Do you have a method? If you want to share with us?
|
|
|
|
nerdralph
|
|
December 12, 2016, 07:02:23 PM |
|
I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS. I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.
I think 14 will be close to ideal. I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters. That's 16KB * 2 for the row counters, or only 32KB.
|
|
|
|
Golku
|
|
December 12, 2016, 07:32:50 PM |
|
I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS. I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.
I think 14 will be close to ideal. I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters. That's 16KB * 2 for the row counters, or only 32KB. I got a feeling that this miner will beat up claymores
|
|
|
|
laik2
|
|
December 12, 2016, 07:43:36 PM |
|
I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS. I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.
I think 14 will be close to ideal. I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters. That's 16KB * 2 for the row counters, or only 32KB. What happened to your closed source miner?
|
|
|
|
Tatalk
|
|
December 12, 2016, 08:26:35 PM |
|
I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS. I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.
I think 14 will be close to ideal. I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters. That's 16KB * 2 for the row counters, or only 32KB. I got a feeling that this miner will beat up claymores I do not think the developers has much incentive to beat the Claymore. They are not rewarded properly.
|
|
|
|
issie81
|
|
December 12, 2016, 09:05:52 PM |
|
I do not think the developers has much incentive to beat the Claymore. They are not rewarded properly.
why not implement 1% dev fee? the linux miners are more stable
|
|
|
|
doktor83
|
|
December 12, 2016, 09:25:42 PM |
|
I do not think the developers has much incentive to beat the Claymore. They are not rewarded properly.
why not implement 1% dev fee? the linux miners are more stable i don't have any stability problems with claymore's, and its not a linux miner.
|
|
|
|
nerdralph
|
|
December 13, 2016, 03:03:07 AM |
|
I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS. I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.
I think 14 will be close to ideal. I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters. That's 16KB * 2 for the row counters, or only 32KB. What happened to your closed source miner? Perfection takes time. :-) If you want to see the results of some of my research into memory performance, check this out: https://github.com/nerdralph/cl-mem
|
|
|
|
eXtremal
|
|
December 13, 2016, 10:04:48 AM |
|
Did anybody implement NR_ROWS less than 16 support already ?
|
|
|
|
ioglnx
Sr. Member
Offline
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
|
|
December 13, 2016, 10:08:04 AM |
|
Did anybody implement NR_ROWS less than 16 support already ?
Nope not yet..and welcome back :-D
|
GTX 1080Ti rocks da house... seriously... this card is a beast³ Owning by now 18x GTX1080Ti :-D @serious love of efficiency
|
|
|
zawawa
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 13, 2016, 12:49:52 PM |
|
Did anybody implement NR_ROWS less than 16 support already ?
I was wondering about the same thing... I'm almost done, but it's a major PITA to code.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zawawa
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 13, 2016, 12:54:10 PM |
|
I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS. I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.
I think 14 will be close to ideal. I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters. That's 16KB * 2 for the row counters, or only 32KB. I agree. This is a perfect reason to use a GCN assembler and squeeze two sets of 32KB rowCounters[] into 64KB GDS. It is almost meant to be
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
Hotmetal
|
|
December 13, 2016, 01:06:00 PM |
|
I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS. I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.
I think 14 will be close to ideal. I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters. That's 16KB * 2 for the row counters, or only 32KB. I agree. This is a perfect reason to use a GCN assembler and squeeze two sets of 32KB rowCounters[] into 64KB GDS. It is almost meant to be Make it so
|
|
|
|
zawawa
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
December 14, 2016, 12:56:53 AM |
|
I will, I will. No worries. I will release my fork in a day or two. Although it is not as fast as Claymore's yet, it should be much faster than SA v5 on AMD cards. I already sold most of my NVIDIA cards, so I have no idea how it would compare on NVIDIA cards. If I had received a lot of donations, I could have justified purchasing GTX 1060 to my wife, but I could only dream...
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zzzzzzzzzz
|
|
December 14, 2016, 02:24:42 AM |
|
I will, I will. No worries. I will release my fork in a day or two. Although it is not as fast as Claymore's yet, it should be much faster than SA v5 on AMD cards. I already sold most of my NVIDIA cards, so I have no idea how it would compare on NVIDIA cards. If I had received a lot of donations, I could have justified purchasing GTX 1060 to my wife, but I could only dream...
Thanks for your work! Just to clarify (for myself), your fork will be Windows-only, correct? I'm sure looking forward to looking at both your new code and Marc's, when they become available. It will be very interesting to see what approaches you each took on the algo/implementation. Don't let it interfere with your Holiday with the family, though! That's much higher priority! Best wishes ...
|
|
|
|
Subw
|
|
December 14, 2016, 02:34:46 AM |
|
Although it is not as fast as Claymore's yet, it should be much faster than SA v5 on AMD cards.
how much fast? I already sold most of my NVIDIA cards, so I have no idea how it would compare on NVIDIA cards. If I had received a lot of donations, I could have justified purchasing GTX 1060 to my wife, but I could only dream...
will donate $50 for your GTX1060 fund
|
|
|
|
Subw
|
|
December 14, 2016, 02:41:56 AM |
|
your fork will be Windows-only, correct?
i hope not at least i'm not gonna donate anything for windows only software
|
|
|
|
zzzzzzzzzz
|
|
December 14, 2016, 03:04:10 AM |
|
your fork will be Windows-only, correct?
i hope not at least i'm not gonna donate anything for windows only software Marc (alias mrb) is working on the linux SA v6 miner separately, so there will be an OSS linux miner update. Also, zawawa's will be OSS, so anyone can port it to linux, if they want.
|
|
|
|
|