zawawa
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
November 26, 2016, 06:20:08 PM |
|
The murderous 8 hour Black Friday shopping is finally over... It's time for coding!
Great! I switched to windows so I guess I can provide tests if necessary. That would be wonderful. As I became more familiar with SA's code, I can now see a rather glaring problem in the current implementation. This is a huge bottleneck both for AMD and NVIDIA, guys. I don't think it's impossible to catch up with Claymore once it's fixed. My wife is working today, so hopefully I can get some results in today.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
|
|
|
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
|
|
|
qwep1
|
|
November 26, 2016, 06:23:41 PM |
|
The murderous 8 hour Black Friday shopping is finally over... It's time for coding!
Great! I switched to windows so I guess I can provide tests if necessary. That would be wonderful. As I became more familiar with SA's code, I can now see a rather glaring problem in the current implementation. This is a huge bottleneck both for AMD and NVIDIA, guys. I don't think it's impossible to catch up with Claymore once it's fixed. My wife is working today, so hopefully I can get some results in today. +1
|
|
|
|
laik2
|
|
November 26, 2016, 06:32:50 PM |
|
The murderous 8 hour Black Friday shopping is finally over... It's time for coding!
Great! I switched to windows so I guess I can provide tests if necessary. That would be wonderful. As I became more familiar with SA's code, I can now see a rather glaring problem in the current implementation. This is a huge bottleneck both for AMD and NVIDIA, guys. I don't think it's impossible to catch up with Claymore once it's fixed. My wife is working today, so hopefully I can get some results in today. Depending on Timezone "today" isn't constant value ( 9pm here)
|
|
|
|
zawawa
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
November 26, 2016, 06:58:03 PM |
|
It's 11 a.m. here in California, so you never know The actual work doesn't look that easy, though. We will see.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
ioglnx
Sr. Member
Offline
Activity: 574
Merit: 250
Fighting mob law and inquisition in this forum
|
|
November 26, 2016, 07:26:08 PM |
|
Good zawawa. I finally managed to bring your code into my TFS for autobuilds.So it syncs with every new commit and builds. I splitted for AMD and NVIDIA meanwhile due to different OpenCL libs.
Meanwhile you figure out the bottleneck..guess the vector calculations :-D I will do my workout and pump. Good luck
|
GTX 1080Ti rocks da house... seriously... this card is a beast³ Owning by now 18x GTX1080Ti :-D @serious love of efficiency
|
|
|
zawawa
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
November 26, 2016, 07:34:59 PM |
|
Good stuff, good stuff. Meanwhile you figure out the bottleneck..guess the vector calculations :-D
I would think so too, but it was actually algorithmic. What a surprise after days of head-scratching...
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
laik2
|
|
November 26, 2016, 08:16:02 PM |
|
Good stuff, good stuff. Meanwhile you figure out the bottleneck..guess the vector calculations :-D
I would think so too, but it was actually algorithmic. What a surprise after days of head-scratching... Anxious to see what your genius has came up with
|
|
|
|
nerdralph
|
|
November 26, 2016, 08:23:56 PM |
|
This is indeed a major rewrite. Now I'm convinced I can do this, but it's very time-consuming. We will see.
I think I can modify ht_store to get performance on par with Optiminer. This would be straight OpenCL. If I have enough time I should have something ready to test tomorrow. The OpenCL compiler isn't behaving as I would like, so no luck with the relatively simple optimization. https://bitcointalk.org/index.php?topic=1679855.msg17000600#msg17000600
|
|
|
|
nerdralph
|
|
November 26, 2016, 08:28:13 PM |
|
Good stuff, good stuff. Meanwhile you figure out the bottleneck..guess the vector calculations :-D
I would think so too, but it was actually algorithmic. What a surprise after days of head-scratching... In my most recent discussions with Marc, he says the atomic_add in ht_store is still a problem. Although the counter table is small enough to fit in L2 cache, he says he's seeing a 60% miss rate. The L2 cache write-back must be lazier than we want, so if the cache line for the slot can be flushed at the end of ht_store the hit rate should be improved.
|
|
|
|
Biodom
Legendary
Offline
Activity: 3752
Merit: 3850
|
|
November 26, 2016, 08:32:45 PM |
|
Thanks, guys...zec miners at the linux fort are anxiously awaiting for the cavalry to arrive just in time to save the fort from the v8 "hordes"
|
|
|
|
giagge
Legendary
Offline
Activity: 1134
Merit: 1001
|
|
November 26, 2016, 08:55:42 PM |
|
It's 11 a.m. here in California, so you never know The actual work doesn't look that easy, though. We will see. Great zawawa , boost my Nvidia gtx 1070 pls .
|
|
|
|
zawawa
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
November 26, 2016, 11:28:43 PM |
|
Good stuff, good stuff. Meanwhile you figure out the bottleneck..guess the vector calculations :-D
I would think so too, but it was actually algorithmic. What a surprise after days of head-scratching... In my most recent discussions with Marc, he says the atomic_add in ht_store is still a problem. Although the counter table is small enough to fit in L2 cache, he says he's seeing a 60% miss rate. The L2 cache write-back must be lazier than we want, so if the cache line for the slot can be flushed at the end of ht_store the hit rate should be improved. Yeah, that function is a total b*tch... I also noticed those counters are slowing things down considerably. I don't know if we can have that kind of precise control over the L2 cache, though. I suspect the root cause is at a higher level. Luckily, I still have 8 hours my time before the end of the day We shall see.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
xeridea
|
|
November 27, 2016, 12:11:01 AM |
|
Optiminer builds no longer available due to people circumventing the absurd devfee for subpar miner, lolz.
|
Profitability over time charts for many GPUs - http://xeridea.us/chartsBTC: bc1qr2xwjwfmjn43zhrlp6pn7vwdjrjnv5z0anhjhn LTC: LXDm6sR4dkyqtEWfUbPumMnVEiUFQvxSbZ Eth: 0x44cCe2cf90C8FEE4C9e4338Ae7049913D4F6fC24
|
|
|
laik2
|
|
November 27, 2016, 12:26:37 AM |
|
|
|
|
|
xeridea
|
|
November 27, 2016, 12:41:23 AM |
|
Nah I just happened to check on progress, and saw notice on the Github page. I would have no interest in removing a devfee, I just find it interesting Optiminer attempted an absurd devfee, on a miner that is not the fastest, which incentivised people to remove it, to get speeds similar to the fastest.
|
Profitability over time charts for many GPUs - http://xeridea.us/chartsBTC: bc1qr2xwjwfmjn43zhrlp6pn7vwdjrjnv5z0anhjhn LTC: LXDm6sR4dkyqtEWfUbPumMnVEiUFQvxSbZ Eth: 0x44cCe2cf90C8FEE4C9e4338Ae7049913D4F6fC24
|
|
|
laik2
|
|
November 27, 2016, 12:47:50 AM |
|
Nah I just happened to check on progress, and saw notice on the Github page. I would have no interest in removing a devfee, I just find it interesting Optiminer attempted an absurd devfee, on a miner that is not the fastest, which incentivised people to remove it, to get speeds similar to the fastest. Well...I would have used it even with 20% fee IF the owner guaranteed stability. There was no such guarantee, so even 1% was too much. Stupidity is hardcoded into human nature I actually coudn't believe that someone is trying to sell stupid proxy and wasted numerous hours to secure something so stupid that could be bypassed by simple iptables ...oh what a day...
|
|
|
|
zawawa
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
November 27, 2016, 05:35:06 AM |
|
Alright, coding is done. I need to tweak parameters quite a bit to get optimal performance, though. I will keep you guys updated.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
Amph
Legendary
Offline
Activity: 3206
Merit: 1069
|
|
November 27, 2016, 06:51:56 AM |
|
Optiminer builds no longer available due to people circumventing the absurd devfee for subpar miner, lolz.
i guess this is not possible with clymore right? so it was something that optiminer did wrong with his fee in his build Alright, coding is done. I need to tweak parameters quite a bit to get optimal performance, though. I will keep you guys updated.
200 sol per 1070 coming?
|
|
|
|
reb0rn21
Legendary
Offline
Activity: 1897
Merit: 1024
|
|
November 27, 2016, 07:00:42 AM |
|
sadly crappy 280x can do 235sol/s at 1200/1600
|
|
|
|
xeridea
|
|
November 27, 2016, 07:10:25 AM |
|
Optiminer builds no longer available due to people circumventing the absurd devfee for subpar miner, lolz.
i guess this is not possible with clymore right? so it was something that optiminer did wrong with his fee in his build Alright, coding is done. I need to tweak parameters quite a bit to get optimal performance, though. I will keep you guys updated.
200 sol per 1070 coming? Not sure, though he does have countermeasures, and it slows down ~5% (same as -nofee option) if you try to avoid it. But 2.5% for a top notch miner is a lot more reasonable than 10-15% on a subpar miner.
|
Profitability over time charts for many GPUs - http://xeridea.us/chartsBTC: bc1qr2xwjwfmjn43zhrlp6pn7vwdjrjnv5z0anhjhn LTC: LXDm6sR4dkyqtEWfUbPumMnVEiUFQvxSbZ Eth: 0x44cCe2cf90C8FEE4C9e4338Ae7049913D4F6fC24
|
|
|
|