zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 25, 2017, 07:38:03 AM |
|
Yeah, I thought I could at least prepare the next alpha version today, PST, but I encountered all sorts of weird bugs in the driver. I already sorted them out, so it should be OK. I will keep you guys updated.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
ghostfaceuk
|
|
January 25, 2017, 08:23:08 AM |
|
Yeah, I thought I could at least prepare the next alpha version today, PST, but I encountered all sorts of weird bugs in the driver. I already sorted them out, so it should be OK. I will keep you guys updated.
Take your time man, it a marathon not a sprint. No need to push your self too hard and cause burnout. Take a days break. it can do wonders for your concentration and creative juices, not to mention your relationship lol.
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 26, 2017, 03:50:37 AM |
|
Thanks, but I'd rather be done with it sooner than later, though. Now I'm optimizing the OpenCL kernel again and getting 434 sol/s on stock 7990. I already confirmed that GDS counters do make the miner run faster and the current assembly version is surprisingly stable given all the crazy stuff that is going on behind the scene, but I realized I still need to work on the OpenCL kernel.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zzzzzzzzzz
|
|
January 26, 2017, 03:58:34 AM |
|
Thanks, but I'd rather be done with it sooner than later, though. Now I'm optimizing the OpenCL kernel again and getting 434 sol/s on stock 7990. I already confirmed that GDS counters do make the miner run faster and the current assembly version is surprisingly stable given all the crazy stuff that is going on behind the scene, but I realized I still need to work on the OpenCL kernel.
Can't wait for an optimized Fiji kernel! Thanks for all your effort!
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 26, 2017, 07:05:49 AM |
|
Alright, I'm done with the OpenCL kernel. All I have to do now is to rewrite a disassembled code of the kernel, which should be straight forward. I will work on the assembly version for 7990 first, then I will do the same for RX 480 on Linux. I will support both platforms, so no worries.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 26, 2017, 07:08:35 AM |
|
Thanks, but I'd rather be done with it sooner than later, though. Now I'm optimizing the OpenCL kernel again and getting 434 sol/s on stock 7990. I already confirmed that GDS counters do make the miner run faster and the current assembly version is surprisingly stable given all the crazy stuff that is going on behind the scene, but I realized I still need to work on the OpenCL kernel.
Can't wait for an optimized Fiji kernel! Thanks for all your effort! Fiji is GCN3, so the assembly version for RX 480 should work. We will see.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 26, 2017, 09:34:43 AM |
|
Hmm... GDS counters are not as fast as they should be. I probably need to unpack them and use uint counters.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 26, 2017, 06:21:28 PM Last edit: January 26, 2017, 08:08:39 PM by zawawa |
|
There are 32 banks mapped to the lowest bits of the dw offset. https://community.amd.com/thread/167167Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified? This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing...
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
laik2
|
|
January 27, 2017, 12:23:59 AM |
|
There are 32 banks mapped to the lowest bits of the dw offset. https://community.amd.com/thread/167167Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified? This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing... Good work, new kernels do segfaults on ubuntu, though...
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 27, 2017, 01:25:47 AM |
|
Could be ncurses as I added colors to the UI. I will check it later.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
laik2
|
|
January 27, 2017, 01:55:11 AM |
|
Could be ncurses as I added colors to the UI. I will check it later.
I mean equihash.working1/2
|
|
|
|
reb0rn21
Legendary
Offline
Activity: 1901
Merit: 1024
|
|
January 27, 2017, 02:10:37 AM |
|
@zawawa
If you know sha256 you might look at pascal coin, no pool miner yet and sha256 need slight modification for it, i think many ppl would like to see it, if pascal price hold NV solo miner is flowed and AMD is just solo ocminer worked at pool but none is doing miner because it need new protocol for pool
|
|
|
|
xeridea
|
|
January 27, 2017, 08:12:36 AM |
|
@zawawa
If you know sha256 you might look at pascal coin, no pool miner yet and sha256 need slight modification for it, i think many ppl would like to see it, if pascal price hold NV solo miner is flowed and AMD is just solo ocminer worked at pool but none is doing miner because it need new protocol for pool
There are no pools yet... and mining profit on it is in the toilet anyway.
|
Profitability over time charts for many GPUs - http://xeridea.us/chartsBTC: bc1qr2xwjwfmjn43zhrlp6pn7vwdjrjnv5z0anhjhn LTC: LXDm6sR4dkyqtEWfUbPumMnVEiUFQvxSbZ Eth: 0x44cCe2cf90C8FEE4C9e4338Ae7049913D4F6fC24
|
|
|
m1n1ngP4d4w4n
Full Member
Offline
Activity: 224
Merit: 100
CryptoLearner
|
|
January 27, 2017, 08:27:58 AM |
|
ocminer (suprnova) is working on a pool he should release soon.
|
|
|
|
ghostfaceuk
|
|
January 27, 2017, 09:17:19 AM Last edit: January 27, 2017, 02:26:19 PM by ghostfaceuk |
|
ocminer (suprnova) is working on a pool he should release soon.
In that case we will need a pool miner as I think all miners out there are currently solo based, I also hear there is problems with the Cuda version of the miner
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 27, 2017, 03:26:01 PM |
|
I will look into that. The assembly version for 7990 is almost ready...
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
ghostfaceuk
|
|
January 27, 2017, 03:31:31 PM |
|
I will look into that. The assembly version for 7990 is almost ready...
have people looked at the returns for pascal coin right now? I would forget mining unless you have almost free electric and super high amounts of hashrate. http://whattomine.com/coins/172-pasc-pascal
|
|
|
|
nerdralph
|
|
January 27, 2017, 03:35:31 PM |
|
There are 32 banks mapped to the lowest bits of the dw offset. https://community.amd.com/thread/167167Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified? This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing... I think "offset" is not the correct term. The GDS address (including the offset) is what matters. So dword addresses 0x0020 and 0x0021 are on different banks, but addresses 0x0020 and 0x0060 are on the same bank. Quote from GCN ISA: "The GDS is configured with 32 banks, each with 512 entries of 4 bytes each."
|
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 27, 2017, 08:41:42 PM |
|
There are 32 banks mapped to the lowest bits of the dw offset. https://community.amd.com/thread/167167Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified? This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing... I think "offset" is not the correct term. The GDS address (including the offset) is what matters. So dword addresses 0x0020 and 0x0021 are on different banks, but addresses 0x0020 and 0x0060 are on the same bank. Quote from GCN ISA: "The GDS is configured with 32 banks, each with 512 entries of 4 bytes each." That makes a perfect sense. I did read that description about GDS, but I got really confused because realhet is usually right about almost anything related to GCN. I must be pretty tired... In any case, the fact remains that GDS bank conflicts are a serious problem and row counters must be unpacked. A single thread assembly version is already running stably with a major speed boost. Multithreading is rather tricky because I now have to use the entire 64KB of GDS, which is rather tricky and required modifications of the code. I am expecting an upper 400's on 7990 with this optimization alone.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
zawawa (OP)
Sr. Member
Offline
Activity: 728
Merit: 304
Miner Developer
|
|
January 27, 2017, 08:44:38 PM |
|
I will look into that. The assembly version for 7990 is almost ready...
have people looked at the returns for pascal coin right now? I would forget mining unless you have almost free electric and super high amounts of hashrate. http://whattomine.com/coins/172-pasc-pascalNo wonder... If it uses a variant of SHA-256, it wouldn't be within the reach of GPU's anyway.
|
Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4VBTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
|
|
|
|