Bitcoin Forum
December 12, 2018, 05:43:05 PM *
News: Latest Bitcoin Core release: 0.17.0 [Torrent].
 
   Home   Help Search Login Register More  
Poll
Question: Do you want to see improvements in Ethash dual-mining with GGS?
I desperately need it. - 8 (15.7%)
It would be nice. - 11 (21.6%)
It's not worth it anymore. - 32 (62.7%)
Total Voters: 51

Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 [23] 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 ... 198 »
  Print  
Author Topic: Gateless Gate Sharp 1.3.8: 30Mh/s (Ethash) on RX 480!  (Read 204369 times)
zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 25, 2017, 07:38:03 AM
 #441

Yeah, I thought I could at least prepare the next alpha version today, PST, but I encountered all sorts of weird bugs in the driver. I already sorted them out, so it should be OK. I will keep you guys updated.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
No Gods or Kings. Only Bitcoin
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction. Advertise here.
1544636585
Hero Member
*
Offline Offline

Posts: 1544636585

View Profile Personal Message (Offline)

Ignore
1544636585
Reply with quote  #2

1544636585
Report to moderator
1544636585
Hero Member
*
Offline Offline

Posts: 1544636585

View Profile Personal Message (Offline)

Ignore
1544636585
Reply with quote  #2

1544636585
Report to moderator
ghostfaceuk
Sr. Member
****
Offline Offline

Activity: 392
Merit: 250


View Profile
January 25, 2017, 08:23:08 AM
 #442

Yeah, I thought I could at least prepare the next alpha version today, PST, but I encountered all sorts of weird bugs in the driver. I already sorted them out, so it should be OK. I will keep you guys updated.

Take your time man, it a marathon not a sprint.


No need to push your self too hard and cause burnout. Take a days break. it can do wonders for your concentration and creative juices, not to mention your relationship lol.

zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 26, 2017, 03:50:37 AM
 #443

Thanks, but I'd rather be done with it sooner than later, though.
Now I'm optimizing the OpenCL kernel again and getting 434 sol/s on stock 7990.
I already confirmed that GDS counters do make the miner run faster and the current assembly version is surprisingly stable given all the crazy stuff that is going on behind the scene, but I realized I still need to work on the OpenCL kernel.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zzzzzzzzzz
Full Member
***
Offline Offline

Activity: 150
Merit: 100


View Profile
January 26, 2017, 03:58:34 AM
 #444

Thanks, but I'd rather be done with it sooner than later, though.
Now I'm optimizing the OpenCL kernel again and getting 434 sol/s on stock 7990.
I already confirmed that GDS counters do make the miner run faster and the current assembly version is surprisingly stable given all the crazy stuff that is going on behind the scene, but I realized I still need to work on the OpenCL kernel.

Can't wait for an optimized Fiji kernel! Thanks for all your effort!
zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 26, 2017, 07:05:49 AM
 #445

Alright, I'm done with the OpenCL kernel.
All I have to do now is to rewrite a disassembled code of the kernel, which should be straight forward.
I will work on the assembly version for 7990 first, then I will do the same for RX 480 on Linux.
I will support both platforms, so no worries.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 26, 2017, 07:08:35 AM
 #446

Thanks, but I'd rather be done with it sooner than later, though.
Now I'm optimizing the OpenCL kernel again and getting 434 sol/s on stock 7990.
I already confirmed that GDS counters do make the miner run faster and the current assembly version is surprisingly stable given all the crazy stuff that is going on behind the scene, but I realized I still need to work on the OpenCL kernel.

Can't wait for an optimized Fiji kernel! Thanks for all your effort!

Fiji is GCN3, so the assembly version for RX 480 should work.
We will  see.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 26, 2017, 09:34:43 AM
 #447

Hmm... GDS counters are not as fast as they should be.
I probably need to unpack them and use uint counters.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 26, 2017, 06:21:28 PM
 #448

Quote
There are 32 banks mapped to the lowest bits of the dw offset.
https://community.amd.com/thread/167167

Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified?
This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
laik2
Sr. Member
****
Offline Offline

Activity: 471
Merit: 260



View Profile
January 27, 2017, 12:23:59 AM
 #449

Quote
There are 32 banks mapped to the lowest bits of the dw offset.
https://community.amd.com/thread/167167

Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified?
This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing...
Good work, new kernels do segfaults on ubuntu, though...

ZEC: t1KbbHtXqzSS6qHBaPZDKyWnzxhRjr9oCtW
zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 27, 2017, 01:25:47 AM
 #450

Could be ncurses as I added colors to the UI. I will check it later.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
laik2
Sr. Member
****
Offline Offline

Activity: 471
Merit: 260



View Profile
January 27, 2017, 01:55:11 AM
 #451

Could be ncurses as I added colors to the UI. I will check it later.
I mean equihash.working1/2 Smiley

ZEC: t1KbbHtXqzSS6qHBaPZDKyWnzxhRjr9oCtW
reb0rn21
Legendary
*
Offline Offline

Activity: 1526
Merit: 1000


View Profile
January 27, 2017, 02:10:37 AM
 #452

@zawawa

If you know sha256  you might look at pascal coin, no pool miner yet and sha256 need slight modification for it, i think many ppl would like to see it, if pascal price hold
NV solo miner is flowed and AMD is just solo
ocminer worked at pool but none is doing miner because it need new protocol for pool

Bitrated user: reb0rn.
xeridea
Sr. Member
****
Offline Offline

Activity: 419
Merit: 251


View Profile WWW
January 27, 2017, 08:12:36 AM
 #453

@zawawa

If you know sha256  you might look at pascal coin, no pool miner yet and sha256 need slight modification for it, i think many ppl would like to see it, if pascal price hold
NV solo miner is flowed and AMD is just solo
ocminer worked at pool but none is doing miner because it need new protocol for pool
There are no pools yet... and mining profit on it is in the toilet anyway.

Profitability over time charts for many GPUs - http://xeridea.us/charts

BTC:  16wzGLYLh1ximotu3Ln7htKnbUUcwWvQUv   LTC:  LdPvSJoAwgH87TXSMBuxDefBvp2bweXApY   Eth:  0xb508131ca5d983ebe72f8af61ecfb7d1b61f6d18
m1n1ngP4d4w4n
Full Member
***
Offline Offline

Activity: 224
Merit: 100

CryptoLearner


View Profile
January 27, 2017, 08:27:58 AM
 #454

ocminer (suprnova) is working on a pool he should release soon.

BTC - 1B1RBYkzxiTmrbnFe2vj8EaNPSYftW8186 for tips Wink
ghostfaceuk
Sr. Member
****
Offline Offline

Activity: 392
Merit: 250


View Profile
January 27, 2017, 09:17:19 AM
 #455

ocminer (suprnova) is working on a pool he should release soon.

In that case we will need a pool miner as I think all miners out there are currently solo based, I also hear there is problems with the Cuda version of the miner

zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 27, 2017, 03:26:01 PM
 #456

I will look into that. The assembly version for 7990 is almost ready...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
ghostfaceuk
Sr. Member
****
Offline Offline

Activity: 392
Merit: 250


View Profile
January 27, 2017, 03:31:31 PM
 #457

I will look into that. The assembly version for 7990 is almost ready...

have people looked at the returns for pascal coin right now? I would forget mining unless you have almost free electric and super high amounts of hashrate. http://whattomine.com/coins/172-pasc-pascal


nerdralph
Sr. Member
****
Offline Offline

Activity: 574
Merit: 251


View Profile
January 27, 2017, 03:35:31 PM
 #458

Quote
There are 32 banks mapped to the lowest bits of the dw offset.
https://community.amd.com/thread/167167

Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified?
This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing...

I think "offset" is not the correct term.  The GDS address (including the offset) is what matters.  So dword addresses 0x0020 and 0x0021 are on different banks, but addresses 0x0020 and 0x0060 are on the same bank.
Quote from GCN ISA: "The GDS is configured with 32 banks, each with 512 entries of 4 bytes each."

zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 27, 2017, 08:41:42 PM
 #459

Quote
There are 32 banks mapped to the lowest bits of the dw offset.
https://community.amd.com/thread/167167

Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified?
This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing...

I think "offset" is not the correct term.  The GDS address (including the offset) is what matters.  So dword addresses 0x0020 and 0x0021 are on different banks, but addresses 0x0020 and 0x0060 are on the same bank.
Quote from GCN ISA: "The GDS is configured with 32 banks, each with 512 entries of 4 bytes each."



That makes a perfect sense. I did read that description about GDS, but I got really confused because realhet is usually right about almost anything related to GCN. I must be pretty tired...

In any case, the fact remains that GDS bank conflicts are a serious problem and row counters must be unpacked. A single thread assembly version is already running stably with a major speed boost. Multithreading is rather tricky because I now have to use the entire 64KB of GDS, which is rather tricky and required modifications of the code. I am expecting an upper 400's on 7990 with this optimization alone.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa
Sr. Member
****
Offline Offline

Activity: 714
Merit: 300


Miner Developer


View Profile
January 27, 2017, 08:44:38 PM
 #460

I will look into that. The assembly version for 7990 is almost ready...

have people looked at the returns for pascal coin right now? I would forget mining unless you have almost free electric and super high amounts of hashrate. http://whattomine.com/coins/172-pasc-pascal



No wonder... If it uses a variant of SHA-256, it wouldn't be within the reach of GPU's anyway.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 [23] 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 ... 198 »
  Print  
 
Jump to:  

Sponsored by , a Bitcoin-accepting VPN.
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!