Bitcoin Forum
April 26, 2024, 05:45:23 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Poll
Question: Do you want to see improvements in Ethash dual-mining with GGS?
I desperately need it. - 8 (15.1%)
It would be nice. - 12 (22.6%)
It's not worth it anymore. - 33 (62.3%)
Total Voters: 53

Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 [23] 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 ... 197 »
  Print  
Author Topic: Gateless Gate Sharp 1.3.8: 30Mh/s (Ethash) on RX 480!  (Read 214337 times)
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 25, 2017, 07:38:03 AM
 #441

Yeah, I thought I could at least prepare the next alpha version today, PST, but I encountered all sorts of weird bugs in the driver. I already sorted them out, so it should be OK. I will keep you guys updated.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
1714153523
Hero Member
*
Offline Offline

Posts: 1714153523

View Profile Personal Message (Offline)

Ignore
1714153523
Reply with quote  #2

1714153523
Report to moderator
"If you don't want people to know you're a scumbag then don't be a scumbag." -- margaritahuyan
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1714153523
Hero Member
*
Offline Offline

Posts: 1714153523

View Profile Personal Message (Offline)

Ignore
1714153523
Reply with quote  #2

1714153523
Report to moderator
1714153523
Hero Member
*
Offline Offline

Posts: 1714153523

View Profile Personal Message (Offline)

Ignore
1714153523
Reply with quote  #2

1714153523
Report to moderator
ghostfaceuk
Sr. Member
****
Offline Offline

Activity: 410
Merit: 250


View Profile
January 25, 2017, 08:23:08 AM
 #442

Yeah, I thought I could at least prepare the next alpha version today, PST, but I encountered all sorts of weird bugs in the driver. I already sorted them out, so it should be OK. I will keep you guys updated.

Take your time man, it a marathon not a sprint.


No need to push your self too hard and cause burnout. Take a days break. it can do wonders for your concentration and creative juices, not to mention your relationship lol.
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 26, 2017, 03:50:37 AM
 #443

Thanks, but I'd rather be done with it sooner than later, though.
Now I'm optimizing the OpenCL kernel again and getting 434 sol/s on stock 7990.
I already confirmed that GDS counters do make the miner run faster and the current assembly version is surprisingly stable given all the crazy stuff that is going on behind the scene, but I realized I still need to work on the OpenCL kernel.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zzzzzzzzzz
Full Member
***
Offline Offline

Activity: 150
Merit: 100


View Profile
January 26, 2017, 03:58:34 AM
 #444

Thanks, but I'd rather be done with it sooner than later, though.
Now I'm optimizing the OpenCL kernel again and getting 434 sol/s on stock 7990.
I already confirmed that GDS counters do make the miner run faster and the current assembly version is surprisingly stable given all the crazy stuff that is going on behind the scene, but I realized I still need to work on the OpenCL kernel.

Can't wait for an optimized Fiji kernel! Thanks for all your effort!
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 26, 2017, 07:05:49 AM
 #445

Alright, I'm done with the OpenCL kernel.
All I have to do now is to rewrite a disassembled code of the kernel, which should be straight forward.
I will work on the assembly version for 7990 first, then I will do the same for RX 480 on Linux.
I will support both platforms, so no worries.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 26, 2017, 07:08:35 AM
 #446

Thanks, but I'd rather be done with it sooner than later, though.
Now I'm optimizing the OpenCL kernel again and getting 434 sol/s on stock 7990.
I already confirmed that GDS counters do make the miner run faster and the current assembly version is surprisingly stable given all the crazy stuff that is going on behind the scene, but I realized I still need to work on the OpenCL kernel.

Can't wait for an optimized Fiji kernel! Thanks for all your effort!

Fiji is GCN3, so the assembly version for RX 480 should work.
We will  see.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 26, 2017, 09:34:43 AM
 #447

Hmm... GDS counters are not as fast as they should be.
I probably need to unpack them and use uint counters.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 26, 2017, 06:21:28 PM
Last edit: January 26, 2017, 08:08:39 PM by zawawa
 #448

Quote
There are 32 banks mapped to the lowest bits of the dw offset.
https://community.amd.com/thread/167167

Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified?
This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
January 27, 2017, 12:23:59 AM
 #449

Quote
There are 32 banks mapped to the lowest bits of the dw offset.
https://community.amd.com/thread/167167

Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified?
This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing...
Good work, new kernels do segfaults on ubuntu, though...

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 27, 2017, 01:25:47 AM
 #450

Could be ncurses as I added colors to the UI. I will check it later.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
January 27, 2017, 01:55:11 AM
 #451

Could be ncurses as I added colors to the UI. I will check it later.
I mean equihash.working1/2 Smiley

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
reb0rn21
Legendary
*
Offline Offline

Activity: 1896
Merit: 1024


View Profile
January 27, 2017, 02:10:37 AM
 #452

@zawawa

If you know sha256  you might look at pascal coin, no pool miner yet and sha256 need slight modification for it, i think many ppl would like to see it, if pascal price hold
NV solo miner is flowed and AMD is just solo
ocminer worked at pool but none is doing miner because it need new protocol for pool

              ▄▄▄ ▀▀▀▀▀▀▀▀▀ ▄▄▄
           ▄▀▀    ▄▄▄▄▄▄▄▄▄    ▀▀▄
        ▄▀▀  ▄▄▀█          ▀█▀▄▄  ▀▀▄
      ▄▀▀ ▄▄▀    ▀▀▄▄▄▄▄▄▄▀▀    ▀▄▄ ▀▀▄
     █   █            ▀            █   █
   ▄▀ █  ▀▄▄                     ▄█▀  █ ▀▄
  ▄▀ ▄▀ █▄ ▀▀▀██▄▄▄       ▄▄▄██▀▀  ██ ▀▄ ▀▄
  ▀▄▀▀▄ ██ ▄▄▄▄▄▄  ▀▄   ▄▀  ▄▄▄▄▄▄ ██ ▄▀▀▄▀
 ██   █ ██ ▀▄    ▀▄ █   █ ▄▀    ▄▀ ██ █  ▀██
 █  ▄█  ▀█  ▀▀▀▀▀▀▀ █   █ ▀▀▀▀▀▀▀  █   █▄  █
█▀ █  █  █          █   █          █  █  █ ▀▀
 █▀  ▄▀  █▀▄        █   █        ▄▀█  ▀▄  ▀█
 ▄  █▀   █ ▀█▄      ▀   ▀      ▄█▀ █  ▄▀█  ▄
 █▄▀  █  █                         █  █  ▀▄█
 ▀▄  █   ▀█        ▄▄▀▄▀▄▄        █▀   █  ▄
  ▀▄▀▀  █▄ █     ▀█  ▀▀▀  █▀     █ ▄█ ▄▀▀▄▀
   ▀ ▄  ██ █▀▄     ▀▀▄▄▄▀▀     ▄▀█ ██ ▀▄ ▀
    ▀█  ██ █ █▀▄    ▄▄▄▄▄    ▄▀█ █ ██  █▀
      ▀▄ ▀ █ █ ██▄         ▄██ █ █ ▀ ▄▀
        ▀▄ █ █ █ ▀█▄     ▄█▀ █ █ █ ▄▀
          ▀▀▄█ █    ▀▀▀▀▀    █ █▄▀▀
              ▀▀ ▄▄▄▄▄▄▄▄▄▄▄ ▀▀
   
..I  D  E  N  A..
   
Proof-of-Person Blockchain

Join the mining of the first human-centric
cryptocurrency
 



 
▲    2 3 2 2

..N  O  D  E  S..
   
                ██
                ██
                ██
                ██
                ██
         ▄      ██      ▄
         ███▄   ██   ▄███
          ▀███▄ ██ ▄███▀
            ▀████████▀
              ▀████▀
                ▀▀
██▄                            ▄██
███                            ███
███                            ███
███                            ███
 ███▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄███
  ▀▀██████████████████████████▀▀
   
D O W N L O A D

Idena node

   
   
▄▄▄██████▄▄▄
▄▄████████████████▄▄
▄█████▀▀        ▀▀█████▄
████▀                ▀████
███▀    ▄▄▄▄▄▄▄▄▄       ▀███
███      █   ▄▄ █▀▄        ███
██▀      █  ███ █  ▀▄      ▀██
███       █   ▀▀ ▀▀▀▀█       ███
███       █  ▄▄▄▄▄▄  █       ███
███       █  ▄▄▄▄▄▄  █       ███
██▄      █  ▄▄▄▄▄▄  █      ▄██
███      █          █      ███
███▄    ▀▀▀▀▀▀▀▀▀▀▀▀    ▄███
████▄                ▄████
▀█████▄▄        ▄▄█████▀
▀▀████████████████▀▀
▀▀▀██████▀▀▀
   
    .REQUEST INVITATION.
xeridea
Sr. Member
****
Offline Offline

Activity: 449
Merit: 251


View Profile WWW
January 27, 2017, 08:12:36 AM
 #453

@zawawa

If you know sha256  you might look at pascal coin, no pool miner yet and sha256 need slight modification for it, i think many ppl would like to see it, if pascal price hold
NV solo miner is flowed and AMD is just solo
ocminer worked at pool but none is doing miner because it need new protocol for pool
There are no pools yet... and mining profit on it is in the toilet anyway.

Profitability over time charts for many GPUs - http://xeridea.us/charts

BTC:  bc1qr2xwjwfmjn43zhrlp6pn7vwdjrjnv5z0anhjhn LTC:  LXDm6sR4dkyqtEWfUbPumMnVEiUFQvxSbZ Eth:  0x44cCe2cf90C8FEE4C9e4338Ae7049913D4F6fC24
m1n1ngP4d4w4n
Full Member
***
Offline Offline

Activity: 224
Merit: 100

CryptoLearner


View Profile
January 27, 2017, 08:27:58 AM
 #454

ocminer (suprnova) is working on a pool he should release soon.
ghostfaceuk
Sr. Member
****
Offline Offline

Activity: 410
Merit: 250


View Profile
January 27, 2017, 09:17:19 AM
Last edit: January 27, 2017, 02:26:19 PM by ghostfaceuk
 #455

ocminer (suprnova) is working on a pool he should release soon.

In that case we will need a pool miner as I think all miners out there are currently solo based, I also hear there is problems with the Cuda version of the miner
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 27, 2017, 03:26:01 PM
 #456

I will look into that. The assembly version for 7990 is almost ready...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
ghostfaceuk
Sr. Member
****
Offline Offline

Activity: 410
Merit: 250


View Profile
January 27, 2017, 03:31:31 PM
 #457

I will look into that. The assembly version for 7990 is almost ready...

have people looked at the returns for pascal coin right now? I would forget mining unless you have almost free electric and super high amounts of hashrate. http://whattomine.com/coins/172-pasc-pascal

nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
January 27, 2017, 03:35:31 PM
 #458

Quote
There are 32 banks mapped to the lowest bits of the dw offset.
https://community.amd.com/thread/167167

Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified?
This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing...

I think "offset" is not the correct term.  The GDS address (including the offset) is what matters.  So dword addresses 0x0020 and 0x0021 are on different banks, but addresses 0x0020 and 0x0060 are on the same bank.
Quote from GCN ISA: "The GDS is configured with 32 banks, each with 512 entries of 4 bytes each."

zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 27, 2017, 08:41:42 PM
 #459

Quote
There are 32 banks mapped to the lowest bits of the dw offset.
https://community.amd.com/thread/167167

Does this mean access to GDS always go through Bank 0 as long as the offset of the DS instruction is not specified?
This cannot be right... It seems like GDS is as horrendously designed difficult to use as VGPR indexing...

I think "offset" is not the correct term.  The GDS address (including the offset) is what matters.  So dword addresses 0x0020 and 0x0021 are on different banks, but addresses 0x0020 and 0x0060 are on the same bank.
Quote from GCN ISA: "The GDS is configured with 32 banks, each with 512 entries of 4 bytes each."



That makes a perfect sense. I did read that description about GDS, but I got really confused because realhet is usually right about almost anything related to GCN. I must be pretty tired...

In any case, the fact remains that GDS bank conflicts are a serious problem and row counters must be unpacked. A single thread assembly version is already running stably with a major speed boost. Multithreading is rather tricky because I now have to use the entire 64KB of GDS, which is rather tricky and required modifications of the code. I am expecting an upper 400's on 7990 with this optimization alone.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 27, 2017, 08:44:38 PM
 #460

I will look into that. The assembly version for 7990 is almost ready...

have people looked at the returns for pascal coin right now? I would forget mining unless you have almost free electric and super high amounts of hashrate. http://whattomine.com/coins/172-pasc-pascal



No wonder... If it uses a variant of SHA-256, it wouldn't be within the reach of GPU's anyway.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 [23] 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 ... 197 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!