Bitcoin Forum
April 23, 2024, 06:08:16 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Poll
Question: Do you want to see improvements in Ethash dual-mining with GGS?
I desperately need it. - 8 (15.1%)
It would be nice. - 12 (22.6%)
It's not worth it anymore. - 33 (62.3%)
Total Voters: 53

Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 [20] 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 ... 197 »
  Print  
Author Topic: Gateless Gate Sharp 1.3.8: 30Mh/s (Ethash) on RX 480!  (Read 214334 times)
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 21, 2017, 08:20:33 PM
 #381

It seems like basic tools are available on Windows:

https://github.com/HSAFoundation/HSAIL-Tools

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
1713852496
Hero Member
*
Offline Offline

Posts: 1713852496

View Profile Personal Message (Offline)

Ignore
1713852496
Reply with quote  #2

1713852496
Report to moderator
1713852496
Hero Member
*
Offline Offline

Posts: 1713852496

View Profile Personal Message (Offline)

Ignore
1713852496
Reply with quote  #2

1713852496
Report to moderator
1713852496
Hero Member
*
Offline Offline

Posts: 1713852496

View Profile Personal Message (Offline)

Ignore
1713852496
Reply with quote  #2

1713852496
Report to moderator
The grue lurks in the darkest places of the earth. Its favorite diet is adventurers, but its insatiable appetite is tempered by its fear of light. No grue has ever been seen by the light of day, and few have survived its fearsome jaws to tell the tale.
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1713852496
Hero Member
*
Offline Offline

Posts: 1713852496

View Profile Personal Message (Offline)

Ignore
1713852496
Reply with quote  #2

1713852496
Report to moderator
1713852496
Hero Member
*
Offline Offline

Posts: 1713852496

View Profile Personal Message (Offline)

Ignore
1713852496
Reply with quote  #2

1713852496
Report to moderator
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 21, 2017, 09:55:20 PM
 #382



Ah, the joy... I'm pretty much over the hump.
I didn't need the OpenCL 1.2 ABI or HSAIL after all.
This most likely means I should be able to catch up with Optiminer.
Good stuff, good stuff.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
January 22, 2017, 01:36:58 AM
 #383


I didn't need the OpenCL 1.2 ABI or HSAIL after all.
This most likely means I should be able to catch up with Optiminer.
Good stuff, good stuff.

You wouldn't have had much luck with HSAIL anyway; I'm pretty sure I already mentioned there's no GDS instructions in HSAIL.
zzzzzzzzzz
Full Member
***
Offline Offline

Activity: 150
Merit: 100


View Profile
January 22, 2017, 02:03:00 AM
 #384

@zawawa Just in case, I'll ask: Are you working on GPUs other than RX4xx? I ask because that's the only GPU that anyone has even mentioned in this thread. How about R9 Fury/Nano, for instance? 290x? Etc..? In any case, thank you for all the effort you've given to this!
manotroll
Sr. Member
****
Offline Offline

Activity: 305
Merit: 250


View Profile
January 22, 2017, 02:06:39 AM
 #385

@zawawa Just in case, I'll ask: Are you working on GPUs other than RX4xx? I ask because that's the only GPU that anyone has even mentioned in this thread. How about R9 Fury/Nano, for instance? 290x? Etc..? In any case, thank you for all the effort you've given to this!


390x use for eth
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 22, 2017, 02:24:07 AM
 #386

@zawawa Just in case, I'll ask: Are you working on GPUs other than RX4xx? I ask because that's the only GPU that anyone has even mentioned in this thread. How about R9 Fury/Nano, for instance? 290x? Etc..? In any case, thank you for all the effort you've given to this!


I am currently focusing on RX 480, but I am planning to work on other cards once I'm done with it.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 22, 2017, 03:55:32 AM
 #387

The miner is running stably with 2 threads with a 32KB GDS segment each. Very cool...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 22, 2017, 04:14:43 AM
 #388

I added a new pseudo-op for Global Data Share (GDS) to CLRadeonExtender:

https://github.com/CLRX/CLRX-mirror/pull/11

It will be so much fun if we can freely exploit this killer feature at last...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 22, 2017, 04:19:20 AM
 #389


I didn't need the OpenCL 1.2 ABI or HSAIL after all.
This most likely means I should be able to catch up with Optiminer.
Good stuff, good stuff.

You wouldn't have had much luck with HSAIL anyway; I'm pretty sure I already mentioned there's no GDS instructions in HSAIL.


Really? I don't recall that... The ROCm ABI does expose GDS, though. I will doublecheck.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
cryptominer420
Sr. Member
****
Offline Offline

Activity: 450
Merit: 255


View Profile
January 22, 2017, 05:50:38 PM
 #390

Sounds interesting, I'm anxious to see what you find out.

   ╖   ╓╖╖                         ╖╖╖ ,
  ▒   ╢▒,@▒▒▒║ ╓╣╝║║*╢  ╢▒╣ ],`]░╢▒▒╖ ▒ ╥╢▒▒▒╢  @╝╢▒
  Ñ▒▒]▒▒` ]`╢║▒╣▒╢▒▒  ╢▒╝▒▒▒  ╢▒╜║▒▒▒╢▒╜  ╢╢║N
 ║╢   ▒▒╜ ║▒▒╢▒▒@@╢▒║  ╢▒╜ ▒ ╙▒▒,║░▒╣ ▒║ ╢▒▒╢▒▒▒»@╢@@╢╜



.















▬▬  A Miner Built Mining Platform  ▬▬[/url]
Powered by Our Mining Community













nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
January 22, 2017, 06:39:32 PM
 #391

I added a new pseudo-op for Global Data Share (GDS) to CLRadeonExtender:

https://github.com/CLRX/CLRX-mirror/pull/11

It will be so much fun if we can freely exploit this killer feature at last...

Nice.  With this change there should be no more need to explicitly initialize M0 (except maybe for GCN1 devices since they only have OpenCL1.2 driver support).
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
January 22, 2017, 06:48:27 PM
 #392


I didn't need the OpenCL 1.2 ABI or HSAIL after all.
This most likely means I should be able to catch up with Optiminer.
Good stuff, good stuff.

You wouldn't have had much luck with HSAIL anyway; I'm pretty sure I already mentioned there's no GDS instructions in HSAIL.


Really? I don't recall that... The ROCm ABI does expose GDS, though. I will doublecheck.

I confirmed it with one of the AMD devs working on llvm.  He said there was plans for a GCN extension that never got implemented in the HSAIL llvm backend since they are now focused on the AMDGPU backend.
ROCm also now supports OpenCL kernels.
https://www.khronos.org/news/permalink/rocm-1.4-has-support-for-opencl-1.2-host-code-and-2.0-kernels

The possibility of using inline asm for GDS access with the rest of the kernel in straight OpenCL looks promising to me...
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 23, 2017, 03:00:12 AM
 #393


The possibility of using inline asm for GDS access with the rest of the kernel in straight OpenCL looks promising to me...


That would be really nice, but I need a solution that works right now.
I had to go through another hoop and turn on the "enable_ordered_append_gds" bit, but I finally located where the GDS base is stored. I am getting really close!

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 23, 2017, 05:33:28 AM
 #394

Do I need to initialize GDS before actually using it?
These instructions are documented nowhere.

Code:
DS_CONSUME
DS_APPEND
DS_ORDERED_COUNT

nerdralph, do you have any ideas?

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 23, 2017, 08:54:04 AM
 #395

Hmm... It seems that GDS is not activated for some reasons.
What to do, what to do...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
chronek
Sr. Member
****
Offline Offline

Activity: 273
Merit: 250


BD People Are Legend


View Profile
January 23, 2017, 12:13:14 PM
 #396

i heard that rx480 have opencl 2.0, would be any benefits when using abi 2.0?


                 ▄▄▄██████████████████▄▄▄
            ▄▄██▀▀▀▀▀███████████████████████▄▄
        ▄▄███▀   ▄▄▄   ▀████████████████████████▄▄
     ▄██████  ▄███████▄  ▀██████████████████▀▀▀█████▄
   ▄███████    ███▀▀ ██    ███████████  ███▀  ▄███████▄
  █████████▄  ▄█▀  ▄███    ██████████  ▄██  ▄███████████
 ██████████████▀  ████▀   ▄██▀▀▀████  ▄████▀  ███████████
██████████████▀  ▄███▀   ▄█▀  ▄▄ ██▀ ▄█  ██  █████████████
██████████████   ▀     ▄██▀  ▄█  █▀  █   █▀▄  ▀███████████
█████████████▀  ▄███▄  ▀██   ██ ▄▄▄  █▀ ▄▄▀█▄  ▀▄█████████
█████████████   █████   ▀██▄ ▀▄████▄  ▄███▄▀  ▄███████████
 █████   ▀██▀  ▄█████    █▀▀█████████████████▀███████████
  ████   ▄█▀   █████    ██ ▀▄█▀▄▄▀█ ▀▄▀█▀▄▀ █▀█ ▀▄██████
   ▀███▄▄   ▄█▄ ▀▀     ███ █ █ ▄▄▄█  █ █ █  █ █ ██████▀
     ▀██████████▄▄▄▄▄█████▄█▄██▄▄██ █▄███▄█▄█▄█▄████▀
        ▀▀██████████████████████████████████████▀▀
            ▀▀██████████████████████████████▀▀
                 ▀▀▀██████████████████▀▀▀
      Supported by
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
John McAfee     
|
|
|
|

   ▀██▄       ▀████▄
▄    ▀██          ▀██▄
██▄ ▄███         ▄█████▄▄
 ▀███████▄     ▄██▀ ▀▀████
      ▀████▄ ▄██▀     ▀█▀
        ▀████▄
        ▄█▀████▄
      ▄███▄▀▀████▄
    ▄████▀    ▀████▄
  ▄████▀        ▀██▀█▄
  ▀██▀            ▀██▀
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 23, 2017, 01:58:06 PM
 #397

i heard that rx480 have opencl 2.0, would be any benefits when using abi 2.0?

The OpenCL 2.0 ABI does not make any differences. I might have to bypass the driver and send raw packets directly to the GPU to enable GDS. This is crazy.

http://amd-dev.wpengine.netdna-cdn.com/wordpress/media/2013/10/si_programming_guide_v2.pdf
https://github.com/fail0verflow/radeon-tools/blob/master/f32/f32dis.py

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 23, 2017, 02:07:57 PM
 #398

There you go!

Quote
2.9 Misc/Data Transfer Packets
2.9.1 ALLOC_GDS
The packet will allocate a new segment within its corresponding GDS partition. The corresponding partition is
determined from the Ring to which the packet is submitted. The microcode will first wait until the active partition
count equals zero before continuing. This guarantees that the entire contents of the previous allocated segment have
been dumped to memory before allocating the new segment within the current partition. It will also check if the
segment size is less than partition size and interrupt if the current segment does not fit into its specified partition

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
January 23, 2017, 02:11:57 PM
Last edit: January 23, 2017, 02:31:36 PM by nerdralph
 #399

Do I need to initialize GDS before actually using it?
These instructions are documented nowhere.

Code:
DS_CONSUME
DS_APPEND
DS_ORDERED_COUNT

nerdralph, do you have any ideas?

I suspect the driver initializes M0 when gds_segment_byte_size is set in the kernel configuration.  If you look in the GCN ISA docs, it says M0 has 16 bits for offset and 16 bits for size.  M0 is also used for LDS, so when you use both in your code you'll need to save it to another register.

I hadn't looked at the DS_ instructions you refer to, and a quick look at the ISA confirms your observation about them having no documentation.  The llvm source would at least have the instruction encoding.

I'm not sure why you want to use those instructions though.  For the global row counters I'd use ds_add_u32 with the GDS bit set.

p.s. the M0 description is in s. 3.7 of the GCN ISA docs.

zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
January 23, 2017, 02:38:57 PM
 #400

I suspect the driver initializes M0 when gds_segment_byte_size is set in the kernel configuration.

I assumed that the GDS base/size combination would be stored in one of SGPR's just like the OpenCL 1.2 ABI, but you may be right. I will check it right now.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 [20] 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 ... 197 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!