Bitcoin Forum
November 15, 2024, 02:46:42 PM *
News: Check out the artwork 1Dq created to commemorate this forum's 15th anniversary
 
   Home   Help Search Login Register More  
Poll
Question: Do you want to see improvements in Ethash dual-mining with GGS?
I desperately need it. - 8 (15.1%)
It would be nice. - 12 (22.6%)
It's not worth it anymore. - 33 (62.3%)
Total Voters: 53

Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 [49] 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 ... 197 »
  Print  
Author Topic: Gateless Gate Sharp 1.3.8: 30Mh/s (Ethash) on RX 480!  (Read 214420 times)
lexele
Full Member
***
Offline Offline

Activity: 190
Merit: 100


View Profile
March 30, 2017, 06:42:49 PM
 #961

With default setting from your .bat, on polaris cards I'm very close to claymore on eth, but my 290x is almost 3mh behind.
WBF1
Sr. Member
****
Offline Offline

Activity: 419
Merit: 250


View Profile
March 30, 2017, 07:28:14 PM
 #962

some additional info:

R9 390 with gateless gate & ethash-new = 31.9-32 mhs. xI of 1920, worksize 192, 2 threads

This is the same or better performance I get from Claymore. I am getting slowly increasing HW errors, but no rejected shares or issues shown at the pool side.

This card has +6.5% core clock and 1550 mhz memclock. Drivers are 17.3.2 on Windows 8.1.
toptek
Legendary
*
Offline Offline

Activity: 1274
Merit: 1000


View Profile
March 30, 2017, 09:10:04 PM
Last edit: March 31, 2017, 12:05:03 AM by toptek
 #963

zawawa

Does this do what CM 12.4 does yet for ZEC if so , could you give me some tips on how to hit 306 Sol on a 480 8gb at stock setting ? ... CM does that on my 480 8gb at stock no molded bios ..

but

This matches CM's eth miner  Smiley at stock setting nice work ..... so I'm gonna start using it when I ETH mine ... and ask to have it added to http://ethosdistro.com/

For security, your account has been locked. Email acctcomp15@theymos.e4ward.com
joaocha
Full Member
***
Offline Offline

Activity: 254
Merit: 100


View Profile
March 30, 2017, 11:01:08 PM
 #964

On latest drivers, the eth-new works far better them 16.x or lower
cryptominer420
Sr. Member
****
Offline Offline

Activity: 450
Merit: 255


View Profile
March 30, 2017, 11:28:41 PM
 #965

I just uploaded a new pre-release:

https://github.com/zawawawa/gatelessgate/releases/tag/v0.1.3-pre4

There are some performance improvements, and ethash-new.cl was added as an experimental feature.
Let me know how that works.

This one works on ubiqpool.io using my old 7870XT

   ╖   ╓╖╖                         ╖╖╖ ,
  ▒   ╢▒,@▒▒▒║ ╓╣╝║║*╢  ╢▒╣ ],`]░╢▒▒╖ ▒ ╥╢▒▒▒╢  @╝╢▒
  Ñ▒▒]▒▒` ]`╢║▒╣▒╢▒▒  ╢▒╝▒▒▒  ╢▒╜║▒▒▒╢▒╜  ╢╢║N
 ║╢   ▒▒╜ ║▒▒╢▒▒@@╢▒║  ╢▒╜ ▒ ╙▒▒,║░▒╣ ▒║ ╢▒▒╢▒▒▒»@╢@@╢╜



.















▬▬  A Miner Built Mining Platform  ▬▬[/url]
Powered by Our Mining Community













joaocha
Full Member
***
Offline Offline

Activity: 254
Merit: 100


View Profile
March 30, 2017, 11:38:33 PM
 #966

2 threads dont change too much thing at all...only hardware errors

Just use Xi - 512
and gpu threads  - 1

avoid a lot of hw errors
cryptominer420
Sr. Member
****
Offline Offline

Activity: 450
Merit: 255


View Profile
March 30, 2017, 11:56:21 PM
 #967

on 7870XT I am using XI 1512 and -g 2 and it is doing 23MH mining UBIQ 1 HW error in the last hr.

   ╖   ╓╖╖                         ╖╖╖ ,
  ▒   ╢▒,@▒▒▒║ ╓╣╝║║*╢  ╢▒╣ ],`]░╢▒▒╖ ▒ ╥╢▒▒▒╢  @╝╢▒
  Ñ▒▒]▒▒` ]`╢║▒╣▒╢▒▒  ╢▒╝▒▒▒  ╢▒╜║▒▒▒╢▒╜  ╢╢║N
 ║╢   ▒▒╜ ║▒▒╢▒▒@@╢▒║  ╢▒╜ ▒ ╙▒▒,║░▒╣ ▒║ ╢▒▒╢▒▒▒»@╢@@╢╜



.















▬▬  A Miner Built Mining Platform  ▬▬[/url]
Powered by Our Mining Community













nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 31, 2017, 12:10:58 AM
 #968

some additional info:

R9 390 with gateless gate & ethash-new = 31.9-32 mhs. xI of 1920, worksize 192, 2 threads

This is the same or better performance I get from Claymore. I am getting slowly increasing HW errors, but no rejected shares or issues shown at the pool side.

This card has +6.5% core clock and 1550 mhz memclock. Drivers are 17.3.2 on Windows 8.1.

Sgminer has always been faster than claymore on my 290x on Linux/fglrx.  It's on Polaris where it is slightly behind.
joaocha
Full Member
***
Offline Offline

Activity: 254
Merit: 100


View Profile
March 31, 2017, 12:36:58 AM
 #969

Zawawa didnt even touch the kernel,and it is fast as claymore on my 390, im sure if he give a week on it, it ill beat claymore
Superdawg
Newbie
*
Offline Offline

Activity: 1
Merit: 0


View Profile
March 31, 2017, 01:36:41 AM
 #970

i'm getting about 2 mh/s less than genoil miner over 4 cards.

Genoil's gives me about 90 mh/s on 4 rx 480 stock no bios mods
this one says about 88, maybe a bit less.
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 31, 2017, 03:32:22 AM
 #971

I just pushed to the repo an optimized GCN assembly version of ethash-new.cl for RX 470/480.
Each card should get a 1Mh/s boost with it. If this actually works, then I will extend its support to GCN1/GCN3 devices.
(I sold all of my GCN2 cards a while back...)

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
March 31, 2017, 06:52:28 AM
 #972

I just pushed to the repo an optimized GCN assembly version of ethash-new.cl for RX 470/480.
Each card should get a 1Mh/s boost with it. If this actually works, then I will extend its support to GCN1/GCN3 devices.
(I sold all of my GCN2 cards a while back...)
Excellent, but what settings are best to achieve those speeds?
I have tried 1 gpu thread WS=192, xI: 1024 but speed is lower

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 31, 2017, 07:11:18 AM
 #973

I just pushed to the repo an optimized GCN assembly version of ethash-new.cl for RX 470/480.
Each card should get a 1Mh/s boost with it. If this actually works, then I will extend its support to GCN1/GCN3 devices.
(I sold all of my GCN2 cards a while back...)

That's not optimized - you flipped the SLC and GLC bits, which will likely make it a tad SLOWER; it did when I tried that.

That's what I thought as well, but the miner was actually running faster, though.
Let me double check...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 31, 2017, 07:56:02 AM
 #974

I run the same experiment one more time, and I confirmed that ethash-new.cl actually runs faster with the GLC and SLC bits on.

With the GLC and SLC bits on:



With the GLC and SLC bits off:



The .bat file:

Code:
@echo off
set GPU_FORCE_64BIT_PTR 0
set GPU_MAX_HEAP_SIZE 100
set GPU_USE_SYNC_OBJECTS 1
set GPU_MAX_ALLOC_PERCENT 100
set GPU_SINGLE_ALLOC_PERCENT 100
gatelessgate.exe --gpu-platform 1 -k ethash-new -o stratum+tcp://eu1.ethermine.org:4444 -u 0x91fa32e00b0f365d629fb625182a83fed61f0642.gatelessgate -p x --xintensity 4620 --worksize 192 --gpu-threads 2 --no-extranonce
pause

I ran this experiment on Windows 10 with stock RX 480 and AMD  Crimson Software 16.9.2 as usual.
@laik2 It would be great if you could try the above settings as well. I am puzzled by the results myself...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
March 31, 2017, 08:25:51 AM
 #975

Tried default config, xI 2048, xI 1024, gpu-threads 2,1...every time the windows crashed, I'm sorry but I can't test due to unknown reason...I don't like windows environment at all.
On linux it's doing ~ claymore.

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
SunStruck
Sr. Member
****
Offline Offline

Activity: 676
Merit: 250



View Profile
March 31, 2017, 12:52:05 PM
 #976


The .bat file:

Code:
@echo off
set GPU_FORCE_64BIT_PTR 0
set GPU_MAX_HEAP_SIZE 100
set GPU_USE_SYNC_OBJECTS 1
set GPU_MAX_ALLOC_PERCENT 100
set GPU_SINGLE_ALLOC_PERCENT 100
gatelessgate.exe --gpu-platform 1 -k ethash-new -o stratum+tcp://eu1.ethermine.org:4444 -u 0x91fa32e00b0f365d629fb625182a83fed61f0642.gatelessgate -p x --xintensity 4620 --worksize 192 --gpu-threads 2 --no-extranonce
pause

I ran this experiment on Windows 10 with stock RX 480 and AMD  Crimson Software 16.9.2 as usual.
@laik2 It would be great if you could try the above settings as well. I am puzzled by the results myself...

--gpu-platform 0 , no ?

still cant connect to suprnova eth tho.. very odd.
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 31, 2017, 01:31:20 PM
Last edit: March 31, 2017, 01:46:36 PM by nerdralph
 #977

I just pushed to the repo an optimized GCN assembly version of ethash-new.cl for RX 470/480.
Each card should get a 1Mh/s boost with it. If this actually works, then I will extend its support to GCN1/GCN3 devices.
(I sold all of my GCN2 cards a while back...)

That's not optimized - you flipped the SLC and GLC bits, which will likely make it a tad SLOWER; it did when I tried that.

I was expecting just SLC (bypass L2) to help, though I recall Wolf's comments about GLC (bypass L1) actually helping.  I'd even expect GLC to hurt performance if you weren't very careful to ensure data was read in 64-byte chunks.

p.s.  There's also some easy optimizations to do with instruction reordering (though they might not make much difference in performance).  For example:
Code:
/*d11c6a3e 01a9013c*/ v_addc_u32      v62, vcc, v60, 0, vcc
/*2a7e62b2         */ v_xor_b32       v63, 50, v49
/*dc5c0000 4000003d*/ flat_load_dwordx4 v[64:67], v[61:62] slc glc
/*dc5c0000 3b00003b*/ flat_load_dwordx4 v[59:62], v[59:60] slc glc
/*bf8c0171         */ s_waitcnt       vmcnt(1) & lgkmcnt(1)

The v_xor_b32 can be moved to after the flat_load_dwordx4.
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 31, 2017, 01:51:41 PM
 #978

I just pushed to the repo an optimized GCN assembly version of ethash-new.cl for RX 470/480.
Each card should get a 1Mh/s boost with it. If this actually works, then I will extend its support to GCN1/GCN3 devices.
(I sold all of my GCN2 cards a while back...)

I probably wouldn't even bother with Southern Islands; no flat_ instructions.  It should be reasonably easy to write a single kernel for Sea Islands and later, with the main differences being for the ABI changes for kernel parameter passing.
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 31, 2017, 02:08:58 PM
 #979

Just cloned the repo to do a linux build then noticed the autotools requirement.  Ugh.  I much prefer it when developers run autoreconf, and check the configure script into the repo.  Then to build it's just the usual ./configure;make
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 31, 2017, 02:14:45 PM
 #980

Just cloned the repo to do a linux build then noticed the autotools requirement.  Ugh.  I much prefer it when developers run autoreconf, and check the configure script into the repo.  Then to build it's just the usual ./configure;make


I will probably switch to CMake + ninja sooner than later. This whole autotools thing is too archaic to my taste.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 [49] 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 ... 197 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!