Bitcoin Forum
April 26, 2024, 02:24:16 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Poll
Question: Do you want to see improvements in Ethash dual-mining with GGS?
I desperately need it. - 8 (15.1%)
It would be nice. - 12 (22.6%)
It's not worth it anymore. - 33 (62.3%)
Total Voters: 53

Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 [44] 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 ... 197 »
  Print  
Author Topic: Gateless Gate Sharp 1.3.8: 30Mh/s (Ethash) on RX 480!  (Read 214337 times)
citronick
Legendary
*
Offline Offline

Activity: 1834
Merit: 1080


---- winter*juvia -----


View Profile
March 15, 2017, 03:44:51 AM
 #861

Great work! Any improvements about XMR in pre3?

Hi, it's allready the fastest XMR miner (at least that I found out) and it's open source, so what's the point being even faster? As everybody will be faster and you will not get more XMR.

While on ZEC, the faster miners are closed source with devfee, that's why Zawawa's miner is awaited.
You're not right. Fastest XMR miner at this moment is Claymore CryptoNote 9.7.
Even with 2% fee it gives about 5-10% more speed on 280X.

on my 290x GG is 15% faster, on my RX470 not much faster but it's faster.
But on 280X Claymore is faster. Thats mean that GG is not fastest miner at this moment. Maybe author optimizes something else and it became the best )

Claymore v9.7 is best for non-RX cards (290s/390s/R9 cards). He needs to further optimise the miner for RX/Polaris (long time overdue).

For RX cards, Wolf's XMR miner and sgminer-gm, sgminer-gg will be a better and more stable miner.

If I provided you good and useful info or just a smile to your day, consider sending me merit points to further validate this Bitcointalk account ~ useful for future account recovery...
1714141456
Hero Member
*
Offline Offline

Posts: 1714141456

View Profile Personal Message (Offline)

Ignore
1714141456
Reply with quote  #2

1714141456
Report to moderator
"Bitcoin: the cutting edge of begging technology." -- Giraffe.BTC
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1714141456
Hero Member
*
Offline Offline

Posts: 1714141456

View Profile Personal Message (Offline)

Ignore
1714141456
Reply with quote  #2

1714141456
Report to moderator
1714141456
Hero Member
*
Offline Offline

Posts: 1714141456

View Profile Personal Message (Offline)

Ignore
1714141456
Reply with quote  #2

1714141456
Report to moderator
Sam123
Hero Member
*****
Offline Offline

Activity: 980
Merit: 502


View Profile
March 15, 2017, 04:44:02 AM
 #862

Great work! Any improvements about XMR in pre3?

Hi, it's allready the fastest XMR miner (at least that I found out) and it's open source, so what's the point being even faster? As everybody will be faster and you will not get more XMR.

While on ZEC, the faster miners are closed source with devfee, that's why Zawawa's miner is awaited.
You're not right. Fastest XMR miner at this moment is Claymore CryptoNote 9.7.
Even with 2% fee it gives about 5-10% more speed on 280X.

on my 290x GG is 15% faster, on my RX470 not much faster but it's faster.
But on 280X Claymore is faster. Thats mean that GG is not fastest miner at this moment. Maybe author optimizes something else and it became the best )

Claymore v9.7 is best for non-RX cards (290s/390s/R9 cards). He needs to further optimise the miner for RX/Polaris (long time overdue).

For RX cards, Wolf's XMR miner and sgminer-gm, sgminer-gg will be a better and more stable miner.


Can you please share the link for the Wolf XMR miner (RX480 cards)
Thanks
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
March 15, 2017, 06:28:35 AM
 #863

Great work! Any improvements about XMR in pre3?

Hi, it's allready the fastest XMR miner (at least that I found out) and it's open source, so what's the point being even faster? As everybody will be faster and you will not get more XMR.

While on ZEC, the faster miners are closed source with devfee, that's why Zawawa's miner is awaited.
You're not right. Fastest XMR miner at this moment is Claymore CryptoNote 9.7.
Even with 2% fee it gives about 5-10% more speed on 280X.

on my 290x GG is 15% faster, on my RX470 not much faster but it's faster.
But on 280X Claymore is faster. Thats mean that GG is not fastest miner at this moment. Maybe author optimizes something else and it became the best )
It's the fastest open source miner, I repeat - OPEN SOURCE !
As of your statement that Claymore's CryptoNote miner is the fastest - your are absolutely wrong, sgminer-gm/gg are still faster Smiley

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 15, 2017, 06:36:40 AM
 #864

I am not going to join this "Who got the fastest miner?" discussion.
Project GG's slogan is: "The best miner should be free."
I mean, Bitcoin founder Satoshi Nakamoto was so generous.
Why shouldn't we?

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
Ursul0
Sr. Member
****
Offline Offline

Activity: 857
Merit: 262


View Profile
March 15, 2017, 08:37:37 AM
 #865

getting closer...


EDIT: actually two of the cards just went SICK after 5 minutes, while Claymore works for days with no issues
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 15, 2017, 01:49:54 PM
 #866

Using SLC or GLC memory read/write may also give a small performance boost.

Could you elaborate on this? I recall you said Wolf was using them for his private miner, but I am not entirely sure how to use SLC/GLC bits for performance enhancements.

The SLC (System Level Coherence) bit forces bypassing the L1 cache, and GLC (Global Level Coherence) forces bypassing the L2.  For ETH mining, which is 100% memory reads, I think SLC gave a performance improvement, but GLC did not.  The results weren't completely intuitive, so you'll probably have to do some experimenting.  I also suspect you may get different results from different GCN versions.  Pitcairn and Tahiti seem to have a brain-dead cache controller that gets slower as the working set gets much over 1GB.  Therefore I think GLC read/write may have a more significant impact for them vs Tonga (or even Hawaii).
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 15, 2017, 05:01:36 PM
 #867

EDIT: actually two of the cards just went SICK after 5 minutes, while Claymore works for days with no issues

That must be a hardware issue. Different miners tend to expose different hardware problems.
Also, for optimal performance with Ellesmere, you need to run the miner on Linux for now.
I think the only real technical advantage Claymore has over me is that he figured out how to access the entire GDS both on Windows and Linux.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 15, 2017, 05:07:36 PM
 #868

Using SLC or GLC memory read/write may also give a small performance boost.

Could you elaborate on this? I recall you said Wolf was using them for his private miner, but I am not entirely sure how to use SLC/GLC bits for performance enhancements.

The SLC (System Level Coherence) bit forces bypassing the L1 cache, and GLC (Global Level Coherence) forces bypassing the L2.  For ETH mining, which is 100% memory reads, I think SLC gave a performance improvement, but GLC did not.  The results weren't completely intuitive, so you'll probably have to do some experimenting.  I also suspect you may get different results from different GCN versions.  Pitcairn and Tahiti seem to have a brain-dead cache controller that gets slower as the working set gets much over 1GB.  Therefore I think GLC read/write may have a more significant impact for them vs Tonga (or even Hawaii).


Thanks for the clarification. Yeah, these features definitely make more sense for access to ETH's huge DAG. Let's see what I can do with them for ZEC...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 15, 2017, 05:25:24 PM
Last edit: March 16, 2017, 03:49:28 AM by zawawa
 #869

It seems like it is possible to access PCIe devices directly in the user space on Windows, too:

https://msdn.microsoft.com/windows/hardware/drivers/wdf/comparing-umdf-2-0-functionality-to-kmdf

Well, there is something to learn every day...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 15, 2017, 06:23:14 PM
 #870

Let me see if this works...

https://github.com/rumpkernel/wiki/wiki/Howto:-Accessing-PCI-devices-from-userspace

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 16, 2017, 01:29:55 AM
 #871

This is the portion of the Linux kernel responsible for GDS-related parameters for compute kernels:

Code:
		if (gds) {
p->job->gds_base = amdgpu_bo_gpu_offset(gds);
p->job->gds_size = amdgpu_bo_size(gds);
}
https://github.com/torvalds/linux/blob/ef96152e6a36e0510387cb174178b7982c1ae879/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c

I should be able to change them by modifying the kernel source code.
I love free software!

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 16, 2017, 07:10:23 AM
Last edit: March 16, 2017, 09:50:33 PM by zawawa
 #872

I just patched the Linux kernel as an experiment:

Code:
		if (1 /*gds*/) {
p->job->gds_base = 0; // amdgpu_bo_gpu_offset(gds);
p->job->gds_size = 65536; // amdgpu_bo_size(gds);
}

This may actually work...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
john1010
Hero Member
*****
Offline Offline

Activity: 2072
Merit: 562


View Profile WWW
March 16, 2017, 10:02:52 AM
 #873

How many hash it will produce on ethereum usage? thanks
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 16, 2017, 09:44:57 PM
 #874

How many hash it will produce on ethereum usage? thanks

About 10% lower than Claymore's. You should try it yourself.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 16, 2017, 09:52:19 PM
 #875

THE KERNEL PATCH IS WORKING!! WHOO HOO!!!!

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 17, 2017, 04:20:23 AM
 #876

Well, it turned out that multithreading wasn't working, so I'm still working on the kernel patch.
The miner is running about 15% faster with a single thread, so it's very promising.
I should be able to get rid of the patch entirely by hooking system calls to the driver later.
This is no easy stuff!

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa (OP)
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
March 17, 2017, 09:07:54 AM
 #877

It seems that the real maximum size of GDS segments for RX 480 is 16KB.
It's a little disappointing, but still much better than 4KB without the kernel patch.
This number is also consistent with nertralph's report that Optiminer runs four CPU threads per GPU as GDS utilization can be maximized this way.
Now let me fix the kernel patch one more time...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
doktor83
Hero Member
*****
Offline Offline

Activity: 2520
Merit: 626


View Profile WWW
March 17, 2017, 09:41:04 AM
 #878

so good to see how much you love doing this zawawa Smiley

SRBMiner-MULTI thread - HERE
http://www.srbminer.com
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 17, 2017, 01:03:04 PM
 #879

It seems that the real maximum size of GDS segments for RX 480 is 16KB.
It's a little disappointing, but still much better than 4KB without the kernel patch.
This number is also consistent with nertralph's report that Optiminer runs four CPU threads per GPU as GDS utilization can be maximized this way.
Now let me fix the kernel patch one more time...

Or maybe Optiminer runs just 2 instances of the kernel and uses only half of the GDS.  It's possible (I'd even say probable) the benefits of using a full 64KB is offset by slower GDS access caused by contention with 4 instances of the kernel running.
jstefanop
Legendary
*
Offline Offline

Activity: 2090
Merit: 1396


View Profile
March 17, 2017, 04:49:04 PM
 #880

It seems that the real maximum size of GDS segments for RX 480 is 16KB.
It's a little disappointing, but still much better than 4KB without the kernel patch.
This number is also consistent with nertralph's report that Optiminer runs four CPU threads per GPU as GDS utilization can be maximized this way.
Now let me fix the kernel patch one more time...

Or maybe Optiminer runs just 2 instances of the kernel and uses only half of the GDS.  It's possible (I'd even say probable) the benefits of using a full 64KB is offset by slower GDS access caused by contention with 4 instances of the kernel running.


Pretty sure both optiminer and claymore running two kernel threads.

Project Apollo: A Pod Miner Designed for the Home https://bitcointalk.org/index.php?topic=4974036
FutureBit Moonlander 2 USB Scrypt Stick Miner: https://bitcointalk.org/index.php?topic=2125643.0
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 [44] 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 ... 197 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!