Bitcoin Forum
November 18, 2024, 01:56:31 AM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 ... 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 [86] 87 88 89 90 91 »
  Print  
Author Topic: SILENTARMY v5: Zcash miner, 115 sol/s on R9 Nano, 70 sol/s on GTX 1070  (Read 209309 times)
Hotmetal
Sr. Member
****
Offline Offline

Activity: 378
Merit: 250


View Profile
December 12, 2016, 02:39:33 PM
 #1701

Here are some stats for my fork.

What is the url to your fork?
cryptomined
Full Member
***
Offline Offline

Activity: 168
Merit: 104



View Profile WWW
December 12, 2016, 03:00:17 PM
 #1702

By the way, I found out the root cause of a low number of shares.
It turned out that there is a bug in AMD's drivers, so you need to so specify the "-O1" build option to disable most of optimizations when you call clBuildProgram if the miner is running on AMD cards. The optimizer must have incorrectly eliminated s_waitcnt. What a wonderful world.

is your fork ready for release?  thanks

kay5ive
Member
**
Offline Offline

Activity: 67
Merit: 10


View Profile
December 12, 2016, 05:38:11 PM
 #1703

By the way, I found out the root cause of a low number of shares.
It turned out that there is a bug in AMD's drivers, so you need to so specify the "-O1" build option to disable most of optimizations when you call clBuildProgram if the miner is running on AMD cards. The optimizer must have incorrectly eliminated s_waitcnt. What a wonderful world.
How to fork this? Do you have a method?
If you want to share with us?
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
December 12, 2016, 07:02:23 PM
 #1704

I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS.
I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.

I think 14 will be close to ideal.  I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters.  That's 16KB * 2 for the row counters, or only 32KB.
Golku
Full Member
***
Offline Offline

Activity: 263
Merit: 100


View Profile
December 12, 2016, 07:32:50 PM
 #1705

I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS.
I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.

I think 14 will be close to ideal.  I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters.  That's 16KB * 2 for the row counters, or only 32KB.


I got a feeling that this miner will beat up claymores
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
December 12, 2016, 07:43:36 PM
 #1706

I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS.
I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.

I think 14 will be close to ideal.  I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters.  That's 16KB * 2 for the row counters, or only 32KB.

What happened to your closed source miner?

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
Tatalk
Full Member
***
Offline Offline

Activity: 146
Merit: 100



View Profile
December 12, 2016, 08:26:35 PM
 #1707

I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS.
I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.

I think 14 will be close to ideal.  I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters.  That's 16KB * 2 for the row counters, or only 32KB.


I got a feeling that this miner will beat up claymores

I do not think the developers has much incentive to beat the Claymore. They are not rewarded properly.

issie81
Hero Member
*****
Offline Offline

Activity: 760
Merit: 500

CryptoZilla


View Profile
December 12, 2016, 09:05:52 PM
 #1708

Quote

I do not think the developers has much incentive to beat the Claymore. They are not rewarded properly.

why not implement 1% dev fee?

the linux miners are more stable
doktor83
Hero Member
*****
Offline Offline

Activity: 2716
Merit: 626


View Profile WWW
December 12, 2016, 09:25:42 PM
 #1709

Quote

I do not think the developers has much incentive to beat the Claymore. They are not rewarded properly.

why not implement 1% dev fee?

the linux miners are more stable

i don't have any stability problems with claymore's, and its not a linux miner.

SRBMiner-MULTI thread - HERE
http://www.srbminer.com
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
December 13, 2016, 03:03:07 AM
 #1710

I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS.
I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.

I think 14 will be close to ideal.  I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters.  That's 16KB * 2 for the row counters, or only 32KB.

What happened to your closed source miner?

Perfection takes time. :-)  If you want to see the results of some of my research into memory performance, check this out:
https://github.com/nerdralph/cl-mem
eXtremal
Sr. Member
****
Offline Offline

Activity: 2106
Merit: 282


👉bit.ly/3QXp3oh | 🔥 Ultimate Launc


View Profile WWW
December 13, 2016, 10:04:48 AM
 #1711

Did anybody implement NR_ROWS less than 16 support already ?

TONUP██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
██
▄▄███████▄▄
▄▄███████████████▄▄
▄███████████████████▄
▄█████▄░▄▄▀█████▀▄████▄
▄███████▄▀█▄▀██▀▄███████▄
█████████▄▀█▄▀▄██████████
██████████▄▀█▄▀██████████
██████████▀▄▀█▄▀█████████
▀███████▀▄██▄▀█▄▀███████▀
▀████▀▄█████▄▀▀░▀█████▀
▀███████████████████▀
▀▀███████████████▀▀
▀▀███████▀▀
▄▄▄███████▄▄▄
▄▄███████████████▄▄
▄███████████████████▄
▄██████████████▀▀█████▄
▄██████████▀▀█████▐████▄
██████▀▀████▄▄▀▀█████████
████▄▄███▄██▀█████▐██████
█████████▀██████████████
▀███████▌▐██████▐██████▀
▀███████▄▄███▄████████▀
▀███████████████████▀
▀▀███████████████▀▀
▀▀▀███████▀▀▀
▄▄▄███████▄▄▄
▄▄███████████████▄▄
▄███████████████████▄
▄█████████████████████▄
▄████▀▀███▀▀███▀▀██▀███▄
████▀███████▀█▀███▀█████
██████████████████████
████▄███████▄█▄███▄█████
▀████▄▄███▄▄███▄▄██▄███▀
▀█████████████████████▀
▀███████████████████▀
▀▀███████████████▀▀
▀▀▀███████▀▀▀
████████
██
██
██
██
██
██
██
██
██
██
██
████████
████████████████████████████████████████████████████████████████████████████████
.
JOIN NOW
.
████████████████████████████████████████████████████████████████████████████████
████████
██
██
██
██
██
██
██
██
██
██
██
████████
ioglnx
Sr. Member
****
Offline Offline

Activity: 574
Merit: 250

Fighting mob law and inquisition in this forum


View Profile
December 13, 2016, 10:08:04 AM
 #1712

Did anybody implement NR_ROWS less than 16 support already ?

Nope not yet..and welcome back :-D

GTX 1080Ti rocks da house... seriously... this card is a beast³
Owning by now 18x GTX1080Ti :-D @serious love of efficiency
zawawa
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 13, 2016, 12:49:52 PM
 #1713

Did anybody implement NR_ROWS less than 16 support already ?

I was wondering about the same thing...
I'm almost done, but it's a major PITA to code.

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zawawa
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 13, 2016, 12:54:10 PM
 #1714

I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS.
I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.

I think 14 will be close to ideal.  I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters.  That's 16KB * 2 for the row counters, or only 32KB.


I agree. This is a perfect reason to use a GCN assembler and squeeze two sets of 32KB rowCounters[] into 64KB GDS.
It is almost meant to be  Grin

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
Hotmetal
Sr. Member
****
Offline Offline

Activity: 378
Merit: 250


View Profile
December 13, 2016, 01:06:00 PM
 #1715

I'm now using NR_ROWS_LOG=16 as the new sorting algorithm I implemented seems to prefer a larger NR_SLOTS.
I'm trying to set up NR_ROWS_LOG=14 and 15 to see how that would affect performance.

I think 14 will be close to ideal.  I expect 256 for NR_SLOTS will be enough to avoid row overflows, and that still allow using 1 byte for the row counters.  That's 16KB * 2 for the row counters, or only 32KB.


I agree. This is a perfect reason to use a GCN assembler and squeeze two sets of 32KB rowCounters[] into 64KB GDS.
It is almost meant to be  Grin

Make it so Cheesy
zawawa
Sr. Member
****
Offline Offline

Activity: 728
Merit: 304


Miner Developer


View Profile
December 14, 2016, 12:56:53 AM
 #1716

I will, I will. No worries. I will release my fork in a day or two.
Although it is not as fast as Claymore's yet, it should be much faster than SA v5 on AMD cards.
I already sold most of my NVIDIA cards, so I have no idea how it would compare on NVIDIA cards.
If I had received a lot of donations, I could have justified purchasing GTX 1060 to my wife, but I could only dream...

Gateless Gate Sharp, an open-source ETH/XMR miner: http://bit.ly/2rJ2x4V
BTC: 1BHwDWVerUTiKxhHPf2ubqKKiBMiKQGomZ
zzzzzzzzzz
Full Member
***
Offline Offline

Activity: 150
Merit: 100


View Profile
December 14, 2016, 02:24:42 AM
 #1717

I will, I will. No worries. I will release my fork in a day or two.
Although it is not as fast as Claymore's yet, it should be much faster than SA v5 on AMD cards.
I already sold most of my NVIDIA cards, so I have no idea how it would compare on NVIDIA cards.
If I had received a lot of donations, I could have justified purchasing GTX 1060 to my wife, but I could only dream...
Thanks for your work! Just to clarify (for myself), your fork will be Windows-only, correct? I'm sure looking forward to looking at both your new code and Marc's, when they become available. It will be very interesting to see what approaches you each took on the algo/implementation. Don't let it interfere with your Holiday with the family, though! That's much higher priority! Best wishes ...
Subw
Hero Member
*****
Offline Offline

Activity: 672
Merit: 500


View Profile
December 14, 2016, 02:34:46 AM
 #1718

Although it is not as fast as Claymore's yet, it should be much faster than SA v5 on AMD cards.

how much fast?

I already sold most of my NVIDIA cards, so I have no idea how it would compare on NVIDIA cards.
If I had received a lot of donations, I could have justified purchasing GTX 1060 to my wife, but I could only dream...

will donate $50 for your GTX1060 fund
Subw
Hero Member
*****
Offline Offline

Activity: 672
Merit: 500


View Profile
December 14, 2016, 02:41:56 AM
 #1719

your fork will be Windows-only, correct?

i hope not at least i'm not gonna donate anything for windows only software
zzzzzzzzzz
Full Member
***
Offline Offline

Activity: 150
Merit: 100


View Profile
December 14, 2016, 03:04:10 AM
 #1720

your fork will be Windows-only, correct?

i hope not at least i'm not gonna donate anything for windows only software

Marc (alias mrb) is working on the linux SA v6 miner separately, so there will be an OSS linux miner update. Also, zawawa's will be OSS, so anyone can port it to linux, if they want.
Pages: « 1 ... 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 [86] 87 88 89 90 91 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!