Bitcoin Forum
November 07, 2024, 09:11:59 PM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 [3] 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 »  All
  Print  
Author Topic: [ANN][GRS][DMD][DGB] Pallas optimized groestl opencl kernels  (Read 61242 times)
This is a self-moderated topic. If you do not want to be moderated by the person who started this topic, create a new topic.
pallas (OP)
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 20, 2014, 09:49:22 AM
 #41


Congratulations Pallas.

Do you have it on Github already?

Thanks.
No git yet. I was thinking it's not necessary for a single file, but now there are bins and maybe multiple versions so I may end up doing it.

qwep
Legendary
*
Offline Offline

Activity: 1008
Merit: 1000



View Profile
September 20, 2014, 06:58:13 PM
 #42


Congratulations Pallas.

Do you have it on Github already?

Thanks.
No git yet. I was thinking it's not necessary for a single file, but now there are bins and maybe multiple versions so I may end up doing it.
congratulations excellent optimization, but what about the other algorithms
pallas (OP)
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 20, 2014, 08:02:24 PM
 #43


Congratulations Pallas.

Do you have it on Github already?

Thanks.
No git yet. I was thinking it's not necessary for a single file, but now there are bins and maybe multiple versions so I may end up doing it.
congratulations excellent optimization, but what about the other algorithms

I did work on other algorithms too (x11 components, m7 and others) but nothing ready for publication.
I'd need more time and lower kilowatt hour cost in order to go ahead :-)

qwep
Legendary
*
Offline Offline

Activity: 1008
Merit: 1000



View Profile
September 20, 2014, 08:49:54 PM
 #44


Congratulations Pallas.

Do you have it on Github already?

Thanks.
No git yet. I was thinking it's not necessary for a single file, but now there are bins and maybe multiple versions so I may end up doing it.
congratulations excellent optimization, but what about the other algorithms

I did work on other algorithms too (x11 components, m7 and others) but nothing ready for publication.
I'd need more time and lower kilowatt hour cost in order to go ahead :-)
I understand that all miners are written C ++, and why not in C #, it is a bit faster than the C ++
pallas (OP)
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 20, 2014, 08:57:27 PM
 #45


Congratulations Pallas.

Do you have it on Github already?

Thanks.
No git yet. I was thinking it's not necessary for a single file, but now there are bins and maybe multiple versions so I may end up doing it.
congratulations excellent optimization, but what about the other algorithms

I did work on other algorithms too (x11 components, m7 and others) but nothing ready for publication.
I'd need more time and lower kilowatt hour cost in order to go ahead :-)
I understand that all miners are written C ++, and why not in C #, it is a bit faster than the C ++

Most miners are written in plain C but it doesn't matter that much unless you are mining with the CPU (still there is a good deal of assembly on some algorithms). GPU code is opencl or cuda instead.

smolen
Hero Member
*****
Offline Offline

Activity: 524
Merit: 500


View Profile
September 24, 2014, 12:09:06 AM
 #46

Take a look at my groestl implementation Wink

Of course I gave you bad advice. Good one is way out of your price range.
pallas (OP)
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 24, 2014, 07:06:02 AM
 #47


Thanks. From a first look, I don't see anything I haven't tried yet :-)
Do you have some hashrate figures?

smolen
Hero Member
*****
Offline Offline

Activity: 524
Merit: 500


View Profile
September 24, 2014, 05:13:54 PM
 #48


Thanks. From a first look, I don't see anything I haven't tried yet :-)
Do you have some hashrate figures?
Sorry, no testing results, I'm away from all crypto stuff, that's rather abandoned project, collecting virtual dust on HDD...
I didn't quite grok all your tricks Smiley I only use 3 arrays of 32 integers for intermediate results, so memory usage should be almost minimal and such buffer reusing could be an independent optimization, quite sure you have tried the rest Smiley

Of course I gave you bad advice. Good one is way out of your price range.
pallas (OP)
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 24, 2014, 05:58:15 PM
 #49

I did the best I could to reduce register usage, but in the end using more 64 bits turned out to be faster. Probably private memory is not fully used...

smolen
Hero Member
*****
Offline Offline

Activity: 524
Merit: 500


View Profile
September 24, 2014, 06:29:22 PM
 #50

in the end using more 64 bits turned out to be faster. Probably private memory is not fully used...
May be 64 bit math tricked AMD OpenCL compiler away from useless 'optimizations' Smiley I once get strange effect when inserting absolutely unrelated operations (well, it was copy protection) in the middle of big number crunching resulted in ~5% speed increase.

Of course I gave you bad advice. Good one is way out of your price range.
utahjohn
Hero Member
*****
Offline Offline

Activity: 630
Merit: 500


View Profile
September 24, 2014, 06:58:55 PM
 #51

@pallas
Any chance of you integrating your groestl kernel into the optimised X11 and X13 kernels on my BCT thread (in my sig) Smiley
pallas (OP)
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 24, 2014, 08:18:21 PM
 #52

@pallas
Any chance of you integrating your groestl kernel into the optimised X11 and X13 kernels on my BCT thread (in my sig) Smiley

I'll have a look asap. Actually I don't have much free time now, so it may take a bit.

pallas (OP)
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 24, 2014, 08:21:07 PM
 #53

in the end using more 64 bits turned out to be faster. Probably private memory is not fully used...
May be 64 bit math tricked AMD OpenCL compiler away from useless 'optimizations' Smiley I once get strange effect when inserting absolutely unrelated operations (well, it was copy protection) in the middle of big number crunching resulted in ~5% speed increase.

Sometimes it looks random indeed! :-D
And sometimes compiling the same .cl file leads to different hashrates O_o

istvandv
Sr. Member
****
Offline Offline

Activity: 342
Merit: 250


View Profile
September 25, 2014, 04:58:45 AM
 #54

@pallas
Any chance of you integrating your groestl kernel into the optimised X11 and X13 kernels on my BCT thread (in my sig) Smiley

I'll have a look asap. Actually I don't have much free time now, so it may take a bit.

while you are at it, how about myr-groestl?  Grin

         ▄███████████████▄
       ▄██▀             ▀██▄
    ▄▄██▀                 ▀██▄▄
█████▀▀       ▄▀▀▀▀▀▀▀▄▄    ▀▀█████
██          ▄▀ ▄▄▄▀▀▀▀▄▀█▄▄      ██
▐█▌       ▄▀ ▄▀ ▄▄▄▀▀▀▄▀▀▀███   ▐█▌
 ██      ▄▀▄▀▄▀▀▄▄▄▀▀▀▀▀█ ▄█▀   ██
 ▐█▌    █▄▀▄▀▄█▀▀▀ ▀█▀ ▄▀▄▀█   ▐█▌
  ██    █▄▀▄▀▄▄█▀ ▄▀ ▄▀▄▀▄▀█   ██
  ▐█▌ ▀▄█████▀▄▄▀▀▄▄▀▄▀▄▀▄▀█  ▐█▌
   ██▌▀████▀██▄▄▀▀▄▄▀▄▀▄▀▄█▀ ▐██
    ██▌▀█▀▀█▄▀▀▄▀▀▄▄▀▄█▄▄█▀ ▐██
     ██▌ ▀  ▀███▄▄▄█████▀  ▐██
      ██▄      ▀▀▀▀▀      ▄██
       ▀██▄             ▄██▀
         ▀██▄         ▄██▀
           ▀██▄     ▄██▀
             ▀███▄███▀
               ▀███▀
DeepOnion 
★ ★ ★ ★ ★   ❱❱❱ JOIN AIRDROP NOW!
TOR INTEGRATED & SECURED
★  Your Anonymity Guaranteed
★  Your Assets Secured by TOR
★  Guard Your Privacy!
|Bitcointalk
Reddit
Telegram
|                        ▄▄▀▄▄▀▄▄▀▄▀▀
                    ▄▄██▀█▀▄▀▀▀
                  ▄██▄█▄██▀
                ▄██████▀
              ▄██████▀
  ▄█▄▄▄▄▄▄▄▄▄██████▀
██████▀▀▀▀▀██████▀
 ▀█████  ▄███████
  ████████████▀██
  ██▀███████▀  ██
  ██ ▀████▀    ██
  ██   ▀▀      ██
  ▀█████████████▀
pallas (OP)
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 25, 2014, 07:28:17 AM
 #55

@pallas
Any chance of you integrating your groestl kernel into the optimised X11 and X13 kernels on my BCT thread (in my sig) Smiley

I'll have a look asap. Actually I don't have much free time now, so it may take a bit.

while you are at it, how about myr-groestl?  Grin

I had a quick look some time ago: some of the tricks that work on this kernel do not make myr-groestl any faster, thus I'd need to re-tune it from scratch... :-/

Spider07
Full Member
***
Offline Offline

Activity: 144
Merit: 100


View Profile
September 28, 2014, 07:08:43 AM
 #56

Hello
How do you do to have a such result ? I have only 6
My settings are wrong ?

sgminer.exe -k groestlcoin -o localhost:17772 -u XXXX -p XXXXXXXX -I 22 -w 256 -g 1 --thread-concurrency 24000 --gpu-engine 1100 --gpu-memclock 1250


Thanks

★ ★ ★ ★ ★   DeepOnion    Anonymous and Untraceable Cryptocurrency    TOR INTEGRATED & SECURED   ★ ★ ★ ★ ★
› › › › ›  JOIN THE NEW AIRDROP ✈️        VERIFIED WITH DEEPVAULT  ‹ ‹ ‹ ‹ ‹
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬   ANN  WHITEPAPER  FACEBOOK  TWITTER  YOUTUBE  FORUM   ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
pallas (OP)
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 28, 2014, 07:26:05 AM
 #57

Hello
How do you do to have a such result ? I have only 6
My settings are wrong ?

sgminer.exe -k groestlcoin -o localhost:17772 -u XXXX -p XXXXXXXX -I 22 -w 256 -g 1 --thread-concurrency 24000 --gpu-engine 1100 --gpu-memclock 1250


Thanks

See the troubleshooting on the op.
Lower your memory clock, try the compiled binary.

Spider07
Full Member
***
Offline Offline

Activity: 144
Merit: 100


View Profile
September 28, 2014, 07:33:57 AM
Last edit: September 28, 2014, 08:01:10 AM by Spider07
 #58

Oh
Tried to use the compiled binary - no more succes.....
 I paste your binary to my folder, copy the name of my  generate  .bin, delete my .bin and rename your .bin with the name of my.bin, run again my .bat
Is-it the good way to proceed ?

Also changed to --gpu-memclock 350 without any success
Thanks

★ ★ ★ ★ ★   DeepOnion    Anonymous and Untraceable Cryptocurrency    TOR INTEGRATED & SECURED   ★ ★ ★ ★ ★
› › › › ›  JOIN THE NEW AIRDROP ✈️        VERIFIED WITH DEEPVAULT  ‹ ‹ ‹ ‹ ‹
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬   ANN  WHITEPAPER  FACEBOOK  TWITTER  YOUTUBE  FORUM   ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
losk22
Legendary
*
Offline Offline

Activity: 1354
Merit: 1000


View Profile
September 28, 2014, 09:24:09 AM
 #59

 Replace groestlcoin.cl, diamond.cl and/or the kernel you want to use with this one (it's inside the "kernel" folder) You can read more?
Spider07
Full Member
***
Offline Offline

Activity: 144
Merit: 100


View Profile
September 28, 2014, 10:52:27 AM
Last edit: September 28, 2014, 12:01:32 PM by Spider07
 #60

Replace groestlcoin.cl, diamond.cl and/or the kernel you want to use with this one (it's inside the "kernel" folder) You can read more?

Thanks for your help.

Sorry forgot to mention that I have done that.
To be sure , I deleted all files in kernel folder
I have only 1 file (groestlcoin-v1.cl renamed to groestlcoin.cl)

I can't reach 16 Mh/s..... Cry   only 6..... Cry

★ ★ ★ ★ ★   DeepOnion    Anonymous and Untraceable Cryptocurrency    TOR INTEGRATED & SECURED   ★ ★ ★ ★ ★
› › › › ›  JOIN THE NEW AIRDROP ✈️        VERIFIED WITH DEEPVAULT  ‹ ‹ ‹ ‹ ‹
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬   ANN  WHITEPAPER  FACEBOOK  TWITTER  YOUTUBE  FORUM   ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Pages: « 1 2 [3] 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 »  All
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!