Bitcoin Forum
March 29, 2017, 03:25:52 AM *
News: Latest stable version of Bitcoin Core: 0.14.0  [Torrent]. (New!)
 
   Home   Help Search Donate Login Register  
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 [18] 19 20 21 »  All
  Print  
Author Topic: [ANN][GRS][DMD] Pallas optimized groestlcoin / diamond etc. opencl kernel  (Read 48984 times)
This is a self-moderated topic. If you do not want to be moderated by the person who started this topic, create a new topic.
Wolf0
Legendary
*
Offline Offline

Activity: 1512


Miner Developer


View Profile
April 16, 2015, 03:37:30 PM
 #341

Just wanted to say I've tried applying some of the tricks I learnt working on whirlpoolx to the groestl kernel, but it's not so simple.
This kernel is much bigger in size so you can't just copy some good lines of code and it runs faster. Furthermore some of the optimizations I made in the past, make it more time consuming to apply some apparently simple hacks. Wolf0 I'm sure you know what I mean ;-)
Still there is room for improvement, I have some ideas, but the question is: when the profit is gone, and the fun is gone, is it still worth?

It is indeed, as long as all mining profit isn't gone. It's a challenge - learn from it, and use that knowledge elsewhere.

Code:
Donations: BTC: 1WoLFdwcfNEg64fTYsX1P25KUzzSjtEZC -- XMR: 45SLUTzk7UXYHmzJ7bFN6FPfzTusdUVAZjPRgmEDw7G3SeimWM2kCdnDQXwDBYGUWaBtZNgjYtEYA22aMQT4t8KfU3vHLHG
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction. Advertise here.
1490757952
Hero Member
*
Offline Offline

Posts: 1490757952

View Profile Personal Message (Offline)

Ignore
1490757952
Reply with quote  #2

1490757952
Report to moderator
Wolf0
Legendary
*
Offline Offline

Activity: 1512


Miner Developer


View Profile
April 16, 2015, 06:33:24 PM
 #342

@wolf0 do you have anything better than the neocrypt kernel u leaked on feathercoin thread?  I am getting 278KHs on 7950 and 295 on 280x

I didn't leak that, I released it. Checking my records...

EDIT: Okay, most recent record of Neoscrypt I have is 12/23/2014 (NSFW): https://ottrbutt.com/miner/neoscryptwolf-12232014.png
Needless to say but I will, I appreciate your work, I have no conception of wavefronts and such, I have tried but I'm just too old to embrace new concepts.  If you have something better for me please do put on Mega Smiley  Same goes for groestl Pallas Smiley  U are my heroes Smiley
And realhet who understands AMD GPU coding better than all of us Smiley  realhet hetpas assembly kernel still best for 280x and other Tahiti cards AFAIK Smiley

Nope, I have 21MH/s out of a 7950 at 1125/1250, IIRC, using OpenCL.

Code:
Donations: BTC: 1WoLFdwcfNEg64fTYsX1P25KUzzSjtEZC -- XMR: 45SLUTzk7UXYHmzJ7bFN6FPfzTusdUVAZjPRgmEDw7G3SeimWM2kCdnDQXwDBYGUWaBtZNgjYtEYA22aMQT4t8KfU3vHLHG
utahjohn
Hero Member
*****
Offline Offline

Activity: 616


View Profile WWW
April 16, 2015, 11:31:53 PM
 #343

@wolf0 do you have anything better than the neocrypt kernel u leaked on feathercoin thread?  I am getting 278KHs on 7950 and 295 on 280x

I didn't leak that, I released it. Checking my records...

EDIT: Okay, most recent record of Neoscrypt I have is 12/23/2014 (NSFW): https://ottrbutt.com/miner/neoscryptwolf-12232014.png
Needless to say but I will, I appreciate your work, I have no conception of wavefronts and such, I have tried but I'm just too old to embrace new concepts.  If you have something better for me please do put on Mega Smiley  Same goes for groestl Pallas Smiley  U are my heroes Smiley
And realhet who understands AMD GPU coding better than all of us Smiley  realhet hetpas assembly kernel still best for 280x and other Tahiti cards AFAIK Smiley

Nope, I have 21MH/s out of a 7950 at 1125/1250, IIRC, using OpenCL.
Wow! may I have new Neoscrypt kernel, 7950 working hard just doing 278KHs with your older kernel!

Looking ... 1160/1500 Smiley I have modded card a bit for better cooling Smiley

DMD: dUTjohnrXHGYkh7jELWrZkGJbMnE6mdsuh (Staking)
BTC: 1HANJQygp3jHuzutceBgMT7wfCgEug6h4L (Donation)
ETH: 0xba90d7c1ab2bb9d5c07d843476153d1722637250 Mine ETH for 0.5% http://donkeypool.com
Wolf0
Legendary
*
Offline Offline

Activity: 1512


Miner Developer


View Profile
April 16, 2015, 11:47:51 PM
 #344

@wolf0 do you have anything better than the neocrypt kernel u leaked on feathercoin thread?  I am getting 278KHs on 7950 and 295 on 280x

I didn't leak that, I released it. Checking my records...

EDIT: Okay, most recent record of Neoscrypt I have is 12/23/2014 (NSFW): https://ottrbutt.com/miner/neoscryptwolf-12232014.png
Needless to say but I will, I appreciate your work, I have no conception of wavefronts and such, I have tried but I'm just too old to embrace new concepts.  If you have something better for me please do put on Mega Smiley  Same goes for groestl Pallas Smiley  U are my heroes Smiley
And realhet who understands AMD GPU coding better than all of us Smiley  realhet hetpas assembly kernel still best for 280x and other Tahiti cards AFAIK Smiley

Nope, I have 21MH/s out of a 7950 at 1125/1250, IIRC, using OpenCL.
Wow! may I have new Neoscrypt kernel, 7950 working hard just doing 278KHs with your older kernel!

The 21MH/s is Groestl, I meant. And... idk, I suppose Neoscrypt isn't very useful at the moment...

Code:
Donations: BTC: 1WoLFdwcfNEg64fTYsX1P25KUzzSjtEZC -- XMR: 45SLUTzk7UXYHmzJ7bFN6FPfzTusdUVAZjPRgmEDw7G3SeimWM2kCdnDQXwDBYGUWaBtZNgjYtEYA22aMQT4t8KfU3vHLHG
utahjohn
Hero Member
*****
Offline Offline

Activity: 616


View Profile WWW
April 17, 2015, 12:08:03 AM
 #345

I get 26MHs on 280x mining groestl however I have quit groestl mining of DMD for the moment till diff drops back into the teens.  For some reason ASM kernel crashes 7950 within a few minutes ...  I am mining nneoscrypt on yaamp at present and also selling neo on westhash Smiley
Buying more DMD than I used to mine direct Huh  Will see what happens in next week or so as miners drop like flies on DMD ...

DMD: dUTjohnrXHGYkh7jELWrZkGJbMnE6mdsuh (Staking)
BTC: 1HANJQygp3jHuzutceBgMT7wfCgEug6h4L (Donation)
ETH: 0xba90d7c1ab2bb9d5c07d843476153d1722637250 Mine ETH for 0.5% http://donkeypool.com
smolen
Hero Member
*****
Offline Offline

Activity: 525


View Profile
April 17, 2015, 04:02:13 AM
 #346

Just wanted to say I've tried applying some of the tricks I learnt working on whirlpoolx to the groestl kernel, but it's not so simple.
This kernel is much bigger in size so you can't just copy some good lines of code and it runs faster. Furthermore some of the optimizations I made in the past, make it more time consuming to apply some apparently simple hacks. Wolf0 I'm sure you know what I mean ;-)
Still there is room for improvement, I have some ideas, but the question is: when the profit is gone, and the fun is gone, is it still worth?
Another trick, not for speed, but for cleaning the code - when you want to postpone sboxing of byte, put preimage of zero (0x81 in Whirlpool) there.

Of course I gave you bad advice. Good one is way out of your price range.
realhet
Jr. Member
*
Offline Offline

Activity: 32


View Profile WWW
May 17, 2015, 10:21:24 AM
 #347

Hi,

Have you checked the new GCN3 ISA manual? http://amd-dev.wpengine.netdna-cdn.com/wordpress/media/2013/07/AMD_GCN3_Instruction_Set_Architecture.pdf

It has some really useful things like:

- Bytepermute (no more shifts and masks)
- VOP_DPP: It actually does 2 ds_swizzle in the instruction in no time, so optimizing a single thread for 4 lanes costs no more cycles.
- VOP_SDWA: access a word or a byte in the 32bit inputs and in the output too. (again: no more shifts and masks)
- S alu can write memory

No 3 op add, and 3 op bitwise, though.

And they altered some instruction encodings, so I guess my asm will crash on GCN3 immediately. Cheesy

MaxDZ8
Hero Member
*****
Offline Offline

Activity: 672



View Profile
May 18, 2015, 03:44:15 PM
 #348

That's some truly slick updates!

I was indeed planning to do full AES round without t-tables as the amount of masks are nonsensical.
I had the impression the SALU was immensely updated for Tonga given it takes much more VGPRs on the analyzer.

I wonder how to trick the CL compiler in emitting this code.

But most importantly, what are they waiting to just make an AMD_GCN_swizzle extension!?
realhet
Jr. Member
*
Offline Offline

Activity: 32


View Profile WWW
May 21, 2015, 12:31:48 PM
 #349

It doesn't seems like they are implementing gcn specific goodies on the current compiler stack. It's kinda bloated, and AMD_IL awaits for it's replacement since 7970 came out. I'm sure in the upcoming HSA language there will be much more GCN things implemented (except the separated V and S programming).
pallas
Legendary
*
Offline Offline

Activity: 1232


Black Belt Developer


View Profile
July 01, 2015, 09:45:05 PM
 #350

I'm interested in knowing the hashrate of R9 285 and R9 Fury X cards, anybody?

pallas
Legendary
*
Offline Offline

Activity: 1232


Black Belt Developer


View Profile
July 03, 2015, 08:31:30 AM
 #351

Wolf0 created a faster Tahiti binary and posted about it in the groestlcoin thread:

I have a faster Tahiti binary than Pallas' for Groestlcoin - works on DMD, too. The usage is the same as his binary; I should have more info later.

Get it here: https://ottrbutt.com/miner/wolf-groestlcoinTahitigw256l4.bin

it is indeed faster and works flawlessly.
usage: just rename it over the old one and make sure you set worksize 256 for that card; you can get a bit more hashrate by using 2 or 4 threads.

pallas
Legendary
*
Offline Offline

Activity: 1232


Black Belt Developer


View Profile
October 13, 2015, 11:32:00 AM
 #352

Nothing new in the groestl+groestl area, but I've worked a bit on the groestl+sha variant (myr-groestl for myriad, digibyte, saffron, etc.).
Tahiti is a mess, but I could easily push hawaii over 60 Mh/s, keeping the kernel compatible with the old miners.

pallas
Legendary
*
Offline Offline

Activity: 1232


Black Belt Developer


View Profile
October 16, 2015, 12:59:30 PM
 #353

Nothing new in the groestl+groestl area, but I've worked a bit on the groestl+sha variant (myr-groestl for myriad, digibyte, saffron, etc.).
Tahiti is a mess, but I could easily push hawaii over 60 Mh/s, keeping the kernel compatible with the old miners.

I could finally get rid of scratch registers on Tahiti: now the 280x is doing 35 Mh/s with moderate overclock :-)

carlo_0000
Hero Member
*****
Offline Offline

Activity: 588


View Profile
November 02, 2015, 01:30:27 AM
 #354

the diamond.cl  is missing on the download
i only see groestlcoin-v1.cl

or must we just rename to diamond ?

so i rename to diamond.cl
but no change in my speed i have 4.7 mh on r9 270  sgminer 4.1.0

i guest groestlcoin-v1.cl is not for diamond, i ve got a lot rejected shares

my BTC  3Lt6aNAsetewmMsgnjHtx659qcVchhS2nK
my HBN  F1DMYtUYMV7fSVXphFLnAub7ACfLkunaQz
pallas
Legendary
*
Offline Offline

Activity: 1232


Black Belt Developer


View Profile
November 02, 2015, 09:01:07 AM
 #355

the diamond.cl  is missing on the download
i only see groestlcoin-v1.cl

or must we just rename to diamond ?

so i rename to diamond.cl
but no change in my speed i have 4.7 mh on r9 270  sgminer 4.1.0

i guest groestlcoin-v1.cl is not for diamond, i ve got a lot rejected shares

groestlcoin and diamond use the same block hashing algo so the same opencl kernel applies.
but you must configure the miner to mine for the specific coin because there are differences!
that's why there are two kernels even thought the two kernel files are the same.

please posto your conf file and commandline so I can help you debug it.

carlo_0000
Hero Member
*****
Offline Offline

Activity: 588


View Profile
November 03, 2015, 12:20:37 AM
 #356

so i have r9 270  with driver 15.10

sgminer_diamond_v4.1.0

my batch

setx GPU_MAX_ALLOC_PERCENT 100
setx GPU_USE_SYNC_OBJECTS 1
"E:\myriadcoin\cgminer skein\sgminer_diamond_v4.1.0\sgminer.exe" -k diamond -o stratum+tcp://eu.miningfield.com:3377 -u carlo0000.r9a -p 0 --difficulty-multiplier 0.0039062500 -w 256 -I 22 -T

so i can only use the kernel
the bin are for 290 and 280, i try to use and rename but sgminer crash

my bin file name is  diamondPitcairnglg2tc10688w256l4.bin


i try again it s working now with wolf-groestlcoinTahitigw256l4
the other one crash

i notice i had diamondPitcairnglg2tc10688w256l4.bin in my user folder C:\Users\carlo  , i delete

i have 8.7 mh  Grin @1025mhz

thanks for help

so i run it on my other computer, that one does 2x  9.3 mh @1040mhz  with driver 15.7 with display is at 800*600 with no screen,

so u put this computer to 1040mhz too but i does only 8.8 , but i have lot of stuff running on it a the display it at 1080p
so i put back to 1025 and gonna mine mry skein again on this one, it s less intensive, screen is really slow with mining diamond @ I22
but on skein it s more faster with I8 (max) 140mh

even with double hashrate now on diamond, i still have more incomes with myr with skein on my r9 270
but i don't know how much it s gona make with POS on diamond  so maybe not a big difference in long term

or not i just made new calculation MYR is still droping a lot last days
difficulty is higher and price get down a lot   , last week i was at 120000 satochi day,

today it s only 78000 sat  Huh  it s less than mining diamond
so i m going on diamond for now

my BTC  3Lt6aNAsetewmMsgnjHtx659qcVchhS2nK
my HBN  F1DMYtUYMV7fSVXphFLnAub7ACfLkunaQz
pallas
Legendary
*
Offline Offline

Activity: 1232


Black Belt Developer


View Profile
November 03, 2015, 09:12:13 AM
 #357

so i have r9 270  with driver 15.10

...snip...

Instead of using binaries made for other chips, why not simply compiling your own for pitcairn? just overwrite diamond.cl, remove the bin files and run.
Let me know how it goes :-)

carlo_0000
Hero Member
*****
Offline Offline

Activity: 588


View Profile
November 04, 2015, 01:40:16 AM
 #358

so i have r9 270  with driver 15.10

...snip...

Instead of using binaries made for other chips, why not simply compiling your own for pitcairn? just overwrite diamond.cl, remove the bin files and run.
Let me know how it goes :-)

it s what i did first   not using the bin file  but speed was 4.6mh

but i think maybe because there was the old bin file in C:\Users\carlo   (sgminer use that one  sometimes) strange but not always

when i see it don't create a bin file in sgminer  directory  than i lnow it s using the one is  C:\Users\carlo

i gonna tri again  remove them both

so i delete, start sgmine  it create a new but  now working 4.6mhz only

so the bin make the difference   not the kernel


my BTC  3Lt6aNAsetewmMsgnjHtx659qcVchhS2nK
my HBN  F1DMYtUYMV7fSVXphFLnAub7ACfLkunaQz
Heavyiron
Newbie
*
Offline Offline

Activity: 18


View Profile
November 05, 2015, 09:53:08 PM
 #359

Hm, bin from Wolf0 works fine and faster a little bit on 7850 (8 MH/s) and even 5770 (3,6 MH/s). On kernel v1 compiled on 14.7 RC3 driver speeds were 7,2 and 3,2 at the same clocks. Nice work.
pallas
Legendary
*
Offline Offline

Activity: 1232


Black Belt Developer


View Profile
January 18, 2016, 07:51:14 PM
 #360

Myriad-groestl: I tried splitting the kernel into two parts, groestl and sha. I was almost sure it would be an improvement but it is a little slower instead (and it requires a custom miner). It could fix Tahiti slowness, though. I don't know yet because I didn't have such a card ready on the rig to test.

Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 [18] 19 20 21 »  All
  Print  
 
Jump to:  

Sponsored by , a Bitcoin-accepting VPN.
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!