Bitcoin Forum
December 02, 2025, 06:42:16 AM *
News: Latest Bitcoin Core release: 30.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 [4] 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 ... 1240 »
  Print  
Author Topic: CCminer(SP-MOD) Modded GPU kernels.  (Read 2348054 times)
SS2006
Sr. Member
****
Offline Offline

Activity: 285
Merit: 250


View Profile
October 20, 2014, 12:43:42 AM
 #61

Very nice! I'm now seeing 6600-6700 KH/S on my GTX 970 OC'd by 200 on clock on X11. (Original was 6100-6200), so an extra 500 KH/S
Def the best ccminer I've seen for X11 (i only mine X11)
Thanks! I hope there is more improvement, donations coming soon

I should add the hash rate is not too stable at times, it does bounce around a lot from 6300 to 6700
jpouza
Legendary
*
Offline Offline

Activity: 3106
Merit: 1142


View Profile
October 20, 2014, 02:04:25 AM
Last edit: October 20, 2014, 07:39:18 AM by jpouza
 #62

I have emailed some of you a version of the miner with compute 5.2

Please test and report in the thread.

If I forgot anyone please resend your email on pm.


JESUS!!!!WTF!!!
Almost 8MH/s@X11 on my 980 cards. WOW!!!
Keep up the awesome job man!

jpouza
Legendary
*
Offline Offline

Activity: 3106
Merit: 1142


View Profile
October 20, 2014, 02:37:02 AM
 #63

Monster!


share image
jjjordan
Sr. Member
****
Offline Offline

Activity: 271
Merit: 251


View Profile
October 20, 2014, 05:48:36 AM
 #64

EVGA GTX 970 SC ACX1 (stock)
X11 5900-6000
X13 4800-4900
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 20, 2014, 06:12:12 AM
 #65

Very nice! I'm now seeing 6600-6700 KH/S on my GTX 970 OC'd by 200 on clock on X11. (Original was 6100-6200), so an extra 500 KH/S
Def the best ccminer I've seen for X11 (i only mine X11)
Thanks! I hope there is more improvement, donations coming soon

I should add the hash rate is not too stable at times, it does bounce around a lot from 6300 to 6700

Try to mine with a higher diff. Each time the miner is finding a nounce the hashrate is dropping. You can alse see this in GPU-Z. I think the author of the fork tried to fix this. Thread issue... I am focusing on the kernals right now. I beleive I can push the 980 above 10MHASH. NVIDIA will sell a shitload of cards with my improved miner, ask them to donate a card for me. Wink

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
SS2006
Sr. Member
****
Offline Offline

Activity: 285
Merit: 250


View Profile
October 20, 2014, 06:38:13 AM
 #66

heck if you can push the 970 that far relatively might get you card myself Tongue
SS2006
Sr. Member
****
Offline Offline

Activity: 285
Merit: 250


View Profile
October 20, 2014, 06:54:04 AM
 #67

Very nice! I'm now seeing 6600-6700 KH/S on my GTX 970 OC'd by 200 on clock on X11. (Original was 6100-6200), so an extra 500 KH/S
Def the best ccminer I've seen for X11 (i only mine X11)
Thanks! I hope there is more improvement, donations coming soon

I should add the hash rate is not too stable at times, it does bounce around a lot from 6300 to 6700

Try to mine with a higher diff. Each time the miner is finding a nounce the hashrate is dropping. You can alse see this in GPU-Z. I think the author of the fork tried to fix this. Thread issue... I am focusing on the kernals right now. I beleive I can push the 980 above 10MHASH. NVIDIA will sell a shitload of cards with my improved miner, ask them to donate a card for me. Wink

the difficulty fixed it. I always thought the faster i saw yay the better. Even tho now I see consistent high rates (6700), the accepts are much slower, is that OK, am I still getting paid the same/more on the pool side?
Thanks man
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 20, 2014, 10:48:57 AM
 #68

the difficulty fixed it. I always thought the faster i saw yay the better. Even tho now I see consistent high rates (6700), the accepts are much slower, is that OK, am I still getting paid the same/more on the pool side?
Thanks man

Yes, you will be payed the same. Probobly abit more because higher stable hashrates.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 20, 2014, 10:58:13 AM
 #69

I don't know if the 980 will ever beat a 290X in raw hashrate; power consumption, certainly, but not plain performance. I've got 8.2MH/s on low clocks now - getting a better card in a couple of days.

Still alot than can be done. Half of the Hashfunctions are still not modified in my mod. My compute 5.2 build is close to 8 MHASH with heavy overclock. 10MHASH is only 25% faster.

25% faster but 100% more profit for the miners.

happy hashing

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
go6ooo1212
Legendary
*
Offline Offline

Activity: 1512
Merit: 1000


quarkchain.io


View Profile
October 20, 2014, 05:34:25 PM
 #70

@sp_
I've tested the new reliese all day on x13.
970 went up to 5050 kH/s ; 980 reaches 5960 kH/s
The losses are about 2.4-2.5%
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 20, 2014, 07:48:02 PM
 #71

Today I have integrated the changes from Schleicher that he posted on the ccminer thread into my mod:

Quote from: Schleicher on October 19, 2014, 05:07:14 PM
I managed to increase the quark and nist5 speed a little bit.
Source code:
https://github.com/KlausT/ccminer

simd512 seems to be 10% faster. 
My stock 750TI is approaching 2700 without overclock. and 3000 with oclock.

This is without the faster Keccak (created by nvidia) in cudaminer.

These are the optimized numbers so far.

Blake         xxx (not done)
skein    1.5%
BMW       60%
jh512    4.5%
keccac    1%
cubehash: 7.5%
shavite: 3.6%
simd512: 9,2%
fuge:   4,70%      
hamsi:  6.97% 
shabal: 22%   
wirlpool:1.87%   
echo: 5.5% 
luffa: 0.4%

3 coders have contributed to the new speedup.

The sourcecode will be checked into the blakecoin fork by Epsylon3

https://bitcointalk.org/index.php?topic=770064.0

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Amph
Legendary
*
Offline Offline

Activity: 3276
Merit: 1072



View Profile
October 20, 2014, 08:01:04 PM
 #72

Very nice! I'm now seeing 6600-6700 KH/S on my GTX 970 OC'd by 200 on clock on X11. (Original was 6100-6200), so an extra 500 KH/S
Def the best ccminer I've seen for X11 (i only mine X11)
Thanks! I hope there is more improvement, donations coming soon

I should add the hash rate is not too stable at times, it does bounce around a lot from 6300 to 6700

hey that's good, can you tell me the consumption?
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
October 21, 2014, 12:20:42 AM
 #73

Today I have integrated the changes from Schleicher that he posted on the ccminer thread into my mod:

Quote from: Schleicher on October 19, 2014, 05:07:14 PM
I managed to increase the quark and nist5 speed a little bit.
Source code:
https://github.com/KlausT/ccminer

simd512 seems to be 10% faster.  
My stock 750TI is approaching 2700 without overclock. and 3000 with oclock.

This is without the faster Keccak (created by nvidia) in cudaminer.

These are the optimized numbers so far.

Blake         xxx (not done)
skein    1.5%
BMW       60%
jh512    4.5%
keccac    1%
cubehash: 7.5%
shavite: 3.6%
simd512: 9,2%
fuge:   4,70%      
hamsi:  6.97%  
shabal: 22%   
wirlpool:1.87%   
echo: 5.5%  
luffa: 0.4%

3 coders have contributed to the new speedup.

The sourcecode will be checked into the blakecoin fork by Epsylon3

https://bitcointalk.org/index.php?topic=770064.0


That uint2 keccak was done by mtrlt, if I'm not mistaken.
actually using it makes no difference on x11 it is already the fastest kernel of the bunch by far, also for some strange reason it does not work with compute 5.2 (registers gets confused  Grin had some weird issues when testing where some variables weren't updated at all...)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 21, 2014, 05:32:43 AM
 #74

Have you investigated whatever is going on with jackpotcoin?  I emailed you what was going on when I tried mining it.  It seemed like a nice improvement, but the pools only saw about 1/8 of the reported hash in the cmd prompt.

In the first version the blake implementation was commented out for jackpoint coin. 1/8 of the reported hash. What are the odds for that? The chained cryptohash is missing one of the algorithms and still reports found nounces. Is this the killer Blake?

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 21, 2014, 07:38:59 AM
 #75

Today I have integrated the changes from Schleicher that he posted on the ccminer thread into my mod:

Quote from: Schleicher on October 19, 2014, 05:07:14 PM
I managed to increase the quark and nist5 speed a little bit.
Source code:
https://github.com/KlausT/ccminer

simd512 seems to be 10% faster. 
My stock 750TI is approaching 2700 without overclock. and 3000 with oclock.

This is without the faster Keccak (created by nvidia) in cudaminer.

These are the optimized numbers so far.

Blake         xxx (not done)
skein    1.5%
BMW       60%
jh512    4.5%
keccac    1%
cubehash: 7.5%
shavite: 3.6%
simd512: 9,2%
fuge:   4,70%      
hamsi:  6.97% 
shabal: 22%   
wirlpool:1.87%   
echo: 5.5% 
luffa: 0.4%

3 coders have contributed to the new speedup.

The sourcecode will be checked into the blakecoin fork by Epsylon3

https://bitcointalk.org/index.php?topic=770064.0


That uint2 keccak was done by mtrlt, if I'm not mistaken.
// Experimental Kernel for Kepler (Compute 3.5) devices
// code submitted by nVidia performance engineer Alexey Panteleev

https://github.com/cbuchner1/CudaMiner/blob/35984c723eb786d614158daca9b07ac20de8645d/nv_kernel2.cu


Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 21, 2014, 07:14:45 PM
 #76

Today I managed a few percent in groestl!

All the kernals in x11 have now been optimized, but there is more potential.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
October 21, 2014, 07:37:46 PM
 #77

Today I have integrated the changes from Schleicher that he posted on the ccminer thread into my mod:

Quote from: Schleicher on October 19, 2014, 05:07:14 PM
I managed to increase the quark and nist5 speed a little bit.
Source code:
https://github.com/KlausT/ccminer

simd512 seems to be 10% faster.  
My stock 750TI is approaching 2700 without overclock. and 3000 with oclock.

This is without the faster Keccak (created by nvidia) in cudaminer.

These are the optimized numbers so far.

Blake         xxx (not done)
skein    1.5%
BMW       60%
jh512    4.5%
keccac    1%
cubehash: 7.5%
shavite: 3.6%
simd512: 9,2%
fuge:   4,70%      
hamsi:  6.97%  
shabal: 22%   
wirlpool:1.87%   
echo: 5.5%  
luffa: 0.4%

3 coders have contributed to the new speedup.

The sourcecode will be checked into the blakecoin fork by Epsylon3

https://bitcointalk.org/index.php?topic=770064.0


That uint2 keccak was done by mtrlt, if I'm not mistaken.
// Experimental Kernel for Kepler (Compute 3.5) devices
// code submitted by nVidia performance engineer Alexey Panteleev

https://github.com/cbuchner1/CudaMiner/blob/35984c723eb786d614158daca9b07ac20de8645d/nv_kernel2.cu



It's been done before, in OpenCL. See Maxcoin's CL file.
hmm, except that nvidia miner was there first...  Grin
(but, I think Reorder used it a lot)
And actually it makes no difference on compute 3.0 and 3.5 (might explain why it wasn't used anymore)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 21, 2014, 07:38:26 PM
 #78

// Experimental Kernel for Kepler (Compute 3.5) devices
// code submitted by nVidia performance engineer Alexey Panteleev
https://github.com/cbuchner1/CudaMiner/blob/35984c723eb786d614158daca9b07ac20de8645d/nv_kernel2.cu
It's been done before, in OpenCL. See Maxcoin's CL file.

(Compute 3.5) is outdated. All kernals needs to be rewritten for maxwell for optimal performance. preferably in Assembly language. This work takes months.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 21, 2014, 07:41:53 PM
 #79

My groestl fixes seems to increase x11 with around 20 khash on the 750ti. around 60khash on the 980. The groestl is now only the 3rd slowest of the x11 hashes.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
October 21, 2014, 09:58:21 PM
 #80

Managed to fix quark and Nist5 now. I had to rollback my assembly blake and the optimalisations I merged from Schleicher.

750ti standard clock
quark/nist5
4950/ 8000

750ti oc.
quark/nist5
5600/9000

But all algorithms seems to work now.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Pages: « 1 2 3 [4] 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 ... 1240 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!