Bitcoin Forum
April 24, 2026, 10:31:02 PM *
News: Latest Bitcoin Core release: 30.2 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 [31] 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 ... 1240 »
  Print  
Author Topic: CCminer(SP-MOD) Modded GPU kernels.  (Read 2348146 times)
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2996
Merit: 1089

Team Black developer


View Profile
December 13, 2014, 10:18:06 AM
 #601

I fixed it now. I am currently working with implementing uint2 into some of the hashingfunctions. (idea from DJM34 lyra implementation)

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW QUAI-KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2996
Merit: 1089

Team Black developer


View Profile
December 13, 2014, 11:36:07 AM
 #602

The uint2 keccak made x11 20-25 KHASH faster.  Checked in now.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW QUAI-KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining.. https://github.com/sp-hash/TeamBlackMiner
Amph
Legendary
*
Offline Offline

Activity: 3290
Merit: 1072



View Profile
December 14, 2014, 01:32:31 PM
 #603

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
December 14, 2014, 01:53:59 PM
 #604

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
Amph
Legendary
*
Offline Offline

Activity: 3290
Merit: 1072



View Profile
December 14, 2014, 03:57:41 PM
 #605

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

what number your version is doing?

with sp version i'm currently at 7800 hash with one 970(1400 core)
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
December 14, 2014, 04:11:10 PM
 #606

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

what number your version is doing?

with sp version i'm currently at 7800 hash with one 970(1400 core)
on x11 ? I don't know, but sp_ version should be better on x11, I haven't looked so far on x11 (the point of this version was Lyra2RE)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
Amph
Legendary
*
Offline Offline

Activity: 3290
Merit: 1072



View Profile
December 14, 2014, 10:00:36 PM
 #607

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

what number your version is doing?

with sp version i'm currently at 7800 hash with one 970(1400 core)
on x11 ? I don't know, but sp_ version should be better on x11, I haven't looked so far on x11 (the point of this version was Lyra2RE)

oh that's a new algo, a fast google search tells me that is vertcoin releated, it's something anti-asic?
K1773R
Legendary
*
Offline Offline

Activity: 1792
Merit: 1008


/dev/null


View Profile
December 15, 2014, 06:05:15 PM
 #608

HEAD@windows never submits any shares.

[GPG Public Key]
BTC/DVC/TRC/FRC: 1K1773RbXRZVRQSSXe9N6N2MUFERvrdu6y ANC/XPM AK1773RTmRKtvbKBCrUu95UQg5iegrqyeA NMC: NK1773Rzv8b4ugmCgX789PbjewA9fL9Dy1 LTC: LKi773RBuPepQH8E6Zb1ponoCvgbU7hHmd EMC: EK1773RxUes1HX1YAGMZ1xVYBBRUCqfDoF BQC: bK1773R1APJz4yTgRkmdKQhjhiMyQpJgfN
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
December 15, 2014, 06:35:19 PM
 #609

HEAD@windows never submits any shares.
you need to use --diff 128 (if it is related to lyra2re)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
December 15, 2014, 06:36:13 PM
 #610

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

what number your version is doing?

with sp version i'm currently at 7800 hash with one 970(1400 core)
on x11 ? I don't know, but sp_ version should be better on x11, I haven't looked so far on x11 (the point of this version was Lyra2RE)

oh that's a new algo, a fast google search tells me that is vertcoin releated, it's something anti-asic?
it is anti-asic for now...

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
K1773R
Legendary
*
Offline Offline

Activity: 1792
Merit: 1008


/dev/null


View Profile
December 15, 2014, 08:34:30 PM
 #611

HEAD@windows never submits any shares.
you need to use --diff 128 (if it is related to lyra2re)
tested with x11 and x13. i see a big hashrate increase, tough never any submitted shares.

[GPG Public Key]
BTC/DVC/TRC/FRC: 1K1773RbXRZVRQSSXe9N6N2MUFERvrdu6y ANC/XPM AK1773RTmRKtvbKBCrUu95UQg5iegrqyeA NMC: NK1773Rzv8b4ugmCgX789PbjewA9fL9Dy1 LTC: LKi773RBuPepQH8E6Zb1ponoCvgbU7hHmd EMC: EK1773RxUes1HX1YAGMZ1xVYBBRUCqfDoF BQC: bK1773R1APJz4yTgRkmdKQhjhiMyQpJgfN
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2996
Merit: 1089

Team Black developer


View Profile
December 15, 2014, 09:45:44 PM
 #612

I just submitted an improved BMW rewrite.

32% faster. Using UINT2 instead of Uint64_t. Added new operators for uint2 operations. SHL2 and SHR2, and minus. (in cuda_helper.h)

Around +30-40 KHASH  on x11, x13 and x15

Now BMW is spilling 0 bytes to slow memory...

I also tried to use uint2 in blake, but it was slower.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW QUAI-KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2996
Merit: 1089

Team Black developer


View Profile
December 15, 2014, 09:50:40 PM
 #613

HEAD@windows never submits any shares.
you need to use --diff 128 (if it is related to lyra2re)
tested with x11 and x13. i see a big hashrate increase, tough never any submitted shares.

Works here. Is this on linux? wich card?

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW QUAI-KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining.. https://github.com/sp-hash/TeamBlackMiner
Epsylon3
Legendary
*
Offline Offline

Activity: 1484
Merit: 1122


ccminer/cpuminer developer


View Profile WWW
December 15, 2014, 10:05:44 PM
 #614

I had this while testing keccak uint2 changes with a sm30 only binary on the 750 ti... No errors but no found nonces neither.. So i commited the optimisation differently

BTC: 1FhDPLPpw18X4srecguG3MxJYe4a1JsZnd - My Projects: ccminer - cpuminer-multi - yiimp - Forum threads : ccminer - cpuminer-multi - yiimp
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2996
Merit: 1089

Team Black developer


View Profile
December 15, 2014, 10:07:08 PM
 #615

Rewrote skein to use uint2. 10-20 KHASH on x11,x13,x15.  Added to github.

https://github.com/sp-hash/ccminer/commit/f5c7cc861fb4ed62e473437813f4f1a637e672e1

I see that this has also been done in the skein256 (lyra) but the code is different. Is the new implementation faster DJM34?

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW QUAI-KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2996
Merit: 1089

Team Black developer


View Profile
December 15, 2014, 10:08:17 PM
 #616

My version is probobly failing on sm30 cards.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW QUAI-KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining.. https://github.com/sp-hash/TeamBlackMiner
K1773R
Legendary
*
Offline Offline

Activity: 1792
Merit: 1008


/dev/null


View Profile
December 16, 2014, 07:30:12 AM
 #617

HEAD@windows never submits any shares.
you need to use --diff 128 (if it is related to lyra2re)
tested with x11 and x13. i see a big hashrate increase, tough never any submitted shares.

Works here. Is this on linux? wich card?
yes on linux.
GTX 680 SOC which is unfortunately not compute 5

[GPG Public Key]
BTC/DVC/TRC/FRC: 1K1773RbXRZVRQSSXe9N6N2MUFERvrdu6y ANC/XPM AK1773RTmRKtvbKBCrUu95UQg5iegrqyeA NMC: NK1773Rzv8b4ugmCgX789PbjewA9fL9Dy1 LTC: LKi773RBuPepQH8E6Zb1ponoCvgbU7hHmd EMC: EK1773RxUes1HX1YAGMZ1xVYBBRUCqfDoF BQC: bK1773R1APJz4yTgRkmdKQhjhiMyQpJgfN
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2996
Merit: 1089

Team Black developer


View Profile
December 16, 2014, 08:49:12 AM
Last edit: December 17, 2014, 11:11:26 AM by sp_
 #618

HEAD@windows never submits any shares.
you need to use --diff 128 (if it is related to lyra2re)
tested with x11 and x13. i see a big hashrate increase, tough never any submitted shares.
Works here. Is this on linux? wich card?
yes on linux.
GTX 680 SOC which is unfortunately not compute 5

The head on Gtihub only works for compute 3.5 cards or newer.

http://en.wikipedia.org/wiki/CUDA

If you have 3.0 you can use the tpruvot version:

https://bitcointalk.org/index.php?topic=770064.0


Team Black Miner (ETHB3 ETH ETC VTC KAWPOW QUAI-KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining.. https://github.com/sp-hash/TeamBlackMiner
K1773R
Legendary
*
Offline Offline

Activity: 1792
Merit: 1008


/dev/null


View Profile
December 16, 2014, 08:59:34 AM
 #619

HEAD@windows never submits any shares.
you need to use --diff 128 (if it is related to lyra2re)
tested with x11 and x13. i see a big hashrate increase, tough never any submitted shares.
Works here. Is this on linux? wich card?
yes on linux.
GTX 680 SOC which is unfortunately not compute 5

The head on Gtihub only works for compute 3.5 cards or newer.

http://en.wikipedia.org/wiki/CUDA

If you have 3.0 you can use the tvpovet version:

https://bitcointalk.org/index.php?topic=770064.0


thanks.

[GPG Public Key]
BTC/DVC/TRC/FRC: 1K1773RbXRZVRQSSXe9N6N2MUFERvrdu6y ANC/XPM AK1773RTmRKtvbKBCrUu95UQg5iegrqyeA NMC: NK1773Rzv8b4ugmCgX789PbjewA9fL9Dy1 LTC: LKi773RBuPepQH8E6Zb1ponoCvgbU7hHmd EMC: EK1773RxUes1HX1YAGMZ1xVYBBRUCqfDoF BQC: bK1773R1APJz4yTgRkmdKQhjhiMyQpJgfN
Epsylon3
Legendary
*
Offline Offline

Activity: 1484
Merit: 1122


ccminer/cpuminer developer


View Profile WWW
December 17, 2014, 11:08:04 AM
 #620


If you have 3.0 you can use the tvpovet version:

https://bitcointalk.org/index.php?topic=770064.0


Could you try to write once my name correctly ? Wink tx

BTC: 1FhDPLPpw18X4srecguG3MxJYe4a1JsZnd - My Projects: ccminer - cpuminer-multi - yiimp - Forum threads : ccminer - cpuminer-multi - yiimp
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 [31] 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 ... 1240 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!