Bitcoin Forum
November 11, 2024, 12:52:06 PM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 [31] 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 ... 1240 »
  Print  
Author Topic: CCminer(SP-MOD) Modded GPU kernels.  (Read 2347573 times)
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2954
Merit: 1087

Team Black developer


View Profile
December 12, 2014, 07:52:52 PM
 #601

Nice, you finally updated your fork Smiley



Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
December 12, 2014, 08:03:36 PM
 #602

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2954
Merit: 1087

Team Black developer


View Profile
December 12, 2014, 09:25:30 PM
 #603

I have merged all the files from tvpouvet (1.5.1git version)

New beta from github: Should support old cards as well. Compute 3.0 or newer.

http://www.filedropper.com/release19

x11 is down a bit, but hope that it is more stable

source

https://github.com/sp-hash/ccminer/

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Schleicher
Hero Member
*****
Offline Offline

Activity: 675
Merit: 514



View Profile
December 13, 2014, 12:00:38 AM
 #604

I have merged all the files from tvpouvet (1.5.1git version)
New beta from github: Should support old cards as well. Compute 3.0 or newer.

http://www.filedropper.com/release19

x11 is down a bit, but hope that it is more stable
source

https://github.com/sp-hash/ccminer/
I think you should check your files on github. There's some stuff missing.

sp_ (OP)
Legendary
*
Offline Offline

Activity: 2954
Merit: 1087

Team Black developer


View Profile
December 13, 2014, 10:18:06 AM
 #605

I fixed it now. I am currently working with implementing uint2 into some of the hashingfunctions. (idea from DJM34 lyra implementation)

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2954
Merit: 1087

Team Black developer


View Profile
December 13, 2014, 11:36:07 AM
 #606

The uint2 keccak made x11 20-25 KHASH faster.  Checked in now.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Amph
Legendary
*
Offline Offline

Activity: 3248
Merit: 1070



View Profile
December 14, 2014, 01:32:31 PM
 #607

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
December 14, 2014, 01:53:59 PM
 #608

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
Amph
Legendary
*
Offline Offline

Activity: 3248
Merit: 1070



View Profile
December 14, 2014, 03:57:41 PM
 #609

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

what number your version is doing?

with sp version i'm currently at 7800 hash with one 970(1400 core)
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
December 14, 2014, 04:11:10 PM
 #610

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

what number your version is doing?

with sp version i'm currently at 7800 hash with one 970(1400 core)
on x11 ? I don't know, but sp_ version should be better on x11, I haven't looked so far on x11 (the point of this version was Lyra2RE)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
Amph
Legendary
*
Offline Offline

Activity: 3248
Merit: 1070



View Profile
December 14, 2014, 10:00:36 PM
 #611

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

what number your version is doing?

with sp version i'm currently at 7800 hash with one 970(1400 core)
on x11 ? I don't know, but sp_ version should be better on x11, I haven't looked so far on x11 (the point of this version was Lyra2RE)

oh that's a new algo, a fast google search tells me that is vertcoin releated, it's something anti-asic?
K1773R
Legendary
*
Offline Offline

Activity: 1792
Merit: 1008


/dev/null


View Profile
December 15, 2014, 06:05:15 PM
 #612

HEAD@windows never submits any shares.

[GPG Public Key]
BTC/DVC/TRC/FRC: 1K1773RbXRZVRQSSXe9N6N2MUFERvrdu6y ANC/XPM AK1773RTmRKtvbKBCrUu95UQg5iegrqyeA NMC: NK1773Rzv8b4ugmCgX789PbjewA9fL9Dy1 LTC: LKi773RBuPepQH8E6Zb1ponoCvgbU7hHmd EMC: EK1773RxUes1HX1YAGMZ1xVYBBRUCqfDoF BQC: bK1773R1APJz4yTgRkmdKQhjhiMyQpJgfN
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
December 15, 2014, 06:35:19 PM
 #613

HEAD@windows never submits any shares.
you need to use --diff 128 (if it is related to lyra2re)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
December 15, 2014, 06:36:13 PM
 #614

Nice, you finally updated your fork Smiley

the good thing with my complicated relationship with github is that I update only when necessary

djm your version is the optimized one with 970-980?
actually, I think the main difference is in the compute 3.0 kernel which weren't there in that prelim. version.
It also add an adjustable throughput, which might be able to increase a little the hashrate over that version (in the limitation of memory bandwidt... ) or decrease it if it is plugged into a monitor (my gtx980 were causing some lag when they were plugged into a monitor)

The current implementation is already optimized for 9xx serie (that's the interest of the calculation using uint2 type actually) however because of the memory bandwidth limitation in the 9xx serie, the 780ti is doing better...

what number your version is doing?

with sp version i'm currently at 7800 hash with one 970(1400 core)
on x11 ? I don't know, but sp_ version should be better on x11, I haven't looked so far on x11 (the point of this version was Lyra2RE)

oh that's a new algo, a fast google search tells me that is vertcoin releated, it's something anti-asic?
it is anti-asic for now...

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
K1773R
Legendary
*
Offline Offline

Activity: 1792
Merit: 1008


/dev/null


View Profile
December 15, 2014, 08:34:30 PM
 #615

HEAD@windows never submits any shares.
you need to use --diff 128 (if it is related to lyra2re)
tested with x11 and x13. i see a big hashrate increase, tough never any submitted shares.

[GPG Public Key]
BTC/DVC/TRC/FRC: 1K1773RbXRZVRQSSXe9N6N2MUFERvrdu6y ANC/XPM AK1773RTmRKtvbKBCrUu95UQg5iegrqyeA NMC: NK1773Rzv8b4ugmCgX789PbjewA9fL9Dy1 LTC: LKi773RBuPepQH8E6Zb1ponoCvgbU7hHmd EMC: EK1773RxUes1HX1YAGMZ1xVYBBRUCqfDoF BQC: bK1773R1APJz4yTgRkmdKQhjhiMyQpJgfN
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2954
Merit: 1087

Team Black developer


View Profile
December 15, 2014, 09:45:44 PM
 #616

I just submitted an improved BMW rewrite.

32% faster. Using UINT2 instead of Uint64_t. Added new operators for uint2 operations. SHL2 and SHR2, and minus. (in cuda_helper.h)

Around +30-40 KHASH  on x11, x13 and x15

Now BMW is spilling 0 bytes to slow memory...

I also tried to use uint2 in blake, but it was slower.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2954
Merit: 1087

Team Black developer


View Profile
December 15, 2014, 09:50:40 PM
 #617

HEAD@windows never submits any shares.
you need to use --diff 128 (if it is related to lyra2re)
tested with x11 and x13. i see a big hashrate increase, tough never any submitted shares.

Works here. Is this on linux? wich card?

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Epsylon3
Legendary
*
Offline Offline

Activity: 1484
Merit: 1082


ccminer/cpuminer developer


View Profile WWW
December 15, 2014, 10:05:44 PM
 #618

I had this while testing keccak uint2 changes with a sm30 only binary on the 750 ti... No errors but no found nonces neither.. So i commited the optimisation differently

BTC: 1FhDPLPpw18X4srecguG3MxJYe4a1JsZnd - My Projects: ccminer - cpuminer-multi - yiimp - Forum threads : ccminer - cpuminer-multi - yiimp
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2954
Merit: 1087

Team Black developer


View Profile
December 15, 2014, 10:07:08 PM
 #619

Rewrote skein to use uint2. 10-20 KHASH on x11,x13,x15.  Added to github.

https://github.com/sp-hash/ccminer/commit/f5c7cc861fb4ed62e473437813f4f1a637e672e1

I see that this has also been done in the skein256 (lyra) but the code is different. Is the new implementation faster DJM34?

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2954
Merit: 1087

Team Black developer


View Profile
December 15, 2014, 10:08:17 PM
 #620

My version is probobly failing on sm30 cards.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 [31] 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 ... 1240 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!