CCminer(SP-MOD) Modded GPU kernels.

sp_ (OP)

Legendary

Offline

Activity: 2926
Merit: 1087

Team Black developer

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 23, 2015, 09:05:06 AM

#1641

yes there is:

https://github.com/KlausT/ccminer/commit/fa44f730b875489e7f1c5f7e72179dc3c960e86f

But I think the code is made by DJM34

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner

chrysophylax

Legendary

Offline

Activity: 3136
Merit: 1093

--- ChainWorks Industries ---

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 23, 2015, 09:07:38 AM

#1642

Quote from: bathrobehero on February 23, 2015, 09:00:35 AM

Quote from: sp_ on February 23, 2015, 08:07:26 AM

KlausT has added support in his fork:

https://github.com/KlausT/ccminer

There's no pluck there. DJM34 made a fork which runs at around 2.3kh/s per 750 Ti while sgminer does ~3.7kh/s.

kool ...

will get it all sorted tomorrow ...

i think the last time i tried to compile i was gettign errors - and djm34 pointed out that i needed to use the latest cuda 6.5 ...

if thats the case for the sgminer / ccminer compiles - i will have a bit of work to do to build another linux machine thats more up to date than the fedora 19 x64 that i have ...

:|

#crysx

CWI-Thread (theFORUM) - https://bitcointalk.org/index.php?topic=1563601 . CWI-WebSite (theSITE) - https://chainworksindustries.com/ . CWI-Shop (theSHOP) - https://chainworksindustries.com/theSHOP.html .

bathrobehero

Legendary

Offline

Activity: 2002
Merit: 1051

ICO? Not even once.

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 23, 2015, 09:09:12 AM

#1643

Quote from: sp_ on February 23, 2015, 09:05:06 AM

yes there is:

https://github.com/KlausT/ccminer/commit/fa44f730b875489e7f1c5f7e72179dc3c960e86f

But I think the code is made by DJM34

Oh right, branches.

Not your keys, not your coins!

djm34

Legendary

Offline

Activity: 1400
Merit: 1050

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 23, 2015, 10:36:35 AM

#1644

Quote from: chrysophylax on February 23, 2015, 09:07:38 AM

Quote from: bathrobehero on February 23, 2015, 09:00:35 AM

Quote from: sp_ on February 23, 2015, 08:07:26 AM

KlausT has added support in his fork:

https://github.com/KlausT/ccminer

There's no pluck there. DJM34 made a fork which runs at around 2.3kh/s per 750 Ti while sgminer does ~3.7kh/s.

you just need to update cuda...

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw

djm34

Legendary

Offline

Activity: 1400
Merit: 1050

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 23, 2015, 12:50:15 PM

#1645

Quote from: bathrobehero on February 23, 2015, 07:52:45 AM

Quote from: chrysophylax on February 22, 2015, 05:56:56 AM

sp - is pluck part of this fork? ...

if not - will it be? ...

#crysx

Not yet but I hope soon. For the time being sgminer is much faster than ccminer for nvidia cards which pretty much screams for optimizations.

That's not that obvious... the program was optimized for memory access, then transposed to opencl and it seems that opencl does a better job with memory access than cuda. (I found a small difference in the code between the two and now the cuda version runs at 2.8kh/s for the 750ti and 9.2kh/s for the 980, but this is still below the perf of opencl).

To be honest I would be curious to look at the ptx generated by opencl (if there are a command to obtain it)...

Actually the main difference between the two, is that the cards on nvidia runs at 40% tdp while on sgminer it runs at 100%tdp...

ps: I think it would be interesting (but lengthy) to transpose the cuda neoscrypt to opencl and check how it does on nvidia Grin

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw

djm34

Legendary

Offline

Activity: 1400
Merit: 1050

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 23, 2015, 12:56:37 PM

#1646

nobody has been looking into that vanilla coin ? Grin

because it is nothing to code: you just need to xor the first 256bit of the whirlpool hash to the last 256bit Grin

I can't believed it stayed cpu only for 2 months Grin

(it takes mostly 2 min, and 10 if you create a new algo in ccminer framework)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw

PVmining

Sr. Member

Offline

Activity: 330
Merit: 252

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 23, 2015, 04:27:10 PM

#1647

Quote

I can't believed it stayed cpu only for 2 months...

*lol* ...but to late... amd is already there.

edit: and whirlpool seems not that slow on amd - 50mhash per r9 270

tbearhere

Legendary

Offline

Activity: 3360
Merit: 1003

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 23, 2015, 06:24:40 PM
Last edit: February 23, 2015, 06:41:27 PM by tbearhere

#1648

What multiplier do we need for Qubit?
Im only getting half my hash report at pool...750ti. Thx

sp_ (OP)

Legendary

Offline

Activity: 2926
Merit: 1087

Team Black developer

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 23, 2015, 07:54:19 PM

#1649

Use release 39 and without a multiplier. should work. mine at yaamp.com (Hamsterpool is broken)

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner

chrysophylax

Legendary

Offline

Activity: 3136
Merit: 1093

--- ChainWorks Industries ---

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 06:46:26 AM

#1650

Quote from: Wolf0 on February 24, 2015, 06:44:27 AM

Quote from: PVmining on February 23, 2015, 04:27:10 PM

Quote

I can't believed it stayed cpu only for 2 months...

*lol* ...but to late... amd is already there.

edit: and whirlpool seems not that slow on amd - 50mhash per r9 270

100MH/s+ on 270X.

wolf - we can never know whether you are quoting YOUR miner / optimizations - or the standard that is available to the public ...

so this figure you have quoted mate - yours or public? ...

#crysx

chrysophylax

Legendary

Offline

Activity: 3136
Merit: 1093

--- ChainWorks Industries ---

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 06:51:49 AM

#1651

Quote from: djm34 on February 23, 2015, 10:36:35 AM

Quote from: chrysophylax on February 23, 2015, 09:07:38 AM

Quote from: bathrobehero on February 23, 2015, 09:00:35 AM

Quote from: sp_ on February 23, 2015, 08:07:26 AM

KlausT has added support in his fork:

https://github.com/KlausT/ccminer

There's no pluck there. DJM34 made a fork which runs at around 2.3kh/s per 750 Ti while sgminer does ~3.7kh/s.

you just need to update cuda...

tanx mate ...

the cuda repo doesnt allow that update or upgrade ...

doing it manually means a crapload of work on our part for the farm - as the 'standardization' of the farm is incomplete ...

different motherboards - cpus and the like ...

im looking at an easier way of upgrading the whole farm to the latest cuda without a 'one by one' approach ...

ill build a fedora 20 x64 test machine - which will allow all this to happen ( and also finally allow testing of your neoscrypt miner ) which will make it easier to roll out when the hardware changes happen too ...

tanx again ...

#crysx

chrysophylax

Legendary

Offline

Activity: 3136
Merit: 1093

--- ChainWorks Industries ---

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 06:53:00 AM

#1652

Quote from: Wolf0 on February 24, 2015, 06:48:59 AM

Quote from: chrysophylax on February 24, 2015, 06:46:26 AM

Quote from: Wolf0 on February 24, 2015, 06:44:27 AM

Quote from: PVmining on February 23, 2015, 04:27:10 PM

Quote

I can't believed it stayed cpu only for 2 months...

*lol* ...but to late... amd is already there.

edit: and whirlpool seems not that slow on amd - 50mhash per r9 270

100MH/s+ on 270X.

wolf - we can never know whether you are quoting YOUR miner / optimizations - or the standard that is available to the public ...

so this figure you have quoted mate - yours or public? ...

#crysx

Mine - he pointed out it wasn't slow on AMD; he's right.

damn ...

and how to get hold of your one? with the appropriate settings? ...

Wink

#crysx

chrysophylax

Legendary

Offline

Activity: 3136
Merit: 1093

--- ChainWorks Industries ---

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 07:14:19 AM

#1653

Quote from: Wolf0 on February 24, 2015, 07:03:31 AM

Quote from: chrysophylax on February 24, 2015, 06:53:00 AM

Quote from: Wolf0 on February 24, 2015, 06:48:59 AM

Quote from: chrysophylax on February 24, 2015, 06:46:26 AM

Quote from: Wolf0 on February 24, 2015, 06:44:27 AM

Quote from: PVmining on February 23, 2015, 04:27:10 PM

Quote

I can't believed it stayed cpu only for 2 months...

*lol* ...but to late... amd is already there.

edit: and whirlpool seems not that slow on amd - 50mhash per r9 270

100MH/s+ on 270X.

wolf - we can never know whether you are quoting YOUR miner / optimizations - or the standard that is available to the public ...

so this figure you have quoted mate - yours or public? ...

#crysx

Mine - he pointed out it wasn't slow on AMD; he's right.

damn ...

and how to get hold of your one? with the appropriate settings? ...

Wink

#crysx

You know the answer to that. But, anyway, I'm working on something more epic.

The CUDA and OpenCL code for Whirlpool consists of lookups into huge tables - which sucks for the GPU; that's CPU code. Even with my current code, I've noticed beyond a certain point, it doesn't matter how high I clock, because it's stalling on memory accesses. Those tables have so got to go away.

I have gotten the reference implementation down in C - surprisingly hard, seeing as it appears there's no code anywhere for it. This consists of mostly the block cipher W that was created with Whirlpool, which is based on AES - and I know AES backwards and forwards. Small issue - it's got a 2048 byte table for the multiplication, then a 256 byte Sbox.

I took the 2048 byte table used for the multiplies and reduced it to one 8-byte table by doing them manually - then I got rid of that by inlining them as constants. The S-box I split into its parts - three S-boxes containing 16 entries of 4 bits each, and bitsliced them. Does valid hashes so far, but I have a bit further to go before it's really GPU-ready.

wow - so you have been a VERY busy lil wolfie then ... damn ...

so when will you expected final implementation come? ...

btw - pm for the 'you know the answer to that' situation with your idea of how that can be done ...

just trying to make the farm work THAT MUCH better - and that requires optimizations ... sooo - pm me please with what needs to be done on my end to get it organized ...

btw - the completion of the exchange from amd to nvidia is almost complete with the farm - so i can still run / test the optimizations with the gigabyte 280x oc cards left ( 16 of them currently ) ... once those are gone - the farm will be nothing but gigabyte 750ti oc lp cards ...

hence the reason for my interest in what / when / where / how / and how much ... Wink

#crysx

sp_ (OP)

Legendary

Offline

Activity: 2926
Merit: 1087

Team Black developer

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 07:16:20 AM

#1654

Looks like the cuda implementation of wirlpool can use 8 times less memory access by a small rewrite.

If Wirlpoolx is just wirlpool with an extra xor pass I think alot of work is needed to get close to the Wolf0 speed.

the 750ti only does 4,4 MHASH on wirlpool. (this overview is a bit old, the latest miner is faster)

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner

chrysophylax

Legendary

Offline

Activity: 3136
Merit: 1093

--- ChainWorks Industries ---

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 07:17:52 AM

#1655

Quote from: sp_ on February 24, 2015, 07:16:20 AM

would it be worth it though sp? ...

#crysx

sp_ (OP)

Legendary

Offline

Activity: 2926
Merit: 1087

Team Black developer

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 07:19:26 AM

#1656

Quote from: chrysophylax on February 24, 2015, 07:17:52 AM

would it be worth it though sp? ...
#crysx

Yes, because the same algo is used in x15 (the last of the hashing function) rewriting it will improve the x15 speed alot..

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner

sp_ (OP)

Legendary

Offline

Activity: 2926
Merit: 1087

Team Black developer

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 07:35:42 AM

#1657

Quote from: Wolf0 on February 24, 2015, 07:03:31 AM

The CUDA and OpenCL code for Whirlpool consists of lookups into huge tables - which sucks for the GPU;

The lookup is done in shared memory and is 1 cycle, but the internal RISC cpu needs 4 instructions to do the lookup (byteperm/add/shift/move)
With the BFINS instruction and alligned memroy buffers this can be reduced to 2 instructions, although I failed to implement it in my first attempt (AES)

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner

tbearhere

Legendary

Offline

Activity: 3360
Merit: 1003

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 09:08:12 AM

#1658

Need help on DGB coin Qubit algo Theblocksfactory I think needs a setting that I don't understand and he doesn't, all other pools on this algorithm work fine.
On Qubit algo... before #33 we needed a -f 236 and now we don't. Now on this pool since 6 months ago I never got ccminer to work properly. With the older versions I needed to restart the program every 60 seconds to get the pool at my true hashrate. With #39 , no -f 236 needed , it works fine except it only excepts exactly 1/2 my hashrate. I think its a setting the pool owner needs to make. Again I tried this on another pools and it works fine. Any thoughts on this please? Please. ps The other pools have so little hash rate they only hit a block once in awhile.
Thx

bathrobehero

Legendary

Offline

Activity: 2002
Merit: 1051

ICO? Not even once.

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 12:06:41 PM

#1659

Quote from: tbearhere on February 24, 2015, 09:08:12 AM

If a pool is showing half the hashrate chances are you're doing twice the expected work so doubling your difficulty divide factor (--diff or -f) is what's probably missing. The default is 1 so you should try 2. Conversely, if it only accepts half the shares then you're sending smaller chunks of work then what the pool expects in which case halving the diff helps (-f 0.5). If there are still rejected shares try lowering the values to like -f 0.0078125 or -f 0.00390625 to offset the default 128/256 multipliers while checking the pool's reported hashrate.

Not your keys, not your coins!

sp_ (OP)

Legendary

Offline

Activity: 2926
Merit: 1087

Team Black developer

Re: 10MHASH CCminer modded NVIDIA Maxwell kernals by SP.

February 24, 2015, 12:26:32 PM

#1660

Quote from: Wolf0 on February 24, 2015, 07:37:54 AM

Quote from: sp_ on February 24, 2015, 07:35:42 AM

Quote from: Wolf0 on February 24, 2015, 07:03:31 AM

The CUDA and OpenCL code for Whirlpool consists of lookups into huge tables - which sucks for the GPU;

I haven't done CUDA in quite a while, but here's a tip about AMD - using fucktons of LDS is bad for you. It reduces the waves in flight - more waves in flight usually mean more performance, up to a point.

The maxwell can do 2 instructions per clockcycle, but only one cycle when the instruction is using shared/const memory. Normal superscalar design. Thats why I normally move constants into the instruction cache. Just need to make sure that the codesize fit the cache..

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner

Pages: « 1 ... 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 [83] 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 ... 1240 »

Bitcoin Forum > Alternate cryptocurrencies > Mining (Altcoins) > CCminer(SP-MOD) Modded GPU kernels.

« previous topic next topic »