Bitcoin Forum
May 03, 2024, 02:22:34 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 ... 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 [83] 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 ... 1240 »
  Print  
Author Topic: CCminer(SP-MOD) Modded GPU kernels.  (Read 2347498 times)
chrysophylax
Legendary
*
Offline Offline

Activity: 2814
Merit: 1091


--- ChainWorks Industries ---


View Profile WWW
February 22, 2015, 05:56:56 AM
 #1641

sp - is pluck part of this fork? ...

if not - will it be? ...

#crysx

1714702954
Hero Member
*
Offline Offline

Posts: 1714702954

View Profile Personal Message (Offline)

Ignore
1714702954
Reply with quote  #2

1714702954
Report to moderator
1714702954
Hero Member
*
Offline Offline

Posts: 1714702954

View Profile Personal Message (Offline)

Ignore
1714702954
Reply with quote  #2

1714702954
Report to moderator
"There should not be any signed int. If you've found a signed int somewhere, please tell me (within the next 25 years please) and I'll change it to unsigned int." -- Satoshi
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1714702954
Hero Member
*
Offline Offline

Posts: 1714702954

View Profile Personal Message (Offline)

Ignore
1714702954
Reply with quote  #2

1714702954
Report to moderator
1714702954
Hero Member
*
Offline Offline

Posts: 1714702954

View Profile Personal Message (Offline)

Ignore
1714702954
Reply with quote  #2

1714702954
Report to moderator
1714702954
Hero Member
*
Offline Offline

Posts: 1714702954

View Profile Personal Message (Offline)

Ignore
1714702954
Reply with quote  #2

1714702954
Report to moderator
bathrobehero
Legendary
*
Offline Offline

Activity: 2002
Merit: 1051


ICO? Not even once.


View Profile
February 23, 2015, 07:52:45 AM
 #1642

sp - is pluck part of this fork? ...

if not - will it be? ...

#crysx

Not yet but I hope soon. For the time being sgminer is much faster than ccminer for nvidia cards which pretty much screams for optimizations.

Not your keys, not your coins!
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
February 23, 2015, 08:07:26 AM
 #1643

KlausT has added support in his fork:

https://github.com/KlausT/ccminer

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
bathrobehero
Legendary
*
Offline Offline

Activity: 2002
Merit: 1051


ICO? Not even once.


View Profile
February 23, 2015, 09:00:35 AM
 #1644

KlausT has added support in his fork:

https://github.com/KlausT/ccminer


There's no pluck there. DJM34 made a fork which runs at around 2.3kh/s per 750 Ti while sgminer does ~3.7kh/s.

Not your keys, not your coins!
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
February 23, 2015, 09:05:06 AM
 #1645

yes there is:

https://github.com/KlausT/ccminer/commit/fa44f730b875489e7f1c5f7e72179dc3c960e86f

But I think the code is made by DJM34

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
chrysophylax
Legendary
*
Offline Offline

Activity: 2814
Merit: 1091


--- ChainWorks Industries ---


View Profile WWW
February 23, 2015, 09:07:38 AM
 #1646

KlausT has added support in his fork:

https://github.com/KlausT/ccminer


There's no pluck there. DJM34 made a fork which runs at around 2.3kh/s per 750 Ti while sgminer does ~3.7kh/s.

kool ...

will get it all sorted tomorrow ...

i think the last time i tried to compile i was gettign errors - and djm34 pointed out that i needed to use the latest cuda 6.5 ...

if thats the case for the sgminer / ccminer compiles - i will have a bit of work to do to build another linux machine thats more up to date than the fedora 19 x64 that i have ...

:|

#crysx

bathrobehero
Legendary
*
Offline Offline

Activity: 2002
Merit: 1051


ICO? Not even once.


View Profile
February 23, 2015, 09:09:12 AM
 #1647


Oh right, branches.

Not your keys, not your coins!
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
February 23, 2015, 10:36:35 AM
 #1648

KlausT has added support in his fork:

https://github.com/KlausT/ccminer


There's no pluck there. DJM34 made a fork which runs at around 2.3kh/s per 750 Ti while sgminer does ~3.7kh/s.

kool ...

will get it all sorted tomorrow ...

i think the last time i tried to compile i was gettign errors - and djm34 pointed out that i needed to use the latest cuda 6.5 ...

if thats the case for the sgminer / ccminer compiles - i will have a bit of work to do to build another linux machine thats more up to date than the fedora 19 x64 that i have ...

:|

#crysx
you just need to update cuda...

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
February 23, 2015, 12:50:15 PM
 #1649

sp - is pluck part of this fork? ...

if not - will it be? ...

#crysx

Not yet but I hope soon. For the time being sgminer is much faster than ccminer for nvidia cards which pretty much screams for optimizations.
That's not that obvious... the program was optimized for memory access, then transposed to opencl and it seems that opencl does a better job with memory access than cuda. (I found a small difference in the code between the two and now the cuda version runs at 2.8kh/s for the 750ti and 9.2kh/s for the 980, but this is still below the perf of opencl).

To be honest I would be curious to look at the ptx generated by opencl (if there are a command to obtain it)...

Actually the main difference between the two, is that the cards on nvidia runs at 40% tdp while on sgminer it runs at 100%tdp...

ps: I think it would be interesting (but lengthy) to transpose the cuda neoscrypt to opencl and check how it does on nvidia  Grin

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
February 23, 2015, 12:56:37 PM
 #1650

nobody has been looking into that vanilla coin ?  Grin

because it is nothing to code: you just need to xor the first 256bit of the whirlpool hash to the last 256bit  Grin
I can't believed it stayed cpu only for 2 months  Grin Grin (it takes mostly 2 min, and 10 if you create a new algo in ccminer framework)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
PVmining
Sr. Member
****
Offline Offline

Activity: 330
Merit: 252



View Profile
February 23, 2015, 04:27:10 PM
 #1651

Quote
I can't believed it stayed cpu only for 2 months...
*lol* ...but to late... amd is already there.

edit: and whirlpool seems not that slow on amd - 50mhash per r9 270
tbearhere
Legendary
*
Offline Offline

Activity: 3136
Merit: 1003



View Profile
February 23, 2015, 06:24:40 PM
Last edit: February 23, 2015, 06:41:27 PM by tbearhere
 #1652

What multiplier do we need for Qubit?  
Im only getting half my hash report at pool...750ti.  Thx
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
February 23, 2015, 07:54:19 PM
 #1653

Use release 39 and without a multiplier. should work. mine at yaamp.com (Hamsterpool is broken)

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
chrysophylax
Legendary
*
Offline Offline

Activity: 2814
Merit: 1091


--- ChainWorks Industries ---


View Profile WWW
February 24, 2015, 06:46:26 AM
 #1654

Quote
I can't believed it stayed cpu only for 2 months...
*lol* ...but to late... amd is already there.

edit: and whirlpool seems not that slow on amd - 50mhash per r9 270

100MH/s+ on 270X.

wolf - we can never know whether you are quoting YOUR miner / optimizations - or the standard that is available to the public ...

so this figure you have quoted mate - yours or public? ...

#crysx

chrysophylax
Legendary
*
Offline Offline

Activity: 2814
Merit: 1091


--- ChainWorks Industries ---


View Profile WWW
February 24, 2015, 06:51:49 AM
 #1655

KlausT has added support in his fork:

https://github.com/KlausT/ccminer


There's no pluck there. DJM34 made a fork which runs at around 2.3kh/s per 750 Ti while sgminer does ~3.7kh/s.

kool ...

will get it all sorted tomorrow ...

i think the last time i tried to compile i was gettign errors - and djm34 pointed out that i needed to use the latest cuda 6.5 ...

if thats the case for the sgminer / ccminer compiles - i will have a bit of work to do to build another linux machine thats more up to date than the fedora 19 x64 that i have ...

:|

#crysx
you just need to update cuda...

tanx mate ...

the cuda repo doesnt allow that update or upgrade ...

doing it manually means a crapload of work on our part for the farm - as the 'standardization' of the farm is incomplete ...

different motherboards - cpus and the like ...

im looking at an easier way of upgrading the whole farm to the latest cuda without a 'one by one' approach ...

ill build a fedora 20 x64 test machine - which will allow all this to happen ( and also finally allow testing of your neoscrypt miner ) which will make it easier to roll out when the hardware changes happen too ...

tanx again ...

#crysx

chrysophylax
Legendary
*
Offline Offline

Activity: 2814
Merit: 1091


--- ChainWorks Industries ---


View Profile WWW
February 24, 2015, 06:53:00 AM
 #1656

Quote
I can't believed it stayed cpu only for 2 months...
*lol* ...but to late... amd is already there.

edit: and whirlpool seems not that slow on amd - 50mhash per r9 270

100MH/s+ on 270X.

wolf - we can never know whether you are quoting YOUR miner / optimizations - or the standard that is available to the public ...

so this figure you have quoted mate - yours or public? ...

#crysx

Mine - he pointed out it wasn't slow on AMD; he's right.

damn ...

and how to get hold of your one? with the appropriate settings? ...

Wink

#crysx

chrysophylax
Legendary
*
Offline Offline

Activity: 2814
Merit: 1091


--- ChainWorks Industries ---


View Profile WWW
February 24, 2015, 07:14:19 AM
 #1657

Quote
I can't believed it stayed cpu only for 2 months...
*lol* ...but to late... amd is already there.

edit: and whirlpool seems not that slow on amd - 50mhash per r9 270

100MH/s+ on 270X.

wolf - we can never know whether you are quoting YOUR miner / optimizations - or the standard that is available to the public ...

so this figure you have quoted mate - yours or public? ...

#crysx

Mine - he pointed out it wasn't slow on AMD; he's right.

damn ...

and how to get hold of your one? with the appropriate settings? ...

Wink

#crysx

You know the answer to that. But, anyway, I'm working on something more epic.

The CUDA and OpenCL code for Whirlpool consists of lookups into huge tables - which sucks for the GPU; that's CPU code. Even with my current code, I've noticed beyond a certain point, it doesn't matter how high I clock, because it's stalling on memory accesses. Those tables have so got to go away.

I have gotten the reference implementation down in C - surprisingly hard, seeing as it appears there's no code anywhere for it. This consists of mostly the block cipher W that was created with Whirlpool, which is based on AES - and I know AES backwards and forwards. Small issue - it's got a 2048 byte table for the multiplication, then a 256 byte Sbox.

I took the 2048 byte table used for the multiplies and reduced it to one 8-byte table by doing them manually - then I got rid of that by inlining them as constants. The S-box I split into its parts - three S-boxes containing 16 entries of 4 bits each, and bitsliced them. Does valid hashes so far, but I have a bit further to go before it's really GPU-ready.

wow - so you have been a VERY busy lil wolfie then ... damn ...

so when will you expected final implementation come? ...

btw - pm for the 'you know the answer to that' situation with your idea of how that can be done ...

just trying to make the farm work THAT MUCH better - and that requires optimizations ... sooo - pm me please with what needs to be done on my end to get it organized ...

btw - the completion of the exchange from amd to nvidia is almost complete with the farm - so i can still run / test the optimizations with the gigabyte 280x oc cards left ( 16 of them currently ) ... once those are gone - the farm will be nothing but gigabyte 750ti oc lp cards ...

hence the reason for my interest in what / when / where / how / and how much ... Wink

#crysx

sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
February 24, 2015, 07:16:20 AM
 #1658

Looks like the cuda implementation of wirlpool can use 8 times less memory access by a small rewrite.


If Wirlpoolx is just wirlpool with an extra xor pass I think alot of work is needed to get close to the Wolf0 speed.


the 750ti only does 4,4 MHASH on wirlpool. (this overview is a bit old, the latest miner is faster)


Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
chrysophylax
Legendary
*
Offline Offline

Activity: 2814
Merit: 1091


--- ChainWorks Industries ---


View Profile WWW
February 24, 2015, 07:17:52 AM
 #1659

Looks like the cuda implementation of wirlpool can use 8 times less memory access by a small rewrite.


If Wirlpoolx is just wirlpool with an extra xor pass I think alot of work is needed to get close to the Wolf0 speed.


the 750ti only does 4,4 MHASH on wirlpool.



would it be worth it though sp? ...

#crysx

sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
February 24, 2015, 07:19:26 AM
 #1660

would it be worth it though sp? ...
#crysx

Yes, because the same algo is used in x15 (the last of the hashing function) rewriting it will improve the x15 speed alot..

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Pages: « 1 ... 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 [83] 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 ... 1240 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!