Bitcoin Forum
August 22, 2025, 08:02:32 PM *
News: Latest Bitcoin Core release: 29.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 ... 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 [1216] 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 »
  Print  
Author Topic: CCminer(SP-MOD) Modded GPU kernels.  (Read 2347924 times)
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 14, 2018, 12:37:04 PM
 #24301

Pushed a 20% faster shavite_final implementation to github. Another #include "cuda_x11_aes_sp.cuh" modification...

The sp-modded Optimized AES has been applied to the following hashing functions.

echo  (done)
fugue (done)
shavite (only the final function / if shavite is the last algo in the chain)
whirlpool(not done)
streborg(not done)



Streebog can only fit 2 (3) copies of the tables in shared mem, unless you can compute them on fly. Did you manage to apply the "trick" to it?

sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
September 14, 2018, 01:25:23 PM
 #24302

Streebog can only fit 2 (3) copies of the tables in shared mem, unless you can compute them on fly. Did you manage to apply the "trick" to it?

The streebog opensource implementation use 8kb of shared memory. and the pascal chip has 96kb of shared memory. But you have some limitations.  I see that the cryptodredge 0.9 is using 48kb shared memory. You can use more shared memory if you disable the level1 cache..

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 15, 2018, 06:26:12 PM
 #24303

Streebog can only fit 2 (3) copies of the tables in shared mem, unless you can compute them on fly. Did you manage to apply the "trick" to it?

The streebog opensource implementation use 8kb of shared memory. and the pascal chip has 96kb of shared memory. But you have some limitations.  I see that the cryptodredge 0.9 is using 48kb shared memory. You can use more shared memory if you disable the level1 cache..


256 X 8 X 8 = 16K
You can use up to 48k of shared mem so 2 or 3 copies.
As for pascal, "The maximum shared memory per block remains limited at 48KB as with prior architectures".

sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
September 15, 2018, 06:26:30 PM
Last edit: September 15, 2018, 06:47:45 PM by sp_
 #24304

Thanks - your contributions have made it easier for even a relative novice to make further modifications to the code and come out (slightly) ahead of enemy and trex, even before their fees.

Convert simd to use shared memory instead of a big memory buffer  (d_temp4[thr_id]) and then you get a good speedup. this has already been done in the other private miners. I might opensource later on..

here from enemy  1.20 SIMD implementation disassembly converted to ptx

enemy 1,20 is using 16kb of sharedmem

Code:
..
st.shared.u32 [%r1671], %r1666;
shfl.sync.up.b32 %r1672|%p370, %r162, %r304, %r1653, %r302;
selp.b32 %r1673, %r144, %r1672, %p92;
shfl.sync.up.b32 %r1674|%p371, %r154, %r304, %r1653, %r302;
selp.b32 %r1675, %r136, %r1674, %p92;
mul.lo.s32 %r1676, %r1673, 185;
mul.lo.s32 %r1677, %r1675, 185;
prmt.b32 %r1678, %r1676, %r1677, %r1661;
shfl.sync.idx.b32 %r1679|%p372, %r1678, %r1665, %r301, %r302;
st.shared.u32 [%r1671+4], %r1679;
shfl.sync.up.b32 %r1680|%p373, %r126, %r304, %r1653, %r302;
selp.b32 %r1681, %r108, %r1680, %p92;
shfl.sync.up.b32 %r1682|%p374, %r118, %r304, %r1653, %r302;
selp.b32 %r1683, %r100, %r1682, %p92;
mul.lo.s32 %r1684, %r1681, 185;
mul.lo.s32 %r1685, %r1683, 185;
prmt.b32 %r1686, %r1684, %r1685, %r1661;
shfl.sync.idx.b32 %r1687|%p375, %r1686, %r1665, %r301, %r302;
st.shared.u32 [%r1671+8], %r1687;
shfl.sync.up.b32 %r1688|%p376, %r1613, %r304, %r1653, %r302;
selp.b32 %r1689, %r180, %r1688, %p92;
shfl.sync.up.b32 %r1690|%p377, %r190, %r304, %r1653, %r302;
selp.b32 %r1691, %r172, %r1690, %p92;
mul.lo.s32 %r1692, %r1689, 185;
mul.lo.s32 %r1693, %r1691, 185;
prmt.b32 %r1694, %r1692, %r1693, %r1661;
shfl.sync.idx.b32 %r1695|%p378, %r1694, %r1665, %r301, %r302;
st.shared.u32 [%r1671+12], %r1695;
shfl.sync.up.b32 %r1696|%p379, %r91, %r304, %r1653, %r302;
selp.b32 %r1697, %r73, %r1696, %p92;
shfl.sync.up.b32 %r1698|%p380, %r3433, %r304, %r1653, %r302;
selp.b32 %r1699, %r3432, %r1698, %p92;
mul.lo.s32 %r1700, %r1697, 185;
mul.lo.s32 %r1701, %r1699, 185;
prmt.b32 %r1702, %r1700, %r1701, %r1661;
ld.const.u8 %r1703, [%r1664+8];
shfl.sync.idx.b32 %r1704|%p381, %r1702, %r1703, %r301, %r302;
st.shared.u32 [%r1671+128], %r1704;
...

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
September 15, 2018, 06:34:47 PM
 #24305

"The maximum shared memory per block remains limited at 48KB as with prior architectures".

So you need 2 blocks.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
bensam1231
Legendary
*
Offline Offline

Activity: 1848
Merit: 1024


View Profile
September 16, 2018, 02:35:45 AM
 #24306

If coin devs give two shits about the actual mining community (IE gpu miners), they really need to be more proactive with protecting their coins against ASICs and especially FPGAs. Changing things up on a regular schedule including forking at the first signs of ASICs to a new algo. There are more then enough GPU developers to keep up with evolving algos, especially after this year. More advanced ASICs are functioning more and more like GPUs (and FPGAs) to fill in the gap with evolving algos as well. Keep it fresh if you want to keep your community around. Not everyone is a BTC or ETH.


X16r/X16s/C11/x17 Spmod-git #11 has been released

- Added support for X17 (+10-15% faster than the ccminer alexis 1.0 opensource fork) (24-25MHASH on the 1080ti)
- Faster x16r/x16s +3-5% more than sp-mod #10 (Optimalizations added to Fugue, whirlpool)

this miner is free, opensource, and have no fee

cuda 9.2 builds:

https://github.com/sp-hash/suprminer/releases

sourcecode:

https://github.com/sp-hash/suprminer/commits/master

Your free product will not allow t-rex, Cryptodredge and Enemy sleep. Definitely we will see their new versions in a few days.

From what I've seen based on SPs updates, their miners (especially enemy) is updated about a day later with changes SP makes. Hence why open software doesn't go anywhere.

I buy private Nvidia miners. Send information and/or inquiries to my PM box.
Mr.Spider703
Member
**
Offline Offline

Activity: 392
Merit: 27

http://radio.r41.ru


View Profile WWW
September 16, 2018, 02:52:50 AM
 #24307

with changes SP makes
if others took work SP_ and used it - the income on coins at SP_ would be higher. at the moment, SP_ mainers in the console draws well, but the income of coins loses to everyone else
abudfv2008
Full Member
***
Offline Offline

Activity: 728
Merit: 106


View Profile
September 16, 2018, 10:32:17 AM
 #24308

From what I've seen based on SPs updates, their miners (especially enemy) is updated about a day later with changes SP makes. Hence why open software doesn't go anywhere.
Enemy adopt sp_ "improvements" and release his miner that is faster than sp_ slow miner with all those "improvements".
Nice logic.  Shocked

For a person with normal logic it is evident that sp_ can't compete with z-enemy/t-rex miners and he make his "improvements" based on what z/t already implemented. That's why this miner is not sold as others - noone will by slow miner. That's why in this case sp_ decided to be a "knight in white clothes" and release slow but opensource miner.  Grin

P.S. But anyway - it is good work from some point of view.
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
September 16, 2018, 10:41:25 AM
Last edit: September 16, 2018, 10:53:06 AM by sp_
 #24309

I don't earn any money from my opensource work, but my private bins are getting faster and faster.
X16R/X17 has improved to become +30% faster in a few months thanks to optimalizations. I made the 27MHASH x17 miner first..

Here is a post from August 8th. When enemy 1.14 and t-rex 5.7 was "the fastest"

Still 20% slower than Enemy?
My private is around 20% faster than (enemy 1.14)/(t-rex 5.7) in x16r/x16s/x17  27 MHASH x17 1080ti, (2BTC)


Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
abudfv2008
Full Member
***
Offline Offline

Activity: 728
Merit: 106


View Profile
September 16, 2018, 11:03:17 AM
 #24310

I don't earn any money from my opensource work, but my private bins are getting faster and faster.
X16R/X17 has improved to become +30% faster in a few months thanks to optimalizations. I made the 27MHASH x17 miner first..

Here is a post from August 8th. When enemy 1.14 and t-rex 5.7 was "the fastest"

Still 20% slower than Enemy?
My private is around 20% faster than (enemy 1.14)/(t-rex 5.7) in x16r/x16s/x17  27 MHASH x17 1080ti, (2BTC)
We all know how you "measure".  Grin 27Mh was "on paper".  In reality maybe 23-24 was achieved.
Mr.Spider703
Member
**
Offline Offline

Activity: 392
Merit: 27

http://radio.r41.ru


View Profile WWW
September 16, 2018, 11:15:33 AM
 #24311

I don't earn any money from my opensource work, but my private bins are getting faster and faster.
X16R/X17 has improved to become +30% faster in a few months thanks to optimalizations. I made the 27MHASH x17 miner first..

Here is a post from August 8th. When enemy 1.14 and t-rex 5.7 was "the fastest"

Still 20% slower than Enemy?
My private is around 20% faster than (enemy 1.14)/(t-rex 5.7) in x16r/x16s/x17  27 MHASH x17 1080ti, (2BTC)
We all know how you "measure".  Grin 27Mh was "on paper".  In reality maybe 23-24 was achieved.
but for free
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
September 16, 2018, 11:38:29 AM
Last edit: September 16, 2018, 12:02:42 PM by sp_
 #24312

but for free

I have extracted all the kernels from the other devs and can soon publish a ptx opensource miner.. 30MHASH x17(asus 1080ti 90% tdp, +130 core +600 mem). 0.8% dev fee. Kernels made by t-rex/enemy/sp-mod. Thanks for the free work guys..

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
abudfv2008
Full Member
***
Offline Offline

Activity: 728
Merit: 106


View Profile
September 16, 2018, 11:46:17 AM
 #24313

I don't earn any money from my opensource work, but my private bins are getting faster and faster.
X16R/X17 has improved to become +30% faster in a few months thanks to optimalizations. I made the 27MHASH x17 miner first..

Here is a post from August 8th. When enemy 1.14 and t-rex 5.7 was "the fastest"

Still 20% slower than Enemy?
My private is around 20% faster than (enemy 1.14)/(t-rex 5.7) in x16r/x16s/x17  27 MHASH x17 1080ti, (2BTC)
We all know how you "measure".  Grin 27Mh was "on paper".  In reality maybe 23-24 was achieved.
but for free
What for free? Sp_ is talking about private miner. So 0.05BTC + dev fee that is mined at every start. So after each crash another devfee that is exactly in sp_ manner.
@enemy
Member
**
Offline Offline

Activity: 81
Merit: 16


View Profile
September 16, 2018, 12:03:13 PM
 #24314

but for free

I have extracted all the kernels from the other devs and can soon publish a ptx opensource miner.. 30MHASH x17(asus 1080ti 90% tdp, +130 core +600 mem). 0.8% dev fee. Kernels made by t-rex/enemy/sp-mod. Thanks for the free work guys..

Do not forget to rename everything - _sp and in a month write that we steal your code from you  Grin

RVN: RFQXKVKpHMwyJ86YqQUJSZK1S8m8oRbC5h
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
September 16, 2018, 12:17:07 PM
 #24315

_sp and in a month write that we steal your code from you  Grin

I don't steal code. I plan use your free code in you "free" miner and release the sourcecode with it. (the Ptx assembly code). Then  I won't violate the GPL licence you are violating. My miner will be faster,opensource, free and with a smaller devfee of 0.8%

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
COBOK
Jr. Member
*
Offline Offline

Activity: 39
Merit: 1


View Profile
September 17, 2018, 11:33:10 AM
 #24316

but for free

I have extracted all the kernels from the other devs and can soon publish a ptx opensource miner.. 30MHASH x17(asus 1080ti 90% tdp, +130 core +600 mem). 0.8% dev fee. Kernels made by t-rex/enemy/sp-mod. Thanks for the free work guys..

 Shocked Shocked Shocked you now confessed that you are stealing code from other developers, it's very bad
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
September 17, 2018, 03:46:48 PM
 #24317

I have found evidence of my gpl code in their work. The sourcecode is provided for free in the public exefiles released by team cryptodregde/team enemy/team-t-rex (assembly language). My miner will be opensource and under GPL. The ptx kernels will be compiled runtime and executed by my program. If they release a faster miner, I will extract the new ptx code to make sure that my opensource miner always is the fastest.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
abudfv2008
Full Member
***
Offline Offline

Activity: 728
Merit: 106


View Profile
September 17, 2018, 04:03:07 PM
 #24318

but for free

I have extracted all the kernels from the other devs and can soon publish a ptx opensource miner.. 30MHASH x17(asus 1080ti 90% tdp, +130 core +600 mem). 0.8% dev fee. Kernels made by t-rex/enemy/sp-mod. Thanks for the free work guys..

 Shocked Shocked Shocked you now confessed that you are stealing code from other developers, it's very bad
More than that - he confessed that he is incapable to make faster miner due to lack of skills. And the only option he has left is to steal kernels from cd/t-rex/enemy.
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
September 17, 2018, 05:18:16 PM
 #24319

kernels from cd/t-rex/enemy.

The best from cd/t-rex/enemy kernels is only giving 28MHASH on the 1080ti (x17). So I add the profitable percent. The +10% the other guys can't copy..

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
abudfv2008
Full Member
***
Offline Offline

Activity: 728
Merit: 106


View Profile
September 17, 2018, 05:25:26 PM
 #24320

kernels from cd/t-rex/enemy.
The best from cd/t-rex/enemy kernels is only giving 28MHASH on the 1080ti (x17). So I add the profitable percent. The +10% the other guys can't copy..
blablabla. Do first than talk. Who knows - maybe when you will accomplish adadtation of stolen kernels then z/t will do 35Mh. Whats the problem to write code? Lack of knowledge and abilities, isn't it?
For the moment - your miner is the slowest, far behind z/t.
Pity for you, but your time has passed. There is no alexis78 to steal code from Grin Stealing kernels won't help you - it's a dead end.  Grin
Pages: « 1 ... 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 [1216] 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!