Bitcoin Forum
May 02, 2024, 08:59:23 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 ... 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 [282] 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 ... 1240 »
  Print  
Author Topic: CCminer(SP-MOD) Modded GPU kernels.  (Read 2347498 times)
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 05, 2015, 12:04:43 PM
 #5621

I reverted the jh512 change and the rotr change.

That is suspect: quark jh and x11 bmw were redefining rotr32 with the original sph code, while all the rest use the cuda_helper version which has specific cuda code.
If the cuda specific code is slower, than we should replace the rotr32 definition in cuda_helper because all the other files will benefit from it.
But I highly doubt that it's the case: cuda_helper has been worked on for months and it can't be slower than sph code, unless the compiler is so smart that it can detect it's a rotate and replace it with even more optimized code.

In brief, I wouldn't reput the double definitions (it's a warning...) but adjust the cuda_helper version, so we have the best rotr32 on all algos.

The compiler is smart. it converts the

ROTL32c(x, n) ((x) << (n)) | ((x) >> (32 - (n)))

to one assembly instrucion. the funnelshift. Take a look at the generated .ptx code in the release folder.


But in the lyra2v2 algo I now get +50KHASH on the gtx 970, so sometimes it's faster to use the method in cuda helper.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
1714683563
Hero Member
*
Offline Offline

Posts: 1714683563

View Profile Personal Message (Offline)

Ignore
1714683563
Reply with quote  #2

1714683563
Report to moderator
"Bitcoin: mining our own business since 2009" -- Pieter Wuille
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 05, 2015, 12:10:42 PM
 #5622

I think it's because the compiler is  unable to precalculate and remove instructions that operate on contant data when we use inline asm. rotl(const,x)

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 05, 2015, 12:16:12 PM
 #5623

with the current commit (reverting quark_jh and quark_skein but not groestl) quark is a bit faster... that's puzzling :-)

"in the lyra2v2 algo I now get +50KHASH on the gtx 970"
but lyra2v2 is not using quark/, Algo256/ or groestl... still puzzled ;-)

sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 05, 2015, 12:21:21 PM
 #5624

with the current commit (reverting quark_jh and quark_skein but not groestl) quark is a bit faster... that's puzzling :-)
"in the lyra2v2 algo I now get +50KHASH on the gtx 970"
but lyra2v2 is not using quark/, Algo256/ or groestl... still puzzled ;-)

yes, but you also changed the rotl32 in the blake256 And it seems to be a little bether there. So I kept it.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 05, 2015, 12:42:25 PM
 #5625

with the current commit (reverting quark_jh and quark_skein but not groestl) quark is a bit faster... that's puzzling :-)
"in the lyra2v2 algo I now get +50KHASH on the gtx 970"
but lyra2v2 is not using quark/, Algo256/ or groestl... still puzzled ;-)

yes, but you also changed the rotl32 in the blake256 And it seems to be a little bether there. So I kept it.

I see. But that means that x11 (and others based on the same file) should also be faster :-)

GingerAle
Legendary
*
Offline Offline

Activity: 1260
Merit: 1008


View Profile WWW
September 05, 2015, 12:44:45 PM
 #5626

For those that follow this thread in the hopes that some improvements will happen to Monero (i.e. cryptonight) nvidia mining, the Monero Forum Funding system for djm34's proposed work is up and running:

https://bitcointalk.org/index.php?topic=656841.msg12334860#msg12334860

well, that takes you to a link to the link, here's a direct:

https://forum.getmonero.org/8/funding-required/2355/improved-nvidia-mining-software

Sorry for the off topic. This thread has become the hotspot for this stuff though.

< Track your bitcoins! > < Track them again! > <<< [url=https://www.reddit.com/r/Bitcoin/comments/1qomqt/what_a_landmark_legal_case_from_mid1700s_scotland/] What is fungibility? >>> 46P88uZ4edEgsk7iKQUGu2FUDYcdHm2HtLFiGLp1inG4e4f9PTb4mbHWYWFZGYUeQidJ8hFym2WUmWc p34X8HHmFS2LXJkf <<< Free subdomains at moneroworld.com!! >>> <<< If you don't want to run your own node, point your wallet to node.moneroworld.com, and get connected to a random node! @@@@ FUCK ALL THE PROFITEERS! PROOF OF WORK OR ITS A SCAM !!! @@@@
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 05, 2015, 12:50:49 PM
 #5627

with the current commit (reverting quark_jh and quark_skein but not groestl) quark is a bit faster... that's puzzling :-)
"in the lyra2v2 algo I now get +50KHASH on the gtx 970"
but lyra2v2 is not using quark/, Algo256/ or groestl... still puzzled ;-)

yes, but you also changed the rotl32 in the blake256 And it seems to be a little bether there. So I kept it.

I see. But that means that x11 (and others based on the same file) should also be faster :-)

x11 uses blake 512.

the 256 algos are neoscrypt,lyra, and lyra2

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 05, 2015, 12:51:30 PM
 #5628

Submitted a speedup 50KHASH in x11 on the 980ti. With code from the klaus_t fork. and constant rotates.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
t-nelson
Member
**
Offline Offline

Activity: 70
Merit: 10


View Profile
September 05, 2015, 05:19:55 PM
 #5629


-- SNIP --

needs to include cuda_vector.h instead of cuda_helper.h

I didn't modify the includes.
Furthermore, it compiles fine on Linux...
I know the build files on windows have been enhanced recently, maybe it's related to that.
I can't help because I never built on win.

The header deps are a mess.  Odds are it was pulled in by some other header on Linux and you got (un?)lucky.

BTC:   1K4yxRwZB8DpFfCgeJnFinSqeU23dQFEMu
DASH: XcRSCstQpLn8rgEyS6yH4Kcma4PfcGSJxe
t-nelson
Member
**
Offline Offline

Activity: 70
Merit: 10


View Profile
September 05, 2015, 06:08:06 PM
 #5630

Submitted a speedup 50KHASH in x11 on the 980ti. With code from the klaus_t fork. and constant rotates.

Seems on par with 65 on 750ti and 960.  At least no regression noticeable.

BTC:   1K4yxRwZB8DpFfCgeJnFinSqeU23dQFEMu
DASH: XcRSCstQpLn8rgEyS6yH4Kcma4PfcGSJxe
scryptr
Legendary
*
Offline Offline

Activity: 1793
Merit: 1028



View Profile WWW
September 05, 2015, 10:51:39 PM
 #5631

GIMP ON YIIMP--

For the last couple hours I mined MUE coin (Monetary Unit) on YiiMP (yiimp.ccminer.org).  MUE is a quark algorithm coin, and according to CoinWarz, currently capable of producing ~200-400 dollars a day in mining revenue.  There must be a decimal place or two missing in the calculation!  Smiley  I mined over 1000 MUE in the space of time that I spent on YiiMP, but they are currenly worth about 15 satoshis each.  I will be earning about .00015 BTC on the exchange when my MUE coins mature, I think.  Not so sure!  Smiley

Every block that I hit, I got all the coins save the pool share. Somehow, CoinWarz led me to believe that I'd be earning 100X more, at least.  I solved ~40 blocks at 40 MUE coins per block.  Because YiiMP is more or less a hobby/experimental pool, it was as if I was solomining.  There was no other miner to share the blocks with me, just the 1% pool share to deduct.

If I could only find the right coin...

--scryptr

TIPS:  BTC - 1Fs4uZ6a9ABYBTaHGUfqcwCQmeBRxkKRQT    DASH - XrK81tW31SLsVvZ2WX9VhTjpT6GXJPLdbQ
          SCRYPTR'S NOTEBOOK: https://bitcointalk.org/index.php?topic=5035515.msg46035530#msg46035530
          GITHUB: "github.com/scryptr"  MERIT is appreciated, also.  Thanks!
bathrobehero
Legendary
*
Offline Offline

Activity: 2002
Merit: 1051


ICO? Not even once.


View Profile
September 05, 2015, 11:06:44 PM
 #5632

GIMP ON YIIMP--

For the last couple hours I mined MUE coin (Monetary Unit) on YiiMP (yiimp.ccminer.org).  MUE is a quark algorithm coin, and according to CoinWarz, currently capable of producing ~200-400 dollars a day in mining revenue.  There must be a decimal place or two missing in the calculation!  Smiley  I mined over 1000 MUE in the space of time that I spent on YiiMP, but they are currenly worth about 15 satoshis each.  I will be earning about .00015 BTC on the exchange when my MUE coins mature, I think.  Not so sure!  Smiley

Every block that I hit, I got all the coins save the pool share. Somehow, CoinWarz led me to believe that I'd be earning 100X more, at least.  I solved ~40 blocks at 40 MUE coins per block.  Because YiiMP is more or less a hobby/experimental pool, it was as if I was solomining.  There was no other miner to share the blocks with me, just the 1% pool share to deduct.

If I could only find the right coin...

--scryptr

MonetaryUnit (MUE)
Exchange volume (24h): 0.02 BTC.

No offense, but I still chuckle everytime when I see a multipool picking up coins with 0 volume.

If I ever launch a coin it will have coin maturity of about a week and it will change the algo every few weeks just to throw multipools off.

Not your keys, not your coins!
chrysophylax
Legendary
*
Offline Offline

Activity: 2814
Merit: 1091


--- ChainWorks Industries ---


View Profile WWW
September 06, 2015, 02:08:45 AM
 #5633

GIMP ON YIIMP--

For the last couple hours I mined MUE coin (Monetary Unit) on YiiMP (yiimp.ccminer.org).  MUE is a quark algorithm coin, and according to CoinWarz, currently capable of producing ~200-400 dollars a day in mining revenue.  There must be a decimal place or two missing in the calculation!  Smiley  I mined over 1000 MUE in the space of time that I spent on YiiMP, but they are currenly worth about 15 satoshis each.  I will be earning about .00015 BTC on the exchange when my MUE coins mature, I think.  Not so sure!  Smiley

Every block that I hit, I got all the coins save the pool share. Somehow, CoinWarz led me to believe that I'd be earning 100X more, at least.  I solved ~40 blocks at 40 MUE coins per block.  Because YiiMP is more or less a hobby/experimental pool, it was as if I was solomining.  There was no other miner to share the blocks with me, just the 1% pool share to deduct.

If I could only find the right coin...

--scryptr

MonetaryUnit (MUE)
Exchange volume (24h): 0.02 BTC.

No offense, but I still chuckle everytime when I see a multipool picking up coins with 0 volume.

If I ever launch a coin it will have coin maturity of about a week and it will change the algo every few weeks just to throw multipools off.

id be in on that Wink ...

#crysx

bensam1231
Legendary
*
Offline Offline

Activity: 1750
Merit: 1024


View Profile
September 06, 2015, 06:23:03 AM
 #5634

GIMP ON YIIMP--

For the last couple hours I mined MUE coin (Monetary Unit) on YiiMP (yiimp.ccminer.org).  MUE is a quark algorithm coin, and according to CoinWarz, currently capable of producing ~200-400 dollars a day in mining revenue.  There must be a decimal place or two missing in the calculation!  Smiley  I mined over 1000 MUE in the space of time that I spent on YiiMP, but they are currenly worth about 15 satoshis each.  I will be earning about .00015 BTC on the exchange when my MUE coins mature, I think.  Not so sure!  Smiley

Every block that I hit, I got all the coins save the pool share. Somehow, CoinWarz led me to believe that I'd be earning 100X more, at least.  I solved ~40 blocks at 40 MUE coins per block.  Because YiiMP is more or less a hobby/experimental pool, it was as if I was solomining.  There was no other miner to share the blocks with me, just the 1% pool share to deduct.

If I could only find the right coin...

--scryptr

Volume bro... Mining a dead coin. More hashrate you have, the bigger coins you have to mine. Putting aside volume, large amounts of hashrate instantly destroy the difficulty on small coins making them pointless to mine.

I buy private Nvidia miners. Send information and/or inquiries to my PM box.
ldp5500
Full Member
***
Offline Offline

Activity: 173
Merit: 100



View Profile
September 06, 2015, 07:48:54 AM
 #5635

Optimization anything about GTX950? I have three cards in wait. Grin Grin
kama
Full Member
***
Offline Offline

Activity: 140
Merit: 100


View Profile
September 06, 2015, 08:17:54 AM
 #5636

Optimization anything about GTX950? I have three cards in wait. Grin Grin

optimizations are base specific(i mean compute 5.2 or 5.0)not card specific

go6ooo1212
Legendary
*
Offline Offline

Activity: 1512
Merit: 1000


quarkchain.io


View Profile
September 06, 2015, 09:58:22 AM
 #5637

Optimization anything about GTX950? I have three cards in wait. Grin Grin

Yesterday I've been testing one of these  babies. It reaches 8.77MH on quark with overclock...
skunk
Sr. Member
****
Offline Offline

Activity: 329
Merit: 250


View Profile
September 06, 2015, 11:06:43 AM
 #5638

master's whirlpoolx seems broken, it segfaults under linux...

ldp5500
Full Member
***
Offline Offline

Activity: 173
Merit: 100



View Profile
September 06, 2015, 11:41:35 AM
 #5639

Optimization anything about GTX950? I have three cards in wait. Grin Grin

Yesterday I've been testing one of these  babies. It reaches 8.77MH on quark with overclock...

That is really good, which version to use? THX.
go6ooo1212
Legendary
*
Offline Offline

Activity: 1512
Merit: 1000


quarkchain.io


View Profile
September 06, 2015, 12:52:34 PM
 #5640

Optimization anything about GTX950? I have three cards in wait. Grin Grin

Yesterday I've been testing one of these  babies. It reaches 8.77MH on quark with overclock...

That is really good, which version to use? THX.

r.65 - the last
Pages: « 1 ... 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 [282] 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 ... 1240 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!