Bitcoin Forum
June 18, 2025, 03:59:12 PM *
News: Latest Bitcoin Core release: 29.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 ... 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 [227] 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 ... 1240 »
  Print  
Author Topic: CCminer(SP-MOD) Modded GPU kernels.  (Read 2347845 times)
bathrobehero
Legendary
*
Offline Offline

Activity: 2002
Merit: 1051


ICO? Not even once.


View Profile
August 05, 2015, 02:54:53 PM
 #4521

They probably don't care about mining and rendering and other CUDA calculations might even improved.

Not your keys, not your coins!
jjjordan
Sr. Member
****
Offline Offline

Activity: 271
Merit: 251


View Profile
August 05, 2015, 03:00:13 PM
 #4522

Not for sale. But you can keep donating, so I can publish small increases in the hashrates for free.

Next up is another groestlcoin optimalization.

From 23,7 to 24 MHASH on the gtx 970 windoforce oc.(stock) (will submitt later tonight)

The speed on AMD cards with the pallas opensource is:

Quote
v1 - to be compiled with catalyst 14.6 or 14.7:

R9 290x @1125 Mhz: ~26.4 Mh/s
R9 290 @1200: ~25 Mh/s
R9 280x (stock): ~18 Mh/s
7950 @1200: ~16 Mh/s
R9 270X: ~9.7 Mh/s

v2 - experimental hawaii only bin:

R9 290x @1125 Mhz: ~34.4 Mh/s
R9 290 @1100: ~30.6 Mh/s


Mine is faster on Tahiti - bin is public.

that falls into the "v2" category ;-)
in order to avoid confusion, I might link your tahiti bin in the OP and add its hashrate to that list.
are you ok with that?

Yeah, go ahead; just credit me.

I was looking around about mining groestl and it's only groestlcoin, right?
Only one exchange without volume, so bag holding only?

you can mine diamond as well.

I see, that makes more sense... Was just wondering why you guys are optimizing groestl if it's not that popular...
chrysophylax
Legendary
*
Offline Offline

Activity: 3080
Merit: 1093


--- ChainWorks Industries ---


View Profile WWW
August 05, 2015, 03:01:16 PM
 #4523

They probably don't care about mining and rendering and other CUDA calculations might even improved.

the funny thing is - they originally built cuda to spearhead the 'render farm' movie niche and scientific market that was wide open a little while ago ...

why would they screw things up now? ...

you are probably right though - they dont care about mining and the rest of the community :| ...

#crysx

bathrobehero
Legendary
*
Offline Offline

Activity: 2002
Merit: 1051


ICO? Not even once.


View Profile
August 05, 2015, 03:12:18 PM
 #4524

They probably don't care about mining and rendering and other CUDA calculations might even improved.

the funny thing is - they originally built cuda to spearhead the 'render farm' movie niche and scientific market that was wide open a little while ago ...

why would they screw things up now? ...

you are probably right though - they dont care about mining and the rest of the community :| ...

#crysx

We should probably make them more aware of the mining community.

To add to my previous comment:



Not your keys, not your coins!
Epsylon3
Legendary
*
Offline Offline

Activity: 1484
Merit: 1122


ccminer/cpuminer developer


View Profile WWW
August 05, 2015, 03:23:51 PM
 #4525

you dont have to use the distribution packages to use cuda... all the versions can be used (and even mixed on linux) for a project.

I repeat it again... there was an improvement in the 7.5 RC (over the 7.0 Wink ) and a big part of the speed reduction is related to the fine tuning which should be redone due to the different register count in the output "binaries". Kernels which were "oppressed" (with a low limit compared to the required count of registers) are slower

BTC: 1FhDPLPpw18X4srecguG3MxJYe4a1JsZnd - My Projects: ccminer - cpuminer-multi - yiimp - Forum threads : ccminer - cpuminer-multi - yiimp
flipclip
Member
**
Offline Offline

Activity: 111
Merit: 10


View Profile
August 05, 2015, 03:45:18 PM
 #4526

I see, that makes more sense... Was just wondering why you guys are optimizing groestl if it's not that popular...

Groestl is used in quark (possibly others), so faster Groestl means faster quark.
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
August 05, 2015, 03:47:09 PM
 #4527

I see, that makes more sense... Was just wondering why you guys are optimizing groestl if it's not that popular...

Groestl is used in quark (possibly others), so faster Groestl means faster quark.

and x11 and x13 and lyra2 etc....
but, the code is only shared at times, so it takes work to spread to all the algos.

joblo
Legendary
*
Offline Offline

Activity: 1470
Merit: 1114


View Profile
August 05, 2015, 04:06:16 PM
 #4528

you dont have to use the distribution packages to use cuda... all the versions can be used (and even mixed on linux) for a project.

I repeat it again... there was an improvement in the 7.5 RC (over the 7.0 Wink ) and a big part of the speed reduction is related to the fine tuning which should be redone due to the different register count in the output "binaries". Kernels which were "oppressed" (with a low limit compared to the required count of registers) are slower

There are two issues here and I don't know which one you are responding to.

Yes multiple versions of cuda can be used on the same Linux installation, just need to specify
--with-cuda when running ./configure. The RPM repo supports this as well with seperate meta-packages
for cuda 6.5 and 7.

Given that cuda 6.5 is still the best for mining at this time the other issue is finding a Linux distro that
has 6.5 available from Nvidia, either in the distro's package format or run file. The last 6.5 for Ubuntu was
14.04, for Fedora it was 20.

I know that in Fedora some RPMs for release n will work in release n+1. In fact the Fedora 18 version of
Virtualbox was carried over all the way to 21. it's only in Fedora 22 that VBox created a new release version.

I don't know if this is the case with cuda but it might be worth a try.

AKA JayDDee, cpuminer-opt developer. https://github.com/JayDDee/cpuminer-opt
https://bitcointalk.org/index.php?topic=5226770.msg53865575#msg53865575
BTC: 12tdvfF7KmAsihBXQXynT6E6th2c2pByTT,
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
August 05, 2015, 05:29:07 PM
 #4529

There is something wrong with the dependencies of groestl_functions_quad.cu: modifying it doesn't trigger recompile of the files that include it, like groestlcoin.cu

sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
August 05, 2015, 05:37:19 PM
 #4530

they are exluded to speedup the build. If you save the file in the editor, they will recompile.

1>------ Build started: Project: ccminer, Configuration: Release Win32 ------
1>  Skipping CUDA source file bitslice_transformations_quad.cu (excluded from build).
1>  Skipping CUDA source file groestl_functions_quad.cu (excluded from build).
1>  Compiling CUDA source file quark\cuda_quark_compactionTest.cu...

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
August 05, 2015, 06:08:28 PM
 #4531

That's not how a Makefile is supposed to work.
When I modify a file, the files that include it should be recompiled on the next build.
Otherwise you need to find all that files and touch them manually. And if you don't remember to do it, the changes will not be reflected in the binary.

hashbrown9000
Sr. Member
****
Offline Offline

Activity: 427
Merit: 250


View Profile
August 05, 2015, 09:19:15 PM
 #4532

@chrysophylax , if you'd like i could try to upload an image to Mega when I had an HDD installation with overclocking working on the 750ti's. It's Ubuntu 14.04 with CUDA 6.5 and 346.xx nvidia drivers.  The thing was very stable and I have the HDD saved for a backup in case one of my USB sticks goes down.

Pinkcoin:
ETH:
VTC:
BTC:
chrysophylax
Legendary
*
Offline Offline

Activity: 3080
Merit: 1093


--- ChainWorks Industries ---


View Profile WWW
August 06, 2015, 01:07:52 AM
 #4533

@chrysophylax , if you'd like i could try to upload an image to Mega when I had an HDD installation with overclocking working on the 750ti's. It's Ubuntu 14.04 with CUDA 6.5 and 346.xx nvidia drivers.  The thing was very stable and I have the HDD saved for a backup in case one of my USB sticks goes down.

tanx hashbrown9000,

that would be great ... is it a usb stick install? ...

im not familiar with debian based distros - though i have tinkered with them ...

im more a rhel based ( rhel - fedora ) user and admin guy Smiley ...

but i would really like that - yes ... please share ...

i have a great deal of running around to do today - so ill pop in when time permits ...

tanx ...

#crysx

bathrobehero
Legendary
*
Offline Offline

Activity: 2002
Merit: 1051


ICO? Not even once.


View Profile
August 06, 2015, 01:17:49 AM
 #4534

Bit off topic but in GPU-Z there's an ASIC quality estimation tool and I checked all of my GPUs and it seems there is a correlation between the estimated quality and overclocking capabilities.

It apparently takes into account the voltage requires to operate and some other factors to generate a number that doesn't change with overclock or anything.
Higher ASIC quality cards require less voltage to operate and they also naturally boost higher and can take the most OC without overvolting.

So I started incrementally overclocking all cards in a rig with 6 x Asus non-6-pin cards which had a wide spectrum of ASIC quality (62.7% 69.4% 70.6% 70.7% 72% 80.6%) and sure enough the highest quality was the last to crash by a large margin.
Here's a picture showing how to check it. (source)

My highest card is a Gigabyte GTX 750 Ti Windforce with 83.8% and my lowest is my main card I'm not mining with but kept for the occasional gaming with 66.6% which is a 780 Ti and it overclocks absolutely terribly. The average of all my cards is 75%.


So ideally cards should be overclocked based on these figures for maximizing efficiency and stability.
High quality high end cards might even have a decent value for enthusiast gamers.

Not your keys, not your coins!
AliMan
Hero Member
*****
Offline Offline

Activity: 2296
Merit: 502


Dragonslots | +13k Slots & Casino Games


View Profile
August 06, 2015, 03:35:01 AM
 #4535

I gotta say, I'm not a believer in that ASIC thing, that just seems to be a bunch of horse.

I've had lower and higher ASIC cards, both OC quite well, and also the higher hasn't always been the best. Basically, most of these cards OC till a certain limit, regardless of how well they're made.

From what I've seen, extremely bad cards can't even hold their factory boost clock, but then again, it's got nothing to do with ASIC.
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
August 06, 2015, 11:39:53 AM
 #4536

I wanted to try cuda 7.5 and here's my experience with it and neoscrypt mining.

On cuda 6.5 I'm working on an optimized version which does about 560 Mh/s on 970.
Simply switching to cuda 7.5 brought it down to 420.
So I wanted to try to reach at least the same hashrate as before, but I failed: after tuning it for the new cuda version for half a day, I was only able to bring it up to about 460.

Any suggestion on how to do better than that? Or should I just leave 7.5 alone? :-)

djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
August 06, 2015, 12:00:49 PM
 #4537

I wanted to try cuda 7.5 and here's my experience with it and neoscrypt mining.

On cuda 6.5 I'm working on an optimized version which does about 560 Mh/s on 970.
Simply switching to cuda 7.5 brought it down to 420.
So I wanted to try to reach at least the same hashrate as before, but I failed: after tuning it for the new cuda version for half a day, I was only able to bring it up to about 460.

Any suggestion on how to do better than that? Or should I just leave 7.5 alone? :-)
same here... (with an additional crash on the 780ti)
you can try to add that to the compilation option: -Xptxas --def-load-cache=cs -Xptxas --def-store-cache=cs, haven't really try on neo, the problem it is 1hour to recompile each time... might help (however it is rather for the 780ti...)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2926
Merit: 1087

Team Black developer


View Profile
August 06, 2015, 12:07:07 PM
 #4538

I wanted to try cuda 7.5 and here's my experience with it and neoscrypt mining.

Cuda 7.5 is broken. Try to compile shavite.
With 6.5 the kernal compiles with 64 registers 0 spill stores 0 stack.
In cuda 7.5 the stack and memory usage explodes. The x11  performance is down 30%. I have tried to tweak it with register count and launchbounds, but just small gains.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW EVRPROGPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
zTheWolfz
Full Member
***
Offline Offline

Activity: 231
Merit: 150



View Profile
August 06, 2015, 03:10:00 PM
 #4539

Had expect more out of 32 cores, but core speed is 1Ghz below my Intel rigs. Still not to bad I guess.   Roll Eyes

djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
August 06, 2015, 03:31:30 PM
 #4540

your intel rig is using an amd processor... on an amd board
(but yeah, nobody can stop you from putting an intel sticker on it  Grin)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
Pages: « 1 ... 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 [227] 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 ... 1240 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!