DougB62
|
|
July 23, 2014, 02:31:36 PM |
|
When I run this build I get "Unable to query number of CUDA device! Is an nVidia driver installed?" Working with 2x EVGA 750TI SC on driver 337.88 Ideas? I get the same error. 750Ti's & 660's
|
|
|
|
sp_
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
July 23, 2014, 02:40:35 PM |
|
download cuda 6.5
|
|
|
|
sp_
Legendary
Offline
Activity: 2954
Merit: 1087
Team Black developer
|
|
July 23, 2014, 02:42:50 PM |
|
Ye, full warps do sound tasty. We're starting to get there too. The launch config isn't exactly about threads per block anymore, the kernels are starting to use more than one thread per hash and the launch config is actually hashes per block and blocks per grid. For example the kernels I modified earlier are now running eight threads per hash, so they're actually already at full warp size at four hashes per block. The latest experimental build takes the slowest kernel that is running only a single thread per hash on the latest committed source and spreads it out between four threads per hash. Again, full warp at eight hashes per block while four hashes per block remains kinda iffy.
Good job. Wonder if we can do something similar in the x11/x13 etc. implementations. Anything that could boost is worth a try.
|
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
July 23, 2014, 02:47:27 PM |
|
When I run this build I get "Unable to query number of CUDA device! Is an nVidia driver installed?" Working with 2x EVGA 750TI SC on driver 337.88 Ideas? I get the same error. 750Ti's & 660's might be a problem with the drivers... try to use the latest. (forgot about that , cuda 6.5 installs the driver 340.46 (not even on the nvidia site), so it "might" (meaning: "I don't know") work with the latest one on nivdia site which is also a 340.something... ) Or may-be wait for someone with a fast computer to compile with 5.5 (won't be me sorry... it takes 2 hours...)
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
AizenSou
|
|
July 23, 2014, 02:51:26 PM |
|
CCMiner algos: anime (C&C) cryptonight (tsiv) dmd-gr (Bombadil) fresh (djm34) fugue256 (C&C) groestl (C&C) heavy (C&C-based off reorder's cgminer code) jackpot (C&C) mjollnir (C&C-based off reorder's cgminer code) myr-gr (C&C) nist5 (C&C) quark (C&C) qubit (djm34) Whirlcoin (djm34) x11 (C&C) x13 (C&C) x14 (djm34) x15 (djm34) x17 (djm34) 1 Bombadil 1 tsiv 6 djm34 11 C&C djm34 is on a massive roll! Change ccminer's name to cdminer (Christian&djm34) or even dcminer (after djm34 passes Christian's number of algos) soon
|
|
|
|
DougB62
|
|
July 23, 2014, 02:51:39 PM |
|
download cuda 6.5
Wow - I gotta wait for my Registered Developer Program submission to be approved... bummer.
|
|
|
|
tarzanbigcity
|
|
July 23, 2014, 02:52:00 PM |
|
download cuda 6.5
Looks like it requires you to register as a dev to download. Anyone got a link to download it?
|
|
|
|
DrAlco
Newbie
Offline
Activity: 43
Merit: 0
|
|
July 23, 2014, 02:52:59 PM |
|
At this point I'm starting to think I'll just forget about that part and start looking if there's something else to be improved. I'm still curious as to how it runs on other hardware, so if a couple of gents on Win boxes with something else than a 750 Ti in would be willing to take it for a spin, I'd appreciate it. I've added the number for SMX/SMM/Whateverthingmabobs into the miner thread start-up info, you'll probably find your card performing best when the block count is a multiple of the SMX count and the number of threads a power of 2. 4/8/16/32/64 are the best bets. https://github.com/tsiv/ccminer-cryptonight/releases/download/v0.15-rc1/ccminer-cryptonight_20140723_exp.zipImproved hashrate of about 70H/s on a 780ti. Up from 320 to about 390 (using 8x60). Also doesn't seem to hang and bring the system to it's knees when using all GFX cards. Seems to be in line with the ~18% improvements I saw when benchmarking only the AES part of the kernel. Have you tried other configs? 390 is still pretty low for a 780 Ti, I think people were getting best results with 4x120 on the 780 Ti. Getting about 420h/s with 4x120.
|
|
|
|
cayars
|
|
July 23, 2014, 02:57:21 PM |
|
When I run this build I get "Unable to query number of CUDA device! Is an nVidia driver installed?" Working with 2x EVGA 750TI SC on driver 337.88 Ideas? I get the same error. 750Ti's & 660's might be a problem with the drivers... try to use the latest. (forgot about that , cuda 6.5 installs the driver 340.46 (not even on the nvidia site), so it "might" (meaning: "I don't know") work with the latest one on nivdia site which is also a 340.something... ) Or may-be wait for someone with a fast computer to compile with 5.5 (won't be me sorry... it takes 2 hours...) I'm adding this to nvMiner as I type this but it's a 3 or 4 hour build with 5.5 for me with all the extra stuff. I'll post the compiled version as soon as it's complete.
|
|
|
|
cayars
|
|
July 23, 2014, 02:58:37 PM |
|
download cuda 6.5
Wow - I gotta wait for my Registered Developer Program submission to be approved... bummer. I've been waiting over 2 weeks.
|
|
|
|
DougB62
|
|
July 23, 2014, 02:59:17 PM |
|
When I run this build I get "Unable to query number of CUDA device! Is an nVidia driver installed?" Working with 2x EVGA 750TI SC on driver 337.88 Ideas? I get the same error. 750Ti's & 660's might be a problem with the drivers... try to use the latest. (forgot about that , cuda 6.5 installs the driver 340.46 (not even on the nvidia site), so it "might" (meaning: "I don't know") work with the latest one on nivdia site which is also a 340.something... ) Or may-be wait for someone with a fast computer to compile with 5.5 (won't be me sorry... it takes 2 hours...) I'm adding this to nvMiner as I type this but it's a 3 or 4 hour build with 5.5 for me with all the extra stuff. I'll post the compiled version as soon as it's complete. Bravo! cayars to the rescue!
|
|
|
|
tarzanbigcity
|
|
July 23, 2014, 03:09:39 PM |
|
When I run this build I get "Unable to query number of CUDA device! Is an nVidia driver installed?" Working with 2x EVGA 750TI SC on driver 337.88 Ideas? I get the same error. 750Ti's & 660's might be a problem with the drivers... try to use the latest. (forgot about that , cuda 6.5 installs the driver 340.46 (not even on the nvidia site), so it "might" (meaning: "I don't know") work with the latest one on nivdia site which is also a 340.something... ) Or may-be wait for someone with a fast computer to compile with 5.5 (won't be me sorry... it takes 2 hours...) I'm adding this to nvMiner as I type this but it's a 3 or 4 hour build with 5.5 for me with all the extra stuff. I'll post the compiled version as soon as it's complete. For those who don't want to wait, here is the Nvidia Driver 340.46 with support for CUDA 6.5. I dont feel like updating untested drivers on multiple rigs, so I will wait for the 5.5 compile. https://mega.co.nz/#!UgF2nKxQ!iAUukN9fJndF4askDWwsddjOb69bm2J02DgT1riMVlY
|
|
|
|
DougB62
|
|
July 23, 2014, 03:14:53 PM |
|
Meh... the window of opportunity is pretty much closed on PPL now anyways... back to watching for a good (haha) ninja launch.
|
|
|
|
PVmining
|
|
July 23, 2014, 03:34:59 PM |
|
DJM... is your hint still working for all the maxwell guys to speed up the compile with reducing on compute_3.5 version? then this could be a idea for others too:
makefile.in # we're now targeting all major compute architectures within one binary. .cu.o: $(NVCC) @CFLAGS@ -I . -Xptxas "-abi=no -v" -gencode=arch=compute_20,code=\"sm_20,compute_20\" -gencode=arch=compute_30,code=\"sm_30,compute_30\" -gencode=arch=compute_35,code=\"sm_35,compute_35\" --maxrregcount=80 --ptxas-options=-v $(JANSSON_INCLUDES) -o $@ -c $<
|
|
|
|
7854587
Newbie
Offline
Activity: 43
Merit: 0
|
|
July 23, 2014, 03:49:47 PM |
|
[2014-07-20 17:13:32] Stratum detected new block [2014-07-20 17:15:08] Stratum detected new block [2014-07-20 17:17:21] Stratum detected new block [2014-07-20 17:19:54] Stratum detected new block [2014-07-20 17:20:12] Stratum detected new block [2014-07-20 17:22:34] Stratum detected new block [2014-07-20 17:25:45] Stratum detected new block [2014-07-20 17:26:04] Stratum detected new block [2014-07-20 17:29:18] Stratum detected new block [2014-07-20 17:29:50] Stratum detected new block [2014-07-20 17:31:57] Stratum detected new block [2014-07-20 17:32:32] Stratum detected new block [2014-07-20 17:33:32] Stratum detected new block Why is it so? ccminer21.exe -a jackpot -o stratum+tcp://us-east1.jackpotcoin.miningpoolhub.com:20484 -u end.aa -p xx gts 450 help me
|
|
|
|
wetland
Newbie
Offline
Activity: 11
Merit: 0
|
|
July 23, 2014, 03:56:13 PM |
|
[2014-07-20 17:13:32] Stratum detected new block [2014-07-20 17:15:08] Stratum detected new block [2014-07-20 17:17:21] Stratum detected new block [2014-07-20 17:19:54] Stratum detected new block [2014-07-20 17:20:12] Stratum detected new block [2014-07-20 17:22:34] Stratum detected new block [2014-07-20 17:25:45] Stratum detected new block [2014-07-20 17:26:04] Stratum detected new block [2014-07-20 17:29:18] Stratum detected new block [2014-07-20 17:29:50] Stratum detected new block [2014-07-20 17:31:57] Stratum detected new block [2014-07-20 17:32:32] Stratum detected new block [2014-07-20 17:33:32] Stratum detected new block Why is it so? ccminer21.exe -a jackpot -o stratum+tcp://us-east1.jackpotcoin.miningpoolhub.com:20484 -u end.aa -p xx gts 450 help me
Just a guess did you upgrade your wallet? " V 1.4.1 MANDATORY UPGRADE (1) PoS bug patch - Start from : Block # : 451000 * Estimated time : July 23 22:00PM (USA EAST TIME ZONE)" https://bitcointalk.org/index.php?topic=584481.0Edit: never mind wouldn't have anything to do with that.
|
|
|
|
Bombadil
|
|
July 23, 2014, 04:02:39 PM |
|
Welp. Managed to split the most offensive part of the kernel into four parallel threads per hash, result is spectacularly unimpressive. The best I've come up with breaks even with the current single thread per hash implementation. Well, almost. It's actually a percent slower AND loses compute 2.0 compatibility due to using shuffle. On the other hands it performs a lot more reasonably with various launch configurations, 15 blocks of 32 threads works our equally well as the original 8x60 magic bullet for 750 Ti. At this point I'm starting to think I'll just forget about that part and start looking if there's something else to be improved. I'm still curious as to how it runs on other hardware, so if a couple of gents on Win boxes with something else than a 750 Ti in would be willing to take it for a spin, I'd appreciate it. I've added the number for SMX/SMM/Whateverthingmabobs into the miner thread start-up info, you'll probably find your card performing best when the block count is a multiple of the SMX count and the number of threads a power of 2. 4/8/16/32/64 are the best bets. https://github.com/tsiv/ccminer-cryptonight/releases/download/v0.15-rc1/ccminer-cryptonight_20140723_exp.zipAlso, any chances for this code to get released already? Or are you competing against Wolf0 It works like a charm, 220H/s for GTX760, before it was 190. GTX750TIs seem unchanged.
|
|
|
|
7854587
Newbie
Offline
Activity: 43
Merit: 0
|
|
July 23, 2014, 04:06:18 PM |
|
[2014-07-20 17:13:32] Stratum detected new block [2014-07-20 17:15:08] Stratum detected new block [2014-07-20 17:17:21] Stratum detected new block [2014-07-20 17:19:54] Stratum detected new block [2014-07-20 17:20:12] Stratum detected new block [2014-07-20 17:22:34] Stratum detected new block [2014-07-20 17:25:45] Stratum detected new block [2014-07-20 17:26:04] Stratum detected new block [2014-07-20 17:29:18] Stratum detected new block [2014-07-20 17:29:50] Stratum detected new block [2014-07-20 17:31:57] Stratum detected new block [2014-07-20 17:32:32] Stratum detected new block [2014-07-20 17:33:32] Stratum detected new block Why is it so? ccminer21.exe -a jackpot -o stratum+tcp://us-east1.jackpotcoin.miningpoolhub.com:20484 -u end.aa -p xx gts 450 help me
Just a guess did you upgrade your wallet? " V 1.4.1 MANDATORY UPGRADE (1) PoS bug patch - Start from : Block # : 451000 * Estimated time : July 23 22:00PM (USA EAST TIME ZONE)" https://bitcointalk.org/index.php?topic=584481.0Edit: never mind wouldn't have anything to do with that. No speed, NO JPC
|
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
July 23, 2014, 04:13:41 PM |
|
DJM... is your hint still working for all the maxwell guys to speed up the compile with reducing on compute_3.5 version? then this could be a idea for others too:
makefile.in # we're now targeting all major compute architectures within one binary. .cu.o: $(NVCC) @CFLAGS@ -I . -Xptxas "-abi=no -v" -gencode=arch=compute_20,code=\"sm_20,compute_20\" -gencode=arch=compute_30,code=\"sm_30,compute_30\" -gencode=arch=compute_35,code=\"sm_35,compute_35\" --maxrregcount=80 --ptxas-options=-v $(JANSSON_INCLUDES) -o $@ -c $<
That's the problem, If I target the Maxwell guys, then fermi guys complain (however, the binaries I released are only compute_35 compute_50, forgot to add compute_30) Also the makefile is only for linux, (I guess people compiling on linux can select the compute version they need and remove the others...) Regarding the window of opportunity for ppl... well that was last week However the coin has never been profitable at all (no matter which %age of hashrate you got )
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
zelante
|
|
July 23, 2014, 04:24:22 PM |
|
Welp. Managed to split the most offensive part of the kernel into four parallel threads per hash, result is spectacularly unimpressive. The best I've come up with breaks even with the current single thread per hash implementation. Well, almost. It's actually a percent slower AND loses compute 2.0 compatibility due to using shuffle. On the other hands it performs a lot more reasonably with various launch configurations, 15 blocks of 32 threads works our equally well as the original 8x60 magic bullet for 750 Ti. At this point I'm starting to think I'll just forget about that part and start looking if there's something else to be improved. I'm still curious as to how it runs on other hardware, so if a couple of gents on Win boxes with something else than a 750 Ti in would be willing to take it for a spin, I'd appreciate it. I've added the number for SMX/SMM/Whateverthingmabobs into the miner thread start-up info, you'll probably find your card performing best when the block count is a multiple of the SMX count and the number of threads a power of 2. 4/8/16/32/64 are the best bets. https://github.com/tsiv/ccminer-cryptonight/releases/download/v0.15-rc1/ccminer-cryptonight_20140723_exp.zipAlso, any chances for this code to get released already? Or are you competing against Wolf0 It works like a charm, 220H/s for GTX760, before it was 190. GTX750TIs seem unchanged. What speed for 750ti ?
|
|
|
|
|