buysolar
Member
Offline
Activity: 125
Merit: 12
|
|
April 15, 2013, 02:13:11 PM |
|
huge improvement in interactive mode. Not sure it is the new update or that I used -C 0 this time. But interactive mode autotuned me to 8x8 and that has my khash at ~46 now on my gtx460m. Which is better than my previous non interactive hash rate by 1khash. And it's better than my previous interactive hash rate by 10khash about a 30% improvement. Was nervous at first because it gave me 3 boos in a row but then the yays started coming. I'll try it in non interactive mode tonight and see if the huge bump in performance carries over to that as well.
|
|
|
|
nst6563
|
|
April 15, 2013, 02:24:12 PM |
|
Added my gtx560se and gt430 oc to the list.
|
|
|
|
cbuchner1 (OP)
|
|
April 15, 2013, 02:45:34 PM |
|
huge improvement in interactive mode. Not sure it is the new update or that I used -C 0 this time. But interactive mode autotuned me to 8x8 and that has my khash at ~46 now on my gtx460m.
The x8 kernel launch configurations were added with the very latest update. And interactive mode performance on Windows was also greatly increased. The only remaining problem is the increased CPU use in interactive mode.
|
|
|
|
haderach
Newbie
Offline
Activity: 12
Merit: 0
|
|
April 15, 2013, 03:14:42 PM |
|
Following the computation on total memory used, what is the reason that not even half of the memory available on the Titan is being used? Up until the last release today (4/14) autotune has always picked a configuration of 300-307x2 (now 263x2) which comes out to about 2.5GB of used memory at most. Since scrypt is a space/computation tradeoff algorithm isn't there potential for almost a doubling in hash performance if the full 6GB could be used?
At the moment there is no trade-off. I use the full scratchpad size per thread. So required memory size scales with the number of threads. A trade-off happens when you try to reduce the scratchpad size at the cost of increased computation. Trying this is on my TODO list. Christian Ah, so if you're using the full scratchpad space right now do you have an idea why I can't use any launch configurations that go over 2.8GB? The program says there isn't enough memory, but according to the calculation there should be. I'm wondering if there is an integer overflow comparison somewhere?
|
|
|
|
crunchy
Newbie
Offline
Activity: 19
Merit: 0
|
|
April 15, 2013, 04:53:11 PM |
|
Hmmmn, tried cudaminner on another clean win8 pc , and it wont run complaining about some msvcp100.dll which is missing. visual c++ 2010 runtimes are installed. the file is where it should be windows\system32 When i copy the msvcp100.dll to the directory of cudaminer: it still wont start giving me windows error code 0xc000007b. on a clean pc that is VERY strange in the least ?
|
|
|
|
Lacan82
|
|
April 15, 2013, 05:06:24 PM |
|
Hmmmn, tried cudaminner on another clean win8 pc , and it wont run complaining about some msvcp100.dll which is missing. visual c++ 2010 runtimes are installed. the file is where it should be windows\system32 When i copy the msvcp100.dll to the directory of cudaminer: it still wont start giving me windows error code 0xc000007b. on a clean pc that is VERY strange in the least ? from command line try: regsvr32 /u msvcp100.dll regsvr32 /msvcp100.dll This program runs fine on my windows 8 laptop.
|
|
|
|
Lacan82
|
|
April 15, 2013, 05:17:46 PM |
|
Is that a typo on the google docs? Someone saying they are receiving 5000 KH solo mining?
|
|
|
|
Wole
Newbie
Offline
Activity: 22
Merit: 0
|
|
April 15, 2013, 06:15:56 PM |
|
Oh, If the pool isnt responding, like this; [2013-04-15 20:16:39] HTTP request failed: Failed connect to eu-pool.give-me-ltc.com:8080; No error [2013-04-15 20:16:39] submit_upstream_work json_rpc_call failed [2013-04-15 20:16:39] ...retry after 15 seconds , ctrl-c isnt working.
|
|
|
|
Lacan82
|
|
April 15, 2013, 06:25:53 PM |
|
Oh, If the pool isnt responding, like this; [2013-04-15 20:16:39] HTTP request failed: Failed connect to eu-pool.give-me-ltc.com:8080; No error [2013-04-15 20:16:39] submit_upstream_work json_rpc_call failed [2013-04-15 20:16:39] ...retry after 15 seconds , ctrl-c isnt working. I have this issue on my 650M card, but not on my GTX 570 If I hit control+C is just takes forever to end. I can force it to reset by end tasking cudaminer
|
|
|
|
cbuchner1 (OP)
|
|
April 15, 2013, 06:53:05 PM Last edit: April 15, 2013, 07:42:08 PM by cbuchner1 |
|
Ah, so if you're using the full scratchpad space right now do you have an idea why I can't use any launch configurations that go over 2.8GB? The program says there isn't enough memory, but according to the calculation there should be. I'm wondering if there is an integer overflow comparison somewhere?
Windows Driver Model, or WDDM doesn't allow for really huge single memory allocations. For this to work you need to run the card either on Windows XP, Linux or Windows with the TCC Driver (which is officially only for Tesla cards). Vista, 7 and 8 use WDDM. ALSO: On 32 Bit systems or using 32 bit binaries (like cudaminer.exe) one can only address 4 GB of memory on the card at most. Christian
|
|
|
|
haderach
Newbie
Offline
Activity: 12
Merit: 0
|
|
April 15, 2013, 08:12:24 PM |
|
I was hoping it was just a 32 vs. 64bit issue with cudaminer. But that driver issue is unfortunate. I'll have to look into TCC or running linux.
|
|
|
|
cbuchner1 (OP)
|
|
April 15, 2013, 08:36:15 PM |
|
Anyone else having downloading problems from mega? I've tried everything, mega just loads a blank page. I've tried on several PC's.
try disabling any flash and popup blockers temporarily. Christian
|
|
|
|
wndrbr3d
|
|
April 15, 2013, 09:58:36 PM |
|
Nope, not working on my droidx either, could u add a 2nd alternative download source, please?
Christian, could you just post the source to git and host the binaries there?
|
|
|
|
grosminer
|
|
April 16, 2013, 03:09:52 AM |
|
Is it normal that auto-tune sets a new config everytime i start cudaminer.. I got a 660ti and the last 4 configs were: 42x7 98x3 166x2 84x4
|
|
|
|
wyodude
Newbie
Offline
Activity: 35
Merit: 0
|
|
April 16, 2013, 04:02:26 AM |
|
|
|
|
|
SubNoize
Newbie
Offline
Activity: 47
Merit: 0
|
|
April 16, 2013, 07:02:10 AM |
|
Finally found a thread from 2011 with instructions on how to fix the gigabyte 580gtx SOC heating/temperature issues. Cards come with 1111 volts and you're able to knock them back to 1025 which knocks like 20-30 degrees off the cards.
I know this doesn't have much to do with Cudaminer but it's now running amazingly smooth and not causing my cards to heat up. If anyone else has problems with gigabyte soc try the above.
|
|
|
|
dentldir
|
|
April 16, 2013, 07:32:34 AM |
|
When compiling 04-14 in Linux (Ubuntu 12.04), I'm getting the following message not seen in 04-09:
/usr/local/cuda-5.0/bin/nvcc -g -O2 -arch=compute_10 --maxrregcount=124 --ptxas-options=-v -I./compat/jansson -o salsa_kernel.o -c salsa_kernel.cu salsa_kernel.cu(269): error: no instance of overloaded function "tex2D" matches the argument list argument types are: (texture<ulong2, 2, cudaReadModeElementType>, unsigned int, unsigned int)
Found this when playing with an Amazon EC2 GPU cluster instance on a pair of M2050's. Same error on known working Linux box.
04-09 gave about 200kH/s on the pair. That with the 12 hyperthreads on the CPU nets about 250kH/s. Still ~60x too expensive to be worth it, but fun to try anyway.
Cheers.
|
1DentLdiRMv3dpmpmqWsQev8BUaty9vN3v
|
|
|
cbuchner1 (OP)
|
|
April 16, 2013, 07:35:17 AM |
|
I got a 660ti and the last 4 configs were: 42x7 98x3 166x2 84x4
and which one performed best? take notes, and keep using that config.
|
|
|
|
gchil0
Newbie
Offline
Activity: 59
Merit: 0
|
|
April 16, 2013, 08:18:10 AM |
|
When compiling 04-14 in Linux (Ubuntu 12.04), I'm getting the following message not seen in 04-09:
/usr/local/cuda-5.0/bin/nvcc -g -O2 -arch=compute_10 --maxrregcount=124 --ptxas-options=-v -I./compat/jansson -o salsa_kernel.o -c salsa_kernel.cu salsa_kernel.cu(269): error: no instance of overloaded function "tex2D" matches the argument list argument types are: (texture<ulong2, 2, cudaReadModeElementType>, unsigned int, unsigned int)
Found this when playing with an Amazon EC2 GPU cluster instance on a pair of M2050's. Same error on known working Linux box.
04-09 gave about 200kH/s on the pair. That with the 12 hyperthreads on the CPU nets about 250kH/s. Still ~60x too expensive to be worth it, but fun to try anyway.
Cheers.
The newest code no longer supports a 64-bit compile. You need to compile a 32-bit binary.
|
|
|
|
Listener
Newbie
Offline
Activity: 17
Merit: 0
|
|
April 16, 2013, 08:33:20 AM |
|
Thank you for your work! I think you did a great job! What I miss is a variable to control the system/GPU load. The --interactive flag does not really work for me, I even experienced greater desktop lags with "interactive 1"...
|
|
|
|
|