djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
November 18, 2014, 02:43:35 PM |
|
i'm not so sure P0 is really faster on the 9xx... memory speed is a bit lowered in P2 but (max) core freq seems to be the same :
gpu temp on the 970 77°C How do you do that ? (my 980 never go beyond 73~74 at 120% tdp) never mind I see your 750ti at 71°C (hope it is only a setting and not its temp...) p0 isn't necessarily the fastest but it is the easiest to overclock... The main problem with p2, is you also need nvidia inspector to overclock it... I can overclock core clock, tdp with MSI AB, but to overclock mem clock, I need nvidia inspector, and 2 programs to overclock the same cards is a bit too much... Now I am not sure if it is possible to change it, from what I read it seems that any cuda application (not game obviously) run at p2 on design...
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
November 18, 2014, 03:11:00 PM |
|
i'm not so sure P0 is really faster on the 9xx... memory speed is a bit lowered in P2 but (max) core freq seems to be the same :
gpu temp on the 970 77°C How do you do that ? (my 980 never go beyond 73~74 at 120% tdp) never mind I see your 750ti at 71°C (hope it is only a setting and not its temp...) p0 isn't necessarily the fastest but it is the easiest to overclock... The main problem with p2, is you also need nvidia inspector to overclock it... I can overclock core clock, tdp with MSI AB, but to overclock mem clock, I need nvidia inspector, and 2 programs to overclock the same cards is a bit too much... Now I am not sure if it is possible to change it, from what I read it seems that any cuda application (not game obviously) run at p2 on design... Games run at p0, therefore, miner can run at p0 even if you need to disassemble the game and rip the code out! I don't think it is really needed, but I haven't seen anything which says what trigger p0... so it is a bit puzzling
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
sp_
Legendary
Offline
Activity: 2940
Merit: 1087
Team Black developer
|
|
November 18, 2014, 07:12:13 PM |
|
The API is open, It seems just to call the method NVAPI_INTERFACE NvAPI_GPU_GetPstates20 ( __in NvPhysicalGpuHandle hPhysicalGpu, __inout NV_GPU_PERF_PSTATES20_INFO * pPstatesInfo http://docs.nvidia.com/gameworks/content/gameworkslibrary/coresdk/nvapi/group__gpupstate.html#gaeffe0838ca9850b9984fa9be117f637eDetailed Description The GPU performance state APIs are used to get and set various performance levels on a per-GPU basis. P-States are GPU active/executing performance capability and power consumption states. P-States range from P0 to P15, with P0 being the highest performance/power state, and P15 being the lowest performance/power state. Each P-State maps to a performance level. Not all P-States are available on a given system. The definition of each P-States are currently as follows: P0/P1 - Maximum 3D performance P2/P3 - Balanced 3D performance-power P8 - Basic HD video playback P10 - DVD playback P12 - Minimum idle power consumption .. Here is the method to call: NVAPI_INTERFACE NvAPI_GPU_GetPstates20 ( __in NvPhysicalGpuHandle hPhysicalGpu, __inout NV_GPU_PERF_PSTATES20_INFO * pPstatesInfo ) DESCRIPTION: This API retrieves all performance states (P-States) 2.0 information. P-States are GPU active/executing performance capability states. They range from P0 to P15, with P0 being the highest performance state, and P15 being the lowest performance state. Each P-State, if available, maps to a performance level. Not all P-States are available on a given system. The definition of each P-States are currently as follow: P0/P1 - Maximum 3D performance P2/P3 - Balanced 3D performance-power P8 - Basic HD video playback P10 - DVD playback P12 - Minimum idle power consumption TCC_SUPPORTED Since: Release: 295 SUPPORTED OS: Windows XP and higher Parameters: [in] hPhysicalGPU GPU selection [out] pPstatesInfo P-States information retrieved, as documented in declaration above Returns: This API can return any of the error codes enumerated in NvAPI_Status. If there are return error codes with specific meaning for this API, they are listed below.
|
|
|
|
Epsylon3 (OP)
Legendary
Offline
Activity: 1484
Merit: 1082
ccminer/cpuminer developer
|
|
November 19, 2014, 11:02:35 AM |
|
i'm not so sure P0 is really faster on the 9xx... memory speed is a bit lowered in P2 but (max) core freq seems to be the same :
gpu temp on the 970 77°C How do you do that ? (my 980 never go beyond 73~74 at 120% tdp) never mind I see your 750ti at 71°C (hope it is only a setting and not its temp...) p0 isn't necessarily the fastest but it is the easiest to overclock... The main problem with p2, is you also need nvidia inspector to overclock it... I can overclock core clock, tdp with MSI AB, but to overclock mem clock, I need nvidia inspector, and 2 programs to overclock the same cards is a bit too much... Now I am not sure if it is possible to change it, from what I read it seems that any cuda application (not game obviously) run at p2 on design... Simple: 2 cards in the same tower running ccminer
|
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
November 19, 2014, 12:11:37 PM |
|
i'm not so sure P0 is really faster on the 9xx... memory speed is a bit lowered in P2 but (max) core freq seems to be the same :
gpu temp on the 970 77°C How do you do that ? (my 980 never go beyond 73~74 at 120% tdp) never mind I see your 750ti at 71°C (hope it is only a setting and not its temp...) p0 isn't necessarily the fastest but it is the easiest to overclock... The main problem with p2, is you also need nvidia inspector to overclock it... I can overclock core clock, tdp with MSI AB, but to overclock mem clock, I need nvidia inspector, and 2 programs to overclock the same cards is a bit too much... Now I am not sure if it is possible to change it, from what I read it seems that any cuda application (not game obviously) run at p2 on design... Simple: 2 cards in the same tower running ccminer still this is hot, I have 2 gtx980 in my tower (750 and 780 outside) and it doesn't go that high... but fans run higher too (~90% for the 980), but for the 750ti this is really anormal, I never saw that card going higher than 60°C (outside or inside the tower). You should add some autofan to ccminer (or use msi ab)
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
tbearhere
Legendary
Offline
Activity: 3178
Merit: 1003
|
|
November 19, 2014, 12:52:13 PM Last edit: November 19, 2014, 02:46:05 PM by tbearhere |
|
anyone notice much difference in a 970 and a 970 oc hash rates at stockclock offsets ? thanks
|
|
|
|
Dotcommie
|
|
November 19, 2014, 04:44:06 PM |
|
i'm not so sure P0 is really faster on the 9xx... memory speed is a bit lowered in P2 but (max) core freq seems to be the same :
gpu temp on the 970 77°C How do you do that ? (my 980 never go beyond 73~74 at 120% tdp) never mind I see your 750ti at 71°C (hope it is only a setting and not its temp...) p0 isn't necessarily the fastest but it is the easiest to overclock... The main problem with p2, is you also need nvidia inspector to overclock it... I can overclock core clock, tdp with MSI AB, but to overclock mem clock, I need nvidia inspector, and 2 programs to overclock the same cards is a bit too much... Now I am not sure if it is possible to change it, from what I read it seems that any cuda application (not game obviously) run at p2 on design... Simple: 2 cards in the same tower running ccminer still this is hot, I have 2 gtx980 in my tower (750 and 780 outside) and it doesn't go that high... but fans run higher too (~90% for the 980), but for the 750ti this is really anormal, I never saw that card going higher than 60°C (outside or inside the tower). You should add some autofan to ccminer (or use msi ab) My 970 Gigabyte G1's are never colder than 71 while mining. That's not normal? I have plenty of fans, custom water cooling on CPU, and fans are set to 75-100% most of the time. I used to run with the side panel off so it would stay around 75C, but it only goes up to 78C with it on so I just set the temp limit in Precision X to 77C(95% power) and stick with that. Even with a large fan blowing directly at them it stays around 70. Pretty sure those temps are completely normal.
|
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
November 19, 2014, 04:55:42 PM |
|
i'm not so sure P0 is really faster on the 9xx... memory speed is a bit lowered in P2 but (max) core freq seems to be the same :
gpu temp on the 970 77°C How do you do that ? (my 980 never go beyond 73~74 at 120% tdp) never mind I see your 750ti at 71°C (hope it is only a setting and not its temp...) p0 isn't necessarily the fastest but it is the easiest to overclock... The main problem with p2, is you also need nvidia inspector to overclock it... I can overclock core clock, tdp with MSI AB, but to overclock mem clock, I need nvidia inspector, and 2 programs to overclock the same cards is a bit too much... Now I am not sure if it is possible to change it, from what I read it seems that any cuda application (not game obviously) run at p2 on design... Simple: 2 cards in the same tower running ccminer still this is hot, I have 2 gtx980 in my tower (750 and 780 outside) and it doesn't go that high... but fans run higher too (~90% for the 980), but for the 750ti this is really anormal, I never saw that card going higher than 60°C (outside or inside the tower). You should add some autofan to ccminer (or use msi ab) My 970 Gigabyte G1's are never colder than 71 while mining. That's not normal? I have plenty of fans, custom water cooling on CPU, and fans are set to 75-100% most of the time. I used to run with the side panel off so it would stay around 75C, but it only goes up to 78C with it on so I just set the temp limit in Precision X to 77C(95% power) and stick with that. Even with a large fan blowing directly at them it stays around 70. Pretty sure those temps are completely normal. custom fan or stock (nvidia) fan ? I have the stock fan which I prefer actually... my 290x was a msi twin froz and the 780ti which was on top was totally unmanageable because the 290x was blowing hot air on it... I use also the speed fan curve with msi (temp limit is 79°C but it never went that far...) No matter what this is within nvidia standard temp, but this is still surprising...
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
Dotcommie
|
|
November 19, 2014, 05:47:58 PM |
|
custom fan or stock (nvidia) fan ? I have the stock fan which I prefer actually... my 290x was a msi twin froz and the 780ti which was on top was totally unmanageable because the 290x was blowing hot air on it...
I use also the speed fan curve with msi (temp limit is 79°C but it never went that far...)
No matter what this is within nvidia standard temp, but this is still surprising...
G1's have the custom Windforce setup which should be plenty of cooling. I'm sure it would go down to maybe 65 degrees if they weren't inside the case and set up for using SLI sometimes, but I can't imagine they'd ever be cooler than that. All of my 750s and 750ti's run 50-61 most of the time so it seems normal to me that the 970 would be about 10-20C more.
|
|
|
|
scryptr
Legendary
Offline
Activity: 1797
Merit: 1028
|
|
November 22, 2014, 04:28:39 PM |
|
tpruvot's ccminer-
Since adding extranonce compatibility and other recent improvements to ccminer, i have seen a much lower error rate when mining x11/x13. --scryptr
|
|
|
|
Bobbs
Newbie
Offline
Activity: 36
Merit: 0
|
|
November 22, 2014, 04:52:47 PM |
|
Is there and AMD version of this? I skimmed through the other post and didn't see an answer.
|
|
|
|
Epsylon3 (OP)
Legendary
Offline
Activity: 1484
Merit: 1082
ccminer/cpuminer developer
|
|
November 22, 2014, 05:12:19 PM |
|
Is there and AMD version of this? I skimmed through the other post and didn't see an answer. no.... cuda is a nvidia compute language tpruvot's ccminer-
Since adding extranonce compatibility and other recent improvements to ccminer, i have seen a much lower error rate when mining x11/x13. --scryptr
Tx I try to reduce the problems i see... Not perfect for the moment, 1.5 is not ready, but usable
|
|
|
|
skunk
|
|
November 23, 2014, 02:53:28 AM Last edit: November 23, 2014, 03:13:14 AM by skunk |
|
thank you Epsylon3 for this miner, i'm testing 1.5-git and found 2 issues: *** ccminer 1.5-git for nVidia GPUs by tpruvot@github *** Built with the nVidia CUDA SDK 6.5
Based on pooler cpuminer 2.3.2 CUDA support by Christian Buchner and Christian H. Include some of djm34 additions and sp optimisations
BTC donation address: 1AJdfCpLWPNoAMDfHF1wD5y8VgKSSTHxPo
[2014-11-23 03:34:06] Starting Stratum on stratum+tcp://yaamp.com:3633 [2014-11-23 03:34:06] NVML GPU monitoring enabled. [2014-11-23 03:34:06] 1 miner threads started, using 'x13' algorithm. [2014-11-23 03:34:06] Stratum extranonce answer id is not correct! [2014-11-23 03:34:06] stratum time is at least 34s in the future [2014-11-23 03:34:06] yaamp.com:3633 x13 block 8885 [2014-11-23 03:34:06] accepted: 1/1 (100.00%), 0.00 khash/s yay!!! [2014-11-23 03:34:16] stratum time is at least 87s in the future [2014-11-23 03:34:16] yaamp.com:3633 x13 block 8885 [2014-11-23 03:34:16] GPU #0: GeForce GTX 660, 1291 kH/s [2014-11-23 03:35:16] GPU #0: GeForce GTX 660, 1287 kH/s [2014-11-23 03:35:19] stratum time is at least 88s in the future [2014-11-23 03:35:19] yaamp.com:3633 x13 block 8885 [2014-11-23 03:35:19] GPU #0: GeForce GTX 660, 1290 kH/s [2014-11-23 03:36:20] GPU #0: GeForce GTX 660, 1285 kH/s [2014-11-23 03:36:24] yaamp.com:3633 x13 block 8885 [2014-11-23 03:36:24] GPU #0: GeForce GTX 660, 1290 kH/s [2014-11-23 03:36:31] yaamp.com:3633 x13 block 78884 [2014-11-23 03:36:31] GPU #0: GeForce GTX 660, 1290 kH/s
while mining.set_extranonce feature is working fine with nicehash and has no side effects with trademybit.com, it breaks mining on yaamp.com (after the first share no more yays). second, the hash rate rate has decreased ~200khs on all x## algos compared with version 1.4.9 on sm_30 (gtx660), do you think it's caused by changes made in the calculation of the average hash rate or by recent sm_50 optimizations?
|
|
|
|
Epsylon3 (OP)
Legendary
Offline
Activity: 1484
Merit: 1082
ccminer/cpuminer developer
|
|
November 23, 2014, 07:32:43 AM Last edit: November 23, 2014, 08:39:31 AM by Epsylon3 |
|
i will check yamp... And yes there was a small decrease but not related to optimisations, but to a change in scan ranges. v1.5 is not finished, under hard work do you know the commit hash of your build ? (git show) edit: for sm 3.0 yes i know, i will make a special change for that (pushed the sp commit by mistake) edit2: yamp fixed, tx for the bug report
|
|
|
|
skunk
|
|
November 23, 2014, 11:02:06 AM |
|
do you know the commit hash of your build ? (git show)
3be08ad9445ddc7e594c916e5cc85bef2b58af66
|
|
|
|
Epsylon3 (OP)
Legendary
Offline
Activity: 1484
Merit: 1082
ccminer/cpuminer developer
|
|
November 23, 2014, 11:10:51 AM |
|
yes ok... just after Simd patch which reduce a bit the speed on SM 3.0
|
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
November 23, 2014, 03:05:55 PM |
|
was looking into blake256, what exactly do crcsum ? Looks to me as some useless calculation...
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
Epsylon3 (OP)
Legendary
Offline
Activity: 1484
Merit: 1082
ccminer/cpuminer developer
|
|
November 23, 2014, 05:29:04 PM |
|
its no more used, was to check when pdata change, but in fact never in one "scan" (now i compute the first 64 bytes bloc on the cpu)
I think there is a problem with blake... Shares are accepted but i never find a real block... Already seen this issue ?
|
|
|
|
djm34
Legendary
Offline
Activity: 1400
Merit: 1050
|
|
November 23, 2014, 05:32:27 PM Last edit: November 23, 2014, 05:50:22 PM by djm34 |
|
its no more used, was to check when pdata change, but in fact never in one "scan" (now i compute the first 64 bytes bloc on the cpu)
I think there is a problem with blake... Shares are accepted but i never find a real block... Already seen this issue ?
pdata changes only when you request a new block regarding share/block issue, I don't know... A block is the same as a share except that the difficulty is higher (meaning more digits should match...) So assuming the algo is tuned to find shares, you should find also blocks. (may-be you should try on testnet and see if it happens) edit: but yeah there is this strange thing when you swap h[6] and not h[7] may-be it explains a little the problem.
|
djm34 facebook pageBTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
|
|
|
Epsylon3 (OP)
Legendary
Offline
Activity: 1484
Merit: 1082
ccminer/cpuminer developer
|
|
November 23, 2014, 06:16:07 PM |
|
Yep, could be related... i asked for a testnet but never had an answer... Will try to double check this for the final 1.5.0...
Any news from you ? We dont see you on irc
edit, for h7, blake is so fast than... difficulty is in h6 and h7 set to 0
|
|
|
|
|