d3m0n1q_733rz
|
|
February 10, 2012, 01:58:57 PM |
|
*Cough*Diapolo VECTORS8 GOFFSET=false*cough* Play around with the worksize.
|
Funroll_Loops, the theoretically quicker breakfast cereal! Check out http://www.facebook.com/JupiterICT for all of your computing needs. If you need it, we can get it. We have solutions for your computing conundrums. BTC accepted! 12HWUSguWXRCQKfkPeJygVR1ex5wbg3hAq
|
|
|
DiabloD3
Legendary
Offline
Activity: 1162
Merit: 1000
DiabloMiner author
|
|
February 10, 2012, 02:14:01 PM |
|
Will the CPU bug be gone with reversing to 2.1? How do I do that when I'm on 2.6 already? Heard all over the board going back isn't easy AT ALL.
Thx!
Actually Quite easy.. Let me know if you want help with it. Except his problem isn't the SDK. There are two known CPU bugs, the first is SDK side and exists in 2.2 and 2.3 only. 2.1, 2.4, 2.5, and 2.6 do not exhibit it, and 2.2 and 2.3 exhibit it on any driver version; the other bug is in driver and exists in 11.7 through 11.11* and exhibit it with any SDK including both 2.1 and 2.6. * Depends on the user, some had it fixed in 11.9 and 11.10. There are no known instances of this bug in 12.1 and up.
|
|
|
|
deepceleron (OP)
Legendary
Offline
Activity: 1512
Merit: 1036
|
|
February 10, 2012, 07:06:04 PM |
|
I finally got a 5870 to mess around with, and, yeah, the "best" memory setting on that is diff than the 5830.
At 1000/395, it gets 440mhash, at 1000/310, 448mhash, etc.
1/3rd core speed is considered the rule. So, in your case, 1000/333. Not a good rule. I did a SI shit ton of benchmarking every 10MHz (and 5MHz around the peak) on SDK 2.5 (worksize 256). On a 5770 at 950MHz, the performance peak was at 295MHz RAM; at 800MHz it was around 260MHz. On a 5830 at 1050MHz peak performance was obtained at 375MHz-390MHz RAM; at 800MHz, it was still around 360MHz. There is probably a better formula that involves number of stream processors and memory bus width vs worksize and vector size, but benchmarking your particular configuration is ideal. Of course this topic is about SDK 2.6, where any memory clock other than ~1000Mhz will hurt your hashrate.
|
|
|
|
Joshwaa
|
|
February 10, 2012, 07:23:51 PM |
|
I finally got a 5870 to mess around with, and, yeah, the "best" memory setting on that is diff than the 5830.
At 1000/395, it gets 440mhash, at 1000/310, 448mhash, etc.
1/3rd core speed is considered the rule. So, in your case, 1000/333. Not a good rule. I did a SI shit ton of benchmarking every 10MHz (and 5MHz around the peak) on SDK 2.5 (worksize 256). On a 5770 at 950MHz, the performance peak was at 295MHz RAM; at 800MHz it was around 260MHz. On a 5830 at 1050MHz peak performance was obtained at 375MHz-390MHz RAM; at 800MHz, it was still around 360MHz. There is probably a better formula that involves number of stream processors and memory bus width vs worksize and vector size, but benchmarking your particular configuration is ideal. Of course this topic is about SDK 2.6, where any memory clock other than ~1000Mhz will hurt your hashrate. Except for on a 7970. It helped!
|
|
|
|
DiabloD3
Legendary
Offline
Activity: 1162
Merit: 1000
DiabloMiner author
|
|
February 10, 2012, 10:31:33 PM |
|
I finally got a 5870 to mess around with, and, yeah, the "best" memory setting on that is diff than the 5830.
At 1000/395, it gets 440mhash, at 1000/310, 448mhash, etc.
1/3rd core speed is considered the rule. So, in your case, 1000/333. Not a good rule. I did a SI shit ton of benchmarking every 10MHz (and 5MHz around the peak) on SDK 2.5 (worksize 256). On a 5770 at 950MHz, the performance peak was at 295MHz RAM; at 800MHz it was around 260MHz. On a 5830 at 1050MHz peak performance was obtained at 375MHz-390MHz RAM; at 800MHz, it was still around 360MHz. There is probably a better formula that involves number of stream processors and memory bus width vs worksize and vector size, but benchmarking your particular configuration is ideal. Of course this topic is about SDK 2.6, where any memory clock other than ~1000Mhz will hurt your hashrate. Memory speed is largely voodoo magic. Every "best" speed I've seen that doesn't fit the 1/3rd rule (such as on 1200mhz mem) 57xx/58xx) seems to still be between 1/3rd /1200*1000 (1000mhz core == 278mhz) and 1/3rd /1000*1200 (== 400 mhz), your lower and higher settings seem to be within that margin.
|
|
|
|
DrG
Legendary
Offline
Activity: 2086
Merit: 1035
|
|
February 10, 2012, 11:08:49 PM |
|
Is there a good app that let's me see the SDK version installed? I installed 11.12 drivers on a new build (using hardware that I know hashes higher - ie my 6870s are only getting 270MH even with DiabloMiner) and then uninstalled everything and reinstalled 11.7 drivers but I don't think it cleaned out the 2.6 SDK properly.
|
|
|
|
DiabloD3
Legendary
Offline
Activity: 1162
Merit: 1000
DiabloMiner author
|
|
February 10, 2012, 11:11:33 PM |
|
Is there a good app that let's me see the SDK version installed? I installed 11.12 drivers on a new build (using hardware that I know hashes higher - ie my 6870s are only getting 270MH even with DiabloMiner) and then uninstalled everything and reinstalled 11.7 drivers but I don't think it cleaned out the 2.6 SDK properly.
DM says what your SDK is on startup. If it says AMD-APP (851.4), you're on SDK 2.6.
|
|
|
|
d3m0n1q_733rz
|
|
February 11, 2012, 01:24:00 AM |
|
I find it funny how everyone sort of ignored me there. If you're going to test the kernels, test them to the fullest extent of their capabilities. I would like to see how they measure up completely. Which means include VECTORS8 with the GOFFSET=false
|
Funroll_Loops, the theoretically quicker breakfast cereal! Check out http://www.facebook.com/JupiterICT for all of your computing needs. If you need it, we can get it. We have solutions for your computing conundrums. BTC accepted! 12HWUSguWXRCQKfkPeJygVR1ex5wbg3hAq
|
|
|
zvs
Legendary
Offline
Activity: 1680
Merit: 1000
https://web.archive.org/web/*/nogleg.com
|
|
February 14, 2012, 08:44:11 AM |
|
I find it funny how everyone sort of ignored me there. If you're going to test the kernels, test them to the fullest extent of their capabilities. I would like to see how they measure up completely. Which means include VECTORS8 with the GOFFSET=false
Slower, on all worksizes (ed: well, cpu was faster)
|
|
|
|
d3m0n1q_733rz
|
|
February 14, 2012, 01:28:23 PM |
|
I find it funny how everyone sort of ignored me there. If you're going to test the kernels, test them to the fullest extent of their capabilities. I would like to see how they measure up completely. Which means include VECTORS8 with the GOFFSET=false
Slower, on all worksizes (ed: well, cpu was faster) Hmm, I do know that VLIW is faster with VECTORS8. I would have thought that GCN could handle it. I wonder why it decreased like that. Hmm...
|
Funroll_Loops, the theoretically quicker breakfast cereal! Check out http://www.facebook.com/JupiterICT for all of your computing needs. If you need it, we can get it. We have solutions for your computing conundrums. BTC accepted! 12HWUSguWXRCQKfkPeJygVR1ex5wbg3hAq
|
|
|
stoppots
|
|
March 02, 2012, 05:43:52 PM |
|
Is there a good app that let's me see the SDK version installed? I installed 11.12 drivers on a new build (using hardware that I know hashes higher - ie my 6870s are only getting 270MH even with DiabloMiner) and then uninstalled everything and reinstalled 11.7 drivers but I don't think it cleaned out the 2.6 SDK properly.
GPU Caps Viewer does a good job of providing plenty of information on wat ATI Stream/APP SDK's are installed on your system and allows you to sellect each GPU/CPU individually and displays a read out of wat the openCL/GL software is doing. Seen it also allow you to select between either ATI Stream or APP SDK within the app when I had both installed at one time. Have not yet see it offered for linux systems but I suspect something similiar will soon surface. Here is a link to the latest version. http://www.geeks3d.com/20120202/gpu-caps-viewer-1-15-0-opengl-opencl-cuda-graphics-card-utility/
|
|
|
|
|