I usually gets lower output when i mess with threads.
I am getting ~720 on linux and ~660 on windows 7 64
The engine memory part is what i found worked best for me. I can tune it a little higher, but i got some unstability and noise problems
Make sure you use the latest cgminer, 2.11.4 it should be.