-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
June 27, 2011, 10:20:00 AM |
|
With 4 vectors, this change actually slows down the hash rate. With 2 vectors it speeds it up, but then I get runs of rejected shares. Not sure why but this is consistent now so I'm reluctant to include it at this stage.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
iopq
|
|
June 27, 2011, 10:22:05 AM |
|
With 4 vectors, this change actually slows down the hash rate. With 2 vectors it speeds it up, but then I get runs of rejected shares. Not sure why but this is consistent now so I'm reluctant to include it at this stage.
are you sure?
|
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
June 27, 2011, 10:26:00 AM |
|
With 4 vectors, this change actually slows down the hash rate. With 2 vectors it speeds it up, but then I get runs of rejected shares. Not sure why but this is consistent now so I'm reluctant to include it at this stage.
are you sure? I can keep trying it on and off to see, but every time so far it has happened. It could well be my pool as they're experiencing technical difficulties, but it's always been the same time I enable it that I get the rejects. 2011-06-27 20:22:46] [173.08 | 191.67 Mhash/s] [81 Accepted] [40 Rejected] Look at that reject rate. Normally it's <5%
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
iopq
|
|
June 27, 2011, 10:27:13 AM |
|
With 4 vectors, this change actually slows down the hash rate. With 2 vectors it speeds it up, but then I get runs of rejected shares. Not sure why but this is consistent now so I'm reluctant to include it at this stage.
are you sure? I can keep trying it on and off to see, but every time so far it has happened. It could well be my pool as they're experiencing technical difficulties, but it's always been the same time I enable it that I get the rejects. 2011-06-27 20:22:46] [173.08 | 191.67 Mhash/s] [81 Accepted] [40 Rejected] Look at that reject rate. Normally it's <5% I'm running GUIMiner with this change and I see no difference other than slight speed increase
|
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
June 27, 2011, 10:30:44 AM |
|
I don't doubt it, and no one else is reporting this issue. The other machine I've tried it on it does give a speed up (with minerd) but this one 6770 I'm using it on reliably spits out tons of rejects when I make this change. It's not a heating issue, the card is at 64 degrees.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
June 27, 2011, 10:42:55 AM |
|
Maybe it's just my pool. They're having a funky time so that would explain it.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
Naven
Newbie
Offline
Activity: 22
Merit: 0
|
|
June 27, 2011, 10:48:58 AM |
|
@ckolivas, could u share daily builds of this minner?
|
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
June 27, 2011, 10:50:48 AM |
|
@ckolivas, could u share daily builds of this minner?
linux only at this stage, sure I could do that.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
June 27, 2011, 02:37:09 PM |
|
Update tree:
I did incorporate that change into my kernel. It turns out that even though my hardware reports 4 as the preferred vector width, it's faster with 2. I assume many people have experienced the same. So I've made the default to be 2 when the hardware says its preferred vector width is anything larger than 1.
I found a little buglet that also would repeat some blocks, thereby artificially raising the hash rate, so the overall rate has dropped slightly (about the same amount it's increased with the other code!).
As for the daily builds, I assume the requester meant windows builds? Most people who have linux will likely be able to build it. It's not building on windows yet, but will in the near future I hope. If you really do want linux binaries, just say the word.
The problem with repeated blocks was my pool not sending me out longpoll information reliably.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
iopq
|
|
June 27, 2011, 03:42:57 PM |
|
Update tree:
I did incorporate that change into my kernel. It turns out that even though my hardware reports 4 as the preferred vector width, it's faster with 2.
yeah, same thing in poclbm, window size 128, vectors 2 is the fastest setting for me
|
|
|
|
burp
Member
Offline
Activity: 98
Merit: 10
|
|
June 27, 2011, 05:37:05 PM |
|
Current status for my dual 5830 setup: - one poclbm with phatk kernel for each card: 2*308MH/s = 616MH/s - minerd with 2 threads for each card, gives me 605MH/s so there is still some room for improvements
|
|
|
|
jgarzik (OP)
Legendary
Offline
Activity: 1596
Merit: 1100
|
|
June 27, 2011, 06:04:28 PM |
|
- one poclbm with phatk kernel for each card: 2*308MH/s = 616MH/s - minerd with 2 threads for each card, gives me 605MH/s
Just for knowledge... what performance do you get with 1 thread per card?
|
Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own. Visit bloq.com / metronome.io Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
|
|
|
burp
Member
Offline
Activity: 98
Merit: 10
|
|
June 27, 2011, 06:11:45 PM Last edit: June 27, 2011, 06:24:10 PM by burp |
|
- one poclbm with phatk kernel for each card: 2*308MH/s = 616MH/s - minerd with 2 threads for each card, gives me 605MH/s
Just for knowledge... what performance do you get with 1 thread per card? About 586MH/s, means 293MH/s per card. EDIT: Considering minerd uses poclbm kernel (which is slower for me than phatk), minerd might be already on par (with twice the number of threads).
|
|
|
|
figvam
Newbie
Offline
Activity: 42
Merit: 0
|
|
June 28, 2011, 08:01:20 AM |
|
It appears it's not possible to use minerd as a pure CPU miner anymore - setting GPU threads to zero doesn't work.
|
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
June 28, 2011, 11:38:22 AM |
|
Updated tree:
I've imported the phatk kernel into minerd. The maximum possible throughput is slightly faster on machines that support amd media ops which is nice. However, even nicer is that on sane intensity levels (including the default value of 4), the throughput is significantly faster now as well. The phatk kernel unfortunately doesn't even work on hardware that doesn't have amd media ops (radeon 4x cards and nvidia) so for now it defaults back to the poclbm kernel.
I've also updated the cpu mining component. Now it tries to keep its work sizes within the log update interval instead of the scan interval so that the hash rate doesn't fluctuate all over the place. It is also possible now to set number of gpu threads to 0 to run minerd as just a cpu miner again.
TODO: -I want to find ways of allowing even larger settings for intensity that would only be suitable for headless boxes. Currently the code ends up racing too much (with all the parallel processing) and generates far too many rejected blocks when the intensity is set to >10. Making the cl code synchronous would avoid that but it also slows it down, thereby making it pointless to push it further. -Store binary versions of the kernels that could be loaded faster when restarting the app. -Any bugfixing remaining. -Profit.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
gat3way
|
|
June 28, 2011, 11:57:59 AM |
|
Sorry for the rude OT question but was that you that maintained the -ck tree? Update tree:
I did incorporate that change into my kernel. It turns out that even though my hardware reports 4 as the preferred vector width, it's faster with 2. I assume many people have experienced the same. So I've made the default to be 2 when the hardware says its preferred vector width is anything larger than 1.
It's due to the high GPR usage, it is high enough to balance the poorer ALUPacking coming from uint2, not uint4 vectors. In fact I found out 3-component vectors to work best and they should be supported by opencl 1.1 standart, but the OpenCL compiler is buggy and generates bad code with uint3. Interlacing uint2 and uint works though
|
|
|
|
burp
Member
Offline
Activity: 98
Merit: 10
|
|
June 28, 2011, 05:40:28 PM Last edit: June 28, 2011, 08:00:07 PM by burp |
|
OK, it looks very good for me with 1 thread per gpu, intensity 10, and worksize 256. I get 619MH/s in total, means ~609MH/s per card. Rejection rate is at a normal level. For me it seems to be beneficial to increase intensity and worksize in favor of 2 gpu threads (which leads to more rejections for me).
EDIT: Rejection rate for now is higher than with poclbm (equal settings), minerd so far: 10/220 ~ 4.5%, poclbm: 41/2752 ~1.5% EDIT2: Better results with intensity 8, gives me "just" 617MH/s in total but no rejections for 100 accepted shares so far. Probably the perfect settings for me.
|
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
June 28, 2011, 08:49:38 PM |
|
Sorry for the rude OT question but was that you that maintained the -ck tree? Not rude at all. Yes it is me and I still do
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
gat3way
|
|
June 28, 2011, 09:46:58 PM |
|
I thought you quit kernel hacking. I've compiled some of your kernels a while ago on my desktop Had no idea you are into bitcoin stuff and OpenCL. Nice
|
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
June 28, 2011, 10:43:30 PM Last edit: June 28, 2011, 11:01:24 PM by ckolivas |
|
I thought you quit kernel hacking. I've compiled some of your kernels a while ago on my desktop Had no idea you are into bitcoin stuff and OpenCL. Nice Actually I'm very new to opencl and bitcoin. Just started a week ago, and had to learn all about opencl. I've put in over a hundred hours on this code already to get up to speed To see what I'm doing with linux kernel, check out http://ck-hack.blogspot.com
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
|