QuantumFoam
Full Member
Offline
Activity: 200
Merit: 100
|Quantum|World's First Cloud Management Platform
|
|
February 09, 2012, 07:22:59 AM |
|
It looked almost like a thread starvation problem to me, but I've never looked at the code so take that with a lot of salt
|
|Quantum|World's First Cloud Management Platform on the Blockchain
|
|
|
-ck (OP)
Legendary
Offline
Activity: 4242
Merit: 1644
Ruu \o/
|
|
February 09, 2012, 07:24:32 AM |
|
Haha no chance. It just would have been waiting on a network response presumably.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
QuantumFoam
Full Member
Offline
Activity: 200
Merit: 100
|Quantum|World's First Cloud Management Platform
|
|
February 09, 2012, 07:30:26 AM |
|
Just re-enabled the still dead pool, now stats are behaving normally. Maybe the problem is dependent on the type of network failure. When stats were frozen before, the "accepted/rejected" messages below were updating as normal. Stats were updating maybe once every 3 or 4 minutes.
|
|Quantum|World's First Cloud Management Platform on the Blockchain
|
|
|
Diapolo
|
|
February 09, 2012, 12:44:14 PM Last edit: February 09, 2012, 01:32:55 PM by Diapolo |
|
Wohoo, looks good so far ... I forked cgminer and set diakgcn as branch, added a remote for the diakgcn branch in your repo. I now can edit files and do commits . Con, if you are now doing commits to your diakgcn branch, can I merge them via "git fetch upstream" and "git merge upstream/diakgcn" afterwards? Can you have a look at https://github.com/Diapolo/cgminer/commits/diakgcn ... I now need to figure out how to create a pull request for the branch diakgcn. Thanks, Dia
|
|
|
|
bulanula
|
|
February 09, 2012, 01:28:25 PM |
|
I just encountered a weird problem with the latest version 2.2.3. One of my backup pools went dead and it seemed to be interfering with cgminer's ability to update the statistics on top (mhs, gpu temp and fan rpm). Basically, these stats were frozen and cgminer was only acting on what they last said (so fan rpms and such were not being adjusted properly, a potentially dangerous situation). When I disabled the offending pool, stats began updating again normally. Enabled the pool again and right back to frozen stats. A couple of times tonight this caused the fans to go to 100% due to overheating because the rpms were being kept too low due to the last stat update being too long ago.
Not sure what else I can do to help track this down, pool management is set to failover, but failover only flag is not enabled.
This on what OS ? Thanks !
|
|
|
|
-ck (OP)
Legendary
Offline
Activity: 4242
Merit: 1644
Ruu \o/
|
|
February 09, 2012, 01:57:43 PM |
|
Wohoo, looks good so far ... I forked cgminer and set diakgcn as branch, added a remote for the diakgcn branch in your repo. I now can edit files and do commits . Con, if you are now doing commits to your diakgcn branch, can I merge them via "git fetch upstream" and "git merge upstream/diakgcn" afterwards? Can you have a look at https://github.com/Diapolo/cgminer/commits/diakgcn ... I now need to figure out how to create a pull request for the branch diakgcn. Thanks, Dia I just tested it. Now instead of producing no shares at all, it is only producing hardware errors... Still needs work I expect. Likely something in the API is broken. Check the code in findnonce.c in precalc hash to see what variables are being used and then the code in device-gpu.c for what parameters are being passed to your kernel in queue_diakgcn_kernel in what order. It should make sense.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
Diapolo
|
|
February 09, 2012, 02:26:55 PM Last edit: February 09, 2012, 03:37:50 PM by Diapolo |
|
Wohoo, looks good so far ... I forked cgminer and set diakgcn as branch, added a remote for the diakgcn branch in your repo. I now can edit files and do commits . Con, if you are now doing commits to your diakgcn branch, can I merge them via "git fetch upstream" and "git merge upstream/diakgcn" afterwards? Can you have a look at https://github.com/Diapolo/cgminer/commits/diakgcn ... I now need to figure out how to create a pull request for the branch diakgcn. Thanks, Dia I just tested it. Now instead of producing no shares at all, it is only producing hardware errors... Still needs work I expect. Likely something in the API is broken. Check the code in findnonce.c in precalc hash to see what variables are being used and then the code in device-gpu.c for what parameters are being passed to your kernel in queue_diakgcn_kernel in what order. It should make sense. I checked how you precompute the kernel parameters yesterday, every parameter looked good. I will investigate further, did you use vectors or no vectors? Would be best to first get the non vectors code working ... Edit: I need some input, the values A to H in findnonce.c are "mixed" via R() into new values, so I guess A to H correspond to state2 in my Python code and ctx_a - ctx_h would be state0 in my Python code. If this is the case, I have to recheck all kernel arguments ... I'm a bit confused right now . self.state = np.array(unpack('IIIIIIII', nonceRange.unit.midstate), dtype=np.uint32) self.state2 = np.array(unpack('IIIIIIII', calculateMidstate(nonceRange.unit.data[64:80] + '\x00\x00\x00\x80' + '\x00' * 40 + '\x80\x02\x00\x00', nonceRange.unit.midstate, 3)), dtype=np.uint32) self.state2 = np.array(list(self.state2)[3:] + list(self.state2)[:3], dtype=np.uint32) Dia
|
|
|
|
QuantumFoam
Full Member
Offline
Activity: 200
Merit: 100
|Quantum|World's First Cloud Management Platform
|
|
February 09, 2012, 06:36:16 PM |
|
This on what OS ?
Thanks ! Xubuntu linux 11.04. I have not been able to recreate the problem again since I posted. The pool also has come back online now. When the problem was occurring it was easily recreated by disabling/enabling the pool. At first I thought it was a problem with my machine so I shut it off and made sure the cards were all seated properly, then turned it back on. When cgminer ran again at startup, the problem recurred. I was able to fix it by disabling the dead pool again.
|
|Quantum|World's First Cloud Management Platform on the Blockchain
|
|
|
Diapolo
|
|
February 09, 2012, 07:25:36 PM Last edit: February 09, 2012, 08:32:49 PM by Diapolo |
|
Hey Con, I looked again through every kernel argument and compared line by line with my Python code. I found 2 small differences and 2 brackets, that are not needed (see last commit https://github.com/Diapolo/cgminer/commit/68e36c657318fbe1e7714be470cf954a1d512333), but I guess they don't fix the persisting problem with false-positive nonces (perhaps you can give it a try - I have no compiler or IDE setup to test it by myself). The argument order is exactly as DiaKGCN awaits it, so that can't be the problem either. It could be a problem of your changes to the output code in the kernel, a problem with the base-nonces, who are passed to the kernel or something with the output-buffer in the CGMINER host code ... :-/. Where resides the output-buffer processing? As I said my kernel used ulong * natively, which I changed to uint * in one commit of my fork, I guess I need to look at it. Edit: OMFG, I introduced a bug with one of my former commits, which changed the type of the output buffer from uint * to int * ... fixed that one! It's time for another try Con . Dia
|
|
|
|
rcocchiararo
Newbie
Offline
Activity: 78
Merit: 0
|
|
February 09, 2012, 09:03:03 PM |
|
i had an ubuntu box that was ok running phoenix
but i like cgminer more, so i wanted to compile it (it has ubuntu x86)
I tried, but failed, because ncurses version was not correct.
I then upgraded from 10.04 to 10.11 (all through ssh)
After reinstalling the drivers (ssh -X), i was able to compile cgminer.
but i cant start mining, except with cpu only (on 2.1.2, 2.2.3 has no cpu support i think), it tells me that there is no valid gpu available.
I tried automatically starting cgminer with screen and upstart (like i was told arroung page 178 to 180).
It gives the same error.
i then tried with the same trick i used on debian, and nothing happens.
Im not sure if this PC with ubuntu has any trouble with not having a display connected.
|
|
|
|
jjiimm_64
Legendary
Offline
Activity: 1876
Merit: 1000
|
|
February 09, 2012, 09:11:25 PM |
|
i had an ubuntu box that was ok running phoenix
but i like cgminer more, so i wanted to compile it (it has ubuntu x86)
I tried, but failed, because ncurses version was not correct.
I then upgraded from 10.04 to 10.11 (all through ssh)
After reinstalling the drivers (ssh -X), i was able to compile cgminer.
but i cant start mining, except with cpu only (on 2.1.2, 2.2.3 has no cpu support i think), it tells me that there is no valid gpu available.
I tried automatically starting cgminer with screen and upstart (like i was told arroung page 178 to 180).
It gives the same error.
i then tried with the same trick i used on debian, and nothing happens.
Im not sure if this PC with ubuntu has any trouble with not having a display connected.
did you accept the 'sdk license?'
|
1jimbitm6hAKTjKX4qurCNQubbnk2YsFw
|
|
|
-ck (OP)
Legendary
Offline
Activity: 4242
Merit: 1644
Ruu \o/
|
|
February 09, 2012, 09:34:41 PM Last edit: February 09, 2012, 10:08:05 PM by ckolivas |
|
Hey Con, I looked again through every kernel argument and compared line by line with my Python code. I found 2 small differences and 2 brackets, that are not needed (see last commit https://github.com/Diapolo/cgminer/commit/68e36c657318fbe1e7714be470cf954a1d512333), but I guess they don't fix the persisting problem with false-positive nonces (perhaps you can give it a try - I have no compiler or IDE setup to test it by myself). The argument order is exactly as DiaKGCN awaits it, so that can't be the problem either. It could be a problem of your changes to the output code in the kernel, a problem with the base-nonces, who are passed to the kernel or something with the output-buffer in the CGMINER host code ... :-/. Where resides the output-buffer processing? As I said my kernel used ulong * natively, which I changed to uint * in one commit of my fork, I guess I need to look at it. Edit: OMFG, I introduced a bug with one of my former commits, which changed the type of the output buffer from uint * to int * ... fixed that one! It's time for another try Con . Dia Diapolo... I appreciate the effort you're putting in, and I realise you're new to this collaborative coding and source control management, but probably a good idea to see your code actually compiles before you ask someone to test it. Usually people compile and test their own code before asking someone else to test it for them. Anyway... I fixed the !(find) in my local copy and it still produces hardware errors. edit: It doesn't matter what vectors or worksize I try this with.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
rcocchiararo
Newbie
Offline
Activity: 78
Merit: 0
|
|
February 09, 2012, 10:42:18 PM |
|
i had an ubuntu box that was ok running phoenix
but i like cgminer more, so i wanted to compile it (it has ubuntu x86)
I tried, but failed, because ncurses version was not correct.
I then upgraded from 10.04 to 10.11 (all through ssh)
After reinstalling the drivers (ssh -X), i was able to compile cgminer.
but i cant start mining, except with cpu only (on 2.1.2, 2.2.3 has no cpu support i think), it tells me that there is no valid gpu available.
I tried automatically starting cgminer with screen and upstart (like i was told arroung page 178 to 180).
It gives the same error.
i then tried with the same trick i used on debian, and nothing happens.
Im not sure if this PC with ubuntu has any trouble with not having a display connected.
did you accept the 'sdk license?' when, where and how am i supposed to do that ? don't remember doing it on my debian machine. This ubuntu pc is at my parents house (i moved last week xD), and has no display attached right now. Important to mention tho, is that right now, i can't use phoenix either. And ati commands for OC and such, fail as the "normal user" telling me that i MUST start X if i want to run them without being "su", and if i run them with "sudo", they fail too. I guess that my only choice is to go back, plug a display, and see what happens
|
|
|
|
Endeavour79
|
|
February 09, 2012, 10:54:11 PM |
|
Hi @ll, first I want to say CGMINER is a fantastic app and I really appreciate all the work and time in it. I already donated One thing I notices since version 1.5 (when I started to use it).. I have a 4xHD5830 Windows x64 rig with clocks at 960@300 and had in the beginning constant hashrates around 315MHash or more. Over time, with new CGMINER versions, never kernels and for sure updated drivers/app sdk the performance is lower and lower. Currently with 12.1 driver and APP SDK 2.5 I only get around 280Mhash with poclbm and worksize 128 (I8) (tried many settings and different kernels and this works out best). I am wondering what may be the issue here? What settings do you currently use for best performance and what driver-app sdk combination. Any recommendations would be nice. (Remember Windows OS please.) Thanks!!
|
NSW, Australia - Rigs, Mining, Pools - Local help needed? Send me a message!
|
|
|
-ck (OP)
Legendary
Offline
Activity: 4242
Merit: 1644
Ruu \o/
|
|
February 09, 2012, 10:54:32 PM |
|
but i cant start mining, except with cpu only (on 2.1.2, 2.2.3 has no cpu support i think), it tells me that there is no valid gpu available.
What did ./configure show when you built it? Also, when running remotely: then start it.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
-ck (OP)
Legendary
Offline
Activity: 4242
Merit: 1644
Ruu \o/
|
|
February 09, 2012, 10:56:21 PM |
|
Over time, with new CGMINER versions, never kernels and for sure updated drivers/app sdk the performance is lower and lower.
Currently with 12.1 driver and APP SDK 2.5 I only get around 280Mhash with poclbm and worksize 128 (I8) (tried many settings and different kernels and this works out best).
See this is the thing. You're saying it's the newer kernels and the updated drivers and sdk.... but there have been no updated kernels. They are essentially unchanged for 7 months now. So look at the other things you've blamed instead.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
rcocchiararo
Newbie
Offline
Activity: 78
Merit: 0
|
|
February 09, 2012, 11:02:15 PM |
|
./configure said OpenCL and ADL sdks where available. (i have fought long enough with compiling cgminer xD) adrian@mine01:~$ export DISPLAY=:0 adrian@mine01:~$ cgminer [2012-02-09 19:57:58] Started cgminer 2.2.3 No protocol specified [2012-02-09 19:57:58] Error: Getting Device IDs (num) [2012-02-09 19:57:58] clDevicesNum returned error, no GPUs usable All devices disabled, cannot mine! adrian@mine01:~$
Also adrian@mine01:~/cgminer-2.2.3$ export DISPLAY=:0.0 adrian@mine01:~/cgminer-2.2.3$ cgminer [2012-02-09 20:01:01] Started cgminer 2.2.3 No protocol specified [2012-02-09 20:01:02] Error: Getting Device IDs (num) [2012-02-09 20:01:02] clDevicesNum returned error, no GPUs usable All devices disabled, cannot mine! adrian@mine01:~/cgminer-2.2.3$ aticonfig --pplib-cmd "set fanspeed 0 65" No protocol specified aticonfig: This program must be run as root when no X server is active adrian@mine01:~/cgminer-2.2.3$ sudo aticonfig --pplib-cmd "set fanspeed 0 65" [sudo] password for adrian: No protocol specified No protocol specified ati_pplib_cmd: Unable to open display `:0.1'. aticonfig: parsing the command-line failed.
i have a 5850 and a 5830 on this machine.
|
|
|
|
gnar1ta$
Donator
Hero Member
Offline
Activity: 798
Merit: 500
|
|
February 09, 2012, 11:04:23 PM |
|
Hello ck!
Apparently the problem with "OFF" decreased but not disappeared completely:
Same here. Ran 18 hrs and got 3 of 4 cards showing OFF with 2.2.3. Started 2.1.2 with the same flags last night, I'll check it in 8 hours. Are you using --auto-fan? Not using auto-fan seemed to solve it in 2.2.1, until my power crashed anyway. Can you check in the menu when the GPUs were "last initialised" ? Didn't' forget about this I ran 2.1.2 with no errors for 18 hours then started 2.2.3 yesterday with the same flags and got this: cgminer version 2.2.3 - Started: [2012-02-08 22:52:08]
GPU 2: 74.0C 1801RPM | OFF / 36.9Mh/s | A: 545 R: 2 HW:0 U: 0.48/m I: 8
GPU 2: 0.0 / 37.1 Mh/s | A:545 R:2 HW:0 U:0.48/m I:8 74.0 C F: 31% (1805 RPM) E: 157 MHz M: 200 Mhz V: 0.950V A: 0% P: 0% Last initialised: [2012-02-08 22:52:12] Intensity: 8 Thread 4: 0.0 Mh/s Disabled ALIVE Thread 5: 0.0 Mh/s Disabled ALIVE
Log entry: [2012-02-09 00:48:34] Device 2 idle for more than 60 seconds, GPU 2 declared SICK! [2012-02-09 00:48:34] Attempting to restart GPU [2012-02-09 00:48:34] Thread 4 still exists, killing it off [2012-02-09 00:48:34] Thread 5 still exists, killing it off [2012-02-09 00:48:35] Thread 4 restarted [2012-02-09 00:48:35] Thread 5 restarted [2012-02-09 00:48:36] Thread 4 being disabled [2012-02-09 00:48:36] Thread 5 being disabled Weird how las initialised is slightly after start time, but it wasn't disabled for a few hours. You might be on to something here.
|
Losing hundreds of Bitcoins with the best scammers in the business - BFL, Avalon, KNC, HashFast.
|
|
|
Endeavour79
|
|
February 09, 2012, 11:06:51 PM |
|
Over time, with new CGMINER versions, never kernels and for sure updated drivers/app sdk the performance is lower and lower.
Currently with 12.1 driver and APP SDK 2.5 I only get around 280Mhash with poclbm and worksize 128 (I8) (tried many settings and different kernels and this works out best).
See this is the thing. You're saying it's the newer kernels and the updated drivers and sdk.... but there have been no updated kernels. They are essentially unchanged for 7 months now. So look at the other things you've blamed instead. Thanks for the reply ckolivas and again thanks for the good work.. If I remember right, with version 1.5.1 or a bit later you've upgraded the kernels and again with version 2.2.1or3. I don't blame you for the performance decrease ckolivas! I just want to find out what other users do for best performance, what the best config is. Btw.. do you still support SDK 2.1, in your FAQ you mention 2.4/2.5 only.
|
NSW, Australia - Rigs, Mining, Pools - Local help needed? Send me a message!
|
|
|
SAC
|
|
February 09, 2012, 11:09:04 PM |
|
Didn't' forget about this I ran 2.1.2 with no errors for 18 hours then started 2.2.3 yesterday with the same flags and got this: Weird how las initialised is slightly after start time, but it wasn't disabled for a few hours. You might be on to something here. Both 2.2.2 and 2.2.3 will not even start on the machine I compile on they fail to initialize the GPUs never tried on the others I have.
|
|
|
|
|