Bogart
Legendary
Offline
Activity: 966
Merit: 1000
|
|
July 15, 2013, 07:53:24 PM |
|
It's not the pool connection. Something happens and all work ends up "discarded" until cgminer is restarted. What would cause that?
It's got nowhere to go. It prepares work and then no device is working to take it so it just discards it. That's why I have a backup pool and use failover mode. I even have eloipool running locally, set as the secondary backup, so even in the internet fails completely, it can still mine against that (until it becomes out-of-sync if the internet failure lasts long enough). I can watch MHS5s slowly drop to 0, and eventually cgminer-monitor will restart cgminer. Why is cgminer stopping hashing?
Common reasons: Wifi kernel problem Overdoing the overclocking Pool failure and the cgminer-monitor watchdog is trigger happy and kills cgminer when all it's doing is waiting for a pool to come back online. FPGA failure in the avalon.If the fans stop running entirely, does that indicate FPGA failure? My Batch 1 unit does this sometimes. I now have a script that queries the API, and if it has a zero hashrate for too long, it will call the connected Web Power Switch to cycle the power on the unit, which always brings it back up. (I do have to have it leave the power off for a full 30 seconds.)
|
"All safe deposit boxes in banks or financial institutions have been sealed... and may only be opened in the presence of an agent of the I.R.S." - President F.D. Roosevelt, 1933
|
|
|
mgio
|
|
July 15, 2013, 09:06:55 PM |
|
I get a REALLY large number of invalid shares with my Avalon.
I'm not sure whats going on. It's not consistent.
One day I'll be mining and I'll get 99+% valid shares and the next day or two it will only be 88% valid shares.
I've tried mining on Ozcoin and 50btc and it seems to be bad on both.
I don't think it is my network because my jalapenos only get <1% invalid shares.
I'm using the dynamically adjusting frequency and it usually settle at about 352 MHz or so and 82 GH/s on average. Of course, according to the pool, I'm only getting 70 GH/s or so due to all the invalid shares.
I have an AC feeding directly into the intake on the Avalon and no temperature gets above 48 or 49 or so (very rarely 50).
What could be wrong?
|
|
|
|
Aseras
|
|
July 15, 2013, 09:11:32 PM |
|
I get a REALLY large number of invalid shares with my Avalon.
I'm not sure whats going on. It's not consistent.
One day I'll be mining and I'll get 99+% valid shares and the next day or two it will only be 88% valid shares.
I've tried mining on Ozcoin and 50btc and it seems to be bad on both.
I don't think it is my network because my jalapenos only get <1% invalid shares.
I'm using the dynamically adjusting frequency and it usually settle at about 352 MHz or so and 82 GH/s on average. Of course, according to the pool, I'm only getting 70 GH/s or so due to all the invalid shares.
I have an AC feeding directly into the intake on the Avalon and no temperature gets above 48 or 49 or so (very rarely 50).
What could be wrong?
This is normal on stratum. Every time the block changes, you get invalids. Some days burn more blocks than others.
|
|
|
|
mgio
|
|
July 15, 2013, 09:33:55 PM |
|
I get a REALLY large number of invalid shares with my Avalon.
I'm not sure whats going on. It's not consistent.
One day I'll be mining and I'll get 99+% valid shares and the next day or two it will only be 88% valid shares.
I've tried mining on Ozcoin and 50btc and it seems to be bad on both.
I don't think it is my network because my jalapenos only get <1% invalid shares.
I'm using the dynamically adjusting frequency and it usually settle at about 352 MHz or so and 82 GH/s on average. Of course, according to the pool, I'm only getting 70 GH/s or so due to all the invalid shares.
I have an AC feeding directly into the intake on the Avalon and no temperature gets above 48 or 49 or so (very rarely 50).
What could be wrong?
This is normal on stratum. Every time the block changes, you get invalids. Some days burn more blocks than others. It seems like I have way more than other Avalon users, at least from what I've seen on Ozcoin. Most of them have only a couple percent invalid shares at most, while I will have 12 or 13% invalid shares. Right now I'm standing at about 94% efficiency on average, that's with 400,000 invalid shares in the less than a week I've been mining there.
|
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
July 16, 2013, 04:06:08 AM Last edit: July 16, 2013, 06:46:27 AM by ckolivas |
|
I get a REALLY large number of invalid shares with my Avalon.
I'm not sure whats going on. It's not consistent.
One day I'll be mining and I'll get 99+% valid shares and the next day or two it will only be 88% valid shares.
I've tried mining on Ozcoin and 50btc and it seems to be bad on both.
I don't think it is my network because my jalapenos only get <1% invalid shares.
I'm using the dynamically adjusting frequency and it usually settle at about 352 MHz or so and 82 GH/s on average. Of course, according to the pool, I'm only getting 70 GH/s or so due to all the invalid shares.
I have an AC feeding directly into the intake on the Avalon and no temperature gets above 48 or 49 or so (very rarely 50).
What could be wrong?
This is normal on stratum. Every time the block changes, you get invalids. Some days burn more blocks than others. It seems like I have way more than other Avalon users, at least from what I've seen on Ozcoin. Most of them have only a couple percent invalid shares at most, while I will have 12 or 13% invalid shares. Right now I'm standing at about 94% efficiency on average, that's with 400,000 invalid shares in the less than a week I've been mining there. That is definitely far more than you should be getting. There is a good chance you're submitting heaps of duplicates which may also be a different form of instability that the auto mode can't check for. Try setting a lower maximum speed if you're using auto mode because clearly you're not doing 82GH of useful work.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
bitdude
|
|
July 16, 2013, 06:23:41 AM |
|
today I have experienced a "bigger problem" for the second time since I have it
the machine was unresponsive on web interface, so I thought it has crashed completely and would need a physical restart, which is very bad for me
however, I tried to connect to SSH and somehow it worked, but extremely slowly, took my a couple of minutes but finally I was able to type reboot command and reboot the machine, which it did and everything is OK again
so, if someone has this totally unresponsive machine, try SSH and be very very patient, it may come up
funny thing is that it did not stop hashing until I reboot it - i.e. even in this strange unresponsive state, it was doing its job, just web interface could not be reached and SSH extremely hardly
since the operation with the whole machine was impossibly slow I could not analyze any logs before reboot, so I have no idea what happened
|
|
|
|
loshia
Legendary
Offline
Activity: 1610
Merit: 1000
|
|
July 16, 2013, 07:54:08 AM |
|
today I have experienced a "bigger problem" for the second time since I have it
the machine was unresponsive on web interface, so I thought it has crashed completely and would need a physical restart, which is very bad for me
however, I tried to connect to SSH and somehow it worked, but extremely slowly, took my a couple of minutes but finally I was able to type reboot command and reboot the machine, which it did and everything is OK again
so, if someone has this totally unresponsive machine, try SSH and be very very patient, it may come up
funny thing is that it did not stop hashing until I reboot it - i.e. even in this strange unresponsive state, it was doing its job, just web interface could not be reached and SSH extremely hardly
since the operation with the whole machine was impossibly slow I could not analyze any logs before reboot, so I have no idea what happened
Out of mem - free? WiFi disabled yes/no dmesg? just a few basic commands will tell you what went wrong. Make sure you post the output next time
|
|
|
|
psjw4450
|
|
July 16, 2013, 12:37:23 PM |
|
Yesterday my avalon started showing a super high number of hw. Almost 75% of the accepted diff1a shares. For instance shares 100000 hw 75000. Any ideas on what to check?
|
|
|
|
ProfMac
Legendary
Offline
Activity: 1246
Merit: 1002
|
|
July 16, 2013, 01:40:05 PM |
|
Yesterday my avalon started showing a super high number of hw. Almost 75% of the accepted diff1a shares. For instance shares 100000 hw 75000. Any ideas on what to check?
I had a maximum temp of 51C last night, and my HW values were almost as high as my accepted. The past week, it has stayed below 50 during routine operation. It has been stable around 343-347 frequency. I had --avalon-auto set to 285-375. I just changed it to 285-350. My output is conveniently shown at http://eligius.st/~wizkid057/newstats/userstats.php/18bLcVkviErQi75zB8X39jZXxHNpSZggdCAt the start of 16 Jul I moved the unit from a warm upstairs room where the fans fairly consistently ran at 3800 to the basement where they have run closer to 2200. The dip is from the time it took to move the machine. The stability and speed both seemed to improve, but during the night the basement started warming up from it's initial 69°F. I also put a filter in front of the fans, and the fan speed seems to be a little unstable now. Without the filter, their speed stays pretty constant.
|
I try to be respectful and informed.
|
|
|
SolarSilver
Legendary
Offline
Activity: 1112
Merit: 1000
|
|
July 16, 2013, 01:42:22 PM |
|
I had a maximum temp of 51C last night, and my HW values were almost as high as my accepted. The past week, it has stayed below 50 during routine operation.
Are you talking about "Accepted" vs "HW" or "Diff1Shares" vs "HW"?
|
|
|
|
ProfMac
Legendary
Offline
Activity: 1246
Merit: 1002
|
|
July 16, 2013, 01:48:58 PM |
|
I had a maximum temp of 51C last night, and my HW values were almost as high as my accepted. The past week, it has stayed below 50 during routine operation.
Are you talking about "Accepted" vs "HW" or "Diff1Shares" vs "HW"? "Accepted" I don't see a number labeled "Diff1Shares" in the status panel.
|
I try to be respectful and informed.
|
|
|
cypherdoc
Legendary
Offline
Activity: 1764
Merit: 1002
|
|
July 16, 2013, 02:17:08 PM |
|
I had a maximum temp of 51C last night, and my HW values were almost as high as my accepted. The past week, it has stayed below 50 during routine operation.
Are you talking about "Accepted" vs "HW" or "Diff1Shares" vs "HW"? "Accepted" I don't see a number labeled "Diff1Shares" in the status panel. it's there. look harder. you should be using that as part of your denominator.
|
|
|
|
GenTarkin
Legendary
Offline
Activity: 2450
Merit: 1002
|
|
July 16, 2013, 02:43:22 PM |
|
in case you have multiple pools in the list that switch between you will add up diff1shares then do the calculation. Also, I find setting up miner.php an easier method of lookin at the data =)
|
|
|
|
WinTame2012
|
|
July 16, 2013, 10:32:28 PM |
|
I get a REALLY large number of invalid shares with my Avalon.
I'm not sure whats going on. It's not consistent.
One day I'll be mining and I'll get 99+% valid shares and the next day or two it will only be 88% valid shares.
I've tried mining on Ozcoin and 50btc and it seems to be bad on both.
I don't think it is my network because my jalapenos only get <1% invalid shares.
I'm using the dynamically adjusting frequency and it usually settle at about 352 MHz or so and 82 GH/s on average. Of course, according to the pool, I'm only getting 70 GH/s or so due to all the invalid shares.
I have an AC feeding directly into the intake on the Avalon and no temperature gets above 48 or 49 or so (very rarely 50).
What could be wrong?
This is normal on stratum. Every time the block changes, you get invalids. Some days burn more blocks than others. It seems like I have way more than other Avalon users, at least from what I've seen on Ozcoin. Most of them have only a couple percent invalid shares at most, while I will have 12 or 13% invalid shares. Right now I'm standing at about 94% efficiency on average, that's with 400,000 invalid shares in the less than a week I've been mining there. That is definitely far more than you should be getting. There is a good chance you're submitting heaps of duplicates which may also be a different form of instability that the auto mode can't check for. Try setting a lower maximum speed if you're using auto mode because clearly you're not doing 82GH of useful work. I also have 11% of rejected shares while HW is ~1.3% Max temp is 48 C, ambient 28 C. My command line is: --quiet --avalon-auto --avalon-freq 282-375 --avalon-cutoff 60 Last night it somehow stopped dumping rejected shares and so was till today, then I've caught blackout. After resuming it still generates 11% reject shares working at 352 MHz (autotuned).
|
|
|
|
Bogart
Legendary
Offline
Activity: 966
Merit: 1000
|
|
July 17, 2013, 12:30:45 AM Last edit: July 17, 2013, 03:07:32 AM by Bogart |
|
I can watch MHS5s slowly drop to 0, and eventually cgminer-monitor will restart cgminer. Why is cgminer stopping hashing?
Common reasons: Wifi kernel problem Overdoing the overclocking Pool failure and the cgminer-monitor watchdog is trigger happy and kills cgminer when all it's doing is waiting for a pool to come back online. FPGA failure in the avalon.If the fans stop running entirely, does that indicate FPGA failure? My Batch 1 unit does this sometimes. I now have a script that queries the API, and if it has a zero hashrate for too long, it will call the connected Web Power Switch to cycle the power on the unit, which always brings it back up. (I do have to have it leave the power off for a full 30 seconds.) Well today power-cycleing the unit did not bring it back to life. I found it stuck repeatedly cycling. The blue LED on the TP-Link would light, and the red LED on the front would come on, but the fans never spun up. I opened it up, and out came the stock Antec EarthWatts 650 power supply, and in went a new Corsair HX850. It was a tight fit. I had to dismount the PDU board, and reinstall it after the PSU went in. I'm happy to report that it's once again hashing away. --avalon-auto has now settled on 360MHz and it's been running for over an hour now, whereas before I had to limit it to 345 to keep things stable. I'm glad I thought ahead to buy that PSU as a spare for exactly this kind of situation. I guess I should now buy another spare. http://www.newegg.com/Product/Product.aspx?Item=N82E16817139011
|
"All safe deposit boxes in banks or financial institutions have been sealed... and may only be opened in the presence of an agent of the I.R.S." - President F.D. Roosevelt, 1933
|
|
|
bitdude
|
|
July 17, 2013, 05:51:24 AM |
|
today I have experienced a "bigger problem" for the second time since I have it
the machine was unresponsive on web interface, so I thought it has crashed completely and would need a physical restart, which is very bad for me
however, I tried to connect to SSH and somehow it worked, but extremely slowly, took my a couple of minutes but finally I was able to type reboot command and reboot the machine, which it did and everything is OK again
so, if someone has this totally unresponsive machine, try SSH and be very very patient, it may come up
funny thing is that it did not stop hashing until I reboot it - i.e. even in this strange unresponsive state, it was doing its job, just web interface could not be reached and SSH extremely hardly
since the operation with the whole machine was impossibly slow I could not analyze any logs before reboot, so I have no idea what happened
Out of mem - free? WiFi disabled yes/no dmesg? just a few basic commands will tell you what went wrong. Make sure you post the output next time thanks, will try next time wifi disabled -> yes
|
|
|
|
-ck
Legendary
Offline
Activity: 4284
Merit: 1645
Ruu \o/
|
|
July 17, 2013, 07:13:34 AM |
|
That is definitely far more than you should be getting. There is a good chance you're submitting heaps of duplicates which may also be a different form of instability that the auto mode can't check for. Try setting a lower maximum speed if you're using auto mode because clearly you're not doing 82GH of useful work.
I also have 11% of rejected shares while HW is ~1.3% Max temp is 48 C, ambient 28 C. My command line is: --quiet --avalon-auto --avalon-freq 282-375 --avalon-cutoff 60 Last night it somehow stopped dumping rejected shares and so was till today, then I've caught blackout. After resuming it still generates 11% reject shares working at 352 MHz (autotuned). Like I said, try setting a lower maximum.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
psjw4450
|
|
July 17, 2013, 01:38:13 PM |
|
Yesterday my avalon started showing a super high number of hw. Almost 75% of the accepted diff1a shares. For instance shares 100000 hw 75000. Any ideas on what to check?
Found out that running any temps higher then 41 starts giving hw higher then 70%. Im using current firmware from cgminer and using ethernet not wifi. Anyone able to help or any have ideas.
|
|
|
|
GenTarkin
Legendary
Offline
Activity: 2450
Merit: 1002
|
|
July 17, 2013, 02:46:52 PM |
|
Yesterday my avalon started showing a super high number of hw. Almost 75% of the accepted diff1a shares. For instance shares 100000 hw 75000. Any ideas on what to check?
Found out that running any temps higher then 41 starts giving hw higher then 70%. Im using current firmware from cgminer and using ethernet not wifi. Anyone able to help or any have ideas. What clocks / temps?
|
|
|
|
psjw4450
|
|
July 17, 2013, 02:52:51 PM |
|
Yesterday my avalon started showing a super high number of hw. Almost 75% of the accepted diff1a shares. For instance shares 100000 hw 75000. Any ideas on what to check?
Found out that running any temps higher then 41 starts giving hw higher then 70%. Im using current firmware from cgminer and using ethernet not wifi. Anyone able to help or any have ideas. What clocks / temps? Pretty much at any clock, I have tried 282 and once it climbs over 40-41 then it starts with high errors. I currently have an airconditioner by it and running stable on 300 clock with temps at 40 and hw errors less then 2%. If I move the airconditioner the hw starts climbing fast.
|
|
|
|
|