jgarzik (OP)
Legendary
Offline
Activity: 1596
Merit: 1099
|
|
February 08, 2013, 05:14:12 PM |
|
Another reliability update:
Now that difficulty is sufficiently high, no longer seeing machine or cgminer restarts.
The most common symptom now is a cessation of mining; cgminer and machine are both responding to status queries, but no work is occurring.
This symptom occurs every 24-48 hours.
A simple machine restart fixes the problem immediately.
|
Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own. Visit bloq.com / metronome.io Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
|
|
|
Photon939
|
|
February 08, 2013, 05:30:40 PM |
|
That's rather poor performance TBH, I imagine it will quickly become frustrating to manually restart your miner every/every other day. My GPU miners easily went over a month with no restarts or failures (on Windows even)
Too bad nobody can do anything about it since they haven't made any of the code available. Disappointing after their "it's not for the money" statements.
|
|
|
|
Bogart
Legendary
Offline
Activity: 966
Merit: 1000
|
|
February 08, 2013, 05:58:19 PM |
|
Another reliability update:
Now that difficulty is sufficiently high, no longer seeing machine or cgminer restarts.
The most common symptom now is a cessation of mining; cgminer and machine are both responding to status queries, but no work is occurring.
This symptom occurs every 24-48 hours.
A simple machine restart fixes the problem immediately.
Sure sounds like a memory leak. If you could check and record every hour or so, the free memory reported by 'free', and maybe capture which processes are using how much with 'ps axu', you may be able to find more definitive proof. For remote/automated restart capability, I like these: http://www.digital-loggers.com/lpc.html
|
"All safe deposit boxes in banks or financial institutions have been sealed... and may only be opened in the presence of an agent of the I.R.S." - President F.D. Roosevelt, 1933
|
|
|
-ck
Legendary
Offline
Activity: 4256
Merit: 1645
Ruu \o/
|
|
February 08, 2013, 09:19:39 PM |
|
It's a damn shame because the memory leak was very real and trivial to fix and fixed in the next release of cgminer. I'd suggest just using ozcoin (or a pool like it that supports setting high static diffs) to mine with, and set difficulty to 10,000. Then the number of shares submitted will be miniscule and it will take much longer to run out of memory. The only cost of doing that would be some more variance.
|
Developer/maintainer for cgminer, ckpool/ckproxy, and the -ck kernel 2% Fee Solo mining at solo.ckpool.org -ck
|
|
|
jgarzik (OP)
Legendary
Offline
Activity: 1596
Merit: 1099
|
|
February 09, 2013, 01:40:55 AM |
|
Another reliability update:
Now that difficulty is sufficiently high, no longer seeing machine or cgminer restarts.
The most common symptom now is a cessation of mining; cgminer and machine are both responding to status queries, but no work is occurring.
This symptom occurs every 24-48 hours.
A simple machine restart fixes the problem immediately.
Sure sounds like a memory leak. If you could check and record every hour or so, the free memory reported by 'free', and maybe capture which processes are using how much with 'ps axu', you may be able to find more definitive proof. 'free' is happy as a clam. The previous behavior can be attributed to a memory leak. Now that difficulty is sufficiently high, the box reaches a condition where the controller (linux kernel, cgminer) are active and accessible remotely, but no work is progressing. The box will restart if the memory leak condition is reached. The box does not restart upon this no-mining condition.
|
Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own. Visit bloq.com / metronome.io Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
|
|
|
Bogart
Legendary
Offline
Activity: 966
Merit: 1000
|
|
February 09, 2013, 02:19:41 AM |
|
Another reliability update:
Now that difficulty is sufficiently high, no longer seeing machine or cgminer restarts.
The most common symptom now is a cessation of mining; cgminer and machine are both responding to status queries, but no work is occurring.
This symptom occurs every 24-48 hours.
A simple machine restart fixes the problem immediately.
Sure sounds like a memory leak. If you could check and record every hour or so, the free memory reported by 'free', and maybe capture which processes are using how much with 'ps axu', you may be able to find more definitive proof. 'free' is happy as a clam. The previous behavior can be attributed to a memory leak. Now that difficulty is sufficiently high, the box reaches a condition where the controller (linux kernel, cgminer) are active and accessible remotely, but no work is progressing. The box will restart if the memory leak condition is reached. The box does not restart upon this no-mining condition. I see. Does 'dmesg' report anything new once this condition is reached?
|
"All safe deposit boxes in banks or financial institutions have been sealed... and may only be opened in the presence of an agent of the I.R.S." - President F.D. Roosevelt, 1933
|
|
|
fcmatt
Legendary
Offline
Activity: 2072
Merit: 1001
|
|
February 09, 2013, 02:40:45 AM |
|
Another reliability update:
Now that difficulty is sufficiently high, no longer seeing machine or cgminer restarts.
The most common symptom now is a cessation of mining; cgminer and machine are both responding to status queries, but no work is occurring.
This symptom occurs every 24-48 hours.
A simple machine restart fixes the problem immediately.
Sure sounds like a memory leak. If you could check and record every hour or so, the free memory reported by 'free', and maybe capture which processes are using how much with 'ps axu', you may be able to find more definitive proof. 'free' is happy as a clam. The previous behavior can be attributed to a memory leak. Now that difficulty is sufficiently high, the box reaches a condition where the controller (linux kernel, cgminer) are active and accessible remotely, but no work is progressing. The box will restart if the memory leak condition is reached. The box does not restart upon this no-mining condition. I am using a semi new version of cgminer and that behavior is what i am seeing. Program is running but no actual work gets to btcguild. Restarting cgminer every few weeks fixes it. No reboot needed on windows 7 with gpu. Cant you drop in a new version of cgminer but somehow keep the critcal bits for those asics? Or are the drivers compiled into one big binary? Just thinking out loud. I am prob wrong.
|
|
|
|
iCEBREAKER
Legendary
Offline
Activity: 2156
Merit: 1072
Crypto is the separation of Power and State.
|
|
February 09, 2013, 03:21:24 AM |
|
Avalon ASIC miner is currently surviving >24 hour stretches without restarting, now that we're on a stable pool with a reasonably high difficulty value (BTC Guild, difficulty 32.0).
However, have now seen the machine get "stuck" in a strange state, where it is not mining or restarting. The fans ramp up, then ramp down, in a cycle. BTC Guild is the most advanced and best pool on the planet. Maybe this is why we've only 3 ASICs in the wild at present? It's great that Avalon was first to market and all, but your unit was specifically intended for review and it seems sort of... HALF BAKED
|
██████████ ██████████████████ ██████████████████████ ██████████████████████████ ████████████████████████████ ██████████████████████████████ ████████████████████████████████ ████████████████████████████████ ██████████████████████████████████ ██████████████████████████████████ ██████████████████████████████████ ██████████████████████████████████ ██████████████████████████████████ ████████████████████████████████ ██████████████ ██████████████ ████████████████████████████ ██████████████████████████ ██████████████████████ ██████████████████ ██████████ Monero
|
| "The difference between bad and well-developed digital cash will determine whether we have a dictatorship or a real democracy." David Chaum 1996 "Fungibility provides privacy as a side effect." Adam Back 2014
|
| | |
|
|
|
Nemesis
|
|
February 09, 2013, 03:47:19 AM |
|
Avalon ASIC miner is currently surviving >24 hour stretches without restarting, now that we're on a stable pool with a reasonably high difficulty value (BTC Guild, difficulty 32.0).
However, have now seen the machine get "stuck" in a strange state, where it is not mining or restarting. The fans ramp up, then ramp down, in a cycle. BTC Guild is the most advanced and best pool on the planet. Maybe this is why we've only 3 ASICs in the wild at present? It's great that Avalon was first to market and all, but your unit was specifically intended for review and it seems sort of... HALF BAKED If its a true review, he would have said "This shit doesnt work, still prototyping. Buyers beware" You know like any real review of a product being sold in working state.
|
|
|
|
jgarzik (OP)
Legendary
Offline
Activity: 1596
Merit: 1099
|
|
February 10, 2013, 06:42:42 PM |
|
Uptime: 1d 13h 49m 29s
solo mining w/ eloipool + high diff
|
Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own. Visit bloq.com / metronome.io Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
|
|
|
fcmatt
Legendary
Offline
Activity: 2072
Merit: 1001
|
|
February 10, 2013, 07:52:57 PM |
|
Uptime: 1d 13h 49m 29s
solo mining w/ eloipool + high diff
Good luck. Are you going for a block no matter how long it takes?
|
|
|
|
BitSyncom
Sr. Member
Offline
Activity: 336
Merit: 251
Avalon ASIC Team
|
|
February 10, 2013, 09:43:26 PM |
|
overheating is not the issue here. see http://imgur.com/a/5GqixI'm sure Jeff can confirm these numbers on his device as well. as for memleak, we found it but that didn't solve all the issues. expect a controller update soon and a newsletter.
|
|
|
|
Bicknellski
|
|
February 11, 2013, 12:20:01 AM |
|
overheating is not the issue here. see http://imgur.com/a/5GqixI'm sure Jeff can confirm these numbers on his device as well. as for memleak, we found it but that didn't solve all the issues. expect a controller update soon and a newsletter. +1
|
|
|
|
fcmatt
Legendary
Offline
Activity: 2072
Merit: 1001
|
|
February 11, 2013, 01:52:39 AM |
|
overheating is not the issue here. see http://imgur.com/a/5GqixI'm sure Jeff can confirm these numbers on his device as well. as for memleak, we found it but that didn't solve all the issues. expect a controller update soon and a newsletter. +1 +1 what? I am unclear how one can interpret those pics without knowing what is too hot for this gear. Am i supposed to think i can compare it to gpu cards and how hot they get? What numbers exactly? Does the gui report temp values? Good they found a memleak but if it did not solve all issues... What exact issues remain?
|
|
|
|
Inaba
Legendary
Offline
Activity: 1260
Merit: 1000
|
|
February 11, 2013, 02:37:38 AM |
|
That entire heatsink on each card is saturated at 50C! Holy hell on a crutch!
Are those IR pics accurate? That seems excessive... if it actually is 40 - 50C, what's going to happen to them in hot weather?
Maybe Jeff can heat up a room to 85F or so and let it run to see what happens.
|
If you're searching these lines for a point, you've probably missed it. There was never anything there in the first place.
|
|
|
kjj
Legendary
Offline
Activity: 1302
Merit: 1026
|
|
February 11, 2013, 02:53:12 AM |
|
That entire heatsink on each card is saturated at 50C! Holy hell on a crutch!
Are those IR pics accurate? That seems excessive... if it actually is 40 - 50C, what's going to happen to them in hot weather?
Maybe Jeff can heat up a room to 85F or so and let it run to see what happens.
Even better, get a sheet of IR-transparent plexiglass and put it where the missing side of the case should be so that the fans can generate proper airflow, and then take the thermal images again.
|
17Np17BSrpnHCZ2pgtiMNnhjnsWJ2TMqq8 I routinely ignore posters with paid advertising in their sigs. You should too.
|
|
|
Dalkore
Legendary
Offline
Activity: 1330
Merit: 1026
Mining since 2010 & Hosting since 2012
|
|
February 11, 2013, 03:27:41 AM |
|
That entire heatsink on each card is saturated at 50C! Holy hell on a crutch!
Are those IR pics accurate? That seems excessive... if it actually is 40 - 50C, what's going to happen to them in hot weather?
Maybe Jeff can heat up a room to 85F or so and let it run to see what happens.
Why would he want to do that? I think we can guess what happens. Either they slow down or it hits a temp cutoff that shuts down mining. FYI, high temperatures will have adverse effects on mining operations. What happens to BFL singles when they get too hot, they down-clock until they are within the optimal temperature range.
|
Hosting: Low as $60.00 per KW - LinkTransaction List: jayson3 +5 - ColdHardMetal +3 - Nolo +2 - CoinHoarder +1 - Elxiliath +1 - tymm0 +1 - Johnniewalker +1 - Oscer +1 - Davidj411 +1 - BitCoiner2012 +1 - dstruct2k +1 - Philj +1 - camolist +1 - exahash +1 - Littleshop +1 - Severian +1 - DebitMe +1 - lepenguin +1 - StringTheory +1 - amagimetals +1 - jcoin200 +1 - serp +1 - klintay +1 - -droid- +1 - FlutterPie +1
|
|
|
crazyates
Legendary
Offline
Activity: 952
Merit: 1000
|
|
February 11, 2013, 05:19:55 AM |
|
That entire heatsink on each card is saturated at 50C! Holy hell on a crutch!
Are those IR pics accurate? That seems excessive... if it actually is 40 - 50C, what's going to happen to them in hot weather?
Maybe Jeff can heat up a room to 85F or so and let it run to see what happens.
Why would he want to do that? I think we can guess what happens. Either they slow down or it hits a temp cutoff that shuts down mining. FYI, high temperatures will have adverse effects on mining operations. What happens to BFL singles when they get too hot, they down-clock until they are within the optimal temperature range. Inaba: Hey, lets do some scientific and procedural testing so we can determine an outcome given a set of constants (hardware) and variables (temperature). Dalkore: That's stupid. It's either going to be situation A or situation B. I don't know which, and I have no way of finding out unless we follow Inaba's suggestion, but it's still stupid. Really?
|
|
|
|
Inaba
Legendary
Offline
Activity: 1260
Merit: 1000
|
|
February 11, 2013, 05:27:24 AM |
|
That entire heatsink on each card is saturated at 50C! Holy hell on a crutch!
Are those IR pics accurate? That seems excessive... if it actually is 40 - 50C, what's going to happen to them in hot weather?
Maybe Jeff can heat up a room to 85F or so and let it run to see what happens.
Why would he want to do that? I think we can guess what happens. Either they slow down or it hits a temp cutoff that shuts down mining. FYI, high temperatures will have adverse effects on mining operations. What happens to BFL singles when they get too hot, they down-clock until they are within the optimal temperature range. I don't know if you noticed, but many parts of the world get to be 85F or more during the summer months. Personally, I would want to know if my expensive mining hardware was going to shutdown when the temp rose as it would dictate how much power I had to consume to cool the thing with an AC unit. Maybe you don't care, but I sure as hell would.
|
If you're searching these lines for a point, you've probably missed it. There was never anything there in the first place.
|
|
|
BitSyncom
Sr. Member
Offline
Activity: 336
Merit: 251
Avalon ASIC Team
|
|
February 11, 2013, 05:28:20 AM |
|
Maybe Jeff can heat up a room to 85F or so and let it run to see what happens.
Avalon is designed to operate at 35C, not a problem.
|
|
|
|
|