Bitcoin Forum
November 02, 2024, 10:26:27 AM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 4 5 [6] 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 ... 221 »
  Print  
Author Topic: Avalon ASIC users thread  (Read 438567 times)
jgarzik (OP)
Legendary
*
Offline Offline

Activity: 1596
Merit: 1100


View Profile
February 03, 2013, 05:11:24 AM
Last edit: February 03, 2013, 05:22:55 AM by jgarzik
 #101

Still having some amount of reliability problems.  Moving the miner in the office helped a great deal.  However, still seeing machine restarts for unknown reasons (note: must ping support, inquire about reboot logging).

And right now, the miner is in a strange state where the controller (running Linux) and cgminer are alive, but fans are not running, and no work is occurring:

Code:
   [Elapsed] => 10477
   [MHS av] => 15419.77   (this is not changing)
   [Found Blocks] => 0
   [Getworks] => 385         (these are increasing, slowly)
   [Accepted] => 37082     (these are not increasing)
   [Rejected] => 398
...
   [fan1] => 0                     (indeed, fans are not moving)
   [fan2] => 0
   [fan3] => 0
   [temp1] => 26
   [temp2] => -1
   [temp3] => 48
   [temp_max] => 49

And the pool agrees that no shares are incoming.


Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own.
Visit bloq.com / metronome.io
Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
kano
Legendary
*
Offline Offline

Activity: 4606
Merit: 1851


Linux since 1997 RedHat 4


View Profile
February 03, 2013, 05:45:31 AM
 #102

i.e. still hashing, still getting work, but not sending anything back to the pool/bitcoind.
So either there's an issue of heat causing bad/no nonces to come back or there's some failure to send out the answers (failure to send is unlikely)

Counting the amount of work done is up to the driver supplying a response back to the call to scanhash()
Is Hardware Errors increasing?

... and yes the driver module can reply in such a way as cgminer thinks it is working but no nonces are being found.
Since a work item doesn't have to find a none (they only average one per nonce range as you know) if it isn't actually returning any nonces then that would also match your description (i.e. due to some over heat MCU/whatever issue that might happen on occasion and the reboot fixes it)

Edit: this happens with Icarus on rare occasions Smiley

Pool: https://kano.is - low 0.5% fee PPLNS 3 Days - Most reliable Solo with ONLY 0.5% fee   Bitcointalk thread: Forum
Discord support invite at https://kano.is/ Majority developer of the ckpool code - k for kano
The ONLY active original developer of cgminer. Original master git: https://github.com/kanoi/cgminer
jgarzik (OP)
Legendary
*
Offline Offline

Activity: 1596
Merit: 1100


View Profile
February 03, 2013, 05:46:16 AM
 #103

i.e. still hashing, still getting work, but not sending anything back to the pool/bitcoind.

No, not hashing.


Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own.
Visit bloq.com / metronome.io
Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
kano
Legendary
*
Offline Offline

Activity: 4606
Merit: 1851


Linux since 1997 RedHat 4


View Profile
February 03, 2013, 05:51:24 AM
 #104

i.e. still hashing, still getting work, but not sending anything back to the pool/bitcoind.

No, not hashing.

Hmm but you said MHs isn't changing i.e. that hash rate isn't dropping.
Check "Total MH" which should be increasing.

Edit: yes it may not *actually* be hashing, but the driver module may think it is and isn't getting an error from the devices and thus assuming it is hashing and thus it's effectively saying that it's just not finding any nonces.
i.e. it's able to send the work and the expected responses are coming back, but without any nonces.

Pool: https://kano.is - low 0.5% fee PPLNS 3 Days - Most reliable Solo with ONLY 0.5% fee   Bitcointalk thread: Forum
Discord support invite at https://kano.is/ Majority developer of the ckpool code - k for kano
The ONLY active original developer of cgminer. Original master git: https://github.com/kanoi/cgminer
jgarzik (OP)
Legendary
*
Offline Offline

Activity: 1596
Merit: 1100


View Profile
February 03, 2013, 06:01:53 AM
 #105

Again, no, it was not hashing.  Avoid speculating wildly based on a tiny subset of hundreds of variables.

Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own.
Visit bloq.com / metronome.io
Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
rchapoteau
Newbie
*
Offline Offline

Activity: 56
Merit: 0



View Profile WWW
February 03, 2013, 06:12:18 AM
 #106

Mighr be a stupid question but are you sure that your office isn't doing anything on the network that might be giving you an issue?  Like proxy server settings?
jgarzik (OP)
Legendary
*
Offline Offline

Activity: 1596
Merit: 1100


View Profile
February 03, 2013, 06:12:55 AM
 #107

Mighr be a stupid question but are you sure that your office isn't doing anything on the network that might be giving you an issue?  Like proxy server settings?

Yes.


Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own.
Visit bloq.com / metronome.io
Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
kano
Legendary
*
Offline Offline

Activity: 4606
Merit: 1851


Linux since 1997 RedHat 4


View Profile
February 03, 2013, 06:15:31 AM
 #108

Again, no, it was not hashing.  Avoid speculating wildly based on a tiny subset of hundreds of variables.

No speculation.
MHs says it is hashing - reread my last reply.

Pool: https://kano.is - low 0.5% fee PPLNS 3 Days - Most reliable Solo with ONLY 0.5% fee   Bitcointalk thread: Forum
Discord support invite at https://kano.is/ Majority developer of the ckpool code - k for kano
The ONLY active original developer of cgminer. Original master git: https://github.com/kanoi/cgminer
Unacceptable
Legendary
*
Offline Offline

Activity: 2212
Merit: 1001



View Profile
February 03, 2013, 06:17:36 AM
 #109

Maybe someone can get on teamviewer or the like & see whats happening  Huh

"If you run into an asshole in the morning, you ran into an asshole. If you run into assholes all day long, you are the asshole."  -Raylan Givens
Got GOXXED ?? https://www.youtube.com/watch?v=9KiqRpPiJAU&feature=youtu.be
"An ASIC being late is perfectly normal, predictable, and legal..."Hashfast & BFL slogan Smiley
kano
Legendary
*
Offline Offline

Activity: 4606
Merit: 1851


Linux since 1997 RedHat 4


View Profile
February 03, 2013, 06:19:10 AM
 #110

Maybe someone can get on teamviewer or the like & see whats happening  Huh
As I said, this already happens with Icarus.
It's not surprising - and the API output says it is doing the same thing.

Pool: https://kano.is - low 0.5% fee PPLNS 3 Days - Most reliable Solo with ONLY 0.5% fee   Bitcointalk thread: Forum
Discord support invite at https://kano.is/ Majority developer of the ckpool code - k for kano
The ONLY active original developer of cgminer. Original master git: https://github.com/kanoi/cgminer
jgarzik (OP)
Legendary
*
Offline Offline

Activity: 1596
Merit: 1100


View Profile
February 03, 2013, 06:30:26 AM
 #111

No speculation.
MHs says it is hashing - reread my last reply.

And as noted, that number is not changing.  The number is constantly changing, normally.

Other stats bear out the conclusion.


Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own.
Visit bloq.com / metronome.io
Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
rudrigorc2
Legendary
*
Offline Offline

Activity: 1064
Merit: 1000



View Profile
February 03, 2013, 07:04:36 AM
 #112

 is it dead Huh
kano
Legendary
*
Offline Offline

Activity: 4606
Merit: 1851


Linux since 1997 RedHat 4


View Profile
February 03, 2013, 07:13:07 AM
 #113

No speculation.
MHs says it is hashing - reread my last reply.

And as noted, that number is not changing.  The number is constantly changing, normally.

Other stats bear out the conclusion.

Well the fact that the API is replying (and counting getworks) says of course cgminer is running.
The only ways *I* can imagine that number to be exactly static:
1) The system clock stopped at the same time it stopped hashing - simple to check - look at some of the other information you removed - [When] and see if it changes form one output to the next ...
2) The software changes have messed with the main control of cgminer ... and got something wrong there
3) Some number is overflowing somewhere due to the higher hash rate ... and screwing up the work thread(s) in some weird way
4) The hashing speed is perfectly 15419.77 when it thinks it is working but is indeed doing nothing - since the timing to do nothing is the same every time it does nothing Smiley
5) Who knows - anyway I'm wasting my time here ... have fun with it Smiley

Pool: https://kano.is - low 0.5% fee PPLNS 3 Days - Most reliable Solo with ONLY 0.5% fee   Bitcointalk thread: Forum
Discord support invite at https://kano.is/ Majority developer of the ckpool code - k for kano
The ONLY active original developer of cgminer. Original master git: https://github.com/kanoi/cgminer
jgarzik (OP)
Legendary
*
Offline Offline

Activity: 1596
Merit: 1100


View Profile
February 03, 2013, 07:16:30 AM
 #114

is it dead Huh

No, not at all.  A restart fixes everything.  This is just describing the behavior before the restart.


Jeff Garzik, Bloq CEO, former bitcoin core dev team; opinions are my own.
Visit bloq.com / metronome.io
Donations / tip jar: 1BrufViLKnSWtuWGkryPsKsxonV2NQ7Tcj
BitSyncom
Sr. Member
****
Offline Offline

Activity: 336
Merit: 251

Avalon ASIC Team


View Profile
February 03, 2013, 08:45:01 AM
 #115

power consumption on site fixed to reflect end-user situations.

Chip power efficienty: 6.6W/GHs @ 1.15 V
Module power consumption: 149W @ 20GHs
machine power consumption: 595W @ 220V-AC | 620W @ 120V-AC

Elokane
Hero Member
*****
Offline Offline

Activity: 817
Merit: 1000


Truth is a consensus among neurons www.synereo.com


View Profile WWW
February 03, 2013, 10:25:03 AM
 #116

Still having some amount of reliability problems.  Moving the miner in the office helped a great deal.  However, still seeing machine restarts for unknown reasons (note: must ping support, inquire about reboot logging).

And right now, the miner is in a strange state where the controller (running Linux) and cgminer are alive, but fans are not running, and no work is occurring:

And the pool agrees that no shares are incoming.


How often does it restart?
How much trouble is it to start mining again once it does?


Synereo: liberating the Internet from abusive business models.

Beware of he who would deny you access to information, for in his heart, he dreams himself your master.
<br>
Icoin
Hero Member
*****
Offline Offline

Activity: 585
Merit: 501



View Profile WWW
February 03, 2013, 11:22:28 AM
Last edit: February 03, 2013, 04:23:48 PM by Icoin
 #117

DVB allready started to build up the team for GPL AVALON Watercooling elements production. I see heat as a major problem for ASIC devices in general, this heat probably causes the errors, more noise and higher power consumption, aswell the need for the restart. We will investigate, with a thermo camera, as soon the AVALON trade-in harware arrives here in Switzerland.

https://bitcointalk.org/index.php?topic=101559.msg1493823#msg1493823(

DVB is a DEVCOIN (DEVELOPERS COIN) based and we guys use for development devcoins in between us.

Jeff or BitSyncom: can you make heat pictures from your AVALON? It will show us the heat distribution among the device.

BitSyncom: i guess your shop system will be fixed soon, and we gonna be able to make the trade-in order today Smiley

Thanks

Icoin

SLok
Hero Member
*****
Offline Offline

Activity: 568
Merit: 500


View Profile
February 03, 2013, 04:07:56 PM
 #118

Chip's power consumption is <6.6W/Ghs

ATX power loses(82%), DC/DC power loses(87%), Fan(5~20w), etc…

So…

650W * 0.82 * 0.87 - 20W = 443W, About right.

so an equivalent bfl number would be something like:

60w / .87 / .82 + 2W (fan) + 5W (host) = 91 W    (host can be discounted for many since it can run several units)

or "bfl's worst case" 60W * 1.2 = 72W ===> 72w / .87 / .82 + 2W (fan) + 5W (host) = ~108W


BFL's single uses 72W at the wall, there is no ATX psu or DC/DC or extra for fans etc, just the host usage to be added. All speculating of course coz it ain't alive yet, but it is definitely no 620W, or 108W.

@jgarzik, have you ran it without the ups to see if that isn't messing up? Checked/reseated internal connections, it has been on a long and bumpy ride?

WARNING! Don't trade BTC with Bruno Kucinskas aka Gleb Gamow, Phinnaeus Gage, etc Laundering BTC from anonymous sellers, avoid!https://bitcointalk.org/index.php?topic=649176.msg7279994#msg7279994 #TELLFBI #TELLKSAG #TELLIRS WARNING! Darin M. Bicknell, a proclaimed atheist, teaching at the Jakarta CanadianMontessori School. Drop your kids there at your own risk! WARNING! Christian Otzipka - Hildesheim is a known group-buy scammer, avoid! WARNING! Frizz Supertramp, faker with dozens of accounts here! WARNING! Christian "2 coins to see SLOk's" Antkow, still playing his little microphone...WARNING! Slobodan "Stolen Valor" Bogovac, faking being a ProfessorWARNING!Marion Sydney Lynn, google him, errr her, errr.. and lol
MrTeal
Legendary
*
Offline Offline

Activity: 1274
Merit: 1004


View Profile
February 03, 2013, 04:23:45 PM
 #119

Chip's power consumption is <6.6W/Ghs

ATX power loses(82%), DC/DC power loses(87%), Fan(5~20w), etc…

So…

650W * 0.82 * 0.87 - 20W = 443W, About right.

so an equivalent bfl number would be something like:

60w / .87 / .82 + 2W (fan) + 5W (host) = 91 W    (host can be discounted for many since it can run several units)

or "bfl's worst case" 60W * 1.2 = 72W ===> 72w / .87 / .82 + 2W (fan) + 5W (host) = ~108W


BFL's single uses 72W at the wall, there is no ATX psu or DC/DC or extra for fans etc, just the host usage to be added. All speculating of course coz it ain't alive yet, but it is definitely no 620W, or 108W.

@jgarzik, have you ran it without the ups to see if that isn't messing up? Checked/reseated internal connections, it has been on a long and bumpy ride?

There is an onboard DC/DC module that converts the 12V to 1.2V or whatever core voltage the chips are running at, along with whatever they're using for IO.
Quote from: BFL_Josh
We have been quoting our power as power at the wall, not the chip power. If we want to talk chip power, it's .8w GH, so using the Avalon system of measure, 48w per Single, or 24w for a Little Single.

Even if we have worst case scenario, it's 1.2w per GH and that gives you 72w via the Avalon measuring system and probably somewhere around 80w - 85w at the wall. We are not expecting 1.2w GH/s, that is just the upper limit of the power draw per chip under the worst possible conditions.

Earlier Josh had posted that they were aiming for 0.8J/GH and that the possible worst case was 1.2J/GH. I'm a little confused by this too as BFL has always said that 60W would be at the wall with the included power adapter, but if 0.8J/GH is just the chips you will have to factor in both the onboard convertor losses, other draws (microcontroller, fan, etc) and the brick PSU to get the final power draw.

Hopefully we'll know the answer in a month's time.
Aseras
Hero Member
*****
Offline Offline

Activity: 658
Merit: 500


View Profile
February 03, 2013, 04:29:05 PM
 #120

DVB allready started to build up the team for GPL AVALON Watercooling elements production. I see heat as a major problem for ASIC devices in general, this heat probably causes the errors, more noise and higher power consumption, aswell the need for the restart. We will investigate, with a thermo camera, as soon the AVALON trade-in harware arrives here in Switzerland.

https://bitcointalk.org/index.php?topic=101559.msg1493823#msg1493823(

DVB is a DEVCOIN (DEVELOPERS COIN) based and we guys use for development devcoins in between us.

Jeff or BitSyncom: can you make heat pictures from your AVALON? It will show us the heat distribution among the device.

BitSyncom: i guess your shop system will be fixed soon, and we gonna be able to make the trade-in order today Smiley

Thanks

Icoin

I can do thermal imaging of my units when they arrive. I'm in the first batch expecting them soon.
Pages: « 1 2 3 4 5 [6] 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 ... 221 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!