Bitcoin Forum
May 11, 2024, 12:18:19 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1] 2 3 4 5 »  All
  Print  
Author Topic: [FIXED] Avalon URGENT ISSUE: both of my avalon do not work any more  (Read 11845 times)
libertybuck (OP)
Full Member
***
Offline Offline

Activity: 137
Merit: 100


View Profile
March 01, 2013, 10:59:07 PM
Last edit: March 02, 2013, 05:21:59 AM by libertybuck
 #1

Today morning I login btcguild to check my avalon. To my surprise I find both of my avalon IDLE there with mining speed 0.  Shocked



I am not sure what happened so I login openwrt console and run the following command:

Code:
root@OpenWrt:~# /etc/init.d/cgminer stop
no cgminer found; none killed


Code:
root@OpenWrt:~# 
root@OpenWrt:~# /etc/init.d/cgminer start
ntpd: resolved peer 3.openwrt.pool.ntp.org to 202.112.31.197
ntpd: sent query to 202.112.31.197
ntpd: resolved peer 2.openwrt.pool.ntp.org to 202.118.1.130
ntpd: sent query to 202.118.1.130
ntpd: resolved peer 1.openwrt.pool.ntp.org to 202.112.29.82
ntpd: sent query to 202.112.29.82
ntpd: resolved peer 0.openwrt.pool.ntp.org to 218.75.4.130
ntpd: sent query to 218.75.4.130
ntpd: reply from 202.112.31.197: reach 0x01 offset -0.325146 delay 0.656282 status 0x24 strat 2 refid 0xca76012e rootdelay 0.049805
ntpd: reply from 202.118.1.130: reach 0x01 offset -0.275834 delay 0.559704 status 0x24 strat 2 refid 0xca76012e rootdelay 0.000137
ntpd: reply from 202.112.29.82: reach 0x01 offset -0.178994 delay 0.365631 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: reply from 218.75.4.130: reach 0x01 offset -0.003093 delay 0.060521 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x03 offset -0.009608 delay 0.048428 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: sent query to 202.112.31.197
ntpd: sent query to 202.118.1.130
ntpd: sent query to 202.112.29.82
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x07 offset -0.007173 delay 0.051018 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: reply from 202.112.31.197: reach 0x03 offset -0.026169 delay 0.058811 status 0x24 strat 2 refid 0xca76012e rootdelay 0.049805
ntpd: reply from 202.118.1.130: reach 0x03 offset -0.025689 delay 0.060743 status 0x24 strat 2 refid 0xca76012e rootdelay 0.000137
ntpd: reply from 202.112.29.82: reach 0x03 offset -0.024491 delay 0.062413 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: sent query to 202.112.29.82
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x0f offset -0.006441 delay 0.050414 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: reply from 202.112.29.82: reach 0x07 offset -0.022051 delay 0.057496 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: sent query to 202.112.31.197
ntpd: sent query to 202.118.1.130
ntpd: sent query to 202.112.29.82
ntpd: reply from 202.118.1.130: reach 0x07 offset -0.022068 delay 0.058948 status 0x24 strat 2 refid 0xca76012e rootdelay 0.000137
ntpd: reply from 202.112.31.197: reach 0x07 offset -0.024856 delay 0.061325 status 0x24 strat 2 refid 0xca76012e rootdelay 0.049805
ntpd: reply from 202.112.29.82: reach 0x0f offset -0.020955 delay 0.061241 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: sent query to 202.118.1.130
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x1f offset -0.007505 delay 0.047845 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
ntpd: reply from 202.118.1.130: reach 0x0f offset -0.023846 delay 0.054938 status 0x24 strat 2 refid 0xca76012e rootdelay 0.000137
ntpd: sent query to 202.112.31.197
ntpd: sent query to 202.112.29.82
ntpd: reply from 202.112.31.197: reach 0x0f offset -0.019475 delay 0.062804 status 0x24 strat 2 refid 0xca76012e rootdelay 0.049805
ntpd: reply from 202.112.29.82: reach 0x1f offset -0.020469 delay 0.069275 status 0x24 strat 2 refid 0x7bc773cb rootdelay 0.048386
ntpd: sent query to 202.112.31.197
ntpd: sent query to 202.118.1.130
ntpd: sent query to 218.75.4.130
ntpd: reply from 218.75.4.130: reach 0x3f offset -0.006091 delay 0.052049 status 0x24 strat 2 refid 0xd1510907 rootdelay 0.171008
root@OpenWrt:~#

And then I run ps command still could not find cgminer running.

I am not sure if it has relation with btcguild pool so I change pool to ozco and run the above commands again. Still nothing luck.


Now, I run the following command:

Code:
root@OpenWrt:~# cgminer -S /dev/ttyUSB0 -o http://us.ozco.in:8331 -O xxxxxx.0:yyyyy--avalon-options 115200:24:10:45:282
 [2013-03-01 22:47:22] Started cgminer 2.10.4                    
 [2013-03-01 22:47:22] Avalon: Reset succeeded                    
 [2013-03-01 22:47:22] Probing for an alive pool                    
 Bus error

root@OpenWrt:~#

Base on the above fact I am afraid both of my two avalon are broken.
How stange it is !  
Both of them broken at the same time !
My avalons run well for the past 14 days and then crash at the same time !

Any one could understand it ?

Because I fail to contact nzhang via email and phone call so I have to post topic here.  

YiFu, nzhang, xiangfu, anyone of your three please contact with me via phone call or email I will prepare teamviewer for you with it you could login my avalon to have a check.

Urget help needed!


1715429899
Hero Member
*
Offline Offline

Posts: 1715429899

View Profile Personal Message (Offline)

Ignore
1715429899
Reply with quote  #2

1715429899
Report to moderator
1715429899
Hero Member
*
Offline Offline

Posts: 1715429899

View Profile Personal Message (Offline)

Ignore
1715429899
Reply with quote  #2

1715429899
Report to moderator
1715429899
Hero Member
*
Offline Offline

Posts: 1715429899

View Profile Personal Message (Offline)

Ignore
1715429899
Reply with quote  #2

1715429899
Report to moderator
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1715429899
Hero Member
*
Offline Offline

Posts: 1715429899

View Profile Personal Message (Offline)

Ignore
1715429899
Reply with quote  #2

1715429899
Report to moderator
Beepbop
Full Member
***
Offline Offline

Activity: 126
Merit: 100



View Profile
March 01, 2013, 11:04:49 PM
 #2

You one of the Chinese custormers, right?

Silly of me to ask maybe, but have you tried turning it off and on again? Did you get the firmware update that jgarzik got?
darkip
Full Member
***
Offline Offline

Activity: 160
Merit: 100


View Profile
March 01, 2013, 11:06:27 PM
 #3

Silly of me to ask maybe, but have you tried turning it off and on again?

Unlikely both have failed at exactly the same time. More likely is you started them at the same time and some sort of memory leak / OS error is causing this. I expect a restart will probably fix the problem.
kaerf
Hero Member
*****
Offline Offline

Activity: 631
Merit: 500


View Profile
March 01, 2013, 11:06:53 PM
 #4

check the cable connections inside the box too
libertybuck (OP)
Full Member
***
Offline Offline

Activity: 137
Merit: 100


View Profile
March 01, 2013, 11:07:51 PM
 #5

You one of the Chinese custormers, right?

Yes.


have you tried turning it off and on again?

Sure.  I tried reboot command under openwrt console several times. Each time after avalon rebooted with ps command I could find cgminer dispear soon.

Beepbop
Full Member
***
Offline Offline

Activity: 126
Merit: 100



View Profile
March 01, 2013, 11:09:05 PM
 #6

Do you not have physical access, or a remote controlled PDU? Sometimes hard reset (power off, and on again) might be needed.
libertybuck (OP)
Full Member
***
Offline Offline

Activity: 137
Merit: 100


View Profile
March 01, 2013, 11:10:27 PM
 #7


Unlikely both have failed at exactly the same time. More likely is you started them at the same time and some sort of memory leak / OS error is causing this. I expect a restart will probably fix the problem.

Sir, I run them at 14 days before at the same time. And based on the above btcguild picture we could see they do not mine at the same time.

I just restarted them many times nothing turns good.

libertybuck (OP)
Full Member
***
Offline Offline

Activity: 137
Merit: 100


View Profile
March 01, 2013, 11:11:53 PM
 #8

check the cable connections inside the box too

Sir, they are two avalon located in different rooms. No one touch them because they are all working well last night before I went to sleep.

SellingMyGPUs
Newbie
*
Offline Offline

Activity: 55
Merit: 0


View Profile
March 01, 2013, 11:12:21 PM
 #9

My avalons run well for the past 14 days and then crash at the same time !
You had 2 working avalon for 14 days and thought was not worth just mentioning that in the relevant threads where people are begging for delivery info? Wow... that is going to score karma points...

anyways, a bus error might indicate a core dump due to many reasons, a change in data supplied (increase in difficulty to 4*10ˆ6) or over heating

Can you point it to another pool? Turn one machine off and let it cool down for an hour?

What does dmesg give you as unusual errors?
libertybuck (OP)
Full Member
***
Offline Offline

Activity: 137
Merit: 100


View Profile
March 01, 2013, 11:13:28 PM
 #10

Do you not have physical access, or a remote controlled PDU? Sometimes hard reset (power off, and on again) might be needed.

Nothing.  Anyway I will unplug power cable and plug it again to have a check now.

Beepbop
Full Member
***
Offline Offline

Activity: 126
Merit: 100



View Profile
March 01, 2013, 11:15:00 PM
 #11

ATX power supplies always provide power on some of their rails unless they're physically disconnected or have a hard switch. So I really hope this is just a hardware controller hanging (inconsistent state cleared by power reset) or something like that.
darkip
Full Member
***
Offline Offline

Activity: 160
Merit: 100


View Profile
March 01, 2013, 11:17:45 PM
 #12

Hopefully this is not related to the comment in their most recent email:

Quote from: Avalon email
Lessons learned. Batch #2 process will have improvements, minor design adjustments and other goodies.
libertybuck (OP)
Full Member
***
Offline Offline

Activity: 137
Merit: 100


View Profile
March 01, 2013, 11:21:27 PM
 #13

You had 2 working avalon for 14 days and thought was not worth

I have to mention 2 avalon because they crash at the very same time. I feel it quite abnormal.


anyways, a bus error might indicate a core dump due to many reasons, a change in data supplied (increase in difficulty to 4*10ˆ6) or over heating

I am afraid it is hardware related issue.


Can you point it to another pool?

Sure. I mentioned it. I tested them with ozco nothing different.


Turn one machine off and let it cool down for an hour?

Yes.


What does dmesg give you as unusual errors?

Nothing useful. Seems avalon does not output error message to system log.

libertybuck (OP)
Full Member
***
Offline Offline

Activity: 137
Merit: 100


View Profile
March 01, 2013, 11:23:14 PM
 #14

ATX power supplies always provide power on some of their rails unless they're physically disconnected or have a hard switch. So I really hope this is just a hardware controller hanging (inconsistent state cleared by power reset) or something like that.

Sir, I unplug power cable from wall and plug it again after a while. Nothing help.

Beepbop
Full Member
***
Offline Offline

Activity: 126
Merit: 100



View Profile
March 01, 2013, 11:24:02 PM
 #15

Firmware question:
Did you get the firmware update that jgarzik got?
PuertoLibre
Legendary
*
Offline Offline

Activity: 1834
Merit: 1003


View Profile
March 01, 2013, 11:31:24 PM
 #16

ATX power supplies always provide power on some of their rails unless they're physically disconnected or have a hard switch. So I really hope this is just a hardware controller hanging (inconsistent state cleared by power reset) or something like that.

Sir, I unplug power cable from wall and plug it again after a while. Nothing help.
What temperature is the room at when you ran the devices?
kaerf
Hero Member
*****
Offline Offline

Activity: 631
Merit: 500


View Profile
March 01, 2013, 11:38:51 PM
 #17

Firmware question:
Did you get the firmware update that jgarzik got?


doesn't look like it. he is on cgminer 2.10.4.
CrazyGuy
Legendary
*
Offline Offline

Activity: 1973
Merit: 1007



View Profile
March 01, 2013, 11:40:45 PM
 #18

Same time different rooms, maybe network issue? Don't they come with a static ip? Perhaps they are trying to use the same address? Maybe try running only one of them. Or, send one to my house and I'll let you know if it still works.

ASICPuppy.net ASIC Mining Hardware and Accessories - Compac F in stock!
Beepbop
Full Member
***
Offline Offline

Activity: 126
Merit: 100



View Profile
March 01, 2013, 11:42:53 PM
 #19

Things to try
  • Get the latest firmware from jgarzik or some other Avalon customer who downloaded it
  • Check if it's related to change in difficulty. Are you mining with stratum and high enough difficulty?
kaerf
Hero Member
*****
Offline Offline

Activity: 631
Merit: 500


View Profile
March 01, 2013, 11:44:23 PM
 #20

I have to mention 2 avalon because they crash at the very same time. I feel it quite abnormal.

...

I am afraid it is hardware related issue.


Since they're failing at the same time, I would actually think it's a software issue. Very unlikely that hardware would fail at the same time. However, if it's software, the same problem could happen to both machines at the same time.
Pages: [1] 2 3 4 5 »  All
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!