Bitcoin Forum
December 12, 2024, 10:35:11 AM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: S9 Hashboard keeps reseting  (Read 195 times)
frozenwave11 (OP)
Newbie
*
Offline Offline

Activity: 4
Merit: 0


View Profile
April 05, 2020, 12:17:42 PM
Last edit: April 06, 2020, 12:32:03 AM by frodocooper
 #1

Hello. can somebody help me, I'm having S9 with latest stock firmware, but one of the hashboard works shows normal GH/S(RT) and after few minutes 0. and thats keep going all day. i have tried different firmwares, unplugged and replugged all the cables a but the result remained same.

Here is log(its divided into 3 links due to the size) and pictures.

https://imgur.com/a/bbLqFf8
https://pastebin.com/ZLAJynQk
https://pastebin.com/BLVznWvr
https://pastebin.com/u6aJw7Hr
philipma1957
Legendary
*
Offline Offline

Activity: 4340
Merit: 9013


'The right to privacy matters'


View Profile WWW
April 05, 2020, 12:50:54 PM
Last edit: April 06, 2020, 12:32:42 AM by frodocooper
 #2

okay  I don't know why the board drops off.

But try this,

Board    Board      Board
   1           2            3

pull  the 3 pcie power cables and pull the data cable  on board  1 and board 2

Run just board 3 for an hour see if it works  the whole hour

Next run just board 2  for an hour  see if it works for an hour

last run just  board 1 for an hour  see if it works for an hour.

once you do this  let us know  what happened.

I suspect  one will not work.  let us know if all work alone or if 1 does not work alone.

▄▄███████▄▄
▄██████████████▄
▄██████████████████▄
▄████▀▀▀▀███▀▀▀▀█████▄
▄█████████████▄█▀████▄
███████████▄███████████
██████████▄█▀███████████
██████████▀████████████
▀█████▄█▀█████████████▀
▀████▄▄▄▄███▄▄▄▄████▀
▀██████████████████▀
▀███████████████▀
▀▀███████▀▀
.
 MΞTAWIN  THE FIRST WEB3 CASINO   
.
.. PLAY NOW ..
mikeywith
Legendary
*
Offline Offline

Activity: 2450
Merit: 6651


be constructive or S.T.F.U


View Profile
April 06, 2020, 06:28:11 AM
 #3

Run just board 3 for an hour see if it works  the whole hour

Next run just board 2  for an hour  see if it works for an hour

last run just  board 1 for an hour  see if it works for an hour.

There is really no need for all that, line 463 in the kernel logs tells the whole story

Code:
driver-btm-c5.c:15886:re_open_core: Chain[J6] has 63 asic
driver-btm-c5.c:15886:re_open_core: Chain[J7] has 63 asic
driver-btm-c5.c:15886:re_open_core: Chain[J8] has 0 asic

Chain J8 ( most left ) is failing, in general, you can't do anything about it but watch it toast even more, soon enough it could start making the miner restart itself every now and then - and thus affecting the performance of the two working boards J6 and J7, however, there is hope and that would be by trying a different firmware such as BriianOs for Vnish (Asic.to / Awoesmeminer), and you want to play with the frequency of the bad board, start by going really low, test it for say 30 mins and then go higher slowly, sometimes increasing the voltage helps but keep that as your last resort.

if all the above fails, you should simply discount the bad board, and then separate the two boards, put on to the left and one to right (keep the middle empty) this way you ensure the boards get better cooling since the board in the middle usually runs hotter than the once on the sides.

█▀▀▀











█▄▄▄
▀▀▀▀▀▀▀▀▀▀▀
e
▄▄▄▄▄▄▄▄▄▄▄
█████████████
████████████▄███
██▐███████▄█████▀
█████████▄████▀
███▐████▄███▀
████▐██████▀
█████▀█████
███████████▄
████████████▄
██▄█████▀█████▄
▄█████████▀█████▀
███████████▀██▀
████▀█████████
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
c.h.
▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄
▀▀▀█











▄▄▄█
▄██████▄▄▄
█████████████▄▄
███████████████
███████████████
███████████████
███████████████
███░░█████████
███▌▐█████████
█████████████
███████████▀
██████████▀
████████▀
▀██▀▀
thierry4wd
Sr. Member
****
Offline Offline

Activity: 446
Merit: 347



View Profile WWW
April 06, 2020, 08:32:39 AM
 #4

hi, i look your kernel log, Mikeywith as true, and for me, the hashboard need more power, is possible at 668mhz and 850 / 860mv the hashboard run ok for only litle time and after showing 0 GH/S ...

I thinks need more power, try my mod, is fixed value, solve some problem, here : https://bitcointalk.org/index.php?topic=5127323.0

install my mod, and try 675mhz, at 900mv at first time, and see here your result, and if solved your problem Wink

frozenwave11 (OP)
Newbie
*
Offline Offline

Activity: 4
Merit: 0


View Profile
April 06, 2020, 09:00:02 AM
 #5

Yes I agree it must be board 3.
It kind set itself. From last night (21:00 approx.) I had 0 restarts, CRC error counter remained the same and no hash rate oscillations on all boards.

I believe that auto frequency found best setting for that board which is 8.9 Volt and 656 Mhz.

Since the seller agreed to send me a new board for free. Is it safe to run miner with all 3 boards or is better to disconnect  faulty one?
mikeywith
Legendary
*
Offline Offline

Activity: 2450
Merit: 6651


be constructive or S.T.F.U


View Profile
April 06, 2020, 03:57:42 PM
 #6

I believe that auto frequency found best setting for that board which is 8.9 Volt and 656 Mhz.

Since the seller agreed to send me a new board for free. Is it safe to run miner with all 3 boards or is better to disconnect  faulty one?

It is safe, no physical damage can be caused by the faulty dashboard, it might, however, cause the mine to restart every now and then which means financial damage.

I would still try to fix the board, I mean some time in the future another board will surely stop hashing, so it's good to have a spare, and your isn't completely dead since it shows all 63 basics at certain points, I suggest you try to play with the frequency and voltage in an attempt to get it to work, you can try one of the firmware I told you about or thierry4wd's suggestion, and then you can easily roll back to your desired firmware.

█▀▀▀











█▄▄▄
▀▀▀▀▀▀▀▀▀▀▀
e
▄▄▄▄▄▄▄▄▄▄▄
█████████████
████████████▄███
██▐███████▄█████▀
█████████▄████▀
███▐████▄███▀
████▐██████▀
█████▀█████
███████████▄
████████████▄
██▄█████▀█████▄
▄█████████▀█████▀
███████████▀██▀
████▀█████████
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
c.h.
▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄
▀▀▀█











▄▄▄█
▄██████▄▄▄
█████████████▄▄
███████████████
███████████████
███████████████
███████████████
███░░█████████
███▌▐█████████
█████████████
███████████▀
██████████▀
████████▀
▀██▀▀
thierry4wd
Sr. Member
****
Offline Offline

Activity: 446
Merit: 347



View Profile WWW
April 06, 2020, 11:19:32 PM
Last edit: April 07, 2020, 10:30:36 AM by thierry4wd
 #7

Of corse Wink in my experience, run miner by autotune firmware is not very good, so, i run all my miner by FIXED mod, and never problem ! and i have purchass some miner by same problem of you, and solved by my mod fixed Wink

Just try it, is very simply to install (by webgui), if you don't satisfied, just reflash other firmware you want.


Edit : if you want return at any firmware, never flash 2019 firmware, is locked...

frozenwave11 (OP)
Newbie
*
Offline Offline

Activity: 4
Merit: 0


View Profile
April 07, 2020, 09:43:27 AM
 #8

I have switched, bad board to middle position (board 6), also mixed power cables just to be sure that the problem isn't in PSU, also switched to Braiins-os.

and still have the same problem. Now middle board keeps dropping to 0.
here is the log:
https://pastebin.com/TRcTBPea

I would like to try thierry4wd's firmware but for patch link is expired.
mikeywith
Legendary
*
Offline Offline

Activity: 2450
Merit: 6651


be constructive or S.T.F.U


View Profile
April 07, 2020, 09:45:54 PM
 #9

I have switched, bad board to middle position (board 6), also mixed power cables just to be sure that the problem isn't in PSU,

We always tell people to switch ribbon cables and 6pin PSU cables, that's like the ABC for troubleshooting a bad board, but I very rarely hear from someone that a ribbon cable or a 6pin cable caused the problem, it's almost always the hashboard itself.

Quote
also switched to Braiins-os.

Switching to BraiinOS alone doesn't fix the problem, you should lower the frequency / increase the voltage or both,  thierry4wd will assist you better in this regards, he probably knows some exact figures to use, but you can start by lowering the frequency to below 500mhz, you can do that from the setting in the GUI of BraiinOS.

Quote

Kernel log is incomplete, it's missing the part where it counts board's Asics.

█▀▀▀











█▄▄▄
▀▀▀▀▀▀▀▀▀▀▀
e
▄▄▄▄▄▄▄▄▄▄▄
█████████████
████████████▄███
██▐███████▄█████▀
█████████▄████▀
███▐████▄███▀
████▐██████▀
█████▀█████
███████████▄
████████████▄
██▄█████▀█████▄
▄█████████▀█████▀
███████████▀██▀
████▀█████████
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
c.h.
▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄
▀▀▀█











▄▄▄█
▄██████▄▄▄
█████████████▄▄
███████████████
███████████████
███████████████
███████████████
███░░█████████
███▌▐█████████
█████████████
███████████▀
██████████▀
████████▀
▀██▀▀
frozenwave11 (OP)
Newbie
*
Offline Offline

Activity: 4
Merit: 0


View Profile
April 08, 2020, 10:22:45 AM
Last edit: April 09, 2020, 12:15:10 AM by frodocooper
 #10

This is log with thierry4wd's firmware, also had the same issue: https://pastebin.com/KY7VkBrN

https://imgur.com/a/aQlWI8r
thierry4wd
Sr. Member
****
Offline Offline

Activity: 446
Merit: 347



View Profile WWW
April 08, 2020, 07:58:52 PM
 #11

So , is not a problem by power ...

your problem is cleary by temp sensor on chaine 6 (midle hashboard) :

read failed on Chain[6] Chip[62] middle Temp old value:15
Special fix Chain[6] Chip[62] middle Temp = 15
Done read temp on Chain[6]

this value is say by defective sensor ... this hashboard need repaire Smiley

But, is possible for resolve, please try unplug data cable on the hashboard and clean up ... (all at power off)

Or, on my mode, i have unlock security of over head, please try this without this ! (Stop running when temprerature pcb is over 90) ... if possible run good ! (but dangerous)

Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!