af_newbie
Legendary
Offline
Activity: 2702
Merit: 1468
|
|
December 12, 2013, 01:44:48 PM |
|
OK yesterday(today) I was up till 4:00am... One board do work the rest don't... No idea why... MrTeal also didn't give me the answer...
Today we redid x-ray of the boards and compeer it visually... It is the same... We also made some boards with the ather board just to be sure that the board is not to blame... They act the same way...
Board that is working has only about 60% of engines working. So it runs at about 23-24GH(not sure if that is related). Others have almost all working and they run for some time then they crash... It looks like it always happens at same chips voltage... Some crash complicity others start producing HW errors and chip voltage go to 0V...
Will need MrTeal to figure out... It would make sense if none would be working but this is not the case... All were made from same batch of components the same way and only difference we can find is number of working engines on chips...
Lucko, Chips are from the same batch as Coindo 16 chips board?
|
|
|
|
Lucko (OP)
|
|
December 12, 2013, 01:53:51 PM |
|
Lucko,
Chips are from the same batch as Coindo 16 chips board?
No... It was one of the ideas I had... I will probably send one board to MrTeal as soon as he say he is OK with that. The other is now going on oscilloscope to look for a problem... But personally I think it is ether problem with one of the components or firmware... EDIT: By component I was thinking batch of components made with error...
|
|
|
|
bx8389
Member
Offline
Activity: 80
Merit: 10
|
|
December 12, 2013, 01:56:39 PM |
|
OK yesterday(today) I was up till 4:00am... One board do work the rest don't... No idea why... MrTeal also didn't give me the answer...
Today we redid x-ray of the boards and compeer it visually... It is the same... We also made some boards with the ather board just to be sure that the board is not to blame... They act the same way...
Board that is working has only about 60% of engines working. So it runs at about 23-24GH(not sure if that is related). Others have almost all working and they run for some time then they crash... It looks like it always happens at same chips voltage... Some crash complicity others start producing HW errors and chip voltage go to 0V...
Will need MrTeal to figure out... It would make sense if none would be working but this is not the case... All were made from same batch of components the same way and only difference we can find is number of working engines on chips...
Can be a heat related problem - probably on DC-DC converter ? If all temperatures are OK I would check next decoupling capacitors and DC converter filtering capacitors: with so high currents flowing capacitor impedance is important to overall stability and usually is not primary concern buying components
|
|
|
|
Lucko (OP)
|
|
December 12, 2013, 05:24:30 PM |
|
|
|
|
|
capa
|
|
December 12, 2013, 05:54:50 PM |
|
yay! thats Awesome Lucko great work. Keep it up, we are all rooting for you
|
|
|
|
GrapeApe
|
|
December 12, 2013, 06:01:42 PM |
|
At the end of the video when it crashes and stops hashing what is the message it repeats over and over? Also I hope that loud crack wasn't coming from the chili.
|
|
|
|
dwdoc
Legendary
Offline
Activity: 966
Merit: 1000
- - -Caveat Aleo- - -
|
|
December 12, 2013, 06:49:46 PM |
|
Looks like progress.
|
|
|
|
Lucko (OP)
|
|
December 12, 2013, 07:47:02 PM |
|
At the end of the video when it crashes and stops hashing what is the message it repeats over and over? Also I hope that loud crack wasn't coming from the chili.
No it wasn't... It is glass table and I guess it was my zipper hitting it... [2013-12-12 20:40:32] Started cgminer 3.8.5 [2013-12-12 20:40:34] Probing for an alive pool [2013-12-12 20:40:36] Pool 0 difficulty changed to 2 [2013-12-12 20:40:37] Network diff set to 908M [2013-12-12 20:40:38] Accepted 3174e6ec Diff 5/2 BAL 0 [2013-12-12 20:40:38] Reconnect requested from pool 0 to eu-stratum-lb489kj.btcguild.com:3333 [2013-12-12 20:40:38] Pool 0 stratum share submission failure [2013-12-12 20:40:39] Pool 0 difficulty changed to 2 [2013-12-12 20:40:39] Pool 0 difficulty changed to 4 [2013-12-12 20:40:39] Stratum from pool 0 requested work restart [2013-12-12 20:40:43] Pool 0 communication resumed, submitting work [2013-12-12 20:40:44] Rejected 7cdb7cfa Diff 2/2 BAL 0 (unknown) [2013-12-12 20:40:45] Rejected 7ba7095d Diff 2/2 BAL 0 (unknown) [2013-12-12 20:40:45] Accepted 1ce88b42 Diff 9/4 BAL 0 [2013-12-12 20:40:45] Accepted 2c7b1dfc Diff 6/4 BAL 0 [2013-12-12 20:40:45] Accepted 33a09445 Diff 5/4 BAL 0 [2013-12-12 20:40:48] Accepted 08ccc023 Diff 29/4 BAL 0 [2013-12-12 20:40:50] Accepted 31abe929 Diff 5/4 BAL 0 [2013-12-12 20:40:51] Accepted 2cabcef9 Diff 6/4 BAL 0 [2013-12-12 20:40:51] Accepted 29371e1c Diff 6/4 BAL 0 [2013-12-12 20:40:52] Accepted 01888088 Diff 167/4 BAL 0 [2013-12-12 20:40:54] Accepted 20d8c763 Diff 8/4 BAL 0 [2013-12-12 20:40:55] Accepted 220a6257 Diff 8/4 BAL 0 [2013-12-12 20:40:55] Accepted 16672733 Diff 11/4 BAL 0 [2013-12-12 20:40:55] Accepted 0fe687e3 Diff 16/4 BAL 0 [2013-12-12 20:40:56] Accepted 2b01dbb4 Diff 6/4 BAL 0 [2013-12-12 20:40:58] Accepted 22f92408 Diff 7/4 BAL 0 [2013-12-12 20:40:59] Accepted 17e9d090 Diff 11/4 BAL 0 [2013-12-12 20:40:59] Accepted 070d248c Diff 36/4 BAL 0 [2013-12-12 20:40:59] Accepted dd5e1b9e Diff 296/4 BAL 0 [2013-12-12 20:41:00] Accepted 1844612b Diff 11/4 BAL 0 [2013-12-12 20:41:00] Accepted 01ec72a4 Diff 133/4 BAL 0 [2013-12-12 20:41:03] Accepted 1f1ed030 Diff 8/4 BAL 0 [2013-12-12 20:41:03] Accepted 34657f65 Diff 5/4 BAL 0 [2013-12-12 20:41:03] Accepted 39952efc Diff 4/4 BAL 0 [2013-12-12 20:41:03] Accepted 15327dfe Diff 12/4 BAL 0 [2013-12-12 20:41:03] Accepted 20ff6ce7 Diff 8/4 BAL 0 [2013-12-12 20:41:04] Accepted 06336317 Diff 41/4 BAL 0 [2013-12-12 20:41:04] Accepted 114b7a07 Diff 15/4 BAL 0 [2013-12-12 20:41:04] Accepted 2993ff80 Diff 6/4 BAL 0 [2013-12-12 20:41:04] Accepted 2e43d328 Diff 6/4 BAL 0 [2013-12-12 20:41:04] Accepted 30c5e121 Diff 5/4 BAL 0 [2013-12-12 20:41:05] Accepted 25f4e0cd Diff 7/4 BAL 0 [2013-12-12 20:41:10] Accepted 2186a67a Diff 8/4 BAL 0 [2013-12-12 20:41:11] Accepted 078f564c Diff 34/4 BAL 0 [2013-12-12 20:41:11] Accepted 15c2277d Diff 12/4 BAL 0 [2013-12-12 20:41:13] Accepted 09a0cf86 Diff 27/4 BAL 0 [2013-12-12 20:41:13] Accepted 3dc2b7ee Diff 4/4 BAL 0 [2013-12-12 20:41:14] Accepted 294817df Diff 6/4 BAL 0 [2013-12-12 20:41:15] BAL 0 RequestResults usb write err:(-1) LIBUSB_ERROR_IO [2013-12-12 20:41:15] BAL 0 attempted reset got err:(0) LIBUSB_SUCCESS [2013-12-12 20:41:15] BAL 0 RequestResults usb write err:(-1) LIBUSB_ERROR_IO [2013-12-12 20:41:15] BAL 0 attempted reset got err:(0) LIBUSB_SUCCESS [2013-12-12 20:41:15] BAL 0 RequestResults usb write err:(-1) LIBUSB_ERROR_IO [2013-12-12 20:41:15] BAL 0 attempted reset got err:(0) LIBUSB_SUCCESS
|
|
|
|
GrapeApe
|
|
December 12, 2013, 08:02:56 PM |
|
At the end of the video when it crashes and stops hashing what is the message it repeats over and over? Also I hope that loud crack wasn't coming from the chili.
No it wasn't... It is glass table and I guess it was my zipper hitting it... Yeah I was joking thanks for the response I was just curious as to what it was saying.
|
|
|
|
Mudbankkeith
|
|
December 12, 2013, 09:05:12 PM |
|
At the end of the video when it crashes and stops hashing what is the message it repeats over and over? Also I hope that loud crack wasn't coming from the chili.
No it wasn't... It is glass table and I guess it was my zipper hitting it... [2013-12-12 20:40:32] Started cgminer 3.8.5 [2013-12-12 20:40:34] Probing for an alive pool [2013-12-12 20:40:36] Pool 0 difficulty changed to 2 [2013-12-12 20:40:37] Network diff set to 908M [2013-12-12 20:40:38] Accepted 3174e6ec Diff 5/2 BAL 0 [2013-12-12 20:40:38] Reconnect requested from pool 0 to eu-stratum-lb489kj.btcguild.com:3333 [2013-12-12 20:40:38] Pool 0 stratum share submission failure [2013-12-12 20:40:39] Pool 0 difficulty changed to 2 [2013-12-12 20:40:39] Pool 0 difficulty changed to 4 [2013-12-12 20:40:39] Stratum from pool 0 requested work restart [2013-12-12 20:40:43] Pool 0 communication resumed, submitting work [2013-12-12 20:40:44] Rejected 7cdb7cfa Diff 2/2 BAL 0 (unknown) [2013-12-12 20:40:45] Rejected 7ba7095d Diff 2/2 BAL 0 (unknown) [2013-12-12 20:40:45] Accepted 1ce88b42 Diff 9/4 BAL 0 [2013-12-12 20:40:45] Accepted 2c7b1dfc Diff 6/4 BAL 0 [2013-12-12 20:40:45] Accepted 33a09445 Diff 5/4 BAL 0 [2013-12-12 20:40:48] Accepted 08ccc023 Diff 29/4 BAL 0 [2013-12-12 20:40:50] Accepted 31abe929 Diff 5/4 BAL 0 [2013-12-12 20:40:51] Accepted 2cabcef9 Diff 6/4 BAL 0 [2013-12-12 20:40:51] Accepted 29371e1c Diff 6/4 BAL 0 [2013-12-12 20:40:52] Accepted 01888088 Diff 167/4 BAL 0 [2013-12-12 20:40:54] Accepted 20d8c763 Diff 8/4 BAL 0 [2013-12-12 20:40:55] Accepted 220a6257 Diff 8/4 BAL 0 [2013-12-12 20:40:55] Accepted 16672733 Diff 11/4 BAL 0 [2013-12-12 20:40:55] Accepted 0fe687e3 Diff 16/4 BAL 0 [2013-12-12 20:40:56] Accepted 2b01dbb4 Diff 6/4 BAL 0 [2013-12-12 20:40:58] Accepted 22f92408 Diff 7/4 BAL 0 [2013-12-12 20:40:59] Accepted 17e9d090 Diff 11/4 BAL 0 [2013-12-12 20:40:59] Accepted 070d248c Diff 36/4 BAL 0 [2013-12-12 20:40:59] Accepted dd5e1b9e Diff 296/4 BAL 0 [2013-12-12 20:41:00] Accepted 1844612b Diff 11/4 BAL 0 [2013-12-12 20:41:00] Accepted 01ec72a4 Diff 133/4 BAL 0 [2013-12-12 20:41:03] Accepted 1f1ed030 Diff 8/4 BAL 0 [2013-12-12 20:41:03] Accepted 34657f65 Diff 5/4 BAL 0 [2013-12-12 20:41:03] Accepted 39952efc Diff 4/4 BAL 0 [2013-12-12 20:41:03] Accepted 15327dfe Diff 12/4 BAL 0 [2013-12-12 20:41:03] Accepted 20ff6ce7 Diff 8/4 BAL 0 [2013-12-12 20:41:04] Accepted 06336317 Diff 41/4 BAL 0 [2013-12-12 20:41:04] Accepted 114b7a07 Diff 15/4 BAL 0 [2013-12-12 20:41:04] Accepted 2993ff80 Diff 6/4 BAL 0 [2013-12-12 20:41:04] Accepted 2e43d328 Diff 6/4 BAL 0 [2013-12-12 20:41:04] Accepted 30c5e121 Diff 5/4 BAL 0 [2013-12-12 20:41:05] Accepted 25f4e0cd Diff 7/4 BAL 0 [2013-12-12 20:41:10] Accepted 2186a67a Diff 8/4 BAL 0 [2013-12-12 20:41:11] Accepted 078f564c Diff 34/4 BAL 0 [2013-12-12 20:41:11] Accepted 15c2277d Diff 12/4 BAL 0 [2013-12-12 20:41:13] Accepted 09a0cf86 Diff 27/4 BAL 0 [2013-12-12 20:41:13] Accepted 3dc2b7ee Diff 4/4 BAL 0 [2013-12-12 20:41:14] Accepted 294817df Diff 6/4 BAL 0 [2013-12-12 20:41:15] BAL 0 RequestResults usb write err:(-1) LIBUSB_ERROR_IO [2013-12-12 20:41:15] BAL 0 attempted reset got err:(0) LIBUSB_SUCCESS [2013-12-12 20:41:15] BAL 0 RequestResults usb write err:(-1) LIBUSB_ERROR_IO [2013-12-12 20:41:15] BAL 0 attempted reset got err:(0) LIBUSB_SUCCESS [2013-12-12 20:41:15] BAL 0 RequestResults usb write err:(-1) LIBUSB_ERROR_IO [2013-12-12 20:41:15] BAL 0 attempted reset got err:(0) LIBUSB_SUCCESS All those accepted results look to be arriving very fast. Is the software clocking up and running too fast for the rest of the hardware to process?
|
BTc donations welcome:- 13c2KuzWCaWFTXF171Zn1HrKhMYARPKv97
|
|
|
ChipGeek
|
|
December 12, 2013, 09:08:53 PM |
|
With only a single LED flashing, it looks like maybe the MCU has rebooted. The LEDs are numbered 1 (closest to USB connector) to 8 (closest to the PCI-E power connector). Is it LED 7 that is blinking?
|
Tip jar: 1ChipGeeK7PDxaAWG4VgsTi31SfJ6peKHw
|
|
|
Lucko (OP)
|
|
December 12, 2013, 10:07:00 PM |
|
With only a single LED flashing, it looks like maybe the MCU has rebooted. The LEDs are numbered 1 (closest to USB connector) to 8 (closest to the PCI-E power connector). Is it LED 7 that is blinking? Yes this board reboots(7 blinking)... But some just drops 1 V rail to 0 V and start producing HW errors and hashing blinking speeds up... But this board is now hashing at 41GH... But if I turn it off it will probably stop working again... I need to restart it enough times and it works again... But if it reboots it reboots at 0.97V and 27GH... After about 200 shares...
|
|
|
|
asjfdlksfd
|
|
December 13, 2013, 07:34:49 PM |
|
Lucky boy, my chilies are hashing with to less performance: cgminer version 3.8.5 - Started: [2013-12-13 08:11:43] -------------------------------------------------------------------------------- ALL (10s):162.7G (avg):162.6Gh/s | A:1579858 R:6000 HW:100679 WU:2137.2/m ST: 10 SS: 39 NB: 83 LW: 1705533 GF: 1 RF: 1 ... Block: 2dce8686... Diff:908M Started: [20:21:53] Best share: 995K -------------------------------------------------------------------------------- [P]ool management [S]ettings [D]isplay options [Q]uit BAL 0: max 69C 1.16V | 27.92G/28.05Gh/s | A:277760 R:1168 HW: 4781 WU:385.4/m BAL 1: max 43C 1.00V | 9.745G/9.694Gh/s | A: 46864 R: 192 HW:54790 WU: 64.1/m BAL 2: max 69C 1.15V | 21.23G/21.42Gh/s | A:215920 R: 576 HW: 3540 WU:294.1/m BAJ 0: max 49C 1.00V | 7.614G/7.752Gh/s | A: 82432 R: 464 HW: 399 WU:107.6/m BAL 3: max 69C 1.16V | 25.02G/25.45Gh/s | A:261538 R: 896 HW: 6695 WU:345.3/m BAL 4: max 67C 1.16V | 27.25G/27.23Gh/s | A:265600 R:1056 HW:18941 WU:354.5/m BAL 5: max 69C 1.16V | 27.31G/27.39Gh/s | A:272928 R: 944 HW: 5337 WU:376.1/m BAL 6: max 70C 1.02V | 15.60G/15.66Gh/s | A:156880 R: 704 HW: 6196 WU:210.0/m -------------------------------------------------------------------------------- BAL 1 and BAJ0 are Jallies, BAL1 with cooling problem. BAL6 and BAL2 are Chillies, also with cooling problem. Rest looks ok for non good vrm cooled boards at ambient temperature of 30 °C. Some of them has not a backplate which I will mount on weekend. Maybe than they will hashing faster. What I'm missing is a way to understand the which temps are shown in bfgminer the detailed view of each board as like it does for BFL miners in bfgminer (processor like stats) check the internal temperatures of each chip shows the frequencies of each chip ... Whishes over whishes... Cheers...
|
|
|
|
Lucko (OP)
|
|
December 13, 2013, 08:34:16 PM |
|
The problem is that is not stable... Every Chili has voltage/hashing speed(they do go up in parallel) that is critical. And if it goes over it it will work but if not will crash... Using more then one is impossible since it crashes mining program...
I really can't find anything wrong with them... Today test showed noting strange. Nothing is to hot no spikes... I do need to make some tests with Digital Power Designer software(need to figure out how to use it) that control chip power supply... That will be done tomorrow...
And it is different problem with different firmware on Chili. If I use old one it start producing HW errors if I use new one it restarts...
One card that is working is probably below problematic hashing speed...
|
|
|
|
asjfdlksfd
|
|
December 14, 2013, 11:39:23 AM |
|
Hm, how much pcbs you now have allready tested? Maybe they are only spikes in the whole order and so are really lemons ? How much hashing engines are activated after some hashing? Do you tried to use bfgminer instead of cgminer? I've here a new Thermaltake 730W PSU which looks like give also "not so ideal" hashing results with my both boards which have hashing problems. They have also problems with my new Enermax Revo PSU, but have little bit less hw errors so they hash ~ 1 GH/s better than with the Thermaltake. Connected to my good boards I cannot see a difference. But if I read some posts in Mr. Teals forum it looks like that some have problem where most other didn't have. If hashing with 30 GH/s is possible without special cooling of the vrm the board should be ok. So my last question is, how much boards never hashes or hashes really pool, arrive <25 GH/s, > 25 GH/s, >30 GH/s, >35 GH/s? Also how boards are affected and when will the boards which are not affected send to us?
|
|
|
|
Lucko (OP)
|
|
December 14, 2013, 12:29:17 PM |
|
1 board 23GH no problems... Only 70 engines.
The rest of the boards are over 118 engines...
5 boards looks to be 30+ but can't get them to hash for longer then minute to get to speed...
2 boards 36 and 38(with additional cooling 41 and 39) but they don't run every time for cold start... But if I stop mining software and restart is fast enough(so that the voltage is still over 1V) board will start every time...
In most cases if problems starts at 24 to 29GH. Only one board crashes at 38GH.
Now I can't figure it out why boards run for some time then restarts. I would say that it is put together right but no idea why it then restart and they restart even USB so they get disconnected from computer.
|
|
|
|
Mudbankkeith
|
|
December 14, 2013, 12:37:44 PM |
|
BFL did have problems with engine 0 making the miner unstable,
Can you disable engine 0 within the software? and then test without engine 0 running on each chip?
|
BTc donations welcome:- 13c2KuzWCaWFTXF171Zn1HrKhMYARPKv97
|
|
|
Lucko (OP)
|
|
December 14, 2013, 12:47:35 PM |
|
No. I don't have source code for firmware...
|
|
|
|
jelin1984
Legendary
Offline
Activity: 2408
Merit: 1004
|
|
December 14, 2013, 02:22:50 PM |
|
Any new firmware for the chili board .?
|
|
|
|
Bogart
Legendary
Offline
Activity: 966
Merit: 1000
|
|
December 14, 2013, 02:48:19 PM |
|
I recall my lot of chips being divided into 2 groups of "A" and "B", with the laser etchings bearing different numbers. I think A was week 38 and B was week 37, or something like that. I wonder if they behave differently.
|
"All safe deposit boxes in banks or financial institutions have been sealed... and may only be opened in the presence of an agent of the I.R.S." - President F.D. Roosevelt, 1933
|
|
|
|