Title: Fault on Antminer S17e Post by: Marcusgowell on May 14, 2022, 12:55:43 PM Hi guys,
I've been advised to look for help here on issues with my S17e miner. As I power it up, it works for a period of time, then stops hashing. It sometimes starts hashing again for a short period of time sometimes it doesn't. If I switch the power off and turn it back on it starts hashing again. I have copied parts of the Kernel log bellow, maybe you you will be able to advise me with potential causes of this issue chain 2 domain 12: d0 0.311, d1 0.309, d2 0.310, d3 0.308, sum = 1.238241 chain 2 domain 13: d0 0.312, d1 0.307, d2 0.309, d3 0.310, sum = 1.237549 chain 2 domain 14: d0 0.314, d1 0.312, d2 0.313, d3 0.314, sum = 1.254110 2022-05-12 09:05:08:driver-hash-chip.c:493:check_adc_voltage: PASS domainn volt check: request 0.800 (index 0, open_core) 2022-05-12 09:05:08:frequency.c:442:inc_asic_diff_freq_by_steps: chain = 0, start = 365, freq_step = 5 2022-05-12 09:05:19:frequency.c:442:inc_asic_diff_freq_by_steps: chain = 1, start = 395, freq_step = 5 2022-05-12 09:05:26:frequency.c:442:inc_asic_diff_freq_by_steps: chain = 2, start = 365, freq_step = 5 2022-05-12 09:05:37:driver-btm-api.c:642:set_timeout: freq = 425, percent = 90, hcn = 235928, timeout = 554 2022-05-12 09:05:37:power_api.c:353:set_to_working_voltage_by_steps: Set to voltage raw 1860, step by step. 2022-05-12 09:05:53:thread.c:1515:create_check_system_status_thread: create thread 2022-05-12 09:05:53:driver-btm-api.c:2571:bitmain_soc_init: Init done! 2022-05-12 09:05:59:driver-btm-api.c:247:set_miner_status: STATUS_OKAY 2022-05-12 09:06:00:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 62323 2022-05-12 09:06:00:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 60000 2022-05-12 09:06:02:driver-btm-api.c:1610:dhash_chip_send_job: Version num 8 2022-05-12 09:06:02:driver-btm-api.c:1754:dhash_chip_send_job: stime.tv_sec 1652346362, block_ntime 1652346251 2022-05-12 09:36:37:thread.c:233:calc_hashrate_avg: avg rate is 62533.67 in 30 mins 2022-05-12 09:36:37:temperature.c:440:temp_statistics_show: pcb temp 42~67 chip temp 58~71 2022-05-12 10:07:14:thread.c:233:calc_hashrate_avg: avg rate is 62541.08 in 30 mins 2022-05-12 10:07:14:temperature.c:440:temp_statistics_show: pcb temp 43~67 chip temp 59~72 2022-05-12 10:37:46:thread.c:233:calc_hashrate_avg: avg rate is 62595.34 in 30 mins 2022-05-12 10:37:46:temperature.c:440:temp_statistics_show: pcb temp 46~71 chip temp 61~75 2022-05-12 11:08:16:thread.c:233:calc_hashrate_avg: avg rate is 62737.02 in 30 mins 2022-05-12 11:08:16:temperature.c:440:temp_statistics_show: pcb temp 46~70 chip temp 60~75 2022-05-12 11:38:49:thread.c:233:calc_hashrate_avg: avg rate is 63035.07 in 30 mins 2022-05-12 11:38:49:temperature.c:440:temp_statistics_show: pcb temp 46~72 chip temp 62~76 2022-05-12 11:48:12:thread.c:1136:asic_status_monitor_thread: ERROR: chain 0 get hashrate_reg_counter 0, require 135, failed times 1 2022-05-12 11:48:12:thread.c:1136:asic_status_monitor_thread: ERROR: chain 1 get hashrate_reg_counter 0, require 135, failed times 1 2022-05-12 11:48:12:thread.c:1136:asic_status_monitor_thread: ERROR: chain 2 get hashrate_reg_counter 0, require 135, failed times 1 2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 0, chip = 27, reg = 0 2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 0, chip = 27, reg = 1 2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 1, chip = 35, reg = 0 2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 1, chip = 35, reg = 1 2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 2, chip = 99, reg = 0 2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 2, chip = 99, reg = 1 2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 3, chip = 107, reg = 0 2022-05-12 11:48:13:temperature.c:737:get_temp_info: read temp sensor failed: chain = 0, sensor = 3, chip = 107, reg = 1 2022-05-12 11:48:13:temperature.c:754:get_temp_info: ERROR: chain 0 can get NONE temp info or temp value abnormal, power it off 2022-05-12 11:48:13:thread.c:1172:asic_status_monitor_thread: chain 0 can't get enough hashrate reg val for 0 times. 2022-05-12 11:48:13:thread.c:1172:asic_status_monitor_thread: chain 1 can't get enough hashrate reg val for 0 times. 2022-05-12 11:48:13:thread.c:1172:asic_status_monitor_thread: chain 2 can't get enough hashrate reg val for 0 times. 2022-05-12 11:48:13:driver-hash-chip.c:591:recalc_invalid_volt: chain 0 domain 00 column 00 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:13:driver-hash-chip.c:591:recalc_invalid_volt: chain 0 domain 00 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:16:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 14 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:16:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 14 column 02 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:16:thread.c:1172:asic_status_monitor_thread: chain 0 can't get enough hashrate reg val for 1 times. 2022-05-12 11:48:16:thread.c:1172:asic_status_monitor_thread: chain 1 can't get enough hashrate reg val for 1 times. 2022-05-12 11:48:16:thread.c:1172:asic_status_monitor_thread: chain 2 can't get enough hashrate reg val for 1 times. 2022-05-12 11:48:16:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 14 column 03 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:16:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 14 column 04 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:16:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 12 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:16:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 13 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:16:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 14 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:17:thread.c:1136:asic_status_monitor_thread: ERROR: chain 0 get hashrate_reg_counter 0, require 135, failed times 1 2022-05-12 11:48:17:thread.c:1136:asic_status_monitor_thread: ERROR: chain 1 get hashrate_reg_counter 0, require 135, failed times 1 2022-05-12 11:48:17:thread.c:1136:asic_status_monitor_thread: ERROR: chain 2 get hashrate_reg_counter 0, require 135, failed times 1 2022-05-12 11:48:17:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 41848 2022-05-12 11:48:17:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 40000 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 0, chip = 27, reg = 0 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 0, chip = 27, reg = 1 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 1, chip = 35, reg = 0 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 1, chip = 35, reg = 1 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 2, chip = 99, reg = 0 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 2, chip = 99, reg = 1 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 3, chip = 107, reg = 0 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 1, sensor = 3, chip = 107, reg = 1 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 0, chip = 27, reg = 0 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 0, chip = 27, reg = 1 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 1, chip = 35, reg = 0 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 1, chip = 35, reg = 1 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 2, chip = 99, reg = 0 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 2, chip = 99, reg = 1 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 3, chip = 107, reg = 0 2022-05-12 11:48:18:temperature.c:737:get_temp_info: read temp sensor failed: chain = 2, sensor = 3, chip = 107, reg = 1 2022-05-12 11:48:18:temperature.c:754:get_temp_info: ERROR: chain 1 can get NONE temp info or temp value abnormal, power it off 2022-05-12 11:48:18:thread.c:1172:asic_status_monitor_thread: chain 1 can't get enough hashrate reg val for 2 times. 2022-05-12 11:48:18:thread.c:1172:asic_status_monitor_thread: chain 2 can't get enough hashrate reg val for 2 times. 2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 1 domain 00 column 00 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 1 domain 00 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. chain 1 domain 12: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 1 domain 13: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 1 domain 14: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 2022-05-12 11:48:19:thread.c:1136:asic_status_monitor_thread: ERROR: chain 1 get hashrate_reg_counter 0, require 135, failed times 1 2022-05-12 11:48:19:thread.c:1136:asic_status_monitor_thread: ERROR: chain 2 get hashrate_reg_counter 0, require 135, failed times 1 2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 00 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:19:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 02 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:20:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 12 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:20:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 13 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:20:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 14 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:21:temperature.c:754:get_temp_info: ERROR: chain 2 can get NONE temp info or temp value abnormal, power it off 2022-05-12 11:48:21:thread.c:1172:asic_status_monitor_thread: chain 2 can't get enough hashrate reg val for 3 times. 2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 00 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 01 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:21:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 20475 2022-05-12 11:48:21:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 20000 2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 02 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 03 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:21:driver-hash-chip.c:591:recalc_invalid_volt: chain 2 domain 00 column 04 single core_domain: no useful core domain volt: 0.00 0.00 0.00 0.00. 2022-05-12 11:48:21:driver-hash-chip.c:707:dump_adc_voltage_v2: chain 2 domain 14 core_domain 1: all asic timeout or overrange. 2022-05-12 11:48:21:driver-hash-chip.c:707:dump_adc_voltage_v2: chain 2 domain 14 core_domain 2: all asic timeout or overrange. 2022-05-12 11:48:21:driver-hash-chip.c:707:dump_adc_voltage_v2: chain 2 domain 14 core_domain 3: all asic timeout or overrange. 2022-05-12 11:48:21:driver-hash-chip.c:737:dump_adc_voltage_v2: get ADC_DATAOUT from chain 2 with 540 regs timeout. chain 2 domain 0: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 1: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 2: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 3: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 4: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 5: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 6: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 7: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 8: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 9: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 10: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 11: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 12: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 13: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 chain 2 domain 14: x d0 0.000, x d1 0.000, x d2 0.000, x d3 0.000, x sum = 0.000000 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 00 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 01 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 02 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 03 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 04 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 05 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 06 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 07 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 08 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 09 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 10 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 11 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 12 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 13 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:22:driver-hash-chip.c:486:check_adc_voltage: FAIL domain volt check: chain 2 domain 14 volt 0.000 less then request 0.800 (index 0) 2022-05-12 11:48:23:driver-btm-api.c:247:set_miner_status: ERROR_TEMP_LOST 2022-05-12 11:48:23:driver-btm-api.c:176:stop_mining: stop mining: no chain exists, maybe caused by sensor lost 2022-05-12 11:48:23:thread.c:1588:cancel_check_miner_status_thread: cancel thread 2022-05-12 11:48:23:thread.c:1583:cancel_check_system_status_thread: cancel thread 2022-05-12 11:48:23:thread.c:1572:cancel_read_nonce_reg_thread: cancel thread 2022-05-12 11:48:23:thread.c:1593:cancel_asic_status_monitor_thread: cancel thread 2022-05-12 11:48:23:driver-btm-api.c:147:killall_hashboard: ****power off hashboard**** 2022-05-12 11:48:23:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 0 2022-05-12 11:48:23:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 0 2022-05-12 11:48:24:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 0 2022-05-12 11:48:24:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 0 2022-05-12 11:48:25:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 0 2022-05-12 11:48:25:frequency.c:128:get_sale_hash_rate_GH: sale_hash_rate = 0 2022-05-12 11:48:26:frequency.c:107:get_ideal_hash_rate_GH: ideal_hash_rate = 0 Title: Re: Fault on Antminer S17e Post by: BitMaxz on May 15, 2022, 12:03:22 AM The fault might be overheating that is why the sensor is lost based on the kernel logs you posted above.
What I would you to do is try to flash it first with the latest firmware available from Bitmain here https://shop.bitmain.com/support/download Then post the result here if the issue is still there I suggest you to try test all hashboard one by one you can follow the guide from Bitmain here "Test hash board one by one Guide (https://support.bitmain.com/hc/en-us/articles/226142788-Test-hash-board-one-by-one)" Title: Re: Fault on Antminer S17e Post by: THANOSMINING on June 14, 2022, 01:03:11 PM This may be a fault caused by the PSU. It is recommended to try another PSU.
Title: Re: Fault on Antminer S17e Post by: philipma1957 on June 14, 2022, 01:22:53 PM pull two boards and try with one board.
see if that works. if it does thank the good board out place it in a safe place go to board two. run it and see if it works. if it works take it out and place in a safe place. go to board three run it and see if it works. if it works the issue is either over heating or weak psu. so try two boards see if it works. so if two boards work settle for that and sell the third board. my guess is nothing will work and the psu is bad. also look long and hard at fans on the psu maybe one is failing or failed. Title: Re: Fault on Antminer S17e Post by: GFDhd on July 25, 2022, 02:23:33 AM I was wondering if you tried replacing the control board? The temperature sensing circuit of the Antminer 17 series hash board is in the form of I2C serial bus, which is powered by the control board. Therefore, if the control board fails, it will not be able to supply power to the temperature sensor, and the temperature sensor parameters will not be collected.
Title: Re: Fault on Antminer S17e Post by: MinerMEDIC on July 26, 2022, 05:44:46 PM To move forward we need to know if the power supply is outputting the required 20V. The control board puts out an additional 3.3V through the data cable to power the pic which collects the temperature data via I2C. And almost finally, the control board is powered from the power supply by the 6 wire 12V line. Plenty of failure points for this error. Let’s start with the first. Do you have a voltage meter?
|