Title: T17e problem
Post by: Masoudes72 on May 05, 2021, 05:34:53 PM
My miner device gives this error, where is the problem? 2021-05-05 17:18:18:power_api.c:322:set_to_check_asic_voltage_by_steps: Set to voltage raw 1800, step by step. chain 1, IO_DRIVE_STRENGTH_CONFIGURATION reg = 0x 7777777 2021-05-05 17:18:38:driver-btm-api.c:908:check_asic_number_with_power_on: Chain[1]: find 78 asic, times 0 chain 2, IO_DRIVE_STRENGTH_CONFIGURATION reg = 0x 7777777 2021-05-05 17:18:42:driver-btm-api.c:908:check_asic_number_with_power_on: Chain[2]: find 78 asic, times 0 2021-05-05 17:18:42:power_api.c:388:set_to_highest_voltage_by_steps: Set to voltage raw 2000, step by step. 2021-05-05 17:19:01:driver-hash-chip.c:451:set_uart_relay: set uart relay to 0x330003 chain 1 domain 0: d0 0.402, d1 0.396, d2 0.394, d3 0.308, sum = 1.499634 chain 1 domain 1: d0 0.402, d1 0.396, d2 0.417, d3 0.285, sum = 1.499634 chain 1 domain 2: d0 0.393, d1 0.378, d2 0.391, d3 0.338, sum = 1.499634 chain 1 domain 3: d0 0.378, d1 0.378, d2 0.384, d3 0.360, sum = 1.499634 chain 1 domain 4: d0 0.374, d1 0.372, d2 0.357, d3 0.383, sum = 1.485718 chain 1 domain 5: d0 0.374, d1 0.379, d2 0.375, d3 0.363, sum = 1.491577 chain 1 domain 6: d0 0.418, d1 0.422, d2 0.414, d3 0.245, sum = 1.499634 chain 1 domain 7: d0 0.376, d1 0.374, d2 0.373, d3 0.368, sum = 1.491577 chain 1 domain 8: d0 0.359, d1 0.360, d2 0.354, d3 0.357, sum = 1.430054 chain 1 domain 9: d0 0.359, d1 0.359, d2 0.354, d3 0.328, sum = 1.400024 chain 1 domain 10: d0 0.322, d1 0.327, d2 0.312, d3 0.323, sum = 1.282837 chain 1 domain 11: d0 0.398, d1 0.379, d2 0.393, d3 0.329, sum = 1.499634 chain 1 domain 12: d0 0.403, d1 0.398, d2 0.394, d3 0.304, sum = 1.499634 chain 2 domain 0: d0 0.402, d1 0.397, d2 0.396, d3 0.305, sum = 1.499634 chain 2 domain 1: d0 0.389, d1 0.372, d2 0.380, d3 0.356, sum = 1.498169 chain 2 domain 2: d0 0.380, d1 0.373, d2 0.389, d3 0.352, sum = 1.494507 chain 2 domain 3: d0 0.367, d1 0.359, d2 0.380, d3 0.337, sum = 1.443237 chain 2 domain 4: d0 0.379, d1 0.388, d2 0.386, d3 0.347, sum = 1.499634 chain 2 domain 5: d0 0.379, d1 0.379, d2 0.376, d3 0.362, sum = 1.495972 chain 2 domain 6: d0 0.393, d1 0.383, d2 0.380, d3 0.343, sum = 1.499634 chain 2 domain 7: d0 0.371, d1 0.371, d2 0.384, d3 0.366, sum = 1.493042 chain 2 domain 8: d0 0.367, d1 0.376, d2 0.362, d3 0.345, sum = 1.449463 chain 2 domain 9: d0 0.392, d1 0.394, d2 0.389, d3 0.324, sum = 1.499634 chain 2 domain 10: d0 0.387, d1 0.387, d2 0.383, d3 0.343, sum = 1.499634 chain 2 domain 11: d0 0.375, d1 0.377, d2 0.354, d3 0.372, sum = 1.478027 chain 2 domain 12: d0 0.376, d1 0.376, d2 0.375, d3 0.365, sum = 1.492310 2021-05-05 17:19:02:driver-hash-chip.c:839:set_ldo_ctrl: Set LDO to 0x2000203 2021-05-05 17:19:02 voltage[1] = 1800 2021-05-05 17:19:02 voltage[2] = 1800 2021-05-05 17:19:02:power_api.c:272:set_working_voltage_raw: working_voltage_raw = 1800 2021-05-05 17:19:03:temperature.c:305:calibrate_temp_sensor_one_chain: chain 1 temp sensor NCT218 2021-05-05 17:19:05:temperature.c:305:calibrate_temp_sensor_one_chain: chain 2 temp sensor NCT218 2021-05-05 17:19:05:uart.c:69:set_baud: set UART baud to 12000000 2021-05-05 17:19:06:driver-btm-api.c:240:check_bringup_temp: Bring up temperature is 36 2021-05-05 17:19:06:thread.c:848:create_check_miner_status_thread: create thread 2021-05-05 17:19:06:thread.c:838:create_show_miner_status_thread: create thread 2021-05-05 17:19:06:thread.c:818:create_temperature_monitor_thread: create thread 2021-05-05 17:19:06:sweep.c:1168:sweep_get_max_freq: max_freq_with_diff = 720 2021-05-05 17:19:06:frequency.c:473:inc_freq_with_fixed_vco: chain = 255, freq = 570, is_higher_voltage = true 2021-05-05 17:19:06:power_api.c:400:set_to_voltage_by_steps: Set to voltage raw 2100, step by step. 2021-05-05 17:19:29:power_api.c:400:set_to_voltage_by_steps: Set to voltage raw 2000, step by step. 2021-05-05 17:19:44:power_api.c:400:set_to_voltage_by_steps: Set to voltage raw 1920, step by step. 2021-05-05 17:19:53:power_api.c:400:set_to_voltage_by_steps: Set to voltage raw 1880, step by step. 2021-05-05 17:20:02:power_api.c:400:set_to_voltage_by_steps: Set to voltage raw 1830, step by step. 2021-05-05 17:20:12:frequency.c:508:inc_freq_with_fixed_step: chain = 1, freq_start = 570, freq_end = 570, freq_step = 5, is_higher_voltage = true 2021-05-05 17:20:12:frequency.c:508:inc_freq_with_fixed_step: chain = 2, freq_start = 570, freq_end = 570, freq_step = 5, is_higher_voltage = true 2021-05-05 17:20:12:frequency.c:347:inc_asic_diff_freq_by_steps: chain = 1, start = 570, freq_step = 5 2021-05-05 17:20:17:thread.c:630:check_temperature: over max temp, pcb temp 67 (max 85), chip temp 101(max 98) pcb temp rise 0 (max 13) chip temp rise 6 (max 15) 2021-05-05 17:20:17:driver-btm-api.c:198:set_miner_status: ERROR_TEMP_TOO_HIGH 2021-05-05 17:20:17:driver-btm-api.c:139:stop_mining: stop mining: over max temp 2021-05-05 17:20:17:thread.c:868:cancel_temperature_monitor_thread: cancel thread 2021-05-05 17:20:17:thread.c:878:cancel_read_nonce_reg_thread: cancel thread 2021-05-05 17:20:17:driver-btm-api.c:124:killall_hashboard: ****power off hashboard****
Title: Re: T17e problem
Post by: HagssFIN on May 05, 2021, 06:00:04 PM
See the end part in your log sample.. Some chip is running over max. temp at 101C (max. level is 98C).
Is there some damage is heat sinks or a bad contact between heat sink and chip?
Is your ambient temperature too warm?
Are the fans running ok?
Title: Re: T17e problem
Post by: Masoudes72 on May 05, 2021, 06:14:48 PM
Last time device work in high temp and tern off , now can’t start hashing. Usually can identify chips on hashboard but sometimes can’t identify them , Could this be due to the device overheating? Fans works fine
Title: Re: T17e problem
Post by: mikeywith on May 06, 2021, 04:31:33 AM
Last time device work in high temp and tern off , now can’t start hashing. Usually can identify chips on hashboard but sometimes can’t identify them , Could this be due to the device overheating? Fans works fine
The temp part has to do with the heatsink contact, finding less than the total Asics has to do with the terrible quality of the solder paste Bitmain uses, it's a very common issue with all the 17 series, the E version is the worst of all be it the T or the S type, the hash board is probably dead and needs repair.
|