Bitcoin Forum

Bitcoin => Mining support => Topic started by: CryptoLLC on January 19, 2021, 10:18:59 PM



Title: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on January 19, 2021, 10:18:59 PM
I have 250 units. About 8 of these randomly turn off for no reason at all. There are no errors, temps are fine, I tried different psu, different control board, fans are at 97%. If you know what the reason is please let me know. Thank you.

Code:
CPU: PIPT / VIPT nonaliasing data cache, VIPT aliasing instruction cache
Machine model: Xilinx Zynq
cma: Reserved 16 MiB at 0x0e000000
Memory policy: Data cache writealloc
On node 0 totalpages: 61440
free_area_init_node: node 0, pgdat c0b3c040, node_mem_map cde10000
  Normal zone: 480 pages used for memmap
  Normal zone: 0 pages reserved
  Normal zone: 61440 pages, LIFO batch:15
percpu: Embedded 12 pages/cpu @cddf0000 s19916 r8192 d21044 u49152
pcpu-alloc: s19916 r8192 d21044 u49152 alloc=12*4096
pcpu-alloc: [0] 0 [0] 1
Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 60960
Kernel command line: mem=240M console=ttyPS0,115200 ramdisk_size=33554432 root=/dev/ram rw earlyprintk
PID hash table entries: 1024 (order: 0, 4096 bytes)
Dentry cache hash table entries: 32768 (order: 5, 131072 bytes)
Inode-cache hash table entries: 16384 (order: 4, 65536 bytes)
Memory: 209572K/245760K available (6317K kernel code, 243K rwdata, 1932K rodata, 1024K init, 232K bss, 19804K reserved, 16384K cma-reserved, 0K highmem)
Virtual kernel memory layout:
    vector  : 0xffff0000 - 0xffff1000   (   4 kB)
    fixmap  : 0xffc00000 - 0xfff00000   (3072 kB)
    vmalloc : 0xcf800000 - 0xff800000   ( 768 MB)
    lowmem  : 0xc0000000 - 0xcf000000   ( 240 MB)
    pkmap   : 0xbfe00000 - 0xc0000000   (   2 MB)
    modules : 0xbf000000 - 0xbfe00000   (  14 MB)
      .text : 0xc0008000 - 0xc090e410   (9242 kB)
      .init : 0xc0a00000 - 0xc0b00000   (1024 kB)
      .data : 0xc0b00000 - 0xc0b3cda0   ( 244 kB)
       .bss : 0xc0b3cda0 - 0xc0b77024   ( 233 kB)
Preemptible hierarchical RCU implementation.
Build-time adjustment of leaf fanout to 32.
RCU restricting CPUs from NR_CPUS=4 to nr_cpu_ids=2.
RCU: Adjusting geometry for rcu_fanout_leaf=32, nr_cpu_ids=2
NR_IRQS:16 nr_irqs:16 16
efuse mapped to cf800000
ps7-slcr mapped to cf802000
L2C: platform modifies aux control register: 0x72360000 -> 0x72760000
L2C: DT/platform modifies aux control register: 0x72360000 -> 0x72760000
L2C-310 erratum 769419 enabled
L2C-310 enabling early BRESP for Cortex-A9
L2C-310 full line of zeros enabled for Cortex-A9
L2C-310 ID prefetch enabled, offset 1 lines
L2C-310 dynamic clock gating enabled, standby mode enabled
L2C-310 cache controller enabled, 8 ways, 512 kB
L2C-310: CACHE_ID 0x410000c8, AUX_CTRL 0x76760001
zynq_clock_init: clkc starts at cf802100
Zynq clock init
sched_clock: 64 bits at 333MHz, resolution 3ns, wraps every 4398046511103ns
clocksource: arm_global_timer: mask: 0xffffffffffffffff max_cycles: 0x4ce07af025, max_idle_ns: 440795209040 ns
Switching to timer-based delay loop, resolution 3ns
clocksource: ttc_clocksource: mask: 0xffff max_cycles: 0xffff, max_idle_ns: 537538477 ns
ps7-ttc #0 at cf80a000, irq=18
Console: colour dummy device 80x30
Calibrating delay loop (skipped), value calculated using timer frequency.. 666.66 BogoMIPS (lpj=3333333)
pid_max: default: 32768 minimum: 301
Mount-cache hash table entries: 1024 (order: 0, 4096 bytes)
Mountpoint-cache hash table entries: 1024 (order: 0, 4096 bytes)
CPU: Testing write buffer coherency: ok
CPU0: thread -1, cpu 0, socket 0, mpidr 80000000
Setting up static identity map for 0x100000 - 0x100058
CPU1: failed to boot: -1
Brought up 1 CPUs
SMP: Total of 1 processors activated (666.66 BogoMIPS).
CPU: All CPU(s) started in SVC mode.
devtmpfs: initialized
VFP support v0.3: implementor 41 architecture 3 part 30 variant 9 rev 4
clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 19112604462750000 ns
pinctrl core: initialized pinctrl subsystem
NET: Registered protocol family 16
DMA: preallocated 256 KiB pool for atomic coherent allocations
cpuidle: using governor menu
hw-breakpoint: found 5 (+1 reserved) breakpoint and 1 watchpoint registers.
hw-breakpoint: maximum watchpoint size is 4 bytes.
zynq-ocm f800c000.ps7-ocmc: ZYNQ OCM pool: 256 KiB @ 0xcf880000
vgaarb: loaded
SCSI subsystem initialized
usbcore: registered new interface driver usbfs
usbcore: registered new interface driver hub
usbcore: registered new device driver usb
media: Linux media interface: v0.10
Linux video capture interface: v2.00
pps_core: LinuxPPS API ver. 1 registered
pps_core: Software ver. 5.3.6 - Copyright 2005-2007 Rodolfo Giometti <giometti@linux.it>
PTP clock support registered
EDAC MC: Ver: 3.0.0
Advanced Linux Sound Architecture Driver Initialized.
clocksource: Switched to clocksource arm_global_timer
NET: Registered protocol family 2
TCP established hash table entries: 2048 (order: 1, 8192 bytes)
TCP bind hash table entries: 2048 (order: 2, 16384 bytes)
TCP: Hash tables configured (established 2048 bind 2048)
UDP hash table entries: 256 (order: 1, 8192 bytes)
UDP-Lite hash table entries: 256 (order: 1, 8192 bytes)
NET: Registered protocol family 1
RPC: Registered named UNIX socket transport module.
RPC: Registered udp transport module.
RPC: Registered tcp transport module.
RPC: Registered tcp NFSv4.1 backchannel transport module.
PCI: CLS 0 bytes, default 64
Trying to unpack rootfs image as initramfs...
rootfs image is not initramfs (no cpio magic); looks like an initrd
Freeing initrd memory: 6732K (cd466000 - cdaf9000)
hw perfevents: enabled with armv7_cortex_a9 PMU driver, 7 counters available
futex hash table entries: 512 (order: 3, 32768 bytes)
workingset: timestamp_bits=28 max_order=16 bucket_order=0
jffs2: version 2.2. (NAND) (SUMMARY)  © 2001-2006 Red Hat, Inc.
io scheduler noop registered
io scheduler deadline registered
io scheduler cfq registered (default)
dma-pl330 f8003000.ps7-dma: Loaded driver for PL330 DMAC-241330
dma-pl330 f8003000.ps7-dma: DBUFF-128x8bytes Num_Chans-8 Num_Peri-4 Num_Events-16
e0000000.serial: ttyPS0 at MMIO 0xe0000000 (irq = 159, base_baud = 6249999) is a xuartps
console [ttyPS0] enabled
xdevcfg f8007000.ps7-dev-cfg: ioremap 0xf8007000 to cf86e000
[drm] Initialized drm 1.1.0 20060810
brd: module loaded
loop: module loaded
CAN device driver interface
gpiod_set_value: invalid GPIO
libphy: MACB_mii_bus: probed
macb e000b000.ethernet eth0: Cadence GEM rev 0x00020118 at 0xe000b000 irq 31 (00:0a:35:00:00:00)
Generic PHY e000b000.etherne:00: attached PHY driver [Generic PHY] (mii_bus:phy_addr=e000b000.etherne:00, irq=-1)
e1000e: Intel(R) PRO/1000 Network Driver - 3.2.6-k
e1000e: Copyright(c) 1999 - 2015 Intel Corporation.
ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
ehci-pci: EHCI PCI platform driver
usbcore: registered new interface driver usb-storage
mousedev: PS/2 mouse device common for all mice
i2c /dev entries driver
cdns-i2c e0005000.ps7_i2c: 100 kHz mmio e0005000 irq 154
Xilinx Zynq CpuIdle Driver started
sdhci: Secure Digital Host Controller Interface driver
sdhci: Copyright(c) Pierre Ossman
sdhci-pltfm: SDHCI platform and OF driver helper
mmc0: SDHCI controller on e0100000.ps7-sdio [e0100000.ps7-sdio] using ADMA
ledtrig-cpu: registered to indicate activity on CPUs
usbcore: registered new interface driver usbhid
usbhid: USB HID core driver
nand: disable subpage write
nand: device found, Manufacturer ID: 0x2c, Chip ID: 0xda
nand: Micron MT29F2G08ABAEAWP
nand: 256 MiB, SLC, erase size: 128 KiB, page size: 2048, OOB size: 64
nand: NAND_ECC_HW
nand: NAND_ECC_HW_SYNDROME
mtd->writesize = 2048
ecc->strength = 1
ecc->size = 2048
mtd->writesize = 2048
chip->ecc_strength_ds = 4
chip->ecc_step_ds = 512
nand: WARNING: pl35x-nand: the ECC used on your system is too weak compared to the one required by the NAND chip
Bad block table found at page 131008, version 0x01
Bad block table found at page 130944, version 0x01
8 ofpart partitions found on MTD device pl35x-nand
Creating 8 MTD partitions on "pl35x-nand":
0x000000000000-0x000002800000 : "BOOT.bin-dts-marker-kernel"
0x000002800000-0x000004800000 : "ramfs"
0x000004800000-0x000005000000 : "configs"
0x000005000000-0x000005200000 : "sig"
0x000005200000-0x000006000000 : "reserve1"
0x000006000000-0x000007000000 : "upgrade-ramfs"
0x000007000000-0x00000a800000 : "upgrade-file"
0x00000a800000-0x000010000000 : "reserve2"
nf_conntrack version 0.5.0 (3635 buckets, 14540 max)
ip_tables: (C) 2000-2006 Netfilter Core Team
NET: Registered protocol family 10
ip6_tables: (C) 2000-2006 Netfilter Core Team
sit: IPv6 over IPv4 tunneling driver
NET: Registered protocol family 17
can: controller area network core (rev 20120528 abi 9)
NET: Registered protocol family 29
can: raw protocol (rev 20120528)
can: broadcast manager protocol (rev 20120528 t)
can: netlink gateway (rev 20130117) max_hops=1
zynq_pm_ioremap: no compatible node found for 'xlnx,zynq-ddrc-a05'
zynq_pm_late_init: Unable to map DDRC IO memory.
Registering SWP/SWPB emulation handler
hctosys: unable to open rtc device (rtc0)
ALSA device list:
  No soundcards found.
RAMDISK: gzip image found at block 0
EXT4-fs (ram0): couldn't mount as ext3 due to feature incompatibilities
EXT4-fs warning (device ram0): ext4_update_dynamic_rev:746: updating to rev 1 because of new feature flag, running e2fsck is recommended
EXT4-fs (ram0): mounted filesystem without journal. Opts: (null)
VFS: Mounted root (ext4 filesystem) on device 1:0.
devtmpfs: mounted
Freeing unused kernel memory: 1024K (c0a00000 - c0b00000)
EXT4-fs (ram0): re-mounted. Opts: block_validity,delalloc,barrier,user_xattr,errors=remount-ro
devpts: called with bogus options
ubi0: attaching mtd2
ubi0: scanning is finished
ubi0: attached mtd2 (name "configs", size 8 MiB)
ubi0: PEB size: 131072 bytes (128 KiB), LEB size: 126976 bytes
ubi0: min./max. I/O unit sizes: 2048/2048, sub-page size 2048
ubi0: VID header offset: 2048 (aligned 2048), data offset: 4096
ubi0: good PEBs: 64, bad PEBs: 0, corrupted PEBs: 0
ubi0: user volume: 1, internal volumes: 1, max. volumes count: 128
ubi0: max/mean erase counter: 6/2, WL threshold: 4096, image sequence number: 262741923
ubi0: available PEBs: 36, total reserved PEBs: 28, PEBs reserved for bad PEB handling: 4
ubi0: background thread "ubi_bgt0d" started, PID 729
UBIFS (ubi0:0): background thread "ubifs_bgt0_0" started, PID 733
UBIFS (ubi0:0): UBIFS: mounted UBI device 0, volume 0, name "configs"
UBIFS (ubi0:0): LEB size: 126976 bytes (124 KiB), min./max. I/O unit sizes: 2048 bytes/2048 bytes
UBIFS (ubi0:0): FS size: 1396736 bytes (1 MiB, 11 LEBs), journal size 888833 bytes (0 MiB, 5 LEBs)
UBIFS (ubi0:0): reserved for root: 65970 bytes (64 KiB)
UBIFS (ubi0:0): media format: w4/r0 (latest is w4/r0), UUID A9E8BDA4-70DE-45D9-83BE-2DD9F129C0C6, small LPT model
ubi2: attaching mtd4
ubi2: scanning is finished
ubi2: attached mtd4 (name "reserve1", size 14 MiB)
ubi2: PEB size: 131072 bytes (128 KiB), LEB size: 126976 bytes
ubi2: min./max. I/O unit sizes: 2048/2048, sub-page size 2048
ubi2: VID header offset: 2048 (aligned 2048), data offset: 4096
ubi2: good PEBs: 112, bad PEBs: 0, corrupted PEBs: 0
ubi2: user volume: 1, internal volumes: 1, max. volumes count: 128
ubi2: max/mean erase counter: 6/2, WL threshold: 4096, image sequence number: 2255068344
ubi2: available PEBs: 0, total reserved PEBs: 112, PEBs reserved for bad PEB handling: 4
ubi2: background thread "ubi_bgt2d" started, PID 740
UBIFS (ubi2:0): background thread "ubifs_bgt2_0" started, PID 744
UBIFS (ubi2:0): UBIFS: mounted UBI device 2, volume 0, name "misc"
UBIFS (ubi2:0): LEB size: 126976 bytes (124 KiB), min./max. I/O unit sizes: 2048 bytes/2048 bytes
UBIFS (ubi2:0): FS size: 11935744 bytes (11 MiB, 94 LEBs), journal size 1015809 bytes (0 MiB, 6 LEBs)
UBIFS (ubi2:0): reserved for root: 563754 bytes (550 KiB)
UBIFS (ubi2:0): media format: w4/r0 (latest is w4/r0), UUID 0EC0A221-F712-4D31-9EF9-BCF170A12764, small LPT model
IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
random: avahi-daemon urandom read with 2 bits of entropy available
macb e000b000.ethernet eth0: unable to generate target frequency: 25000000 Hz
macb e000b000.ethernet eth0: link up (100/Full)
IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
In axi fpga driver!
request_mem_region OK!
AXI fpga dev virtual address is 0xcfbc8000
*base_vir_addr = 0xb031
In fpga mem driver!
request_mem_region OK!
fpga mem virtual address is 0xd2000000
random: nonblocking pool is initialized


===========================================Miner log===========================================
1970-01-01 00:00:10 Miner compile time: Mon Oct 26 17:55:33 CST 2020 type: Antminer S19 Pro
1970-01-01 00:00:11 This is fix-freq version
1970-01-01 00:00:11 Miner compile time: Mon Oct 26 17:55:33 CST 2020 type: Antminer S19 Pro
1970-01-01 00:00:11 commit version: 62e4736 2020-09-21 11:17:56, build by: jenkins 2020-10-26 18:06:03
1970-01-01 00:00:11 opt_multi_version     = 1
1970-01-01 00:00:11 opt_bitmain_ab        = 1
1970-01-01 00:00:11 mid_auto_gen          = 0
1970-01-01 00:00:11 opt_bitmain_work_mode = 0
1970-01-01 00:00:11 mmap fpga_mem_addr_hal = 0xb5900000
1970-01-01 00:00:11 HASH_ON_PLUG V9 = 0x7
1970-01-01 00:00:11 Note: front fan is power on!
1970-01-01 00:00:11 Note: rear fan is power on!
1970-01-01 00:00:11 start the http log.
1970-01-01 00:00:11 httpListenThread start ret=0
1970-01-01 00:00:11 start listen on 6060 ...
1970-01-01 00:00:11 load machine NBP1901 conf
1970-01-01 00:00:11 machine : NBP1901
1970-01-01 00:00:11 chain_num 4, chain_domain_num 38, chain_asic_num 114, domain_asic_num 3
2021-01-19 20:55:40 miner ID : 814254205710481c
2021-01-19 20:55:40 FPGA Version = 0xB031
2021-01-19 20:55:40 HASH_ON_PLUG V9 = 0x7
2021-01-19 20:55:40 ==========================capability start==========================
2021-01-19 20:55:40 board num = 3
2021-01-19 20:55:40 board id = 0, chain num = 1
2021-01-19 20:55:40 chain id = 0
2021-01-19 20:55:40 board id = 1, chain num = 1
2021-01-19 20:55:40 chain id = 1
2021-01-19 20:55:40 board id = 2, chain num = 1
2021-01-19 20:55:40 chain id = 2
2021-01-19 20:55:40 ==========================capability end============================
2021-01-19 20:55:40 chain num = 3
2021-01-19 20:55:42 [chain 0]
2021-01-19 20:55:42 0x0000 11 42 3D C8 D9 C8 26 0A   9E 12 7A E9 BB EA D3 A4
2021-01-19 20:55:42 0x0010 97 77 F0 AE C9 85 B0 EA   CE 56 A3 EE 0C 9A 28 78
2021-01-19 20:55:42 0x0020 94 F8 8C 41 A5 BA F9 91   FB 7A 25 AE AF 3E 70 81
2021-01-19 20:55:42 0x0030 E1 FA 69 61 96 60 E9 A8   E7 8B 9D 38 6A 50 59 B8
2021-01-19 20:55:42 0x0040 31 F1 FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x0050 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x0060 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x0070 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x0080 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x0090 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x00A0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x00B0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x00C0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x00D0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x00E0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:42 0x00F0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF 5A
2021-01-19 20:55:42
2021-01-19 20:55:44 [chain 1]
2021-01-19 20:55:44 0x0000 11 42 1B 38 9B 6C 44 C6   A6 7C F2 03 3B C3 4F 49
2021-01-19 20:55:44 0x0010 99 A1 D0 04 7C F4 C8 F2   C5 04 B7 79 D6 1F 74 CA
2021-01-19 20:55:44 0x0020 20 3E 65 D4 E3 ED A9 1D   84 79 D8 60 00 86 E2 9B
2021-01-19 20:55:44 0x0030 7F B5 52 4C 5C 80 41 A8   E4 72 E6 6A 66 99 F3 70
2021-01-19 20:55:44 0x0040 02 3B FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x0050 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x0060 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x0070 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x0080 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x0090 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x00A0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x00B0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x00C0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x00D0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x00E0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:44 0x00F0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF 5A
2021-01-19 20:55:44
2021-01-19 20:55:45 [chain 2]
2021-01-19 20:55:45 0x0000 11 42 A4 20 E1 58 B0 FB   AB AD E8 B6 F7 23 3A A0
2021-01-19 20:55:45 0x0010 4B C0 CC 1F 88 C7 F0 B5   45 AB 6E EE E0 55 02 DA
2021-01-19 20:55:45 0x0020 88 D0 CE 01 95 E2 4D F4   78 5D 68 A8 54 4D F4 61
2021-01-19 20:55:45 0x0030 E4 87 B8 25 FB E0 2E 34   ED B5 3A A5 F9 A2 3D BC
2021-01-19 20:55:45 0x0040 E4 DD FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x0050 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x0060 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x0070 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x0080 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x0090 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x00A0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x00B0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x00C0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x00D0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x00E0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-01-19 20:55:45 0x00F0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF 5A
2021-01-19 20:55:45
2021-01-19 20:55:45 fan_eft : 1  fan_pwm : 97
2021-01-19 20:55:45 create thread
2021-01-19 20:55:45 fixed working voltage = 1260
2021-01-19 20:55:45 Chain [0] PCB Version: 0x0100
2021-01-19 20:55:45 Chain [0] BOM Version: 0x0100
2021-01-19 20:55:45 Chain [1] PCB Version: 0x0100
2021-01-19 20:55:45 Chain [1] BOM Version: 0x0100
2021-01-19 20:55:45 Chain [2] PCB Version: 0x0100
2021-01-19 20:55:45 Chain [2] BOM Version: 0x0100
2021-01-19 20:55:49 Fan check passed.
2021-01-19 20:55:50 chain[0] PIC jump to app
2021-01-19 20:55:52 Check chain[0] PIC fw version=0x89
2021-01-19 20:55:53 chain[1] PIC jump to app
2021-01-19 20:55:54 Check chain[1] PIC fw version=0x89
2021-01-19 20:55:56 chain[2] PIC jump to app
2021-01-19 20:55:57 Check chain[2] PIC fw version=0x89
2021-01-19 20:55:57 create thread
2021-01-19 20:55:57 max sensor num = 4
2021-01-19 20:55:57 temperature_monitor_thread start...
2021-01-19 20:56:02 power type version: 0x0071
2021-01-19 20:56:02 Enter sleep to make sure power release finish.
2021-01-19 20:56:45 Slept 42 seconds, diff = 5.
2021-01-19 20:56:45 set_voltage_by_steps to 1500.
2021-01-19 20:56:48 start up min temp by 75a = 25
2021-01-19 20:56:48 set UART baud to 115200
2021-01-19 20:56:56 Chain[0]: find 114 asic, times 0
2021-01-19 20:57:04 Chain[1]: find 114 asic, times 0
2021-01-19 20:57:12 Chain[2]: find 114 asic, times 0
2021-01-19 20:57:12 pulse_mode = 1, ccdly_sel = 1, pwth_sel = 1
2021-01-19 20:57:16 min freq in eeprom = 525
2021-01-19 20:57:16 fixed frequency is 525
2021-01-19 20:57:16 Bring up temperature is 25
2021-01-19 20:57:16 set UART baud to 12000000
2021-01-19 20:57:17 set_voltage_by_steps to 1260.
2021-01-19 20:57:24 STATUS_INITED: soc init done!
2021-01-19 20:57:24 create thread
2021-01-19 20:57:24 create thread
2021-01-19 20:57:25 fan_etf: Set fixed fan speed=97
2021-01-19 20:57:26 start to init...
2021-01-19 20:57:35 Init done!
2021-01-19 20:57:35 STATUS_OKAY
2021-01-19 20:57:35 start the cached job
2021-01-19 20:57:35 Version num 8
2021-01-19 20:57:35 Mask num 0xe000
2021-01-19 20:57:35 freq = 525, percent = 90, hcn = 12480, timeout = 449
2021-01-19 20:57:35 set_start_time_point total_tv_start_sys=133 total_tv_end_sys=134


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: philipma1957 on January 19, 2021, 10:33:54 PM
so when fans are 97% what rpm do they show?

and what temps do they show?

my lone s19pro is set to 70% fan

fans read

4080
3980
4440
4440

temps read

51.45------72.71
51.45------73.70
51.48------75.72


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: mikeywith on January 19, 2021, 11:54:12 PM
I have 250 units. About 8 of these randomly turn off for no reason at all. T

What do these 8 miners have in common? are they on the same breaker? switch? rack? when more than 1 miner acts the same it's usually an outside cause.

Quote
here are no errors, temps are fine, I tried different psu, different control board, fans are at 97%. If you know what the reason is please let me know. Thank you.

Miners turn-off or reboot for many reasons, heat is the most common, if the miner's temps are fine it could be PSU, if the PSU is dirty or isn't getting enough air flow it i will stop, also if the voltage goes out of the 200-240v range it can trigger protection mode.

The good news is here
Code:
2021-01-19 20:56:56 Chain[0]: find 114 asic, times 0
2021-01-19 20:57:04 Chain[1]: find 114 asic, times 0
2021-01-19 20:57:12 Chain[2]: find 114 asic, times 0

All three hash boards are in good shape, finding the complete asic count in the first trial is a good sign.

Since you have 250 of these, I don't think I should be telling you this, but you sure your router/switch can handle all the traffic? that could also be the reason why.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: philipma1957 on January 19, 2021, 11:59:31 PM
Good point he may not have a good enough set of switches and or router.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on January 20, 2021, 04:36:29 AM
Miners are very cold.
43 - 64
41 - 58
40 - 59

PCB Temps Are from low 35 to high 59
Chip Temps Are from 38 to high 64
These are the min and max for all chips on all 3 boards. Average is way below the high I listed above.

Fan Speeds do show running at 97%
Speeds Are: 5040 5040 6000 6000



My router is set to be able to handle 750+ ip address. So thats not the issue. But it could be the voltage drop or voltage increase from the panel. So what happens is that these miners randomly turn off and yes they are all located in one area. Even though some miners in that same area do not have these issues. It might be the panels voltage changing for some reason. I will look into that. As for the other possibilities. All these miners are brand new and have brand new PSU. I even tried replacing 2 psu but still same problem. I tried new control board but still same problem. The really weird part is when the heat outside get to 50 degrees then this happens. Even though the miners are super cold. It might be something with the panel. The Electricians could have made a mistake somewhere.



It cant be the router. On top of these s19 pro I also have 220 units of S17 running on the same network. So there are over 500 units on the network. I increase the network to be able to handle 750+ ip address. It goes from 10.1.10.1 - 10.1.12.255. Unless I did something wrong but I can connect to these bad miners no problem. Always works. Also why would they shut down if there is a network problem? They would show network error. But I dont see any errors at all.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: mikeywith on January 20, 2021, 06:36:17 AM
Just because you changed your network settings to class A, it doesn't mean the infrastructure can handle all the traffic, this why some routers/switches cost 10x the price of another router/switch, but well since the miners completely shut down, I would rule out the network thing altogether.

I would like to hear more explanation on the "shut-down / turn off f" part, does the PSU itself shut down and its fan stop spinning or does the PSU shut down its main rail and keep the 12v running so the miner's fan and control board work but the hashboards don't?

Since you mentioned that you changed the control board and PSU, what I would do is move one of the bad miners to another rack, just to ensure that this isn't an electricity-related issue, if it is not, then chances are one or some of these boards have a loose heatsink which loses contact with the chip when the mine gets warm, this is very common with the 17 series and could be with the 19 series too since they gears are new, we don't have much info about them, to troubleshoot that you will need to run the miners with 1 board at the time to figure out which board/boards are causing the problem.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on January 20, 2021, 07:05:22 AM
We have a business modem / router from comcast. It should handle all these miners just fine. Also we have good upload and download speed. So what happens --> at random times the miners turn off but to be more exact they look like they restart and dont start mining after restarting. Or they just stop mining. The kernel is showing that they are most likely restarting. Here is some kernel info from a miner that has been running for 7 hours:

2021-01-19 23:48:58 freq = 525, percent = 90, hcn = 12480, timeout = 449
2021-01-19 23:48:58 set_start_time_point total_tv_start_sys=121 total_tv_end_sys=122
2021-01-19 23:48:58 set_voltage_by_steps to 1307.
2021-01-19 23:54:31 set_voltage_by_steps to 1287


You can see that its doing a == set_voltage_by_steps function -- it auto regulates the voltage as time goes on. The miners that have the restart issue are not showing multiple entrys for set_voltage_by_steps after they stop mining. That gives me the impression that the miners restarted since there is only one entry from the bad miner and once it restarted its not mining anymore. The problem could be the heat sinks but it seems that bitmain acknowledged this problem with the s17 series and supposedly fixed the issue on the S19. So i would not put much thought into the heat sink problem. I will try to diagnose the panel for these miners. I can also try moving a bad miner to a different location and see if the problem goes away. If anyone has any other thoughts let me know. This problem is really hard to figure out since there is ZERO kernel info.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: mikeywith on January 20, 2021, 08:16:45 PM
The problem could be the heat sinks but it seems that bitmain acknowledged this problem with the s17 series and supposedly fixed the issue on the S19.

I do agree, after a second thought, I just remembered that the S19 have large heatsinks that cover more than one chip and it's tightened with some screws, so it's unlikely. but still, one bad chip can cause a mess in these hash boards, so I would still do the board by board troubleshooting if they miner still acts weird even after you move it to a new location.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on January 20, 2021, 10:27:36 PM
Here is another miner. It just now stopped mining. Kernel Log is similar to all the other miners with the same problem.

Code:
2021-01-19 23:52:22 freq = 525, percent = 90, hcn = 12480, timeout = 449
2021-01-19 23:52:22 set_start_time_point total_tv_start_sys=270 total_tv_end_sys=271
2021-01-19 23:52:22 set_voltage_by_steps to 1292.
2021-01-19 23:53:47 set_voltage_by_steps to 1272.
2021-01-20 20:18:42 set_voltage_by_steps to 1260.
2021-01-20 21:45:40 set_voltage_by_steps to 1282.
2021-01-20 21:53:27 avg rate is lower than ideal rate, 72720.92 in 30 mins
2021-01-20 21:53:27 chain 0 hash rate 11401.00 low in 15 mins
2021-01-20 21:53:27 chain 1 hash rate 10699.00 low in 15 mins
2021-01-20 21:53:27 chain 2 hash rate 11628.00 low in 15 mins
2021-01-20 22:08:28 chain 0 hash rate 0.00 low in 15 mins
2021-01-20 22:08:28 chain 1 hash rate 0.00 low in 15 mins
2021-01-20 22:08:28 chain 2 hash rate 0.00 low in 15 mins
2021-01-20 22:23:29 avg rate is lower than ideal rate, 0.00 in 30 mins

You can see that the hash rate slowly came down to zero with no reason. No errors, temps are good. Worked fine for 22 hours. Only happens when outside temps go to 50f degrees.  Any other tips from anyone please leave them. I will inspect the copper wires for any loose connection in the outlet and the breaker.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: mikeywith on January 21, 2021, 01:14:24 AM
You can see that the hash rate slowly came down to zero with no reason. No errors, temps are good. Worked fine for 22 hours. Only happens when outside temps go to 50f degrees.  Any other tips from anyone please leave them. I will inspect the copper wires for any loose connection in the outlet and the breaker.

Some miners don't like to be run in cold weather, I have seen many issues like this where people had to point the hot exhausted air of some miner to another miner that doesn't like "too low temps", your miner's fans are at 97% and seems like you have that on purpose, could you drop it to 50% or just auto and see how it goes? I mean you would know better how to troubleshoot the low temps but just know it's not uncommon for some gears to stop hashing when they get too cold.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on January 21, 2021, 02:22:24 AM
I dont think they have a cold problem. They run just fine at night when its even colder. They only have a problem at daytime and when temps outside go above 50f.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: philipma1957 on January 21, 2021, 02:24:19 AM
[...]

So as I asked you here. what do your temps read at 97%

my room-is over 90f my fans are slower then yours .

please look at temps of the rigs.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on January 21, 2021, 02:31:24 AM
I posted my temp above. But here they are again:

Miners are very cold inlet and outlet:

43 - 64
41 - 58
40 - 59

PCB Temps Are from low 35 to high 59. Chip Temps Are from 38 to high 64. These are the min and max for all chips on all 3 boards. Average is way below the high I listed above.

Fan Speeds do show running at 97%. Speeds Are: 5040 5040 6000 6000.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: philipma1957 on January 21, 2021, 02:47:47 AM
Set fans to 80% and give back temps.

those cold chips could be the issue.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on January 22, 2021, 08:06:55 PM
I dont think it is cold temp. I have 240 miners right next to these ones that are as cold or even colder. None of those have these issues. Its about 8 miners out of 257 miners that have the issue and all 8 are located in the same area and on the same panel.



Update:

I have tightened all the bolts in the panel. The bolts holding the breaker to the panel, the cooper wire going into the breaker and the cooper wire going into the outlet. All have been tightened but the problem is still present. Today the outside temps have reached 50f and inside the container its also 50f. Inlet and out let are below 75 for all these miners. But they still stop mining. They are breaking after mining for 20-30 minutes.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: mikeywith on January 23, 2021, 05:46:29 AM
I have tightened all the bolts in the panel. The bolts holding the breaker to the panel, the cooper wire going into the breaker and the cooper wire going into the outlet. All have been tightened but the problem is still present. Today the outside temps have reached 50f and inside the container its also 50f. Inlet and out let are below 75 for all these miners. But they still stop mining. They are breaking after mining for 20-30 minutes.

The reason why I asked if these "bad" miners were located in the same area is simply to help you troubleshoot further, now that you answered, it's actually good news, at least we know there is a good chance that the problem isn't the miners themselves, to confirm this you have to take 1-2 two miners and put them elsewhere, see if they act any differently.

You should also try to reduce the fan speed on some of these miners to see if they would run any longer/better, if so happen that this rack directly faces the intake fans or has a small difference in reference to the airflow they can still can weird,  I would also check the PSU screws and make sure all hash boards are tightened, I would connect the bad miners to a different switch using different cables, I would also use different power cords.

Also in regards to checking the breaker and the wires, you need to check the voltage right before the miners stop hashing, if the wire is damaged it can cause a voltage drop, and these gears don't do well with low voltage, so what could be happening is that after loading the wire for 20-30 mins it gets warm, voltage drops, the PSU goes into protection mode and stops feeding the hash boards.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on January 23, 2021, 06:27:49 AM
I was thinking that maybe the psu is overheating? It feel warm but not hot. The very back of the psu next to the miners exhaust fans is probably hot? The idea I have is that maybe they are getting to warm and shutting down? I have a half inch space between the miners to allow some warm air to come back into the conex. Its only half and inch and seems like nothing. Could this be causing the problems? Maybe these psu's are sensitive to heat and shut down? I have set up a test scenario with some of my miners. I moved some to a different location and I made some have zero space between them. Next time temps are 50+ I will know if these miners have the same problem. If anyone know anything about the S19 psu shutting down because of heat. Please leave some info. Thanks.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: mikeywith on January 23, 2021, 07:24:34 AM
If anyone know anything about the S19 psu shutting down because of heat. Please leave some info. Thanks.

Of course!. Nearly all PSUs have temp protection and they will shut-down to protect themselves, a very common issue that miners face on a daily bases is their PSUs shutting down due to heat mostly caused by dust and/or not enough space between the PSU's and the wall/other miners,

Quote
The very back of the psu next to the miners exhaust fans is probably hot

What's behind the PSUs exhaust is what matters most, but still a bit of spacing between the miners will help cool down the PSUs a bit.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on January 31, 2021, 11:14:39 PM
I have some more info. So I have replaced the psu and i have made no space between all miners. So there is absolutely zero chance that the psu are overheating or that the boards / control boards are over heating. I even made a video in real time for one of my miners breaking / stops mining. The issue has to be either bad hash board(s) or maybe it could be volt fluctuation. Im thinking its the bad boards. Or maybe it could be one bad board that causes the whole miner to shut down. Can you guys watch the video please and let me know what you think?

https://www.youtube.com/watch?v=Ip-y7uKqFgc


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: mikeywith on February 01, 2021, 05:05:49 AM
I saw the video, basically nothing new to what you mentioned here, you perfectly explained the problem in this topic; however, you were supposed to report back the results of moving the miners to another location, which should be fairly easy to do and it isn't just a random thing I asked you to try, it's the backbone of this troubleshooting process, unplug one of the miners, take it somewhere far away, put it outside of the farm if you would, and see if behaves any differently.

The result of this test will eliminate many doubts and it will narrow down the problem to only a few possibilities.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: baro0k on February 05, 2021, 12:26:49 AM
We have a business modem / router from comcast. It should handle all these miners just fine. Also we have good upload and download speed. So what happens --> at random times the miners turn off but to be more exact they look like they restart and dont start mining after restarting. Or they just stop mining. The kernel is showing that they are most likely restarting. Here is some kernel info from a miner that has been running for 7 hours:

2021-01-19 23:48:58 freq = 525, percent = 90, hcn = 12480, timeout = 449
2021-01-19 23:48:58 set_start_time_point total_tv_start_sys=121 total_tv_end_sys=122
2021-01-19 23:48:58 set_voltage_by_steps to 1307.
2021-01-19 23:54:31 set_voltage_by_steps to 1287


You can see that its doing a == set_voltage_by_steps function -- it auto regulates the voltage as time goes on. The miners that have the restart issue are not showing multiple entrys for set_voltage_by_steps after they stop mining. That gives me the impression that the miners restarted since there is only one entry from the bad miner and once it restarted its not mining anymore. The problem could be the heat sinks but it seems that bitmain acknowledged this problem with the s17 series and supposedly fixed the issue on the S19. So i would not put much thought into the heat sink problem. I will try to diagnose the panel for these miners. I can also try moving a bad miner to a different location and see if the problem goes away. If anyone has any other thoughts let me know. This problem is really hard to figure out since there is ZERO kernel info.

I have a similar case

Code:
2021-01-22 13:00:13
2021-01-22 13:00:13
2021-01-22 13:00:13
2021-01-22 13:00:13
2021-01-22 13:00:13
2021-01-22 13:00:13
2021-01-22 13:00:13
2021-01-22 13:00:13
2021-01-22 13:00:13
2021-01-22 13:00:17
2021-01-22 13:00:17
2021-01-22 13:00:19
2021-01-22 13:00:20 chain[1] PIC jump to app 2021-01-22 13:00:22 Check chain[1] PIC fw version=0x89 2021-01-22 13:00:23 chain[2] PIC jump to app 2021-01-22 13:00:24 Check chain[2] PIC fw version=0x89 2021-01-22 13:00:25 create thread
2021-01-22 13:00:25 max sensor num = 4
2021-01-22 13:00:25 temperature_monitor_thread start... 2021-01-22 13:00:29 power type version: 0x0071
0x0060FFFFFFFFFFFF 0x0070FFFFFFFFFFFF 0x0080FFFFFFFFFFFF 0x0090FFFFFFFFFFFF 0x00A0FFFFFFFFFFFF 0x00B0FFFFFFFFFFFF 0x00C0FFFFFFFFFFFF 0x00D0FFFFFFFFFFFF 0x00E0FFFFFFFFFFFF 0x00F0FFFFFFFFFFFF
FF FF FF FF FF FF
FF FF FF FF FF FF FF FF FF FF FF FF
FF FF
FF FF FF FF FF FF FF FF FF FF FF FF FF FF FFFFFFFFFFFFFF
FFFFFFFFFFFFFF FFFFFFFFFFFFFF FFFFFFFFFFFFFF FF FF FF FF FF FF FF FF FF FF FF FF FF FF FF FF FF FF FF FF FF
FF FF FF FF FF FF FF
fan_eft : 0 fan_pwm : 100 create thread
fixed working voltage = 1260 Chain [0] PCB Version: 0x0100 Chain [0] BOM Version: 0x0100 Chain [1] PCB Version: 0x0100 Chain [1] BOM Version: 0x0100 Chain [2] PCB Version: 0x0100 Chain [2] BOM Version: 0x0100 Fan check passed.
chain[0] PIC jump to app
Check chain[0] PIC fw version=0x89
2021-01-22 13:00:29 Enter sleep to make sure power release finish. 2021-01-22 13:01:00 Slept 30 seconds, diff = 1.
2021-01-22 13:01:00 set_voltage_by_steps to 1500.
2021-01-22 13:01:05 start up min temp by 75a = 24
2021-01-22 13:01:05 set UART baud to 115200
2021-01-22 13:01:10 chain avg vol rise to 15.17
2021-01-22 13:01:13 Chain[0]: find 114 asic, times 0
2021-01-22 13:01:21 Chain[1]: find 114 asic, times 0
2021-01-22 13:01:29 Chain[2]: find 114 asic, times 0
2021-01-22 13:01:29 pulse_mode = 1, ccdly_sel = 1, pwth_sel = 1 2021-01-22 13:01:33 min freq in eeprom = 525
2021-01-22 13:01:33 fixed frequency is 525 2021-01-22 13:01:33 Bring up temperature is 24 2021-01-22 13:01:34 set UART baud to 12000000 2021-01-22 13:01:34 set_voltage_by_steps to 1260. 2021-01-22 13:01:41 STATUS_INITED: soc init done! 2021-01-22 13:01:41 create thread
2021-01-22 13:01:41 create thread
2021-01-22 13:01:43 start to init...
2021-01-22 13:01:52 Init done!
2021-01-22 13:01:52 STATUS_OKAY
2021-01-22 13:01:52 start the cached job
2021-01-22 13:01:52 Version num 8
2021-01-22 13:01:52 Mask num 0xe000
2021-01-22 13:01:52 freq = 525, percent = 90, hcn = 12480, timeout = 449 2021-01-22 13:01:52 set_start_time_point total_tv_start_sys=122 total_tv_end_sys=123
2021-01-22 13:57:32 set_voltage_by_steps to 1240. 2021-01-22 13:57:38 set_voltage_by_steps to 1260. 2021-01-22 13:58:54 set_voltage_by_steps to 1240. 2021-01-22 13:59:34 set_voltage_by_steps to 1260. 2021-01-22 13:59:38 set_voltage_by_steps to 1240. 2021-01-22 13:59:42 set_voltage_by_steps to 1260. 2021-01-22 14:00:54 set_voltage_by_steps to 1240. 2021-01-22 14:01:12 set_voltage_by_steps to 1

can you please explain more?


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on February 05, 2021, 06:45:31 AM
Try tightening the bolts on the boards and the psu. Tighten all the bolts that transfer power. Also update the firmware. There is new firmware that came out 2 days ago. Let us know if that fixes the issue for you.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on February 06, 2021, 04:38:26 AM
I moved one of the bad miners to a different outlet and know it breaks even faster. Im getting a little more info in the kernel. Before I would get  zero info but now im getting: (chain avg vol rise) and (chain avg vol drop) and (read asic reg error).

Something is wrong with these miners. Here is the full log for the miner I move to a different outlet. Before these errors would not be in the kernel. It would restart with no error. Now you can see Im getting more info. Any ideas what it could be? Keep in mind that if its the psu that would mean that there are 20+ bad psu out of the 257 miners. Also these miners are brand new and only been mining for 1 month. Psu's are brand new.

Code:
free_area_init_node: node 0, pgdat c0b3c040, node_mem_map cde10000
  Normal zone: 480 pages used for memmap
  Normal zone: 0 pages reserved
  Normal zone: 61440 pages, LIFO batch:15
percpu: Embedded 12 pages/cpu @cddf0000 s19916 r8192 d21044 u49152
pcpu-alloc: s19916 r8192 d21044 u49152 alloc=12*4096
pcpu-alloc: [0] 0 [0] 1
Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 60960
Kernel command line: mem=240M console=ttyPS0,115200 ramdisk_size=33554432 root=/dev/ram rw earlyprintk
PID hash table entries: 1024 (order: 0, 4096 bytes)
Dentry cache hash table entries: 32768 (order: 5, 131072 bytes)
Inode-cache hash table entries: 16384 (order: 4, 65536 bytes)
Memory: 209672K/245760K available (6317K kernel code, 243K rwdata, 1932K rodata, 1024K init, 232K bss, 19704K reserved, 16384K cma-reserved, 0K highmem)
Virtual kernel memory layout:
    vector  : 0xffff0000 - 0xffff1000   (   4 kB)
    fixmap  : 0xffc00000 - 0xfff00000   (3072 kB)
    vmalloc : 0xcf800000 - 0xff800000   ( 768 MB)
    lowmem  : 0xc0000000 - 0xcf000000   ( 240 MB)
    pkmap   : 0xbfe00000 - 0xc0000000   (   2 MB)
    modules : 0xbf000000 - 0xbfe00000   (  14 MB)
      .text : 0xc0008000 - 0xc090e410   (9242 kB)
      .init : 0xc0a00000 - 0xc0b00000   (1024 kB)
      .data : 0xc0b00000 - 0xc0b3cda0   ( 244 kB)
       .bss : 0xc0b3cda0 - 0xc0b77024   ( 233 kB)
Preemptible hierarchical RCU implementation.
Build-time adjustment of leaf fanout to 32.
RCU restricting CPUs from NR_CPUS=4 to nr_cpu_ids=2.
RCU: Adjusting geometry for rcu_fanout_leaf=32, nr_cpu_ids=2
NR_IRQS:16 nr_irqs:16 16
efuse mapped to cf800000
ps7-slcr mapped to cf802000
L2C: platform modifies aux control register: 0x72360000 -> 0x72760000
L2C: DT/platform modifies aux control register: 0x72360000 -> 0x72760000
L2C-310 erratum 769419 enabled
L2C-310 enabling early BRESP for Cortex-A9
L2C-310 full line of zeros enabled for Cortex-A9
L2C-310 ID prefetch enabled, offset 1 lines
L2C-310 dynamic clock gating enabled, standby mode enabled
L2C-310 cache controller enabled, 8 ways, 512 kB
L2C-310: CACHE_ID 0x410000c8, AUX_CTRL 0x76760001
zynq_clock_init: clkc starts at cf802100
Zynq clock init
sched_clock: 64 bits at 333MHz, resolution 3ns, wraps every 4398046511103ns
clocksource: arm_global_timer: mask: 0xffffffffffffffff max_cycles: 0x4ce07af025, max_idle_ns: 440795209040 ns
Switching to timer-based delay loop, resolution 3ns
clocksource: ttc_clocksource: mask: 0xffff max_cycles: 0xffff, max_idle_ns: 537538477 ns
ps7-ttc #0 at cf80a000, irq=18
Console: colour dummy device 80x30
Calibrating delay loop (skipped), value calculated using timer frequency.. 666.66 BogoMIPS (lpj=3333333)
pid_max: default: 32768 minimum: 301
Mount-cache hash table entries: 1024 (order: 0, 4096 bytes)
Mountpoint-cache hash table entries: 1024 (order: 0, 4096 bytes)
CPU: Testing write buffer coherency: ok
CPU0: thread -1, cpu 0, socket 0, mpidr 80000000
Setting up static identity map for 0x100000 - 0x100058
CPU1: failed to boot: -1
Brought up 1 CPUs
SMP: Total of 1 processors activated (666.66 BogoMIPS).
CPU: All CPU(s) started in SVC mode.
devtmpfs: initialized
VFP support v0.3: implementor 41 architecture 3 part 30 variant 9 rev 4
clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 19112604462750000 ns
pinctrl core: initialized pinctrl subsystem
NET: Registered protocol family 16
DMA: preallocated 256 KiB pool for atomic coherent allocations
cpuidle: using governor menu
hw-breakpoint: found 5 (+1 reserved) breakpoint and 1 watchpoint registers.
hw-breakpoint: maximum watchpoint size is 4 bytes.
zynq-ocm f800c000.ps7-ocmc: ZYNQ OCM pool: 256 KiB @ 0xcf880000
vgaarb: loaded
SCSI subsystem initialized
usbcore: registered new interface driver usbfs
usbcore: registered new interface driver hub
usbcore: registered new device driver usb
media: Linux media interface: v0.10
Linux video capture interface: v2.00
pps_core: LinuxPPS API ver. 1 registered
pps_core: Software ver. 5.3.6 - Copyright 2005-2007 Rodolfo Giometti <giometti@linux.it>
PTP clock support registered
EDAC MC: Ver: 3.0.0
Advanced Linux Sound Architecture Driver Initialized.
clocksource: Switched to clocksource arm_global_timer
NET: Registered protocol family 2
TCP established hash table entries: 2048 (order: 1, 8192 bytes)
TCP bind hash table entries: 2048 (order: 2, 16384 bytes)
TCP: Hash tables configured (established 2048 bind 2048)
UDP hash table entries: 256 (order: 1, 8192 bytes)
UDP-Lite hash table entries: 256 (order: 1, 8192 bytes)
NET: Registered protocol family 1
RPC: Registered named UNIX socket transport module.
RPC: Registered udp transport module.
RPC: Registered tcp transport module.
RPC: Registered tcp NFSv4.1 backchannel transport module.
PCI: CLS 0 bytes, default 64
Trying to unpack rootfs image as initramfs...
rootfs image is not initramfs (no cpio magic); looks like an initrd
Freeing initrd memory: 6632K (cd480000 - cdafa000)
hw perfevents: enabled with armv7_cortex_a9 PMU driver, 7 counters available
futex hash table entries: 512 (order: 3, 32768 bytes)
workingset: timestamp_bits=28 max_order=16 bucket_order=0
jffs2: version 2.2. (NAND) (SUMMARY)  © 2001-2006 Red Hat, Inc.
io scheduler noop registered
io scheduler deadline registered
io scheduler cfq registered (default)
dma-pl330 f8003000.ps7-dma: Loaded driver for PL330 DMAC-241330
dma-pl330 f8003000.ps7-dma: DBUFF-128x8bytes Num_Chans-8 Num_Peri-4 Num_Events-16
e0000000.serial: ttyPS0 at MMIO 0xe0000000 (irq = 159, base_baud = 6249999) is a xuartps
console [ttyPS0] enabled
xdevcfg f8007000.ps7-dev-cfg: ioremap 0xf8007000 to cf86e000
[drm] Initialized drm 1.1.0 20060810
brd: module loaded
loop: module loaded
CAN device driver interface
gpiod_set_value: invalid GPIO
libphy: MACB_mii_bus: probed
macb e000b000.ethernet eth0: Cadence GEM rev 0x00020118 at 0xe000b000 irq 31 (00:0a:35:00:00:00)
Generic PHY e000b000.etherne:00: attached PHY driver [Generic PHY] (mii_bus:phy_addr=e000b000.etherne:00, irq=-1)
e1000e: Intel(R) PRO/1000 Network Driver - 3.2.6-k
e1000e: Copyright(c) 1999 - 2015 Intel Corporation.
ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
ehci-pci: EHCI PCI platform driver
usbcore: registered new interface driver usb-storage
mousedev: PS/2 mouse device common for all mice
i2c /dev entries driver
cdns-i2c e0005000.ps7_i2c: 100 kHz mmio e0005000 irq 154
Xilinx Zynq CpuIdle Driver started
sdhci: Secure Digital Host Controller Interface driver
sdhci: Copyright(c) Pierre Ossman
sdhci-pltfm: SDHCI platform and OF driver helper
mmc0: SDHCI controller on e0100000.ps7-sdio [e0100000.ps7-sdio] using ADMA
ledtrig-cpu: registered to indicate activity on CPUs
usbcore: registered new interface driver usbhid
usbhid: USB HID core driver
nand: disable subpage write
nand: device found, Manufacturer ID: 0x2c, Chip ID: 0xda
nand: Micron MT29F2G08ABAEAWP
nand: 256 MiB, SLC, erase size: 128 KiB, page size: 2048, OOB size: 64
nand: NAND_ECC_HW
nand: NAND_ECC_HW_SYNDROME
mtd->writesize = 2048
ecc->strength = 1
ecc->size = 2048
mtd->writesize = 2048
chip->ecc_strength_ds = 4
chip->ecc_step_ds = 512
nand: WARNING: pl35x-nand: the ECC used on your system is too weak compared to the one required by the NAND chip
Bad block table found at page 131008, version 0x01
Bad block table found at page 130944, version 0x01
8 ofpart partitions found on MTD device pl35x-nand
Creating 8 MTD partitions on "pl35x-nand":
0x000000000000-0x000002800000 : "BOOT.bin-dts-marker-kernel"
0x000002800000-0x000004800000 : "ramfs"
0x000004800000-0x000005000000 : "configs"
0x000005000000-0x000005200000 : "sig"
0x000005200000-0x000006000000 : "reserve1"
0x000006000000-0x000007000000 : "upgrade-ramfs"
0x000007000000-0x00000a800000 : "upgrade-file"
0x00000a800000-0x000010000000 : "reserve2"
nf_conntrack version 0.5.0 (3635 buckets, 14540 max)
ip_tables: (C) 2000-2006 Netfilter Core Team
NET: Registered protocol family 10
ip6_tables: (C) 2000-2006 Netfilter Core Team
sit: IPv6 over IPv4 tunneling driver
NET: Registered protocol family 17
can: controller area network core (rev 20120528 abi 9)
NET: Registered protocol family 29
can: raw protocol (rev 20120528)
can: broadcast manager protocol (rev 20120528 t)
can: netlink gateway (rev 20130117) max_hops=1
zynq_pm_ioremap: no compatible node found for 'xlnx,zynq-ddrc-a05'
zynq_pm_late_init: Unable to map DDRC IO memory.
Registering SWP/SWPB emulation handler
hctosys: unable to open rtc device (rtc0)
ALSA device list:
  No soundcards found.
RAMDISK: gzip image found at block 0
EXT4-fs (ram0): couldn't mount as ext3 due to feature incompatibilities
EXT4-fs warning (device ram0): ext4_update_dynamic_rev:746: updating to rev 1 because of new feature flag, running e2fsck is recommended
EXT4-fs (ram0): mounted filesystem without journal. Opts: (null)
VFS: Mounted root (ext4 filesystem) on device 1:0.
devtmpfs: mounted
Freeing unused kernel memory: 1024K (c0a00000 - c0b00000)
EXT4-fs (ram0): re-mounted. Opts: block_validity,delalloc,barrier,user_xattr,errors=remount-ro
devpts: called with bogus options
ubi0: attaching mtd2
ubi0: scanning is finished
ubi0: attached mtd2 (name "configs", size 8 MiB)
ubi0: PEB size: 131072 bytes (128 KiB), LEB size: 126976 bytes
ubi0: min./max. I/O unit sizes: 2048/2048, sub-page size 2048
ubi0: VID header offset: 2048 (aligned 2048), data offset: 4096
ubi0: good PEBs: 64, bad PEBs: 0, corrupted PEBs: 0
ubi0: user volume: 1, internal volumes: 1, max. volumes count: 128
ubi0: max/mean erase counter: 4/1, WL threshold: 4096, image sequence number: 262741923
ubi0: available PEBs: 36, total reserved PEBs: 28, PEBs reserved for bad PEB handling: 4
ubi0: background thread "ubi_bgt0d" started, PID 729
UBIFS (ubi0:0): background thread "ubifs_bgt0_0" started, PID 733
UBIFS (ubi0:0): recovery needed
UBIFS (ubi0:0): recovery completed
UBIFS (ubi0:0): UBIFS: mounted UBI device 0, volume 0, name "configs"
UBIFS (ubi0:0): LEB size: 126976 bytes (124 KiB), min./max. I/O unit sizes: 2048 bytes/2048 bytes
UBIFS (ubi0:0): FS size: 1396736 bytes (1 MiB, 11 LEBs), journal size 888833 bytes (0 MiB, 5 LEBs)
UBIFS (ubi0:0): reserved for root: 65970 bytes (64 KiB)
UBIFS (ubi0:0): media format: w4/r0 (latest is w4/r0), UUID A9E8BDA4-70DE-45D9-83BE-2DD9F129C0C6, small LPT model
ubi2: attaching mtd4
ubi2: scanning is finished
ubi2: attached mtd4 (name "reserve1", size 14 MiB)
ubi2: PEB size: 131072 bytes (128 KiB), LEB size: 126976 bytes
ubi2: min./max. I/O unit sizes: 2048/2048, sub-page size 2048
ubi2: VID header offset: 2048 (aligned 2048), data offset: 4096
ubi2: good PEBs: 112, bad PEBs: 0, corrupted PEBs: 0
ubi2: user volume: 1, internal volumes: 1, max. volumes count: 128
ubi2: max/mean erase counter: 5/2, WL threshold: 4096, image sequence number: 443719576
ubi2: available PEBs: 0, total reserved PEBs: 112, PEBs reserved for bad PEB handling: 4
ubi2: background thread "ubi_bgt2d" started, PID 740
UBIFS (ubi2:0): background thread "ubifs_bgt2_0" started, PID 744
UBIFS (ubi2:0): recovery needed
UBIFS (ubi2:0): recovery completed
UBIFS (ubi2:0): UBIFS: mounted UBI device 2, volume 0, name "misc"
UBIFS (ubi2:0): LEB size: 126976 bytes (124 KiB), min./max. I/O unit sizes: 2048 bytes/2048 bytes
UBIFS (ubi2:0): FS size: 11935744 bytes (11 MiB, 94 LEBs), journal size 1015809 bytes (0 MiB, 6 LEBs)
UBIFS (ubi2:0): reserved for root: 563754 bytes (550 KiB)
UBIFS (ubi2:0): media format: w4/r0 (latest is w4/r0), UUID 83280604-31C3-4236-B043-08F4084F4891, small LPT model
IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
random: avahi-daemon urandom read with 2 bits of entropy available
macb e000b000.ethernet eth0: unable to generate target frequency: 25000000 Hz
macb e000b000.ethernet eth0: link up (100/Full)
IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
In axi fpga driver!
request_mem_region OK!
AXI fpga dev virtual address is 0xcfbc8000
*base_vir_addr = 0xb031
In fpga mem driver!
request_mem_region OK!
fpga mem virtual address is 0xd2000000
random: nonblocking pool is initialized


===========================================Miner log===========================================
1970-01-01 00:00:09 Miner compile time: Fri Dec 11 11:23:44 CST 2020 type: Antminer S19 Pro
1970-01-01 00:00:10 This is fix-freq version
1970-01-01 00:00:10 Miner compile time: Fri Dec 11 11:23:44 CST 2020 type: Antminer S19 Pro
1970-01-01 00:00:10 commit version: 1821c90 2020-11-16 16:05:37, build by: jenkins 2020-12-11 11:35:43
1970-01-01 00:00:10 opt_multi_version     = 1
1970-01-01 00:00:10 opt_bitmain_ab        = 1
1970-01-01 00:00:10 mid_auto_gen          = 0
1970-01-01 00:00:10 opt_bitmain_work_mode = 0
1970-01-01 00:00:10 mmap fpga_mem_addr_hal = 0xb5900000
1970-01-01 00:00:10 HASH_ON_PLUG V9 = 0x7
1970-01-01 00:00:10 Note: front fan is power on!
1970-01-01 00:00:10 Note: rear fan is power on!
1970-01-01 00:00:10 start the http log.
1970-01-01 00:00:10 httpListenThread start ret=0
1970-01-01 00:00:10 start listen on 6060 ...
1970-01-01 00:00:10 load machine NBP1901 conf
1970-01-01 00:00:10 machine : NBP1901
1970-01-01 00:00:10 chain_num 4, chain_domain_num 38, chain_asic_num 114, domain_asic_num 3
2021-02-06 01:23:05 miner ID : 801265c85710481c
2021-02-06 01:23:05 FPGA Version = 0xB031
2021-02-06 01:23:05 HASH_ON_PLUG V9 = 0x7
2021-02-06 01:23:05 ==========================capability start==========================
2021-02-06 01:23:05 board num = 3
2021-02-06 01:23:05 board id = 0, chain num = 1
2021-02-06 01:23:05 chain id = 0
2021-02-06 01:23:05 board id = 1, chain num = 1
2021-02-06 01:23:05 chain id = 1
2021-02-06 01:23:05 board id = 2, chain num = 1
2021-02-06 01:23:05 chain id = 2
2021-02-06 01:23:05 ==========================capability end============================
2021-02-06 01:23:05 chain num = 3
2021-02-06 01:23:07 [chain 0]
2021-02-06 01:23:07 0x0000 11 42 72 E7 87 CF FD 7F   93 C8 CF A1 C9 26 17 EA
2021-02-06 01:23:07 0x0010 3E FB 3F 8C 61 DC B4 77   C9 04 15 70 A9 F6 E7 A4
2021-02-06 01:23:07 0x0020 C0 F2 A1 97 33 5E FD 7F   2E 57 E6 7D 9E 2B FE 39
2021-02-06 01:23:07 0x0030 D3 0E BE FE 70 E7 7B BD   10 35 D2 05 82 8C 8C 63
2021-02-06 01:23:07 0x0040 F5 CC FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x0050 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x0060 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x0070 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x0080 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x0090 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x00A0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x00B0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x00C0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x00D0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x00E0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:07 0x00F0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF 5A
2021-02-06 01:23:07
2021-02-06 01:23:09 [chain 1]
2021-02-06 01:23:09 0x0000 11 42 D2 04 7D 08 1D 10   6F 71 FE CA A1 80 14 0D
2021-02-06 01:23:09 0x0010 C8 4A 73 F1 78 EC 4F 1B   8D 8B CF D8 C4 19 E8 1F
2021-02-06 01:23:09 0x0020 73 A4 F0 AA 52 48 E8 91   36 B8 FA C5 FD DB CC C9
2021-02-06 01:23:09 0x0030 AC 3A 05 B3 E0 2E 74 4E   E0 FB C7 F0 18 10 4C BE
2021-02-06 01:23:09 0x0040 69 90 FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x0050 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x0060 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x0070 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x0080 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x0090 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x00A0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x00B0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x00C0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x00D0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x00E0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:09 0x00F0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF 5A
2021-02-06 01:23:09
2021-02-06 01:23:10 [chain 2]
2021-02-06 01:23:10 0x0000 11 42 FE 43 A1 7F ED E1   4B F6 00 7C 82 B4 59 14
2021-02-06 01:23:10 0x0010 36 50 5D 70 F6 5B 9C 33   88 B2 BF E0 2D 85 79 57
2021-02-06 01:23:10 0x0020 BA 6A 14 67 BC 45 C7 01   6A EB AC 0E B4 39 7E 2F
2021-02-06 01:23:10 0x0030 75 48 96 9D 3C 74 50 D3   EB 6E DE C1 41 5B 54 9F
2021-02-06 01:23:10 0x0040 04 B4 FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x0050 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x0060 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x0070 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x0080 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x0090 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x00A0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x00B0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x00C0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x00D0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x00E0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF FF
2021-02-06 01:23:10 0x00F0 FF FF FF FF FF FF FF FF   FF FF FF FF FF FF FF 5A
2021-02-06 01:23:10
2021-02-06 01:23:10 fan_eft : 1  fan_pwm : 90
2021-02-06 01:23:10 create thread
2021-02-06 01:23:10 fixed working voltage = 1260
2021-02-06 01:23:10 Chain [0] PCB Version: 0x0100
2021-02-06 01:23:10 Chain [0] BOM Version: 0x0100
2021-02-06 01:23:10 Chain [1] PCB Version: 0x0100
2021-02-06 01:23:10 Chain [1] BOM Version: 0x0100
2021-02-06 01:23:10 Chain [2] PCB Version: 0x0100
2021-02-06 01:23:10 Chain [2] BOM Version: 0x0100
2021-02-06 01:23:14 Fan check passed.
2021-02-06 01:23:15 chain[0] PIC jump to app
2021-02-06 01:23:17 Check chain[0] PIC fw version=0x89
2021-02-06 01:23:18 chain[1] PIC jump to app
2021-02-06 01:23:19 Check chain[1] PIC fw version=0x89
2021-02-06 01:23:21 chain[2] PIC jump to app
2021-02-06 01:23:22 Check chain[2] PIC fw version=0x89
2021-02-06 01:23:22 create thread
2021-02-06 01:23:22 max sensor num = 4
2021-02-06 01:23:22 temperature_monitor_thread start...
2021-02-06 01:23:27 power type version: 0x0071
2021-02-06 01:23:27 Enter sleep to make sure power release finish.
2021-02-06 01:26:28 Slept 180 seconds, diff = 8.
2021-02-06 01:26:28 set_voltage_by_steps to 1500.
2021-02-06 01:26:33 start up min temp by 75a = 17
2021-02-06 01:26:33 set UART baud to 115200
2021-02-06 01:26:41 Chain[0]: find 114 asic, times 0
2021-02-06 01:26:43 chain avg vol rise to 15.46
2021-02-06 01:26:48 Chain[1]: find 114 asic, times 0
2021-02-06 01:26:56 Chain[2]: find 114 asic, times 0
2021-02-06 01:26:56 pulse_mode = 1, ccdly_sel = 1, pwth_sel = 1
2021-02-06 01:27:01 min freq in eeprom = 525
2021-02-06 01:27:01 fixed frequency is 525
2021-02-06 01:27:01 Bring up temperature is 17
2021-02-06 01:27:01 set UART baud to 12000000
2021-02-06 01:27:01 set_voltage_by_steps to 1267.
2021-02-06 01:27:08 STATUS_INITED: soc init done!
2021-02-06 01:27:08 create thread
2021-02-06 01:27:08 create thread
2021-02-06 01:27:09 fan_etf: Set fixed fan speed=90
2021-02-06 01:27:10 start to init...
2021-02-06 01:27:19 Init done!
2021-02-06 01:27:19 STATUS_OKAY
2021-02-06 01:27:19 start the cached job
2021-02-06 01:27:19 Version num 8
2021-02-06 01:27:19 Mask num 0xe000
2021-02-06 01:27:19 freq = 525, percent = 90, hcn = 12480, timeout = 449
2021-02-06 01:27:19 set_start_time_point total_tv_start_sys=271 total_tv_end_sys=272
2021-02-06 01:27:19 set_voltage_by_steps to 1290.
2021-02-06 01:28:42 set_voltage_by_steps to 1270.
2021-02-06 01:32:48 set_voltage_by_steps to 1260.
2021-02-06 02:32:10 chain avg vol drop to 0.99
2021-02-06 02:34:47 set_voltage_by_steps to 1280.
2021-02-06 02:35:27 read asic reg error: expect chain = 0, chip = 0, reg = 176, got chain = 1, chip = 48, reg = 128
2021-02-06 02:35:35 chain avg vol rise to 13.25
2021-02-06 02:42:11 chain 0 hash rate 12253.00 low in 15 mins
2021-02-06 02:42:11 chain 1 hash rate 12031.00 low in 15 mins
2021-02-06 02:42:11 chain 2 hash rate 12252.00 low in 15 mins
2021-02-06 02:57:12 avg rate is lower than ideal rate, 18268.86 in 30 mins
2021-02-06 02:57:12 chain 0 hash rate 0.00 low in 15 mins
2021-02-06 02:57:12 chain 1 hash rate 0.00 low in 15 mins
2021-02-06 02:57:12 chain 2 hash rate 0.00 low in 15 mins


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: irrWN on February 07, 2021, 12:16:07 AM
Hey, i have same problem. But i found one nuance. You was try to start only one of this boards?

When i start only one of 3 boards it work. If more than one board - get error.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: mikeywith on February 07, 2021, 07:31:04 PM
Please use the code tag to paste kernel logs, my take on this issue is that you damaged all those 20 PSUs by feeding them improper voltage or any other electrical related issue, the PSU acts differently when plugged elsewhere gives more strength to the theory, to confirm/deny this theory use a PSU which has not been plugged in that "bad" area of the farm on one of the "bad" miners, obviously, the test should be done in a different outlet/area, chances are it will work perfectly fine, if not -- then there is even worse news, which I doubt for now.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on February 12, 2021, 09:48:57 PM
Please use the code tag to paste kernel logs, my take on this issue is that you damaged all those 20 PSUs by feeding them improper voltage or any other electrical related issue, the PSU acts differently when plugged elsewhere gives more strength to the theory, to confirm/deny this theory use a PSU which has not been plugged in that "bad" area of the farm on one of the "bad" miners, obviously, the test should be done in a different outlet/area, chances are it will work perfectly fine, if not -- then there is even worse news, which I doubt for now.

Looks like I solved the issue. The back of the psu is heating up. Even though the chips are cool and the pcb is cool and the psu is cool. The very back of the psu is not. I think the fans on the psu are not fast enough and cause this issue. Make sure you remove all the heat. This problem does not exsist on the S17 series and its only on the S19.. Watch the video.
https://www.youtube.com/watch?v=qg7HFFAkB7A


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: mikeywith on February 13, 2021, 01:04:10 AM
I said in the very first post in this "long" thread:

... if the PSU is dirty or isn't getting enough air flow it i will stop.

You have even suspected that the back of the PSU was getting hot, looks like your initial troubleshooting went wrong and led you (and us for that matter) into thinking it was something else.

And FYI, this isn't an S19 only a problem, it's common with the 17 series as well, the S17 Pro is kind of an expectation due to its high efficiency and relatively low overall power consumtion, of course, weaker fans on the S19 PSU could make this problem even worse, but bad PSU airflow is a very common problem.

Thanks for reporting back (most people don't), it was quite a ride, but we all learned some stuff along the way. Mine on.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: CryptoLLC on February 13, 2021, 02:32:28 AM
Yeah. The reason I did not suspect it to be the psu is because the chips on the boards are below 50 and the psu is cool to the touch. Also its 40f degrees inside the container. So I assumed it cant be the psu. (The PSU's are brand new and have zero dust). Also I have an even worse heat situation with the s17+ and none of them have problems. These S19 Pro have very sensitive PSU's when compared to the S17+.


Title: Re: S19 Pro Randomly Stops working with no Kernel Errors
Post by: PT-Mining on October 17, 2021, 07:09:26 PM
i think i have a similar issue.
What was shown on the youtube video?!
this sadly is offline now...

i wonder how units work, that use intake and exhaust adapters for ducted colder air?! usually the PSU has no separat duct on these, so it still gets only room temperatue. But no extra or cooler airflow.

 Or does such ducted solutions work bad anyhow?