Bitcoin Forum
May 13, 2024, 11:15:13 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: PLEASE help :( Antminer Z11's Hashboards randomly dis- and reappearing  (Read 188 times)
Exl (OP)
Newbie
*
Offline Offline

Activity: 13
Merit: 0


View Profile
June 23, 2020, 06:31:33 AM
Last edit: July 04, 2020, 07:50:29 AM by Exl
 #1

Quote
** PS: summarized my issue in the last post with some additional testing. If anyone helped me troubleshoot this further, I'd be so so grateful. Really don't know what's the issue here. **


Hi people.

my Antminer Z11 is doing weird things and I can't really comprehend what is going on. I hope someone more experienced can make a guess what the problem is or give me an idea how to figure it out. I feel literally fooled by this machine.


Pre-story, somehow interconnected:

Situation began when some time ago my Z11's hashrate was dropping all of a sudden by one third. I believe it was after dusting off / cleaning. Since then a few times a board went missing. After reconnecting PCI-E cables, the situation has kind of resolved, although I found it weird and didn't quite understand. Maybe power cables are not tight enough or cables too stretched, who knows. Miner kept mining, then a week or two later:

One fan suddenly failed. Stopped spinning, not being detected anymore. Coincidence? I believed not but replugging and exchanging power cables hasn't helped like with the hash boards. The fan is dead. Not the cable, not the connector, the fan itself. Consequently I ordered a new one, replaced it and ....drum roll ... all good! Fans spinning, boards hashing.

I then started isolating my IT room properly and renovating a bit (months of lockdown in SE-Asia), so mining gear was mainly switched off and only operating on a weekly basis.


Now


my room's eventually finished ,computers booting up and surprise surprise, a hash board is missing. Sometimes two are missing. Sometimes asics on each chain missing.

Code:
Jun 23 02:08:25 (none) local0.notice cgminer[971]: software_set_address_chain chainID0 asics 0 addrInterval 64
Jun 23 02:08:25 (none) local0.notice cgminer[971]: software_set_address_chain chainID1 asics 2 addrInterval 64
Jun 23 02:08:25 (none) local0.notice cgminer[971]: software_set_address_chain chainID2 asics 0 addrInterval 64

or

Code:
Jun 23 02:23:45 (none) local0.notice cgminer[969]: software_set_address_chain chainID0 asics 3 addrInterval 64
Jun 23 02:23:46 (none) local0.notice cgminer[969]: software_set_address_chain chainID1 asics 3 addrInterval 64
Jun 23 02:23:46 (none) local0.notice cgminer[969]: software_set_address_chain chainID2 asics 0 addrInterval 64

or any combination.

Sometimes however, all three boards would show up working properly for 12 or 24 hours without interruption. Bad situation for trial and error. Resetting /reflashing firmware doesn't help.

Control board failing? PSU failing? Unstable electricity?

Thank you for reading, I could really take an advice.


Random Kernel log at the end:

Code:
"Booting Linux on physical CPU 0x0
Linux version 4.6.0-xilinx-g0387054-dirty (lzq@ubuntu) (gcc version 4.8.3 20140320 (prerelease) (Sourcery CodeBench Lite 2014.05-23) ) #45 SMP PREEMPT Wed Jun 19 15:03:51 CST 2019
CPU: ARMv7 Processor [413fc090] revision 0 (ARMv7), cr=18c5387d
CPU: PIPT / VIPT nonaliasing data cache, VIPT aliasing instruction cache
Machine model: Xilinx Zynq
cma: Reserved 16 MiB at 0x0e000000
Memory policy: Data cache writealloc
On node 0 totalpages: 61440
free_area_init_node: node 0, pgdat c0b3c280, node_mem_map cde10000
  Normal zone: 480 pages used for memmap
  Normal zone: 0 pages reserved
  Normal zone: 61440 pages, LIFO batch:15
percpu: Embedded 12 pages/cpu @cddf1000 s20032 r8192 d20928 u49152
pcpu-alloc: s20032 r8192 d20928 u49152 alloc=12*4096
pcpu-alloc: [0] 0 [0] 1
Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 60960
Kernel command line: mem=240M console=ttyPS0,115200 ramdisk_size=33554432 root=/dev/ram rw earlyprintk
PID hash table entries: 1024 (order: 0, 4096 bytes)
Dentry cache hash table entries: 32768 (order: 5, 131072 bytes)
Inode-cache hash table entries: 16384 (order: 4, 65536 bytes)
Memory: 203808K/245760K available (6348K kernel code, 243K rwdata, 1928K rodata, 1024K init, 233K bss, 25568K reserved, 16384K cma-reserved, 0K highmem)
Virtual kernel memory layout:
    vector  : 0xffff0000 - 0xffff1000   (   4 kB)
    fixmap  : 0xffc00000 - 0xfff00000   (3072 kB)
    vmalloc : 0xcf800000 - 0xff800000   ( 768 MB)
    lowmem  : 0xc0000000 - 0xcf000000   ( 240 MB)
    pkmap   : 0xbfe00000 - 0xc0000000   (   2 MB)
    modules : 0xbf000000 - 0xbfe00000   (  14 MB)
      .text : 0xc0008000 - 0xc0915150   (9269 kB)
      .init : 0xc0a00000 - 0xc0b00000   (1024 kB)
      .data : 0xc0b00000 - 0xc0b3cfe0   ( 244 kB)
       .bss : 0xc0b3cfe0 - 0xc0b77428   ( 234 kB)
Preemptible hierarchical RCU implementation.
Build-time adjustment of leaf fanout to 32.
RCU restricting CPUs from NR_CPUS=4 to nr_cpu_ids=2.
RCU: Adjusting geometry for rcu_fanout_leaf=32, nr_cpu_ids=2
NR_IRQS:16 nr_irqs:16 16
efuse mapped to cf800000
ps7-slcr mapped to cf802000
L2C: platform modifies aux control register: 0x72360000 -> 0x72760000
L2C: DT/platform modifies aux control register: 0x72360000 -> 0x72760000
L2C-310 erratum 769419 enabled
L2C-310 enabling early BRESP for Cortex-A9
L2C-310 full line of zeros enabled for Cortex-A9
L2C-310 ID prefetch enabled, offset 1 lines
L2C-310 dynamic clock gating enabled, standby mode enabled
L2C-310 cache controller enabled, 8 ways, 512 kB
L2C-310: CACHE_ID 0x410000c8, AUX_CTRL 0x76760001
zynq_clock_init: clkc starts at cf802100
Zynq clock init
sched_clock: 64 bits at 333MHz, resolution 3ns, wraps every 4398046511103ns
clocksource: arm_global_timer: mask: 0xffffffffffffffff max_cycles: 0x4ce07af025, max_idle_ns: 440795209040 ns
Switching to timer-based delay loop, resolution 3ns
clocksource: ttc_clocksource: mask: 0xffff max_cycles: 0xffff, max_idle_ns: 537538477 ns
ps7-ttc #0 at cf80a000, irq=18
Console: colour dummy device 80x30
Calibrating delay loop (skipped), value calculated using timer frequency.. 666.66 BogoMIPS (lpj=3333333)
pid_max: default: 32768 minimum: 301
Mount-cache hash table entries: 1024 (order: 0, 4096 bytes)
Mountpoint-cache hash table entries: 1024 (order: 0, 4096 bytes)
CPU: Testing write buffer coherency: ok
CPU0: thread -1, cpu 0, socket 0, mpidr 80000000
Setting up static identity map for 0x100000 - 0x100058
CPU1: failed to boot: -1
Brought up 1 CPUs
SMP: Total of 1 processors activated (666.66 BogoMIPS).
CPU: All CPU(s) started in SVC mode.
devtmpfs: initialized
VFP support v0.3: implementor 41 architecture 3 part 30 variant 9 rev 4
clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 19112604462750000 ns
pinctrl core: initialized pinctrl subsystem
NET: Registered protocol family 16
DMA: preallocated 256 KiB pool for atomic coherent allocations
cpuidle: using governor menu
hw-breakpoint: found 5 (+1 reserved) breakpoint and 1 watchpoint registers.
hw-breakpoint: maximum watchpoint size is 4 bytes.
zynq-ocm f800c000.ps7-ocmc: ZYNQ OCM pool: 256 KiB @ 0xcf880000
vgaarb: loaded
SCSI subsystem initialized
usbcore: registered new interface driver usbfs
usbcore: registered new interface driver hub
usbcore: registered new device driver usb
media: Linux media interface: v0.10
Linux video capture interface: v2.00
pps_core: LinuxPPS API ver. 1 registered
pps_core: Software ver. 5.3.6 - Copyright 2005-2007 Rodolfo Giometti <giometti@linux.it>
PTP clock support registered
EDAC MC: Ver: 3.0.0
Advanced Linux Sound Architecture Driver Initialized.
clocksource: Switched to clocksource arm_global_timer
NET: Registered protocol family 2
TCP established hash table entries: 2048 (order: 1, 8192 bytes)
TCP bind hash table entries: 2048 (order: 2, 16384 bytes)
TCP: Hash tables configured (established 2048 bind 2048)
UDP hash table entries: 256 (order: 1, 8192 bytes)
UDP-Lite hash table entries: 256 (order: 1, 8192 bytes)
NET: Registered protocol family 1
RPC: Registered named UNIX socket transport module.
RPC: Registered udp transport module.
RPC: Registered tcp transport module.
RPC: Registered tcp NFSv4.1 backchannel transport module.
PCI: CLS 0 bytes, default 64
Trying to unpack rootfs image as initramfs...
rootfs image is not initramfs (no cpio magic); looks like an initrd
Freeing initrd memory: 12504K (cceca000 - cdb00000)
hw perfevents: enabled with armv7_cortex_a9 PMU driver, 7 counters available
futex hash table entries: 512 (order: 3, 32768 bytes)
workingset: timestamp_bits=28 max_order=16 bucket_order=0
jffs2: version 2.2. (NAND) (SUMMARY)  © 2001-2006 Red Hat, Inc.
io scheduler noop registered
io scheduler deadline registered
io scheduler cfq registered (default)
dma-pl330 f8003000.ps7-dma: Loaded driver for PL330 DMAC-241330
dma-pl330 f8003000.ps7-dma: DBUFF-128x8bytes Num_Chans-8 Num_Peri-4 Num_Events-16
e0000000.serial: ttyPS0 at MMIO 0xe0000000 (irq = 158, base_baud = 6249999) is a xuartps
console [ttyPS0] enabled
xdevcfg f8007000.ps7-dev-cfg: ioremap 0xf8007000 to cf86e000
[drm] Initialized drm 1.1.0 20060810
brd: module loaded
loop: module loaded
CAN device driver interface
gpiod_set_value: invalid GPIO
libphy: MACB_mii_bus: probed
macb e000b000.ethernet eth0: Cadence GEM rev 0x00020118 at 0xe000b000 irq 31 (00:0a:35:00:00:00)
Broadcom BCM5241 e000b000.etherne:00: attached PHY driver [Broadcom BCM5241] (mii_bus:phy_addr=e000b000.etherne:00, irq=-1)
e1000e: Intel(R) PRO/1000 Network Driver - 3.2.6-k
e1000e: Copyright(c) 1999 - 2015 Intel Corporation.
ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
ehci-pci: EHCI PCI platform driver
usbcore: registered new interface driver usb-storage
mousedev: PS/2 mouse device common for all mice
i2c /dev entries driver
Xilinx Zynq CpuIdle Driver started
sdhci: Secure Digital Host Controller Interface driver
sdhci: Copyright(c) Pierre Ossman
sdhci-pltfm: SDHCI platform and OF driver helper
mmc0: SDHCI controller on e0100000.ps7-sdio [e0100000.ps7-sdio] using ADMA
ledtrig-cpu: registered to indicate activity on CPUs
usbcore: registered new interface driver usbhid
usbhid: USB HID core driver
nand: device found, Manufacturer ID: 0x2c, Chip ID: 0xda
nand: Micron MT29F2G08ABAEAWP
nand: 256 MiB, SLC, erase size: 128 KiB, page size: 2048, OOB size: 64
nand: NAND_ECC_HW
nand: NAND_ECC_HW_SYNDROME
mtd->writesize = 2048
ecc->strength = 1
ecc->size = 2048
mtd->writesize = 2048
chip->ecc_strength_ds = 4
chip->ecc_step_ds = 512
nand: WARNING: pl35x-nand: the ECC used on your system is too weak compared to the one required by the NAND chip
Bad block table found at page 131008, version 0x01
Bad block table found at page 130944, version 0x01
5 ofpart partitions found on MTD device pl35x-nand
Creating 5 MTD partitions on "pl35x-nand":
0x000000000000-0x000002800000 : "BOOT.bin-env-dts-kernel"
0x000002800000-0x000004800000 : "ramfs"
0x000004800000-0x000005000000 : "configs"
0x000005000000-0x000006000000 : "reserve"
0x000006000000-0x000008000000 : "ramfs-bak"
nf_conntrack version 0.5.0 (3635 buckets, 14540 max)
ip_tables: (C) 2000-2006 Netfilter Core Team
NET: Registered protocol family 10
ip6_tables: (C) 2000-2006 Netfilter Core Team
sit: IPv6 over IPv4 tunneling driver
NET: Registered protocol family 17
can: controller area network core (rev 20120528 abi 9)
NET: Registered protocol family 29
can: raw protocol (rev 20120528)
can: broadcast manager protocol (rev 20120528 t)
can: netlink gateway (rev 20130117) max_hops=1
zynq_pm_ioremap: no compatible node found for 'xlnx,zynq-ddrc-a05'
zynq_pm_late_init: Unable to map DDRC IO memory.
Registering SWP/SWPB emulation handler
hctosys: unable to open rtc device (rtc0)
ALSA device list:
  No soundcards found.
RAMDISK: gzip image found at block 0
EXT4-fs (ram0): couldn't mount as ext3 due to feature incompatibilities
EXT4-fs (ram0): mounted filesystem without journal. Opts: (null)
VFS: Mounted root (ext4 filesystem) on device 1:0.
devtmpfs: mounted
Freeing unused kernel memory: 1024K (c0a00000 - c0b00000)
EXT4-fs (ram0): re-mounted. Opts: block_validity,delalloc,barrier,user_xattr
random: dd urandom read with 0 bits of entropy available
IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
macb e000b000.ethernet eth0: unable to generate target frequency: 25000000 Hz
macb e000b000.ethernet eth0: link up (100/Full)
IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
In axi fpga driver!
request_mem_region OK!
AXI fpga dev virtual address is 0xcf9cc000
*base_vir_addr = 0x200ac513
In fpga mem driver!
request_mem_region OK!
fpga mem virtual address is 0xd2000000
random: nonblocking pool is initialized
Jun 23 02:22:39 (none) local0.err cgminer: Miner compile time: Thu Oct 24 11:09:07 CST 2019 type: Antminer Z11
Jun 23 02:22:39 (none) local0.warn cgminer: Started cgminer 4.9.0
Jun 23 02:22:39 (none) local0.notice cgminer[969]: z11 init.
Jun 23 02:22:39 (none) local0.notice cgminer[969]: httpListenThread start ret=0
Jun 23 02:22:39 (none) local0.warn cgminer[969]: bitmain_ZCASH_init
Jun 23 02:22:39 (none) local0.notice cgminer[969]: No 2 Fan find, check again
Jun 23 02:22:41 (none) local0.notice cgminer[969]: fan-num 2 fan-map 3
Jun 23 02:22:41 (none) local0.notice cgminer[969]: check_chain
Jun 23 02:22:41 (none) local0.notice cgminer[969]: Chain 0 existed!
Jun 23 02:22:41 (none) local0.notice cgminer[969]: Chain 1 existed!
Jun 23 02:22:41 (none) local0.notice cgminer[969]: Chain 2 existed!
Jun 23 02:22:41 (none) local0.notice cgminer[969]: chain0 every_chain_reset_PIC16F1704_pic_new
Jun 23 02:22:42 (none) local0.notice cgminer[969]: reset_PIC16F1704_pic_new ok
Jun 23 02:22:42 (none) local0.notice cgminer[969]: chain1 every_chain_reset_PIC16F1704_pic_new
Jun 23 02:22:43 (none) local0.notice cgminer[969]: reset_PIC16F1704_pic_new ok
Jun 23 02:22:43 (none) local0.notice cgminer[969]: chain2 every_chain_reset_PIC16F1704_pic_new
Jun 23 02:22:44 (none) local0.notice cgminer[969]: reset_PIC16F1704_pic_new ok
Jun 23 02:22:45 (none) local0.notice cgminer[969]: chain0 every_chain_jump_from_loader_to_app_PIC16F1704_new
Jun 23 02:22:45 (none) local0.notice cgminer[969]: jump_from_loader_to_app_PIC16F1704_new ok
Jun 23 02:22:45 (none) local0.notice cgminer[969]: chain1 every_chain_jump_from_loader_to_app_PIC16F1704_new
Jun 23 02:22:46 (none) local0.notice cgminer[969]: jump_from_loader_to_app_PIC16F1704_new ok
Jun 23 02:22:46 (none) local0.notice cgminer[969]: chain2 every_chain_jump_from_loader_to_app_PIC16F1704_new
Jun 23 02:22:46 (none) local0.notice cgminer[969]: jump_from_loader_to_app_PIC16F1704_new ok
Jun 23 02:22:48 (none) local0.notice cgminer[969]: reset_all_hash_board_low 4235
Jun 23 02:22:48 (none) local0.notice cgminer[969]: pic_heart_beat_func_new
Jun 23 02:22:48 (none) local0.notice cgminer[969]: every_chain_disable_PIC16F1704_dc_dc_new
Jun 23 02:22:48 (none) local0.notice cgminer[969]: disable_PIC16F1704_dc_dc_new ok
Jun 23 02:22:48 (none) local0.notice cgminer[969]: disable_PIC16F1704_dc_dc_new ok
Jun 23 02:22:49 (none) local0.notice cgminer[969]: disable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:05 (none) local0.notice cgminer[969]: every_chain_enable_PIC16F1704_dc_dc_new
Jun 23 02:23:05 (none) local0.notice cgminer[969]: enable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:05 (none) local0.notice cgminer[969]: enable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:06 (none) local0.notice cgminer[969]: enable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:06 (none) local0.notice cgminer[969]: reset_all_hash_board_high 4241
Jun 23 02:23:06 (none) local0.notice cgminer[969]: tty_init
Jun 23 02:23:06 (none) local0.notice cgminer[969]: tty_init_chain chainid = 0
Jun 23 02:23:06 (none) local0.err cgminer[969]: create rx read thread for chain 0 ok
Jun 23 02:23:06 (none) local0.notice cgminer[969]: Start A New Asic Response.Chain Id:[0]
Jun 23 02:23:06 (none) local0.err cgminer[969]: create tx read thread for chain 0 ok
Jun 23 02:23:06 (none) local0.notice cgminer[969]: open device over
Jun 23 02:23:06 (none) local0.notice cgminer[969]: tty_init_chain chainid = 1
Jun 23 02:23:06 (none) local0.err cgminer[969]: create rx read thread for chain 1 ok
Jun 23 02:23:06 (none) local0.notice cgminer[969]: Start A New Asic Response.Chain Id:[1]
Jun 23 02:23:07 (none) local0.err cgminer[969]: create tx read thread for chain 1 ok
Jun 23 02:23:07 (none) local0.notice cgminer[969]: open device over
Jun 23 02:23:07 (none) local0.notice cgminer[969]: tty_init_chain chainid = 2
Jun 23 02:23:07 (none) local0.err cgminer[969]: create rx read thread for chain 2 ok
Jun 23 02:23:07 (none) local0.notice cgminer[969]: Start A New Asic Response.Chain Id:[2]
Jun 23 02:23:07 (none) local0.err cgminer[969]: create tx read thread for chain 2 ok
Jun 23 02:23:07 (none) local0.notice cgminer[969]: open device over
Jun 23 02:23:09 (none) local0.notice cgminer[969]: start listen on 6060 ...
Jun 23 02:23:11 (none) local0.notice cgminer[969]: calculate_address_interval:addrInterval = 64
Jun 23 02:23:13 (none) local0.notice cgminer[969]: check_every_chain_asic_number
Jun 23 02:23:15 (none) local0.notice cgminer[969]: check_asic_reg: chain J0 has 3 ASIC
Jun 23 02:23:16 (none) local0.notice cgminer[969]: check_asic_reg: chain J1 has 3 ASIC
Jun 23 02:23:18 (none) local0.notice cgminer[969]: check_asic_reg: chain J2 has 0 ASIC
Jun 23 02:23:18 (none) local0.notice cgminer[969]: recheck: chainid 2 exist 1 asicnum 0 realnum 3
Jun 23 02:23:18 (none) local0.warn cgminer[969]: reset_chain chainid 2
Jun 23 02:23:18 (none) local0.notice cgminer[969]: disable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:19 (none) local0.notice cgminer[969]: reset_PIC16F1704_pic_new ok
Jun 23 02:23:21 (none) local0.notice cgminer[969]: jump_from_loader_to_app_PIC16F1704_new ok
Jun 23 02:23:22 (none) local0.notice cgminer[969]: reset_hash_board_low 2
Jun 23 02:23:22 (none) local0.notice cgminer[969]: enable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:23 (none) local0.notice cgminer[969]: reset_hash_board_high 2
Jun 23 02:23:23 (none) local0.notice cgminer[969]: tty_init_chain chainid = 2
Jun 23 02:23:23 (none) local0.err cgminer[969]: create rx read thread for chain 2 ok
Jun 23 02:23:23 (none) local0.notice cgminer[969]: Start A New Asic Response.Chain Id:[2]
Jun 23 02:23:23 (none) local0.err cgminer[969]: create tx read thread for chain 2 ok
Jun 23 02:23:23 (none) local0.notice cgminer[969]: open device over
Jun 23 02:23:26 (none) local0.notice cgminer[969]: check_asic_reg: chain J2 has 0 ASIC
Jun 23 02:23:26 (none) local0.notice cgminer[969]: recheck_asic_num DONE!
Jun 23 02:23:26 (none) local0.notice cgminer[969]: recheck: chainid 2 exist 1 asicnum 0 realnum 3
Jun 23 02:23:26 (none) local0.warn cgminer[969]: reset_chain chainid 2
Jun 23 02:23:26 (none) local0.notice cgminer[969]: disable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:27 (none) local0.notice cgminer[969]: reset_PIC16F1704_pic_new ok
Jun 23 02:23:29 (none) local0.notice cgminer[969]: jump_from_loader_to_app_PIC16F1704_new ok
Jun 23 02:23:30 (none) local0.notice cgminer[969]: reset_hash_board_low 2
Jun 23 02:23:30 (none) local0.notice cgminer[969]: enable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:31 (none) local0.notice cgminer[969]: reset_hash_board_high 2
Jun 23 02:23:31 (none) local0.notice cgminer[969]: tty_init_chain chainid = 2
Jun 23 02:23:31 (none) local0.err cgminer[969]: create rx read thread for chain 2 ok
Jun 23 02:23:31 (none) local0.notice cgminer[969]: Start A New Asic Response.Chain Id:[2]
Jun 23 02:23:31 (none) local0.err cgminer[969]: create tx read thread for chain 2 ok
Jun 23 02:23:31 (none) local0.notice cgminer[969]: open device over
Jun 23 02:23:34 (none) local0.notice cgminer[969]: check_asic_reg: chain J2 has 0 ASIC
Jun 23 02:23:34 (none) local0.notice cgminer[969]: recheck_asic_num DONE!
Jun 23 02:23:34 (none) local0.notice cgminer[969]: recheck: chainid 2 exist 1 asicnum 0 realnum 3
Jun 23 02:23:34 (none) local0.warn cgminer[969]: reset_chain chainid 2
Jun 23 02:23:34 (none) local0.notice cgminer[969]: disable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:35 (none) local0.notice cgminer[969]: reset_PIC16F1704_pic_new ok
Jun 23 02:23:37 (none) local0.notice cgminer[969]: jump_from_loader_to_app_PIC16F1704_new ok
Jun 23 02:23:38 (none) local0.notice cgminer[969]: reset_hash_board_low 2
Jun 23 02:23:38 (none) local0.notice cgminer[969]: enable_PIC16F1704_dc_dc_new ok
Jun 23 02:23:39 (none) local0.notice cgminer[969]: reset_hash_board_high 2
Jun 23 02:23:39 (none) local0.notice cgminer[969]: tty_init_chain chainid = 2
Jun 23 02:23:39 (none) local0.err cgminer[969]: create rx read thread for chain 2 ok
Jun 23 02:23:39 (none) local0.notice cgminer[969]: Start A New Asic Response.Chain Id:[2]
Jun 23 02:23:39 (none) local0.err cgminer[969]: create tx read thread for chain 2 ok
Jun 23 02:23:39 (none) local0.notice cgminer[969]: open device over
Jun 23 02:23:42 (none) local0.notice cgminer[969]: check_asic_reg: chain J2 has 0 ASIC
Jun 23 02:23:42 (none) local0.notice cgminer[969]: recheck_asic_num DONE!
Jun 23 02:23:42 (none) local0.notice cgminer[969]: chainid0 set_frequency_chain freq=681
Jun 23 02:23:42 (none) local0.notice cgminer[969]: chainid1 set_frequency_chain freq=681
Jun 23 02:23:42 (none) local0.notice cgminer[969]: chainid2 set_frequency_chain freq=681
Jun 23 02:23:43 (none) local0.notice cgminer[969]: chain0 pic version 04
Jun 23 02:23:43 (none) local0.notice cgminer[969]: chain1 pic version 04
Jun 23 02:23:44 (none) local0.notice cgminer[969]: chain2 pic version 04
Jun 23 02:23:45 (none) local0.notice cgminer[969]: 1 Goto turbo mode
Jun 23 02:23:45 (none) local0.notice cgminer[969]: set_ticket_mask_chain chainID0 ticket_mask = 0x0000000e
Jun 23 02:23:45 (none) local0.notice cgminer[969]: set_ticket_mask_chain chainID1 ticket_mask = 0x0000000e
Jun 23 02:23:45 (none) local0.notice cgminer[969]: set_ticket_mask_chain chainID2 ticket_mask = 0x0000000e
Jun 23 02:23:45 (none) local0.notice cgminer[969]: software_set_address_chain chainID0 asics 3 addrInterval 64
Jun 23 02:23:46 (none) local0.notice cgminer[969]: software_set_address_chain chainID1 asics 3 addrInterval 64
Jun 23 02:23:46 (none) local0.notice cgminer[969]: software_set_address_chain chainID2 asics 0 addrInterval 64
Jun 23 02:23:46 (none) local0.notice cgminer[969]: enable_read_temperature_from_asic_chain: reg_value=0x07006061 chainid 0
Jun 23 02:23:46 (none) local0.notice cgminer[969]: enable_read_temperature_from_asic_chain: reg_value=0x07006061 chainid 1
Jun 23 02:23:47 (none) local0.notice cgminer[969]: enable_read_temperature_from_asic_chain: reg_value=0x07006061 chainid 2
Jun 23 02:23:47 (none) local0.notice cgminer[969]: dev.timeout = 100000000 us
Jun 23 02:23:47 (none) local0.notice cgminer[969]: bitmain_ZCASH_init: fan_eft : 0  fan_pwm : 30
Jun 23 02:23:50 (none) local0.notice cgminer[969]:  430412one
Jun 23 02:23:50 (none) local0.err cgminer[969]: cgminer time error total_secs = 1592879030.147710 last_total_secs = 1.000000
Jun 23 02:23:55 (none) local0.info cgminer[969]: Pool 0 extranonce set to 0000000000000098
Jun 23 02:23:55 (none) local0.info cgminer[969]: Pool 1 extranonce set to 77b27727e6
Jun 23 02:23:55 (none) local0.info cgminer[969]: Pool 2 extranonce set to 8424230d16
Jun 23 02:24:01 (none) local0.warn cgminer[969]: API running in IP access mode on port 4028 (9)
Jun 23 02:24:03 (none) local0.notice cgminer[969]: All asics arrive target freq"
Unlike traditional banking where clients have only a few account numbers, with Bitcoin people can create an unlimited number of accounts (addresses). This can be used to easily track payments, and it improves anonymity.
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1715642113
Hero Member
*
Offline Offline

Posts: 1715642113

View Profile Personal Message (Offline)

Ignore
1715642113
Reply with quote  #2

1715642113
Report to moderator
1715642113
Hero Member
*
Offline Offline

Posts: 1715642113

View Profile Personal Message (Offline)

Ignore
1715642113
Reply with quote  #2

1715642113
Report to moderator
Exl (OP)
Newbie
*
Offline Offline

Activity: 13
Merit: 0


View Profile
June 26, 2020, 04:10:00 PM
 #2

I will keep updating this thread. Maybe it will help someone at some point or I'll get unexpected advise.

So, I tested each and every hash board separately for a couple of hours. All were good without failures.
Then I started adding one more hashboard. Two boards are hashing for around 36 hours at usual hashrate without any failures.

I'll switch Hashboards and keep on checking. If two boards are hashing reliably but not three I suppose it is the PSU? Or just one of it's power cables. I can hardly imagine how the control board might cause such behaviour.
Exl (OP)
Newbie
*
Offline Offline

Activity: 13
Merit: 0


View Profile
July 04, 2020, 07:47:49 AM
 #3

Please guys, help is greatly appreciated. I'm kind of desperate. To make it short here's a summary of my issue and new things I've checked.

Summary / Problem:
One hashboard is randomly disconnecting while mining, sometimes from boot up, sometimes happening after mining for a while. I can not tell that it's a specific board as each board works fine. Two boards at the same time work fine aswell two. The problem seems to occure when three boards are connected.

A while ago a fan failed - maybe it is common, Idk, it is my first asic. Now, about 2-3 months after the first fan failed, the second one failed aswell. There might be no corellation but I find it weird.

I supposed it could be the PSU and got a multimeter.
-PSU's voltage is 12.4V not being under load. As the second fan has just failed, I can't test in under load.
-Continuity test of PCI cables as well as female connectors at the hashboards seem okay.
-Electrical socket's grounding is good aswell.
-Voltage is stable at 240V out of the socket.

Do you think it could still be a PSU issue? Or rather Control board?
skittang
Newbie
*
Offline Offline

Activity: 2
Merit: 0


View Profile
December 15, 2020, 03:02:11 PM
 #4

Please guys, help is greatly appreciated. I'm kind of desperate. To make it short here's a summary of my issue and new things I've checked.

Summary / Problem:
One hashboard is randomly disconnecting while mining, sometimes from boot up, sometimes happening after mining for a while. I can not tell that it's a specific board as each board works fine. Two boards at the same time work fine aswell two. The problem seems to occure when three boards are connected.

A while ago a fan failed - maybe it is common, Idk, it is my first asic. Now, about 2-3 months after the first fan failed, the second one failed aswell. There might be no corellation but I find it weird.

I supposed it could be the PSU and got a multimeter.
-PSU's voltage is 12.4V not being under load. As the second fan has just failed, I can't test in under load.
-Continuity test of PCI cables as well as female connectors at the hashboards seem okay.
-Electrical socket's grounding is good aswell.
-Voltage is stable at 240V out of the socket.

Do you think it could still be a PSU issue? Or rather Control board?
Hey Exl, going to bump this thread as I too am having relatively same issue and looking to get some solution or at least to find out what is plaguing my Z11s.

My units have been running for 1.5 years with occasional fan and board failures. Most repairs involve replacing burnt NPCs (picture below) by a third-party contact I know, who mentioned a faulty PSU could be causing the issue.
https://www.veswin.com/uploads/product/stockpic/NPC/356438/SM-IS-2103.jpg

I was using an OEM PSU 2400w 12.4v at 240v (which is probably at the end of its lifetime), which I suspect could be causing the NPC components to burn. If you could check the current load voltage on your PSU while under loading from your Z11, you might get some insight as to what's causing your issue. Else try swapping to an original Bitmain APW to see if your issues persist.

I see my units and their hashboards reporting "OOO" chips to "OOX". They eventually report 0 ASIC under Kernel Log and show no data on the Miner Status tab.
xxcsu
Hero Member
*****
Offline Offline

Activity: 1498
Merit: 597


View Profile WWW
December 15, 2020, 04:44:08 PM
 #5

when something like this happening/happened with my ASICs , after i went through the basic troubleshooting , ( checking cables , connection...) the first thing i did , replaced the psu... most of the time its helped ... 

Learn about Merit & new rank requirements , Learn how to use MERIT , make this community better
If you like the answer you got for your question from any member ,
If you find any post useful , informative use the +Merit button.
Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!