Title: Bminer: Irrecoverable errors from miner in GPU 0: GPU hang detected Post by: jamiemetzger600 on February 22, 2018, 09:19:28 PM Hi,
I am using Ubuntu with (6) 1080ti's running bminer-v5.1.0-c65bb74 and I get this error every so often: Irrecoverable errors from miner in GPU 0: GPU hang detected I've searched online, and on the Bminer site but I can't find anything to indicate what this issue is. I've checked the riser, card, cables...etc The cards are OC'd so I ill try to back it off a bit. A reboot always fixes this issue. Mining ZCL if that makes a difference. Thanks! Title: Re: Bminer: Irrecoverable errors from miner in GPU 0: GPU hang detected Post by: smoolae on February 22, 2018, 09:27:33 PM Hi, I am using Ubuntu with (6) 1080ti's running bminer-v5.1.0-c65bb74 and I get this error every so often: Irrecoverable errors from miner in GPU 0: GPU hang detected I've searched online, and on the Bminer site but I can't find anything to indicate what this issue is. I've checked the riser, card, cables...etc The cards are OC'd so I ill try to back it off a bit. A reboot always fixes this issue. Mining ZCL if that makes a difference. Thanks! Yeah, try to decrease OC a little. For me, this "Hang error" has always shown up because of mad OC/undervolting. Title: Re: Bminer: Irrecoverable errors from miner in GPU 0: GPU hang detected Post by: jamiemetzger600 on February 22, 2018, 09:40:02 PM Hi, I am using Ubuntu with (6) 1080ti's running bminer-v5.1.0-c65bb74 and I get this error every so often: Irrecoverable errors from miner in GPU 0: GPU hang detected I've searched online, and on the Bminer site but I can't find anything to indicate what this issue is. I've checked the riser, card, cables...etc The cards are OC'd so I ill try to back it off a bit. A reboot always fixes this issue. Mining ZCL if that makes a difference. Thanks! Yeah, try to decrease OC a little. For me, this "Hang error" has always shown up because of mad OC/undervolting. OK, thanks. Here is my overclock file, would you mind taking a look at it for me? Quote #!/bin/bash # A script for overclocking Nvidia graphics card under Linux nvidia-smi -pm DISABLED export GPU_FORCE_64BIT_PTR=0 export GPU_MAX_HEAP_SIZE=100 export GPU_USE_SYNC_OBJECTS=1 export GPU_MAX_ALLOC_PERCENT=100 export GPU_SINGLE_ALLOC_PERCENT=100 #export CUDA_DEVICE_ORDER=PCI_BUS_ID #X :0 & #sleep 5 #export DISPLAY=:0 #sleep 3 # 1080 ## set power to 200 watts max nvidia-smi -i 0 -pl 200 nvidia-smi -i 1 -pl 200 nvidia-smi -i 2 -pl 200 nvidia-smi -i 3 -pl 200 nvidia-smi -i 4 -pl 200 nvidia-smi -i 5 -pl 200 # DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings -c :0 -q gpus # 1080ti DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \ -a "[gpu:0]/GPULogoBrightness=1" \ -a "[gpu:0]/GPUFanControlState=1" \ -a "[fan:0]/GPUTargetFanSpeed=75" \ -a "[gpu:0]/GPUGraphicsClockOffset[3]=150" \ -a "[gpu:0]/GPUMemoryTransferRateOffset[3]=1400" \ # 1080ti DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \ -a "[gpu:1]/GPULogoBrightness=1" \ -a "[gpu:1]/GPUFanControlState=1" \ -a "[fan:1]/GPUTargetFanSpeed=75" \ -a "[gpu:1]/GPUGraphicsClockOffset[3]=150" \ -a "[gpu:1]/GPUMemoryTransferRateOffset[3]=1400" \ # 1080ti DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \ -a "[gpu:2]/GPULogoBrightness=1" \ -a "[gpu:2]/GPUFanControlState=1" \ -a "[fan:2]/GPUTargetFanSpeed=75" \ -a "[gpu:2]/GPUGraphicsClockOffset[3]=150" \ -a "[gpu:2]/GPUMemoryTransferRateOffset[3]=1400" \ # 1080ti DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \ -a "[gpu:3]/GPULogoBrightness=1" \ -a "[gpu:3]/GPUFanControlState=1" \ -a "[fan:3]/GPUTargetFanSpeed=75" \ -a "[gpu:3]/GPUGraphicsClockOffset[3]=150" \ -a "[gpu:3]/GPUMemoryTransferRateOffset[3]=1400" \ # 1080ti DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \ -a "[gpu:4]/GPULogoBrightness=1" \ -a "[gpu:4]/GPUFanControlState=1" \ -a "[fan:4]/GPUTargetFanSpeed=75" \ -a "[gpu:4]/GPUGraphicsClockOffset[3]=150" \ -a "[gpu:4]/GPUMemoryTransferRateOffset[3]=1400" \ # 1080ti DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \ -a "[gpu:5]/GPULogoBrightness=1" \ -a "[gpu:5]/GPUFanControlState=1" \ -a "[fan:5]/GPUTargetFanSpeed=75" \ -a "[gpu:5]/GPUGraphicsClockOffset[3]=150" \ -a "[gpu:5]/GPUMemoryTransferRateOffset[3]=1400" \ Title: Re: Bminer: Irrecoverable errors from miner in GPU 0: GPU hang detected Post by: jamiemetzger600 on February 22, 2018, 09:50:43 PM I changed the fan speed on all the GPU's from 75 -> 80
I will report back |