Bitcoin Forum
May 26, 2024, 03:58:34 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: Bminer: Irrecoverable errors from miner in GPU 0: GPU hang detected  (Read 135 times)
jamiemetzger600 (OP)
Newbie
*
Offline Offline

Activity: 3
Merit: 0


View Profile
February 22, 2018, 09:19:28 PM
 #1

Hi,
I am using Ubuntu with (6) 1080ti's running bminer-v5.1.0-c65bb74 and I get this error every so often: Irrecoverable errors from miner in GPU 0: GPU hang detected

I've searched online, and on the Bminer site but I can't find anything to indicate what this issue is. I've checked the riser, card, cables...etc The cards are OC'd so I ill try to back it off a bit.

A reboot always fixes this issue.

Mining ZCL if that makes a difference.

Thanks!
smoolae
Full Member
***
Offline Offline

Activity: 602
Merit: 106



View Profile
February 22, 2018, 09:27:33 PM
 #2

Hi,
I am using Ubuntu with (6) 1080ti's running bminer-v5.1.0-c65bb74 and I get this error every so often: Irrecoverable errors from miner in GPU 0: GPU hang detected

I've searched online, and on the Bminer site but I can't find anything to indicate what this issue is. I've checked the riser, card, cables...etc The cards are OC'd so I ill try to back it off a bit.

A reboot always fixes this issue.

Mining ZCL if that makes a difference.

Thanks!

Yeah, try to decrease OC a little. For me, this "Hang error" has always shown up because of mad OC/undervolting.

jamiemetzger600 (OP)
Newbie
*
Offline Offline

Activity: 3
Merit: 0


View Profile
February 22, 2018, 09:40:02 PM
 #3

Hi,
I am using Ubuntu with (6) 1080ti's running bminer-v5.1.0-c65bb74 and I get this error every so often: Irrecoverable errors from miner in GPU 0: GPU hang detected

I've searched online, and on the Bminer site but I can't find anything to indicate what this issue is. I've checked the riser, card, cables...etc The cards are OC'd so I ill try to back it off a bit.

A reboot always fixes this issue.

Mining ZCL if that makes a difference.

Thanks!

Yeah, try to decrease OC a little. For me, this "Hang error" has always shown up because of mad OC/undervolting.

OK, thanks. Here is my overclock file, would you mind taking a look at it for me?

Quote
#!/bin/bash
# A script for overclocking Nvidia graphics card under Linux

nvidia-smi -pm DISABLED

export GPU_FORCE_64BIT_PTR=0
export GPU_MAX_HEAP_SIZE=100
export GPU_USE_SYNC_OBJECTS=1
export GPU_MAX_ALLOC_PERCENT=100
export GPU_SINGLE_ALLOC_PERCENT=100

#export CUDA_DEVICE_ORDER=PCI_BUS_ID

#X :0 &
#sleep 5
#export DISPLAY=:0
#sleep 3

# 1080

## set power to 200 watts max
nvidia-smi -i 0 -pl 200
nvidia-smi -i 1 -pl 200
nvidia-smi -i 2 -pl 200
nvidia-smi -i 3 -pl 200
nvidia-smi -i 4 -pl 200
nvidia-smi -i 5 -pl 200
# DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings -c :0 -q gpus

# 1080ti
DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \
    -a "[gpu:0]/GPULogoBrightness=1" \
    -a "[gpu:0]/GPUFanControlState=1" \
    -a "[fan:0]/GPUTargetFanSpeed=75" \
    -a "[gpu:0]/GPUGraphicsClockOffset[3]=150" \
    -a "[gpu:0]/GPUMemoryTransferRateOffset[3]=1400" \


# 1080ti
DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \
    -a "[gpu:1]/GPULogoBrightness=1" \
    -a "[gpu:1]/GPUFanControlState=1" \
    -a "[fan:1]/GPUTargetFanSpeed=75" \
    -a "[gpu:1]/GPUGraphicsClockOffset[3]=150" \
    -a "[gpu:1]/GPUMemoryTransferRateOffset[3]=1400" \

# 1080ti
DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \
    -a "[gpu:2]/GPULogoBrightness=1" \
    -a "[gpu:2]/GPUFanControlState=1" \
    -a "[fan:2]/GPUTargetFanSpeed=75" \
    -a "[gpu:2]/GPUGraphicsClockOffset[3]=150" \
    -a "[gpu:2]/GPUMemoryTransferRateOffset[3]=1400" \

# 1080ti
DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \
     -a "[gpu:3]/GPULogoBrightness=1" \
     -a "[gpu:3]/GPUFanControlState=1" \
     -a "[fan:3]/GPUTargetFanSpeed=75" \
     -a "[gpu:3]/GPUGraphicsClockOffset[3]=150" \
     -a "[gpu:3]/GPUMemoryTransferRateOffset[3]=1400" \

# 1080ti
DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \
     -a "[gpu:4]/GPULogoBrightness=1" \
     -a "[gpu:4]/GPUFanControlState=1" \
     -a "[fan:4]/GPUTargetFanSpeed=75" \
     -a "[gpu:4]/GPUGraphicsClockOffset[3]=150" \
     -a "[gpu:4]/GPUMemoryTransferRateOffset[3]=1400" \
# 1080ti
DISPLAY=:0 XAUTHORITY=/var/run/lightdm/root/:0 nvidia-settings \
     -a "[gpu:5]/GPULogoBrightness=1" \
     -a "[gpu:5]/GPUFanControlState=1" \
     -a "[fan:5]/GPUTargetFanSpeed=75" \
     -a "[gpu:5]/GPUGraphicsClockOffset[3]=150" \
     -a "[gpu:5]/GPUMemoryTransferRateOffset[3]=1400" \
jamiemetzger600 (OP)
Newbie
*
Offline Offline

Activity: 3
Merit: 0


View Profile
February 22, 2018, 09:50:43 PM
 #4

I changed the fan speed on all the GPU's from 75 -> 80

I will report back
Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!