Bitcoin Forum
April 19, 2024, 10:12:58 PM *
News: Latest Bitcoin Core release: 26.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: Nvidia GPU Rig Goes Down Periodically - ASUS H170 Pro Gaming  (Read 162 times)
MrN1ce9uy (OP)
Member
**
Offline Offline

Activity: 155
Merit: 11


View Profile
May 18, 2018, 02:54:00 AM
 #1

Problem is the past 2 days my rig stopped mining completely after I left for work. Running EWBF Miner 0.3.4b and it would say "GPU 0,1,3 stopped working". It would require a restart of teh system to work again. I've reinstalled drivers several times.

Also, before I was using Nicehash and only 1 GPU would stop working pretty much everyday at some time. That's why I switched to mining ZEC.

It doesn't really happen with only 4x GPUs, but with 5x it seems to have issues and restarting or reinstalling the driver is a temporary fix.

My idea is the motherboard has some temp monitoring feature that doesn't work properly with so many GPUs. I have a 7x AMD mining rig on a GA-Z270XP-SLI and it will run non-stop without issue.

Think I should buy another GA-Z270XP-SLI??

Or any other suggestions?

Asus H170 Pro Gaming
4GB RAM
G3900
2x EVGA GTX 1080 Ti SC2
2x Zotac GTX 1070 Ti Mini
1x EVGA GTX 1070 SC
1x EVGA 850 B2
1x EVGA 750 B2
24-Pin Dual Power Supply Adapter Cable For PC ATX Motherboard
Windows 10
Latest Nvidia Driver (also previous drivers did same thing)

"I'm sure that in 20 years there will either be very large transaction volume or no volume." -- Satoshi
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1713564778
Hero Member
*
Offline Offline

Posts: 1713564778

View Profile Personal Message (Offline)

Ignore
1713564778
Reply with quote  #2

1713564778
Report to moderator
leonix007
Sr. Member
****
Offline Offline

Activity: 1008
Merit: 297


Grow with community


View Profile
May 18, 2018, 03:14:40 AM
 #2

have you tried lowering the clocks a bit

hows your GPU temps?

did you try DSTM miner?
MrN1ce9uy (OP)
Member
**
Offline Offline

Activity: 155
Merit: 11


View Profile
May 18, 2018, 03:49:02 AM
 #3

Temps are good. I just changed some settings in BIOS and hopefully it fixes the issue.

*I just remember a setting in BIOS when I first set it up that said it could cause issues when monitoring temps; but I can't remember what or where it is exactly.
kiki1
Newbie
*
Offline Offline

Activity: 1
Merit: 0


View Profile
May 18, 2018, 07:52:48 AM
 #4

double check PSUs , i had recurring crash on nvidia rig , changed risers and tried countless fix to finally realise my problem was the cable feeding in energy the 1080ti.
My corsair 850 RMX cable with 2x6+2 pin can't feed the 1080ti correctly and rig crash every few hours/days, i plugged the same card with another 2x6+2 pin from EVGA SuperNOVA 750 G3 psu and rig hashing non stop for 30days+ now. Good luck !
philipma1957
Legendary
*
Offline Offline

Activity: 4102
Merit: 7723


'The right to privacy matters'


View Profile WWW
May 18, 2018, 12:25:49 PM
 #5

Problem is the past 2 days my rig stopped mining completely after I left for work. Running EWBF Miner 0.3.4b and it would say "GPU 0,1,3 stopped working". It would require a restart of teh system to work again. I've reinstalled drivers several times.

Also, before I was using Nicehash and only 1 GPU would stop working pretty much everyday at some time. That's why I switched to mining ZEC.

It doesn't really happen with only 4x GPUs, but with 5x it seems to have issues and restarting or reinstalling the driver is a temporary fix.

My idea is the motherboard has some temp monitoring feature that doesn't work properly with so many GPUs. I have a 7x AMD mining rig on a GA-Z270XP-SLI and it will run non-stop without issue.

Think I should buy another GA-Z270XP-SLI??

Or any other suggestions?

Asus H170 Pro Gaming
4GB RAM
G3900
2x EVGA GTX 1080 Ti SC2
2x Zotac GTX 1070 Ti Mini
1x EVGA GTX 1070 SC
1x EVGA 850 B2
1x EVGA 750 B2
24-Pin Dual Power Supply Adapter Cable For PC ATX Motherboard
Windows 10
Latest Nvidia Driver (also previous drivers did same thing)



psu issue is likely

and you have 1600 watts of bronze

2 1080tis   should be at 180 watts each or 70%   360
2 1070tis  should be at   105 watts each or 70%  210
1  1070ti should be at     105 watts or 70%         105

675 watts  add 75 more 750 watts


which means this psu below is what you should be using



https://www.corsair.com/us/en/Categories/Products/Certified-Refurbished/Power-Supplies/RMx-Series%E2%84%A2-RM1000x-%E2%80%94-1000-Watt-80-PLUS%C2%AE-Gold-Certified-Fully-Modular-PSU-%28NA%29-%28Refurbished%29/p/CP-9020094-NA/RF

if you want more overhead use this one

https://www.corsair.com/us/en/Categories/Products/Certified-Refurbished/Power-Supplies/HXi-Series%E2%84%A2-HX1200i-High-Performance-ATX-Power-Supply-%E2%80%94-1200-Watt-80-Plus%C2%AE-PLATINUM-Certified-PSU-%28Refurbished%29/p/CP-9020070-NA/RF

▄▄███████▄▄
▄██████████████▄
▄██████████████████▄
▄████▀▀▀▀███▀▀▀▀█████▄
▄█████████████▄█▀████▄
███████████▄███████████
██████████▄█▀███████████
██████████▀████████████
▀█████▄█▀█████████████▀
▀████▄▄▄▄███▄▄▄▄████▀
▀██████████████████▀
▀███████████████▀
▀▀███████▀▀
.
 MΞTAWIN  THE FIRST WEB3 CASINO   
.
.. PLAY NOW ..
MrN1ce9uy (OP)
Member
**
Offline Offline

Activity: 155
Merit: 11


View Profile
May 18, 2018, 04:42:32 PM
 #6

Yeah I was wondering if PSU is the cause.

I've got it setup like this:

850W PSU=
System
2x 1080 Ti

750W=
2x 1070 Ti mini
1x 1070

I have an energy meter, i'll see how much each is pulling later this evening.

I was wanting to buy that HX1200i because they are in stock and on sale. Nothing was in stock when I bought the bronze units. Everything was overpriced. The efficiency isn't that big of a deal, like $30/year or something like that. But if it's causing instability then it's worth the upgrade IMO.
madnessteat
Legendary
*
Offline Offline

Activity: 2226
Merit: 1963



View Profile
May 18, 2018, 07:35:31 PM
 #7

Your power supply should be enough. Read somewhere that the latest NVIDIA drivers are unstable. Try to install these drivers 23.21.13.9077, but first delete the old using Display Driver Uninstaller (DDU). Another may be that you have a strong overclocking.

███████████████████████████
███████▄████████████▄██████
████████▄████████▄████████
███▀█████▀▄███▄▀█████▀███
█████▀█▀▄██▀▀▀██▄▀█▀█████
███████▄███████████▄███████
███████████████████████████
███████▀███████████▀███████
████▄██▄▀██▄▄▄██▀▄██▄████
████▄████▄▀███▀▄████▄████
██▄███▀▀█▀██████▀█▀███▄███
██▀█▀████████████████▀█▀███
███████████████████████████
.
.Duelbits.
▄▄█▄▄░░▄▄█▄▄░░▄▄█▄▄
███░░░░███░░░░███
░░░░░░░░░░░░░
░░░░░░░░░░░░
▀██████████
░░░░░███░░░░
░░░░░███▄█░░░
░░██▌░░███░▀░░██▌
█░██░░███░░░██
█▀▀▀█▌░███░░█▀▀▀█▌
▄█▄░░░██▄███▄█▄░░▄██▄
▄███▄
░░░░▀██▄▀
.
REGIONAL
SPONSOR
███▀██▀███▀█▀▀▀▀██▀▀▀██
██░▀░██░█░███░▀██░███▄█
█▄███▄██▄████▄████▄▄▄██
██▀ ▀███▀▀░▀██▀▀▀██████
███▄███░▄▀██████▀█▀█▀▀█
████▀▀██▄▀█████▄█▀███▄█
███▄▄▄████████▄█▄▀█████
███▀▀▀████████████▄▀███
███▄░▄█▀▀▀██████▀▀▀▄███
███████▄██▄▌████▀▀█████
▀██▄█████▄█▄▄▄██▄████▀
▀▀██████████▄▄███▀▀
▀▀▀▀█▀▀▀▀
.
EUROPEAN
BETTING
PARTNER
fanatic26
Hero Member
*****
Offline Offline

Activity: 756
Merit: 560


View Profile
May 18, 2018, 10:07:09 PM
 #8

Run linux for a stable mining rig. None of this DDU and which driver to run nonsense.

Stop buying industrial miners, running them at home, and then complaining about the noise.
ciciteng
Jr. Member
*
Offline Offline

Activity: 168
Merit: 2


View Profile
May 18, 2018, 10:13:10 PM
 #9

If you in overclocked mode or in mod-bios, try to reset it back to default (factory) config.
Test it for several hours/ days and observe how it's behave.
Also no harm to check your power supply, it might be start to defect and gave your rigs inconsistent amps and voltages.
MrN1ce9uy (OP)
Member
**
Offline Offline

Activity: 155
Merit: 11


View Profile
May 20, 2018, 02:33:18 PM
Last edit: May 20, 2018, 04:02:05 PM by MrN1ce9uy
 #10

Okay, so it happened again since I changed some bios settings. This time when I hit restart in Windows a BSOD occured with the following error code:

Video_TDR_Failure (nvlddmkm.sys) on Windows 10
https://www.drivereasy.com/knowledge/nvlddmkm-sys-video_tdr_failure-blue-screen-error-solved-on-windows-10/

Do note that I've clean reinstalled the drivers numerous times yet it continues to happen.

I just now reset GPU clocks to factory defaults and I'll see if that keeps it from happening.

I had them set at:
GTX 1080 Ti +50core +200mem
GTX 1070 Ti +100core +400mem
GTX 1070 +100core +400mem
all at 75% power limit

I don't want to run them at 100% power, so I only set clocks to default.

I'll also be running checkdisk and memtest to see if that's an issue.

*Checkdisk and memtest were good.
Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!