Bitcoin Forum

Alternate cryptocurrencies => Mining (Altcoins) => Topic started by: rwtrader on November 20, 2017, 06:53:05 PM



Title: Rebooting 2x Per Day
Post by: rwtrader on November 20, 2017, 06:53:05 PM
Can't seem to diagnose why my rig is rebooting itself twice a day. Running a mix of 6 gpus, all nvidia on a biostar tb250btc with g temps running well under 70c. Anyone else have an issue like this and what was the culprit?  Thanks!


Title: Re: Rebooting 2x Per Day
Post by: Deathwing on November 20, 2017, 06:53:34 PM
What are the specs? Most importantly, the operating system?


Title: Re: Rebooting 2x Per Day
Post by: Miderian on November 20, 2017, 07:12:27 PM
hm interesting. Try to speed up your fans more, not by one or two, above 80% and leave it like that for a few hours, but something is not right


Title: Re: Rebooting 2x Per Day
Post by: percy_tc on November 20, 2017, 07:22:55 PM
Can't seem to diagnose why my rig is rebooting itself twice a day. Running a mix of 6 gpus, all nvidia on a biostar tb250btc with g temps running well under 70c. Anyone else have an issue like this and what was the culprit?  Thanks!

1. check latest log of miner
2. check that all of your extra power plugs are connected to motherboard
3. i guess you use 2 separeted PSU, make sure, that one PSU is powering all motherboard power ins ( ATX, CPU, 2x 4 pin on mobo) and second PSU just powering VGA and risers.





Title: Re: Rebooting 2x Per Day
Post by: rwtrader on November 20, 2017, 07:38:26 PM
Thanks all! Here are some answers.
OS Win 10 Pro
Fans are all 100
Will look at logs
All power connected to mobo including pcie supplement power
Using 1000w for mobo and 3 demanding gpus 500w for less demanding gpus. They draw about 500 and 350w respectively
Does this help for diagnosis? Thanks


Title: Re: Rebooting 2x Per Day
Post by: fapar on November 20, 2017, 08:41:39 PM
You wrote that you need to watch log miner. But you also need to check system log: compmgmt.msc -> Event viewer -> Windows logs -> System


Title: Re: Rebooting 2x Per Day
Post by: wacko on November 20, 2017, 08:54:35 PM
Can't seem to diagnose why my rig is rebooting itself twice a day. Running a mix of 6 gpus, all nvidia on a biostar tb250btc with g temps running well under 70c. Anyone else have an issue like this and what was the culprit?  Thanks!
Is this a new rig that you just built? Or was it working fine for some time and started rebooting only recently?
There's not that many good 500W PSUs out there, most of them are crap, so if you're powering 3 cards with one of those, that might be your problem. Even though the cards only take 350W, that could still be too much for a cheap PSU. Then again, you didn't mention the exact specs so we're all guessing here.


Title: Re: Rebooting 2x Per Day
Post by: Lampaster on November 20, 2017, 08:58:03 PM
The reasons for this can be many. Maybe you need to call a specialist. Windows keeps a log of faults and if there was a restart information must appear in this magazine. I also want to draw your attention to the fact that the temperature is 70 degrees high temperature. My GPU never heated above 58 degrees. Perhaps the driver is to control the temperature and it protects your GPU from overheating.


Title: Re: Rebooting 2x Per Day
Post by: rwtrader on November 20, 2017, 09:59:13 PM
i don't see anything out of the ordinary in Claymore logs.
Win says
A connected hardware error has occurred
Component: PCIE Root Port
Error Source: Advanced Error Reporting (PCIE)
Bus Device Function  0x0:0x1C:0x6
VendorID:DeviceID: 0x8086:0xA296
Class Code: 0x30400

New rig working fine for a couple of days
500w is an Antec but it is a couple of years old

After I learn and exhaust all my resources, I might have to call someone
All psus are at 100 with temps running from 49-66c adding another fan today

Thanks for the feedback.  I hope the Win log points us to something.


Title: Re: Rebooting 2x Per Day
Post by: wacko on November 20, 2017, 10:07:02 PM
Win says
A connected hardware error has occurred
Component: PCIE Root Port
Error Source: Advanced Error Reporting (PCIE)
Bus Device Function  0x0:0x1C:0x6
VendorID:DeviceID: 0x8086:0xA296
Class Code: 0x30400
It's hard to decode these, might be the motherboard, but more likely problems with the risers (one or more might be faulty). Try changing them if you have spares.


Title: Re: Rebooting 2x Per Day
Post by: dawidt on November 20, 2017, 10:56:31 PM
check risers


Title: Re: Rebooting 2x Per Day
Post by: cpmcgrat on November 21, 2017, 12:37:16 AM
For me, my rig was rebooting a couple of times per day due to issues with memory sharding/overflow (my RAM was barebones 4Gb w/ 16Gb swap). The error code reported out to the error monitor was 0x116. I was able to solve this and increase stability by upgrading the rig to 16Gb of RAM (enough to hold any DAG files or buffer up any I/O from the GPUs without having to use the SSD as swap). After doing this, my machine went from rebooting itself 1-2 times a day to being alive and well for the past 2 weeks straight.

If you're running Windows you can find the event logs at Event Logs Viewer > Windows Logs > System. Below is the error I was seeing that tipped me off.

Quote
The computer has rebooted from a bugcheck.  The bugcheck was: 0x00000116 (0xffffe1842ec0b250, 0xfffff802ff76f7d8, 0xffffffffc000009a, 0x0000000000000004). A dump was saved in: C:\Windows\MEMORY.DMP. Report Id: b139f2b1-3d17-48a3-a61f-493013377152.


Title: Re: Rebooting 2x Per Day
Post by: fanatic26 on November 21, 2017, 12:42:19 AM
dump Windows and mine on linux if you want reliability and stability


Title: Re: Rebooting 2x Per Day
Post by: dagarair on November 21, 2017, 12:47:14 AM
1 card - 24 hours
no reboot
2 cards 24 hours
etc etc


Title: Re: Rebooting 2x Per Day
Post by: cpmcgrat on November 21, 2017, 12:55:18 AM
dump Windows and mine on linux if you want reliability and stability

I mine on both, for my nvidia cards I prefer windows since it is incredibly difficult to overclock/undervolt them on linux systems.


Title: Re: Rebooting 2x Per Day
Post by: rwtrader on November 21, 2017, 01:41:44 AM
I will try swapping risers as I bring on card by card.
If I were more comfortable with Linux I would give it a try.
I thought about the ram so I brought it up to 8.  No effect...


Title: Re: Rebooting 2x Per Day
Post by: wacko on November 21, 2017, 01:58:25 AM
I will try swapping risers as I bring on card by card.
If I were more comfortable with Linux I would give it a try.
I thought about the ram so I brought it up to 8.  No effect...
Looking at your logs, I would say it's more likely to be a hardware problem, so adding more RAM or switching to Linux is not going to help. For now the main suspects are the board (less likely) and the risers (more likely).


Title: Re: Rebooting 2x Per Day
Post by: rwtrader on November 26, 2017, 06:48:03 PM
Ok. Seemed to be working fine, no reboots for 3 days. I noticed the extension was getting warm, so I shut it down to use a larger gauge extension and now it's worse! Rebooting every few hours. Any clue why this would be? Thanks again.


Title: Re: Rebooting 2x Per Day
Post by: wacko on November 26, 2017, 06:52:21 PM
I will try swapping risers as I bring on card by card.

...

Ok. Seemed to be working fine, no reboots for 3 days. I noticed the extension was getting warm, so I shut it down to use a larger gauge extension and now it's worse! Rebooting every few hours. Any clue why this would be? Thanks again.

Did you try to do what was suggested to you? Changing risers at least?


Title: Re: Rebooting 2x Per Day
Post by: rwtrader on November 27, 2017, 02:28:30 PM
Absolutely! That's how I got a good 3 days before I manually shut it down.


Title: Re: Rebooting 2x Per Day
Post by: wacko on November 27, 2017, 02:44:47 PM
So you changed the risers and then the rig worked for 3 days without reboots. Then you changed the extension and it's rebooting every few hours? What exactly is "extension" — is that the power cable from the PSU to the outlet? Can you put the previous one back and see whether the rig is stable with that one? Just to make sure that it is indeed the cause for these reboots and not just a coincidence?


Title: Re: Rebooting 2x Per Day
Post by: rwtrader on November 27, 2017, 03:09:46 PM
Yes, it is in between power strip and wall. However unlikely, tried that and still have the problem. Put it back and the frequency of reboots is gradually going down. I am down to 3-4 per day... Thanks


Title: Re: Rebooting 2x Per Day
Post by: Tidsdilatation on November 27, 2017, 03:40:51 PM
First of all i would switch to an Linux based OS. Then i would check heat on ALL components. Not only GPUs. After that i would check hardware. Risers first then ram. Reboots could literally be anything.


Title: Re: Rebooting 2x Per Day
Post by: wacko on November 27, 2017, 03:43:35 PM
If you want any meaningful advises you'll need to at least properly describe your setup. After all this time you haven't even said what GPUs you run. :-\ There's plenty of reasons for any computer to reboot by itself, and even more so for a mining rig. Both hardware and software reasons. Unless it's something very obvious, it's unlikely that someone will figure out what's going on with your rig simply because you didn't bother to spend a few minutes and actually describe it. Cards, risers, PSUs, how exactly it's all connected together etc. All we know is that it's tb250btc and you're using 1000w and 500w PSUs one of which is an old Antec. That's like a riddle, you posted only a few clues and then you wait till someone will solve it by guessing everything else. Except obviously no one here cares whether your rig will keep rebooting or not. :) You're the only party interested in solving this, yet you don't even want to bother with providing all the details.

The usual suspects in problems like this are the risers, the power (both how it is supplied and with what PSUs), the GPUs themselves and the system (motherboard/cpu/ram). You start by swapping the risers. If it doesn't help then you check the power, install another PSU, check whether all the connections are alright (so there's no nonsense like 3 risers sitting on a single cable from the PSU etc). Didn't help? Change the board. If it doesn't help either then test all the GPUs one by one. That's how it's done in general, it's just with more experience you're more likely to find the culprit faster, but the process remains more or less the same.


Title: Re: Rebooting 2x Per Day
Post by: RedUkulele on November 27, 2017, 03:52:05 PM
 Try to speed up your fans more, not by one or two, above 80% and leave it like that for a few hours, but something is not right


Title: Re: Rebooting 2x Per Day
Post by: kapipindot on November 27, 2017, 04:06:46 PM
Usually a pc will reboot if there is a faulty ram, over heating processor, overheating gpu, faulty power supply. You may try to mine with 3 cards first and then add additional 1 everyday until a reboot is experienced. You should always check for gpu temp and cpu temp, maybe your setup or the place where you place you rig has poor ventillation.As with the risers, that is one hell of troubleshooting to make. If possible, you may change all of the risers.


Title: Re: Rebooting 2x Per Day
Post by: cryptocoinfarmer on November 27, 2017, 04:10:52 PM
Can you answer few questions to determine your problem?
 
1.) What kind of Nvidia cards.
2.) Cards are overclocked or not.
3.) Specify raiser model or just add a photo.
4.) Specify diagram of wiring how all is connected together or add a photo.
5.) What kind of OS you use.
6.) Did you try to swap the risers.
7.) Did you try to connect only one card to the PCI-e x16 slot and run the mining process.
8.) The problem started at the first time when the rig was done or the rig was working some time and then suddenly started to restart.


Title: Re: Rebooting 2x Per Day
Post by: rwtrader on November 27, 2017, 07:15:03 PM
I understand, especially about adequately describing rig. Let me put something together I can post with each question. Your time is much too valuable to be running down my problem piece by piece. Give me a day. I really do appreciate your help and this has been a great community!


Title: Re: Rebooting 2x Per Day
Post by: rwtrader on November 30, 2017, 11:35:28 PM
Finally figured out problem (I think). Pulled all cables and one of the pins pulled out of a connector. Traded cable and glued back side of all connectors. Up for a day so far. One more day and I start tweeting hash rates. Thanks for all your help and I now have a system config I can post!

https://docs.google.com/document/d/1-j7dvV2r-WhJ1_iG9posVMKZ8WI4B7iyC_SNZWviHzA