Bitcoin Forum
May 01, 2024, 12:55:22 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
  Home Help Search Login Register More  
  Show Posts
Pages: [1]
1  Alternate cryptocurrencies / Mining (Altcoins) / GPUs stop mining in Linux on: May 14, 2018, 10:55:16 PM
After constantly fighting with windows for months. I decided to make the leap to Linux for my mining rig thinking it would be more reliable. Well.... not really.



When I start mining, everything is fine. Temps never get above 73 on hottest card during mid-day. It appears that after about 30+ mins or so GPUs start disappearing from the miner terminal window and stop hashing (they still show in Nvidia x-server). Some GPUs can take 10+hours to disappear. After about 24 hours of mining, only 4 are still working (confirmed with pool). Only way to get them back is to reboot the rig. I have tried changing risers and I still have this issue. Any ideas?




Currently I am running,

 Ubuntu 16.04
B250 Mining Expert MOBO
DSTM mining ZelCash
7 GPUs
1x Gigabyte G1 GTX1070
2x Gigabyte Mini GTX1070
1x Asus Strix GTX1070
2x Zotac Mini GTX1070ti
1x Gigabyte GTX1080


All card undervolted to 150W or less. No OC no anything.

Power:
1200w PSU ---> 3x GPU + MOBO + HDD, etc.    ~~500W used
1200w Server PSU ---> 4x GPU                         ~~600W used
2  Alternate cryptocurrencies / Mining (Altcoins) / Re: Stability Issues on: January 30, 2018, 10:45:48 PM
UPDATE:

Since my last update I have changed my miner to neoscrypt-hsrminer, changed video TDRDelay to 10 seconds, moved all cards to new risers and moved 3 of the cards to my new DPS-1200 PSU. I was still having issues with windows freezing. Absolutely no change.


but wait, there's more.

After uninstalling my nvidia drivers and reinstalling using Windows 7 compatibility mode, the rig is finally showing improvement. I can actually leave it on all day with little to no human intervention. I still have a babysitter script that restarts hsrminer if the application closes (just in case) and a 3-hour reboot cycle on my wifi outlet. Hopefully I can ween the rig off of all my automated scripts and timers a little bit.

3  Alternate cryptocurrencies / Mining (Altcoins) / Re: Stability Issues on: January 23, 2018, 05:19:30 PM
It appears the system is now only soft crashing ccminer every so often due to non responding video drivers. Vid Driver updated to 390.65.
Fingers crossed...

Otherwise I will have to try pulling the newest GPU out of the mix to see what happens.

4  Alternate cryptocurrencies / Mining (Altcoins) / Re: Stability Issues on: January 22, 2018, 02:10:28 AM
I am still having issues with crashing. I tested skunkhash and it appeared to last a bit longer than Neocrypt but neither runs for more than 2.5 hours reliably. I have since swapped all cards to new risers from a different brand in hopes I had a bad riser in the mix.
5  Alternate cryptocurrencies / Mining (Altcoins) / Re: Stability Issues on: January 20, 2018, 01:13:55 AM
you've got a 98gb virtual memory, and your wondering about stability, maybe you need to look up what a pagefile does on a ssd or hard drive. any IT person knows a large page file affects performance really badly, to the point it can affect stability. theres NO need for anything larger than 16gb virtual memory PERIOD. if you need to increase it that means the system is using too much somewhere and you need to figure out what and fix it instead of just adding too the problem

98gb is a little much, but I get out of memory errors in pretty much any miner instantly on all of my rigs (6gpu vega 56, 6gpu 1070ti's, 6gpu 1080ti's), if I don't have my virtual memory set at 50gb.  I'm running 8gb of ddr4 in each.  The OS runs fine without that big of a page file, until I start the miner.

When I check to see what's using so much memory, it's the miner.  I don't know how I can get away with a 16gb page file, unless I upgrade them all to 32gb ram.

 

After running Virtual @20 GB im getting memory exhaustion errors again. I'll try your magic number and see how that goes. I have a feeling this is going to turn into a balance act by the time i connect all 19 GPUs to this thing.
6  Alternate cryptocurrencies / Mining (Altcoins) / Re: Stability Issues on: January 19, 2018, 09:13:34 PM


you've got a 98gb virtual memory, and your wondering about stability, maybe you need to look up what a pagefile does on a ssd or hard drive. any IT person knows a large page file affects performance really badly, to the point it can affect stability. theres NO need for anything larger than 16gb virtual memory PERIOD. if you need to increase it that means the system is using too much somewhere and you need to figure out what and fix it instead of just adding too the problem

Maybe I took some bad advice. From what I heard around here is, the sum of pagefile+PYS RAM>sum of gpu RAM. Changing this max allotment actually helped windows stability dramatically. The rig actually stays on now. All errors pertaining to memory have stopped. The miner application crash is the only thing happening now and it appears to be Nvidia driver related from what the event viewer is telling me.

What the heck, I'll try turning pagefile down too.
7  Alternate cryptocurrencies / Mining (Altcoins) / Re: Stability Issues on: January 19, 2018, 07:18:55 PM
I have stopped all overclock and lowered TDP to 80% 

From what I can tell Windows appears more stable but, now ccminer keeps crashing (wtf?). I have switched to ccminer KlausT Cuda 9 to see if this clears things up.

You take one down issue and another pops up. Sheesh.
8  Alternate cryptocurrencies / Mining (Altcoins) / Re: Stability Issues on: January 19, 2018, 03:27:02 AM

Edit:
I just noticed you said the WHOLE rig locks up? damn, maybe a short or power issue?



Ya, I am lucky if the screen displays the cursor when it goes down. Concerning thing is that the card fans keep going at mining speed. 

I think the power issue could be from a bad riser or maybe I am pushing the PSU too far. Another PSU is already on the way.
9  Alternate cryptocurrencies / Mining (Altcoins) / Re: Stability Issues on: January 19, 2018, 03:19:00 AM
Besides the Virtual Memory Exhaustion issue which I have already addressed is:


The highlighted event is the approximate time it crashed





Error Description:

The computer restarted from a bugcheck  - (Im looking at the .DMP next.)

The server {AB8902B4-09CA-4BB6-B78D-A8F59079A8D5} did not register with DCOM within the required timeout.

The WarpJITSvc service terminated unexpectedly.  It has done this 1 time(s).
10  Alternate cryptocurrencies / Mining (Altcoins) / Stability Issues on: January 19, 2018, 02:47:57 AM
Hey All!

This forum has been super helpful while putting my rig together (thanks guys!) but, I still can't iron out these last few issues I am having.

It seems that every 3-4 hrs or so my rig hard crashes. The PC can only be recovered by completely unplugging it. With my luck, this usually happens as soon as I leave for work. I have had to resort to purchasing a Wifi outlet in the event this happens while I am away. Currently, the rig is mining 24/7 only with the aid of a BAT file I made that auto reboots every 2.5 hours. This seems to have happened shortly after adding the 6th card but, I also changed from Skunkhash to Neoscrypt around that time so I am not sure. The only thing that sticks out to me is I get weird power fluctuations (See pic) in MSI AB that seem to lower my hashrate while they happen. After about 5 mins, the flucuation goes away. Am I missing something?

The Satoshi Smasher


Fluctuation



MSI AfterBurner
Power 84%
Temp Limit 85%
1080-Core +45, Mem +450
1070-Core +70, Mem +375

Power:
~940W via Kill-a-Watt

Algo:
Neoscrypt ~6.15/MHs

Miner
Ccminer KlausT Cuda 8

System Specs:
Celeron G3920
Asus B250 Mining Expert MB
Crucial DDR4 2400Mhz 8GB
250GB HDD (SSD to come)
1200W 87% PSU
Virtual Mem= 98GB
Nvidia Driver-388.43
 
Cards:
1x Gigabyte 1080
3x Gigabyte 1070 Mini
1x Gigabyte G1 1070
1x Asus Strix Rog 1070
Pages: [1]
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!