Hi,
I've tried googling this, but haven't managed to find much on the topic. I thought folks here might be able to offer some concrete tips and advice.
Basically, I'm suspecting one of the GPUs in my mining rig of being close to expiry, but have absolutely no way of really confirming (or disproving) this. Are there any "tests", or anything I can do to at least fairly accurately determine the card is (or isn't) dying, or is it all pretty much undeterminable with any real degree of certainty?
Stress testing is the way to go. Since mining is basically stress testing you might up the intensity and see how many HW errors you get. Go lower with the intensitiy untill the card can run without or very few HW errors.
For info, the card is a Sapphire R9 270 (non-X), and it's only about 4 months old. It has been mining nearly 24/7 in those 4 months, with temperatures between 75 and 80°C for 99% of the time, and just over 80, but not over 85 for literally a few hours in its entire life.
Sounds like bad cooling. Sure the GPU can take the heat but its not going to last long under these conditions.
Further info, the reason I'm suspecting the card (which is a primary card in my rig, with the monitor connected to it) is that, although it still mines stably, the GUI on this BAMT based rig often hangs, and I recently tried switching over to PiMP, and again, when booting up the rig from a freshly imaged brand new USB stick, the GUI just hangs for about 10-15 minutes, then sort of comes back to life, but it's still laggy and largely unusable. Someone in the PiMP irc support channel suggested my card might be dying and that got me thinking if that really might be the case, and hanging GUI could be the symptom. Find that odd though, displaying the GUI (in my non-expert opinion) should be a fairly basic and simple task for the card, far easier than mining, which the card can still do.
Thanks in advance.
Yes GUI is "easier" but thats not the matter. Usually the memory dies first and calculating the GUI uses the same memory as calculating hashes. Try cranking up the intensity and see how much the card still can handle. You also should improve the cooling.