Bitcoin Forum
November 16, 2024, 01:25:44 AM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1] 2 »  All
  Print  
Author Topic: [0.25 BTC bounty] Radeon 6990 2nd gpu overheat  (Read 5315 times)
Jack of Diamonds (OP)
Sr. Member
****
Offline Offline

Activity: 252
Merit: 251



View Profile
July 04, 2011, 07:20:36 PM
Last edit: July 04, 2011, 07:36:55 PM by Jack of Diamonds
 #1

To cut a long story short, here is the GPU in OC mode in Furmark & Metro 2033 & idle shortly after,
some of the most demanding 3d apps out there.



Here is GPU #1 after 30 seconds of bitcoin mining in Phoenix.



I know the GPU is not faulty since it works fine in everything else and doesn't overheat.
Why would bitcoin miner clients do this, and can it be solved? Both cores running at identical clocks (840 mem, 880 core, max. fan speed while mining).

I've triple checked that both miners also have the exact same flags, cpu affinity etc... Even when creating 2 blank miners with no extra settings this happens.

1f3gHNoBodYw1LLs3ndY0UanYB1tC0lnsBec4USeYoU9AREaCH34PBeGgAR67fx
shata
Sr. Member
****
Offline Offline

Activity: 547
Merit: 253



View Profile
July 04, 2011, 10:54:10 PM
 #2

How close together are the two 6990's is there small gap so top card can pull air? I dont think its faulty card, if gpu cant get enough air either try extender to give it more air or down clock gpu a bit to keep heat down. I have 6970 and 5870 and one card is always gona run at higher temp if there right next to each other.

▄▄█████████▄▄
▄█████████████████▄
▄████████████ ▀███████▄
▄█████████████   ▀██████▄
▄█████████ ▀████▄   ▀█████▄
██████████  ██████▄   █████
█████ ▀████▄ ▀██████▄ █████
█████   ▀████▄ ▀ ██████████
▀█████▄   ▀████▄ █████████▀
▀██████▄   █████████████▀
▀███████▄ ████████████▀
▀█████████████████▀
▀▀█████████▀▀
Emporium.
Finance
.
Decentralized Peer-to-Peer
Marketplace and DeFi
Liquidity Mining Platform
.
▄▄█▀▀██▀██▀▄▄
▄███▀██▀▀▀▀▀ ▄▄   
 ▀          ▄█▀▄█▄    
▄▄▄▄▄        ▀   ▀██▄███▄
▄██████▄          ▄▄██████▄
███████▌       ▄███████████
█████████▄  ▀█▄████████████
███████████▄▄▄▀▀▀▀▀████████
▀█████████████▀     ▀▀████▀
▀████████████▄        ██▀
▀████████████▌    ▄▄██▀
▀██████████▌  ▄███▀
▀▀██████ ▄█▀▀
Available
in +125
Countries
▄███▄
█████
▀███▀
▄▄▄     ▄█████▄     ▄▄▄
█████    ███████    █████
█████    ███████    █████
▄███▄               ▄███▄
███████     ███     ███████
███████ ██▄█████▄██ ███████
▀▀▀▀▀▀▀  ███▀ ▀███  ▀▀▀▀▀▀▀
███▄ ▄███
██▀█████▀██
███
Community
Governance
System
▄▄██████▄▄ ▄▀▄
      ▀▀▀ ▄██▄
          ▀██ ▄██▄       ▄█
        ▄██   ▀▀███▄   ▄███
       ▄██        ▀█▄   ███
      ▄██           ▀  ▄███
     ▄██        ▄▄     ▀███
    ▄██        ██▀      ███
   ▄██                ▄████
  ▄██         ▄█████████▄ █
  ▀▀      ▄▄▄█████ █▀  ████
      ▄▄██▀▀██▀  ███▄  ▄███
    ▄██████████████████████
Liquidity
Mining
Platform
.
████████████████████████████████████████████████████████
.
JOIN NOW
.
████████████████████████████████████████████████████████
Jack of Diamonds (OP)
Sr. Member
****
Offline Offline

Activity: 252
Merit: 251



View Profile
July 04, 2011, 11:14:56 PM
 #3

It's actually just 1 card, but as you know it houses two GPU's on the board. It's the only card in the system.

1f3gHNoBodYw1LLs3ndY0UanYB1tC0lnsBec4USeYoU9AREaCH34PBeGgAR67fx
shata
Sr. Member
****
Offline Offline

Activity: 547
Merit: 253



View Profile
July 05, 2011, 12:40:04 AM
 #4

Good point forgot... um has it always done this since you got it? maybe thermal paste job on hot cpu was not done very good. My diamond 5770 had HORRIBLE paste job n i ran in the mid 90's during load. Took it apart n there was paste all over the chip not just on die. long story short cleaned it n put some AS5 paste on it and it dropped down to the normal 70-80C range.

▄▄█████████▄▄
▄█████████████████▄
▄████████████ ▀███████▄
▄█████████████   ▀██████▄
▄█████████ ▀████▄   ▀█████▄
██████████  ██████▄   █████
█████ ▀████▄ ▀██████▄ █████
█████   ▀████▄ ▀ ██████████
▀█████▄   ▀████▄ █████████▀
▀██████▄   █████████████▀
▀███████▄ ████████████▀
▀█████████████████▀
▀▀█████████▀▀
Emporium.
Finance
.
Decentralized Peer-to-Peer
Marketplace and DeFi
Liquidity Mining Platform
.
▄▄█▀▀██▀██▀▄▄
▄███▀██▀▀▀▀▀ ▄▄   
 ▀          ▄█▀▄█▄    
▄▄▄▄▄        ▀   ▀██▄███▄
▄██████▄          ▄▄██████▄
███████▌       ▄███████████
█████████▄  ▀█▄████████████
███████████▄▄▄▀▀▀▀▀████████
▀█████████████▀     ▀▀████▀
▀████████████▄        ██▀
▀████████████▌    ▄▄██▀
▀██████████▌  ▄███▀
▀▀██████ ▄█▀▀
Available
in +125
Countries
▄███▄
█████
▀███▀
▄▄▄     ▄█████▄     ▄▄▄
█████    ███████    █████
█████    ███████    █████
▄███▄               ▄███▄
███████     ███     ███████
███████ ██▄█████▄██ ███████
▀▀▀▀▀▀▀  ███▀ ▀███  ▀▀▀▀▀▀▀
███▄ ▄███
██▀█████▀██
███
Community
Governance
System
▄▄██████▄▄ ▄▀▄
      ▀▀▀ ▄██▄
          ▀██ ▄██▄       ▄█
        ▄██   ▀▀███▄   ▄███
       ▄██        ▀█▄   ███
      ▄██           ▀  ▄███
     ▄██        ▄▄     ▀███
    ▄██        ██▀      ███
   ▄██                ▄████
  ▄██         ▄█████████▄ █
  ▀▀      ▄▄▄█████ █▀  ████
      ▄▄██▀▀██▀  ███▄  ▄███
    ▄██████████████████████
Liquidity
Mining
Platform
.
████████████████████████████████████████████████████████
.
JOIN NOW
.
████████████████████████████████████████████████████████
Xenland
Legendary
*
Offline Offline

Activity: 980
Merit: 1003


I'm not just any shaman, I'm a Sha256man


View Profile
July 05, 2011, 03:15:25 AM
 #5

My Radeon 6990 made by Diamond is in over clocked mode and I basically have to have the Fans running at 95-100%(usually a constant 100%) unless it is at night because it gets freezing were I live so usualy open a window about that time and can set the fans to around 65-80%.
 Any ways what are your fan speeds at?

P.s. my gpu temps are wierd like that aswell even with fans full blast.
Jack of Diamonds (OP)
Sr. Member
****
Offline Offline

Activity: 252
Merit: 251



View Profile
July 05, 2011, 04:30:59 AM
 #6

Ok, good to hear it's normal then. I do run 5 other 6990's since March but none with this big temp fluctuations.

Fans are constantly at 90-100% due to that one core. The 1st gpu temp is just as it should be around 70-75, but the 2nd goes nuts.
And it only happens while mining so it can't be misapplied thermal paste etc. (temps stay within 1-2c of each other in different apps)

1f3gHNoBodYw1LLs3ndY0UanYB1tC0lnsBec4USeYoU9AREaCH34PBeGgAR67fx
Xenland
Legendary
*
Offline Offline

Activity: 980
Merit: 1003


I'm not just any shaman, I'm a Sha256man


View Profile
July 05, 2011, 05:24:30 AM
 #7

Id like to note that although not reccomended my 6990 runs at 105c stably but i dont even run it at that temp after the inital stress test.
checkmate
Newbie
*
Offline Offline

Activity: 57
Merit: 0


View Profile
July 05, 2011, 02:34:39 PM
 #8

Did you try to reduce the memory clock in order to get a lower temperature?

Works fine for me...
Jack of Diamonds (OP)
Sr. Member
****
Offline Offline

Activity: 252
Merit: 251



View Profile
July 05, 2011, 03:41:13 PM
Last edit: July 05, 2011, 04:36:55 PM by Jack of Diamonds
 #9

Did you try to reduce the memory clock in order to get a lower temperature?

Works fine for me...

Tried to 840mhz ages ago, works great for the other core. Also puts temps in games etc. at about 65c for both cores, both equally loaded to 99-100%.

2nd core still goes up to 99c while mining.

1f3gHNoBodYw1LLs3ndY0UanYB1tC0lnsBec4USeYoU9AREaCH34PBeGgAR67fx
kookiekrak
Full Member
***
Offline Offline

Activity: 238
Merit: 100


View Profile
July 05, 2011, 03:59:50 PM
 #10

the temperature difference is because data moves in a stream in the gpu

input -> GPU 1 -> GPU 2 -> output

GPU 1 will have a continuous stream of work, while gpu 2 will get the work that gpu 1 hasn't done.

thats why gpu1 is running at full capacity while gpu 2 is slower. chances are if you turn up your aggression, it'll increase the temp.

as for your temp, 99*C is a safe temperature for the gpu, so you're fine. try increasing circulation in your case to drop a few deg, but it wont really matter.


if i've answered your q, http://payb.tc/kookiekrak/ =D

FREE 100 GME if you post your Cryptsy TradeKey on the official GameCoinTalk forums

http://www.gamecointalk.org/index.php?topic=7.msg9#new
Jack of Diamonds (OP)
Sr. Member
****
Offline Offline

Activity: 252
Merit: 251



View Profile
July 05, 2011, 07:57:02 PM
 #11

the temperature difference is because data moves in a stream in the gpu

input -> GPU 1 -> GPU 2 -> output

GPU 1 will have a continuous stream of work, while gpu 2 will get the work that gpu 1 hasn't done.

thats why gpu1 is running at full capacity while gpu 2 is slower. chances are if you turn up your aggression, it'll increase the temp.

as for your temp, 99*C is a safe temperature for the gpu, so you're fine. try increasing circulation in your case to drop a few deg, but it wont really matter.


if i've answered your q, http://payb.tc/kookiekrak/ =D

That stream of work theory definitely makes sense.

Any idea why it would only happen while mining though? The load is spread pretty much 50/50 in everything else that stresses the GPUs. During mining both cores have same hash rates but only the 2nd core overheats.

Or maybe it's GUIminer/Poclbm?

1f3gHNoBodYw1LLs3ndY0UanYB1tC0lnsBec4USeYoU9AREaCH34PBeGgAR67fx
SmokeAndMirrors
Full Member
***
Offline Offline

Activity: 168
Merit: 100


View Profile
July 06, 2011, 01:17:55 AM
 #12

I'm not sure about the 6990's, but most other 6x series cards are stable at 300mhz mem clock without hurting hashing speeds. I run multiple 6970's which don't run too much hotter than 6990 and this lowers my temps by almost 10C.

On top of this, go into your catalyst control panel -> performance -> overdrive and reduce power consumption until you notice drops in temp without stability issues. I don't know 100% if this will help temps(it should), but I used to keep my 6970's at -%50 power without having to underclock anything in order for my 650 watt PSU to power 2 cards without lockups.

Let me know if this helps.

Help Bitcoins by buying clothes, technology, books, etc. through people/stores that accept BTC. This will increase overall value of BTC as well as mitigate unnecessary bank transaction fees.

My address -
1EM9HGg1SEa5Bux1rVEPxGqGSfNTTc9EkC
kiwiasian
Full Member
***
Offline Offline

Activity: 168
Merit: 100


View Profile
July 06, 2011, 02:50:29 AM
Last edit: July 06, 2011, 03:05:41 AM by kiwiasian
 #13

Some misinformation here...

Furmark is the one thing that will push your card the hardest. Nothing will use more resources than Xtreme Burn-in mode in Furmark. Period.

If Pheonix yields higher temperatures you probably did not have Burn-in selected during your Furmark run.

That's not how Crossfire multi-GPU rendering works--each card works in an alternate pattern meaning GPU1 does the first portion of the work and GPU2 does the second portion, and it repeats.

What's your fan speed?

And what is the exact model of your card? Reference or non-reference?

What case? Or open case environment?

Have you tried reseating the heatsink with a fresh TIM application? (Preferrably Shin-etsu, IC Diamond, or anything of the likes)

Edit:
What's your vcore?

And to the above poster: 99c is definitely NOT safe. Yes, your GPU will not die at that temperature, but running them at that temperature for such a prolonged time will definitely shorten the lifespan of your GPU or damage its components. While you're at it check VRM temps too to make sure you're not frying those as well. My GPU cores mining are about 45c (two 5850s)

Also, do this for me will you:
Measure the temperature of the air coming out of the card from the back. In celsius.

I'm a computer hardware enthusiast and I have been way before the whole mining hype began, I would know.

Tradehill referral link, save 10% | http://www.tradehill.com/?r=TH-R12328
www.payb.tc/kiwiasian | 1LHNW1JGMBo2e7rKiiFz7KJPKE57bqCdEC
Jack of Diamonds (OP)
Sr. Member
****
Offline Offline

Activity: 252
Merit: 251



View Profile
July 06, 2011, 06:39:14 AM
 #14

It's a reference Sapphire 6990 running at 100% fan speed in an open case.

I changed the pastes on my other cards to IC diamond 7,

but AMD says 6990's have a special phase-change TIM which should not be changed because it already has better cooling properties than synthetic diamonds or silver.
So I left that alone, besides the temps are very cool outside of mining.

Vcore is at stock, afterburner wont allow tweaking it

1f3gHNoBodYw1LLs3ndY0UanYB1tC0lnsBec4USeYoU9AREaCH34PBeGgAR67fx
kiwiasian
Full Member
***
Offline Offline

Activity: 168
Merit: 100


View Profile
July 06, 2011, 08:47:36 PM
 #15

What AMD says is BS, switch it out with some IC7 anyways

Also have you measured the exhaust temps

Tradehill referral link, save 10% | http://www.tradehill.com/?r=TH-R12328
www.payb.tc/kiwiasian | 1LHNW1JGMBo2e7rKiiFz7KJPKE57bqCdEC
MysteryMiner
Legendary
*
Offline Offline

Activity: 1512
Merit: 1049


Death to enemies!


View Profile
July 06, 2011, 11:23:36 PM
 #16

Don't know how your card looks like, but my guess is that it have reference cooler and that it is blowing hot air from one GPU past other GPU heatsink. Was a minor problem in some HD5970 models.

try to buy card with fan positioned in center between GPU's or but aftermarket heatsink for 6990.

The ultimate solution might be installing water cooling on your video card. Forget about the special thermal compound, in my opinion it's marketing bullshit. Properly installed Arctic Silver 5 or similar high-end compound will be just as good.

bc1q59y5jp2rrwgxuekc8kjk6s8k2es73uawprre4j
kiwiasian
Full Member
***
Offline Offline

Activity: 168
Merit: 100


View Profile
July 06, 2011, 11:42:34 PM
 #17

Don't know how your card looks like, but my guess is that it have reference cooler and that it is blowing hot air from one GPU past other GPU heatsink. Was a minor problem in some HD5970 models.

try to buy card with fan positioned in center between GPU's or but aftermarket heatsink for 6990.

The ultimate solution might be installing water cooling on your video card. Forget about the special thermal compound, in my opinion it's marketing bullshit. Properly installed Arctic Silver 5 or similar high-end compound will be just as good.

All dual GPU cards have a single heatsink with two GPU mounting bases
That heat transfer issue is not what causes one GPU to be hotter than the other. Each GPU is transferring an equal amount of heat to the same heatsink...which would just make the heatsink heat up faster than a single GPU heatsink.

My guess is that the TIM application on the hot GPU is bad and the application on the cooler GPU is fine.

But a spread with any quality aftermarket TIM will improve temps on both GPUs anyways (provided you do it correctly)

Tradehill referral link, save 10% | http://www.tradehill.com/?r=TH-R12328
www.payb.tc/kiwiasian | 1LHNW1JGMBo2e7rKiiFz7KJPKE57bqCdEC
Jack of Diamonds (OP)
Sr. Member
****
Offline Offline

Activity: 252
Merit: 251



View Profile
July 06, 2011, 11:45:04 PM
 #18

or buy an aftermarket heatsink for 6990

That's the most obvious solution. Looked into it the first time this occurred.

Of course, it just happens the 6990 is the only graphics card in existence with no aftermarket coolers available. Sure there's some ref. waterblocks, but no custom air coolers.

Quote
All dual GPU cards have a single heatsink

Well the 6990 is the first card with two separate heatsinks with vapor chambers beneath.
Here are the schematics

1f3gHNoBodYw1LLs3ndY0UanYB1tC0lnsBec4USeYoU9AREaCH34PBeGgAR67fx
kiwiasian
Full Member
***
Offline Offline

Activity: 168
Merit: 100


View Profile
July 06, 2011, 11:56:41 PM
 #19

Interesting, I have never seen that before...
Have you taken those temp measurements yet..

Tradehill referral link, save 10% | http://www.tradehill.com/?r=TH-R12328
www.payb.tc/kiwiasian | 1LHNW1JGMBo2e7rKiiFz7KJPKE57bqCdEC
Jack of Diamonds (OP)
Sr. Member
****
Offline Offline

Activity: 252
Merit: 251



View Profile
July 07, 2011, 12:21:54 AM
 #20

Heatgun says 78.2

1f3gHNoBodYw1LLs3ndY0UanYB1tC0lnsBec4USeYoU9AREaCH34PBeGgAR67fx
Pages: [1] 2 »  All
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!