Bitcoin Forum
May 10, 2024, 07:56:42 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: 1 2 3 [All]
  Print  
Author Topic: Mining rig randomly shuts off?  (Read 13054 times)
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
April 29, 2011, 09:55:46 PM
 #1

I finally got two of my 6990s hooked up and both of them are mining just fine now under windows 7.
but then randomly the whole computer just shuts off, but not completely.
the monitor no longer displays anything and i know they have stopped mining because i checked deepbit.


im using 11.4 cata, not sure about sdk, i never installed any so it might be what ever it is default
its also @ 100 percent cpu and 56 percent ram.

anyone know whats going on here?
1715327802
Hero Member
*
Offline Offline

Posts: 1715327802

View Profile Personal Message (Offline)

Ignore
1715327802
Reply with quote  #2

1715327802
Report to moderator
1715327802
Hero Member
*
Offline Offline

Posts: 1715327802

View Profile Personal Message (Offline)

Ignore
1715327802
Reply with quote  #2

1715327802
Report to moderator
1715327802
Hero Member
*
Offline Offline

Posts: 1715327802

View Profile Personal Message (Offline)

Ignore
1715327802
Reply with quote  #2

1715327802
Report to moderator
Bitcoin addresses contain a checksum, so it is very unlikely that mistyping an address will cause you to lose money.
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
Cablesaurus
Sr. Member
****
Offline Offline

Activity: 302
Merit: 250



View Profile WWW
April 29, 2011, 10:13:41 PM
 #2

Yes, your Power Supply is supplying insufficient power. You need a better PSU.

PCIe Extender Cables; Dummy Plugs, Fans; PSU Cables; Cases & More
Visit www.Cablesaurus.com and our forum thread at http://bitcointalk.org/index.php?topic=6128.0
eleuthria
Legendary
*
Offline Offline

Activity: 1750
Merit: 1007



View Profile
April 29, 2011, 10:14:15 PM
 #3

What size power supply are you running, and are the cards overclocked?  Sounds like the cards are crashing, and they're crashing in a way that Windows can't recover from.  Two 6990s require a MASSIVE power supply, especially if the machine isn't dedicated to mining meaning other components may be drawing a reasonable amount of power as well.

RIP BTC Guild, April 2011 - June 2015
rezin777
Full Member
***
Offline Offline

Activity: 154
Merit: 100


View Profile
April 29, 2011, 10:28:11 PM
 #4

If he bought the power supply from the build specs he listed before, he has ample power.

Unless it's defective. But it seems to be a very good unit.
Tannke
Member
**
Offline Offline

Activity: 84
Merit: 10


View Profile
April 29, 2011, 10:44:04 PM
 #5

Do you have enabled the energy tweeks (shutdown monitor and sleep mode features)?

I had to disable them, that features may cause that kind of issues.  Undecided
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
April 29, 2011, 11:59:16 PM
 #6

i got a 1250 watt power supply, planning on running three 6990s on it, they will not be overclocked however.

i already changed the sleep settings. anyone know how to hook up the third one using windows 7?
grndzero
Sr. Member
****
Offline Offline

Activity: 392
Merit: 250


View Profile
April 30, 2011, 12:03:17 AM
 #7

Are you going through a regulating power strip or UPS?

Could be the power supply being sensitive to the incoming power.

Ubuntu Desktop x64 -  HD5850 Reference - 400Mh/s w/ cgminer  @ 975C/325M/1.175V - 11.6/2.1 SDK
Donate if you find this helpful: 1NimouHg2acbXNfMt5waJ7ohKs2TtYHePy
allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 12:18:05 AM
 #8

Single rail or multi rail power supply? If multi rail make sure you spread the load across all the available PCIE rails.

clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
April 30, 2011, 01:17:03 AM
 #9

yeah its connected to a power strip, as for rails im not sure what that means, but im using all but two of the pcie ports for power, since the 6990 needs 2 power cables each.

what should i do, connect it to the wall instead?

actually do you think logmein is messing with it? cus i left it alone for two hours and it hadnt shut off, then i used logmein to check up on it, and it shut off within an hour.
allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 01:32:33 AM
 #10

yeah its connected to a power strip, as for rails im not sure what that means, but im using all but two of the pcie ports for power, since the 6990 needs 2 power cables each.

what should i do, connect it to the wall instead?

actually do you think logmein is messing with it? cus i left it alone for two hours and it hadnt shut off, then i used logmein to check up on it, and it shut off within an hour.

It could maybe possibly remotely cause a driver crash..also try to tone down on the aggressiveness of the miner in case the system being at 100% GPU load is somehow causing driver crashes. As for power supply how about you tell me what power supply specifically you have and I'll tell you if it has dual rails or single. OR you can google it Smiley


clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
April 30, 2011, 01:39:21 AM
 #11

OCZ ZX Series OCZ-ZX1250W 1250W ATX12V v2.2 / EPS12V SLI Ready CrossFire Ready 80 PLUS GOLD Certified Modular Active PFC Power Supply Compatible with Core i5 & i7

+12V Rails
    Single

thats the one.

i cant seem to stop it from being 100 percent cpu, i didnt put any flags (im using guiminer)
ghost
Newbie
*
Offline Offline

Activity: 34
Merit: 0


View Profile
April 30, 2011, 02:02:35 AM
 #12

Single rail or multi rail power supply? If multi rail make sure you spread the load across all the available PCIE rails.

This post helped me tremendously. I have dual 6990s and an HX1000 power supply. The 6990s have an over-clock switch that raises the voltage and produces an extra 15% to my hash rates. It was causing the computer to freeze after a couple of minutes and I figured it might be overloading the power supply. Sure enough I didn't have the two cards spread evenly between the rails (in the case of the HX1000, it doesn't have rails but is kind of like 2 500 watt power supplies bundle together).
allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 02:06:09 AM
 #13

Single rail or multi rail power supply? If multi rail make sure you spread the load across all the available PCIE rails.

This post helped me tremendously. I have dual 6990s and an HX1000 power supply. The 6990s have an over-clock switch that raises the voltage and produces an extra 15% to my hash rates. It was causing the computer to freeze after a couple of minutes and I figured it might be overloading the power supply. Sure enough I didn't have the two cards spread evenly between the rails (in the case of the HX1000, it doesn't have rails but is kind of like 2 500 watt power supplies bundle together).

I have the same power supply and even with a single 5970 it is still recommended that one spreads the load across the two 12v rails. I'm amazed the PSU handled two 6990 on the same rail. However when I think of it I am not surprised as I've read somewhere that the HX1000W is conservatively rated (ie it can deliver more than 1000W if pushed) and it really a 1200W PSU sold as 1000W (designed by an OEM called CWT)


allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 02:08:57 AM
 #14

OCZ ZX Series OCZ-ZX1250W 1250W ATX12V v2.2 / EPS12V SLI Ready CrossFire Ready 80 PLUS GOLD Certified Modular Active PFC Power Supply Compatible with Core i5 & i7

+12V Rails
    Single

thats the one.

i cant seem to stop it from being 100 percent cpu, i didnt put any flags (im using guiminer)

That is more than enough to power two 6990s, unless of course the PSU is faulty. Do you happen to have a spare computer you can test it on?

clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
April 30, 2011, 02:11:16 AM
 #15

by testing do you mean just plugging it in and running regular computer parts? I can do that, but is there any other solution? i spent a lotta time putting that power supply in heh Grin
allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 02:15:46 AM
 #16

by testing do you mean just plugging it in and running regular computer parts? I can do that, but is there any other solution? i spent a lotta time putting that power supply in heh Grin

That and also putting as much load on it as possible.

Also to ensure this is not a driver issue uninstall all the catalyst drivers, boot into safe mode and run driver cleaner (available for free on guru3d site or just google it) and remove all traces of the ATI drivers, boot back into windows and reinstall 11.4. If the problem persists then try downgrading to 11.3 (repeat the same procedure)

If even THAT does not work then something terribly wrong is going on. Check to make sure all the PCIE cables are solidly in place, and also do a memtest. Also keep an eye on your temps (for the 6990s) - possible overheat? Bad RAM would not cause the system to shutdown with no vide though (you'd get a blue screen) so I doubt it's the memory. So far I suspect either a faulty PSU or a catalyst driver issue.


clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
April 30, 2011, 02:22:49 AM
 #17

hmm well i had put in a fresh clean windows 7 with 11.4 cata and NO sdk, it seems fine but only for a couple of hours. im scared to logmein cus i dont wanna taint the testing. ill try out the psu on my other computer , but im afraid i dont have enough to overload it. i have 3 6990s but the third one wont fit into my computer case im gonna have to cut out a hole just for the third one.

thanks for your help though ill test the ram tomorrow and the cata if this problem persists. so far it has been running for an hour with no problem i only logmein once to start it up.

turns out i didnt need to use a dummy monitor with crossfire hope people make use outta this info.

Single rail or multi rail power supply? If multi rail make sure you spread the load across all the available PCIE rails.

This post helped me tremendously. I have dual 6990s and an HX1000 power supply. The 6990s have an over-clock switch that raises the voltage and produces an extra 15% to my hash rates. It was causing the computer to freeze after a couple of minutes and I figured it might be overloading the power supply. Sure enough I didn't have the two cards spread evenly between the rails (in the case of the HX1000, it doesn't have rails but is kind of like 2 500 watt power supplies bundle together).

glad someone made use out of this thread other than me.

ill give an update tomorrow so hopefully it can be useful
ghost
Newbie
*
Offline Offline

Activity: 34
Merit: 0


View Profile
April 30, 2011, 02:23:41 AM
 #18

Single rail or multi rail power supply? If multi rail make sure you spread the load across all the available PCIE rails.

This post helped me tremendously. I have dual 6990s and an HX1000 power supply. The 6990s have an over-clock switch that raises the voltage and produces an extra 15% to my hash rates. It was causing the computer to freeze after a couple of minutes and I figured it might be overloading the power supply. Sure enough I didn't have the two cards spread evenly between the rails (in the case of the HX1000, it doesn't have rails but is kind of like 2 500 watt power supplies bundle together).

I have the same power supply and even with a single 5970 it is still recommended that one spreads the load across the two 12v rails. I'm amazed the PSU handled two 6990 on the same rail. However when I think of it I am not surprised as I've read somewhere that the HX1000W is conservatively rated (ie it can deliver more than 1000W if pushed) and it really a 1200W PSU sold as 1000W (designed by an OEM called CWT)



One of the cards was split between the rails, so one rail had 3 of the 4 PCIe cables plugged into it, and the other rail had 1. Even still, the power supply was pulling over 800 (I believe close to 900) watts from the socket.
allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 02:30:26 AM
 #19

Single rail or multi rail power supply? If multi rail make sure you spread the load across all the available PCIE rails.

This post helped me tremendously. I have dual 6990s and an HX1000 power supply. The 6990s have an over-clock switch that raises the voltage and produces an extra 15% to my hash rates. It was causing the computer to freeze after a couple of minutes and I figured it might be overloading the power supply. Sure enough I didn't have the two cards spread evenly between the rails (in the case of the HX1000, it doesn't have rails but is kind of like 2 500 watt power supplies bundle together).

I have the same power supply and even with a single 5970 it is still recommended that one spreads the load across the two 12v rails. I'm amazed the PSU handled two 6990 on the same rail. However when I think of it I am not surprised as I've read somewhere that the HX1000W is conservatively rated (ie it can deliver more than 1000W if pushed) and it really a 1200W PSU sold as 1000W (designed by an OEM called CWT)



One of the cards was split between the rails, so one rail had 3 of the 4 PCIe cables plugged into it, and the other rail had 1. Even still, the power supply was pulling over 800 (I believe close to 900) watts from the socket.

Nice, you got yourself one serious space heater there Smiley However I'd be concerned running the PSU at 80 to 90% rated capacity. I think that will shorten its lifespan.

Did you also try to underclock the memory on those cards? You can save 20-40 watts per card doing so.


Hey btw, checkout this picture to see how the rails are distributed on the HX1000W:

http://forum.corsair.com/forums/showthread.php?t=70317


allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 02:34:29 AM
 #20

hmm well i had put in a fresh clean windows 7 with 11.4 cata and NO sdk, it seems fine but only for a couple of hours. im scared to logmein cus i dont wanna taint the testing. ill try out the psu on my other computer , but im afraid i dont have enough to overload it. i have 3 6990s but the third one wont fit into my computer case im gonna have to cut out a hole just for the third one.

thanks for your help though ill test the ram tomorrow and the cata if this problem persists. so far it has been running for an hour with no problem i only logmein once to start it up.

turns out i didnt need to use a dummy monitor with crossfire hope people make use outta this info.

Single rail or multi rail power supply? If multi rail make sure you spread the load across all the available PCIE rails.

This post helped me tremendously. I have dual 6990s and an HX1000 power supply. The 6990s have an over-clock switch that raises the voltage and produces an extra 15% to my hash rates. It was causing the computer to freeze after a couple of minutes and I figured it might be overloading the power supply. Sure enough I didn't have the two cards spread evenly between the rails (in the case of the HX1000, it doesn't have rails but is kind of like 2 500 watt power supplies bundle together).

glad someone made use out of this thread other than me.

ill give an update tomorrow so hopefully it can be useful

I don't think you necessarily need to overload it. Even with those two 6990s you aren't overloading it. Just put as much strain on it as possible in a different system to see whether the fault lies with one of the components of the other system or with the PSU.

Btw, have you tried using something like VNC or RDP instead of logmein? I would be surprised of logmein caused all this.


ghost
Newbie
*
Offline Offline

Activity: 34
Merit: 0


View Profile
April 30, 2011, 02:42:03 AM
 #21

Single rail or multi rail power supply? If multi rail make sure you spread the load across all the available PCIE rails.

This post helped me tremendously. I have dual 6990s and an HX1000 power supply. The 6990s have an over-clock switch that raises the voltage and produces an extra 15% to my hash rates. It was causing the computer to freeze after a couple of minutes and I figured it might be overloading the power supply. Sure enough I didn't have the two cards spread evenly between the rails (in the case of the HX1000, it doesn't have rails but is kind of like 2 500 watt power supplies bundle together).

I have the same power supply and even with a single 5970 it is still recommended that one spreads the load across the two 12v rails. I'm amazed the PSU handled two 6990 on the same rail. However when I think of it I am not surprised as I've read somewhere that the HX1000W is conservatively rated (ie it can deliver more than 1000W if pushed) and it really a 1200W PSU sold as 1000W (designed by an OEM called CWT)



One of the cards was split between the rails, so one rail had 3 of the 4 PCIe cables plugged into it, and the other rail had 1. Even still, the power supply was pulling over 800 (I believe close to 900) watts from the socket.

Nice, you got yourself one serious space heater there Smiley However I'd be concerned running the PSU at 80 to 90% rated capacity. I think that will shorten its lifespan.

Did you also try to underclock the memory on those cards? You can save 20-40 watts per card doing so.


Hey btw, checkout this picture to see how the rails are distributed on the HX1000W:

http://forum.corsair.com/forums/showthread.php?t=70317



No, I haven't had much luck figuring out how to lower mem speeds on linux. I think that's about the only thing left to get this computer running optimally.

I found that rail diagram when your initial post got me on the right track.
allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 03:35:14 AM
 #22

Quote

No, I haven't had much luck figuring out how to lower mem speeds on linux. I think that's about the only thing left to get this computer running optimally.

I found that rail diagram when your initial post got me on the right track.

Hmm, there is a thread (or two perhaps) in the mining section which mentions a specific program you can use to lower the mem even under linux. I am pretty sure it's possible. It's really worth giving it your best because it can help you save some $ by lowering the power consumption of your card(s)

Yeah that diagram was very useful to me too a while back.

gigabytecoin
Sr. Member
****
Offline Offline

Activity: 280
Merit: 252


View Profile
April 30, 2011, 04:14:22 AM
 #23

Probably cheap parts. I have the exact same problem Sad
allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 04:42:40 AM
 #24

Probably cheap parts. I have the exact same problem Sad

Well I dunno. I don't think the OP mentioned whether the power supply was new or whether the cards are new.

Therein lies the problem..in our drive to lower our costs us miners often run into problems that totally wipes out the savings we're chasing after.

clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
April 30, 2011, 12:54:45 PM
 #25

okay well guys its been running for over 10 hours now, im pretty confident it WAS indeed logmein.
i havened used logmein the whole tiem, and when i finally just did, i saw the mhash shoot down from 333mhash to 29mhash and then shoot back to 333mhash

i really hope this means my psu is not messed up.

thanks for your help guys, ill try team viewer maybe it wont crash it. but in anycase, logmein may be a suspect. if your gonna use it, use it for a couple of seconds just to check on it. im not gonna be able to test if logmein really does it because i got alotta stuff to do this weekend.

thanks again guys
grndzero
Sr. Member
****
Offline Offline

Activity: 392
Merit: 250


View Profile
April 30, 2011, 01:09:43 PM
 #26

okay well guys its been running for over 10 hours now, im pretty confident it WAS indeed logmein.
i havened used logmein the whole tiem, and when i finally just did, i saw the mhash shoot down from 333mhash to 29mhash and then shoot back to 333mhash

i really hope this means my psu is not messed up.

thanks for your help guys, ill try team viewer maybe it wont crash it. but in anycase, logmein may be a suspect. if your gonna use it, use it for a couple of seconds just to check on it. im not gonna be able to test if logmein really does it because i got alotta stuff to do this weekend.

thanks again guys

From your second statement about the Mh/s crashing when using it, it sounds like logmein is accessing OpenCL and probably causing a conflict in the device driver. Why it would just poweroff instead of throwing up a BSOD is kind of odd.

Ubuntu Desktop x64 -  HD5850 Reference - 400Mh/s w/ cgminer  @ 975C/325M/1.175V - 11.6/2.1 SDK
Donate if you find this helpful: 1NimouHg2acbXNfMt5waJ7ohKs2TtYHePy
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
April 30, 2011, 01:20:41 PM
 #27

hah! i can confirm it now! i just used logmein about 40 minutes ago when i woke up to see if it was still on, it indeed was then it shut off within the last half hour.

so people avoid logmein for rigs.
okay well guys its been running for over 10 hours now, im pretty confident it WAS indeed logmein.
i havened used logmein the whole tiem, and when i finally just did, i saw the mhash shoot down from 333mhash to 29mhash and then shoot back to 333mhash

i really hope this means my psu is not messed up.

thanks for your help guys, ill try team viewer maybe it wont crash it. but in anycase, logmein may be a suspect. if your gonna use it, use it for a couple of seconds just to check on it. im not gonna be able to test if logmein really does it because i got alotta stuff to do this weekend.

thanks again guys

From your second statement about the Mh/s crashing when using it, it sounds like logmein is accessing OpenCL and probably causing a conflict in the device driver. Why it would just poweroff instead of throwing up a BSOD is kind of odd.

you are probably right.
the thing is, it dosent entirely shut off, it kinda of just stops working, but the fans are still running, even though the operating system is now.
allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
April 30, 2011, 02:41:56 PM
 #28

That is highly odd. I googled your problem with logmein and no useful results came up. So this is something that you may wish to report to the logmein developers.

Good thing that your PSU wasn't at fault Smiley

grndzero
Sr. Member
****
Offline Offline

Activity: 392
Merit: 250


View Profile
April 30, 2011, 05:43:15 PM
 #29

hah! i can confirm it now! i just used logmein about 40 minutes ago when i woke up to see if it was still on, it indeed was then it shut off within the last half hour.

so people avoid logmein for rigs.
okay well guys its been running for over 10 hours now, im pretty confident it WAS indeed logmein.
i havened used logmein the whole tiem, and when i finally just did, i saw the mhash shoot down from 333mhash to 29mhash and then shoot back to 333mhash

i really hope this means my psu is not messed up.

thanks for your help guys, ill try team viewer maybe it wont crash it. but in anycase, logmein may be a suspect. if your gonna use it, use it for a couple of seconds just to check on it. im not gonna be able to test if logmein really does it because i got alotta stuff to do this weekend.

thanks again guys

From your second statement about the Mh/s crashing when using it, it sounds like logmein is accessing OpenCL and probably causing a conflict in the device driver. Why it would just poweroff instead of throwing up a BSOD is kind of odd.

you are probably right.
the thing is, it dosent entirely shut off, it kinda of just stops working, but the fans are still running, even though the operating system is now.

If you have another computer that you can test with if it happens again I would suggest the following:

ping to see if the system and networking are still up
try to connect to a file share to see if higher level networking is still online
try to do a remote desktop connection and see whether it's just the video card that is cutting out and killing the signal to the monitor as well

At least then you have some idea at what level it's actually happening.

Ubuntu Desktop x64 -  HD5850 Reference - 400Mh/s w/ cgminer  @ 975C/325M/1.175V - 11.6/2.1 SDK
Donate if you find this helpful: 1NimouHg2acbXNfMt5waJ7ohKs2TtYHePy
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 01, 2011, 12:20:28 AM
 #30

hmmm now im confused. sorry about the confirmation on logmein earlier. it happened again today twice. once after it happened, i turned it on again and windows was completeing updates, so that MIGHT be it, im not gonna confirm it though. sigh ill have to test the psu
JJG
Member
**
Offline Offline

Activity: 70
Merit: 20


View Profile
May 01, 2011, 12:29:07 AM
 #31

hmmm now im confused. sorry about the confirmation on logmein earlier. it happened again today twice. once after it happened, i turned it on again and windows was completeing updates, so that MIGHT be it, im not gonna confirm it though. sigh ill have to test the psu

Best of luck. Keep us posted.
TurboK
Full Member
***
Offline Offline

Activity: 136
Merit: 100



View Profile
May 01, 2011, 12:57:20 AM
 #32

It's not your PSU. It's a driver bug. Accessing the video overlay when OpenCL apps are running will lock up a 69xx card. 58xx series suffered from this too, but they fixed it in catalyst 10.4. The 69xx is affected even with catalyst 11.4.

http://bitcointalk.org/index.php?topic=4669
http://forums.amd.com/devforum/messageview.cfm?catid=390&threadid=128404&enterthread=y

12zJNWtM2HknS2EPLkT9QPSuSq1576aKx7

Tradehill viral bullshit code: TH-R114411
Steve
Hero Member
*****
Offline Offline

Activity: 868
Merit: 1007



View Profile WWW
May 01, 2011, 01:01:34 AM
 #33

What motherboard are you using that will let you run with 3 cards?  Are they spaced far enough apart to allow airflow between?

(gasteve on IRC) Does your website accept cash? https://bitpay.com
TurboK
Full Member
***
Offline Offline

Activity: 136
Merit: 100



View Profile
May 01, 2011, 01:07:03 AM
 #34

What motherboard are you using that will let you run with 3 cards?  Are they spaced far enough apart to allow airflow between?

The card scales back its own clockspeed if temps reach too high. You'd have to instantly reach a meltdown temperature for it to hang outright.

I'm telling you, it's the opencl + video bug. Easy enough to reproduce: Have your mining up running, then go to Catalyst Control Center and try opening the Video Settings menu. Or just launch GPU-Z AFTER you have started up the mining app.

12zJNWtM2HknS2EPLkT9QPSuSq1576aKx7

Tradehill viral bullshit code: TH-R114411
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 01, 2011, 02:34:17 AM
 #35

What motherboard are you using that will let you run with 3 cards?  Are they spaced far enough apart to allow airflow between?

The card scales back its own clockspeed if temps reach too high. You'd have to instantly reach a meltdown temperature for it to hang outright.

I'm telling you, it's the opencl + video bug. Easy enough to reproduce: Have your mining up running, then go to Catalyst Control Center and try opening the Video Settings menu. Or just launch GPU-Z AFTER you have started up the mining app.

ahh so what do you recommend then? should i downgrade to 10.4?
TurboK
Full Member
***
Offline Offline

Activity: 136
Merit: 100



View Profile
May 01, 2011, 03:05:08 AM
 #36

10.4 is where they fixed it for the 58xx series (after half a year of the card being out). Earliest Catalyst to support 6950 is 10.11a, I think, and 6990 support was just added recently. So you can't roll back drivers.
68xx cards are not affected, only the 69xx ones that use the new VLIW4 architecture. Chances are, the upcoming Radeon 7xxx will have this bug too, unless they fix it till then.

The only thing you can do is watch out and not open any apps that trigger the video overlay, while mining. That means most video players, untick "hardware acceleration" for Flash, the Video Settings in CCC, GPU-Z at startup (works fine if you run your miner after gpu-z started up though), pcsx2, and Google Chrome with hardware acceleration enabled - these are the ones I've stumbled across so far.

And, perhaps keep bugging AMD about it. Maybe they'll fix it sooner if more people complain. I wouldn't keep my hopes up, the 58xx series got fixed cause it affected an entire generation of cards, here it only affects 3 cards.

12zJNWtM2HknS2EPLkT9QPSuSq1576aKx7

Tradehill viral bullshit code: TH-R114411
allinvain
Legendary
*
Offline Offline

Activity: 3080
Merit: 1080



View Profile WWW
May 01, 2011, 03:53:17 AM
 #37

Wow, way the go AMD. Way to keep up your reputation for having shitty drivers.

clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 01, 2011, 04:00:12 AM
 #38

i see... i shouldnt have gotten these cards then... dam
thanks for your help. im not even sure what to do now.
im thinking of setting up a program to restart my computer every hour and then turn on guiminer
that way i can restart my comp b4 the driver issue shuts me down.
what do you think of this plan? maybe it will slow my bitcoin mining down a lot i suppose
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 01, 2011, 10:48:57 AM
 #39

i see... i shouldnt have gotten these cards then... dam
thanks for your help. im not even sure what to do now.
im thinking of setting up a program to restart my computer every hour and then turn on guiminer
that way i can restart my comp b4 the driver issue shuts me down.
what do you think of this plan? maybe it will slow my bitcoin mining down a lot i suppose

What he is saying is that any app that uses the video overlay will crash the card. If this is a dedicated mining rig, no such app should be opening, and it won't crash. So, are you using the computer when this is happening, or is it just sitting somewhere mining?

i was initially using logmein, now im not...but its still installed... ill uninstall it today but i doubt it would make a difference. i dont use this thing for anything else otherwise.
TurboK
Full Member
***
Offline Offline

Activity: 136
Merit: 100



View Profile
May 01, 2011, 02:25:04 PM
 #40

Wow, way the go AMD. Way to keep up your reputation for having shitty drivers.


The problem affects a very slim percent of users. If you are running the card as a dedicated opencl accelerator, chances are you aren't the type of guy to hang out on Youtube all day. On a dedicated mining rig, no other app should be running that triggers the bug.
Besides, nVidia drivers aren't any better either. Remember the nforce chipset?

clonedone: run through the windows services, see if there's anything suspicious there (such as Windows Media Center). Also check if there are any scheduled tasks. I run a 6950 and any time it hangs, it's from an application I know I opened, never from something that opened in the background.

12zJNWtM2HknS2EPLkT9QPSuSq1576aKx7

Tradehill viral bullshit code: TH-R114411
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 01, 2011, 03:19:08 PM
 #41

Wow, way the go AMD. Way to keep up your reputation for having shitty drivers.


The problem affects a very slim percent of users. If you are running the card as a dedicated opencl accelerator, chances are you aren't the type of guy to hang out on Youtube all day. On a dedicated mining rig, no other app should be running that triggers the bug.
Besides, nVidia drivers aren't any better either. Remember the nforce chipset?

clonedone: run through the windows services, see if there's anything suspicious there (such as Windows Media Center). Also check if there are any scheduled tasks. I run a 6950 and any time it hangs, it's from an application I know I opened, never from something that opened in the background.

i just reformatted the whole system, installed the 11.4 cata, installed guiminer, and started it. it mined for 20 mins, then did the same wierd shutdown, where it dosent fully shut down, but nothing is running except the fans(as far as i can tell). then i restarted it and tried it again, it ran for an hour and shut off. now im trying it again, i never run any other programs or anything else, i updates windows though to service pack 1 for windows 7, but that was after i reformatted, doubt it has anything to do with it.

media center isnt running im sure of it. any other suggestion?
TurboK
Full Member
***
Offline Offline

Activity: 136
Merit: 100



View Profile
May 01, 2011, 03:26:30 PM
 #42

Screensavers?

12zJNWtM2HknS2EPLkT9QPSuSq1576aKx7

Tradehill viral bullshit code: TH-R114411
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 01, 2011, 10:32:17 PM
 #43

Have you tried it with a single card? If your power supply is actually defective, perhaps it will be able to handle one card.

When you install the drivers, do you choose custom and only install the driver and APP? I have no reason to think the other crap will cause any instability, but I always deselect it anyway.

nope i selected express.
Screensavers?

not at all.

the thing about testing it out, is that sometiems it takes a couple of hours sometimes it takes minutes, its so confusing. i was thinking of setting it up without a moniter and see what happens
Mugabuga
Newbie
*
Offline Offline

Activity: 15
Merit: 0


View Profile
May 02, 2011, 01:31:15 AM
 #44

i got a 1250 watt power supply, planning on running three 6990s on it, they will not be overclocked however.

i already changed the sleep settings. anyone know how to hook up the third one using windows 7?

You can't run three 6990s, max of 4 GPUs in CF.
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 02, 2011, 01:56:48 AM
 #45

i got a 1250 watt power supply, planning on running three 6990s on it, they will not be overclocked however.

i already changed the sleep settings. anyone know how to hook up the third one using windows 7?

You can't run three 6990s, max of 4 GPUs in CF.

yeah im probably gonna sell it or return it. my third 6990 is useless right now.

but other than that, I am now running phoenix miner instead of guiminer. atfirst it shut down after 20  minutes twice. then i removed VECTORS and also removed WORKSPACE=128 and also removed BFI thingy and only left agression=10.
it has been running for two hours straight, other than the fact that deepbit crashed for 10 minutes, it still ran smoothly, but at 290 per gpu instead of 333.
I am hoping it will run nonstop finally, then i can adjust the script to see which one will let my computer last longer.
ill update tomorrow
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 03, 2011, 04:44:51 PM
 #46

UPDATE
so heres whats going on. I ran it without vectors, workspace=128, an bfi_int. (pheonix miner)
it never shut down on me. so what i did afterwards was to run it with all of them again, and guess what it still didnt shut down on me for a full 24 hours.
so i decided that it must have been fixed. i decided to just restart the computer just to see what would happen if i shut the computer down, and it started messing up again, this time @ 20 minutes it would just die on me. so here I am again removing the three flags and its running fine again. we will see what happens next time.

hope this can help ppl with 6990
TurboK
Full Member
***
Offline Offline

Activity: 136
Merit: 100



View Profile
May 03, 2011, 05:26:11 PM
 #47

I still think you have some specific application in the background that kicks in only periodically. Perhaps check your scheduled tasks, or leave task manager open and note down the running applications - since the screen freezes when the radeons hang, you can still read the taskmanager.

12zJNWtM2HknS2EPLkT9QPSuSq1576aKx7

Tradehill viral bullshit code: TH-R114411
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 03, 2011, 06:44:42 PM
 #48

I still think you have some specific application in the background that kicks in only periodically. Perhaps check your scheduled tasks, or leave task manager open and note down the running applications - since the screen freezes when the radeons hang, you can still read the taskmanager.

the screen loses connection. i checked sched tasks, there are none that i can find. if i am even looking correctly lol.
well ill try the task manager today, im really desperate to find out whats going on cus i wanna buy another rig.

anyway do u think ubuntu might react differently?'

my two 6990s are still running to this moment with the three flags off.(workspace vectors bfi_int)
TurboK
Full Member
***
Offline Offline

Activity: 136
Merit: 100



View Profile
May 03, 2011, 08:07:26 PM
 #49

Ubuntu might behave differently since it is a wholly different OS architecture.

See if you have turned off all power saving options. I know that "turning off monitor after x mins" will not crash my 6950, but perhaps there's something else there about sleep state and somesuch. If the monitor loses connection as you are using the machine, then that may be unrelated to the UVD bug, since when that happens, the monitor picture stays frozen and even a hard reset takes a few seconds to be acknowledged.

12zJNWtM2HknS2EPLkT9QPSuSq1576aKx7

Tradehill viral bullshit code: TH-R114411
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 03, 2011, 08:19:15 PM
 #50

Ubuntu might behave differently since it is a wholly different OS architecture.

See if you have turned off all power saving options. I know that "turning off monitor after x mins" will not crash my 6950, but perhaps there's something else there about sleep state and somesuch. If the monitor loses connection as you are using the machine, then that may be unrelated to the UVD bug, since when that happens, the monitor picture stays frozen and even a hard reset takes a few seconds to be acknowledged.

yeah everything is on high performance, no sleep, no moniter sleep, no hibernate no nuttin.
clonedone (OP)
Full Member
***
Offline Offline

Activity: 350
Merit: 100


View Profile
May 03, 2011, 08:20:44 PM
 #51

its very strange. now that i started mining without teh three flags (vectors, workspace, and bfiint)
then i add them afterwards
now it is NOT shutting down...
anyone know why?
Pages: 1 2 3 [All]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!