MrTeal (OP)
Legendary
Offline
Activity: 1274
Merit: 1004
|
|
June 14, 2014, 04:49:10 PM |
|
WOW... before you blow yours up... just send it to me... 4 pin... then not using all 4 PCIe plugs... and running it without a cooler? You're gonna break this thing if you don't just wait for your gear... that it has not died already is a testament to Dr Teal + teams excellent work.
Tell me about it. But in my defense, there is absolutely no instructions available for this unit on peppermintmining.com. I had to figure everything out by other means and am surely lacking information. Edit: I was told that I wouldn't be able to blow it up without a fan since it has thermal protection. Currently running with a temporary air heat sink at 75MHz with a temperature of 99C. Is that bad too? Is the 75MHz a typo? I've run air cooling on a $30 Hyper 212 up to 450MHz* and 99C is fine, but do not rely on the thermal protection to save a unit. They are very effective and in my experience have prevented any permanent damage, but you should treat them like an airbag in your car. They're there in case something bad happens, you shouldn't leave your driveway planning to use them a few times on the way to work. * https://bitcointalk.org/index.php?topic=519943.msg6714539#msg6714539
|
|
|
|
daddyfatsax
|
|
June 14, 2014, 06:14:23 PM |
|
MrTeal, did you ever try using the Liquid Metal on the dies?
|
|
|
|
Batshark
Newbie
Offline
Activity: 33
Merit: 0
|
|
June 15, 2014, 04:40:51 AM |
|
this might be a cgminer config issue but can anyone help me with this error? "HFA : OP_USB_INIT failed! Operation status 20 (regulator programming error) edit:using cgminer 4.3.4
Now I get this one: "OP_USB_INIT: Tossing packet, valid but unexpected type 144" Can someone point me in a direction? I'm lost. I had a similar error, but I cant remember if it was the exact same. My issue was a missing WinUSB driver. I installed the missing driver using zadig ( http://zadig.akeo.ie/), and haven't had any issues since. I'm okay now, it wasn't clear to me that power was required on all four plugs. (at least it is in my case) USB errors are for the most part gone. My cooler has yet to arrive and I'm reaching thermal overload rather quickly even with "--hfa-hash-clock 100". I'm assuming this is normal behavior without a cooler on board. I'm going to see how much hashing I can achieve with my cpu air cooler resting on the die with the fan on it. My face when I read this https://i.imgur.com/kf5h3LY.jpgNo, don't do that, and dont just 'sit' the cooler on top, you need to attach it properly and securely with quality thermal paste and appropriate pressure on all four dies. But you seriously need a liquid cooling system to run these properly. I think you should get an off the shelf cooling system, corsair and coolermaster make factory sealed liquid cooling options (coolermaster nepton 280 fits perfect if you plan on cooling the bottom of the board with the radiator exhaust, which I recommend.) Unless you are trolling us, which I'm giving you the benefit of the doubt, but if you still have power going to your habanero, please turn it off. You can contact me and I can walk you through the steps if you really need to, but please don't run it without all of the proper equipment and installation. It would be a shame to blow it up before you even got it working properly. There is plenty of information on how to install these on the forums, you can clearly see every habanero picture posted has some kind of liquid cooling attached. If you need any questions answered before you power it on again, please ask away but for everyone's sake please turn it off until you are completely sure what you need to do
|
|
|
|
QuiveringGibbage
|
|
June 15, 2014, 06:15:17 AM |
|
ok, give me a couple of days to get it delivered. did you like that answer? ... So I'll have dual Habaneros running off the back of the mdf frame below.
this is what i mean, more pics here: http://imgur.com/a/SMBJP#10and the other side.. still WIP. OCing is taking a little longer than planned. or performance is already maxed, i may not have lucky chips Cheers, QG
|
Bitcoin is at the tippity top of the mountain...but it's really only half way up..
|
|
|
SVK
|
|
June 15, 2014, 10:29:27 AM |
|
ok, give me a couple of days to get it delivered. did you like that answer? Cheers, QG Yep Hopefully you didn't take it wrong way. They are same price XSPC = Alphacool so I thought that you might aswell test it out. Thank you very much
|
|
|
|
QuiveringGibbage
|
|
June 15, 2014, 11:14:02 AM |
|
Yep Hopefully you didn't take it wrong way. They are same price XSPC = Alphacool so I thought that you might aswell test it out. Thank you very much nope, sorry, changed my mind last min. It occurred to me that having a different water block will effect the water flow. This is my first ever water cooling rig. I really have nfi wtf I am doing. I better keep it simple and go with the same water head. Looks better this way anyhows.. and i doubt the difference will make much difference in hash rate. I've already spent enough funds on the damn thing. having the same water head gives a better reference on how i'm doing with trying to OC. Having two difference heads would complicate things. Shoulda thunk before i answered QG
|
Bitcoin is at the tippity top of the mountain...but it's really only half way up..
|
|
|
SVK
|
|
June 15, 2014, 05:36:16 PM |
|
Yep Hopefully you didn't take it wrong way. They are same price XSPC = Alphacool so I thought that you might aswell test it out. Thank you very much nope, sorry, changed my mind last min. It occurred to me that having a different water block will effect the water flow. This is my first ever water cooling rig. I really have nfi wtf I am doing. I better keep it simple and go with the same water head. Looks better this way anyhows.. and i doubt the difference will make much difference in hash rate. I've already spent enough funds on the damn thing. having the same water head gives a better reference on how i'm doing with trying to OC. Having two difference heads would complicate things. Shoulda thunk before i answered QG That's fine I'm running two different water heads and difference is 2C however they are both cheap china stuff. I was fortunate and had all watercooling stuff from my old projects so didn't need to invest that much. Your last picture is showing temps. Which one is XSPC and which one is H100 ? Many thanks
|
|
|
|
MrTeal (OP)
Legendary
Offline
Activity: 1274
Merit: 1004
|
|
June 15, 2014, 09:31:48 PM |
|
For higher speeds, you will probably want some airflow over the bottom of the boards to help cool off the power supplies. The heatsink/backplate helps, but there's only so much it can do without airflow. My little ghetto datacenter is coming together nicely. Now it's just a matter of getting the power and networking wired in, and sealing everything up so that all the air that is going over the mining gear is coming from the window. According to the spec on the fans they're supposed to be a little more than 6000CFM, so the two of them should bring in enough fresh air to keep a whole bunch of Habaneros cool.
|
|
|
|
QuiveringGibbage
|
|
June 15, 2014, 10:07:48 PM |
|
Your last picture is showing temps. Which one is XSPC and which one is H100 ?
Readings at the time of posting. 0: HFB JOLOKIA : 725MHz 83C 100% 0.86V | 661.5G / 551.2Gh/s WU:7700.2/m A:4392305 R:4814 HW:1084 1: HFB FRUTESCE: 725MHz 71C 100% 0.87V | 471.7G / 547.7Gh/s WU:7652.2/m A:4367387 R:8278 HW:1210 JOLOKIA @ 83C is the Corsair H100i, with fan mods (2 x 1amp, ~150cmf server fans) FRUTESCENS @ 71C is the XSPC Raystorm Kit. (with only 2 of the 3 server fans going atm) The model of the server fans' Sunon - PMD1212PTB1-A(2).F.GN Ambient temps out side the room is 10C atm and windows wide open. I need to new battery for the thermometer in the room. I'd say it's high 20s C if not 30C in the room. Still slowly increasing the clocks and voltages. Was too rush before and had odd results. Taking my time now.. I did reapply the Thermal paste and re-attached the water heads to the Habanero boards. Swapped them over the 2nd time to see if there's much difference in the boards. No noticeable difference in performance after I switched the water heads on the boards. I assume both board are preforming relatively the same on stock settings. Next up is making a fan attachment to the frame so I can attached fans to blow over the boards. Should have this done when the 2nd water head arrives in the next few days. Cheers, QG
|
Bitcoin is at the tippity top of the mountain...but it's really only half way up..
|
|
|
MrTeal (OP)
Legendary
Offline
Activity: 1274
Merit: 1004
|
|
June 16, 2014, 12:59:54 AM |
|
PSU for the boards, I'll be using one of these per 3 boards. 2400W, 92% efficient, over two feet long, $50. I love eBay.
|
|
|
|
ZiG
|
|
June 16, 2014, 01:33:52 AM |
|
PSU for the boards, I'll be using one of these per 3 boards. 2400W, 92% efficient, over two feet long, $50. I love eBay. Brand...?...Model...? ...Link...? Thanks, ZiG
|
|
|
|
MrTeal (OP)
Legendary
Offline
Activity: 1274
Merit: 1004
|
|
June 16, 2014, 02:04:21 AM |
|
|
|
|
|
ZiG
|
|
June 16, 2014, 02:58:35 AM |
|
|
|
|
|
GenTarkin
Legendary
Offline
Activity: 2450
Merit: 1002
|
|
June 16, 2014, 05:13:20 AM |
|
So, is it possible to see the status of the 96 cores per die? Reason I ask is, when I got these, before I tweaked voltages / cooler mounting etc... I remember the MHav over a 10hr period was right around 650GH per hab @ 850mhz Which, 850 * 96 * 2 * 4 = 652GH ... add the HW% and the 650GH sounds about right.
After tweaking the waterblock heads, voltages and messing w/ clocks ... I have the cooler hab running at 875mhz, 1mv higher than stock, it shows no dropouts(due to not enough voltage) in the pepper app .. after a few hours of mining, its only showing 645GHav ... it should be closer to 660-670 .. HW% is largely unchanged. So, Im confused as hell, if I put it back to 850mhz, its right around 630GHav now =(
My other hab, Im not able to get it stable at 875mhz, so its at 850mhz ... its showing the same drop, down to 630GH now =/
So, Im wondering, is it possible for individual cores or an engine in a core just die / not work and go unnoticed? Hence why Im wondering if its possible to see the status?
I have no idea whats causing this random drop.
My temperatures are all well within range too.... help!
|
|
|
|
MrTeal (OP)
Legendary
Offline
Activity: 1274
Merit: 1004
|
|
June 16, 2014, 05:32:53 AM |
|
So, is it possible to see the status of the 96 cores per die? Reason I ask is, when I got these, before I tweaked voltages / cooler mounting etc... I remember the MHav over a 10hr period was right around 650GH per hab @ 850mhz Which, 850 * 96 * 2 * 4 = 652GH ... add the HW% and the 650GH sounds about right.
After tweaking the waterblock heads, voltages and messing w/ clocks ... I have the cooler hab running at 875mhz, 1mv higher than stock, it shows no dropouts(due to not enough voltage) in the pepper app .. after a few hours of mining, its only showing 645GHav ... it should be closer to 660-670 .. HW% is largely unchanged. So, Im confused as hell, if I put it back to 850mhz, its right around 630GHav now =(
My other hab, Im not able to get it stable at 875mhz, so its at 850mhz ... its showing the same drop, down to 630GH now =/
So, Im wondering, is it possible for individual cores or an engine in a core just die / not work and go unnoticed? Hence why Im wondering if its possible to see the status?
I have no idea whats causing this random drop.
My temperatures are all well within range too.... help!
I have noticed this sometimes happens as well. It's a peculiarity of the chip, and a little more voltage usually cures it. Remember there are two kinds of hardware errors, one is where a share is submitted that shouldn't be; cgminer will see this and flag it as a HW error. The other is a dark HW error, where a specific bit of work should give a valid share but doesn't. This one won't get reported as a HW error, since cgminer doesn't know it's happened. There seems to be a small zone between where work stops noticeable enough to affect temperatures that you can see in the PepperApp, and normal operation where (probably) valid nonces don't get reported. As a general rule, we shipped the boards with the voltage set high enough to get the 650GH/s we advertised, and most will hit 875MHz at that voltage as well. It's a compromise between power consumption and overclocking ability. If your temperatures are still good, I'd try giving it another 10mV and see if the hashrate moves to what you expect.
|
|
|
|
faxfan2002
Member
Offline
Activity: 83
Merit: 10
|
|
June 16, 2014, 10:03:56 AM |
|
ok, give me a couple of days to get it delivered. did you like that answer? ... So I'll have dual Habaneros running off the back of the mdf frame below.
this is what i mean, more pics here: http://imgur.com/a/SMBJP#10and the other side.. still WIP. OCing is taking a little longer than planned. or performance is already maxed, i may not have lucky chips Cheers, QG Are those push or pull fans? I would expect them to be the other way round pushing the hot air away from the boards?
|
|
|
|
firejuan
|
|
June 16, 2014, 10:39:37 AM |
|
You may already know this but you can chain those PSU together and load balance them. Also sidehack is working on a larger PSU like Mr. Teal's behemoth big ass PSU Look at the server PSU's from sidehack. Just bought 2 for $120 shipped. They are 91% efficient or more and 750w PSU's.
|
|
|
|
SVK
|
|
June 16, 2014, 12:05:07 PM |
|
Your last picture is showing temps. Which one is XSPC and which one is H100 ?
Readings at the time of posting. 0: HFB JOLOKIA : 725MHz 83C 100% 0.86V | 661.5G / 551.2Gh/s WU:7700.2/m A:4392305 R:4814 HW:1084 1: HFB FRUTESCE: 725MHz 71C 100% 0.87V | 471.7G / 547.7Gh/s WU:7652.2/m A:4367387 R:8278 HW:1210 JOLOKIA @ 83C is the Corsair H100i, with fan mods (2 x 1amp, ~150cmf server fans) FRUTESCENS @ 71C is the XSPC Raystorm Kit. (with only 2 of the 3 server fans going atm) The model of the server fans' Sunon - PMD1212PTB1-A(2).F.GN Ambient temps out side the room is 10C atm and windows wide open. I need to new battery for the thermometer in the room. I'd say it's high 20s C if not 30C in the room. Still slowly increasing the clocks and voltages. Was too rush before and had odd results. Taking my time now.. I did reapply the Thermal paste and re-attached the water heads to the Habanero boards. Swapped them over the 2nd time to see if there's much difference in the boards. No noticeable difference in performance after I switched the water heads on the boards. I assume both board are preforming relatively the same on stock settings. Next up is making a fan attachment to the frame so I can attached fans to blow over the boards. Should have this done when the 2nd water head arrives in the next few days. Cheers, QG Many thanks Are you running both boards from that AX1200 PSU ? If yes there might be your problem. I'm using 2 PSUs 850W + 750W and power draw from the wall is around 1450W at 775MHZ.
|
|
|
|
joeltang
Newbie
Offline
Activity: 35
Merit: 0
|
|
June 16, 2014, 02:06:33 PM Last edit: June 16, 2014, 09:09:49 PM by joeltang |
|
this might be a cgminer config issue but can anyone help me with this error? "HFA : OP_USB_INIT failed! Operation status 20 (regulator programming error) edit:using cgminer 4.3.4
Now I get this one: "OP_USB_INIT: Tossing packet, valid but unexpected type 144" Can someone point me in a direction? I'm lost. I had a similar error, but I cant remember if it was the exact same. My issue was a missing WinUSB driver. I installed the missing driver using zadig ( http://zadig.akeo.ie/), and haven't had any issues since. I'm okay now, it wasn't clear to me that power was required on all four plugs. (at least it is in my case) USB errors are for the most part gone. My cooler has yet to arrive and I'm reaching thermal overload rather quickly even with "--hfa-hash-clock 100". I'm assuming this is normal behavior without a cooler on board. I'm going to see how much hashing I can achieve with my cpu air cooler resting on the die with the fan on it. My face when I read this https://i.imgur.com/kf5h3LY.jpgNo, don't do that, and dont just 'sit' the cooler on top, you need to attach it properly and securely with quality thermal paste and appropriate pressure on all four dies. But you seriously need a liquid cooling system to run these properly. I think you should get an off the shelf cooling system, corsair and coolermaster make factory sealed liquid cooling options (coolermaster nepton 280 fits perfect if you plan on cooling the bottom of the board with the radiator exhaust, which I recommend.) Unless you are trolling us, which I'm giving you the benefit of the doubt, but if you still have power going to your habanero, please turn it off. You can contact me and I can walk you through the steps if you really need to, but please don't run it without all of the proper equipment and installation. It would be a shame to blow it up before you even got it working properly. There is plenty of information on how to install these on the forums, you can clearly see every habanero picture posted has some kind of liquid cooling attached. If you need any questions answered before you power it on again, please ask away but for everyone's sake please turn it off until you are completely sure what you need to do Not trolling, thanks for your offer to help but I think those few items were the only pieces of information I was lacking. I agree, that this thread does have lots of disjointed information I could have gleamed if I had been aware of its existence. I've been running this setup for the whole weekend never exceeding 96C, but I highly look forward to my Nepton 280 arriving early this week. I work in electronics and have tons of hardware experience like many of you, so I do understand the basics of heat dissipation. I am not upset but do not expect me to have sympathy for a business that sells a product without providing instructions or even inform customers of a thread that requires reading to avoid destroying the product. That kind of info should be included somehow. . you would think. Just a suggestion.
|
|
|
|
GenTarkin
Legendary
Offline
Activity: 2450
Merit: 1002
|
|
June 16, 2014, 02:47:06 PM |
|
So, is it possible to see the status of the 96 cores per die? Reason I ask is, when I got these, before I tweaked voltages / cooler mounting etc... I remember the MHav over a 10hr period was right around 650GH per hab @ 850mhz Which, 850 * 96 * 2 * 4 = 652GH ... add the HW% and the 650GH sounds about right.
After tweaking the waterblock heads, voltages and messing w/ clocks ... I have the cooler hab running at 875mhz, 1mv higher than stock, it shows no dropouts(due to not enough voltage) in the pepper app .. after a few hours of mining, its only showing 645GHav ... it should be closer to 660-670 .. HW% is largely unchanged. So, Im confused as hell, if I put it back to 850mhz, its right around 630GHav now =(
My other hab, Im not able to get it stable at 875mhz, so its at 850mhz ... its showing the same drop, down to 630GH now =/
So, Im wondering, is it possible for individual cores or an engine in a core just die / not work and go unnoticed? Hence why Im wondering if its possible to see the status?
I have no idea whats causing this random drop.
My temperatures are all well within range too.... help!
I have noticed this sometimes happens as well. It's a peculiarity of the chip, and a little more voltage usually cures it. Remember there are two kinds of hardware errors, one is where a share is submitted that shouldn't be; cgminer will see this and flag it as a HW error. The other is a dark HW error, where a specific bit of work should give a valid share but doesn't. This one won't get reported as a HW error, since cgminer doesn't know it's happened. There seems to be a small zone between where work stops noticeable enough to affect temperatures that you can see in the PepperApp, and normal operation where (probably) valid nonces don't get reported. As a general rule, we shipped the boards with the voltage set high enough to get the 650GH/s we advertised, and most will hit 875MHz at that voltage as well. It's a compromise between power consumption and overclocking ability. If your temperatures are still good, I'd try giving it another 10mV and see if the hashrate moves to what you expect. Yeah, this sucks, I guess they are damaged now somehow, no idea, Ive upped the voltage plenty on the 875mhz one and, temps are till under 100C but hashrate are still stuck at 645-650GH Even tried next mv setting up for the one at 850mhz and it has no effect, just runs hotter ... ugh, its hashrate still at 630-635GH So, somewhere Im losing 3% on each one =*( What do the 3 buttons do? more specifically the reconfig one, does that restore firmware defaults or allow firmware installation? Also, like in my original post, any way to see the status of each 96 cores? Thanks
|
|
|
|
|