chromeguy
Newbie
Offline
Activity: 28
Merit: 0
|
|
January 09, 2012, 12:31:14 PM |
|
6990 is dual GPU so has total of 3072 shaders gets about 800 mhash/s using two cores total. 7970 is single GPU so has total of 2048 shaders get about 666 mhash/s using one core total.
i stand corrected, thought it was also dual. i feel the need for read
|
|
|
|
sadpandatech
|
|
January 09, 2012, 01:27:01 PM |
|
With those two changes to the default configuration of cgminer hashes start to get accepted, but the 290MH/s hashing performance with the default settings (-g 2 -v 2 -w 128) for this kernel were slower than the 310MH/s from the trusty OC'd HD5850 that this new card replaced, so I played around with the --gpu-threads, --vectors and --worksize settings and here's a small table with the results: --gpu-threads 1 --vectors 2 --worksize 32 : 141MH/s --gpu-threads 1 --vectors 2 --worksize 64 : 285MH/s --gpu-threads 1 --vectors 2 --worksize 128 : 283MH/s --gpu-threads 1 --vectors 2 --worksize 256 : 284MH/s
--gpu-threads 1 --vectors 4 --worksize 32 : 66MH/s --gpu-threads 1 --vectors 4 --worksize 64 : 133MH/s --gpu-threads 1 --vectors 4 --worksize 128 : 133MH/s --gpu-threads 1 --vectors 4 --worksize 256 : 133MH/s
Not that it might matter much at this point but with vectors 4, and I believe 2, to some extent. There is a need to adjust the memory clock in order to optimize it. I am not sure it would even help being CGN. But, if you get time, I'd check it out. Sadly, I've no clue where that thread is at this time. :/ **** UPDATE **** Someone suggested that I give a recent version of the DiabloMiner a try since it should have decent support for GCN, so I did. ~650MH/s with the default diablominer settings and the card OC'd @ 1125/975MHz: ~530MH/s at standard clocks: pretty freakin awesome, if you ask me. Now if they can just sell the things for <$400 I'd be happy. Do you have any TPD numbers with this card?
|
If you're not excited by the idea of being an early adopter 'now', then you should come back in three or four years and either tell us "Told you it'd never work!" or join what should, by then, be a much more stable and easier-to-use system. - GA
It is being worked on by smart people. -DamienBlack
|
|
|
poppyh
Newbie
Offline
Activity: 10
Merit: 0
|
|
January 09, 2012, 01:50:50 PM |
|
In the UK currently :
5870 costs 170 GBP and gets 440 mhash/s so about 2.6 mhash/GBP
7970 will cost rougly 430 GBP and get 666 mhash/s so about 1.6 mhash/GBP
Thus, the 5870 is still much better and you can also get a 5970 that gets 850 mhash/s for about 400 GBP.
Power figures ?
|
|
|
|
wndrbr3d
|
|
January 09, 2012, 01:54:08 PM |
|
Very interesting results! The only missing piece is the power draw from the wall. My only hesitations at this point are: 1) Price Point/Performance is still super high when compared to used 58xx series cards 2) Lack of optimization in Miners for any new features in GCN/SDK 2.6. Current Miners are heavily optimized for VLIW4/5, so obviously there's going to need to be some re-working for full GCN support. The only way I can see this card being a viable miner is that it needs to outperform 5970/6990 in performance per watt and $/mhash, otherwise it's just a good excuse to see more 58xx's hitting eBay since gamers will be upgrading... Thanks for the initial benchmarks though OP!
|
|
|
|
DiabloD3
Legendary
Offline
Activity: 1162
Merit: 1000
DiabloMiner author
|
|
January 09, 2012, 03:16:37 PM |
|
In the UK currently :
5870 costs 170 GBP and gets 440 mhash/s so about 2.6 mhash/GBP
7970 will cost rougly 430 GBP and get 666 mhash/s so about 1.6 mhash/GBP
Thus, the 5870 is still much better and you can also get a 5970 that gets 850 mhash/s for about 400 GBP.
Power figures ?
7970 is going to be 200 watts I believe, and the 5870 is 188 (both at stock watts). This is where the 7970 suddenly shines. Even if the 7970 is 250 watts, thats still a jump in efficiency.
|
|
|
|
DeathAndTaxes
Donator
Legendary
Offline
Activity: 1218
Merit: 1079
Gerald Davis
|
|
January 09, 2012, 03:20:35 PM |
|
In the UK currently :
5870 costs 170 GBP and gets 440 mhash/s so about 2.6 mhash/GBP
7970 will cost rougly 430 GBP and get 666 mhash/s so about 1.6 mhash/GBP
Thus, the 5870 is still much better and you can also get a 5970 that gets 850 mhash/s for about 400 GBP.
Power figures ?
7970 is going to be 200 watts I believe, and the 5870 is 188 (both at stock watts). This is where the 7970 suddenly shines. Even if the 7970 is 250 watts, thats still a jump in efficiency. You think it will be 200 watts w/ a 20% overclock? I wish the OP had a kill-a-watt. Hey OP do you have a kill-a-watt you could purchase locally. If you are in the states Home Depot and Lowes carry them. If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.
|
|
|
|
DiabloD3
Legendary
Offline
Activity: 1162
Merit: 1000
DiabloMiner author
|
|
January 09, 2012, 03:24:20 PM |
|
You think it will be 200 watts w/ a 20% overclock? I wish the OP had a kill-a-watt.
Thats at stock clocks obviously. I don't know what the mining values will be, all the cards draw less than their full wattage at stock speeds when mining (because large parts of the chip shut off). I imagine 79xx may even get a larger efficiency boost due to this because of AMD's work on power saving, but without a killawatt test, no one knows.
|
|
|
|
rjk
Sr. Member
Offline
Activity: 448
Merit: 250
1ngldh
|
|
January 09, 2012, 03:26:56 PM |
|
(because large parts of the chip shut off).
I know that the shaders are used to do the hashing, but is it possible to utilize more of the chip, even if it were at dramatically lower efficiency?
|
|
|
|
DiabloD3
Legendary
Offline
Activity: 1162
Merit: 1000
DiabloMiner author
|
|
January 09, 2012, 03:29:25 PM |
|
(because large parts of the chip shut off).
I know that the shaders are used to do the hashing, but is it possible to utilize more of the chip, even if it were at dramatically lower efficiency? No. I already tried to abuse the texture/memory fetch units, but couldn't figure out a useful way of doing it. Its all fixed function hardware and its not particularly interesting for what we do. Although, I may go try that again, SDK 2.6 seems to be a much better compiler in some areas.
|
|
|
|
stoppots
|
|
January 09, 2012, 04:20:23 PM |
|
man wat a beast
|
|
|
|
1onevvolf (OP)
Newbie
Offline
Activity: 43
Merit: 0
|
|
January 09, 2012, 05:55:18 PM |
|
Hey OP do you have a kill-a-watt you could purchase locally. If you are in the states Home Depot and Lowes carry them. If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.
The kill-a-watt brand doesn't appear to be commercialized here in europe, and I've been searching for an equivalent device locally each time I've had a chance to head out to a store for the past couple of days, but no luck so far. I also took a stab at modifying DiabloMiner and managed to get it to use 16component vectors, which is what GCN is supposed to be tuned for, but performance isn't what I expect and its really hard to profile/debug the tahiti since I could not find any development tools that specificly support it yet.
|
|
|
|
DiabloD3
Legendary
Offline
Activity: 1162
Merit: 1000
DiabloMiner author
|
|
January 09, 2012, 06:10:19 PM |
|
Hey OP do you have a kill-a-watt you could purchase locally. If you are in the states Home Depot and Lowes carry them. If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.
The kill-a-watt brand doesn't appear to be commercialized here in europe, and I've been searching for an equivalent device locally each time I've had a chance to head out to a store for the past couple of days, but no luck so far. I also took a stab at modifying DiabloMiner and managed to get it to use 16component vectors, which is what GCN is supposed to be tuned for, but performance isn't what I expect and its really hard to profile/debug the tahiti since I could not find any development tools that specificly support it yet. Wait wait wait. Are we sure uint16 is such a good idea? Last time I tried >4 (which was before 2.6, btw, I haven't tested with 2.6), it would crash in the compiler. Also, does anyone have a count on the number of registers per CU? There might not be enough registers to handle that. Also, check some of the larger -vs, -v 40 is two sets of uint4 and -v 44 does three uint4s (unlike cgminer, -v 4 does two uint2s).
|
|
|
|
luo demin
Newbie
Offline
Activity: 70
Merit: 0
|
|
January 09, 2012, 06:26:28 PM |
|
I can't wait till the 7990 that is going to be impressive but expensive I might have missed this but what is the heat like hashing overclocked ? and what fan speed
|
|
|
|
1onevvolf (OP)
Newbie
Offline
Activity: 43
Merit: 0
|
|
January 09, 2012, 07:02:03 PM |
|
Wait wait wait. Are we sure uint16 is such a good idea? Last time I tried >4 (which was before 2.6, btw, I haven't tested with 2.6), it would crash in the compiler. Also, does anyone have a count on the number of registers per CU? There might not be enough registers to handle that.
I'm not sure if it's a good idea or not so I wanted to measure it GCN has 64KB worth of registers per CU, and like you said I'm not sure if that's enough. The reason for my curiosity was because GCN's compute units each contain 4 x SIMD units with a width of 16 elements (same size as Larrabee & Intel's MIC, coincidentally), and I recall reading somewhere that each of these SIMD units can retire one 16-way instruction every 4 cycles, so those 16element vectors kind of rang out at me. I also wanted to get familiar with the OpenCL bitcoin mining code and thought it would be a neat exercise (which it was!). Nice code by the way. I can say for sure that 16element vectors DO compile with the drivers that came with the card. The -ds code dump for 16 element vectors came out nice and clean, although the last few lines where the result is stored in output seem a bit branchy. It looks something like this: if(XG2.s0 == 0x136032ED) { output[Xnonce.s0 & 0xF] = Xnonce.s0; } if(XG2.s1 == 0x136032ED) { output[Xnonce.s1 & 0xF] = Xnonce.s1; } if(XG2.s2 == 0x136032ED) { output[Xnonce.s2 & 0xF] = Xnonce.s2; } ... ... if(XG2.sd == 0x136032ED) { output[Xnonce.sd & 0xF] = Xnonce.sd; } if(XG2.se == 0x136032ED) { output[Xnonce.se & 0xF] = Xnonce.se; } if(XG2.sf == 0x136032ED) { output[Xnonce.sf & 0xF] = Xnonce.sf; }
I tried replacing it with a branch-less expression using shuffle() and vstore16() but haven't managed to get it working. What I've come up with looks something like this: x mask = Xnonce & 0xF; x temp = shuffle(select(Xnonce, 0, selection), mask); vstore16(temp, 0, output);
Anyhow I'm sure that my code modifications are doing all sorts of dumb things. I'm still learning how it all works so please ignore. Also, check some of the larger -vs, -v 40 is two sets of uint4 and -v 44 does three uint4s (unlike cgminer, -v 4 does two uint2s).
I've tried all of the different -v settings available (according to the source) but haven't been able to get any higher than the 666MH/s with the default settings and 3 compute threads.
|
|
|
|
DeathAndTaxes
Donator
Legendary
Offline
Activity: 1218
Merit: 1079
Gerald Davis
|
|
January 09, 2012, 07:07:42 PM |
|
Hey OP do you have a kill-a-watt you could purchase locally. If you are in the states Home Depot and Lowes carry them. If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.
The kill-a-watt brand doesn't appear to be commercialized here in europe, and I've been searching for an equivalent device locally each time I've had a chance to head out to a store for the past couple of days, but no luck so far. Well that sucks. A more universal albeit expensive tool is a clamp meter.
|
|
|
|
terrytibbs
|
|
January 09, 2012, 07:14:07 PM |
|
okay, i will fly to Singapore and pick one up if it all makes you happy....
i got a girl there:P
Is it Mrs. Zhou Tong?
|
|
|
|
1onevvolf (OP)
Newbie
Offline
Activity: 43
Merit: 0
|
|
January 09, 2012, 07:15:39 PM |
|
I can't wait till the 7990 that is going to be impressive but expensive I might have missed this but what is the heat like hashing overclocked ? and what fan speed Overclocked @ 1125/975MHz with automatic fan speed I'm getting temperatures hovering 81-83C, and the fan runs at 47-49% speed. You can see some screencaps on one of the earlier pages. But since I prefer lower temperatures and am worried about VRM and memory temps not yet being reported by GPU-Z, I usually run it at 60% fan speed and get temps around 72C. The blower fan at 60% speed is quite loud (its a reference design from Sapphire). At 100% fan speed, the overclocked card gets below 60C while mining but you can hear it from outside of the house at this point , so as lovely as these temps are this is not an option for me as it is also my gaming and work PC.
|
|
|
|
The King
Newbie
Offline
Activity: 13
Merit: 0
|
|
January 09, 2012, 07:17:14 PM |
|
I can't wait till the 7990 that is going to be impressive but expensive I might have missed this but what is the heat like hashing overclocked ? and what fan speed Overclocked @ 1125/975MHz with automatic fan speed I'm getting temperatures hovering 81-83C, and the fan runs at 47-49% speed. You can see some screencaps on one of the earlier pages. But since I prefer lower temperatures and am worried about VRM and memory temps not yet being reported by GPU-Z, I usually run it at 60% fan speed and get temps around 72C. The blower fan at 60% speed is quite loud (its a reference design from Sapphire). At 100% fan speed, the overclocked card gets below 60C while mining but you can hear it from outside of the house at this point , so as lovely as these temps are this is not an option for me as it is also my gaming and work PC. Yeah, so they still have not fixed that damn reference fan design. Aftermarket coolers FTW ! Damn ATI and their crap loud fan designs
|
|
|
|
DiabloD3
Legendary
Offline
Activity: 1162
Merit: 1000
DiabloMiner author
|
|
January 09, 2012, 07:51:59 PM |
|
Hey OP do you have a kill-a-watt you could purchase locally. If you are in the states Home Depot and Lowes carry them. If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.
The kill-a-watt brand doesn't appear to be commercialized here in europe, and I've been searching for an equivalent device locally each time I've had a chance to head out to a store for the past couple of days, but no luck so far. I also took a stab at modifying DiabloMiner and managed to get it to use 16component vectors, which is what GCN is supposed to be tuned for, but performance isn't what I expect and its really hard to profile/debug the tahiti since I could not find any development tools that specificly support it yet. BTW, they do make 240v/50hz euro Killawatts, but you might have to order it from the US. They also make 240v/60hz (double hot, like ovens and water heaters) ones and 208v ones for DC shit. Might have to look around, I love mine, its been essential for planning stuff out.
|
|
|
|
stick_wood
Newbie
Offline
Activity: 8
Merit: 0
|
|
January 09, 2012, 08:11:16 PM |
|
Really nice cards and performance but the price really sucks !
5XXX is much more cost effective ATM. That may change in the future.
Maybe wait for FPGA ?
|
|
|
|
|