Bitcoin Forum
April 27, 2024, 02:49:32 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 [4] 5 6 7 8 9 10 11 »  All
  Print  
Author Topic: My initial Radeon HD 7970 mining benchmarks  (Read 46739 times)
chromeguy
Newbie
*
Offline Offline

Activity: 28
Merit: 0



View Profile WWW
January 09, 2012, 12:31:14 PM
 #61

6990 is dual GPU so has total of 3072 shaders gets about 800 mhash/s using two cores total.
7970 is single GPU so has total of 2048 shaders get about 666 mhash/s using one core total.
i stand corrected, thought it was also dual.
i feel the need for read
"Bitcoin: mining our own business since 2009" -- Pieter Wuille
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
sadpandatech
Hero Member
*****
Offline Offline

Activity: 504
Merit: 500



View Profile
January 09, 2012, 01:27:01 PM
 #62

With those two changes to the default configuration of cgminer hashes start to get accepted, but the 290MH/s hashing performance with the default settings (-g 2 -v 2 -w 128) for this kernel were slower than the 310MH/s from the trusty OC'd HD5850 that this new card replaced, so I played around with the --gpu-threads, --vectors and --worksize settings and here's a small table with the results:
--gpu-threads 1 --vectors 2 --worksize  32 : 141MH/s
--gpu-threads 1 --vectors 2 --worksize  64 : 285MH/s
--gpu-threads 1 --vectors 2 --worksize 128 : 283MH/s
--gpu-threads 1 --vectors 2 --worksize 256 : 284MH/s

--gpu-threads 1 --vectors 4 --worksize  32 :  66MH/s
--gpu-threads 1 --vectors 4 --worksize  64 : 133MH/s
--gpu-threads 1 --vectors 4 --worksize 128 : 133MH/s
--gpu-threads 1 --vectors 4 --worksize 256 : 133MH/s
Not that it might matter much at this point but with vectors 4, and I believe 2, to some extent. There is a need to adjust the memory clock in order to optimize it. I am not sure it would even help being CGN. But, if you get time, I'd check it out. Sadly, I've no clue where that thread is at this time. :/

**** UPDATE ****

Someone suggested that I give a recent version of the DiabloMiner a try since it should have decent support for GCN, so I did.

~650MH/s with the default diablominer settings and the card OC'd @ 1125/975MHz:



~530MH/s at standard clocks:



pretty freakin awesome, if you ask me. Now if they can just sell the things for <$400 I'd be happy.  Do you have any TPD numbers with this card?

If you're not excited by the idea of being an early adopter 'now', then you should come back in three or four years and either tell us "Told you it'd never work!" or join what should, by then, be a much more stable and easier-to-use system.
- GA

It is being worked on by smart people.  -DamienBlack
poppyh
Newbie
*
Offline Offline

Activity: 10
Merit: 0


View Profile
January 09, 2012, 01:50:50 PM
 #63

In the UK currently :

5870 costs 170 GBP and gets 440 mhash/s so about 2.6 mhash/GBP

7970 will cost rougly 430 GBP and get 666 mhash/s so about 1.6 mhash/GBP

Thus, the 5870 is still much better and you can also get a 5970 that gets 850 mhash/s for about 400 GBP.

Power figures ?
wndrbr3d
Hero Member
*****
Offline Offline

Activity: 914
Merit: 500


View Profile
January 09, 2012, 01:54:08 PM
 #64

Very interesting results! The only missing piece is the power draw from the wall.

My only hesitations at this point are:

1) Price Point/Performance is still super high when compared to used 58xx series cards

2) Lack of optimization in Miners for any new features in GCN/SDK 2.6. Current Miners are heavily optimized for VLIW4/5, so obviously there's going to need to be some re-working for full GCN support.

The only way I can see this card being a viable miner is that it needs to outperform 5970/6990 in performance per watt and $/mhash, otherwise it's just a good excuse to see more 58xx's hitting eBay since gamers will be upgrading...

Thanks for the initial benchmarks though OP! Smiley
DiabloD3
Legendary
*
Offline Offline

Activity: 1162
Merit: 1000


DiabloMiner author


View Profile WWW
January 09, 2012, 03:16:37 PM
 #65

In the UK currently :

5870 costs 170 GBP and gets 440 mhash/s so about 2.6 mhash/GBP

7970 will cost rougly 430 GBP and get 666 mhash/s so about 1.6 mhash/GBP

Thus, the 5870 is still much better and you can also get a 5970 that gets 850 mhash/s for about 400 GBP.

Power figures ?

7970 is going to be 200 watts I believe, and the 5870 is 188 (both at stock watts). This is where the 7970 suddenly shines. Even if the 7970 is 250 watts, thats still a jump in efficiency.

DeathAndTaxes
Donator
Legendary
*
Offline Offline

Activity: 1218
Merit: 1079


Gerald Davis


View Profile
January 09, 2012, 03:20:35 PM
 #66

In the UK currently :

5870 costs 170 GBP and gets 440 mhash/s so about 2.6 mhash/GBP

7970 will cost rougly 430 GBP and get 666 mhash/s so about 1.6 mhash/GBP

Thus, the 5870 is still much better and you can also get a 5970 that gets 850 mhash/s for about 400 GBP.

Power figures ?

7970 is going to be 200 watts I believe, and the 5870 is 188 (both at stock watts). This is where the 7970 suddenly shines. Even if the 7970 is 250 watts, thats still a jump in efficiency.

You think it will be 200 watts w/ a 20% overclock?  I wish the OP had a kill-a-watt.

Hey OP do you have a kill-a-watt you could purchase locally.  If you are in the states Home Depot and Lowes carry them.  If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.
DiabloD3
Legendary
*
Offline Offline

Activity: 1162
Merit: 1000


DiabloMiner author


View Profile WWW
January 09, 2012, 03:24:20 PM
 #67

You think it will be 200 watts w/ a 20% overclock?  I wish the OP had a kill-a-watt.

Thats at stock clocks obviously. I don't know what the mining values will be, all the cards draw less than their full wattage at stock speeds when mining (because large parts of the chip shut off). I imagine 79xx may even get a larger efficiency boost due to this because of AMD's work on power saving, but without a killawatt test, no one knows.

rjk
Sr. Member
****
Offline Offline

Activity: 448
Merit: 250


1ngldh


View Profile
January 09, 2012, 03:26:56 PM
 #68

(because large parts of the chip shut off).
I know that the shaders are used to do the hashing, but is it possible to utilize more of the chip, even if it were at dramatically lower efficiency?

Mining Rig Extraordinaire - the Trenton BPX6806 18-slot PCIe backplane [PICS] Dead project is dead, all hail the coming of the mighty ASIC!
DiabloD3
Legendary
*
Offline Offline

Activity: 1162
Merit: 1000


DiabloMiner author


View Profile WWW
January 09, 2012, 03:29:25 PM
 #69

(because large parts of the chip shut off).
I know that the shaders are used to do the hashing, but is it possible to utilize more of the chip, even if it were at dramatically lower efficiency?

No. I already tried to abuse the texture/memory fetch units, but couldn't figure out a useful way of doing it. Its all fixed function hardware and its not particularly interesting for what we do. Although, I may go try that again, SDK 2.6 seems to be a much better compiler in some areas.

stoppots
Sr. Member
****
Offline Offline

Activity: 271
Merit: 250


View Profile
January 09, 2012, 04:20:23 PM
 #70

man wat a beast
1onevvolf (OP)
Newbie
*
Offline Offline

Activity: 43
Merit: 0


View Profile
January 09, 2012, 05:55:18 PM
 #71

Hey OP do you have a kill-a-watt you could purchase locally.  If you are in the states Home Depot and Lowes carry them.  If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.

The kill-a-watt brand doesn't appear to be commercialized here in europe, and I've been searching for an equivalent device locally each time I've had a chance to head out to a store for the past couple of days, but no luck so far.

I also took a stab at modifying DiabloMiner and managed to get it to use 16component vectors, which is what GCN is supposed to be tuned for, but performance isn't what I expect and its really hard to profile/debug the tahiti since I could not find any development tools that specificly support it yet.
DiabloD3
Legendary
*
Offline Offline

Activity: 1162
Merit: 1000


DiabloMiner author


View Profile WWW
January 09, 2012, 06:10:19 PM
 #72

Hey OP do you have a kill-a-watt you could purchase locally.  If you are in the states Home Depot and Lowes carry them.  If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.

The kill-a-watt brand doesn't appear to be commercialized here in europe, and I've been searching for an equivalent device locally each time I've had a chance to head out to a store for the past couple of days, but no luck so far.

I also took a stab at modifying DiabloMiner and managed to get it to use 16component vectors, which is what GCN is supposed to be tuned for, but performance isn't what I expect and its really hard to profile/debug the tahiti since I could not find any development tools that specificly support it yet.

Wait wait wait. Are we sure uint16 is such a good idea? Last time I tried >4 (which was before 2.6, btw, I haven't tested with 2.6), it would crash in the compiler. Also, does anyone have a count on the number of registers per CU? There might not be enough registers to handle that.

Also, check some of the larger -vs, -v 40 is two sets of uint4 and -v 44 does three uint4s (unlike cgminer, -v 4 does two uint2s).

luo demin
Newbie
*
Offline Offline

Activity: 70
Merit: 0


View Profile
January 09, 2012, 06:26:28 PM
 #73

I can't wait till the 7990 that is going to be impressive but expensive  Sad I might have missed this but what is the heat like hashing overclocked ? and what fan speed
1onevvolf (OP)
Newbie
*
Offline Offline

Activity: 43
Merit: 0


View Profile
January 09, 2012, 07:02:03 PM
 #74

Wait wait wait. Are we sure uint16 is such a good idea? Last time I tried >4 (which was before 2.6, btw, I haven't tested with 2.6), it would crash in the compiler. Also, does anyone have a count on the number of registers per CU? There might not be enough registers to handle that.

I'm not sure if it's a good idea or not so I wanted to measure it Wink GCN has 64KB worth of registers per CU, and like you said I'm not sure if that's enough. The reason for my curiosity was because GCN's compute units each contain 4 x SIMD units with a width of 16 elements (same size as Larrabee & Intel's MIC, coincidentally), and I recall reading somewhere that each of these SIMD units can retire one 16-way instruction every 4 cycles, so those 16element vectors kind of rang out at me. I also wanted to get familiar with the OpenCL bitcoin mining code and thought it would be a neat exercise (which it was!). Nice code by the way.

I can say for sure that 16element vectors DO compile with the drivers that came with the card.

The -ds code dump for 16 element vectors came out nice and clean, although the last few lines where the result is stored in output seem a bit branchy. It looks something like this:

Code:
    if(XG2.s0 == 0x136032ED) { output[Xnonce.s0 & 0xF] = Xnonce.s0; }
    if(XG2.s1 == 0x136032ED) { output[Xnonce.s1 & 0xF] = Xnonce.s1; }
    if(XG2.s2 == 0x136032ED) { output[Xnonce.s2 & 0xF] = Xnonce.s2; }
    ...
    ...
    if(XG2.sd == 0x136032ED) { output[Xnonce.sd & 0xF] = Xnonce.sd; }
    if(XG2.se == 0x136032ED) { output[Xnonce.se & 0xF] = Xnonce.se; }
    if(XG2.sf == 0x136032ED) { output[Xnonce.sf & 0xF] = Xnonce.sf; }

I tried replacing it with a branch-less expression using shuffle() and vstore16() but haven't managed to get it working. What I've come up with looks something like this:

Code:
    x mask = Xnonce & 0xF;
    x temp = shuffle(select(Xnonce, 0, selection), mask);
    vstore16(temp, 0, output);

Anyhow I'm sure that my code modifications are doing all sorts of dumb things. I'm still learning how it all works so please ignore.

Also, check some of the larger -vs, -v 40 is two sets of uint4 and -v 44 does three uint4s (unlike cgminer, -v 4 does two uint2s).

I've tried all of the different -v settings available (according to the source) but haven't been able to get any higher than the 666MH/s with the default settings and 3 compute threads.
DeathAndTaxes
Donator
Legendary
*
Offline Offline

Activity: 1218
Merit: 1079


Gerald Davis


View Profile
January 09, 2012, 07:07:42 PM
 #75

Hey OP do you have a kill-a-watt you could purchase locally.  If you are in the states Home Depot and Lowes carry them.  If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.

The kill-a-watt brand doesn't appear to be commercialized here in europe, and I've been searching for an equivalent device locally each time I've had a chance to head out to a store for the past couple of days, but no luck so far.

Well that sucks.  A more universal albeit expensive tool is a clamp meter. 

terrytibbs
Hero Member
*****
Offline Offline

Activity: 560
Merit: 501



View Profile
January 09, 2012, 07:14:07 PM
 #76

okay, i will fly to Singapore and pick one up if it all makes you happy....


i got a girl there:P
Is it Mrs. Zhou Tong?
1onevvolf (OP)
Newbie
*
Offline Offline

Activity: 43
Merit: 0


View Profile
January 09, 2012, 07:15:39 PM
 #77

I can't wait till the 7990 that is going to be impressive but expensive  Sad I might have missed this but what is the heat like hashing overclocked ? and what fan speed

Overclocked @ 1125/975MHz with automatic fan speed I'm getting temperatures hovering 81-83C, and the fan runs at 47-49% speed. You can see some screencaps on one of the earlier pages. But since I prefer lower temperatures and am worried about VRM and memory temps not yet being reported by GPU-Z, I usually run it at 60% fan speed and get temps around 72C. The blower fan at 60% speed is quite loud (its a reference design from Sapphire).

At 100% fan speed, the overclocked card gets below 60C while mining but you can hear it from outside of the house at this point Tongue, so as lovely as these temps are this is not an option for me as it is also my gaming and work PC.
The King
Newbie
*
Offline Offline

Activity: 13
Merit: 0


View Profile
January 09, 2012, 07:17:14 PM
 #78

I can't wait till the 7990 that is going to be impressive but expensive  Sad I might have missed this but what is the heat like hashing overclocked ? and what fan speed

Overclocked @ 1125/975MHz with automatic fan speed I'm getting temperatures hovering 81-83C, and the fan runs at 47-49% speed. You can see some screencaps on one of the earlier pages. But since I prefer lower temperatures and am worried about VRM and memory temps not yet being reported by GPU-Z, I usually run it at 60% fan speed and get temps around 72C. The blower fan at 60% speed is quite loud (its a reference design from Sapphire).

At 100% fan speed, the overclocked card gets below 60C while mining but you can hear it from outside of the house at this point Tongue, so as lovely as these temps are this is not an option for me as it is also my gaming and work PC.

Yeah, so they still have not fixed that damn reference fan design. Aftermarket coolers FTW !

Damn ATI and their crap loud fan designs Sad
DiabloD3
Legendary
*
Offline Offline

Activity: 1162
Merit: 1000


DiabloMiner author


View Profile WWW
January 09, 2012, 07:51:59 PM
 #79

Hey OP do you have a kill-a-watt you could purchase locally.  If you are in the states Home Depot and Lowes carry them.  If you can find one locally I am sure we could get together the 3 or 4 BTC to get some accurate power readings.

The kill-a-watt brand doesn't appear to be commercialized here in europe, and I've been searching for an equivalent device locally each time I've had a chance to head out to a store for the past couple of days, but no luck so far.

I also took a stab at modifying DiabloMiner and managed to get it to use 16component vectors, which is what GCN is supposed to be tuned for, but performance isn't what I expect and its really hard to profile/debug the tahiti since I could not find any development tools that specificly support it yet.

BTW, they do make 240v/50hz euro Killawatts, but you might have to order it from the US. They also make 240v/60hz (double hot, like ovens and water heaters) ones and 208v ones for DC shit. Might have to look around, I love mine, its been essential for planning stuff out.

stick_wood
Newbie
*
Offline Offline

Activity: 8
Merit: 0


View Profile
January 09, 2012, 08:11:16 PM
 #80

Really nice cards and performance but the price really sucks !

5XXX is much more cost effective ATM. That may change in the future.

Maybe wait for FPGA ?
Pages: « 1 2 3 [4] 5 6 7 8 9 10 11 »  All
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!