Bitcoin Forum
May 07, 2024, 05:11:39 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
  Home Help Search Login Register More  
  Show Posts
Pages: « 1 ... 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 [70] 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 »
1381  Other / Off-topic / Re: Butterfly Labs - Bitforce Single and Rig Box on: March 26, 2012, 06:39:28 AM
sorry to spam the thread but I just want to encourage people to use and contribute to the Bitcoin FPGA wiki at http://wiki.btcfpga.com
Our goal is to provide an informative wiki to display and compare all hardware and software related to mining Bitcoin with FPGA

There is a lot of bad data in this table. Pretty much all of the Radeons have incorrect Mh/s/$ values (eg. Radeon HD 6990, 800 Mh/s, $623 -> 1.28 Mh/s/$... but the table lists 1.46 Mh/s/$).

The prices are wrong too (Radeon HD 6990 sells for more than $623).

The wattages are wrong to (Radeon HD 6990 o/c'd to mine at 800 Mh/s will consume MUCH more than 346 Watt).

Etc.
1382  Other / CPU/GPU Bitcoin mining hardware / Re: GTX 680 Perf Estimations on: March 25, 2012, 05:26:37 AM
Schleicher: very funny Cheesy

Edit: I was privately contacted to explain my reaction to his post... Schleicher's post is obviously untrue. There are no such performance improvements in the GTX 680. I assume he was being humorous.
1383  Other / CPU/GPU Bitcoin mining hardware / Re: GTX 680 Perf Estimations on: March 25, 2012, 03:09:01 AM
Your math is almost right, except this:

The Kepler SMX can execute 2x as many instruction as a 48-shader Fermi SM, but it can execute 3x as many instructions as a 32-shader Fermi SM (which the GTX 580 is an example of). Therefore do not multiply by 2, but by 3.

You forgot to divide the end result by 2 because, as you pointed out, the number of SMXs in the GTX 680 is half the number of SMs in the GTX 580.

Finally, a GTX 580 can mine at 149 Mh/s with a properly fine-tuned miner, not 140 Mh/s. (An entry in the wiki claims this with rpcminer-cuda.exe set to use a high aggression level parameter.)

Correcting these 3 mistakes give a GTX 680 estimation of: 149*3*1058/772/2 = 306 Mh/s... which matches exactly my first estimation (300-315 Mh/s).
1384  Other / CPU/GPU Bitcoin mining hardware / Re: GTX 680 Perf Estimations on: March 23, 2012, 07:22:54 PM
I found this interesting:

The value of these benchmarks to predict Bitcoin performance is null. SiSoft publishes almost no information as to what actually their GPU code is doing. For all we know, they are probably memory-bound instead of ALU-bound. And the huge discrepancy disfavoring AES-128 on AMD, and SHA-256 on Nvidia suggests their code is poorly optimized in these respective cases and/or exposes flaws in the OpenCL compilers... There is no reason for SHA-256 hashing to be that slow on Nvidia (even if GTX 680 has no BFI_INT and no int rotate, it should be at least about half as fast as HD 7970). Conversely, there is no reason for AES-128 to be that slow on AMD.
1385  Other / CPU/GPU Bitcoin mining hardware / Re: GTX 680 Perf Estimations on: March 23, 2012, 06:49:59 PM
Vbs, if this interpretation was true, then GTX 680 would be a 6180 single precision GFLOPS chip, but it is actually only 3090 SP GFLOPS...

Perhaps talking about GFLOPS is an easier way to convince you... In all recent Nvidia and AMD GPUs, the number of 32-bit integer instructions executable per clock is linearly proportional to the number of single precision floating point instructions executable per clock. (This was not true with the old GT200 because Nvidia counted GFLOPS assuming mul+mad or 3 fp operations/clock, anyway I digress.) Per Nvidia's published figures:

GTX 460 = 907 SP GFLOPS
GTX 680 = 3090 SP GFLOPS
And 3090/907 = 3.4x (or about the same 4x ratio I mentioned earlier)

Therefore GTX 680 would only mine 3.4x faster than the GTX 460 (again assuming no BFI_INT and int rotate).

What Nvidia mean by "twice the number of instructions per clock" is that a whole SM/SMX executes twice as many instructions per core clock:  a GTX 460's SM executes 48 (# of shaders) * 2 (shader clock is 2x the core clock) = 96 instructions per core clock; a GTX 680's SMX executes 192 (# of shaders) * 1 (shader clock same as core clock) = 192 instructions per core clock.
1386  Other / CPU/GPU Bitcoin mining hardware / Re: GTX 680 Perf Estimations on: March 23, 2012, 11:23:22 AM
Yes. Two doublings. Number of cores quadrupled in an SM (now SMX). From 48 to 192. But I already take this into account in my calculation (192 cores in an SMX * 8 SMX = 1536 cores, which I base my numbers on). I think you are picking up isolated sentences without understanding the whole picture of how the GPU works.

Another way of seeing it, if I may grossly simplify, is that this Anandtech article says the GTX 680 approximately quadrupled the performance of a GTX 460 (which is approximately true: 4x the # of cores per SM/SMX, about same number of SM/SMX 7 vs 8, and the same shader clock 1006 vs 1350 MHz (+/- 30%)). GTX 460 mines at ~70 Mh/s, therefore GTX 680 would mine at ~280 Mh/s (+/- 30%), which is again consistent with my first estimation of 300-315 Mhash/s (assuming no BFI_INT or int rotate). This is a gross approximation but you get the idea.
1387  Other / CPU/GPU Bitcoin mining hardware / Re: GTX 680 Perf Estimations on: March 23, 2012, 11:01:56 AM
Having 4 warp schedulers instead of 2 does not mean that you can execute twice the number of instructions per clock. It just means you have 4 warps of threads which take turn to be executed instead of 2.

In other words, divide your 576 Mh/s number by two: 288 Mh/s, which is within 4% of my first prediction (300-315 Mh/s).
1388  Other / CPU/GPU Bitcoin mining hardware / GTX 680 Perf Estimations on: March 23, 2012, 09:57:39 AM
Nvidia GTX 680 has 1536 ALUs running at 1006-1058 MHz. There are a few unknowns because AFAIK there is still no public information whether the Kepler architecture has a native instruction doing (a & b) | (~a & c) (aka BFI_INT on AMD GPUs, useful to implement maj() and ch() in SHA-256), and a native integer rotate instruction. Nonetheless, it is easy to calculate the upper bound of mining performance because it is an embarrassingly parallel workload, and we know how many instructions are required in these different scenarios:

  • 300-315 Mh/s assuming the Kepler microarchitecture still lacks BFI_INT and int rotate
  • 375-395 Mh/s if Nvidia added only int rotate
  • 450-475 Mh/s if Nvidia implemented both BFI_INT and int rotate natively

This would still be worse than AMD's comparable HD 7970 card at stock clocks (554 Mh/s); but definitely a step in the right direction for Nvidia who finally recognized that having (much) more ALUs at a (slightly) lower clock is better than fewer ALUs at a higher clock.
1389  Other / CPU/GPU Bitcoin mining hardware / Re: GK104: nVidia's Kepler to be the First Mining Card? on: March 22, 2012, 09:30:44 PM
Newegg listed "Shader clock: 2012MHz".

Interesting... 1536 ALUs at 2012MHz would give it more processing power than a HD 6990. Either the GTX 680 is going to be the fastest GPU for mining, or it is a blatant mistake from Newegg or the microarchitecture has undisclosed limitations that would prevent exploiting all this apparent power.

Per the NDA lifted today, this was a blatant error. The ALUs will run at 1006-1058 MHz, which should allow this card to mine at an upper bound of 450-470 Mh/s (80-85% the speed of a HD 7970.) This is assuming of course thar Nvidia added a BFI_INT-like instruction to the architecture, which is not certain. If not, performance would be much lower...

Edit: for more accurate perf estimations see https://bitcointalk.org/index.php?topic=73627.0
1390  Other / CPU/GPU Bitcoin mining hardware / Re: GK104: nVidia's Kepler to be the First Mining Card? on: March 22, 2012, 08:31:02 AM
Newegg listed "Shader clock: 2012MHz".

Interesting... 1536 ALUs at 2012MHz would give it more processing power than a HD 6990. Either the GTX 680 is going to be the fastest GPU for mining, or it is a blatant mistake from Newegg or the microarchitecture has undisclosed limitations that would prevent exploiting all this apparent power.
1391  Bitcoin / Development & Technical Discussion / Re: v0.1 on: March 22, 2012, 08:10:24 AM
Hal sent me two archives of version 0.1.0 (a rar, and a tgz, both of the same source tree). I uploaded them to my site:

http://www.zorinaq.com/pub/bitcoin-0.1.0.rar
http://www.zorinaq.com/pub/bitcoin-0.1.0.tgz

SHA1:
ec9ed4ccbc990eceb922ff0c4d71d1ad466990dd  bitcoin-0.1.0.rar
35f83eaa334e0e447ceea77a7cc955a4ccdd1a1d  bitcoin-0.1.0.tgz

MD5:
91e2dfa2af043eabbb38964cbf368500  bitcoin-0.1.0.rar
dca1095f053a0c2dc90b19c92bd1ec00  bitcoin-0.1.0.tgz


All, please mirror them!
1392  Bitcoin / Hardware / Re: Algorithmically placed FPGA miner: 227MH/s and rising on: March 21, 2012, 09:40:55 AM
I certainly feel I was properly compensated.

Today, I would either do it the same way, or do it using the Kickstarter model like suggested on this thread. For those not familiar with it: http://en.wikipedia.org/wiki/Kickstarter
In 2 words: funds are pledged by potential customers. If a funding target is not reached by a certain date, money is returned to those who funded. Else the funds go to the seller who can finish producing and releasing the product.
1393  Bitcoin / Hardware / Re: Algorithmically placed FPGA miner: 227MH/s and rising on: March 21, 2012, 05:54:30 AM
Whatever happened to that guy that figured out how to implement BF_INT (or something) on the GPU miners before the rest of the coders did shortly after?

I looked for a few min on to forum and did not find it as if people have completly forgotten.

I am the guy.

What happened to me? Well for a while I was selling my BFI_INT-enabled GPU miner. My strategy to prevent it from being leaked and pirated (I think the only strategy that can work) was to price it quite high (400 BTC which was about $200 at the time), so that the buyers would feel they had a valuable, exclusive product that they would not want to leak. AFAIK it worked and was never leaked. At the time I was 10-15% faster than the other open source miners.

Then the open source ones progressively caught up, so I discounted the price down to 300, then 250 BTC. And eventually I stopped selling it altogether.
1394  Other / Off-topic / Re: Mini-Rig from Butterflylabs on: March 21, 2012, 05:13:05 AM
Really guys?  You're going to base your purchase decision on "can I fit 1 or 2 or 3 on a 15/20 amp circuit" ..??   We're talking $15,000 here .. If you have a doubt, bring in an electrician to run a dedicated circuit.

It is about maximum utilization, and efficiency.

It does not matter if you buy only 1 rig. But the larger your scale, the more you should care about these details.

I would rather spend 1 or 2 grands in mining hardware than in installing extra circuits because the BFL designers chose an awkward wattage that prevents me from utilizing my circuits to their maximum capacity.
1395  Other / Off-topic / Re: Mini-Rig from Butterflylabs on: March 20, 2012, 05:44:08 PM
I am pretty sure 20A is more common than 15A, at least in all US residences built in the last few decades. My 30-year old apartment building has 20A circuits in the units. My office has 20A circuits. My previous apt had 20A circuits.

Anyway, perhaps the mini-rig should be sized for 640W. One could run 2 per 15A circuit with 160W headroom (eg. in an old residence where the circuit might be shared by a lightbulb). Or 3 per 20A circuit with no headroom (for those operating them on dedicated circuits).
1396  Other / Off-topic / Re: Mini-Rig from Butterflylabs on: March 20, 2012, 05:30:26 PM
BFL-Engineer, I would recommend to size the mini-rig either for 960W, or 1920W.
This way, either 1 or 2 could be placed on a standard 120V-20A circuit, while drawing no more than 80% of its rated capacity per the National Electric Code (1920W).
1397  Other / Off-topic / Re: Can anyone tell me what chip is used in BFL single? on: March 20, 2012, 08:50:20 AM
As discussed, they are almost certainly using a cheap source of old-gen 65nm FPGAs: https://bitcointalk.org/index.php?topic=66314.msg769355#msg769355
1398  Bitcoin / Development & Technical Discussion / Re: PetaFLOPS and how it relates to Bitcoin on: March 18, 2012, 03:06:46 AM
That's not what's at issue. FLOPS is not, in fact, FLoating Point Operations Per Second but the numeric result of running a very specific, standardized benchmark from the LINPACK codes solving a large, dense system of linear equations. It is a benchmark that is meaningful for most scientific and technical disciplines, but says *nothing* about the ability to crank out SHA-256 hashes.

FLOPS is floating point operations per second. LINPACK FLOPS are just meaningful in the context of the TOP500 supercomputer rank because this organization decided to use this benchmark to establish the rank. A supercomputer might achieve "x" FLOPS on linear equations with LINPACK, "y" FLOPS when doing protein folding, and "z" FLOPS when doing some other work. The "x" FLOPS value of the LINPACK benchmark is no more significant than "y" or "z".

All are a fraction of "t", the theoretical peak FLOPS of the hardware (which can often be reached within 1-2% with a useless loop of multiply-add instructions). This theoretical peak can be used to predict the performance of SHA256-based Bitcoin mining because it scales linearly with the peak theoretical integer performance of a chip, which is itself directly related to its peak theoretical FLOPS performance by a fixed ratio. For example, there is exactly a 1:4 ratio between the number of integer and double precision floating point instructions that an HD 69xx series GPU can execute.
1399  Bitcoin / Development & Technical Discussion / Re: v0.1 on: March 16, 2012, 08:07:38 AM
I would love it: m.bevand at gmail.com
Take your time, it is not urgent.
1400  Other / CPU/GPU Bitcoin mining hardware / Re: 7990 delayed on: March 15, 2012, 05:20:18 PM
Radeon HD 7990 coming in April
Pages: « 1 ... 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 [70] 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 »
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!