Bitcoin Forum
May 25, 2024, 10:27:34 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
  Home Help Search Login Register More  
  Show Posts
Pages: « 1 2 3 4 5 6 7 8 9 10 [11] 12 »
201  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 11, 2018, 02:34:52 AM
Are there many individual components of the Xnn series of algos which won't fit on an arria or even a cyclone ?

Yes. Some of the components hardly fit on a 9P. The cubehash example i gave a few posts ago.


Are you saying the algorithm  for an individual cubehash pipeline hardly fits on a 9P???... while I haven’t studied cubehash specifically, it’s claimed to take 200 cycle on a basic CPU, and I can implement a lot of basic CPU cores on a 9P...

A pipeline, sure, lots of pipelines, if you want to unroll it fully and obtain real (1Gh/s+) performance it would take the entire 9P and it's not clear that a fully unrolled version of it would fit at all.



Ahh ok - I completely understand what you’re saying now. I misread as the individual algorithm took the chip.

The way i’d attack Lyra2Rev2 in hardware is a literal pipeline of chips, sized according to paralyzed throughput. It looks like the whole chain is 256 bit hashes, so 400gbps interconnect could handle your 1Gh+. For chip to hip interconnect on the same board 3 quads of 32 Gbps should be sufficient.  The Blake/keccak skein is probably all on the same chip or a much smaller chip.

Your 9P is probably $3000, you could buy 2-3x the luts for pipelines on smaller chips for that...

All that said it looks like the 1GH is worth about $1000/mo right now, so still quite a long payout if you’re using $12k in hardware.
202  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 11, 2018, 02:13:04 AM
Are there many individual components of the Xnn series of algos which won't fit on an arria or even a cyclone ?

Yes. Some of the components hardly fit on a 9P. The cubehash example i gave a few posts ago.





Are you saying the algorithm  for an individual cubehash pipeline hardly fits on a 9P???... while I haven’t studied cubehash specifically, it’s claimed to take 200 cycle on a basic CPU, and I can implement a lot of basic CPU cores on a 9P...
203  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 11, 2018, 02:01:43 AM
You can’t fit a Stratix 10 on a nVME stick... I’ve tried. Kintex or Arrria is about as big as you can get. Damn 22x80 form factor.

I know.

I question the need for ultra large and expensive FPGAs instead of clusters of smaller FPGAs with high speed interconnects.
Perhaps a custom PCI-E format board with 4-6 last generation FPGA devices with some fast memory and a cross point switch. Are there many individual components of the Xnn series of algos which won't fit on an arria or even a cyclone ?



That’s exactly what I’ve been working on, for reasonable definitions of small and fast.

I have two active projects in the first spin batch phase. One is nVME with basically the biggest thing you can fit on there, and it augments GPUs more than works standalone.

The second is 4 chips on one PCIe card, with a switch, but the most reasonable chip that can be used in that configuration is still not what you would call cheap. Frankly even the nVME chip is as much as some graphics cards to get 4x 3.0 PCIe lanes.

The one advantage is the 4-chip board uses modules, so you could buy one with 1 module populated in the 3 figure range. When it is ready, which is likely August at this point for mass production.
204  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 11, 2018, 01:20:47 AM
In the next couple of years we'll be buying Stratix 10 PCI-E boards at walmart for $600 a pop.


And gamers will be complaining that miners have bought up all the NvmE Stratix 10 FPGA sticks

You can’t fit a Stratix 10 on a nVME stick... I’ve tried. Kintex or Arrria is about as big as you can get. Damn 22x80 form factor.
205  Alternate cryptocurrencies / Mining (Altcoins) / Re: VCU1525 (FPGA MINER BOARD) - $3,000 to $4,000 on: May 11, 2018, 12:41:00 AM

Those aren't in mass production yet.


You have said in the other thread that you mine using AWS and own a couple FPGAs yourself....  Could you please display them working before you start asking people about money?

Here's one of my VCU118 mining NIST5 with a PCI-E interface. Couple caveats... 1) This board is only 80A 0.85V vccint, so I need to stay under that to keep from frying it. The NIST5 design is operating at a fraction of it's maximum frequency because I can't operate it faster on this board. 2) Never completed / fully optimized nist5 because literally the day I was planning to start mining it baikal started mining it with their x10. 3) The picture says "AWS FPGA" because I use the same software on both.

Excuse the dust, that case was a GPU miner back in 2011. It's the only case I had that was big enough to fit the VCU118 and still put the side on it.

https://imgur.com/a/tmebe6W

And if you're curious why I named the software SuperMiner 31337, I was getting sick of pool operators getting curious about about my Fpgaminer software version string and superior hashrates. Moral of that story, pool server operators are watching and checking hashrates to try to gain an edge by seeing what's possible.

I'll see if I can make a video at some point showing it working with a monitor.


I’ll back up senseless here - the board he is proposing is useful and valuable for all the FPGA mining being done. There will be many FPGA mining options soon, but if you’re interested in getting into this space it is a good option.  


As to the picture,
VCU118 - aka the ol’ Ultrascale+ test/dev bed. http://imgur.com/FKVHPNs

This is not the same space as GPU mining. Expect Bitstreams, miner development, etc. to be more ASIC like.

Edit: I would be interested in 8 of these for some testing.
206  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 11, 2018, 12:19:55 AM
As it relates to Ravencoin mining with FPGAs, OP will need to store over 300 million bitstreams to account for every possible combination. Better get back to the drawing board because this design will never work.
Partial reconfiguration - you don’t need every combination, just every building block.
Yeah, for X16r coins thats 16^2=256, for X16s coins thats 16!/14!=240. Certainly doable.

Hmmm, two accelerator cards will be daisy chained with pipelining and their performance will magically double. I will believe it when I see it like all other claims made by the OP.

This isn’t hard. I do it all the time for a few algorithms. Here’s a contrived example - fill the scratchpad for CN7 on one FPGA dedicated to that, spitting out 2MB scratch pads all day long, and taking them back in and compressing / finalizing them. Total bandwidth for (example) 22kH is 343 Gbps. That’s achievable on lots of current hardware.

This makes it a lot easier build two sets of pipelines on two FPGAs for two related but very different set of operations. Doing all the things on two separate FPGAs couldn’t achieve the same performance.

This was a back of the envelope example, but the Xxx algorithms that just chain more on to the process definitely lean them selves to this kind of operation. This (and memory bandiwdth, and easier cooling) is why my accelerator cards have 4x75W
Ultrascale + FPGAs and not one big Virtex. Interconnect on those is 256 Gbps.


Edit: Let me try to phrase this in a few words. Don’t waste the extremely high bandwidth interconnect and resources inside the FPGA for something you can use the slow external interfaces to accomplish.
207  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 10, 2018, 10:52:31 PM
As it relates to Ravencoin mining with FPGAs, OP will need to store over 300 million bitstreams to account for every possible combination. Better get back to the drawing board because this design will never work.

For Ravencoin and Bitcore, the DDR4 is not used for hashing at all, but rather to store hundreds of different FPGA configuration bitstreams, which allows the entire FPGA to rapidly reprogram itself on every block based on the algorithm sequence for that block.

Partial reconfiguration - you don’t need every combination, just every building block.
208  Alternate cryptocurrencies / Mining (Altcoins) / Re: VCU1525 (FPGA MINER BOARD) - $3,000 to $4,000 on: May 10, 2018, 03:45:39 AM
no interest in selling your miner or dev services?

also is that normal to have the intro pricing then jack it up, whitefire also says its $4k now but 5k + next month

It's the first time I've ever seen a company do that with FPGAs, that's for sure. Only avnet is doing it, digikey wants $5200/board for low QTY. My pricing just depends on chip order quantity, the more ordered, the lower the price is going to get.

I don't trust % fees for dev services because they're so easy to avoid paying.


Really? This is standard operating procedure for Xilinx. Dev boards are usually sold cheaper than even just the chip cost for normal quantities. They want to make it easy for people to build things around their expensive chips, but still make all the margins on chips. I too have been talking to Xilinx and trying to get them to see there is a big market here, big enough to make up for their margins on some of these chips.

There is more FPGA hardware coming pretty soon (over the next 2-3 months)  from a few sources (disclosure - myself included). Generally speaking most are going to be in the same ballpark cost/performance, but there may be outliers or easier entry level costs.

The problem with FPGAs is it is very hard to protect the bitstream if you aren’t selling the hardware. You can sell hardware that will work with your encrypted bitstreams and also support unencrypted bitstreams (anyone can make new miners). You can also lock hardware to only your encrypted bitstreams. But if you don’t sell the hardware (and burn in the encryption keys) anyone can copy it. So that leaves virtuous open source, or people making profit on hardware alone. The tools and time to develop these things are very expensive, so you likely won’t find a lot of people giving it away for free. Development time is also much longer. If I make a change that actually needs tested on real hardware of VCU1525 size, expect hours for a very fast computer  to synthesize and route it before I can test it. A lot can be done in simulation, but the last mile takes a lot of time.

It is possible to use normal methods to try and protect the “command and control” normal software that drives the FPGA, but someone could reverse engineer that - just like OpenCL kernels.

If you think the FPGA is magic and the OpenCL compiler will give you 10-50x performance over your GPU, you will be sorely disappointed. Those of us seeing these gains are (afaik) writing very low level RTL to do so. There’s a big difference between OpenCL and RTL for FPGAs vs OpenCL vs ISA Assembky for GPUs.



209  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 10, 2018, 03:15:39 AM
Sorry, I've been meaning to make a post related to this.  We have been working with the OP to determine the optimal configuration and board for this application, as we have a few to choose from.

If you could include a table showing how much (LP)DRAM and SRAM (or low-latency alternative) can be hooked up to each FPGA, as well as the resulting cost, that would be very helpful. Thanks in advance!

The best thing to hook up to the FPGA would be a hybrid memory cube +$500. This would get you the same level of memory performance as HBM memory. The other nice thing, the HMC has a silicon memory controller on it along with some basic logic functions that can speed up certain applications (xor, and, or).



Can the hybrid memory cube be used with the VCU1525?
.

HMC is usually soldered on to the board just like HBM.  HMC can provide staggering amount of bandwidth although is suffers in latency.  Uses SERDES communication.
There are HMC+Altera FPGA paired boards by PicoComputing iirc.

What is hoped to be accomplished with HBM/HBC? What algorithms ? These typically offer in the order of 500GB/s. That means for ETH they support a max of 62MH, and for CN7 they alone support “only” 16KH, but they add huge cost premiums to FPGAs, and offer no advantage over GPUs.

The FPGA advantage is in calculation bound code, or algorithms whose work happens in memory spaces < 40MB or so, where the onboard 11+ TB/s of ultraram/BRAM can be used.

External QDR is going to top out at 80-160 GB/s too because of pin count. Sure it is low latency but that is only part of the equation.

The absolute most bandwidth you can get off chip is if you used 128 32 GBit/s transceivers in a very expensive chip, and that will give you... 512 GByte/second again.

There’s little to be gained with off chip memory of the very expensive variety, in most cases.
210  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 09, 2018, 11:15:22 PM

45% -- Would have been lower but I lost a LOT in Jan 2018 when shitsmartcash got hacked

Oh yes, that was me. I was controlling the market as much as I could. Most FPGA i ever had at a single time was around 800.


Well, I am jealous and I knew something was up when the prices kept climbing.  Wink

Don't be, I wasted all my profits flying all over the world trying to raise $5M to build out a huge FPGA mine. All the investors thought it was too good to be true and did not believe that I was mining on AWS. They kept saying "why isn't anyone else doing this".  Roll Eyes -- Hey, at least I got to see india, china, hong kong, switzerland, germany, france Cheesy



Lol.Thats awsome

Ya, in some alternate version of reality I'm sitting in a mine with 3K fpgas cussing out whitefire990 for telling the world. For now, I'll have to settle trying to take the wind out of bittware's sales.



Aren’t we all cussing out whitefire990 for telling the world?

Poor smartcash, it’s not gonna know what to do with all the hashpower it’s about to get
211  Alternate cryptocurrencies / Mining (Altcoins) / Re: Dwarf FPGA – the anti-ASIC on: May 09, 2018, 10:53:24 PM
We are a group of developers from different countries. After working over the last six months we are ready to present the result - an ecological "green" miner.
Usually, ASICs are designed for “industrial mining” and they are expensive. In addition, these devices threaten decentralization. Changing the mining algorithm involves the risk to not recoup investment.
GPU mining is more flexible but produces a lot of heat and noise. This accelerates global warming and increases the owner's costs. This forms a feud with gamers!  Grin

We have developed an intermediate product. This small device is able to replace one rig and produce 7 kh /  10 W on the v7 and 3.5 kh / 8W on the heavy.  https://drive.google.com/open?id=1h_6uMPuAfIud_22hsuTlX7IHwLfanmIW
Dwarf supports decentralization because personal mining will now become profitable in the countries with high electricity prices. The device is of low cost and does not threaten large losses if the sudden change in the algorithm happens. This is available to anyone who does not want to pay thousands dollars for ASIC.

Dwarf can be updated. If the next fork will be similar to the previous one changing the firmware will allow the device to function further. This can be done remotely using a unique device encrypted key. We plan to sell updates for a small price.

Now we are negotiating with the factory to produce 5000 units. We expect about 300 euro for each and can tell exact price within 10 days. The sale will be launched soon.
If you have questions about a major purchase, please write to the official mailbox (this is a unique project mailbox!) DwarfMinerSales@gmail.com .

Why do we sell this? Because this board is not designed for industrial mining. It's not convenient way to use a thousand devices for one person. It is designed as an anti-ASIC for personal use in the living area.


What chip are you using?



I second this. What chip, or even brand are you using?

To sell for this price in Xilinx brand, you’d be Artix with some external QDR maybe? Still the hashrate is... tricky at that power consumption. I’d buy a bunch of them if I knew what FPGA so I could program them myself...

I too have a 7W board in development/ initial prototypes along with the big one. I don’t think I could come anywhere close to even the AES for 7KH on that one. @200Mhz artix chips would be capable of that’s around 5-10 M/AES full passes a second, 10 pipelines maybe on chip by the time other logic needed.. That only supports even scratch pad fill/inner/compress of 1 kh.  So I would say again what chip?
212  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 09, 2018, 12:50:47 PM
just think about it, it's not that crazy as may sound like at first glance

one of this is just 30 gpu 1070, at the cost of 10 o f them(you can find a 1070 for 330 or lower now so it's up to 12 of them), so it's 3:1 at best, but the new nvidia gpu will be 100% faster than a single 1070

this mean that this vs the 1170 will be only 1.5:1, 50% faster or lower...nothing special

https://www.techradar.com/news/nvidia-geforce-gtx-1180-leak-shows-that-its-faster-than-a-titan-xp

That Nvidia will release a GPU that is 100% faster than 1070 for 330 USD (which is you price in the comparison) is quite unrealistic Tongue
This generation we had quite a boost, yet the 1060 is only about 12% faster than a 970 as an example.
And dont forget the difference in power consumption! I would love a single FPGA instead of a 1080 ti rig, but have to see that it works first. Lead time in Norway is 12 weeks though Sad

will be 500 at launch( so 15 1170 vs this board will be $4000 vs $7500)as usual and 700 for the 1180 which is arguably as a fast as this fpga board for the same cost and same consumption with some tweaking

a 1170 will consume 100 watt, tdp of a 1180 which is a strongest card is the same as 1070ti which consume only 100, so a 1170 will consume less probably at lower tdp

Did I miss the detailed performances spec, price, and power consumption figures on the 1170 getting published?
213  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 09, 2018, 03:15:11 AM

FPGAs and ASICs are very closely ralated, and FPGAs are generally used to prototype ASICs.


FPGA data + 500k$ = ASIC
Good luck 4% guy Wink


Wait Wait.

You guys are saying that with the FPGA and bitstream code OP has, someone with deep pockets can actually produce an ASIC from it?

I believe you need the low-level source code (behavioral logic, transistor-gate level) to make an ASIC out.  
Or you would have to reverse engineer the bit-stream which would also require complete knowledge about entire FPGA layout and schematics?

There isn’t really much magic to the RTL for these things. Most of the performance gains are more about working around the nuances of the FPGA design and finding hardware where someone has laid out all the needed components (memory bandwidth and/or low latency memory, multiple chips with high speed interconnect, etc. I do wonder how the the OP intends to secure the bitstream if he distributed it. It would be pretty easy to write your own miner, and you can only encrypt the bitstream if it is your own hardware and you burn eFuses or similar.

Also with FPGAs and dev boards you are usually given very detailed manuals and all the schematics.

Anyone equipped to do ASICs could implement the algorithms themselves, it’s all about the capital - and accepting that all that cost is wasted if the algorithm is changed. For a lot of modern algorithms it is more about assembling the memory components than even the ASIC cores.

The reason some of us are long on FPGAs is they have the potential to be lower cost, much lower power, and higher performance than GPUs for algorithms that choose to resist ASIC development (aka change regularly). There is very little a GPU can do to gain advantage on modern (upcoming) FPGAs in that world. The transition to HBM as standard and Intel ending the artificial high price racket are the only barriers. It doesn’t take that long to stand up working RTL for an algorithm change. Personally I’m thrilled if Crypto pressure ushers in a whole new era of high performance computing.
214  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 06, 2018, 10:22:00 PM
Turn on the overtemperature shutdown in the bitstream, and control the temperature in the mining software. It is easy.
“Turn on overtemp shutdown in the bitstream”

AFAIK there’s no such magic button. If you want temp monitoring and thermal limits with an FPGA you have to include them in your design logic.
False. The better fpgas(like virtex 5/6, complete series 7 and newer xilinx) has pin terminals to an separated internal die temperature sensor transistor. A properly designed board should use it for temperature measurement and shutdown. It works even when FPGA isn't configured. An alternative is using jtag to read XADC (7 series and newer).
I actually went ahead and RTFM'd the "UltraScale Architecture SYStem MONitor User's Guide". It looks like it is very close to being a "magic button", it takes only one line of code:
Code:
set_property BITSTREAM.CONFIG.OVERTEMPSHUTDOWN ENABLE
On the other hand the whole guide to the SYSMON block has 113 pages, so it must be also relatively easy to accidentally mis-configure it.
 

I stand corrected, I was not aware of the existence of that flag! I knew you could read the XADC, but did not realize there was a bitstream flag. Good to know, although I don’t know if I would rely on that as your only protection.

I appreciate the wealth of knowledge and skill coming out on this forum regarding the FPGA work.

Regarding workstation GPUs, I said they could be useful for mining, and can work alongside FPGAs given the direct access to their memory, not that they were better than normal GPUs for mining or that they were like FPGAs. You can however very often find large batches of the older generation passive cards for cheap as most people don’t have the ability to use them. I last paid $130/ea about a month ago for a bunch of older FirePro workstation cards that are now hashing at 24MH ea.

215  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 06, 2018, 03:21:25 PM
From my practice there are simple measures to avoid failures.
Do not place these boards in the closed case, setup the fan to cool the bottom layer of PCB.
Do not allow the local overheat of any component like DC/DC or external power connector.
Turn on the overtemperature shutdown in the bitstream, and control the temperature in the mining software. It is easy.
Then the FPGA board will work for years.

“Turn on overtemp shutdown in the bitstream”

AFAIK there’s no such magic button. If you want temp monitoring and thermal limits with an FPGA you have to include them in your design logic. Same as an ASIC. Otherwise you very much can exceed junction temps and damage the hardware if you have enough power going into the board to begin with.

This is also possible on some GPUs with poor drivers. A few of my old Titan Blacks from HPC work had power stages for memory that would overheat if the memory was trashed too much for too long, and the mosfet would slip down on the boards till they caused a short. Actually caught a entire server on fire that way once...
216  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 05, 2018, 11:40:52 PM
We operate custom hardware. We’ve done the 100 device first spin, and are currently working on a full batch order > 1000 FPGAs for the Ultrascale+ end.

I haven’t posted here much, so I seem to have a lot of message reply and rate limits.

Regarding AWS - once the cat is out of the bag it won’t be profitable.
217  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 05, 2018, 11:34:43 PM
I won’t say it’s impossible, but I would be really genuinely surprised. 32MB / hash of total bandwidth (read + write) is needed, and 2MB or so of stashes per hashcore.

You have 1280 URAM blocks of 288kb by 72 bit interface dual ported in the biggest configuration .That’s an incredible amount of internal bandwidth but you can only store 23 or so simultaneous Cryptonight7 2MB blocks in that. The absolute biggest part (which isn’t on the 1525 board) has 360Mbit URAM, 96Mbit BRAM, and 48Mbit Distributed RAM, holding a theoretical 63 MB of pipelines, assuming you didn’t need a single bit of that for the rest of your logic (you do).

The external memory at say 4x64 DIMMs @2666 is only 85GB/s, or 2.6 KH worth of bandwidth with a perfect access pattern.

Even if you could imaginarily use all 2000+ balls on the FPGA for 2666 MT/s DDR style  speeds you’d still only clear 20KH against external memory and that isn’t even real bandwidth.

 Even if you took the biggest part with 128x32 Gbps transceivers to SERDES memory you’d only have 16kH limit from bandwidth.

Unless you break the algorithm itself, there’s no where to find the bandwidth + storage space for 64khs on a single FPGA.

You're missing a really big part of the ultraram. One of the most attractive things that ultraram has to offer. True dual port single clock read/write. Also, when you chain ultrarams together it increases the bus width proportionally to the amount it increases the latency. I never completed monero but my estimates were in the 4-8Kh/s per board range at 100W.





I wasn’t missing it - the true dual port is truly the reason it works as well as it does - write completion before read in the dependency chaining. The issue (at least with cryptonight) isn’t the bandwidth at all, it is the amount available. Ultraram is great in general.


There's also a bunch of block ram and distributed ram. AES itself is tiny (<30K luts) and the secondary hashes don't need to be completed on the FPGA (meaning, you don't really need to put groestl, jh, etc on the fpga, you can just read the 8Kh/s and complete the secondary on CPU). While I have those completed (the secondaries), I had never intended on putting them on the fpga for cryptonight.



Im not sure you actually read my posts on the topic, as you’re repeating a few things I already stated - such as I don’t do secondary hashes on the FPGA. 30k LUTs for AES? That’s a huge amount more than my cores...
218  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 05, 2018, 11:28:03 PM
I won’t say it’s impossible, but I would be really genuinely surprised. 32MB / hash of total bandwidth (read + write) is needed, and 2MB or so of stashes per hashcore.

You have 1280 URAM blocks of 288kb by 72 bit interface dual ported in the biggest configuration .That’s an incredible amount of internal bandwidth but you can only store 23 or so simultaneous Cryptonight7 2MB blocks in that. The absolute biggest part (which isn’t on the 1525 board) has 360Mbit URAM, 96Mbit BRAM, and 48Mbit Distributed RAM, holding a theoretical 63 MB of pipelines, assuming you didn’t need a single bit of that for the rest of your logic (you do).

The external memory at say 4x64 DIMMs @2666 is only 85GB/s, or 2.6 KH worth of bandwidth with a perfect access pattern.

Even if you could imaginarily use all 2000+ balls on the FPGA for 2666 MT/s DDR style  speeds you’d still only clear 20KH against external memory and that isn’t even real bandwidth.

 Even if you took the biggest part with 128x32 Gbps transceivers to SERDES memory you’d only have 16kH limit from bandwidth.

Unless you break the algorithm itself, there’s no where to find the bandwidth + storage space for 64khs on a single FPGA.

You're missing a really big part of the ultraram. One of the most attractive things that ultraram has to offer. True dual port single clock read/write. Also, when you chain ultrarams together it increases the bus width proportionally to the amount it increases the latency. I never completed monero but my estimates were in the 4-8Kh/s per board range at 100W.




[/quote

I wasn’t missing it - I even mentioned it. I know my FPGA prices are more than an order of magnitude below the “digikey” prices.  The true dual port is truly the reason it works as well as it does - write completion before read in the dependency chaining. The issue (at least with cryptonight) isn’t the bandwidth at all, it is the amount available. Ultraram is great in general.

On the Keccak, remember most crypto doesn’t need anywhere near the full 1600bit of initial or final state. Trace through those dependency chains over 24 rounds and you eliminate a lot of calculation.
219  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 05, 2018, 07:39:39 PM
Can workstation cards become useful for mining ? Damn ! They're gonna teach us FPGA next year in college.

You mean the FirePro type GPUs? Definitely. I have many many older FirePro cards that had plenty of memory bandwidth but underpowered (or over powered depending how you look at it) cores compared to the current gen. This cards got large page table support and some other niceties because of who the customers were. The also support an FPGA on the PCIe bus directly writing into / readying from their GPU memory (albeit at PCIe speeds). This works great with something like ETH, because you can do all the compute heavy Keccak off GPU but still use all the memory bandwidth.
220  Alternate cryptocurrencies / Mining (Altcoins) / Re: DIY FPGA Mining rig for any algorithm with fast ROI on: May 05, 2018, 07:02:17 PM
@GPUHoarder

What has been your experience with the latest HLS tools from Xilinx? A few years ago, the performance of their sample Bitcoin miner was atrocious. A $2K board with the throughput of 80 MH/s, not sure why they even included that sample.

We are low level RTL here, so I haven’t messed with much of the HLS side. The big issue with FPGAs is people used to buying graphics cards that are all commoditized close to a low MSRP don’t understand that the FPGA market is nothing like this. The traditional market for these things was Avionics, Defense, Government, Nation States, etc. the price on digikey is no where near the actual price for someone building their own hardware, and the dev kits are expensive because they come with every bell and whistle you might want to use with a particular chip and they are low low volume. This is changing a bit but these VCU1525 dev accelerators are no different than Nvidia’s P100/V100 PCIe card’s or AMD’s old FirePro, now Radeon Pro multi-thousand dollar cards. They don’t cost 10x more to make, the target market is just far less cost sensitive.

The gap between ASIC and FPGA is also much narrower now than it was a few years ago.

If a market emerges to consume 100,000 FPGA accelerators for mining, someone building their own hardware with a good relationship with Xilinx or Intel can and will drive the cost down at least an order of magnitude.
Pages: « 1 2 3 4 5 6 7 8 9 10 [11] 12 »
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!