Xeon Phi

Quote from: rjk on June 20, 2012, 04:44:37 PM

June 20, 2012, 06:19:21 PM

#61

Yeah, that's why their Tesla stuff shows up in pretty much all of the new supercomputer builds these days, right? Roll Eyes

their high end tesla's kick some major ass.

If I've helped: 1CmguJhwW4sbtSMFsyaafikJ8jhYS61quz

Sold: 5850 to lepenguin. Quick, easy and trustworthy.

multi#lord

Member

Offline

Activity: 66
Merit: 10

June 21, 2012, 03:17:59 PM

#62

Probably covered some of the stuff in thread, but interesting read on the Xeon Phi:

http://vr-zone.com/articles/intel-xeon-family-finally-accepts-the-larrabee-in-xeon-phi-and-its-futures/16361.html

rjk

Sr. Member

Offline

Activity: 448
Merit: 250

1ngldh

Quote from: multi#lord on June 21, 2012, 03:17:59 PM

June 21, 2012, 03:26:46 PM

#63

Probably covered some of the stuff in thread, but interesting read on the Xeon Phi:

http://vr-zone.com/articles/intel-xeon-family-finally-accepts-the-larrabee-in-xeon-phi-and-its-futures/16361.html

Interesting!

Quote

The 50+ simple two-way in-order Pentium (yes, 1995 Pentium!) like cores feed the same number of 512-bit wide SIMD FP units, with the ability to deliver around 1 TFLOPs peak in double precision at around 1 GHz.

Mining Rig Extraordinaire - the Trenton BPX6806 18-slot PCIe backplane [PICS] Dead project is dead, all hail the coming of the mighty ASIC!

cmg5461

Sr. Member

Offline

Activity: 369
Merit: 250

June 21, 2012, 03:32:06 PM

#64

Quote

And, at least behind the closed doors, both AMD and Nvidia GPUs have been shown booting Linux on their own, without requiring a CPU.

umm. wat

If I've helped: 1CmguJhwW4sbtSMFsyaafikJ8jhYS61quz

Sold: 5850 to lepenguin. Quick, easy and trustworthy.

rjk

Sr. Member

Offline

Activity: 448
Merit: 250

1ngldh

June 21, 2012, 03:33:53 PM

#65

Quote

And, at least behind the closed doors, both AMD and Nvidia GPUs have been shown booting Linux on their own, without requiring a CPU.

umm. wat

True story, but only in the hands of the engineers that designed them, no one else that I know of has been able to make it happen.

Mining Rig Extraordinaire - the Trenton BPX6806 18-slot PCIe backplane [PICS] Dead project is dead, all hail the coming of the mighty ASIC!

cmg5461

Sr. Member

Offline

Activity: 369
Merit: 250

Quote from: rjk on June 21, 2012, 03:33:53 PM

June 21, 2012, 03:36:02 PM

#66

True story, but only in the hands of the engineers that designed them, no one else that I know of has been able to make it happen.

ah. I never knew it was possible. I guess you could think of a gpu as a slower cpu. It must need a heavily modifies kernel though

If I've helped: 1CmguJhwW4sbtSMFsyaafikJ8jhYS61quz

Sold: 5850 to lepenguin. Quick, easy and trustworthy.

crazyates

Legendary

Offline

Activity: 952
Merit: 1000

Quote from: cmg5461 on June 21, 2012, 03:36:02 PM

June 21, 2012, 03:49:35 PM
Last edit: June 21, 2012, 05:05:48 PM by crazyates

#67

Quote from: rjk on June 21, 2012, 03:33:53 PM

True story, but only in the hands of the engineers that designed them, no one else that I know of has been able to make it happen.

ah. I never knew it was possible. I guess you could think of a gpu as a slower cpu. It must need a heavily modifies kernel though

Still. Gentoo on an APU is gonna be awesome come 2015!

Architecture:
[ ] x86
[ ] amd64
[X] opencl
[ ] arm

Tips? 1crazy8pMqgwJ7tX7ZPZmyPwFbc6xZKM9
Previous Trade History - Sale Thread

DiabloD3

Legendary

Offline

Activity: 1162
Merit: 1000

DiabloMiner author

June 22, 2012, 12:29:32 AM

#68

Quote

And, at least behind the closed doors, both AMD and Nvidia GPUs have been shown booting Linux on their own, without requiring a CPU.

umm. wat

AMD's Fusion is a product of years of research. AMD "demo'ed" an all HyperTransport Radeon about a year after they bought ATI, and they've also been showing off prototype Fusions that don't just have Radeon pipes on-die* but usable from the x86 interface side, although what "usable" means is still up in the air, but if they've managed to use them as the backend for SIMD instructions (ie, no more dedicated FPU units, and the x86 instruction scheduler issues as many ops as it can in parallel (instead of just, say, 2 per core), instead 512 Radeon ALUs across the entire CPU) this could mean a huge goddamned increase in FP performance without needing a dedicated HAL API like OpenCL.

* On-die Fusion Radeons don't have a Radeon memory controller and natively speak HyperTransport. The up side is, they have direct access to system memory as a native processor and can access stuff directly out of on-die cache: this means you have basically zero wait time to send stuff to the GPU for processing and you have zero cost cache coherency.

crazyates

Legendary

Offline

Activity: 952
Merit: 1000

Quote from: DiabloD3 on June 22, 2012, 12:29:32 AM

June 22, 2012, 03:52:19 AM

#69

Quote

And, at least behind the closed doors, both AMD and Nvidia GPUs have been shown booting Linux on their own, without requiring a CPU.

umm. wat

This is an old slide, but it gives a good vision of AMD's overall goal. We are somewhere between step 2 and step 3, and it's only going to be getting better! AMD has one of the most creative and innovative visions for the future of consumer computing (as opposed to intel just shrinking nm die sizes), and I think it's progressing quite well (just look at the success of their APU sales) I also think it's only going to get better for them as they move along with even more amazing features like what you just described.

/amdfanboyrant

Tips? 1crazy8pMqgwJ7tX7ZPZmyPwFbc6xZKM9
Previous Trade History - Sale Thread

DiabloD3

Legendary

Offline

Activity: 1162
Merit: 1000

DiabloMiner author

Quote from: crazyates on June 22, 2012, 03:52:19 AM

June 22, 2012, 04:17:24 AM

#70

Quote from: DiabloD3 on June 22, 2012, 12:29:32 AM

Quote

And, at least behind the closed doors, both AMD and Nvidia GPUs have been shown booting Linux on their own, without requiring a CPU.

umm. wat

Yeah, what I described is clearly Step 3 or later. Intel also seems to have finally sold a "step 3" type of device in the Phi, depending on what it actually can do.

crazyates

Legendary

Offline

Activity: 952
Merit: 1000

Quote from: DiabloD3 on June 22, 2012, 04:17:24 AM

June 22, 2012, 04:28:05 AM

#71

Yeah, what I described is clearly Step 3 or later. Intel also seems to have finally sold a "step 3" type of device in the Phi, depending on what it actually can do.

Intel seems more interested in incorporating the CPU into the GPU, while AMD is incorporating the GPU into the CPU. Totally different mindsets/endgames/results.

Tips? 1crazy8pMqgwJ7tX7ZPZmyPwFbc6xZKM9
Previous Trade History - Sale Thread

AzN1337c0d3r

Full Member

Offline

Activity: 238
Merit: 100

★YoBit.Net★ 350+ Coins Exchange & Dice

Quote from: DiabloD3 on June 19, 2012, 01:56:44 AM

June 22, 2012, 04:54:03 AM

#72

But if it still can perform just as well on highly branchy code, I might have a use for one of those.

That would make it crazy awesome for raytracing.

Quote from: cmg5461 on June 20, 2012, 06:19:21 PM

Quote from: rjk on June 20, 2012, 04:44:37 PM

Yeah, that's why their Tesla stuff shows up in pretty much all of the new supercomputer builds these days, right? Roll Eyes

their high end tesla's kick some major ass.

On the wallet maybe. Last I checked, a Tesla M2090 was north of $4000.

Also 1 TFLOP is not that impressive. HD7970 is 947 DP GFLOP and it was released in January and doesn't have access to Intel's 22 nm 3D tri-gate tech.

. ██████████ YoBit.net - Cryptocurrency Exchange - Trade Over 350 coins
. ██████████ << ● $$$ - $$$ - $$$ - $$$ - $$$ - $$$ - $$$ >>
. ██████████ << ● Play DICE! Win 1-5 btc just for 5 mins! >>

DiabloD3

Legendary

Offline

Activity: 1162
Merit: 1000

DiabloMiner author

Quote from: crazyates on June 22, 2012, 04:28:05 AM

June 22, 2012, 05:42:50 AM

#73

Quote from: DiabloD3 on June 22, 2012, 04:17:24 AM

Yeah, what I described is clearly Step 3 or later. Intel also seems to have finally sold a "step 3" type of device in the Phi, depending on what it actually can do.

Intel seems more interested in incorporating the CPU into the GPU, while AMD is incorporating the GPU into the CPU. Totally different mindsets/endgames/results.

They both want branch/loop happy highly parallel computation. The Radeon's biggest "problem" (and I'm using the term loosely) is that wavefronts are ran in lockstep: both sides of a branch are the same length, even if it requires inserting no-ops, and loops that have lengths that are set at runtime (instead of static/compile time set) are just as nasty.

CPUs, otoh, can't do highly parallel calculations because of all the hardware dedicated dealing with branching, branch prediction, cache prediction, etc etc etc takes up a lot of room, produces a lot of heat, and uses a lot of power. I wonder how much stuff Intel removed to put 50 cores on a card.

goxed

Legendary

Offline

Activity: 1946
Merit: 1006

Bitcoin / Crypto mining Hardware.

Quote from: DiabloD3 on June 22, 2012, 05:42:50 AM

June 22, 2012, 06:21:53 AM

#74

I wonder how much stuff Intel removed to put 50 cores on a card.

Here's a pdf depicting the organization of Larrabee, the precursor of Phi.
http://users.ece.gatech.edu/lanterma/mpg08/Larrabee_ECE4893.pdf

Revewing Bitcoin / Crypto mining Hardware.

DiabloD3

Legendary

Offline

Activity: 1162
Merit: 1000

DiabloMiner author

Quote from: goxed on June 22, 2012, 06:21:53 AM

June 22, 2012, 07:40:37 AM

#75

Quote from: DiabloD3 on June 22, 2012, 05:42:50 AM

I wonder how much stuff Intel removed to put 50 cores on a card.

Here's a pdf depicting the organization of Larrabee, the precursor of Phi.
http://users.ece.gatech.edu/lanterma/mpg08/Larrabee_ECE4893.pdf

Im already well aware of how they designed that. Its more butchered than Atom. But from what I've heard, Phi isn't nearly as bad.

Gabi

Legendary

Offline

Activity: 1148
Merit: 1008

If you want to walk on water, get out of the boat

Quote from: AzN1337c0d3r on June 22, 2012, 04:54:03 AM

June 22, 2012, 11:26:23 AM

#76

Also 1 TFLOP is not that impressive. HD7970 is 947 DP GFLOP and it was released in January and doesn't have access to Intel's 22 nm 3D tri-gate tech.

Protip: Xeon Phi run x86 code

Good luck using the 7970 (or nvidia) computing power, having to fight with opencl and cuda.

░░░░░░░░▄█▀░░░░░▀█▄░░░░░░░ ░░░░░░▄██▀░░░░░░░▀██▄░░░░░ ░░░░░▄███░░░░░░░░▄███▄░░░░ ░░░░░▄████▄█████▄█████░░░░ ░░▄████████▀▀░▀▀████████░░ ░▄███▀██████▄░▄██████▀███▄ ▄██▀░▄██▀▀███▀███▀▀██▄░▀██ ▀█░░░▀██░░░▀█▄█▀░░▄██▀░░░█ ░░░░░░▀██▄░▄███▄░▄██▀░░░░░ ░░░░░░░▀███████████▀░░░░░░ ░░▄░░░░░▄█████████▄░░░░░▄░ ░░░▀█▄████▀▀░░░▀▀████▄█▀░░

▐.

..BitFence..

▐.

.REMOVE HACKERS FROM YOUR NETWORK .
PRESALE: APR 1, 2018, 0:00 UTC CROWDSALE: MAY 1, 2018, 0:00 UTC
WEBSITE ● WHITEPAPER ● ANN ● BOUNTY ● TWITTER ● TELEGRAM ● GITHUB

Gabi

Legendary

Offline

Activity: 1148
Merit: 1008

If you want to walk on water, get out of the boat

Quote from: multi#lord on June 21, 2012, 03:17:59 PM

June 22, 2012, 11:30:19 AM

#77

Probably covered some of the stuff in thread, but interesting read on the Xeon Phi:

http://vr-zone.com/articles/intel-xeon-family-finally-accepts-the-larrabee-in-xeon-phi-and-its-futures/16361.html

This article fail:

Quote

So, how does it stand performance wise? Its double precision FP throughput is the same as the typical AMD Radeon HD7970 card which costs one quarter of the amount but with much smaller memory, 3 GB, and no ECC.

No ECC? The 7970 has ECC Roll Eyes

▐.

..BitFence..

▐.

rjk

Sr. Member

Offline

Activity: 448
Merit: 250

1ngldh

Quote from: Gabi on June 22, 2012, 11:30:19 AM

June 22, 2012, 11:53:45 AM

#78

The 7970 has ECC Roll Eyes

Are you sure? That's usually reserved for the expensive enterprisey cards like FirePro.

Mining Rig Extraordinaire - the Trenton BPX6806 18-slot PCIe backplane [PICS] Dead project is dead, all hail the coming of the mighty ASIC!

DiabloD3

Legendary

Offline

Activity: 1162
Merit: 1000

DiabloMiner author

Quote from: Gabi on June 22, 2012, 11:30:19 AM

June 22, 2012, 02:46:37 PM

#79

Quote from: multi#lord on June 21, 2012, 03:17:59 PM

Probably covered some of the stuff in thread, but interesting read on the Xeon Phi:

http://vr-zone.com/articles/intel-xeon-family-finally-accepts-the-larrabee-in-xeon-phi-and-its-futures/16361.html

This article fail:

Quote

No ECC? The 7970 has ECC Roll Eyes

No it doesn't. What GCN did was add ECC to all internal on-die memory (caches, local stores, etc), but the only cards AMD has that have ECC GDDR5 are FirePro/FireStream cards, and although they're normal GCN chips, they're not referred to as such.

multi#lord

Member

Offline

Activity: 66
Merit: 10