If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#1 |
|
Member
Join Date: Dec 2009
Posts: 171
|
Not strictly related to GPUs but most people working on GPGPU computing are in the HPC field so you may be interested.
There is a new project looking for money on Kickstart: http://www.kickstarter.com/projects/...r-for-everyone With the the HPC scene pretty much dominated by GPUs (i.e. SIMD or SIMT or whatever they are called today), trying a pure MIMD route could be interesting. Porting existing applications will still not be a trivial task (i.e. Parallella looks a lot like the old Transputer) but it could be a lot easier than with SIMD. |
|
|
|
|
|
#2 |
|
Invisible Member
Join Date: Apr 2002
Location: La-la land
Posts: 4,984
|
They have a very wonky way of counting chip performance by adding up the clock speed of all the processor cores. That's just not proper, from any point of view.
__________________
"If I were a science teacher and a student said the Universe is 6000 years old, I would mark that answer as wrong (why? Because it is)." -Phil Plait |
|
|
|
|
|
#3 |
|
Senior Member
Join Date: Mar 2005
Location: Oregon
Posts: 1,692
|
The performance per watt and price don't seem that impressive.
|
|
|
|
|
|
#4 |
|
Moderator
Join Date: Feb 2002
Location: Taiwan
Posts: 2,346
|
Its off-chip bandwidth seems to be just too low. Of course, such project is probably still good for people to experiment with MPI or OpenMP (i.e. not for real works, but can learn how to do proper parallel programming).
|
|
|
|
|
|
#5 |
|
Senior Member
|
Yeah, that was a forgivable marketing misstep in the early days of dual-core processors, when conveying the advantage of a dual-core was deemed difficult, but now it just makes them look amateurish and really hurts their credibility.
__________________
"Well, you mentioned Disneyland, I thought of this porn site, and then bam! A blue Hulk." —The Creature My (currently dormant) blog: Teχlog |
|
|
|
|
|
#6 |
|
Tea maker
Join Date: Feb 2002
Location: In the Island of Sodor, where the steam trains lie
Posts: 4,379
|
oooh. Deja vu indeed.
__________________
"Your work is both good and original. Unfortunately the part that is good is not original and the part that is original is not good." -(attributed to) Samuel Johnson "I invented the term Object-Oriented, and I can tell you I did not have C++ in mind." Alan Kay |
|
|
|
|
|
#7 | |
|
Senior Member
|
Quote:
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
|
#8 |
|
Senior Member
Join Date: Mar 2010
Location: Cleveland, OH
Posts: 1,553
|
If you think their MHz added numbers you should look at the performance "comparison" video they made:
http://www.youtube.com/watch?feature...&v=4sMWbaV1sRQ I think this could be the worst benchmark test I've ever seen. As for the actual product.. can anyone think of any applications where pure MIMD is a win, but only with 32KB of fast memory per core, and an emphasis on single-precision floating point arithmetic? It would be interesting to learn more about their core uarch, but unfortunately you're told to contact their sales division if you want their hardware manual (way to put your first foot forward in embracing that open hardware philosophy you go on about on your KS page guys). They do say somewhere that it's superscalar; I wonder if this is a simple VLIW. You'd have to think that the alleged power budget allocated for these cores wouldn't allow for much, if anything in the way of branch prediction, so I doubt they'd be any good at general purpose code even if it did exhibit extreme and explicit memory locality. |
|
|
|
|
|
#9 |
|
Member
Join Date: Jul 2005
Location: Austin, Tx
Posts: 408
|
I think you guys would be surprised how often companies add up MHz. I've seen it at least in two different companies now. They're both in the signal processing fields so it maybe specific to that industry,
From my experience, these types of processors are a good match for prototyping work of signal processing tasks. For example in a wireless receiver you may have a very simple processing chain look like: resampler->FFT->Channel Estimate->Equalizer->Symbol Demapper->Decoder. In theory, you can parallelize those components and map the whole chain onto the processor. This approach is basically FPGA like, but written in C (year right, it's all ASM) instead of RTL. If you can actually run your tasks in real time, you can actual test your performance before taping out an ASIC. Many companies do this by running the ASIC RTL on FPGA. For a final product I don't think this makes sense. However, if a task is highly parallel and requires a lot of processors, you are better off using one processor with a wide SIMD vector since it will be much more power efficient. Chances are that a task spread out over a whole processor like this will be bottlenecked by the interprocessor communications network. If that communication isn't deterministic, well, good luck both debugging and designing it in way that it works efficiently without overhead. If you're tasks are serial, chances are they will be the bottlenecked by the single thread performance of each core. It's doubtful these types of cores are powerful at all. Without a revolutionary BW/Shared memory innovation, I don't think these things will ever work. Transputer right. |
|
|
|
|
|
#10 |
|
Member
Join Date: Jul 2007
Location: Russia
Posts: 96
|
So far the most detailed description is here:
http://www.adapteva.com/wp-content/u...apteva_mpr.pdf |
|
|
|
|
|
#11 | |
|
Member
Join Date: Dec 2009
Posts: 171
|
Quote:
|
|
|
|
|
|
|
#12 | |
|
Senior Member
Join Date: Mar 2010
Location: Cleveland, OH
Posts: 1,553
|
Quote:
Actually, what I really wonder is how it compares with Ziilabs stemcell array SoCs, because on paper it looks pretty similar, although it appears to have a somewhat different core interconnect topology. |
|
|
|
|
|
|
#13 |
|
Member
Join Date: Jan 2009
Posts: 215
|
Epiphany architecture reference manuals released.
http://www.kickstarter.com/projects/...everyone/posts I have backed the project. I am actually pretty excited about this. |
|
|
|
|
|
#14 |
|
Senior Member
Join Date: Feb 2002
Posts: 2,019
|
Why are you excited about it?
|
|
|
|
|
|
#15 |
|
Member
Join Date: Dec 2009
Posts: 171
|
Because:
- it is a MIMD (and not a SIMD like all GPUs). It can solve some kind of problem a lot better than GPUs even with a lower peak performance. - it has an extremely good GFLOPS/Watt ratio. - the dies size is 2mm^2 (the guessed die size of a AMD 7970 is 354mm^2). There is a lot of room for improvements. - it usable in many embedded applications where GPUs aren't very practical because of their size/cooling/power requirements. Good luck installing a Fermi GPU in a small drone. - they want to open source all the software. - you can buy one at 99$ and you get a complete PC (ethernet, usb, hdmi, etc.) capable to run Ubuntu and where you can test OpenCL, etc. applications. It has a great educational value. |
|
|
|
|
|
#16 |
|
Tea maker
Join Date: Feb 2002
Location: In the Island of Sodor, where the steam trains lie
Posts: 4,379
|
Some GPUs are MIMD.
__________________
"Your work is both good and original. Unfortunately the part that is good is not original and the part that is original is not good." -(attributed to) Samuel Johnson "I invented the term Object-Oriented, and I can tell you I did not have C++ in mind." Alan Kay |
|
|
|
|
|
#17 |
|
Member
Join Date: Dec 2009
Posts: 171
|
|
|
|
|
|
|
#18 |
|
Darlek ******
Join Date: Jun 2004
Posts: 9,486
|
If this is a supercomputer why are they showing benchmarks comparing it to a cortex A9 ?
__________________
Guardian of the Most holy Two Terabytes of Gaming Goodness™ |
|
|
|
|
|
#19 |
|
Member
Join Date: Feb 2002
Posts: 586
|
|
|
|
|
|
|
#20 | |
|
Senior Member
|
Quote:
![]() And Tahiti has 32 CUs running simultaneously. It just happens to be SIMD on top of it. It's MIMerD: Multiple Instructions, Multipler Data
__________________
"Well, you mentioned Disneyland, I thought of this porn site, and then bam! A blue Hulk." —The Creature My (currently dormant) blog: Teχlog |
|
|
|
|
|
|
#21 |
|
Member
Join Date: Jul 2005
Location: Austin, Tx
Posts: 408
|
I do like they're way of addressing data. My previous experience with stuff like this required MPI and it was a nightmare to manage the data. Data partitioning and movement had the potential to be as hard, if not harder, to handle efficiently in software than the partitioning of the operations.
I do wonder how efficient the system is when all processors are loaded and you're hitting memory from all over the chip. |
|
|
|
|
|
#22 |
|
Member
Join Date: Aug 2004
Posts: 244
|
|
|
|
|
![]() |
| Thread Tools | |
| Display Modes | |
|
|