If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#76 |
|
Senior Member
|
I don't think it is SRAM.
|
|
|
|
|
|
#77 |
|
Member
Join Date: Jun 2008
Posts: 335
|
Really? Seems tiny if its DRAM on an interposer, I wouldn't think they would need an interposer at all if it was only 64MB DRAM. I mean, in 2013, they should have 4gbit DRAM chips, and 2gbit has a raw die size of ~55mm^2 nowadays IIRC? Thats gotta be below 10mm^2 for 64MB DRAM, whats the interposer for?
Unless of course this info is completely wrong and its much more than 64MB... I was expecting 512MB or 1GB stacked next to the die, personally. |
|
|
|
|
|
#78 |
|
Senior Member
|
maybe it was 512MBytes but the article authors thought it was 512Mbits and converted it to 64MBytes.
|
|
|
|
|
|
#79 |
|
Member
Join Date: Sep 2011
Posts: 109
|
|
|
|
|
|
|
#80 |
|
Red-headed step child
Join Date: Jun 2004
Location: Guess ;)
Posts: 3,084
|
And yet, would be epic badassery if there's 512MBytes of ram stacked on the die at full clock and a fat connection. I'm not counting on that, though...
__________________
"...twisting my words" |
|
|
|
|
|
#81 |
|
Member
Join Date: Jun 2008
Posts: 335
|
That would be awesome. Though as I think about it, I'm starting to see the appeal of using SRAM even if it was only 64MB. Would essentially be a huge L4 cache that could be shared with the CPU/GPU, and could be used as a framebuffer the way Xbox360 uses its eDRAM. The low latency and huge bandwidth could make for some interesting efficiency gains in the pipeline...
|
|
|
|
|
|
#82 |
|
Senior Member
|
Bear in mind that we're talking about a mainstream, notebook-oriented part, so cost and power are important concerns.
__________________
"Well, you mentioned Disneyland, I thought of this porn site, and then bam! A blue Hulk." —The Creature My (currently dormant) blog: Teχlog |
|
|
|
|
|
#83 | |
|
Senior Member
|
Quote:
They seem to have traded off capacity for much larger bandwidth. I hope they are able to hit ~100GBps, if they are going with only 64MB. The comment over on die framebuffers to feed the display during low power states is also interesting. They would need ~8MB just for that. I am not sure where they will find room for that given 32nm duals have 4MB L3 total. |
|
|
|
|
|
|
#84 |
|
Senior Member
|
![]() Haswell most likely to have 50% more ALUs, according to the die-shot from an early running sample.
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
|
#85 |
|
Member
Join Date: Sep 2011
Posts: 109
|
Who says that this is a Haswell die?
According to Hiroshige Goto Haswell GT2 has 80 ALUs while GT3 has 160. http://pc.watch.impress.co.jp/img/pc...tml/8.jpg.html |
|
|
|
|
|
#86 |
|
Senior Member
|
|
|
|
|
|
|
#87 |
|
Senior Member
Join Date: Apr 2007
Posts: 1,394
|
Kaveri with GDDR5?
http://www.donanimhaber.com/islemci/...re-gidiyor.htm |
|
|
|
|
|
#88 | |
|
French frog
Join Date: Jun 2005
Location: France
Posts: 4,172
|
Quote:
Amd bought a memory company not that long ago they could produce gddr5 modules. I don't think all the memiry would be gdd5 but a module could be gddr5. basivally kaveri could support two 128 bit buses one for gddr5 and one the ddr3.
__________________
What's trying to be a bunch of presentations PS360 youtube channel Sebbbi about virtual texturing Tuned EADGCF and liking it :) Last edited by liolio; 22-May-2012 at 08:18. |
|
|
|
|
|
|
#89 | |
|
Member
Join Date: Oct 2003
Posts: 320
|
Quote:
They should just make Kaveri boards w/ GDDR5 soldered on (much like a graphics board) and they'd much better memory bandwidth by using a fat bus and a memory controller similar to what they use in GPUs instead of using a bottlenecked dimm interface. I can see lower end parts having 8 GB and higher end parts 16 GB w/ 100+ GB/s compared to ~30 GB/s today. |
|
|
|
|
|
|
#90 |
|
French frog
Join Date: Jun 2005
Location: France
Posts: 4,172
|
Well Density for GDDR5 memory chips is not the same as for DDR3. Is more than 4GB doable? Which bus width?
Extra latencies would impact negatively the CPU performance, no? Two buses would fit easily in llano/trinity. Even a 64Bits bus would helps granting the chip with extra ~30GB/s of bandwidth.
__________________
What's trying to be a bunch of presentations PS360 youtube channel Sebbbi about virtual texturing Tuned EADGCF and liking it :) |
|
|
|
|
|
#91 |
|
Senior Member
Join Date: Apr 2007
Posts: 1,394
|
GenX (7.5) GT4 @14nm (Broadwell) with up to 2 TFLOPs?
http://www.inpai.com.cn/doc/hard/180995.htm HSW GT3 could be in the 1 TFLOPs range. |
|
|
|
|
|
#92 |
|
Senior Member
|
Sounds fake, if you ask me.
7970 is <4T today. And it eats >200W. EDIT: surreal ->fake |
|
|
|
|
|
#93 | |
|
Senior Member
|
Quote:
Ideal scaling would take that to 43.52 GFLOPS/W on 20nm, and 87.04 on 14nm. So you'd only need <12W to get to 1TFLOPS on 14nm. Of course, ideal scaling is a pipe dream these days, but with a power budget of ~75W for a desktop chip and full, aggressive bi-directional power management, it seems doable. They'd need to do something big about memory bandwidth, though.
__________________
"Well, you mentioned Disneyland, I thought of this porn site, and then bam! A blue Hulk." —The Creature My (currently dormant) blog: Teχlog |
|
|
|
|
|
|
#94 | |
|
Senior Member
|
Quote:
|
|
|
|
|
|
|
#95 | |
|
Senior Member
|
Quote:
GPU performance of that order could eliminate the need for discrete GPUs altogether, even for hi DPI displays. |
|
|
|
|
|
|
#96 | ||
|
hardly a Senior Member
Join Date: Jul 2008
Location: still camping with a mauler
Posts: 3,637
|
Quote:
Memory bandwidth is always the larger problem with iGPU, and it isn't clear how (or if) there is a plan to solve that. Although I hear there may be some crazy EDRAM type stuff coming down the pipe. The question then is how to make use of it staying in the D3D API, or if each iGPU architecture will require custom code to make use of it.
__________________
Quote:
|
||
|
|
|
|
|
#97 | |
|
Senior Member
Join Date: Dec 2004
Location: Toulouse
Posts: 4,144
|
Quote:
I think Intel kept things simple for themselves, they are strong at making SRAM and it's done at every process, it has to be low cost too and released quickly while being some of the latest tech, maybe this is the first memory-on-interposer mass product? |
|
|
|
|
|
|
#98 | |
|
Senior Member
Join Date: Dec 2004
Location: Toulouse
Posts: 4,144
|
Quote:
two buses would just make your chip and socket much more expensive, such wide CPU are made and sold on socket G34 and 2011. a memory controller, or a pair of 64bit ones that support both gddr5 and ddr3 would be more reasonable and is like the CPU I'm using, which supports both ddr2 and ddr3. |
|
|
|
|
|
|
#99 | |
|
Senior Member
|
Quote:
|
|
|
|
|
|
|
#100 | |
|
Member
Join Date: Sep 2006
Posts: 273
|
Quote:
I don't even know how Haswell will improve performance even by 2x on the 15W Ultrabook parts. If Anandtech's measurements are accurate, it takes ~4W for the CPU core, 9W for iGPU and 4W for the rest of the CPU(in typical games). If you want to bring that down to 15W, and double iGPU performance you need: 18W performance down to 7W, or 2.6x the improvement in perf/watt. Ivy Bridge is said by Intel to use 2x performance/watt at same performance level as Sandy Bridge, but does not reach the same 2x perf/watt at greater performance. Obviously because they had to use a combination of lowered clocks and process tech advancement to achieve 0.5x the power usage. How will they do that on higher performance? |
|
|
|
|
![]() |
| Thread Tools | |
| Display Modes | |
|
|