Moore’s Law is dead – Long live the chiplet (opens in new tab)

(semiwiki.com)

157 pointschclau3y ago116 comments

116 comments

aargh_aargh3y ago

For dummies like me who didn't know what a chiplet is:

https://en.wikipedia.org/wiki/Chiplet

This seems to be about the third reason listed:

> Known good die (KGD): chiplets can be tested before assembly, improving the yield of the final device

Problem:

  > In general, a killer defect is defined as a defect that is 20% the size of the fabrication node.  For
  > example, a defect that is less than 9nm may be acceptable for the 45nm fabrication node, but a defect
  > larger than 2.8nm would be defined as a “killer” defect for the 14nm fabrication node.  For the 5nm
  > fabrication node, a defect measuring only 1nm could be a killer.
  > 
  > This is one of the primary reasons that it has become increasingly difficult to yield large monolithic
  > ICs (as measured in die area) when using leading edge fabrication process technology

Solution: I understood it from the visual explanation in the first chip image (AMDArt2 png) and its description in this article:

https://www.nextplatform.com/2021/06/09/amd-on-why-chiplets-...

nordsieck3y ago

The issue that you seem to be skirting around but not mentioning is, the chance a chip has a defect increases with die area since defects are randomly distributed across the surface of the wafer. Chiplets are a way for manufacturers to practically increase die area while keeping yields high.

hedgehog3y ago

In case it's not obvious the implication is that cost per good part starts to go up quickly. The wafer size and cost is essentially fixed so bigger dies mean both a lower % of parts are good and also fewer fit on the wafer in the first place. Wrong kind of hockey stick chart.

1 more reply

tyingq3y ago

AMD's EPYC-Rome processor, helpful to look at after looking at your link, as the chiplets are nice and visible:

https://cdn.wccftech.com/wp-content/uploads/2018/11/AMD-EPYC...

londons_explore3y ago

Is silicon manufacturing done entirely in a vacuum yet?

Because a vacuum pretty eliminates dust - with no air, dust just falls towards either the ground (if it is uncharged), or towards a positive or negatively charged surface (if the dust particle itself is charged).

rcxdude3y ago

No (at least not entirely: some steps are done in a vacuum or very low pressure), in part because it's harder to get a vacuum than clean enough air. Also it would cause a lot of other problems as well as not solving the whole problem: a lot of processing steps involve applying chemicals to the surface of the wafer, washing off those chemicals, or otherwise handling liquids which would boil in a vacuum. Those chemicals (including just plain water), also carry the same risk of introducing 'killer particles', so they are also a big part of the process control needed in a fab (the levels of contaminants in water that is required on modern process nodes is actually lower than can be detected practically with current technology: the last levels of water purification are effectively done blind, with yield as the only feedback mechanism).

nsteel3y ago

Search "tsmc contamination" for real-life examples although don't expect any details. I'm sure it's not just TSMC but it's the best place to start.

PaulHoule3y ago

Moore’s law is alive but the benefits are diminishing.

Until 2005 or so, shrinking transistors automatically increased speed and reduced power consumption. When that ran out of steam, the industry went to multi core and massive parallelism with GPUs.

Until recently each shrink also lowered the cost per transistor, but that seems to have run out also and has something to do with why Intel was stuck at 14nm for so long and why new GPU prices are so insane despite a collapse in demand and resolution of the supply chain crisis for high end chips.

Chiplets at best are neutral with regard to cost. If manufacturing overhead is low, two chiplets give you twice the transistors at twice the cost. The industry did not pursue chiplets with a lot of vigor until now because it was a less competitive approach to scaling than shrinking transistors until now.

samatman3y ago

I've been jesting for years that it should be referred to as "Moore's business plan", except it isn't a joke.

That business plan is no longer functional for Intel. Translated, Moore's business plan was to shrink die size, and speed up clocks, so much as to obsolete their previous offering with an 18 month half life.

It just doesn't work that way anymore, and hasn't for quite some time.

Engelbart's Scaling Observation[0], on the other hand, remains quite interesting, and from what I see remains in force. Genetics in particular is still pulling exponential gains out of the luminiferous ether.

[0]: https://en.wikipedia.org/wiki/Engelbart%27s_law

jakogut3y ago

Something to consider is that smaller chiplets have higher yields than monolithic dies given the same defect rate, which can certainly have an effect on price.

hnuser1234563y ago

Right, more precisely cut out the defects from a wafer and save as much precious top-tier silicon as possible. Plus, another benefit of chiplets is that not every circuit needs the same performance level. Save the 5nm stuff for "hot paths" and use increasingly older processes for less performance/power critical applications, because while they might take up more space, they won't unnecessarily take up more next-gen fab time. Phones are literally the smallest and most power-limited devices we use so it makes sense to make every chip in them 5nm. Laptops and desktops and servers are not so constrained.

pjmlp3y ago

The problem is that GPUs are mostly underutilized outside games and machine learning, because the industry still hasn't moved away from the concept only a few selected group of developers can enjoy tooling to program them.

So everyone that works in other domains, without access to libraries written by the GPU druids, largely ignores their existence.

zarzavat3y ago

You can have compute shaders in WebGL2, and WebGPU is around the corner. GPU power is available but then you run into the thorny issue of specs...

Consumer machines vary wildly in their GPU capabilities, especially VRAM. So how do you know that your nice accelerated algorithm is going to work if the user has an old GPU? And what do you do if it doesn’t work? Run on the CPU? Tell the user their machine is too weak?

Here the advantage of GPUs (performance) is also the biggest disadvantage: a gigantic range of performance profiles. At least with CPUs the oldest CPU is only going to be a small integer factor slower than a new one in single thread.

What unites gamers and machine learning is an expectation that the user has a reasonably recent and capable GPU. But these are small, self-selecting populations.

On the server side the issue is cost. GPUs are expensive, and usually not necessary, so nobody is going to write code that requires one without a good reason.

pjmlp3y ago

Good luck letting the average JavaScript coder take advantage of them.

This is the problem, GPUs are still a very specialised skill.

briffle3y ago

Can you blame them? I just built a nice custom PC for my Son, with an 6 core cpu (with graphics, a ryzen -g class) 32GB of ram, 1TB NVMe hard drive, nice case, etc.

That cost about the same as a single mid-range video card. (Nvidia RTX 3070)

Why on earth would you add a requirement to your software/workflow that doubles your cost, and is just about impossible to find in stock?

irrational3y ago

I remember building my own PCs back in the 90s and early 2000s. In 2022 I don’t think any of my kids has ever even seen a desktop computer. It is all laptops, tablets, and phones.

miohtama3y ago

I thought most generic computation workloads are ill-suited for GPUs. A normal web SaaS application is full of if branches and JMP instructions. Running this on GPU would slow it down, not speed it up.

pjmlp3y ago

Exposing GPU programming to anyone besides C, C++ and Fortran developers would already help, even if that would take a speed bump, as proven by the few attempts targeting PTX.

I wasn't talking about Web apps.

2 more replies

PaulHoule3y ago

For one thing, most employers will refuse to issue a laptop with a real GPU to developers and other employees because they are afraid they will get used for games.

WastingMyTime893y ago

That’s clearly untrue. Employers source computers from a few selected vendors and generally issue computers with average specifications because they can buy them in bulk and they are good enough. You can get a laptop with a more powerful GPU at most place if you actually need one.

No one is scared of employees gaming. Employees can’t install applications themselves on their laptops at most place.

TomVDB3y ago

That seems like a stretch? Isn’t the more obvious explanation that laptops with a real GPU are much more expensive and that the weaker, integrated GPUs are more than good enough for the vast majority of business use?

Today’s iGPUs are fast enough comfortable run plenty of games.

I have work provided high-end POS Dell Precision engineering laptop. It has an Nvidia discrete GPU, but I don’t think I’ve ever actually needed its power, and I’d gladly trade it for a laptop without…

pjmlp3y ago

The integrated one would already be quite good, if there was a more mainstream way to make use of it.

JonChesterfield3y ago

The tooling is getting better. Debuggers are a thing now. You can program them in freestanding C++ with a little determination. Openmp target regions are friendlier syntax. Julia and a bunch of python machine learning things have GPU backends. They're still niche but slowly we make progress.

mike503y ago

Folding@Home

oneplane3y ago

And the worst thing is that if execs etc. think that they can sit back down and breathe easy because they have a path towards even more infinite money in their chip business now that Moore's law isn't constraining that anymore, real R&D investments into actual advancements might not get the same attraction anymore...

bjourne3y ago

The reason Intel was "stuck" at 14 nm was because it took Extreme ultraviolet lithography (EUV) many years longer to become viable than was predicted. Prices may have more to do with the EUV market being dominated by ASML which has serious trouble producing lithography machines fast enough to meet demand.

adrian_b3y ago

No, Intel was "stuck" at 14 nm because they believed that they will succeed to scale down the transistor sizes a lot more, without using EUV, as Pat Gelsinger has just explained in a long interview in the Verge.

However they failed to implement with good results the methods that they had hoped to work, while the others, i.e. TSMC and Samsung had much more realistic roadmaps, which added EUV at the right moment.

Intel was not stalled by waiting for EUV, on the contrary they were not prepared for the transition that was necessary when EUV was eventually ready.

https://www.theverge.com/2022/10/4/23385652/pat-gelsinger-in...

PaulHoule3y ago

If it’s not one thing it’s another.

If progress continues then they will need some other expensive machine. Either that or they’ll try to stretch the life of EUV the same way Intel tried to delay EUV with extreme multiple patterning.

It seems to me though that the ASML machines ought to get some competition from something more like a free electron laser.

duped3y ago

> If manufacturing overhead is low, two chiplets give you twice the transistors at twice the cost

That doesn't take into account yields. One chip with twice the transistors is physically larger than two chips with half as many, and more likely to have a defect during production.

PartiallyTyped3y ago

GPU prices are so insane because shareholders must make money, because NVDA miscalculated the crypto demand and doesn't want to hold the bags again (cf gtx 1060), and because NVDA artificially limited the supply of [rtx 30xx] cards.

irthomasthomas3y ago

The nm, today, is a marketing term, divorced from the real manufacturing process.

brundolf3y ago

Lotta people here not reading the article:

> However, as it has in the past, the semiconductor ecosystem is adapting and as Chiplet technology builds traction, we will very likely see a period of accelerating innovation and new market opportunities opening as we move forward.

The whole premise is that chip innovation (and overall computing power) is continuing to accelerate, even though "Moore's Law as we've known it" has ended

ordu3y ago

I'd say that the premise of the article is captured in the title perfectly. One needn't to read the article to get it. So, we can strengthen your assertion and to say that "lotta people here are not reading the title." People react to the first part of the title. Seems they are really exasperated by all these repeated deaths of Moore's law.

brundolf3y ago

In the title it's a little subtle due to the wordplay and terminology, but yeah, it is present. Definitely a lot of knee-jerking going on

Eisenstein3y ago

Does Moore's law require transistors to double on a chip made from a single die?

lukaesch3y ago

Wouldn’t this mean that we should focus on writing more efficient code than before?

Especially in the startup space I saw companies building software with the hypothesis “users need the latest device for our product and they will get faster anyway so we don’t need optimize our code. Instead we deliver features on max speed skipping optimizations and wait until our users upgraded to newer devices during the coming 2-4 years”.

mildmotive3y ago

It’s offloading the cost to the customer. It is way cheaper to develop in some famous interpreted language than creating a set of robust compiled binaries. As long as customers can pay up for newer hardware we will keep seeing clunky UIs that can barely handle 20 list items of variable size without noticeable lag on a modern computer.

I don’t even blame the companies for doing this. The benefit to cost ratio of using idk C++ for everything is just too bad.

Ma8ee3y ago

But the inefficiencies you sometimes see today can’t even be explained with any bad choice of language. You can create more than fast enough programs with interpreted languages with garbage collection. But then of course you need to know at least a little bit about data structures and not doing dozens of REST calls anytime anyone taps the screen.

mildmotive3y ago

I only partially agree, an interpreted language can act as a hinderance when doing optimisations, for various reasons.

But how we write the code also matters of course.

Furthermore, competency also costs money. I’m not saying this to be mean to people abusing REST calls.

1 more reply

lpedrosa3y ago

What if doing dozens of REST calls is effectively the product?

1 more reply

theaeolist3y ago

Even if the Moore's law is not dead, single-thread performance and clock frequency have plateaued 10 years ago. This is the key factor. Because of heating even if you squeeze more transistors onto a chip you need to reduce the clock, so even if you may get higher computational throughput the latency will go down. And this is another argument for chiplets or any other alternative computational architectures.

urthor3y ago

> single-thread performance and clock frequency have plateaued 10 years ago

https://www.cpubenchmark.net/singleThread.html

It's amazing how often this is parroted. Anyone with a passing familiarity with the numbers knows this is actually not true at all.

Better caching, branch prediction, plus vast amounts of SRAM. There's been a slow & steady increase in the vast variety of single threaded workloads grouped together by "instructions per clock."

Both at the peak of the voltage frequency curve for workstations & overclocking, the apex of the optimization curve for data centre, and especially at the bare minimum for mobile devices with idle workloads.

Yes, it's a small fraction of the old days. It's still double in 10 years.

And as anyone who's migrated from an Intel Mac to Apple Silicon knows, "merely doubling" is a LOT.

gilbetron3y ago

Sorry, doubling in 10 years vs doubling in 18 months effectively is plateauing! Especially since it isn't really a consistent 10% growth per year, but a decelerating growth over that decade. Furthermore, much of the purported single thread performance is taken from a small set of benchmark tests, and so chip makers just optimize them for those tests. Generic single thread performance has undoubtedly not doubled in that 10 years.

urthor3y ago

In any field of any kind except probably silicon, 100% growth in a decade would be marvelous. I don't think anyone could call it a plateau.

> Furthermore, much of the purported single thread performance is taken from a small set of benchmark tests, and so chip makers just optimize them for those tests

"Single thread" is a notoriously difficult benchmark to quantify. Instruction queue depth, floating vs integer, branching vs linear, there are so many variables.

Passmark is fine. Workload simulation is state of the art.

Uehreka3y ago

Have clock speeds really plateaued? Sure it’s not “double every 18 months”, but in mid 2018 I bought an Intel 8700K that turbo’d to 4.7GHz and could (with liquid metal, dark magic and luck) overclock to exactly 5GHz. I remember people saying progress was slowing down, that we might not make it to 6GHz.

4.5 years later and Intel is bragging that their upcoming topline CPU will run 6GHz stock. I suppose one could call this a plateau compared to the good old days of the 80s and 90s, but it’s definitely still progress.

creshal3y ago

> Have clock speeds really plateaued?

In late 2000, Intel promised that Pentium 4s will hit 10GHz by 2005 – on a presumed 130W power budget –, after the last 5 years saw clock speeds increase from 150MHz to 1.4GHz for the P6 architecture (at a stable 30-40W power budget), and other vendors saw similar increases.

Over 20 years later, we're barely scratching the 6GHz barrier with an opportunistic turbo mode that isn't guaranteed to kick in, if your cooling isn't up to the task of dissipating a record-breaking 250W of peak power consumption.

dontlaugh3y ago

Part of why that happens is Intel selling chips closer to the red line. You need cooling similar to what used to be exclusive to overclocking just to keep the stock CPU cool.

MrFoof3y ago

Yep. We're apparently finding out that it's mostly a waste of electricity to get an extra 5% performance due to how far outside the efficiency sweet spot chips are being pushed.

Not just Intel either. AMD has joined the game as of Zen 4, and NVIDIA's been playing it with their GPUs forever as well.

Zen 4 desktop CPUs appear to have (as expected) virtually unchanged single core performance, and maybe 5% reduced multi-core performance, on CPU-bound workloads by reducing the power limit to cut total power consumption -- by over a 100W reduction in the case of the new 7950X! Granted, Intel's been doing that forever -- reign in Alder Lake and its power consumption also comes way down, again for barely a performance hit in CPU-bound multi-core tasks.

-----

Enthusiast grade CPUs and GPUs are basically sold in the equivalent of a TV's retail "demo mode" now -- where a TV has max brightness, contrast and saturation that you'd NEVER use, but is intended to just grab a customer's attention as they walk by. Being pushed so far outside of their efficiency sweet spot just to get that extra 5% to "win benchmarks", when outside of specific use cases (and even if you actually need that 5%!) you're just consuming 50-100% more electricity for utterly marginal gains.

What a waste of resources! All so children (or people who still act like them) can feel better about a purchase as they argue on the internet over nothing worth arguing about.

1 more reply

michaelt3y ago

> Have clock speeds really plateaued?

Pentium 4 HT 3.8F, November 2004, 3.8GHz, 115W TDP

Core i9-13900KF, October 2022, 3.0GHz, 125W TDP

Of course, the latter does give you 8 performance cores and 16 efficiency cores so performance-per-watt has clearly improved; and it has 'turbo boost'. But in terms of sustained single-core performance? It's clear Intel's attention has been elsewhere. Such as on the laptop market, where power efficiency is king.

dagw3y ago

CPUMark single threaded score for the P4: ~620

CPUMark single threaded score for the 13900K ~4800

So obviously a lot has changed and improved.

3 more replies

dTal3y ago

Clock speed has indeed plateaued, but the single-core performance of the i9 is going to be vastly superior to the Pentium 4.

replygirl3y ago

the frequency plateau always been a power consumption/leakage thing, and power draw for recent intel cpus only reinforces that. it's probably too early to tell if 6ghz is a new normal

and fwiw ive had a 5ghz+ overclock on every cpu ive bought in the last ten years with a corsair 240mm aio, going back to the 3570k

loufe3y ago

I really hope there are breakthroughs that allow us to use other, trickier, semiconductors like GaN for chips, IIUC their efficiency could allow us to hit much higher frequencies for the same heat output. That said, I doubt we'd see 3nm processes for something like that.

actionfromafar3y ago

latency up, but, yes.

andreer3y ago

IPC is still improving, so single thread performance is still increasing, even if clock speeds are not (at least not at the same as before). And new instructions (AVX ect) also help, especially if you can optimize and recompile your code.

It's at least enough that we have to take it into account:

We run our workloads across multiple Intel cpu generations and to be able to optimize utilization we have a "speedup factor" which is currently up to 1.7 for the latest generation we've tuned it for. And the base 1.0 performance is from Ivy Bridge, launched 2013.

tester7563y ago

Jim Keller: Moore’s Law is Not Dead

https://www.youtube.com/watch?v=oIG9ztQw2Gc

reillyse3y ago

Moore’s law was a marketing road map for intel.

It was basically a monopolistic warning from one of the founders not to release new products too fast so the company wouldn’t burn out. It worked. But it’s not some scientific or physical law and I hate when people refer to it as such.

ac293y ago

> Moore’s law was a marketing road map for intel.

According to the article, Moore's Law predates the founding of Intel by several years.

Shorel3y ago

Even more: It was about the price per transistor.

Not about transistor size of performance.

So it was an economic insight, more than anything else.

ddalex3y ago

Did Moore law apply to other chip manufacturers too?

Intel can't dictate the technology advances of other competitors

urthor3y ago

Moore's law applied to transistor density, period. So memory and logic equally.

froggertoaster3y ago

Moore's Law is dead - can anyone think of a more cliche article title in the tech world?

wjnc3y ago

“20xx - Year of the Linux desktop”

cm21873y ago

2xxx

zimpenfish3y ago

/1999|2\d{3}/ - "Dirk Hohndel, who was then the chief Linux and open-source technologist at Intel, predicted that in 1999, Linux would penetrate the PC desktop market and displace Windows."[1]

[1] https://www.howtogeek.com/676963/why-desktop-linux-still-mat...

1 more reply

u801e3y ago

Hopefully sometime before 2038.

Ygg23y ago

That's too optimistic :P

froggertoaster3y ago

That's a good one!

dragonelite3y ago

Moore's Law is transisters per square mm or something like that. Just drop a second layer on top with 3d fabbing and im sure moore's law will go on for a decade. inbe4 we have 2,4,8,16,32 etc layer architectures in the future.

Andrew_nenakhov3y ago

You'll need to find ways to remove the heat form inner layers, or the whole pie will melt.

pedrocr3y ago

If all you wanted was to deliver on the density you could just keep most transistors dark and still stay within the power budget. That's not very useful in general but maybe there are some aplications for it. AMD has been shipping their 3D VCache setup where they layer an extra cache only die on top of a CPU die. That's been benchmarking really well and is already an effective doubling that can be cooled with a normal PC setup. Maybe there's a few other tricks like that to get a few more vertical layers out of the same processes.

93po3y ago

Breakthrough battery tech only 8 years away

warmwaffles3y ago

"I use arch btw"

BirAdam3y ago

Unless I’m missing something, the article did mention Moore’s law proper with transistor density doubling every 18 months, but then meandered to talk about other things. M1 has 16 billion transistors thanks to TSMC. Each new node has delivered on Moore’s law with AMD and Apple. I don’t doubt that Moore’s law will stop. I can even say that it Moore’s law may have failed from time to time, but the spirit of the law lives.

Moving to chiplets doesn’t change transistor density. This is a packaging feature and not a fabrication feature. This is done for manufacturing cost reduction and yield improvements.

urthor3y ago

Moore's law is:

Transistor density doubling every 18 months for a similar cost.

https://t7m8e9c8.rocketcdn.me/wp-content/uploads/2020/09/pre...

https://en.wikipedia.org/wiki/Moore%27s_law#cite_note-Moore_...

By those terms, Moore's law is totally extinct.

Folk haven't noticed however, because the "leading edge" logic manufacturers have 60% gross margins. The vast majority of their costs are in design, distribution and overhead.

Price rises of 30% to 100% have disguised that the cost of manufacturing the silicon is an order of magnitude more than a decade ago.

Granted, the above numbers are not the actual inflation adjusted wafer cost for leading edge nodes.

But, $16,000 for a 300mm wafer is extraordinary.

pedrocr3y ago

Most people believe Moore's law has already stopped on the density side. Jim Keller claims we have 50 more years of it at least. If it failed on the economic side only instead then the implications are very different. I do wonder if on the economic side of the law we are missing a bunch of innovation because every node step also reduced the number of players in the market. Right now on the bleeding edge there are only 3 players and that keeps trending down. They're also all ASML clients so we're pretty close to having a single monoculture in semiconductor fabrication. Maybe that's an inevitable result of the problem space but I do wonder if at least some of our lost efficiency comes from there.

urthor3y ago

Costs are rising by ~50% every cycle give or take.

Exponential growth is exponential growth.

They'll exceed the GDP of the United States in a few cycles.

The question becomes, when costs hit the ceiling that Apple is willing to pay, will TSMC still be able to front up with the good stuff and deliver the growth.

1 more reply

wongarsu3y ago

One interesting point is that even if Moore's law stops or slows down, chiplets allow us to increase transistor count independently of transistor density. Thus keeping alive the spirit of the law (processors get more and more transistors).

bjourne3y ago

Maybe. How many times can you double the size of a 25mm^2 chip before it becomes impractically large?

WithinReason3y ago

Until you turn it into a 25mm^3 chip

somat3y ago

I am not convinced moore's law no longer holds true, consider that there is a third dimension that no one has yet figured out.

I am no silicon engineer but I suspect a chip that fully takes advantage of the third dimension would be something like a sponge full of built in channels for the working fluid to remove heat.

First however I suspect you will see chiplets arranged vertically like heatsink fins and the whole cpu would effectively be the water block, basically a vlsi version of the cray 3

urthor3y ago

https://3dfabric.tsmc.com/english/dedicatedFoundry/technolog...

What you're describing is called 3D stacking, and it works.

It's just extraordinarily complex to resolve the intra-die latency issues, and many others, when going vertical. Hence expensive.

petra3y ago

3D is already being used for ram/flash IC's. That's possible because most memory cells aren't being used in a memory chip - so heat density is reasonable.

3D is also used in AMD's 3D cache.

But 3D logic on logic is more complicated because of heat issues. AFAIK there's no workable technical solution for that yet.

threatripper3y ago

Heat removal is basically a 2D problem. A 3D pipe has a 2D cross section in which fluid can transport heat. Right now we are mostly limited by heat removal, so adding more height to the chip doesn't help anything with the heat. Also, the chips are already many layers thick and each layer means processing steps which means time and money.

If you look into flash memory which aren't limited by heat, they have dozens of functional layers already and then we also stack those silicon wafers.

ilaksh3y ago

The history of computing is moving from one paradigm to another. We are well past the fast speedups in single thread transistor-based performance phase and into the hyper-parallelization phase.

3d stacking is another innovation that can help.

But I think within a decade or two there will be a move away from silicon-only transistors to something like memristors or some type of optical or optoelectronic system that hasn't even been invented yet. This will provide some iterations with again radical parallel interconnect and quite possibly single thread speedups.

atulvi3y ago

If I got a penny everytime I hear about the death of Moore's law..

Karellen3y ago

If I got one penny the first time I heard about the death or Moore's law, two pennies the second time, four pennies the next...

tapanjk3y ago

> Obviously, given these data, volume is VERY important in business models that operate with high fixed and low variable costs.

Off-topic but I wonder how much cheaper mobile phones would be if the manufacturers did not have to come up with a hardware design update every year or so? What if mobile phones were built to last longer, which would reduce the cost per phone due to high volume? Of course, I am not suggesting this is good for business but as a mere thought experiment.

alexvoda3y ago

At this point the most direct and highest impact way of making phones last longer is forcing Qualcomm to support their chipsets for more than 2 years.

transpute3y ago

DARPA ERI (Electronics Resurgence Initiative) has promoted chiplet interoperability and included $100M funding for open-source EDA tools, https://www.eetimes.com/darpa-unveils-100m-eda-project/

> With $100 million in funding, the IDEAS and POSH programs ... aim to combat the growing complexity and cost of designing chips, now approaching $500 million for a bleeding-edge SoC. Essentially, POSH aims to create an open-source library of silicon blocks, and IDEAS hopes to spawn a variety of open-source and commercial tools to automate testing of those blocks and knitting them into SoCs and printed circuit boards. If successful, the programs “will change the economics of the industry,” enabling companies to design in relatively low-volume chips that would be prohibitive today.

2017 vision, slide #22, https://www.darpa.mil/attachments/eri_design_proposers_day.p...

  My DARPA dream
  
  $ git clone https://github.com/darpa/idea
  $ git clone https://github.com/darpa/posh
  $ cd posh
  $ make soc42

ERI Summit 2019, Intelligent Design of Electronic Assets (IDEA) & Posh Open Source Hardware (POSH), https://youtube.com/watch?v=pJubnAN3VKw

UCSD OpenRoad, https://theopenroadproject.org/ & https://vlsicad.ucsd.edu/Publications/Conferences/378/c378.p...

> OpenROAD is a front-runner in open-source semiconductor design automation tools and know-how. Our project reduces barriers of access and tool costs to democratize system and product innovation in silicon. The OpenROAD tool and flow provide autonomous, no-human-in-the-loop, 24-hour RTL-GDSII capability to support low-overhead design exploration and implementation through tapeout. We welcome a diverse community of designers, researchers, enthusiasts and entrepreneurs who use and contribute to OpenROAD to make a far-reaching impact.

https://semiengineering.com/will-open-source-eda-work/

> All the big EDA providers, as well as leading chip companies, are active contributors to ERI projects. In fact, Cadence, Synopsys, Mentor, NXP, Intel, IBM, Intel, Qualcomm, Arm, Nvidia, Analog Photonics, SRI International and Applied Materials all have contributed speakers and engineers or materials to ERI effort ... the key to getting industry players to accept open-source EDA is whether it makes the design process more efficient without breaking anything—and whether it is possible to extract decades worth of design experience from libraries of millions of existing designs and use that to spot errors in real time in existing designs.

3143y ago

The rising cost of software in the design process is startling. I wonder what opportunity there is for new entrants to reduce that cost.

MHAlliance3y ago

Jeff Dean's recent talk on ML for hardware design seems like a great application of the tech in a space where we are seeing design process costs balloon (see https://www.youtube.com/watch?v=FraDFZ2t__A).

atty3y ago

I was actually wondering if someone could explain where that cost increase is coming from. I know the design rules get more complicated as the process node shrinks, but I thought most of those design rules are essentially “taken care of”, because customers use building blocks from the foundry that already have those design rules baked in? And it’s still using the same software I thought?

adgjlsfhk13y ago

a lot of it is that to get continued gains, you run out of easy stuff to optimize. when Moore was alive and well, the job of chip designers was to build abstractions that let them cheaply scale down their designs without introducing too much overhead. now, is you want to announce 30% gen on gen improvement, you can only count on the fab to give you half of that (and even that has gotten harder. co-optimization is now needed, but is really hard). for the other half, you now need to hunt down every last inefficiency that you previously accepted to make your life easier. pure digital signals go to pam4. layout becomes less regular. you start trying to optimize the whole chip rather than just combining optimized pieces. then in 3 years, you have to find another 15% and the process repeats, but this time you have used up all the low hanging improvements.

Qem3y ago

At least Proebsting's Law doesn't look so depressing anymore, by comparison: https://proebsting.cs.arizona.edu/law.html

sitkack3y ago

Cerebras' WFE is just a large chiplet.

karmasimida3y ago

Latest death of Moore Law

Shocking

WithinReason3y ago

The number of people predicting the end of Moore's Law doubles every 2 years

j / k navigate · click thread line to collapse

116 comments

aargh_aargh3y ago

For dummies like me who didn't know what a chiplet is:

https://en.wikipedia.org/wiki/Chiplet

This seems to be about the third reason listed:

> Known good die (KGD): chiplets can be tested before assembly, improving the yield of the final device

Problem:

  > In general, a killer defect is defined as a defect that is 20% the size of the fabrication node.  For
  > example, a defect that is less than 9nm may be acceptable for the 45nm fabrication node, but a defect
  > larger than 2.8nm would be defined as a “killer” defect for the 14nm fabrication node.  For the 5nm
  > fabrication node, a defect measuring only 1nm could be a killer.
  > 
  > This is one of the primary reasons that it has become increasingly difficult to yield large monolithic
  > ICs (as measured in die area) when using leading edge fabrication process technology

Solution: I understood it from the visual explanation in the first chip image (AMDArt2 png) and its description in this article:

https://www.nextplatform.com/2021/06/09/amd-on-why-chiplets-...

nordsieck3y ago

hedgehog3y ago

1 more reply

tyingq3y ago

AMD's EPYC-Rome processor, helpful to look at after looking at your link, as the chiplets are nice and visible:

https://cdn.wccftech.com/wp-content/uploads/2018/11/AMD-EPYC...

londons_explore3y ago

Is silicon manufacturing done entirely in a vacuum yet?

rcxdude3y ago

nsteel3y ago

Search "tsmc contamination" for real-life examples although don't expect any details. I'm sure it's not just TSMC but it's the best place to start.

PaulHoule3y ago

Moore’s law is alive but the benefits are diminishing.

Until 2005 or so, shrinking transistors automatically increased speed and reduced power consumption. When that ran out of steam, the industry went to multi core and massive parallelism with GPUs.

samatman3y ago

I've been jesting for years that it should be referred to as "Moore's business plan", except it isn't a joke.

It just doesn't work that way anymore, and hasn't for quite some time.

[0]: https://en.wikipedia.org/wiki/Engelbart%27s_law

jakogut3y ago

Something to consider is that smaller chiplets have higher yields than monolithic dies given the same defect rate, which can certainly have an effect on price.

hnuser1234563y ago

pjmlp3y ago

So everyone that works in other domains, without access to libraries written by the GPU druids, largely ignores their existence.

zarzavat3y ago

You can have compute shaders in WebGL2, and WebGPU is around the corner. GPU power is available but then you run into the thorny issue of specs...

What unites gamers and machine learning is an expectation that the user has a reasonably recent and capable GPU. But these are small, self-selecting populations.

On the server side the issue is cost. GPUs are expensive, and usually not necessary, so nobody is going to write code that requires one without a good reason.

pjmlp3y ago

Good luck letting the average JavaScript coder take advantage of them.

This is the problem, GPUs are still a very specialised skill.

briffle3y ago

Can you blame them? I just built a nice custom PC for my Son, with an 6 core cpu (with graphics, a ryzen -g class) 32GB of ram, 1TB NVMe hard drive, nice case, etc.

That cost about the same as a single mid-range video card. (Nvidia RTX 3070)

Why on earth would you add a requirement to your software/workflow that doubles your cost, and is just about impossible to find in stock?

irrational3y ago

I remember building my own PCs back in the 90s and early 2000s. In 2022 I don’t think any of my kids has ever even seen a desktop computer. It is all laptops, tablets, and phones.

miohtama3y ago

pjmlp3y ago

Exposing GPU programming to anyone besides C, C++ and Fortran developers would already help, even if that would take a speed bump, as proven by the few attempts targeting PTX.

I wasn't talking about Web apps.

2 more replies

PaulHoule3y ago

For one thing, most employers will refuse to issue a laptop with a real GPU to developers and other employees because they are afraid they will get used for games.

WastingMyTime893y ago

No one is scared of employees gaming. Employees can’t install applications themselves on their laptops at most place.

TomVDB3y ago

Today’s iGPUs are fast enough comfortable run plenty of games.

pjmlp3y ago

The integrated one would already be quite good, if there was a more mainstream way to make use of it.

JonChesterfield3y ago

mike503y ago

Folding@Home

oneplane3y ago

bjourne3y ago

adrian_b3y ago

Intel was not stalled by waiting for EUV, on the contrary they were not prepared for the transition that was necessary when EUV was eventually ready.

https://www.theverge.com/2022/10/4/23385652/pat-gelsinger-in...

PaulHoule3y ago

If it’s not one thing it’s another.

If progress continues then they will need some other expensive machine. Either that or they’ll try to stretch the life of EUV the same way Intel tried to delay EUV with extreme multiple patterning.

It seems to me though that the ASML machines ought to get some competition from something more like a free electron laser.

duped3y ago

> If manufacturing overhead is low, two chiplets give you twice the transistors at twice the cost

That doesn't take into account yields. One chip with twice the transistors is physically larger than two chips with half as many, and more likely to have a defect during production.

PartiallyTyped3y ago

irthomasthomas3y ago

The nm, today, is a marketing term, divorced from the real manufacturing process.

brundolf3y ago

Lotta people here not reading the article:

The whole premise is that chip innovation (and overall computing power) is continuing to accelerate, even though "Moore's Law as we've known it" has ended

ordu3y ago

brundolf3y ago

In the title it's a little subtle due to the wordplay and terminology, but yeah, it is present. Definitely a lot of knee-jerking going on

Eisenstein3y ago

Does Moore's law require transistors to double on a chip made from a single die?

lukaesch3y ago

Wouldn’t this mean that we should focus on writing more efficient code than before?

mildmotive3y ago

I don’t even blame the companies for doing this. The benefit to cost ratio of using idk C++ for everything is just too bad.

Ma8ee3y ago

mildmotive3y ago

I only partially agree, an interpreted language can act as a hinderance when doing optimisations, for various reasons.

But how we write the code also matters of course.

Furthermore, competency also costs money. I’m not saying this to be mean to people abusing REST calls.

1 more reply

lpedrosa3y ago

What if doing dozens of REST calls is effectively the product?

1 more reply

theaeolist3y ago

urthor3y ago

> single-thread performance and clock frequency have plateaued 10 years ago

https://www.cpubenchmark.net/singleThread.html

It's amazing how often this is parroted. Anyone with a passing familiarity with the numbers knows this is actually not true at all.

Better caching, branch prediction, plus vast amounts of SRAM. There's been a slow & steady increase in the vast variety of single threaded workloads grouped together by "instructions per clock."

Yes, it's a small fraction of the old days. It's still double in 10 years.

And as anyone who's migrated from an Intel Mac to Apple Silicon knows, "merely doubling" is a LOT.

gilbetron3y ago

urthor3y ago

In any field of any kind except probably silicon, 100% growth in a decade would be marvelous. I don't think anyone could call it a plateau.

> Furthermore, much of the purported single thread performance is taken from a small set of benchmark tests, and so chip makers just optimize them for those tests

"Single thread" is a notoriously difficult benchmark to quantify. Instruction queue depth, floating vs integer, branching vs linear, there are so many variables.

Passmark is fine. Workload simulation is state of the art.

Uehreka3y ago

creshal3y ago

> Have clock speeds really plateaued?

dontlaugh3y ago

Part of why that happens is Intel selling chips closer to the red line. You need cooling similar to what used to be exclusive to overclocking just to keep the stock CPU cool.

MrFoof3y ago

Yep. We're apparently finding out that it's mostly a waste of electricity to get an extra 5% performance due to how far outside the efficiency sweet spot chips are being pushed.

Not just Intel either. AMD has joined the game as of Zen 4, and NVIDIA's been playing it with their GPUs forever as well.

-----

What a waste of resources! All so children (or people who still act like them) can feel better about a purchase as they argue on the internet over nothing worth arguing about.

1 more reply

michaelt3y ago

> Have clock speeds really plateaued?

Pentium 4 HT 3.8F, November 2004, 3.8GHz, 115W TDP

Core i9-13900KF, October 2022, 3.0GHz, 125W TDP

dagw3y ago

CPUMark single threaded score for the P4: ~620

CPUMark single threaded score for the 13900K ~4800

So obviously a lot has changed and improved.

3 more replies

dTal3y ago

Clock speed has indeed plateaued, but the single-core performance of the i9 is going to be vastly superior to the Pentium 4.

replygirl3y ago

the frequency plateau always been a power consumption/leakage thing, and power draw for recent intel cpus only reinforces that. it's probably too early to tell if 6ghz is a new normal

and fwiw ive had a 5ghz+ overclock on every cpu ive bought in the last ten years with a corsair 240mm aio, going back to the 3570k

loufe3y ago

actionfromafar3y ago

latency up, but, yes.

andreer3y ago

It's at least enough that we have to take it into account:

tester7563y ago

Jim Keller: Moore’s Law is Not Dead

https://www.youtube.com/watch?v=oIG9ztQw2Gc

reillyse3y ago

Moore’s law was a marketing road map for intel.

ac293y ago

> Moore’s law was a marketing road map for intel.

According to the article, Moore's Law predates the founding of Intel by several years.

Shorel3y ago

Even more: It was about the price per transistor.

Not about transistor size of performance.

So it was an economic insight, more than anything else.

ddalex3y ago

Did Moore law apply to other chip manufacturers too?

Intel can't dictate the technology advances of other competitors

urthor3y ago

Moore's law applied to transistor density, period. So memory and logic equally.

froggertoaster3y ago

Moore's Law is dead - can anyone think of a more cliche article title in the tech world?

wjnc3y ago

“20xx - Year of the Linux desktop”

cm21873y ago

2xxx

zimpenfish3y ago

/1999|2\d{3}/ - "Dirk Hohndel, who was then the chief Linux and open-source technologist at Intel, predicted that in 1999, Linux would penetrate the PC desktop market and displace Windows."[1]

[1] https://www.howtogeek.com/676963/why-desktop-linux-still-mat...

1 more reply

u801e3y ago

Hopefully sometime before 2038.

Ygg23y ago

That's too optimistic :P

froggertoaster3y ago

That's a good one!

dragonelite3y ago

Andrew_nenakhov3y ago

You'll need to find ways to remove the heat form inner layers, or the whole pie will melt.

pedrocr3y ago

93po3y ago

Breakthrough battery tech only 8 years away

warmwaffles3y ago

"I use arch btw"

BirAdam3y ago

Moving to chiplets doesn’t change transistor density. This is a packaging feature and not a fabrication feature. This is done for manufacturing cost reduction and yield improvements.

urthor3y ago

Moore's law is:

Transistor density doubling every 18 months for a similar cost.

https://t7m8e9c8.rocketcdn.me/wp-content/uploads/2020/09/pre...

https://en.wikipedia.org/wiki/Moore%27s_law#cite_note-Moore_...

By those terms, Moore's law is totally extinct.

Folk haven't noticed however, because the "leading edge" logic manufacturers have 60% gross margins. The vast majority of their costs are in design, distribution and overhead.

Price rises of 30% to 100% have disguised that the cost of manufacturing the silicon is an order of magnitude more than a decade ago.

Granted, the above numbers are not the actual inflation adjusted wafer cost for leading edge nodes.

But, $16,000 for a 300mm wafer is extraordinary.

pedrocr3y ago

urthor3y ago

Costs are rising by ~50% every cycle give or take.

Exponential growth is exponential growth.

They'll exceed the GDP of the United States in a few cycles.

The question becomes, when costs hit the ceiling that Apple is willing to pay, will TSMC still be able to front up with the good stuff and deliver the growth.

1 more reply

wongarsu3y ago

bjourne3y ago

Maybe. How many times can you double the size of a 25mm^2 chip before it becomes impractically large?

WithinReason3y ago

Until you turn it into a 25mm^3 chip

somat3y ago

I am not convinced moore's law no longer holds true, consider that there is a third dimension that no one has yet figured out.

I am no silicon engineer but I suspect a chip that fully takes advantage of the third dimension would be something like a sponge full of built in channels for the working fluid to remove heat.

First however I suspect you will see chiplets arranged vertically like heatsink fins and the whole cpu would effectively be the water block, basically a vlsi version of the cray 3

urthor3y ago

https://3dfabric.tsmc.com/english/dedicatedFoundry/technolog...

What you're describing is called 3D stacking, and it works.

It's just extraordinarily complex to resolve the intra-die latency issues, and many others, when going vertical. Hence expensive.

petra3y ago

3D is already being used for ram/flash IC's. That's possible because most memory cells aren't being used in a memory chip - so heat density is reasonable.

3D is also used in AMD's 3D cache.

But 3D logic on logic is more complicated because of heat issues. AFAIK there's no workable technical solution for that yet.

threatripper3y ago

If you look into flash memory which aren't limited by heat, they have dozens of functional layers already and then we also stack those silicon wafers.

ilaksh3y ago

The history of computing is moving from one paradigm to another. We are well past the fast speedups in single thread transistor-based performance phase and into the hyper-parallelization phase.

3d stacking is another innovation that can help.

atulvi3y ago

If I got a penny everytime I hear about the death of Moore's law..

Karellen3y ago

If I got one penny the first time I heard about the death or Moore's law, two pennies the second time, four pennies the next...

tapanjk3y ago

> Obviously, given these data, volume is VERY important in business models that operate with high fixed and low variable costs.

alexvoda3y ago

At this point the most direct and highest impact way of making phones last longer is forcing Qualcomm to support their chipsets for more than 2 years.

transpute3y ago

DARPA ERI (Electronics Resurgence Initiative) has promoted chiplet interoperability and included $100M funding for open-source EDA tools, https://www.eetimes.com/darpa-unveils-100m-eda-project/

2017 vision, slide #22, https://www.darpa.mil/attachments/eri_design_proposers_day.p...

  My DARPA dream
  
  $ git clone https://github.com/darpa/idea
  $ git clone https://github.com/darpa/posh
  $ cd posh
  $ make soc42

ERI Summit 2019, Intelligent Design of Electronic Assets (IDEA) & Posh Open Source Hardware (POSH), https://youtube.com/watch?v=pJubnAN3VKw

UCSD OpenRoad, https://theopenroadproject.org/ & https://vlsicad.ucsd.edu/Publications/Conferences/378/c378.p...

https://semiengineering.com/will-open-source-eda-work/

3143y ago

The rising cost of software in the design process is startling. I wonder what opportunity there is for new entrants to reduce that cost.

MHAlliance3y ago

atty3y ago

adgjlsfhk13y ago

Qem3y ago

At least Proebsting's Law doesn't look so depressing anymore, by comparison: https://proebsting.cs.arizona.edu/law.html

sitkack3y ago

Cerebras' WFE is just a large chiplet.

karmasimida3y ago

Latest death of Moore Law

Shocking

WithinReason3y ago

The number of people predicting the end of Moore's Law doubles every 2 years

j / k navigate · click thread line to collapse