"Nvidia is so far ahead that all the 4090s are nerfed to half speed" (opens in new tab)

(twitter.com)

207 pointsBIackSwan1y ago176 comments

176 comments

NVIDIA is obviously not above market segmentation via dubious means (see: driver limitations for consumer GPUs), but I think binning due to silicon defects is a more likely explanation in this case.

Some 4090s have this extra fp16 -> fp32 ALU disabled because it's defective on that chip.

Other 4090s have it disabled because it failed as an Ada 6000 for some other reason, but NVIDIA didn't want to make a 4095 SKU to sell it under.

Or if you generalize this for every fusable part of the chip: NVIDIA didn't want to make a 4094.997 SKU that only one person gets to buy (at what price?)

gary_01y ago

Depending on who you ask, binning is segmentation. Generally demand isn't going to exactly match how the yields work out, so companies often take a bunch of perfectly good high-end chips, nerf them, and throw them in the cheapo version. You used to be able to (and still can, in some cases) take a low-end device and, if you'd won the chip lottery, "undo" the binning and have a perfectly functional high-end version. For some chips, almost all the nerfed ones had no defects. But manufacturers like nVidia hated it when customers pulled that trick, so they started making sure it was impossible.

creato1y ago

> You used to be able to (and still can, in some cases) take a low-end device and, if you'd won the chip lottery, "undo" the binning and have a perfectly functional high-end version.

For the purposes you tested it, sure. Maybe some esoteric feature you don't use is broken. NVIDIA still can't sell it as the higher end SKU. The tests a chip maker runs to bin their chips are not the same tests you might run.

I'm sure chip makers make small adjustments to supply via binning to satisfy market demand, but if the "technical binning" is too far out of line from the "market binning", that's a lot of money left on the table that will get corrected sooner or later.

edit: And that correction might be in the form of removing redundancies from the chip design, rather than increasing the supply/lowering the price of higher end SKUs. The whole point here is, that's two sides of the same coin.

1 more reply

wmf1y ago

It's more likely that any defect in a core causes the whole core to be disabled. Especially in the this case where I assume the FP16 x FP16 -> FP32 path uses the same hardware as the FP16 x FP16 -> FP16 path.

danjl1y ago

Exactly. They can easily sell more Ada 6000s, and I'm pretty sure they would do so rather than sell them for much less as 4090s.

m4631y ago

I think this is just like intel does.

Runs fast? i9. slower? i7. missing cores? i5 slowest? i3

perfect chips probably not only have all the cores working, they also run at low voltages so don't get as hot.

I wonder if they can figure out what parts of the chip run at what speeds, and disable the ones that run slow/hot

I'm pretty sure gpus are overclocked by vendors, so there must be some sort of binning either by the vendors or they buy binned parts. I'll bet if parts could go faster, you would have an ASUS/MSI/etc 4090-2x-max-$$$$

https://www.tomshardware.com/reviews/glossary-binning-defini...

Arrath1y ago

I recall reading before that as yields improved over process maturation Intel has ended up binning faster passing chips as lower SKUs just to meet demand.

monocasa1y ago

I'm not sure that it's completed as a separate fp16 ALU. There's cute ways to share logic between a dual fp16 alu and a single fp32 ALU such that it's really just one ALU with those being different ops.

a_e_k1y ago

As I understand it, that's how the original MMX got started. It was largely reusing the x87 ALU, but breaking the carry chains at the obvious points.

ashoeafoot1y ago

There must be a whole layer of reroute if defective plumbing in drivers.

deaddodo1y ago

I don't even understand why binning non-defect cards is dubious.

It's like the logic people have on /r/pcmasterrace is that if they didn't bin, they would just release all 4090s at 4080 prices. No, there would just be less 4080s for people to buy. No chip maker is going to sell their chips at sub-market rates just because they engineered them to/have a fab that can produce them at very low defect rates.

Now, Nvidia certainly has done dubious things. They've hurt their partners (EVGA, you're missed), and skyrocketing the baseline GPU prices is scummy as hell. But binning isn't anything I necessarily consider dubious.

revnode1y ago

> No, there would just be less 4080s for people to buy.

Sure, and more 4090s at a lower price.

1 more reply

thatfrenchguy1y ago

This starts as binning and ends up at down binning :)

modeless1y ago

This is why Nvidia needs competition. I love the performance of their hardware and the quality of their drivers, but I don't love being their customer. They have a long, long history of price discrimination using techniques like this. Back in the day it was "workstation" graphics for CAD programs that they would nerf for consumer cards by downgrading various features of OpenGL.

Different markets, same techniques. It's in the company DNA. That and their aversion to open source drivers and various other bad decisions around open source support that make maintaining a working Linux GPU setup way harder than it should be even to this day.

daxfohl1y ago

Oh, right, the CAD thing was sketchier. IIRC there wasn't any feature of OpenGL/DirectX that was nerfed, it was just an agreement with CAD companies to reject the GPUs that didn't have the workstation bit fused.

For this performance nerf, IDK, seems fine to me. Software companies do this all the time. Same piece of software but you have to pay to unlock features. I don't see why hardware should be any different.

Granted, even for the CAD nerf, it's a gray area. You pay for features, not for silicon, and NVidia is clear about what you have to pay for what features, so. But I'm a bit more biased on that one because my 10-person HW company had to spring for several of those workstation cards.

henryfjordan1y ago

That sounds like a classic Anti-trust suit waiting to happen. CAD company and NVIDIA colluding to drive sales exclusively to each other. That's illegal and exploitative.

1 more reply

ksec1y ago

>They have a long, long history of price discrimination using techniques like this.

Had AI and Crypto not been a thing. These so called price discrimination was what kept the company afloat and continue to spend stupid money ( according to many ) on CUDA rather than making gaming better.

And when often people say "X" needs competition. What they really want is just cheaper price for the same thing. Would it be great if Nvidia had more competition? Absolutely, but is Nvidia not making progress and milking everything they have? Absolutely not. They invested even more in CUDA, Large Die Size Correction Tooling, Assisted EDA Design and many more to their arsenal to built their moat.

I also often found when a successful founder is still working at a company, that company is often pushed far harder by its founder than whatever market force is driving them. So we need competition for Intel 2009 - 2021, Microsoft in 2000, Any company who just sit there and no longer improves or failed to execute.

Nvidia? They are doing fine if not better than I could imagine. ( I just wished they spend a little more money to compete in the Mobile and Desktop Consumer SoC space. I guess that is coming soon. )

musicale1y ago

> What they really want is just cheaper price for the same thing

They'd probably be happy with competitive market pricing, which is unlikely to happen without a competitive market.

Nvidia's gross margins are extraordinary vs. AMD or intel margins (or even "overpriced" Apple.)

There is competition of a sort - for example at the low end with intel ARC and at the high end with AMD Instinct based supercomputers. However, competitors don't seem to be able to match the CUDA software platform, particularly for AI/ML. The deepening CUDA moat is hard for competitors to cross and for customers to escape.

In the cloud space it seems that Nvidia may face more competition with platforms like GCP/Cloud TPU and AWS/Trainium.

orbital-decay1y ago

Their former competition wasn't any different. I still remember ATI/AMD pencil mods to unlock things they disabled.

itsthecourier1y ago

they have to do that to server different segments. how would you price a 4090 that can be used for crypto, gaming, ai, cad, video editing if you were to discover that is cheaper to create the same chips for all of them, but the segments are really different, 90% coming from datacenter and 10% from gaming

we're lucky they still do gaming by limiting the datacenter chips

it's like getting a ferrari speed limited for usd20,000 and then I complain I don't get the acceleration of a usd100,000 model. they sold the product cheaper, they cared, they adapted. I'm happy they are still improving year after year for the same dollar value

dragonwriter1y ago

> they have to do that to server different segments.

No, they don't.

That they are able to price discriminate this way is a sign that they are functionally a monopoly exercising pricing power, otherwise, they would be easily undercut in the market where they charge premium prices by a competitor.

Xymist1y ago

They just shouldn't do that. If they can afford to sell the Ferrari for $20k, they should do so. To everyone who wants one, for whatever reason.

7 more replies

lofaszvanitt1y ago

It's a protected company...

caycep1y ago

speaking of driver quality...my pc has been regularly blue screening after the latest release and the transition from the Experience app to the Nvidia app...

jsiepkes1y ago

There was a time when Intel seemed unbeatable. In 2000 they had a 500 billion USD valuation. That's almost a trillion dollars in today's (2024) USD. Today they are valued at 90 billion USD and Broadcom was thinking about buying them...

My point is these things don't seem to last in tech.

loa_in_1y ago

Over 23 years almost every person working there probably either moved on or changed roles. Corporations are made of people. It lasted as long as it should.

jeffbee1y ago

Let's don't ignore context. In 2000 JDS Uniphase was worth $125 billion. A lot of things were "worth" a lot of money in the year 2000. Anyway Intel got their ass handed back to them in 2003 with the AMD Opteron. Today is not Intel's first struggle.

ryao1y ago

What made Intel seem unbeatable was its process node advantage. Nvidia does not have fabrication plants, so it is able to get the best process node from whoever has it. Nvidia is therefore not vulnerable to what befell Intel.

What makes Nvidia seem unbeatable is that Nvidia does the best job on hardware design, does a good job on the software for the hardware and gets its designs out quickly such that they can charge a premium. By the time the competition makes a competitive design, Nvidia has the next generation ready to go. They seem to be trying to accelerate their pace to kill attempts to compete with them and so far, it is working.

Nvidia just does not do the same thing better in a new generation, but tries to fundamentally change the paradigm to obtain better than generational improvements across generations. That is how they introduced SIMT, tensor cores, FP8 and more recently FP4, just to name a few. While their competitors are still implementing the last round of improvements Nvidia made to the state of the art, Nvidia launches yet another round of improvements.

For example, Nvidia has had GPUs on the market with FP8 for two years. Intel just launched their B580 discrete GPUs and Lunar Lake CPUs with Xe2 cores. There is no FP8 support to be seen as far as I have been able to gather. Meanwhile, Nvidia will soon be launching its 50 series GPUs with FP4 support. AMD’s RDNA GPUs are not poised to gain FP8 until the yet to be released RDNA 4 and I have no idea when Intel’s ARC graphics will gain FP8. Apple’s recent M4 series does have FP8, but no FP4 support.

Things look look less bad for Nvidia’s competitors in the enterprise market, CDNA 3 launched with FP8 support last year. Intel had Gaudi 2 with FP8 support around the same time as Nvidia, and even launched Gaudi 3. Then there is tenstorrent with FP8 on the wormhole processors that they released 6 months ago. However, FP4 support is no where to be seen with any of them and they will likely not release it until well after Nvidia, just like nearly all of them did with FP8. This is only naming a few companies too. There are many others in this sector that have not even touched FP8 yet.

In any case, I am sure that in a generation or two after Blackwell, Nvidia will have some other bright idea for changing the paradigm and its competition will lag behind in adopting it.

So far, I have only discussed compute. I have not even touched on graphics, where Nvidia has had many more innovations, on top of some of the compute oriented changes being beneficial to graphics too. Off the top of my head, Nvidia has had variable rate shading to improve rendering performance, ray tracing cores to reinvent rendering, tensor cores to enable upscaling (I did mention overlap between compute and graphics), optical flow accelerators to enable frame generation and likely others that I do not recall offhand. These are some of the improvements of the past 10 years and I am sure that the next 10 years will have more.

We do not see Nvidia’s competition put forward nearly as many paradigm changing ideas. For example, AMD did “smart access memory” more than a decade after it had been standardized as resizeable bar, which was definitely a contribution, but not one they invented. For something that they actually did invent, we need to look at HBM. I am not sure if they or anyone else I mentioned has done much else. Beyond the companies I mentioned, there are Groq and Cerebras (maybe Google too, but I am not sure) with their SRAM architectures, but that is about it as far as I know of companies implementing paradigm changing ideas in the same space.

I do not expect Nvidia to stop being a juggernaut until they run out of fresh ideas. They have produced so many ideas that I would not bet on them running out of new ideas any time soon. If I were to bet against them, I would have expected them to run out of ideas years ago, yet here we are.

Going back to the discussion of Intel seeming to be unbeatable in the past, they largely did the same thing better in each generation (with occasional ISA extensions), which was enough when they had a process advantage, but it was not enough when they lost their process advantage. The last time Intel tried to do something innovative in its core market, they gave us Itanium, and it was such a flop that they kept doing the same thing incrementally better ever since then. Losing their process advantage took away what put them on top.

baal80spam1y ago

> In any case, I am sure that in a generation or two after Blackwell, Nvidia will have some other bright idea for changing the paradigm and its competition will lag behind in adopting it.

This is the most important point. Everyone seems to think that Nvidia just rests on its laurels while everyone and their dog tries to catch up with it. This is just not how (good) business works.

2 more replies

singhrac1y ago

I've been using Gaudi chips for a little bit and they are totally fine (and the software stack is even pretty good, or at least the happy path is mostly covered for me). For example I set up training with autocasting, activation checkpointing, fused ops, profiling etc., without too much trouble. I'll write a long blog post about it soon but I think their issue with the Gaudi chips is simply making enough and convincing people to buy them before Falcon Shores (which will, I think, be Xe slice based, so more like a better PVC chip than a Gaudi).

In summary the software story was very surprisingly better than I expected (no Jax though).

notreallyauser1y ago

> What made Intel seem unbeatable was its process node advantage. Nvidia does not have fabrication plants, so it is able to get the best process node from whoever has it. Nvidia is therefore not vulnerable to what befell Intel.

It's able to get the best process node from /whoever is willing to sell it to Nvidia/: it's vulnerable (however unlikely) to something very similar -- a competitor with a process advantage.

bfrog1y ago

Exactly this, hard to grok why people think somehow the fab is the boat anchor around Intel's neck. No, it was the golden goose that kept Intel ahead until it didn't.

BK failed to understand the moat Intel had was the Fab. The moat is now gone and so is the value.

chrsw1y ago

Intel didn't have a software stack moat.

michaelt1y ago

In 2000 Intel had a huge software moat: Microsoft Windows, and the large install base of x86-only software.

Rich webapps hadn't been invented. Smartphones? If you're lucky your flip phone might have a colour screen. If you've got money to burn, you can insert a PCMCIA card into your Compaq iPAQ and try out this new "802.11b" thing. Java was... being Java.

Almost all the software out there - especially if it had a GUI, and a lot of it did - was distributed as binaries that only ran on x86.

2 more replies

acdha1y ago

Other than the huge amount of enterprise software which was only supported on Intel, most of the high-end server business below the mainframe level after the mid-90s, and the huge install base of x86 software keeping everyone but AMD out? Even their own Itanium crashed and burned on x86 compatibility.

1 more reply

YetAnotherNick1y ago

NVidia has software moat for specialized applications but not for AI, which is responsible for most of their sales now. Almost everyone in AI uses pytorch/jax/triton/flash attention and not CUDA directly. And if Google can support pytorch for their TPU and Apple for their M1 GPU, surely others could.

1 more reply

caycep1y ago

Intel didn't do a lot of things...

1 more reply

mensetmanusman1y ago

Nvidia is helping power the tool that destroys the software moat.

1 more reply

tester7561y ago

> and Broadcom was thinking about buying them...

I'm thinking about buying Nvidia

(this is bullshit)

cosmic_cheese1y ago

Between EVGA getting out of the Nvidia card business, Nvidia continuing to be problematic under Linux (even if that’s improving), all the nonsense with the new power connector, and the company’s general sliminess, I’m increasingly leaning towards an AMD (or potentially Intel) card for my next tower upgrade.

AMD and Intel might only be competing in the entry-to-midrange market sector but my needs aren’t likely to exceed what RX 8000 or next-gen Intel cards are capable of anyway.

tgsovlerkhgsel1y ago

AMD has inexplicably decided not to invest in software. Just like car manufacturers don't realize that a shitty infotainment system can keep people from buying the $100k car, AMD doesn't seem to realize that people aren't buying a GPU for ML if their ML framework doesn't run on it...

And this goes down to consumer drivers too. I've sworn to myself that I'm not buying AMD for my next laptop, after endless instability issues with the graphics driver. I don't care how great and cheap and performant and whatever it is when I'm afraid to open Google Maps because it might kernel panic my machine.

freedomben1y ago

I have AMD in my desktop and my laptop and it has been pretty good under Linux (I use Fedora) the past year or two. AMD definitely was late to the game, and I still don't think they care as much as they should, but they are definitely working on it. I've been easily running GPU accelerated Ollama on my desktop and laptop through ROCm.

AMD is definitely not perfect but I don't think it's fair to say they decided not to invest in software. Better late than never, and I'm hoping AMD learned their lesson.

1 more reply

ramon1561y ago

How long ago was this? I bought an AMD laptop this year and it's been great with both windows and Linux. I can't say the same for my Nvidia pc ...

1 more reply

sangnoir1y ago

> AMD has inexplicably decided not to invest in software

Perhaps they were distracted by dismantling Intel's CPU hegemony? I wouldn't fault them for that, fighting 2 Goliaths simultaneously isn't a sound strategy.

1 more reply

evoke49081y ago

I made that choice several years ago. All new PCs I buy/build are AMD only.

The hardware is a bit finnicky, but honestly I prefer a thing to just be broken and tricky as opposed to nvidia intentionally making my life hard.

einpoklum1y ago

I wish I could say that was a realistic alternative for compute work (and on workstations and servers rather than consumer PCs). Unfortunately, it doesn't look like it - both in terms of the hardware offering (AFAICT), and ecosystem richness. Which is really a shame; not because AMD are saintly, but because NVIDIA have indeed indeed been slimey and non-forthcoming about so much, for so long.

TiredOfLife1y ago

EVGA also significantly reduced warranty on their PSUs. Changed PSU components without changing model number.

Insanity1y ago

I am tempted to try the new Intel GPU as an upgrade for my current ~5yo build. I don’t need something high end, and I don’t need any AI stuff. But I use a dual boot Windows/Linux, and I am a bit worried about how it will behave under Linux.

bjoli1y ago

Intel is by far the best out of the box experience under linux. I have 3 cards. I will get one of the new battlmage cards for my gaming pc.

Edit: the only downside is that the hw h265 encoder is pretty bad. Av1 is fine though

3 more replies

KennyBlanken1y ago

AMD's 7xxx series cards were almost universally worse than their 6xxx equivalents. AMD cut memory bus width and reduced compute units, all in a quest to reduce power consumption because they're so power-hungry. They're still not as good as NVIDIA cards for power consumption.

The drivers are unreliable, Adrenalin is buggy, slow, and bloated; AMD's cards have poor raytracing, and AMD's compute is a dumpster fire, especially on Windows; ROCm is a joke.

None of the LLM or Stability Matrix stuff works on AMD GPUs under Windwos without substantial tweaking and even then it's unreliable garbage, whereas the NVIDIA stuff Just Works.

If you don't care about any of that and just want "better than integrated graphics", especially if you're on Linux where you don't need to worry about the shitshow that is AMD Windows drivers - then sure, go for AMD - especially the cards that have been put on sale (don't pay MSRP for any AMD GPU, ever. They almost always rapidly discount.)

AMD simply does not have the care to compete with NVIDIA for the desktop market. They have barely a few percent of the desktop GPU market; they're interested in stuff like gaming consoles.

Intel are the only ones who will push AMD - and it will push them to either compete or let their product line stagnate and milk as much profit out of the AMD fanboys as they can.

harshreality1y ago

I'm ambivalent about this sort of thing (or, as another example, Intel's CPUs many years ago that offered paid firmware upgrades to enable higher performance).

On one hand, it's very bad because it reduces economic output from the exact same input resources (materials and labor and r&d).

On the other hand, allowing market segmentation, and more profits from the higher segments, allows more progress and scaling for the next generation of parts (smaller process nodes aren't cheap, and neither is chip R&D).

itsthecourier1y ago

I want to add nvidia sales are 90% data center and 10% gaming, and the author being part of the 10% who wasn't abandoned is complaining they got a product at half the speed, way lower price, instead of half the price same specs as the datacenter client

man.

zeusk1y ago

Or the 90% are charged absurd markup because clearly they can deliver the hardware for 10% use-case for 1000$ and still make money on top but they would rather charge the data centers 50k for the same product

2 more replies

mensetmanusman1y ago

“On one hand, it's very bad because it reduces economic output from the exact same input resources (materials and labor and r&d).”

This is not true with the economies of scale in the semiconductor industry.

chillingeffect1y ago

Interesting no one considers the environmental impact. This creates tonnes of e-waste with a shortened useful life.

We should demand that it's unlockable after a certain time.

dotancohen1y ago

Maybe not demand that it be unlockable, but rather if Nvidia were to provide a paid upgrade path to unlock these features that would help. They would need some way to prevent the open source drivers from accessing the features, though.

1 more reply

wmf1y ago

E-waste is mostly a fake concept. By the time a 4090 has outlived its usefulness for gaming it's also likely that no one wants it for AI (if they ever did).

1 more reply

rbanffy1y ago

Halving the clock also reduces heat dissipation and extends component life.

michaelt1y ago

In this case, the 2-slot RTX 6000 consumes 300 W whereas the "nerfed" 3.5-slot 4090 can draw 450 W.

So I don't think the nerfing here was to lower power consumption. It's just market segmentation to extract maximum $$$$ from ML workloads.

nvidia have always been pretty open about this stuff - they have EULA terms saying the GeForce drivers can't be used in data centres, software features like virtual GPUs that are only available on certain cards, difficult cooling that makes it hard to put several cards into the same case, awkward product lifecycles, contracts with server builders not to put gaming GPUs into workstations or servers, removal of nvlink, and so on.

rbanffy1y ago

I didn’t say they don’t do artificial segmentation. I just noted that, in this case, it might have an upside for the user. There might also be some binning involved- maybe the parts failed as A300 parts.

beefnugs1y ago

Yeah, somebody knew the new power connectors were going to be sus, so halving the power was at least somewhat safe thing to do

chrsw1y ago

Yeah, this is far from new too

kijin1y ago

Binning and market segmentation are not mutually exclusive. Of course they're going to put their best-performing chips in the most expensive segment.

sdwr1y ago

The difference is whether chips with no defects get artificially binned.

In a competitive market, if you have a surplus of top-tier chips, you lower prices and make a little more $$ selling more power.

With a monopoly (customers are still yours next upgrade cycle), giving customers more power now sabotages your future revenue.

kube-system1y ago

I don't think anyone has ever gone to TSMC and said "hey we're short on our low end chips, can you lower your yields for a bit?"

tetrisgm1y ago

Can this be fixed by removing the efuse or having a custom firmware?

xvfLJfx91y ago

You can't lol. How do you wanna restore a blown fuse on nanometer level INSIDE the GPU die. Its simply not possible.

By the way, AMD also uses fuse blowing if you e.g. overclock some of their CPUs to mark them as warranty voided. They give you a warning in the BIOS and if you resume a fuse inside the CPU gets blown that will permanently indicate that the CPU has been used for overclocking (and thus remove the warranty)

bri3d1y ago

> You can't lol. How do you wanna restore a blown fuse on nanometer level INSIDE the GPU die. Its simply not possible.

I wouldn't dismiss this so aggressively.

Frequently (more frequently than not), efuses are simply used as configuration fields checked by firmware. If that firmware can be modified, the value of the efuse can be ignored. It's substantially easier to implement a fused feature as a bit in a big bitfield of "chicken bits" in one-time programmable memory than to try to physically fuse off an entire power or clock domain, which would border on physically irreversible (this is done sometimes, but only where strictly necessary and not often).

2 more replies

guerrilla1y ago

> How do you wanna restore a blown fuse on nanometer level INSIDE the GPU die. Its simply not possible.

Bullshit. There will be hackers in the future who can do it in their garage. Just... not anytime soon.

> By the way, AMD also uses fuse blowing if you e.g. overclock some of their CPUs to mark them as warranty voided. They give you a warning in the BIOS and if you resume a fuse inside the CPU gets blown that will permanently indicate that the CPU has been used for overclocking (and thus remove the warranty)

Emphasis on "some." You can buy plenty of CPUs from them made for overclocking.

2 more replies

MPSimmons1y ago

e-fuses (https://en.wikipedia.org/wiki/EFuse) are typically etched into the silicon as an exactly-once operation, meant to irrevocably set a configuration. Some devices, for instance, have an e-fuse that makes it impossible to change cryptographic trust signatures after the e-fuse has been blown.

sabareesh1y ago

That's intriguing; one might assume that a key feature of eFuse would be the ability to reset easily. But I guess it could be implemented without it

3 more replies

NoPicklez1y ago

There is competition, but the competition isn't winning 1st, 2nd or 3rd.

There was a day in which Intel was top dog and for a long time, now AMD are competing in 1st place.

AMD and/or other competitors will have their day, but today its Nvidia.

stuckkeys1y ago

This is so sketch. Although, I have not seen any reports of misrepresentation, but I hope EU looks into this.

devops991y ago

  https://xcancel.com/realGeorgeHotz/status/1868356459542770087

ryao1y ago

Did they do this to the 3090 Ti too?

knowitnone1y ago

so the next questions, how to deposit a small bit of metal to fix the fuse?

tayo421y ago

How do you go to from that screen shot to this conclusion?

zamadatix1y ago

You go from the eFuse mentioned at the beginning to the conclusion in the screenshot at the end, not the other way around.

tayo421y ago

Am I missing something then? Is there some context to the linked tweet and screen shot that didn't come up?

1 more reply

h_tbob1y ago

I wonder why everyone on here isn’t saying “copyright is stupid”. You know it grants a monopoly?

throwaway3141551y ago

What?

Uw5ssYPc1y ago

Imagine running this card at full speed, with fully unlocked potential. What would happen to new tiny power connectors? I am betting insta-fire.

wmf1y ago

It would just throttle like it already does.

Uw5ssYPc1y ago

Throttle means that performance gain would not be there. Unthrottled power is needed for (hypothetical) unthrottled GPU chip. Unthrottled power is impossible on current power design, unless melted connectors are not a concern.

sitzkrieg1y ago

stock must go up

daxfohl1y ago

Yeah but we can only use 10% of our brain too, so.

unethical_ban1y ago

It's interesting how we the people (broadly) accept this practice in software and even some hardware, but not in other areas. Note how frustrated people are when you hear about "unlocking" sensors and services available on cars.

If a product is made, and the cost to provide that product is the same one way or the other but you cripple it to create segmentation, then that is greed. Period. Objectively. And if you're okay with that, then fine, no problem. Just don't try to tell me it isn't maximization of profit.

There are no heroes in the megacorp space, but it would be nice for AMD and Intel to bring Nvidia to heel.

GuB-421y ago

Maximization of profit is what all companies do. For publicly traded companies, it is considered a duty to their shareholders and not doing so will result in executives getting booted out and even lawsuits.

With that out of the way, market segmentation is often good for budget customers, who, in the case of Nvidia GPUs, are gamers. They get GPUs that run their games just as well as the uncrippled model, for a much lower price. Without market segmentation, all the GPUs would go to Amazon, Microsoft, Google, etc... since they are the ones with the big budget, gamers will be left with GPUs they can't afford, and Nvidia with less profits as they will lose most of the market for gamers.

With market segmentation, Nvidia wins, gamers win, AI companies and miners lose. And I don't know about you, but I think that AI companies and miners deserve the premiums they pay.

It sounds stupid to pay for crippled hardware, but when buying a GPU, the silicon is only a small part of the price, the expensive part is all the R&D, and that cost is the same no matter how many chips they sell, and it makes sense to maximize these sales, and segmentation is how they do it without sacrificing their profits.

Of course, should AMD or Intel come back, they would do their own market segmentation too, in fact, they already do.

dredmorbius1y ago

Aside from possible technical explanations (e.g., the binning of products based on defects which permit sub-optimal performance as creato describes: <https://news.ycombinator.com/item?id=42435397>), there's market segmentation.

French polymath (economist, engineer, bureaucrat) Jules Dupuit famously described this concerning railway carriage accomodations and the parlous state of third-class carriages:

It is not because of the several thousand francs which they would have to spend to cover the third class wagons or to upholster the benches. ... [I]t would happily sacrifice this [expense] for the sake of its popularity.

Its goal is to stop the traveler who can pay for the second class trip from going third class. It hurts the poor not because it wants them to personally suffer, but to scare the rich.

<https://www.inc.com/bill-murphy-jr/why-does-air-travel-suck-...>

176 comments

creato1y ago

NVIDIA is obviously not above market segmentation via dubious means (see: driver limitations for consumer GPUs), but I think binning due to silicon defects is a more likely explanation in this case.

Some 4090s have this extra fp16 -> fp32 ALU disabled because it's defective on that chip.

Other 4090s have it disabled because it failed as an Ada 6000 for some other reason, but NVIDIA didn't want to make a 4095 SKU to sell it under.

Or if you generalize this for every fusable part of the chip: NVIDIA didn't want to make a 4094.997 SKU that only one person gets to buy (at what price?)

gary_01y ago

creato1y ago

> You used to be able to (and still can, in some cases) take a low-end device and, if you'd won the chip lottery, "undo" the binning and have a perfectly functional high-end version.

1 more reply

wmf1y ago

danjl1y ago

Exactly. They can easily sell more Ada 6000s, and I'm pretty sure they would do so rather than sell them for much less as 4090s.

m4631y ago

I think this is just like intel does.

Runs fast? i9. slower? i7. missing cores? i5 slowest? i3

perfect chips probably not only have all the cores working, they also run at low voltages so don't get as hot.

I wonder if they can figure out what parts of the chip run at what speeds, and disable the ones that run slow/hot

https://www.tomshardware.com/reviews/glossary-binning-defini...

Arrath1y ago

I recall reading before that as yields improved over process maturation Intel has ended up binning faster passing chips as lower SKUs just to meet demand.

monocasa1y ago

a_e_k1y ago

As I understand it, that's how the original MMX got started. It was largely reusing the x87 ALU, but breaking the carry chains at the obvious points.

ashoeafoot1y ago

There must be a whole layer of reroute if defective plumbing in drivers.

deaddodo1y ago

I don't even understand why binning non-defect cards is dubious.

revnode1y ago

> No, there would just be less 4080s for people to buy.

Sure, and more 4090s at a lower price.

1 more reply

thatfrenchguy1y ago

This starts as binning and ends up at down binning :)

modeless1y ago

daxfohl1y ago

henryfjordan1y ago

That sounds like a classic Anti-trust suit waiting to happen. CAD company and NVIDIA colluding to drive sales exclusively to each other. That's illegal and exploitative.

1 more reply

ksec1y ago

>They have a long, long history of price discrimination using techniques like this.

Nvidia? They are doing fine if not better than I could imagine. ( I just wished they spend a little more money to compete in the Mobile and Desktop Consumer SoC space. I guess that is coming soon. )

musicale1y ago

> What they really want is just cheaper price for the same thing

They'd probably be happy with competitive market pricing, which is unlikely to happen without a competitive market.

Nvidia's gross margins are extraordinary vs. AMD or intel margins (or even "overpriced" Apple.)

In the cloud space it seems that Nvidia may face more competition with platforms like GCP/Cloud TPU and AWS/Trainium.

orbital-decay1y ago

Their former competition wasn't any different. I still remember ATI/AMD pencil mods to unlock things they disabled.

itsthecourier1y ago

we're lucky they still do gaming by limiting the datacenter chips

dragonwriter1y ago

> they have to do that to server different segments.

No, they don't.

Xymist1y ago

They just shouldn't do that. If they can afford to sell the Ferrari for $20k, they should do so. To everyone who wants one, for whatever reason.

7 more replies

lofaszvanitt1y ago

It's a protected company...

caycep1y ago

speaking of driver quality...my pc has been regularly blue screening after the latest release and the transition from the Experience app to the Nvidia app...

jsiepkes1y ago

My point is these things don't seem to last in tech.

loa_in_1y ago

Over 23 years almost every person working there probably either moved on or changed roles. Corporations are made of people. It lasted as long as it should.

jeffbee1y ago

ryao1y ago

In any case, I am sure that in a generation or two after Blackwell, Nvidia will have some other bright idea for changing the paradigm and its competition will lag behind in adopting it.

baal80spam1y ago

> In any case, I am sure that in a generation or two after Blackwell, Nvidia will have some other bright idea for changing the paradigm and its competition will lag behind in adopting it.

This is the most important point. Everyone seems to think that Nvidia just rests on its laurels while everyone and their dog tries to catch up with it. This is just not how (good) business works.

2 more replies

singhrac1y ago

In summary the software story was very surprisingly better than I expected (no Jax though).

notreallyauser1y ago

It's able to get the best process node from /whoever is willing to sell it to Nvidia/: it's vulnerable (however unlikely) to something very similar -- a competitor with a process advantage.

bfrog1y ago

Exactly this, hard to grok why people think somehow the fab is the boat anchor around Intel's neck. No, it was the golden goose that kept Intel ahead until it didn't.

BK failed to understand the moat Intel had was the Fab. The moat is now gone and so is the value.

chrsw1y ago

Intel didn't have a software stack moat.

michaelt1y ago

In 2000 Intel had a huge software moat: Microsoft Windows, and the large install base of x86-only software.

Almost all the software out there - especially if it had a GUI, and a lot of it did - was distributed as binaries that only ran on x86.

2 more replies

acdha1y ago

1 more reply

YetAnotherNick1y ago

1 more reply

caycep1y ago

Intel didn't do a lot of things...

1 more reply

mensetmanusman1y ago

Nvidia is helping power the tool that destroys the software moat.

1 more reply

tester7561y ago

> and Broadcom was thinking about buying them...

I'm thinking about buying Nvidia

(this is bullshit)

cosmic_cheese1y ago

AMD and Intel might only be competing in the entry-to-midrange market sector but my needs aren’t likely to exceed what RX 8000 or next-gen Intel cards are capable of anyway.

tgsovlerkhgsel1y ago

freedomben1y ago

AMD is definitely not perfect but I don't think it's fair to say they decided not to invest in software. Better late than never, and I'm hoping AMD learned their lesson.

1 more reply

ramon1561y ago

How long ago was this? I bought an AMD laptop this year and it's been great with both windows and Linux. I can't say the same for my Nvidia pc ...

1 more reply

sangnoir1y ago

> AMD has inexplicably decided not to invest in software

Perhaps they were distracted by dismantling Intel's CPU hegemony? I wouldn't fault them for that, fighting 2 Goliaths simultaneously isn't a sound strategy.

1 more reply

evoke49081y ago

I made that choice several years ago. All new PCs I buy/build are AMD only.

The hardware is a bit finnicky, but honestly I prefer a thing to just be broken and tricky as opposed to nvidia intentionally making my life hard.

einpoklum1y ago

TiredOfLife1y ago

EVGA also significantly reduced warranty on their PSUs. Changed PSU components without changing model number.

Insanity1y ago

bjoli1y ago

Intel is by far the best out of the box experience under linux. I have 3 cards. I will get one of the new battlmage cards for my gaming pc.

Edit: the only downside is that the hw h265 encoder is pretty bad. Av1 is fine though

3 more replies

KennyBlanken1y ago

The drivers are unreliable, Adrenalin is buggy, slow, and bloated; AMD's cards have poor raytracing, and AMD's compute is a dumpster fire, especially on Windows; ROCm is a joke.

None of the LLM or Stability Matrix stuff works on AMD GPUs under Windwos without substantial tweaking and even then it's unreliable garbage, whereas the NVIDIA stuff Just Works.

AMD simply does not have the care to compete with NVIDIA for the desktop market. They have barely a few percent of the desktop GPU market; they're interested in stuff like gaming consoles.

Intel are the only ones who will push AMD - and it will push them to either compete or let their product line stagnate and milk as much profit out of the AMD fanboys as they can.

harshreality1y ago

I'm ambivalent about this sort of thing (or, as another example, Intel's CPUs many years ago that offered paid firmware upgrades to enable higher performance).

On one hand, it's very bad because it reduces economic output from the exact same input resources (materials and labor and r&d).

itsthecourier1y ago

man.

zeusk1y ago

2 more replies

mensetmanusman1y ago

“On one hand, it's very bad because it reduces economic output from the exact same input resources (materials and labor and r&d).”

This is not true with the economies of scale in the semiconductor industry.

chillingeffect1y ago

Interesting no one considers the environmental impact. This creates tonnes of e-waste with a shortened useful life.

We should demand that it's unlockable after a certain time.

dotancohen1y ago

1 more reply

wmf1y ago

E-waste is mostly a fake concept. By the time a 4090 has outlived its usefulness for gaming it's also likely that no one wants it for AI (if they ever did).

1 more reply

rbanffy1y ago

Halving the clock also reduces heat dissipation and extends component life.

michaelt1y ago

In this case, the 2-slot RTX 6000 consumes 300 W whereas the "nerfed" 3.5-slot 4090 can draw 450 W.

So I don't think the nerfing here was to lower power consumption. It's just market segmentation to extract maximum $$$$ from ML workloads.

rbanffy1y ago

beefnugs1y ago

Yeah, somebody knew the new power connectors were going to be sus, so halving the power was at least somewhat safe thing to do

chrsw1y ago

Yeah, this is far from new too

kijin1y ago

Binning and market segmentation are not mutually exclusive. Of course they're going to put their best-performing chips in the most expensive segment.

sdwr1y ago

The difference is whether chips with no defects get artificially binned.

In a competitive market, if you have a surplus of top-tier chips, you lower prices and make a little more $$ selling more power.

With a monopoly (customers are still yours next upgrade cycle), giving customers more power now sabotages your future revenue.

kube-system1y ago

I don't think anyone has ever gone to TSMC and said "hey we're short on our low end chips, can you lower your yields for a bit?"

tetrisgm1y ago

Can this be fixed by removing the efuse or having a custom firmware?

xvfLJfx91y ago

You can't lol. How do you wanna restore a blown fuse on nanometer level INSIDE the GPU die. Its simply not possible.

bri3d1y ago

> You can't lol. How do you wanna restore a blown fuse on nanometer level INSIDE the GPU die. Its simply not possible.

I wouldn't dismiss this so aggressively.

2 more replies

guerrilla1y ago

> How do you wanna restore a blown fuse on nanometer level INSIDE the GPU die. Its simply not possible.

Bullshit. There will be hackers in the future who can do it in their garage. Just... not anytime soon.

Emphasis on "some." You can buy plenty of CPUs from them made for overclocking.

2 more replies

MPSimmons1y ago

sabareesh1y ago

That's intriguing; one might assume that a key feature of eFuse would be the ability to reset easily. But I guess it could be implemented without it

3 more replies

NoPicklez1y ago

There is competition, but the competition isn't winning 1st, 2nd or 3rd.

There was a day in which Intel was top dog and for a long time, now AMD are competing in 1st place.

AMD and/or other competitors will have their day, but today its Nvidia.

stuckkeys1y ago

This is so sketch. Although, I have not seen any reports of misrepresentation, but I hope EU looks into this.

devops991y ago

  https://xcancel.com/realGeorgeHotz/status/1868356459542770087

ryao1y ago

Did they do this to the 3090 Ti too?

knowitnone1y ago

so the next questions, how to deposit a small bit of metal to fix the fuse?

tayo421y ago

How do you go to from that screen shot to this conclusion?

zamadatix1y ago

You go from the eFuse mentioned at the beginning to the conclusion in the screenshot at the end, not the other way around.

tayo421y ago

Am I missing something then? Is there some context to the linked tweet and screen shot that didn't come up?

1 more reply

h_tbob1y ago

I wonder why everyone on here isn’t saying “copyright is stupid”. You know it grants a monopoly?

throwaway3141551y ago

What?

Uw5ssYPc1y ago

Imagine running this card at full speed, with fully unlocked potential. What would happen to new tiny power connectors? I am betting insta-fire.

wmf1y ago

It would just throttle like it already does.

Uw5ssYPc1y ago

sitzkrieg1y ago

stock must go up

daxfohl1y ago

Yeah but we can only use 10% of our brain too, so.

unethical_ban1y ago

There are no heroes in the megacorp space, but it would be nice for AMD and Intel to bring Nvidia to heel.

GuB-421y ago

With market segmentation, Nvidia wins, gamers win, AI companies and miners lose. And I don't know about you, but I think that AI companies and miners deserve the premiums they pay.

Of course, should AMD or Intel come back, they would do their own market segmentation too, in fact, they already do.

dredmorbius1y ago

French polymath (economist, engineer, bureaucrat) Jules Dupuit famously described this concerning railway carriage accomodations and the parlous state of third-class carriages:

Its goal is to stop the traveler who can pay for the second class trip from going third class. It hurts the poor not because it wants them to personally suffer, but to scare the rich.

<https://www.inc.com/bill-murphy-jr/why-does-air-travel-suck-...>